Cla97C02G029020 (gene) Watermelon (97103) v2

NameCla97C02G029020
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionUDP-glycosyltransferase 9
LocationCla97Chr02 : 2384766 .. 2388652 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGAGGCCGCAGCCGCCACCTCCCACCATACCTCTCTGCACATTGCCATGTACCCTTGGTTTGCCCTTGGCCATCTCACTCCATTTCTCCATCTCTCCAATAAATTAGCCAAAAAAGGCCACACAATCTCCTTCTTCATCCCCACCAAAACCCTTCCCAAATTTGAACCCTTAAATCTCTTCCCTAATCTCATCACCTTCATTCCTATCAATGTTCCTCATGTCCATGGCCTCCCACATGGGGCAGAGACCACTTCTGATGTCCCTTACCCTCTCCACAACCTCATCATGACTTCCATGGATCTCACTCAACCTCAAATCACTCACCTCCTTCAAACCCTAAAACCCCATCTCATCCTTTTTGATTTCACTCATTGGTTGCCAAAATTGGCCAGCCAATTGGCTATCAAATCAATCCATTATTGTGTCACTAGTGCAGCCATGATTGCTTATACTCTAACCCCATCAAGGCAATTTTCTAAAATTGAGTTAACTGAGGAAGATCTCATGAAACCCCCATTTGGTTACCCTAGTTCCACCATCAATCTTCATCTCCATGAGGCTAGACTTTTTGCATCAAAAAGAAAGTGGAAGTTTGGGAGTGATGTACTTTTTTATGATCGCCAATTCATTAGTTTTAGTGAATGTGATGCAATAGGGTTTAGGACATGTCATGAGATTGAGGGAGATTTTGTGAGTTATCTTCAAATTGAGTTCAAAAAACCTATTTTGCTTACTGGGTCGGTTTTGTGTGAACCACTGGACACACCTTTGGAGGAGAAATGGCAAAGTTGGCTTTCAGGGTTTAAGGAGGGTTCGGTGGTCTATTGTGCATTTGGGAGCGAGTGTACGTTACAAATGGAGCAATTTCAAGAACTTCTCATGGGGTTTGAGCTTTTAAACATGCCCTTCCTTGCTGCACTCAAACCACCATTCGGGGCAGACACAATTGAAGCTGCATTTCCTGAAGAGTTCGCGCAGAGAGTCGGAAGTCGAGGGGTGGTTTTCGGCGGGTGGATTCAACAAGAAAGGATTTTGGAGCATCCATCTGTGGGATGCTTTGTTACTCATTGTGGATCTAATTCTTTAAAGGAGGCATTGGTGAATAAGTGTCAATTGGTGTTGTTGCCTCAAGTTGGGGATCAAATTATCAATGCAAGGATGATGGGAAACAATCTTAGGGTTGGAGTGGAGGTGGAGAAAAGAGAGGAAAATGGATGGTTTACAAAGGAAAGTGTGTGTAAAGCTGTGAAGATTGTGATGGATGAAGAGAATGAAATTGGAAAAGAAGTTAGAATAAATCATGCTAAAATAAGGGATCTTTTGTTGAAAAAAGATTTGGAACAATCTTATATTGATGGTTTTAGCCAGAATCTTTGTGATTTGGTGGCATGCATGGAAAACACATCCAAAAACATGTAGAAAACAGGTAAAAGCCACATCTTTTGCTGTTGAGAATGAGACTATATGCTAGTCTATAGTGTTTTGAAGAGTTGAATCATGGCTTTAGTTTGTTGTGTTTTCTTTTCCCACAAAATGTTCCAATTTTACATTTTAAAGCAAATTTAAGGTAATGAATGGATTTATTTTTTGACAATTTTTAAATGATGCTATCTTCTTTTAAACCAAAGTATTATTGAATAGGATAATGCTTTTATGTAAGTTCAAAATACAGCTTTTAATTTTACTTTTCGTTTATTTACATCCAACCATGTTACTTTTCTGACCATGAAATTAATGGATACTCGAAATGGTGCTCAATAACCTAACAATATGATGATCAAAATCCATCATTATAAACACTTGGGTTTAACTAGGCCACCATTAATTTGTGGCCACAAAAATAATCTGTTTATCTTCCTTTAATGGGTGTTTGTTTTGTTGAGTTTGGTGAATCTTTAGTTTCCTAAAGAAATTTAACTGTTCATAGCCGAGTGGGGGAGATTTTGCAACACACCCATTAATGGAATAATGACTTAGGAACAATAAACAAATTTTAGTTATGGATTAAACATGAGAGAGTATGTTAATAATATATCACACACCAAGTTCACAACCCTCATAACCTCTTATTGTTTCATTGTCACTTGTCTTATTCTTGCACTGATATATATAGAGAGCTTTGAACGCAAAATAAAACATCATCGTTTCATATTAATTTCCAAACATCATATGTTAAGGAGAAAAGAAAAGAGAAAAAAAAAACACGATATGGCTATGATCAAATCTAGCTTCTTAGTTACTACATTCTTGATTTCGTTGTTATCATTTACTCATATTGAGTCTTGATCTCATGTTGATATTATCTTTTATTTGAAGAAGTATGCTGACATTGTGTCCGGTGTTGATGTCACTAAGGTAAACACACAATAAAATGTATACCAAAATGATTACTTATTGAATTGATCAATTATTTTTGTTAATTTTTGTTACTTTGAAACATAGCCCTTGAAAGATGCATGGAATGATGCATGTGCATCTGTTCGTCCAAGCGCAATTGTTATTCTAAGAGGGATATTTAAAGTAAGTGAAGGAAAACATGCAAGAGTCCTGTTGAAATTCGACTTCAAGGAACATTGCAGGCTCCGAAGCTCCCTCATGGAGATAGTTGGATTACTTTTGCATATGTTGATAGATTGACAATATCAGGTGGAGGAGTTTTTGATGGTCAAGGAAAGGCAGGTTGGGAAAAGAATGATTGCCATAAAAACACAAATTGTACTGACCTACCTATTGTAAGCTTTATACCTTATATTGTTTTATTTTTCAATCTCTTTCATTTTTGTTGATTTTTGGGCAGTTTATTAAAGTTCTCTACTATTTTCAATTTTACAATTTGGTAAATGTTCTTTTATTCTTTATGACAATGTTAAGGAATTTAGTTTTGAAATAAATTTTAACCCTTAGAAGAATTAGGGTAAGGAGCTAAATTTTCAAAATATAAAGATTGTATACCTAAAGAAAATTTATGATTTTTTCCCTCCCTATGAAAATTTTTGGCTTCACACTAGCTCTGACACAAAATATTGATCATATAAAATGAGAGTTTAAGGTTCAACTTCATTACCAATTCAATTGTGAAGAGAATAACATCATCACTAAATAGCAAGAACTTCCATGTCAACATCCTAGGTTGTAACAACCTCACATTTCATGGTGTCAACATAATTGCACCAGAAAATAGTCCCAATACAGATGGAATACATATTGGTCGATCAATTGGGATCTCAATCACAAAATCTCGAATTTCAACCGGAGATGTCACCAACGTGACGTGTGGACCTGGACATGGAATAAGCATAGGAAGTCTTGGAAAATACACTAATGAAGGACATGTTGAAGGTATAATAGTAAAGAACTGCACCATAATGAACACCACGAATGGTGTTAGGATCAAAACATGGCCAGCCTCTCCCGTTGCCGGCACTGCCACCAACATGCATTTTTCGAATATTATAATGGTCAATGTTAGCAACCCAATTCTCATAGACCAAGAGTATTGCCCATGGAATCAATGCAATCGCAAGGTATTTATCACATTTTTAAAACTTCGATTTTTAAATTTCTTCATGTTAATTAACTTCCCAAATGAACTTTTTATTTTCATTTTTCAGATTCCATCAAAGATTAAGATTAGTAAAATGAGCTTCAAGAACATTAGAGGAAGTTTTGCAAATCCAATGGCAGTCAAACTTATTTGTAATAGCGACCTGCCATGTGACGAAGTAAAAGTAGCCAACATTGACCTTGTTAACAATGGAAAAGAAGGACCCAGGACCCATTACATCTCAGTGTATGAATGTGAGACCCATCATTTCGGGGATACAAAATCCTCAAACCTGCTCTTCCCCTTACATTTTTCCATGATTTGTTCATTATCCTAA

mRNA sequence

ATGGCTGAGGCCGCAGCCGCCACCTCCCACCATACCTCTCTGCACATTGCCATGTACCCTTGGTTTGCCCTTGGCCATCTCACTCCATTTCTCCATCTCTCCAATAAATTAGCCAAAAAAGGCCACACAATCTCCTTCTTCATCCCCACCAAAACCCTTCCCAAATTTGAACCCTTAAATCTCTTCCCTAATCTCATCACCTTCATTCCTATCAATGTTCCTCATGTCCATGGCCTCCCACATGGGGCAGAGACCACTTCTGATGTCCCTTACCCTCTCCACAACCTCATCATGACTTCCATGGATCTCACTCAACCTCAAATCACTCACCTCCTTCAAACCCTAAAACCCCATCTCATCCTTTTTGATTTCACTCATTGGTTGCCAAAATTGGCCAGCCAATTGGCTATCAAATCAATCCATTATTGTGTCACTAGTGCAGCCATGATTGCTTATACTCTAACCCCATCAAGGCAATTTTCTAAAATTGAGTTAACTGAGGAAGATCTCATGAAACCCCCATTTGGTTACCCTAGTTCCACCATCAATCTTCATCTCCATGAGGCTAGACTTTTTGCATCAAAAAGAAAGTGGAAGTTTGGGAGTGATGTACTTTTTTATGATCGCCAATTCATTAGTTTTAGTGAATGTGATGCAATAGGGTTTAGGACATGTCATGAGATTGAGGGAGATTTTGTGAGTTATCTTCAAATTGAGTTCAAAAAACCTATTTTGCTTACTGGGTCGGTTTTGTGTGAACCACTGGACACACCTTTGGAGGAGAAATGGCAAAGTTGGCTTTCAGGGTTTAAGGAGGGTTCGGTGGTCTATTGTGCATTTGGGAGCGAGTGTACGTTACAAATGGAGCAATTTCAAGAACTTCTCATGGGGTTTGAGCTTTTAAACATGCCCTTCCTTGCTGCACTCAAACCACCATTCGGGGCAGACACAATTGAAGCTGCATTTCCTGAAGAGTTCGCGCAGAGAGTCGGAAGTCGAGGGGTGGTTTTCGGCGGGTGGATTCAACAAGAAAGGATTTTGGAGCATCCATCTGTGGGATGCTTTGTTACTCATTGTGGATCTAATTCTTTAAAGGAGGCATTGGTGAATAAGTGTCAATTGGTGTTGTTGCCTCAAGTTGGGGATCAAATTATCAATGCAAGGATGATGGGAAACAATCTTAGGGTTGGAGTGGAGGTGGAGAAAAGAGAGGAAAATGGATGGTTTACAAAGGAAAGTGTGTGTAAAGCTGTGAAGATTGTGATGGATGAAGAGAATGAAATTGGAAAAGAAAAGTATGCTGACATTGTGTCCGGTGTTGATGTCACTAAGCCCTTGAAAGATGCATGGAATGATGCATGTGCATCTGTTCGTCCAAGCGCAATTGTTATTCTAAGAGGGATATTTAAAAACTTCCATGTCAACATCCTAGGTTGTAACAACCTCACATTTCATGGTGTCAACATAATTGCACCAGAAAATAGTCCCAATACAGATGGAATACATATTGGTCGATCAATTGGGATCTCAATCACAAAATCTCGAATTTCAACCGGAGATGTCACCAACGTGACGTGTGGACCTGGACATGGAATAAGCATAGGAAGTCTTGGAAAATACACTAATGAAGGACATGTTGAAGGTATAATAGTAAAGAACTGCACCATAATGAACACCACGAATGGTGTTAGGATCAAAACATGGCCAGCCTCTCCCGTTGCCGGCACTGCCACCAACATGCATTTTTCGAATATTATAATGGTCAATGTTAGCAACCCAATTCTCATAGACCAAGAGTATTGCCCATGGAATCAATGCAATCGCAAGATTCCATCAAAGATTAAGATTAGTAAAATGAGCTTCAAGAACATTAGAGGAAGTTTTGCAAATCCAATGGCAGTCAAACTTATTTGTAATAGCGACCTGCCATGTGACGAAGTAAAAGTAGCCAACATTGACCTTGTTAACAATGGAAAAGAAGGACCCAGGACCCATTACATCTCAGTGTATGAATGTGAGACCCATCATTTCGGGGATACAAAATCCTCAAACCTGCTCTTCCCCTTACATTTTTCCATGATTTGTTCATTATCCTAA

Coding sequence (CDS)

ATGGCTGAGGCCGCAGCCGCCACCTCCCACCATACCTCTCTGCACATTGCCATGTACCCTTGGTTTGCCCTTGGCCATCTCACTCCATTTCTCCATCTCTCCAATAAATTAGCCAAAAAAGGCCACACAATCTCCTTCTTCATCCCCACCAAAACCCTTCCCAAATTTGAACCCTTAAATCTCTTCCCTAATCTCATCACCTTCATTCCTATCAATGTTCCTCATGTCCATGGCCTCCCACATGGGGCAGAGACCACTTCTGATGTCCCTTACCCTCTCCACAACCTCATCATGACTTCCATGGATCTCACTCAACCTCAAATCACTCACCTCCTTCAAACCCTAAAACCCCATCTCATCCTTTTTGATTTCACTCATTGGTTGCCAAAATTGGCCAGCCAATTGGCTATCAAATCAATCCATTATTGTGTCACTAGTGCAGCCATGATTGCTTATACTCTAACCCCATCAAGGCAATTTTCTAAAATTGAGTTAACTGAGGAAGATCTCATGAAACCCCCATTTGGTTACCCTAGTTCCACCATCAATCTTCATCTCCATGAGGCTAGACTTTTTGCATCAAAAAGAAAGTGGAAGTTTGGGAGTGATGTACTTTTTTATGATCGCCAATTCATTAGTTTTAGTGAATGTGATGCAATAGGGTTTAGGACATGTCATGAGATTGAGGGAGATTTTGTGAGTTATCTTCAAATTGAGTTCAAAAAACCTATTTTGCTTACTGGGTCGGTTTTGTGTGAACCACTGGACACACCTTTGGAGGAGAAATGGCAAAGTTGGCTTTCAGGGTTTAAGGAGGGTTCGGTGGTCTATTGTGCATTTGGGAGCGAGTGTACGTTACAAATGGAGCAATTTCAAGAACTTCTCATGGGGTTTGAGCTTTTAAACATGCCCTTCCTTGCTGCACTCAAACCACCATTCGGGGCAGACACAATTGAAGCTGCATTTCCTGAAGAGTTCGCGCAGAGAGTCGGAAGTCGAGGGGTGGTTTTCGGCGGGTGGATTCAACAAGAAAGGATTTTGGAGCATCCATCTGTGGGATGCTTTGTTACTCATTGTGGATCTAATTCTTTAAAGGAGGCATTGGTGAATAAGTGTCAATTGGTGTTGTTGCCTCAAGTTGGGGATCAAATTATCAATGCAAGGATGATGGGAAACAATCTTAGGGTTGGAGTGGAGGTGGAGAAAAGAGAGGAAAATGGATGGTTTACAAAGGAAAGTGTGTGTAAAGCTGTGAAGATTGTGATGGATGAAGAGAATGAAATTGGAAAAGAAAAGTATGCTGACATTGTGTCCGGTGTTGATGTCACTAAGCCCTTGAAAGATGCATGGAATGATGCATGTGCATCTGTTCGTCCAAGCGCAATTGTTATTCTAAGAGGGATATTTAAAAACTTCCATGTCAACATCCTAGGTTGTAACAACCTCACATTTCATGGTGTCAACATAATTGCACCAGAAAATAGTCCCAATACAGATGGAATACATATTGGTCGATCAATTGGGATCTCAATCACAAAATCTCGAATTTCAACCGGAGATGTCACCAACGTGACGTGTGGACCTGGACATGGAATAAGCATAGGAAGTCTTGGAAAATACACTAATGAAGGACATGTTGAAGGTATAATAGTAAAGAACTGCACCATAATGAACACCACGAATGGTGTTAGGATCAAAACATGGCCAGCCTCTCCCGTTGCCGGCACTGCCACCAACATGCATTTTTCGAATATTATAATGGTCAATGTTAGCAACCCAATTCTCATAGACCAAGAGTATTGCCCATGGAATCAATGCAATCGCAAGATTCCATCAAAGATTAAGATTAGTAAAATGAGCTTCAAGAACATTAGAGGAAGTTTTGCAAATCCAATGGCAGTCAAACTTATTTGTAATAGCGACCTGCCATGTGACGAAGTAAAAGTAGCCAACATTGACCTTGTTAACAATGGAAAAGAAGGACCCAGGACCCATTACATCTCAGTGTATGAATGTGAGACCCATCATTTCGGGGATACAAAATCCTCAAACCTGCTCTTCCCCTTACATTTTTCCATGATTTGTTCATTATCCTAA

Protein sequence

MAEAAAATSHHTSLHIAMYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKFEPLNLFPNLITFIPINVPHVHGLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLKPHLILFDFTHWLPKLASQLAIKSIHYCVTSAAMIAYTLTPSRQFSKIELTEEDLMKPPFGYPSSTINLHLHEARLFASKRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVSYLQIEFKKPILLTGSVLCEPLDTPLEEKWQSWLSGFKEGSVVYCAFGSECTLQMEQFQELLMGFELLNMPFLAALKPPFGADTIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCFVTHCGSNSLKEALVNKCQLVLLPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVCKAVKIVMDEENEIGKEKYADIVSGVDVTKPLKDAWNDACASVRPSAIVILRGIFKNFHVNILGCNNLTFHGVNIIAPENSPNTDGIHIGRSIGISITKSRISTGDVTNVTCGPGHGISIGSLGKYTNEGHVEGIIVKNCTIMNTTNGVRIKTWPASPVAGTATNMHFSNIIMVNVSNPILIDQEYCPWNQCNRKIPSKIKISKMSFKNIRGSFANPMAVKLICNSDLPCDEVKVANIDLVNNGKEGPRTHYISVYECETHHFGDTKSSNLLFPLHFSMICSLS
BLAST of Cla97C02G029020 vs. NCBI nr
Match: KGN43443.1 (hypothetical protein Csa_7G037470 [Cucumis sativus])

HSP 1 Score: 805.1 bits (2078), Expect = 1.9e-229
Identity = 393/432 (90.97%), Postives = 411/432 (95.14%), Query Frame = 0

Query: 2   AEAAA-ATSHHTSLHIAMYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKFEPLN 61
           AEAAA AT+ HTSLHIAMYPWFALGHLTPFLHLSNKLAKKGH ISFFIPTKTLPKFEPLN
Sbjct: 3   AEAAATATARHTSLHIAMYPWFALGHLTPFLHLSNKLAKKGHKISFFIPTKTLPKFEPLN 62

Query: 62  LFPNLITFIPINVPHVHGLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLKPHLI 121
           LFPNLITFIP+ VPHVHGLPHGAETT DVPYPLHNLIMTSMDLTQPQIT LLQTLKPHLI
Sbjct: 63  LFPNLITFIPVIVPHVHGLPHGAETTCDVPYPLHNLIMTSMDLTQPQITLLLQTLKPHLI 122

Query: 122 LFDFTHWLPKLASQLAIKSIHYCVTSAAMIAYTLTPSRQFSKIELTEEDLMKPPFGYPSS 181
           LFDFTHWLPKLASQL IKSIHYCVTSAAMIAYTLTPSRQF K ELTEEDLMKPP GYPSS
Sbjct: 123 LFDFTHWLPKLASQLGIKSIHYCVTSAAMIAYTLTPSRQFYKNELTEEDLMKPPVGYPSS 182

Query: 182 TINLHLHEARLFASKRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVSYLQIEF 241
           TINLH HEAR+FASKRKWKFGSDVLFYDRQF+SFS+CDAIGFRTCHEIEGDFV+YLQ EF
Sbjct: 183 TINLHPHEARVFASKRKWKFGSDVLFYDRQFVSFSDCDAIGFRTCHEIEGDFVNYLQFEF 242

Query: 242 KKPILLTGSVLCEPLD-TPLEEKWQSWLSGFKEGSVVYCAFGSECTLQMEQFQELLMGFE 301
           +KP+LLTGSVL E L+   LEEKW+SWL GFKEGSVVYCAFGSECTLQMEQFQELLMGFE
Sbjct: 243 RKPVLLTGSVLPETLNPEALEEKWESWLLGFKEGSVVYCAFGSECTLQMEQFQELLMGFE 302

Query: 302 LLNMPFLAALKPPFGADTIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCFVTHC 361
           LL+MPFLAALKPPFGA+T+EAA PE FA+RVG RGVV+GGWIQQERILEHPSVGCFVTHC
Sbjct: 303 LLDMPFLAALKPPFGAETVEAALPEGFAKRVGGRGVVYGGWIQQERILEHPSVGCFVTHC 362

Query: 362 GSNSLKEALVNKCQLVLLPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVCKAVK 421
           GSNSLKEALVNKCQLVLLPQVGDQIINARMMGNNLRVGVEVEKR+E+GWFTKESVCKAVK
Sbjct: 363 GSNSLKEALVNKCQLVLLPQVGDQIINARMMGNNLRVGVEVEKRQEDGWFTKESVCKAVK 422

Query: 422 IVMDEENEIGKE 432
           IVMDE+NEIGKE
Sbjct: 423 IVMDEDNEIGKE 434

BLAST of Cla97C02G029020 vs. NCBI nr
Match: XP_008465522.1 (PREDICTED: anthocyanidin 3-O-glucoside 2''-O-glucosyltransferase-like [Cucumis melo])

HSP 1 Score: 798.9 bits (2062), Expect = 1.4e-227
Identity = 398/471 (84.50%), Postives = 422/471 (89.60%), Query Frame = 0

Query: 1   MAEAAAAT----SHHTSLHIAMYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKF 60
           MA  AAAT    S HT LHIAMYPWFALGHLTPFLHLSNKLAKKGH ISFFIPTKTLPKF
Sbjct: 1   MAAEAAATAXXXSSHTCLHIAMYPWFALGHLTPFLHLSNKLAKKGHKISFFIPTKTLPKF 60

Query: 61  EPLNLFPNLITFIPINVPHVHGLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLK 120
           EPLNLFPNLITFIPI VPHV GLP GAETT DVPYPLHNLIMTSMDLTQPQIT LLQ+LK
Sbjct: 61  EPLNLFPNLITFIPIIVPHVDGLPRGAETTCDVPYPLHNLIMTSMDLTQPQITLLLQSLK 120

Query: 121 PHLILFDFTHWLPKLASQLAIKSIHYCVTSAAMIAYTLTPSRQFSKIELTEEDLMKPPFG 180
           PHLILFDFTHWLPKLASQL IKSIHYCVTSAAMIAYTLTPSRQFSK ELTEEDLMKPP G
Sbjct: 121 PHLILFDFTHWLPKLASQLGIKSIHYCVTSAAMIAYTLTPSRQFSKNELTEEDLMKPPIG 180

Query: 181 YPSSTINLHLHEARLFASKRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVSYL 240
           YPSSTINLH HEAR+FASKRKWKFGSDVLFYDRQF+SFS+CDAIGFRTCHEIEGDFV+YL
Sbjct: 181 YPSSTINLHPHEARVFASKRKWKFGSDVLFYDRQFVSFSDCDAIGFRTCHEIEGDFVNYL 240

Query: 241 QIEFKKPILLTGSVLCEPLD-TPLEEKWQSWLSGFKEGSVVYCAFGSECTLQMEQFQELL 300
           Q EF+KPILLTGSVL EPL+   LEEKW+SWL GFKEGSVVYCAFGSECTLQMEQFQELL
Sbjct: 241 QTEFRKPILLTGSVLPEPLNPEALEEKWESWLLGFKEGSVVYCAFGSECTLQMEQFQELL 300

Query: 301 MGFELLNMPFLAALKPPFGADTIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCF 360
           MGFELL+MPFLAALKPPFGA+T+EAA PE F +RVG RGVV+GGWIQQERILEHPSVGCF
Sbjct: 301 MGFELLDMPFLAALKPPFGAETVEAALPEGFTKRVGGRGVVYGGWIQQERILEHPSVGCF 360

Query: 361 VTHCGSNSLKEALVNKCQLVLLPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVC 420
           VTHCGSNSLKEALVNKCQLVLLPQVGDQIINARMMG+NLRVGVEVEKREE+GWFTKESVC
Sbjct: 361 VTHCGSNSLKEALVNKCQLVLLPQVGDQIINARMMGSNLRVGVEVEKREEDGWFTKESVC 420

Query: 421 KAVKIVMDEENEIGKE------KYADIVSGVDVTKPLKDAWNDACASVRPS 461
           KAVKIVMDE+NEIGKE      K  D++   D+ +   D+++     + PS
Sbjct: 421 KAVKIVMDEDNEIGKEVRTNHSKIRDLLLKKDLEQSYIDSFSHNLCDLVPS 471

BLAST of Cla97C02G029020 vs. NCBI nr
Match: XP_004144610.2 (PREDICTED: anthocyanidin 3-O-glucoside 2''-O-glucosyltransferase-like [Cucumis sativus])

HSP 1 Score: 785.0 bits (2026), Expect = 2.1e-223
Identity = 379/415 (91.33%), Postives = 396/415 (95.42%), Query Frame = 0

Query: 18  MYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKFEPLNLFPNLITFIPINVPHVH 77
           MYPWFALGHLTPFLHLSNKLAKKGH ISFFIPTKTLPKFEPLNLFPNLITFIP+ VPHVH
Sbjct: 1   MYPWFALGHLTPFLHLSNKLAKKGHKISFFIPTKTLPKFEPLNLFPNLITFIPVIVPHVH 60

Query: 78  GLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLKPHLILFDFTHWLPKLASQLAI 137
           GLPHGAETT DVPYPLHNLIMTSMDLTQPQIT LLQTLKPHLILFDFTHWLPKLASQL I
Sbjct: 61  GLPHGAETTCDVPYPLHNLIMTSMDLTQPQITLLLQTLKPHLILFDFTHWLPKLASQLGI 120

Query: 138 KSIHYCVTSAAMIAYTLTPSRQFSKIELTEEDLMKPPFGYPSSTINLHLHEARLFASKRK 197
           KSIHYCVTSAAMIAYTLTPSRQF K ELTEEDLMKPP GYPSSTINLH HEAR+FASKRK
Sbjct: 121 KSIHYCVTSAAMIAYTLTPSRQFYKNELTEEDLMKPPVGYPSSTINLHPHEARVFASKRK 180

Query: 198 WKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVSYLQIEFKKPILLTGSVLCEPLD- 257
           WKFGSDVLFYDRQF+SFS+CDAIGFRTCHEIEGDFV+YLQ EF+KP+LLTGSVL E L+ 
Sbjct: 181 WKFGSDVLFYDRQFVSFSDCDAIGFRTCHEIEGDFVNYLQFEFRKPVLLTGSVLPETLNP 240

Query: 258 TPLEEKWQSWLSGFKEGSVVYCAFGSECTLQMEQFQELLMGFELLNMPFLAALKPPFGAD 317
             LEEKW+SWL GFKEGSVVYCAFGSECTLQMEQFQELLMGFELL+MPFLAALKPPFGA+
Sbjct: 241 EALEEKWESWLLGFKEGSVVYCAFGSECTLQMEQFQELLMGFELLDMPFLAALKPPFGAE 300

Query: 318 TIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCFVTHCGSNSLKEALVNKCQLVL 377
           T+EAA PE FA+RVG RGVV+GGWIQQERILEHPSVGCFVTHCGSNSLKEALVNKCQLVL
Sbjct: 301 TVEAALPEGFAKRVGGRGVVYGGWIQQERILEHPSVGCFVTHCGSNSLKEALVNKCQLVL 360

Query: 378 LPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVCKAVKIVMDEENEIGKE 432
           LPQVGDQIINARMMGNNLRVGVEVEKR+E+GWFTKESVCKAVKIVMDE+NEIGKE
Sbjct: 361 LPQVGDQIINARMMGNNLRVGVEVEKRQEDGWFTKESVCKAVKIVMDEDNEIGKE 415

BLAST of Cla97C02G029020 vs. NCBI nr
Match: XP_023530310.1 (UDP-glycosyltransferase 79B30-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 758.8 bits (1958), Expect = 1.6e-215
Identity = 370/451 (82.04%), Postives = 401/451 (88.91%), Query Frame = 0

Query: 4   AAAATSHHTSLHIAMYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKFEPLNLFP 63
           AAAAT   TSLH AMYPWFALGHLTPFLHLSNKLAKKGH ISFFIPTKTLPK +PLN FP
Sbjct: 2   AAAAT---TSLHFAMYPWFALGHLTPFLHLSNKLAKKGHKISFFIPTKTLPKLQPLNQFP 61

Query: 64  NLITFIPINVPHVHGLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLKPHLILFD 123
           NLI FIPI VPHV GLPHGAETTSDVPYPLH LIMT+MDLTQ QI  LL+ LKPHLI FD
Sbjct: 62  NLIAFIPITVPHVEGLPHGAETTSDVPYPLHTLIMTAMDLTQSQINLLLEDLKPHLIFFD 121

Query: 124 FTHWLPKLASQLAIKSIHYCVTSAAMIAYTLTPSRQFSKIELTEEDLMKPPFGYPSSTIN 183
           FTHWLP+LA QL IKSIHYCVTSAAMIAYTL PSRQFSK ELTEEDLM PP GYPSS I 
Sbjct: 122 FTHWLPQLARQLGIKSIHYCVTSAAMIAYTLAPSRQFSKNELTEEDLMNPPLGYPSSNIK 181

Query: 184 LHLHEARLFASKRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVSYLQIEFKKP 243
           LH HEA++FAS+RKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFV+YLQIEFKKP
Sbjct: 182 LHAHEAKVFASRRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVNYLQIEFKKP 241

Query: 244 ILLTGSVLCEPLDTPLEEKWQSWLSGFKEGSVVYCAFGSECTLQMEQFQELLMGFELLNM 303
           I L+GSV+ EPL TPLEEKW SWL GF +GSV+YCAFGSECTL++EQFQELL+GFEL NM
Sbjct: 242 IFLSGSVIPEPLTTPLEEKWASWLLGFTKGSVIYCAFGSECTLKIEQFQELLLGFELSNM 301

Query: 304 PFLAALKPPFGADTIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCFVTHCGSNS 363
           PFLAALKPPFG D+IE A P+EF +R+G RGVV GGW+QQERILEHPSVGCFV+HCGSNS
Sbjct: 302 PFLAALKPPFGVDSIEDALPKEFVERIGGRGVVHGGWVQQERILEHPSVGCFVSHCGSNS 361

Query: 364 LKEALVNKCQLVLLPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVCKAVKIVMD 423
           LKEALVNKCQLVLLPQVGDQIINARMMGNNLRVGVEVEKREE+G FTKESVCKAV+IVM+
Sbjct: 362 LKEALVNKCQLVLLPQVGDQIINARMMGNNLRVGVEVEKREEDGLFTKESVCKAVRIVME 421

Query: 424 EENEIGKE--KYADIVSGVDVTKPLKDAWND 453
           E+NEIGKE  K  D +  + +TK L+ ++ D
Sbjct: 422 EDNEIGKEVRKNHDKIRDLLLTKDLEQSYMD 449

BLAST of Cla97C02G029020 vs. NCBI nr
Match: XP_022930691.1 (UDP-glycosyltransferase 79B30-like [Cucurbita moschata])

HSP 1 Score: 757.7 bits (1955), Expect = 3.5e-215
Identity = 369/451 (81.82%), Postives = 401/451 (88.91%), Query Frame = 0

Query: 4   AAAATSHHTSLHIAMYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKFEPLNLFP 63
           AAAAT   TSLH AMYPWFALGHLTPFLHLSNKLAKKGH ISFFIPTKTLPK +PLN FP
Sbjct: 2   AAAAT---TSLHFAMYPWFALGHLTPFLHLSNKLAKKGHKISFFIPTKTLPKLQPLNQFP 61

Query: 64  NLITFIPINVPHVHGLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLKPHLILFD 123
           NLI FIPI VPHV GLPHGAETTSDVPYPLH LIMT+MDLTQ QI  LL+ LKPHLI FD
Sbjct: 62  NLIAFIPITVPHVEGLPHGAETTSDVPYPLHTLIMTAMDLTQSQINLLLEDLKPHLIFFD 121

Query: 124 FTHWLPKLASQLAIKSIHYCVTSAAMIAYTLTPSRQFSKIELTEEDLMKPPFGYPSSTIN 183
           FTHWLP+LA Q  IKSIHYCVT+AAMIAYTL PSRQFSK ELTEEDLM PP GYPSSTI 
Sbjct: 122 FTHWLPQLARQFGIKSIHYCVTTAAMIAYTLAPSRQFSKNELTEEDLMNPPLGYPSSTIK 181

Query: 184 LHLHEARLFASKRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVSYLQIEFKKP 243
           LH HEA++FAS+RKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFV+YLQIEFKKP
Sbjct: 182 LHAHEAKVFASRRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVNYLQIEFKKP 241

Query: 244 ILLTGSVLCEPLDTPLEEKWQSWLSGFKEGSVVYCAFGSECTLQMEQFQELLMGFELLNM 303
           I L+GSV+ EPL TPLEEKW SWL GF +GSV+YCAFGSECTL++EQFQELL+GFEL NM
Sbjct: 242 IFLSGSVIPEPLTTPLEEKWASWLLGFTKGSVIYCAFGSECTLKIEQFQELLLGFELSNM 301

Query: 304 PFLAALKPPFGADTIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCFVTHCGSNS 363
           PFLAALKPPFG D+IE A P+EF +R+G RGVV GGW+QQERILEHPSVGCFV+HCGSNS
Sbjct: 302 PFLAALKPPFGVDSIEDALPKEFVERIGGRGVVHGGWVQQERILEHPSVGCFVSHCGSNS 361

Query: 364 LKEALVNKCQLVLLPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVCKAVKIVMD 423
           LKEALVNKCQLVLLPQVGDQIINARMMGNNLRVGVEVEKREE+G FTKESVCKAV+IVM+
Sbjct: 362 LKEALVNKCQLVLLPQVGDQIINARMMGNNLRVGVEVEKREEDGLFTKESVCKAVRIVME 421

Query: 424 EENEIGKE--KYADIVSGVDVTKPLKDAWND 453
           E+NEIGKE  K  D +  + +TK L+ ++ D
Sbjct: 422 EDNEIGKEVRKNHDKIRDLLLTKDLEQSYID 449

BLAST of Cla97C02G029020 vs. TrEMBL
Match: tr|A0A0A0K1W3|A0A0A0K1W3_CUCSA (Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_7G037470 PE=3 SV=1)

HSP 1 Score: 805.1 bits (2078), Expect = 1.3e-229
Identity = 393/432 (90.97%), Postives = 411/432 (95.14%), Query Frame = 0

Query: 2   AEAAA-ATSHHTSLHIAMYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKFEPLN 61
           AEAAA AT+ HTSLHIAMYPWFALGHLTPFLHLSNKLAKKGH ISFFIPTKTLPKFEPLN
Sbjct: 3   AEAAATATARHTSLHIAMYPWFALGHLTPFLHLSNKLAKKGHKISFFIPTKTLPKFEPLN 62

Query: 62  LFPNLITFIPINVPHVHGLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLKPHLI 121
           LFPNLITFIP+ VPHVHGLPHGAETT DVPYPLHNLIMTSMDLTQPQIT LLQTLKPHLI
Sbjct: 63  LFPNLITFIPVIVPHVHGLPHGAETTCDVPYPLHNLIMTSMDLTQPQITLLLQTLKPHLI 122

Query: 122 LFDFTHWLPKLASQLAIKSIHYCVTSAAMIAYTLTPSRQFSKIELTEEDLMKPPFGYPSS 181
           LFDFTHWLPKLASQL IKSIHYCVTSAAMIAYTLTPSRQF K ELTEEDLMKPP GYPSS
Sbjct: 123 LFDFTHWLPKLASQLGIKSIHYCVTSAAMIAYTLTPSRQFYKNELTEEDLMKPPVGYPSS 182

Query: 182 TINLHLHEARLFASKRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVSYLQIEF 241
           TINLH HEAR+FASKRKWKFGSDVLFYDRQF+SFS+CDAIGFRTCHEIEGDFV+YLQ EF
Sbjct: 183 TINLHPHEARVFASKRKWKFGSDVLFYDRQFVSFSDCDAIGFRTCHEIEGDFVNYLQFEF 242

Query: 242 KKPILLTGSVLCEPLD-TPLEEKWQSWLSGFKEGSVVYCAFGSECTLQMEQFQELLMGFE 301
           +KP+LLTGSVL E L+   LEEKW+SWL GFKEGSVVYCAFGSECTLQMEQFQELLMGFE
Sbjct: 243 RKPVLLTGSVLPETLNPEALEEKWESWLLGFKEGSVVYCAFGSECTLQMEQFQELLMGFE 302

Query: 302 LLNMPFLAALKPPFGADTIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCFVTHC 361
           LL+MPFLAALKPPFGA+T+EAA PE FA+RVG RGVV+GGWIQQERILEHPSVGCFVTHC
Sbjct: 303 LLDMPFLAALKPPFGAETVEAALPEGFAKRVGGRGVVYGGWIQQERILEHPSVGCFVTHC 362

Query: 362 GSNSLKEALVNKCQLVLLPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVCKAVK 421
           GSNSLKEALVNKCQLVLLPQVGDQIINARMMGNNLRVGVEVEKR+E+GWFTKESVCKAVK
Sbjct: 363 GSNSLKEALVNKCQLVLLPQVGDQIINARMMGNNLRVGVEVEKRQEDGWFTKESVCKAVK 422

Query: 422 IVMDEENEIGKE 432
           IVMDE+NEIGKE
Sbjct: 423 IVMDEDNEIGKE 434

BLAST of Cla97C02G029020 vs. TrEMBL
Match: tr|A0A1S3CP23|A0A1S3CP23_CUCME (Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103503146 PE=3 SV=1)

HSP 1 Score: 798.9 bits (2062), Expect = 9.2e-228
Identity = 398/471 (84.50%), Postives = 422/471 (89.60%), Query Frame = 0

Query: 1   MAEAAAAT----SHHTSLHIAMYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKF 60
           MA  AAAT    S HT LHIAMYPWFALGHLTPFLHLSNKLAKKGH ISFFIPTKTLPKF
Sbjct: 1   MAAEAAATAXXXSSHTCLHIAMYPWFALGHLTPFLHLSNKLAKKGHKISFFIPTKTLPKF 60

Query: 61  EPLNLFPNLITFIPINVPHVHGLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLK 120
           EPLNLFPNLITFIPI VPHV GLP GAETT DVPYPLHNLIMTSMDLTQPQIT LLQ+LK
Sbjct: 61  EPLNLFPNLITFIPIIVPHVDGLPRGAETTCDVPYPLHNLIMTSMDLTQPQITLLLQSLK 120

Query: 121 PHLILFDFTHWLPKLASQLAIKSIHYCVTSAAMIAYTLTPSRQFSKIELTEEDLMKPPFG 180
           PHLILFDFTHWLPKLASQL IKSIHYCVTSAAMIAYTLTPSRQFSK ELTEEDLMKPP G
Sbjct: 121 PHLILFDFTHWLPKLASQLGIKSIHYCVTSAAMIAYTLTPSRQFSKNELTEEDLMKPPIG 180

Query: 181 YPSSTINLHLHEARLFASKRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVSYL 240
           YPSSTINLH HEAR+FASKRKWKFGSDVLFYDRQF+SFS+CDAIGFRTCHEIEGDFV+YL
Sbjct: 181 YPSSTINLHPHEARVFASKRKWKFGSDVLFYDRQFVSFSDCDAIGFRTCHEIEGDFVNYL 240

Query: 241 QIEFKKPILLTGSVLCEPLD-TPLEEKWQSWLSGFKEGSVVYCAFGSECTLQMEQFQELL 300
           Q EF+KPILLTGSVL EPL+   LEEKW+SWL GFKEGSVVYCAFGSECTLQMEQFQELL
Sbjct: 241 QTEFRKPILLTGSVLPEPLNPEALEEKWESWLLGFKEGSVVYCAFGSECTLQMEQFQELL 300

Query: 301 MGFELLNMPFLAALKPPFGADTIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCF 360
           MGFELL+MPFLAALKPPFGA+T+EAA PE F +RVG RGVV+GGWIQQERILEHPSVGCF
Sbjct: 301 MGFELLDMPFLAALKPPFGAETVEAALPEGFTKRVGGRGVVYGGWIQQERILEHPSVGCF 360

Query: 361 VTHCGSNSLKEALVNKCQLVLLPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVC 420
           VTHCGSNSLKEALVNKCQLVLLPQVGDQIINARMMG+NLRVGVEVEKREE+GWFTKESVC
Sbjct: 361 VTHCGSNSLKEALVNKCQLVLLPQVGDQIINARMMGSNLRVGVEVEKREEDGWFTKESVC 420

Query: 421 KAVKIVMDEENEIGKE------KYADIVSGVDVTKPLKDAWNDACASVRPS 461
           KAVKIVMDE+NEIGKE      K  D++   D+ +   D+++     + PS
Sbjct: 421 KAVKIVMDEDNEIGKEVRTNHSKIRDLLLKKDLEQSYIDSFSHNLCDLVPS 471

BLAST of Cla97C02G029020 vs. TrEMBL
Match: tr|A0A0D5ZD63|A0A0D5ZD63_PANGI (Glycosyltransferase OS=Panax ginseng OX=4054 PE=2 SV=1)

HSP 1 Score: 588.6 bits (1516), Expect = 1.9e-164
Identity = 280/419 (66.83%), Postives = 330/419 (78.76%), Query Frame = 0

Query: 13  SLHIAMYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKFEPLNLFPNLITFIPIN 72
           S HIAM+PWFALGHLTPFLHLSNKLAK+GH +SF IPTKT PK +  NL P+LITFIPI 
Sbjct: 5   SFHIAMFPWFALGHLTPFLHLSNKLAKQGHRVSFLIPTKTQPKLQSFNLHPDLITFIPIT 64

Query: 73  VPHVHGLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLKPHLILFDFTHWLPKLA 132
           VPHV GLP G+ETTSDVP+PL  L++T+MD T+  +  LL  LK  ++LFDF HW+P LA
Sbjct: 65  VPHVDGLPRGSETTSDVPFPLQTLLVTAMDYTEDHVECLLYDLKVDVVLFDFAHWIPGLA 124

Query: 133 SQLAIKSIHYCVTSAAMIAYTLTPSRQFSKIELTEEDLMKPPFGYPSSTINLHLHEARLF 192
            +L IKSIHYC+ S A I YTL+P RQ +  ++TE DLMKPP  YP S I LH HEAR F
Sbjct: 125 RRLGIKSIHYCIISPATIGYTLSPERQLNGDKITEADLMKPPANYPGSNITLHAHEARAF 184

Query: 193 ASKRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVSYLQIEFKKPILLTGSVLC 252
           AS+R  KFG++ LF DRQFIS S+CDA+GFRTC EIEG +  YL+ +F KP+LL+G V+ 
Sbjct: 185 ASRRVMKFGNNTLFNDRQFISLSQCDALGFRTCREIEGPYCDYLESQFGKPVLLSGPVIP 244

Query: 253 EPLDTPLEEKWQSWLSGFKEGSVVYCAFGSECTLQMEQFQELLMGFELLNMPFLAALKPP 312
           EP  +PLE KW  WLS F  GSV+YCAFGSEC L+M QFQELL G EL  MPFLAALKPP
Sbjct: 245 EPPTSPLEXKWAKWLSKFSLGSVIYCAFGSECILKMYQFQELLYGLELTGMPFLAALKPP 304

Query: 313 FGADTIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCFVTHCGSNSLKEALVNKC 372
            GA++IE A P++F +R+  RGVV  GW+QQ+ IL HPSVGCF+THCGS SL EALVNKC
Sbjct: 305 AGAESIEEALPDKFEERIKGRGVVHEGWVQQQLILGHPSVGCFITHCGSGSLAEALVNKC 364

Query: 373 QLVLLPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVCKAVKIVMDEENEIGKE 432
           QLVLLPQVGDQIINARMM  NL+VGVEVEK EE+G FT+ESVCKAV  V  E+N++GKE
Sbjct: 365 QLVLLPQVGDQIINARMMSQNLKVGVEVEKGEEDGVFTRESVCKAVGNVTQEDNQVGKE 423

BLAST of Cla97C02G029020 vs. TrEMBL
Match: tr|A0A1S3TG55|A0A1S3TG55_VIGRR (Glycosyltransferase OS=Vigna radiata var. radiata OX=3916 GN=LOC106755140 PE=3 SV=1)

HSP 1 Score: 572.8 bits (1475), Expect = 1.1e-159
Identity = 271/418 (64.83%), Postives = 324/418 (77.51%), Query Frame = 0

Query: 14  LHIAMYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKFEPLNLFPNLITFIPINV 73
           LHIAMYPW A+GH T FLHL NKLA +GH ISF  PTK   K EP NL P+LITF+ I V
Sbjct: 6   LHIAMYPWLAMGHQTAFLHLCNKLAIRGHRISFITPTKAQAKLEPYNLHPHLITFVTITV 65

Query: 74  PHVHGLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLKPHLILFDFTHWLPKLAS 133
           PHV GLP  A+TT+DV YPL   IMT+MDLT+  I  LL  LKP L+ +DFTHW+P L  
Sbjct: 66  PHVEGLPPDAQTTADVTYPLQPNIMTAMDLTRDDIETLLTDLKPQLVFYDFTHWMPTLTK 125

Query: 134 QLAIKSIHYCVTSAAMIAYTLTPSRQFSKIELTEEDLMKPPFGYPSSTINLHLHEARLFA 193
           +L IK++HYC  S+ MI YTL P+R +  I+L+E DLM+PP GYP S+I LH HEAR FA
Sbjct: 126 KLGIKAVHYCTASSVMIGYTLPPARYYQGIDLSETDLMEPPEGYPDSSIKLHAHEARAFA 185

Query: 194 SKRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVSYLQIEFKKPILLTGSVLCE 253
            KRK  FGSDVLFYDRQFI+ +E D + +RTC EIEG ++ Y+  +FKKP+L TG V+ E
Sbjct: 186 GKRKDTFGSDVLFYDRQFIALNEADVLAYRTCREIEGPYLDYIGSQFKKPVLATGPVILE 245

Query: 254 PLDTPLEEKWQSWLSGFKEGSVVYCAFGSECTLQMEQFQELLMGFELLNMPFLAALKPPF 313
           P  + LEEK+ +WL GF+EGSVVYC FGSECTL++EQFQEL++G EL  MPFLAA+KPP 
Sbjct: 246 PPTSELEEKFSTWLGGFEEGSVVYCCFGSECTLRLEQFQELVLGLELTGMPFLAAVKPPV 305

Query: 314 GADTIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCFVTHCGSNSLKEALVNKCQ 373
           G +T+E+A PE F +RV  RG V+GGW+ Q+ IL HPSVGCF+THCGS SL EALVNKCQ
Sbjct: 306 GFETVESAMPEGFEERVKGRGFVYGGWVMQQLILAHPSVGCFITHCGSGSLSEALVNKCQ 365

Query: 374 LVLLPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVCKAVKIVMDEENEIGKE 432
           LVLLP VGDQI+NARMM NNL VGVEVEK ++NG +TKESVCKAV IVMDEENE  K+
Sbjct: 366 LVLLPNVGDQILNARMMANNLEVGVEVEK-DDNGNYTKESVCKAVSIVMDEENETSKK 422

BLAST of Cla97C02G029020 vs. TrEMBL
Match: tr|A0A0L9V5Y2|A0A0L9V5Y2_PHAAN (Glycosyltransferase OS=Phaseolus angularis OX=3914 GN=LR48_Vigan08g091600 PE=3 SV=1)

HSP 1 Score: 569.3 bits (1466), Expect = 1.2e-158
Identity = 270/419 (64.44%), Postives = 323/419 (77.09%), Query Frame = 0

Query: 13  SLHIAMYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKFEPLNLFPNLITFIPIN 72
           SLHIAMYPW A+GH T FLHL NKLA +GH ISF  PTK   K EP NL P+LITF+ I 
Sbjct: 5   SLHIAMYPWLAMGHQTAFLHLCNKLAIRGHRISFITPTKAQAKLEPYNLHPHLITFVTIT 64

Query: 73  VPHVHGLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLKPHLILFDFTHWLPKLA 132
           VPHV GLP  A+TT+DV YPL   IMT+MDLT+  I  LL  LKP L+ +DFTHW+P L 
Sbjct: 65  VPHVEGLPPDAQTTADVTYPLQPNIMTAMDLTKDDIETLLTDLKPELVFYDFTHWMPTLT 124

Query: 133 SQLAIKSIHYCVTSAAMIAYTLTPSRQFSKIELTEEDLMKPPFGYPSSTINLHLHEARLF 192
            +L IK++HYC  S+ MI YTL P+R +  I+L+E +LM+PP GYP S+I LH HEAR F
Sbjct: 125 KKLGIKAVHYCTASSVMIGYTLPPARYYQGIDLSESNLMEPPEGYPDSSIKLHAHEARAF 184

Query: 193 ASKRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVSYLQIEFKKPILLTGSVLC 252
           A KRK  FGSDVLFYDRQFI+ +E D + +RTC EIEG ++ Y+  +FKKP+L TG V+ 
Sbjct: 185 AGKRKDTFGSDVLFYDRQFIALNEADVLAYRTCREIEGPYLDYIGSQFKKPVLATGPVIL 244

Query: 253 EPLDTPLEEKWQSWLSGFKEGSVVYCAFGSECTLQMEQFQELLMGFELLNMPFLAALKPP 312
           EP  + LEEK+ +WL GF+EGSVVYC FGSECTL++EQFQEL++G EL  MPFLAA+K P
Sbjct: 245 EPPTSELEEKFSTWLGGFEEGSVVYCCFGSECTLRLEQFQELVLGLELTGMPFLAAVKAP 304

Query: 313 FGADTIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCFVTHCGSNSLKEALVNKC 372
            G +T+E+A PE F +R   RG V+GGW+ Q+ IL HPSVGCF+THCGS SL EALVNKC
Sbjct: 305 VGFETVESAMPEGFEERAKGRGFVYGGWVMQQLILAHPSVGCFITHCGSGSLSEALVNKC 364

Query: 373 QLVLLPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVCKAVKIVMDEENEIGKE 432
           QLVLLP VGDQI+NARMM NNL VGVEVEK +ENG +TKESVCKAV IVMDEENE  K+
Sbjct: 365 QLVLLPNVGDQILNARMMANNLEVGVEVEK-DENGNYTKESVCKAVSIVMDEENETSKK 422

BLAST of Cla97C02G029020 vs. Swiss-Prot
Match: sp|I1KEV6|FG3H_SOYBN (UDP-glycosyltransferase 79B30 OS=Glycine max OX=3847 GN=FG3 PE=1 SV=2)

HSP 1 Score: 553.5 bits (1425), Expect = 3.3e-156
Identity = 264/422 (62.56%), Postives = 316/422 (74.88%), Query Frame = 0

Query: 14  LHIAMYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKFEPLNLFPNLITFIPINV 73
           LHIAMYPW A+GH T FLHL NKLA +GH ISF  P K   K EP NL PN ITF+ INV
Sbjct: 6   LHIAMYPWLAMGHQTAFLHLCNKLAIRGHKISFITPPKAQAKLEPFNLHPNSITFVTINV 65

Query: 74  PHVHGLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLKPHLILFDFTHWLPKLAS 133
           PHV GLP  A+TT+DV YPL   IMT+MDLT+  I  LL  LKP L+ +DFTHW+P LA 
Sbjct: 66  PHVEGLPPDAQTTADVTYPLQPQIMTAMDLTKDDIETLLTGLKPDLVFYDFTHWMPALAK 125

Query: 134 QLAIKSIHYCVTSAAMIAYTLTPSRQFSKIELTEEDLMKPPFGYPSSTINLHLHEARLFA 193
           +L IK++HYC  S+ M+ YTLTPSR     +L E DLM+PP GYP S+I L  HEAR FA
Sbjct: 126 RLGIKAVHYCTASSVMVGYTLTPSRFHQGTDLMESDLMEPPEGYPDSSIKLQTHEARTFA 185

Query: 194 SKRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVSYLQIEFKKPILLTGSVLCE 253
           +KRK  FGS+VLFYDRQFI+ +E D + +RTC EIEG ++ Y+  +F KP++ TG V+ +
Sbjct: 186 AKRKDTFGSNVLFYDRQFIALNEADLLAYRTCREIEGPYMDYIGKQFNKPVVATGPVILD 245

Query: 254 PLDTPLEEKWQSWLSGFKEGSVVYCAFGSECTLQMEQFQELLMGFELLNMPFLAALKPPF 313
           P    LEEK+ +WL GF+ GSVVYC FGSECTL+  QF EL++G EL  MPFLAA+K P 
Sbjct: 246 PPTLDLEEKFSTWLGGFEPGSVVYCCFGSECTLRPNQFLELVLGLELTGMPFLAAVKAPL 305

Query: 314 GADTIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCFVTHCGSNSLKEALVNKCQ 373
           G +T+E+A PE F +RV  RG V+GGW+QQ+ IL HPSVGCF+THCGS SL EALVNKCQ
Sbjct: 306 GFETVESAMPEGFQERVKGRGFVYGGWVQQQLILAHPSVGCFITHCGSGSLSEALVNKCQ 365

Query: 374 LVLLPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVCKAVKIVMDEENEIGKEKY 433
           LVLLP VGDQI+NARMMG NL VGVEVEK +E+G +TKESVCKAV IVMD ENE  K   
Sbjct: 366 LVLLPNVGDQILNARMMGTNLEVGVEVEKGDEDGMYTKESVCKAVSIVMDCENETSKRVR 425

Query: 434 AD 436
           A+
Sbjct: 426 AN 427

BLAST of Cla97C02G029020 vs. Swiss-Prot
Match: sp|A0A0G4DBR5|FG3N_SOYBN (UDP-glycosyltransferase 79B30 OS=Glycine max OX=3847 GN=FG3 PE=1 SV=1)

HSP 1 Score: 551.2 bits (1419), Expect = 1.6e-155
Identity = 263/422 (62.32%), Postives = 316/422 (74.88%), Query Frame = 0

Query: 14  LHIAMYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKFEPLNLFPNLITFIPINV 73
           LHIAMYPW A+GH   FLHL NKLA +GH ISF  P K   K EP NL PN ITF+ INV
Sbjct: 6   LHIAMYPWLAMGHQIAFLHLCNKLAIRGHKISFITPPKAQAKLEPFNLHPNSITFVTINV 65

Query: 74  PHVHGLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLKPHLILFDFTHWLPKLAS 133
           PHV GLP  A+TT+DV YPL   IMT+MDLT+  I  LL  LKP L+ +DFTHW+P LA 
Sbjct: 66  PHVEGLPPDAQTTADVTYPLQPQIMTAMDLTKDDIETLLTGLKPDLVFYDFTHWMPALAK 125

Query: 134 QLAIKSIHYCVTSAAMIAYTLTPSRQFSKIELTEEDLMKPPFGYPSSTINLHLHEARLFA 193
           +L IK++HYC  S+ MI YTLTP+R     +L E DLM+PP GYP S+I L  HEAR+FA
Sbjct: 126 RLGIKAVHYCTASSVMIGYTLTPARFHQGTDLMESDLMEPPEGYPDSSIKLQTHEARVFA 185

Query: 194 SKRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVSYLQIEFKKPILLTGSVLCE 253
           +KRK  FGS+VLFYDRQFI+ +E D + +RTC EIEG ++ Y+  +F KP++ TG V+ +
Sbjct: 186 AKRKDTFGSNVLFYDRQFIALNEADLLAYRTCREIEGPYMDYIGKQFNKPVVATGPVILD 245

Query: 254 PLDTPLEEKWQSWLSGFKEGSVVYCAFGSECTLQMEQFQELLMGFELLNMPFLAALKPPF 313
           P    LEEK+ +WL GF+ GSVVYC FGSECTL+  QF EL++G EL  MPFLAA+K P 
Sbjct: 246 PPTLDLEEKFSTWLGGFEPGSVVYCCFGSECTLRPNQFLELVLGLELTGMPFLAAVKAPL 305

Query: 314 GADTIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCFVTHCGSNSLKEALVNKCQ 373
           G +T+E+A PE F +RV  RG V+GGW+QQ+ IL HPSVGCF+THCGS SL EALVNKCQ
Sbjct: 306 GFETVESAMPEGFQERVKGRGFVYGGWVQQQLILAHPSVGCFITHCGSGSLSEALVNKCQ 365

Query: 374 LVLLPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVCKAVKIVMDEENEIGKEKY 433
           LVLLP VGDQI+NARMMG NL VGVEVEK +E+G +TKESVCKAV IVMD ENE  K   
Sbjct: 366 LVLLPNVGDQILNARMMGTNLEVGVEVEKGDEDGMYTKESVCKAVSIVMDCENETSKRVR 425

Query: 434 AD 436
           A+
Sbjct: 426 AN 427

BLAST of Cla97C02G029020 vs. Swiss-Prot
Match: sp|Q53UH4|DUSKY_IPONI (Anthocyanidin 3-O-glucoside 2''-O-glucosyltransferase OS=Ipomoea nil OX=35883 GN=3GGT PE=1 SV=1)

HSP 1 Score: 511.9 bits (1317), Expect = 1.1e-143
Identity = 247/449 (55.01%), Postives = 316/449 (70.38%), Query Frame = 0

Query: 9   SHHTSLHIAMYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKFEPLNLFPNLITF 68
           S  T+ H+AMYPWF +GHLT F  L+NKLA KGH ISF IP  T  K E  NL P+LI+F
Sbjct: 3   SQATTYHMAMYPWFGVGHLTGFFRLANKLAGKGHRISFLIPKNTQSKLESFNLHPHLISF 62

Query: 69  IPINVPHVHGLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLKPHLILFDFTHWL 128
           +PI VP + GLP GAETTSDVP+P  +L+M +MD TQ  I  +L+ LK  ++ +DFTHWL
Sbjct: 63  VPIVVPSIPGLPPGAETTSDVPFPSTHLLMEAMDKTQNDIEIILKDLKVDVVFYDFTHWL 122

Query: 129 PKLASQLAIKSIHYCVTSAAMIAYTLTPSRQFSKIELTEEDLMKPPFGYPSSTINLHLHE 188
           P LA ++ IKS+ Y   S  M  Y L+P R+    +LTE D+MK P  +P  +I LH HE
Sbjct: 123 PSLARKIGIKSVFYSTISPLMHGYALSPERRVVGKQLTEADMMKAPASFPDPSIKLHAHE 182

Query: 189 ARLFASKRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVSYLQIEFKKPILLTG 248
           AR F ++   KFG D+ F+DR F + SE D + + TC EIEG F  Y++ +F+KP+LL G
Sbjct: 183 ARGFTARTVMKFGGDITFFDRIFTAVSESDGLAYSTCREIEGQFCDYIETQFQKPVLLAG 242

Query: 249 SVLCEPLDTPLEEKWQSWLSGFKEGSVVYCAFGSECTLQMEQFQELLMGFELLNMPFLAA 308
             L  P  + +E+KW  WL  FKEGSV+YCAFGSECTL+ ++FQELL G EL  MPF AA
Sbjct: 243 PALPVPSKSTMEQKWSDWLGKFKEGSVIYCAFGSECTLRKDKFQELLWGLELTGMPFFAA 302

Query: 309 LKPPFGADTIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCFVTHCGSNSLKEAL 368
           LKPPF A++IEAA PEE  +++  RG+V G W+QQ+  L+HPSVGCFV+HCG  SL EAL
Sbjct: 303 LKPPFEAESIEAAIPEELKEKIQGRGIVHGEWVQQQLFLQHPSVGCFVSHCGWASLSEAL 362

Query: 369 VNKCQLVLLPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVCKAVKIVMDEENEI 428
           VN CQ+VLLPQVGDQIINAR+M  +L+VGVEVEK EE+G F++ESVCKAVK VMDE++EI
Sbjct: 363 VNDCQIVLLPQVGDQIINARIMSVSLKVGVEVEKGEEDGVFSRESVCKAVKAVMDEKSEI 422

Query: 429 GKE------KYADIVSGVDVTKPLKDAWN 452
           G+E      K    +   D+     D++N
Sbjct: 423 GREVRGNHDKLRGFLLNADLDSKYMDSFN 451

BLAST of Cla97C02G029020 vs. Swiss-Prot
Match: sp|Q53UH5|DUSKY_IPOPU (Anthocyanidin 3-O-glucoside 2''-O-glucosyltransferase OS=Ipomoea purpurea OX=4121 GN=3GGT PE=2 SV=1)

HSP 1 Score: 510.4 bits (1313), Expect = 3.2e-143
Identity = 245/449 (54.57%), Postives = 315/449 (70.16%), Query Frame = 0

Query: 9   SHHTSLHIAMYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKFEPLNLFPNLITF 68
           S  T+ H+AMYPWF +GHLT F  L+NKLA KGH ISF IP  T  K E  NL P+LI+F
Sbjct: 3   SQATTYHMAMYPWFGVGHLTGFFRLANKLAGKGHRISFLIPKNTQSKLESFNLHPHLISF 62

Query: 69  IPINVPHVHGLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLKPHLILFDFTHWL 128
           +PI VP + GLP GAETTSDVP+P  +L+M +MD TQ  I  +L+ LK  ++ +DFTHWL
Sbjct: 63  VPIVVPSIPGLPPGAETTSDVPFPSTHLLMEAMDKTQNDIEIILKDLKVDVVFYDFTHWL 122

Query: 129 PKLASQLAIKSIHYCVTSAAMIAYTLTPSRQFSKIELTEEDLMKPPFGYPSSTINLHLHE 188
           P LA ++ IKS+ Y   S  M  Y L+P R+    +LTE D+MK P  +P  +I LH HE
Sbjct: 123 PSLARKIGIKSVFYSTISPLMHGYALSPERRVVGKQLTEADMMKAPASFPDPSIKLHAHE 182

Query: 189 ARLFASKRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVSYLQIEFKKPILLTG 248
           AR F ++   KFG D+ F+DR F + SE D + + TC EIEG F  Y++ +F+KP+LL G
Sbjct: 183 ARGFTARTVMKFGGDITFFDRIFTAVSESDGLAYSTCREIEGQFCDYIETQFQKPVLLAG 242

Query: 249 SVLCEPLDTPLEEKWQSWLSGFKEGSVVYCAFGSECTLQMEQFQELLMGFELLNMPFLAA 308
             L  P  + +E+KW  WL  FKEGSV+YCAFGSECTL+ ++FQELL G EL  MPF AA
Sbjct: 243 PALPVPSKSTMEQKWSDWLGKFKEGSVIYCAFGSECTLRKDKFQELLWGLELTGMPFFAA 302

Query: 309 LKPPFGADTIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCFVTHCGSNSLKEAL 368
           LKPPF  +++EAA PEE  +++  RG+V G W+QQ+  L+HPSVGCFV+HCG  SL EAL
Sbjct: 303 LKPPFETESVEAAIPEELKEKIQGRGIVHGEWVQQQLFLQHPSVGCFVSHCGWASLSEAL 362

Query: 369 VNKCQLVLLPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVCKAVKIVMDEENEI 428
           VN CQ+VLLPQVGDQIINAR+M  +L+VGVEVEK EE+G F++ESVCKAVK VMDE++EI
Sbjct: 363 VNDCQIVLLPQVGDQIINARIMSVSLKVGVEVEKGEEDGVFSRESVCKAVKAVMDEKSEI 422

Query: 429 GKE------KYADIVSGVDVTKPLKDAWN 452
           G+E      K    +   D+     D++N
Sbjct: 423 GREVRGNHDKLRGFLMNADLDSKYMDSFN 451

BLAST of Cla97C02G029020 vs. Swiss-Prot
Match: sp|Q9LVW3|AXYLT_ARATH (Anthocyanidin 3-O-glucoside 2'''-O-xylosyltransferase OS=Arabidopsis thaliana OX=3702 GN=A3G2XYLT PE=1 SV=1)

HSP 1 Score: 431.4 bits (1108), Expect = 1.9e-119
Identity = 213/432 (49.31%), Postives = 295/432 (68.29%), Query Frame = 0

Query: 8   TSHHTSLHIAMYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKFEPLNLFPNLIT 67
           ++  +S+ I MYPW A GH+TPFLHLSNKLA+KGH I F +P K L + EPLNL+PNLIT
Sbjct: 6   SNESSSMSIVMYPWLAFGHMTPFLHLSNKLAEKGHKIVFLLPKKALNQLEPLNLYPNLIT 65

Query: 68  FIPINVPHVHGLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLKPHLILFDFTHW 127
           F  I++P V GLP GAET SDVP+ L +L+  +MD T+P++  + +T+KP L+ +D  HW
Sbjct: 66  FHTISIPQVKGLPPGAETNSDVPFFLTHLLAVAMDQTRPEVETIFRTIKPDLVFYDSAHW 125

Query: 128 LPKLASQLAIKSIHYCVTSAAMIAYTLTPSRQFSKI---ELTEEDLMKPPFGYPSSTINL 187
           +P++A  +  K++ + + SAA IA +L PS +   I   E++ E+L K P GYPSS + L
Sbjct: 126 IPEIAKPIGAKTVCFNIVSAASIALSLVPSAEREVIDGKEMSGEELAKTPLGYPSSKVVL 185

Query: 188 HLHEAR--LFASKRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVSYLQIEFKK 247
             HEA+   F  ++    GS   F+D +  +   CDAI  RTC E EG F  Y+  ++ K
Sbjct: 186 RPHEAKSLSFVWRKHEAIGS---FFDGKVTAMRNCDAIAIRTCRETEGKFCDYISRQYSK 245

Query: 248 PILLTGSVL--CEPLDTPLEEKWQSWLSGFKEGSVVYCAFGSECTL-QMEQFQELLMGFE 307
           P+ LTG VL   +P    L+ +W  WL+ F  GSVV+CAFGS+  + +++QFQEL +G E
Sbjct: 246 PVYLTGPVLPGSQPNQPSLDPQWAEWLAKFNHGSVVFCAFGSQPVVNKIDQFQELCLGLE 305

Query: 308 LLNMPFLAALKPPFGADTIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCFVTHC 367
               PFL A+KPP G  T+E A PE F +RV  RGVVFGGWIQQ  +L HPSVGCFV+HC
Sbjct: 306 STGFPFLVAIKPPSGVSTVEEALPEGFKERVQGRGVVFGGWIQQPLVLNHPSVGCFVSHC 365

Query: 368 GSNSLKEALVNKCQLVLLPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVCKAVK 427
           G  S+ E+L++ CQ+VL+PQ G+QI+NAR+M   + V VEVE RE+ GWF+++S+  AVK
Sbjct: 366 GFGSMWESLMSDCQIVLVPQHGEQILNARLMTEEMEVAVEVE-REKKGWFSRQSLENAVK 425

Query: 428 IVMDEENEIGKE 432
            VM+E +EIG++
Sbjct: 426 SVMEEGSEIGEK 433

BLAST of Cla97C02G029020 vs. TAIR10
Match: AT5G54060.1 (UDP-glucose:flavonoid 3-o-glucosyltransferase)

HSP 1 Score: 431.4 bits (1108), Expect = 1.1e-120
Identity = 213/432 (49.31%), Postives = 295/432 (68.29%), Query Frame = 0

Query: 8   TSHHTSLHIAMYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKFEPLNLFPNLIT 67
           ++  +S+ I MYPW A GH+TPFLHLSNKLA+KGH I F +P K L + EPLNL+PNLIT
Sbjct: 6   SNESSSMSIVMYPWLAFGHMTPFLHLSNKLAEKGHKIVFLLPKKALNQLEPLNLYPNLIT 65

Query: 68  FIPINVPHVHGLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLKPHLILFDFTHW 127
           F  I++P V GLP GAET SDVP+ L +L+  +MD T+P++  + +T+KP L+ +D  HW
Sbjct: 66  FHTISIPQVKGLPPGAETNSDVPFFLTHLLAVAMDQTRPEVETIFRTIKPDLVFYDSAHW 125

Query: 128 LPKLASQLAIKSIHYCVTSAAMIAYTLTPSRQFSKI---ELTEEDLMKPPFGYPSSTINL 187
           +P++A  +  K++ + + SAA IA +L PS +   I   E++ E+L K P GYPSS + L
Sbjct: 126 IPEIAKPIGAKTVCFNIVSAASIALSLVPSAEREVIDGKEMSGEELAKTPLGYPSSKVVL 185

Query: 188 HLHEAR--LFASKRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVSYLQIEFKK 247
             HEA+   F  ++    GS   F+D +  +   CDAI  RTC E EG F  Y+  ++ K
Sbjct: 186 RPHEAKSLSFVWRKHEAIGS---FFDGKVTAMRNCDAIAIRTCRETEGKFCDYISRQYSK 245

Query: 248 PILLTGSVL--CEPLDTPLEEKWQSWLSGFKEGSVVYCAFGSECTL-QMEQFQELLMGFE 307
           P+ LTG VL   +P    L+ +W  WL+ F  GSVV+CAFGS+  + +++QFQEL +G E
Sbjct: 246 PVYLTGPVLPGSQPNQPSLDPQWAEWLAKFNHGSVVFCAFGSQPVVNKIDQFQELCLGLE 305

Query: 308 LLNMPFLAALKPPFGADTIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCFVTHC 367
               PFL A+KPP G  T+E A PE F +RV  RGVVFGGWIQQ  +L HPSVGCFV+HC
Sbjct: 306 STGFPFLVAIKPPSGVSTVEEALPEGFKERVQGRGVVFGGWIQQPLVLNHPSVGCFVSHC 365

Query: 368 GSNSLKEALVNKCQLVLLPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVCKAVK 427
           G  S+ E+L++ CQ+VL+PQ G+QI+NAR+M   + V VEVE RE+ GWF+++S+  AVK
Sbjct: 366 GFGSMWESLMSDCQIVLVPQHGEQILNARLMTEEMEVAVEVE-REKKGWFSRQSLENAVK 425

Query: 428 IVMDEENEIGKE 432
            VM+E +EIG++
Sbjct: 426 SVMEEGSEIGEK 433

BLAST of Cla97C02G029020 vs. TAIR10
Match: AT5G54010.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 424.1 bits (1089), Expect = 1.7e-118
Identity = 202/419 (48.21%), Postives = 281/419 (67.06%), Query Frame = 0

Query: 12  TSLHIAMYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKFEPLNLFPNLITFIPI 71
           +  H  M+PWF  GH+T FLHL+NKLA+K H I+F +P K   + E LNLFP+ I F  +
Sbjct: 3   SKFHAFMFPWFGFGHMTAFLHLANKLAEKDHKITFLLPKKARKQLESLNLFPDCIVFQTL 62

Query: 72  NVPHVHGLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLKPHLILFDFTHWLPKL 131
            +P V GLP GAETTSD+P  L + + ++MD T+ Q+   +   KP LI FDF HW+P++
Sbjct: 63  TIPSVDGLPDGAETTSDIPISLGSFLASAMDRTRIQVKEAVSVGKPDLIFFDFAHWIPEI 122

Query: 132 ASQLAIKSIHYCVTSAAMIAYTLTPSRQFSKIELTEEDLMKPPFGYPSSTINLHLHEARL 191
           A +  +KS+++   SAA +A +  P R       +++DL   P GYPSS + L  HE   
Sbjct: 123 AREYGVKSVNFITISAACVAISFVPGR-------SQDDLGSTPPGYPSSKVLLRGHETNS 182

Query: 192 FASKRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVSYLQIEFKKPILLTGSVL 251
             S   + FG    FY+R  I    CD I  RTC E+EG F  +++ +F++ +LLTG +L
Sbjct: 183 L-SFLSYPFGDGTSFYERIMIGLKNCDVISIRTCQEMEGKFCDFIENQFQRKVLLTGPML 242

Query: 252 CEPLDT-PLEEKWQSWLSGFKEGSVVYCAFGSECTLQMEQFQELLMGFELLNMPFLAALK 311
            EP ++ PLE++W+ WLS F  GSV+YCA GS+  L+ +QFQEL +G EL  +PFL A+K
Sbjct: 243 PEPDNSKPLEDQWRQWLSKFDPGSVIYCALGSQIILEKDQFQELCLGMELTGLPFLVAVK 302

Query: 312 PPFGADTIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCFVTHCGSNSLKEALVN 371
           PP G+ TI+ A P+ F +RV +RGVV+GGW+QQ  IL HPS+GCFV+HCG  S+ EALVN
Sbjct: 303 PPKGSSTIQEALPKGFEERVKARGVVWGGWVQQPLILAHPSIGCFVSHCGFGSMWEALVN 362

Query: 372 KCQLVLLPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVCKAVKIVMDEENEIG 430
            CQ+V +P +G+QI+N R+M   L+V VEV KREE GWF+KES+  AV+ VMD ++E+G
Sbjct: 363 DCQIVFIPHLGEQILNTRLMSEELKVSVEV-KREETGWFSKESLSGAVRSVMDRDSELG 412

BLAST of Cla97C02G029020 vs. TAIR10
Match: AT4G27570.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 403.7 bits (1036), Expect = 2.4e-112
Identity = 200/419 (47.73%), Postives = 277/419 (66.11%), Query Frame = 0

Query: 15  HIAMYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKFEPLNLFPNLITFIPINVP 74
           H+ MYPWFA GH+TPFL L+NKLA+KGHT++F +P K+L + E  NLFP+ I F  + VP
Sbjct: 7   HVLMYPWFATGHMTPFLFLANKLAEKGHTVTFLLPKKSLKQLEHFNLFPHNIVFRSVTVP 66

Query: 75  HVHGLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLKPHLILFDFTHWLPKLASQ 134
           HV GLP G ET S++P    +L+M++MDLT+ Q+  +++ ++P LI FDF HW+P++A  
Sbjct: 67  HVDGLPVGTETASEIPVTSTDLLMSAMDLTRDQVEAVVRAVEPDLIFFDFAHWIPEVARD 126

Query: 135 LAIKSIHYCVTSAAMIAYTLTPSRQFSKIELTEEDLMKPPFGYPSSTINLHLHEA---RL 194
             +K++ Y V SA+ IA  L P            +L  PP GYPSS + L   +A   + 
Sbjct: 127 FGLKTVKYVVVSASTIASMLVPG----------GELGVPPPGYPSSKVLLRKQDAYTMKK 186

Query: 195 FASKRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVSYLQIEFKKPILLTGSVL 254
                    G ++L  +R   S    D I  RT  EIEG+F  Y++   +K +LLTG V 
Sbjct: 187 LEPTNTIDVGPNLL--ERVTTSLMNSDVIAIRTAREIEGNFCDYIEKHCRKKVLLTGPVF 246

Query: 255 CEPLDT-PLEEKWQSWLSGFKEGSVVYCAFGSECTLQMEQFQELLMGFELLNMPFLAALK 314
            EP  T  LEE+W  WLSG++  SVV+CA GS+  L+ +QFQEL +G EL   PFL A+K
Sbjct: 247 PEPDKTRELEERWVKWLSGYEPDSVVFCALGSQVILEKDQFQELCLGMELTGSPFLVAVK 306

Query: 315 PPFGADTIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCFVTHCGSNSLKEALVN 374
           PP G+ TI+ A PE F +RV  RG+V+GGW+QQ  IL HPSVGCFV+HCG  S+ E+L++
Sbjct: 307 PPRGSSTIQEALPEGFEERVKGRGLVWGGWVQQPLILSHPSVGCFVSHCGFGSMWESLLS 366

Query: 375 KCQLVLLPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVCKAVKIVMDEENEIG 430
            CQ+VL+PQ+GDQ++N R++ + L+V VEV  REE GWF+KES+C AV  VM  ++E+G
Sbjct: 367 DCQIVLVPQLGDQVLNTRLLSDELKVSVEV-AREETGWFSKESLCDAVNSVMKRDSELG 412

BLAST of Cla97C02G029020 vs. TAIR10
Match: AT5G53990.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 401.4 bits (1030), Expect = 1.2e-111
Identity = 201/422 (47.63%), Postives = 274/422 (64.93%), Query Frame = 0

Query: 13  SLHIAMYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKFEPLNLFPNLITFIPIN 72
           + H  M+PWFA GH+TP+LHL+NKLA KGH ++F +P K   + E  NLFP+ I F  + 
Sbjct: 4   NFHAFMFPWFAFGHMTPYLHLANKLAAKGHRVTFLLPKKAQKQLEHHNLFPDRIIFHSLT 63

Query: 73  VPHVHGLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLKPHLILFDFTHWLPKLA 132
           +PHV GLP GAET SD+P  L   +  +MDLT+ Q+   ++ L+P LI FD  +W+P++A
Sbjct: 64  IPHVDGLPAGAETASDIPISLGKFLTAAMDLTRDQVEAAVRALRPDLIFFDTAYWVPEMA 123

Query: 133 SQLAIKSIHYCVTSAAMIAYTLTPSRQFSKIELTEEDLMKPPFGYPSSTINLHLHEARLF 192
            +  +KS+ Y V SA  IA+ L P            +L  PP GYPSS +    H+A   
Sbjct: 124 KEHRVKSVIYFVISANSIAHELVPG----------GELGVPPPGYPSSKVLYRGHDAHAL 183

Query: 193 ASKRKWKFGSDVLFYDRQF----ISFSECDAIGFRTCHEIEGDFVSYLQIEFKKPILLTG 252
            +          +FY+R           CD I  RTC EIEG F  Y++ ++++ +LLTG
Sbjct: 184 LTFS--------IFYERLHYRITTGLKNCDFISIRTCKEIEGKFCDYIERQYQRKVLLTG 243

Query: 253 SVLCEPLDT-PLEEKWQSWLSGFKEGSVVYCAFGSECTLQMEQFQELLMGFELLNMPFLA 312
            +L EP ++ PLE++W  WL+ FK GSV+YCA GS+ TL+ +QFQEL +G EL  +PFL 
Sbjct: 244 PMLPEPDNSRPLEDRWNHWLNQFKPGSVIYCALGSQITLEKDQFQELCLGMELTGLPFLV 303

Query: 313 ALKPPFGADTIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCFVTHCGSNSLKEA 372
           A+KPP GA TI+ A PE F +RV + GVV+G W+QQ  IL HPSVGCFVTHCG  S+ E+
Sbjct: 304 AVKPPKGAKTIQEALPEGFEERVKNHGVVWGEWVQQPLILAHPSVGCFVTHCGFGSMWES 363

Query: 373 LVNKCQLVLLPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVCKAVKIVMDEENE 430
           LV+ CQ+VLLP + DQI+N R+M   L V VEV KREE GWF+KES+  A+  VMD+++E
Sbjct: 364 LVSDCQIVLLPYLCDQILNTRLMSEELEVSVEV-KREETGWFSKESLSVAITSVMDKDSE 406

BLAST of Cla97C02G029020 vs. TAIR10
Match: AT4G27560.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 401.0 bits (1029), Expect = 1.5e-111
Identity = 201/419 (47.97%), Postives = 276/419 (65.87%), Query Frame = 0

Query: 15  HIAMYPWFALGHLTPFLHLSNKLAKKGHTISFFIPTKTLPKFEPLNLFPNLITFIPINVP 74
           H+ MYPWFA GH+TPFL L+NKLA+KGHT++F IP K L + E LNLFP+ I F  + VP
Sbjct: 7   HVLMYPWFATGHMTPFLFLANKLAEKGHTVTFLIPKKALKQLENLNLFPHNIVFRSVTVP 66

Query: 75  HVHGLPHGAETTSDVPYPLHNLIMTSMDLTQPQITHLLQTLKPHLILFDFTHWLPKLASQ 134
           HV GLP G ET S++P    +L+M++MDLT+ Q+  +++ ++P LI FDF HW+P++A  
Sbjct: 67  HVDGLPVGTETVSEIPVTSADLLMSAMDLTRDQVEGVVRAVEPDLIFFDFAHWIPEVARD 126

Query: 135 LAIKSIHYCVTSAAMIAYTLTPSRQFSKIELTEEDLMKPPFGYPSSTINLHLHEA---RL 194
             +K++ Y V SA+ IA  L P            +L  PP GYPSS + L   +A   + 
Sbjct: 127 FGLKTVKYVVVSASTIASMLVPG----------GELGVPPPGYPSSKVLLRKQDAYTMKN 186

Query: 195 FASKRKWKFGSDVLFYDRQFISFSECDAIGFRTCHEIEGDFVSYLQIEFKKPILLTGSVL 254
             S      G ++L  +R   S    D I  RT  EIEG+F  Y++   +K +LLTG V 
Sbjct: 187 LESTNTINVGPNLL--ERVTTSLMNSDVIAIRTAREIEGNFCDYIEKHCRKKVLLTGPVF 246

Query: 255 CEPLDT-PLEEKWQSWLSGFKEGSVVYCAFGSECTLQMEQFQELLMGFELLNMPFLAALK 314
            EP  T  LEE+W  WLSG++  SVV+CA GS+  L+ +QFQEL +G EL   PFL A+K
Sbjct: 247 PEPDKTRELEERWVKWLSGYEPDSVVFCALGSQVILEKDQFQELCLGMELTGSPFLVAVK 306

Query: 315 PPFGADTIEAAFPEEFAQRVGSRGVVFGGWIQQERILEHPSVGCFVTHCGSNSLKEALVN 374
           PP G+ TI+ A PE F +RV  RGVV+G W+QQ  +L HPSVGCFV+HCG  S+ E+L++
Sbjct: 307 PPRGSSTIQEALPEGFEERVKGRGVVWGEWVQQPLLLSHPSVGCFVSHCGFGSMWESLLS 366

Query: 375 KCQLVLLPQVGDQIINARMMGNNLRVGVEVEKREENGWFTKESVCKAVKIVMDEENEIG 430
            CQ+VL+PQ+GDQ++N R++ + L+V VEV  REE GWF+KES+  A+  VM  ++EIG
Sbjct: 367 DCQIVLVPQLGDQVLNTRLLSDELKVSVEV-AREETGWFSKESLFDAINSVMKRDSEIG 412

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KGN43443.11.9e-22990.97hypothetical protein Csa_7G037470 [Cucumis sativus][more]
XP_008465522.11.4e-22784.50PREDICTED: anthocyanidin 3-O-glucoside 2''-O-glucosyltransferase-like [Cucumis m... [more]
XP_004144610.22.1e-22391.33PREDICTED: anthocyanidin 3-O-glucoside 2''-O-glucosyltransferase-like [Cucumis s... [more]
XP_023530310.11.6e-21582.04UDP-glycosyltransferase 79B30-like [Cucurbita pepo subsp. pepo][more]
XP_022930691.13.5e-21581.82UDP-glycosyltransferase 79B30-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
tr|A0A0A0K1W3|A0A0A0K1W3_CUCSA1.3e-22990.97Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_7G037470 PE=3 SV=1[more]
tr|A0A1S3CP23|A0A1S3CP23_CUCME9.2e-22884.50Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103503146 PE=3 SV=1[more]
tr|A0A0D5ZD63|A0A0D5ZD63_PANGI1.9e-16466.83Glycosyltransferase OS=Panax ginseng OX=4054 PE=2 SV=1[more]
tr|A0A1S3TG55|A0A1S3TG55_VIGRR1.1e-15964.83Glycosyltransferase OS=Vigna radiata var. radiata OX=3916 GN=LOC106755140 PE=3 S... [more]
tr|A0A0L9V5Y2|A0A0L9V5Y2_PHAAN1.2e-15864.44Glycosyltransferase OS=Phaseolus angularis OX=3914 GN=LR48_Vigan08g091600 PE=3 S... [more]
Match NameE-valueIdentityDescription
sp|I1KEV6|FG3H_SOYBN3.3e-15662.56UDP-glycosyltransferase 79B30 OS=Glycine max OX=3847 GN=FG3 PE=1 SV=2[more]
sp|A0A0G4DBR5|FG3N_SOYBN1.6e-15562.32UDP-glycosyltransferase 79B30 OS=Glycine max OX=3847 GN=FG3 PE=1 SV=1[more]
sp|Q53UH4|DUSKY_IPONI1.1e-14355.01Anthocyanidin 3-O-glucoside 2''-O-glucosyltransferase OS=Ipomoea nil OX=35883 GN... [more]
sp|Q53UH5|DUSKY_IPOPU3.2e-14354.57Anthocyanidin 3-O-glucoside 2''-O-glucosyltransferase OS=Ipomoea purpurea OX=412... [more]
sp|Q9LVW3|AXYLT_ARATH1.9e-11949.31Anthocyanidin 3-O-glucoside 2'''-O-xylosyltransferase OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
AT5G54060.11.1e-12049.31UDP-glucose:flavonoid 3-o-glucosyltransferase[more]
AT5G54010.11.7e-11848.21UDP-Glycosyltransferase superfamily protein[more]
AT4G27570.12.4e-11247.73UDP-Glycosyltransferase superfamily protein[more]
AT5G53990.11.2e-11147.63UDP-Glycosyltransferase superfamily protein[more]
AT4G27560.11.5e-11147.97UDP-Glycosyltransferase superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
GO:0004650polygalacturonase activity
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO:0005975carbohydrate metabolic process
Vocabulary: INTERPRO
TermDefinition
IPR011050Pectin_lyase_fold/virulence
IPR035595UDP_glycos_trans_CS
IPR002213UDP_glucos_trans
IPR012334Pectin_lyas_fold
IPR000743Glyco_hydro_28
IPR006626PbH1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0005982 starch metabolic process
biological_process GO:0005985 sucrose metabolic process
biological_process GO:0009813 flavonoid biosynthetic process
biological_process GO:0052696 flavonoid glucuronidation
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0016740 transferase activity
molecular_function GO:0016758 transferase activity, transferring hexosyl groups
molecular_function GO:0004650 polygalacturonase activity
molecular_function GO:0033838 flavonol-3-O-glucoside glucosyltransferase activity
molecular_function GO:0080043 quercetin 3-O-glucosyltransferase activity
molecular_function GO:0080044 quercetin 7-O-glucosyltransferase activity
molecular_function GO:0008194 UDP-glycosyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C02G029020.1Cla97C02G029020.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006626Parallel beta-helix repeatSMARTSM00710pbh1coord: 506..536
e-value: 910.0
score: 7.5
coord: 577..598
e-value: 5800.0
score: 1.8
coord: 546..567
e-value: 0.43
score: 19.6
coord: 479..505
e-value: 70.0
score: 12.3
IPR000743Glycoside hydrolase, family 28PFAMPF00295Glyco_hydro_28coord: 470..661
e-value: 1.8E-56
score: 191.4
IPR000743Glycoside hydrolase, family 28PROSITEPS00502POLYGALACTURONASEcoord: 523..536
IPR012334Pectin lyase foldGENE3DG3DSA:2.160.20.10coord: 469..686
e-value: 2.8E-59
score: 203.1
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 6..262
e-value: 1.2E-29
score: 105.4
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 263..438
e-value: 4.3E-48
score: 165.6
NoneNo IPR availablePANTHERPTHR11926:SF356SUBFAMILY NOT NAMEDcoord: 11..427
NoneNo IPR availablePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 11..427
NoneNo IPR availableSUPERFAMILYSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 14..426
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 234..425
e-value: 1.4E-14
score: 53.7
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 340..383
IPR011050Pectin lyase fold/virulence factorSUPERFAMILYSSF51126Pectin lyase-likecoord: 441..669

The following gene(s) are paralogous to this gene:

None