CsGy4G020610 (gene) Cucumber (Gy14) v2

NameCsGy4G020610
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
DescriptionGlycosyltransferase
LocationChr4 : 27448267 .. 27449736 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTGAGCTCATTTTCATACCTGCCCCCGGAATTGGTCACCTTGCATCCACCGTTGAAATGGCCAATGTTCTTGTCACTCGAGATCATCGTCTCTCTGTCACATTGCTTGCCATGAAGCTTCCCTATGATGTCAAAGTTGCTGAATGTATTGAGTCACTTTCTACATCTTTCGCAGGAAAAAATATACAATTCAACGTCCTTCCAGAACCGCCCCTTCCGGAAGAAAGTAAAAAAGACTTCATTGTGTTGGTTGAAAGCTACAAGCCTTATGTTAGAGAGGTTGTATCCAACCTTACTGCCTCGGCAGCGACAAGTATCGACTCGCCTCGGCTGGTCGGGTTAGTCATCGACATGTTTTGTACAACCATGATCGATGTGGGTAATGAATTTGGGGTTCCTTGTTATGTGTTTTATACTTGCAGTGCTAGTTTTCTTGCTTTTAGTCTTTATCTTCAAGAGCTTTATGAAGAGAATGGTAGCAATGAAGTGGTGGAACAGTTGCTGAACTCAGATAATGTTGAGTTAACTTTGCCAAATTTTGTTAATCCTATTCCTAGTAAACTCATTCCTACCCTCTTTTCTAACAAGGACAAAGCTGTTTGGTTTCATAACCATATTAAAAGGTTTAGATTGGAAATCAAAGGGATTCTTATCAATACATTTGAGGAGATGGAATCACATGTGGCGAAGTCCTACTCTCAAGTCCTCCCACCTTTATATTTTGTTGGACCTGTTTTGCACTTGAAAAATGCAGGAGTGGCAGGATCAAGTGAGGCTCAAAACAATGCAGATATAATAATGAAGTGGCTTGATGATCAACCTCCATCATCAGTGGTCCTTGTGTGCTTTGGGACTATGGTAAGCTTTGATGAGGCTCAAGTGGCAGAGATTGCGAATGCATTGGAGGAAAGTGGGGTTCGTTTCATATGGTCCCTTCGACAACCTCCACCGAAGGGTAAGTTCGAAGCGCCGAAAAACTACAATGACATCAGAAACTTCCTACCAGAGGGATTTCTCGATCGAACAATGAGTATTGGGAGAGTCATTGGATGGACATCACAAGTGGAGATATTGGCCCACCCAGCCATTGGCGGATTCATATCACATTGTGGTTGGAACTCGGTACTAGAAAGTGTGTGGCATGGCGTGCTTATTGCGACATGGCCGATGCATGCCGAGCAACAATTCAATGCCTTTGAAATGGTAGTAGAATTGGGATTAGCAGTGGAGGTCACGTTAGACTATCGAATAACTTTTGGTGAAGACAAGCCAAGATTAGTGAGTGCTGAAGAGATAAAGAGTGGGATCAAGAAATTAATGGGAGAAGAAAGTAATGAGGTAAGGAAAAAAGTGAAAGCAAAAAGTGAAGAAAGTAGGAAAAGTGTAATGGAAGGTGGTTCCTCCTTTGTCTCGTTGGGAAAATTTATAGATGATGTTTTGGCCAATTCGGCAGGAGGAGGAAACTAA

mRNA sequence

ATGTTTGAGCTCATTTTCATACCTGCCCCCGGAATTGGTCACCTTGCATCCACCGTTGAAATGGCCAATGTTCTTGTCACTCGAGATCATCGTCTCTCTGTCACATTGCTTGCCATGAAGCTTCCCTATGATGTCAAAGTTGCTGAATGTATTGAGTCACTTTCTACATCTTTCGCAGGAAAAAATATACAATTCAACGTCCTTCCAGAACCGCCCCTTCCGGAAGAAAGTAAAAAAGACTTCATTGTGTTGGTTGAAAGCTACAAGCCTTATGTTAGAGAGGTTGTATCCAACCTTACTGCCTCGGCAGCGACAAGTATCGACTCGCCTCGGCTGGTCGGGTTAGTCATCGACATGTTTTGTACAACCATGATCGATGTGGGTAATGAATTTGGGGTTCCTTGTTATGTGTTTTATACTTGCAGTGCTAGTTTTCTTGCTTTTAGTCTTTATCTTCAAGAGCTTTATGAAGAGAATGGTAGCAATGAAGTGGTGGAACAGTTGCTGAACTCAGATAATGTTGAGTTAACTTTGCCAAATTTTGTTAATCCTATTCCTAGTAAACTCATTCCTACCCTCTTTTCTAACAAGGACAAAGCTGTTTGGTTTCATAACCATATTAAAAGGTTTAGATTGGAAATCAAAGGGATTCTTATCAATACATTTGAGGAGATGGAATCACATGTGGCGAAGTCCTACTCTCAAGTCCTCCCACCTTTATATTTTGTTGGACCTGTTTTGCACTTGAAAAATGCAGGAGTGGCAGGATCAAGTGAGGCTCAAAACAATGCAGATATAATAATGAAGTGGCTTGATGATCAACCTCCATCATCAGTGGTCCTTGTGTGCTTTGGGACTATGGTAAGCTTTGATGAGGCTCAAGTGGCAGAGATTGCGAATGCATTGGAGGAAAGTGGGGTTCGTTTCATATGGTCCCTTCGACAACCTCCACCGAAGGGTAAGTTCGAAGCGCCGAAAAACTACAATGACATCAGAAACTTCCTACCAGAGGGATTTCTCGATCGAACAATGAGTATTGGGAGAGTCATTGGATGGACATCACAAGTGGAGATATTGGCCCACCCAGCCATTGGCGGATTCATATCACATTGTGGTTGGAACTCGGTACTAGAAAGTGTGTGGCATGGCGTGCTTATTGCGACATGGCCGATGCATGCCGAGCAACAATTCAATGCCTTTGAAATGGTAGTAGAATTGGGATTAGCAGTGGAGGTCACGTTAGACTATCGAATAACTTTTGGTGAAGACAAGCCAAGATTAGTGAGTGCTGAAGAGATAAAGAGTGGGATCAAGAAATTAATGGGAGAAGAAAGTAATGAGGTAAGGAAAAAAGTGAAAGCAAAAAGTGAAGAAAGTAGGAAAAGTGTAATGGAAGGTGGTTCCTCCTTTGTCTCGTTGGGAAAATTTATAGATGATGTTTTGGCCAATTCGGCAGGAGGAGGAAACTAA

Coding sequence (CDS)

ATGTTTGAGCTCATTTTCATACCTGCCCCCGGAATTGGTCACCTTGCATCCACCGTTGAAATGGCCAATGTTCTTGTCACTCGAGATCATCGTCTCTCTGTCACATTGCTTGCCATGAAGCTTCCCTATGATGTCAAAGTTGCTGAATGTATTGAGTCACTTTCTACATCTTTCGCAGGAAAAAATATACAATTCAACGTCCTTCCAGAACCGCCCCTTCCGGAAGAAAGTAAAAAAGACTTCATTGTGTTGGTTGAAAGCTACAAGCCTTATGTTAGAGAGGTTGTATCCAACCTTACTGCCTCGGCAGCGACAAGTATCGACTCGCCTCGGCTGGTCGGGTTAGTCATCGACATGTTTTGTACAACCATGATCGATGTGGGTAATGAATTTGGGGTTCCTTGTTATGTGTTTTATACTTGCAGTGCTAGTTTTCTTGCTTTTAGTCTTTATCTTCAAGAGCTTTATGAAGAGAATGGTAGCAATGAAGTGGTGGAACAGTTGCTGAACTCAGATAATGTTGAGTTAACTTTGCCAAATTTTGTTAATCCTATTCCTAGTAAACTCATTCCTACCCTCTTTTCTAACAAGGACAAAGCTGTTTGGTTTCATAACCATATTAAAAGGTTTAGATTGGAAATCAAAGGGATTCTTATCAATACATTTGAGGAGATGGAATCACATGTGGCGAAGTCCTACTCTCAAGTCCTCCCACCTTTATATTTTGTTGGACCTGTTTTGCACTTGAAAAATGCAGGAGTGGCAGGATCAAGTGAGGCTCAAAACAATGCAGATATAATAATGAAGTGGCTTGATGATCAACCTCCATCATCAGTGGTCCTTGTGTGCTTTGGGACTATGGTAAGCTTTGATGAGGCTCAAGTGGCAGAGATTGCGAATGCATTGGAGGAAAGTGGGGTTCGTTTCATATGGTCCCTTCGACAACCTCCACCGAAGGGTAAGTTCGAAGCGCCGAAAAACTACAATGACATCAGAAACTTCCTACCAGAGGGATTTCTCGATCGAACAATGAGTATTGGGAGAGTCATTGGATGGACATCACAAGTGGAGATATTGGCCCACCCAGCCATTGGCGGATTCATATCACATTGTGGTTGGAACTCGGTACTAGAAAGTGTGTGGCATGGCGTGCTTATTGCGACATGGCCGATGCATGCCGAGCAACAATTCAATGCCTTTGAAATGGTAGTAGAATTGGGATTAGCAGTGGAGGTCACGTTAGACTATCGAATAACTTTTGGTGAAGACAAGCCAAGATTAGTGAGTGCTGAAGAGATAAAGAGTGGGATCAAGAAATTAATGGGAGAAGAAAGTAATGAGGTAAGGAAAAAAGTGAAAGCAAAAAGTGAAGAAAGTAGGAAAAGTGTAATGGAAGGTGGTTCCTCCTTTGTCTCGTTGGGAAAATTTATAGATGATGTTTTGGCCAATTCGGCAGGAGGAGGAAACTAA

Protein sequence

MFELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAGKNIQFNVLPEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLVIDMFCTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLPNFVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYSQVLPPLYFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVAEIANALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTSQVEILAHPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTLDYRITFGEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSLGKFIDDVLANSAGGGN
BLAST of CsGy4G020610 vs. NCBI nr
Match: XP_004146065.1 (PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis sativus])

HSP 1 Score: 972.2 bits (2512), Expect = 6.4e-280
Identity = 489/489 (100.00%), Postives = 489/489 (100.00%), Query Frame = 0

Query: 1   MFELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAG 60
           MFELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAG
Sbjct: 1   MFELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAG 60

Query: 61  KNIQFNVLPEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLVIDMF 120
           KNIQFNVLPEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLVIDMF
Sbjct: 61  KNIQFNVLPEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLVIDMF 120

Query: 121 CTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLPN 180
           CTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLPN
Sbjct: 121 CTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLPN 180

Query: 181 FVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYSQVLPPL 240
           FVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYSQVLPPL
Sbjct: 181 FVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYSQVLPPL 240

Query: 241 YFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVAEIAN 300
           YFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVAEIAN
Sbjct: 241 YFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVAEIAN 300

Query: 301 ALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTSQVEILA 360
           ALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTSQVEILA
Sbjct: 301 ALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTSQVEILA 360

Query: 361 HPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTLDYRITF 420
           HPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTLDYRITF
Sbjct: 361 HPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTLDYRITF 420

Query: 421 GEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSLGKFIDDV 480
           GEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSLGKFIDDV
Sbjct: 421 GEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSLGKFIDDV 480

Query: 481 LANSAGGGN 490
           LANSAGGGN
Sbjct: 481 LANSAGGGN 489

BLAST of CsGy4G020610 vs. NCBI nr
Match: XP_008464636.1 (PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo])

HSP 1 Score: 947.2 bits (2447), Expect = 2.2e-272
Identity = 470/487 (96.51%), Postives = 481/487 (98.77%), Query Frame = 0

Query: 1   MFELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAG 60
           MFELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAG
Sbjct: 1   MFELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAG 60

Query: 61  KNIQFNVLPEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLVIDMF 120
           KNIQFNVLPEPPLPEESKKDFIVLVESYKPYVRE VSN TASAATS+DSPRLVGLVIDMF
Sbjct: 61  KNIQFNVLPEPPLPEESKKDFIVLVESYKPYVREAVSNFTASAATSLDSPRLVGLVIDMF 120

Query: 121 CTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLPN 180
           CTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLPN
Sbjct: 121 CTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLPN 180

Query: 181 FVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYSQVLPPL 240
           F NPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESH AKSYSQVLPPL
Sbjct: 181 FANPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHAAKSYSQVLPPL 240

Query: 241 YFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVAEIAN 300
           YFVGPVLHLKNAGVAGSSEAQ+NADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVAEIAN
Sbjct: 241 YFVGPVLHLKNAGVAGSSEAQDNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVAEIAN 300

Query: 301 ALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTSQVEILA 360
           ALEESGVRFIWSLRQPPPKGKFEAP+NYND++NFLPEGFLDRTMSIGRVIGWTSQVEILA
Sbjct: 301 ALEESGVRFIWSLRQPPPKGKFEAPRNYNDVKNFLPEGFLDRTMSIGRVIGWTSQVEILA 360

Query: 361 HPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTLDYRITF 420
           HPAIGGF+SHCGWNS+LESVWHGV IATWPMHAEQQFNAFEMVVELGLAVEVTLDYRITF
Sbjct: 361 HPAIGGFVSHCGWNSILESVWHGVPIATWPMHAEQQFNAFEMVVELGLAVEVTLDYRITF 420

Query: 421 GEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSLGKFIDDV 480
           GEDKPRLVSAEE+KSGIKKLMGEES+EVRKKVKAKSEES+KSVMEGGSSF+SLGKFIDDV
Sbjct: 421 GEDKPRLVSAEEVKSGIKKLMGEESDEVRKKVKAKSEESQKSVMEGGSSFISLGKFIDDV 480

Query: 481 LANSAGG 488
           LANS GG
Sbjct: 481 LANSTGG 487

BLAST of CsGy4G020610 vs. NCBI nr
Match: KGN54989.1 (hypothetical protein Csa_4G618540 [Cucumis sativus])

HSP 1 Score: 897.9 bits (2319), Expect = 1.5e-257
Identity = 447/447 (100.00%), Postives = 447/447 (100.00%), Query Frame = 0

Query: 1   MFELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAG 60
           MFELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAG
Sbjct: 1   MFELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAG 60

Query: 61  KNIQFNVLPEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLVIDMF 120
           KNIQFNVLPEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLVIDMF
Sbjct: 61  KNIQFNVLPEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLVIDMF 120

Query: 121 CTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLPN 180
           CTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLPN
Sbjct: 121 CTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLPN 180

Query: 181 FVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYSQVLPPL 240
           FVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYSQVLPPL
Sbjct: 181 FVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYSQVLPPL 240

Query: 241 YFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVAEIAN 300
           YFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVAEIAN
Sbjct: 241 YFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVAEIAN 300

Query: 301 ALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTSQVEILA 360
           ALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTSQVEILA
Sbjct: 301 ALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTSQVEILA 360

Query: 361 HPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTLDYRITF 420
           HPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTLDYRITF
Sbjct: 361 HPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTLDYRITF 420

Query: 421 GEDKPRLVSAEEIKSGIKKLMGEESNE 448
           GEDKPRLVSAEEIKSGIKKLMGEESNE
Sbjct: 421 GEDKPRLVSAEEIKSGIKKLMGEESNE 447

BLAST of CsGy4G020610 vs. NCBI nr
Match: XP_023521422.1 (anthocyanidin 3-O-glucosyltransferase 2-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 799.3 bits (2063), Expect = 7.4e-228
Identity = 396/491 (80.65%), Postives = 439/491 (89.41%), Query Frame = 0

Query: 2   FELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAGK 61
           FE++FIPAPG+GHLASTVEMANVLVTRD  LSVT+L MKLPYD+KVAECI+SLS SF G 
Sbjct: 4   FEMVFIPAPGMGHLASTVEMANVLVTRDPSLSVTVLVMKLPYDLKVAECIDSLSMSFTGN 63

Query: 62  NIQFNVLPEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLVIDMFC 121
           +IQF VLPEP LPEESKKDFIVLVESYK YVRE V+NL  S  TS+DSP+  G VIDMFC
Sbjct: 64  SIQFIVLPEPSLPEESKKDFIVLVESYKAYVREAVANLVGS-ETSLDSPQRAGFVIDMFC 123

Query: 122 TTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLPNF 181
           TTMIDV NEFGVPCYVFYTCSASFLAFS++LQELY++N SNEVVEQLLNSD   +TLPNF
Sbjct: 124 TTMIDVANEFGVPCYVFYTCSASFLAFSVHLQELYDQNDSNEVVEQLLNSDTEFITLPNF 183

Query: 182 VNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYS---QVLP 241
            NPIPSKLIP+LFSNKDKA+WFHNHIKRFR EIKGILINTF EME H  +S S   +V P
Sbjct: 184 ANPIPSKLIPSLFSNKDKAIWFHNHIKRFRSEIKGILINTFMEMEFHAMESISSNGRVFP 243

Query: 242 PLYFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVAEI 301
           PLYFVGP+LHLKN GVAGSSEA+N  + I++WLD +PPSSVVLVCFGTMVSFDE QV EI
Sbjct: 244 PLYFVGPILHLKNTGVAGSSEAENYEE-ILQWLDGKPPSSVVLVCFGTMVSFDEDQVVEI 303

Query: 302 ANALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTSQVEI 361
           ANALEESGV FIWSLRQPPPKGKFEAP+NY DI+  LPEGFLDRT  IGRVIGWTSQVE+
Sbjct: 304 ANALEESGVGFIWSLRQPPPKGKFEAPRNYTDIKEVLPEGFLDRTADIGRVIGWTSQVEL 363

Query: 362 LAHPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTLDYRI 421
           LAHP+IGGF+SHCGWNS++ESVWHGV +ATWPMHAEQQFNAF+MV ELGLAVE+TL+YRI
Sbjct: 364 LAHPSIGGFVSHCGWNSIIESVWHGVPMATWPMHAEQQFNAFQMVKELGLAVEITLEYRI 423

Query: 422 TFGEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSLGKFID 481
           TFGE KPRLVSAEEIK+GI+ LMGEESNE++K+VKAKSEESRKSV EGGSSF+SLGKFID
Sbjct: 424 TFGEGKPRLVSAEEIKNGIRTLMGEESNEIKKRVKAKSEESRKSVKEGGSSFISLGKFID 483

Query: 482 DVLANSAGGGN 490
           +VLANS GGGN
Sbjct: 484 NVLANSPGGGN 492

BLAST of CsGy4G020610 vs. NCBI nr
Match: XP_022936724.1 (anthocyanidin 3-O-glucosyltransferase 2-like [Cucurbita moschata])

HSP 1 Score: 770.4 bits (1988), Expect = 3.7e-219
Identity = 384/491 (78.21%), Postives = 427/491 (86.97%), Query Frame = 0

Query: 2   FELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAGK 61
           FE++FIPAPG+GHLASTVEMANVLVTRD RLSVT+LAMKLPYD+KVAECI+SLS SF GK
Sbjct: 4   FEMVFIPAPGMGHLASTVEMANVLVTRDPRLSVTVLAMKLPYDLKVAECIDSLSMSFTGK 63

Query: 62  NIQFNVLPEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLVIDMFC 121
           +IQF VLPEP LPEESKKDFIVLVESYK YVRE V+NL  S  TS+DSP+L G VIDMFC
Sbjct: 64  SIQFIVLPEPSLPEESKKDFIVLVESYKAYVREAVANLVGS-ETSLDSPQLAGFVIDMFC 123

Query: 122 TTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLPNF 181
           TTMIDV NEFGVPCYVFYTCSASFLAFS++L+ELY++N SNEVVEQLLNSD   +TLPNF
Sbjct: 124 TTMIDVANEFGVPCYVFYTCSASFLAFSVHLRELYDQNDSNEVVEQLLNSDTEFITLPNF 183

Query: 182 VNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYS---QVLP 241
            NPIPSKLIP+LFSNKDKA+WFHNHIKRFR EIKGIL+NTF EME H  +S S   +V P
Sbjct: 184 ANPIPSKLIPSLFSNKDKAIWFHNHIKRFRSEIKGILVNTFMEMEFHAMESISSNGRVFP 243

Query: 242 PLYFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVAEI 301
           PLYFVGP+LHLKN GVAGSSEA+N  + I++WLD +PPSSVVLVCFGTMVSFDE QV EI
Sbjct: 244 PLYFVGPILHLKNTGVAGSSEAENYEE-ILQWLDGKPPSSVVLVCFGTMVSFDEDQVVEI 303

Query: 302 ANALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTSQVEI 361
           ANALEESGV FIWSLRQPPPKGKFEAP+NY DI++ LPEGFLDRT  IGRVIGWTSQVE+
Sbjct: 304 ANALEESGVGFIWSLRQPPPKGKFEAPRNYTDIKDVLPEGFLDRTADIGRVIGWTSQVEL 363

Query: 362 LAHPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTLDYRI 421
           LAHP+IGGF+SHCGWNS++ESVWHGV +ATWPMHAEQQFNAFEMV EL LAVE+TL+YRI
Sbjct: 364 LAHPSIGGFVSHCGWNSIIESVWHGVPMATWPMHAEQQFNAFEMVKELELAVEITLEYRI 423

Query: 422 TFGEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSLGKFID 481
           TFGE KPRLVSAEEIK+GI+ LMGEESNE                 EGGSSF+SLGKFID
Sbjct: 424 TFGEGKPRLVSAEEIKNGIRTLMGEESNEXXXXXXXXXXXXXXXXXEGGSSFISLGKFID 483

Query: 482 DVLANSAGGGN 490
           +VLANS GGG+
Sbjct: 484 NVLANSPGGGS 492

BLAST of CsGy4G020610 vs. TAIR10
Match: AT3G21760.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 412.5 bits (1059), Expect = 3.5e-115
Identity = 231/490 (47.14%), Postives = 313/490 (63.88%), Query Frame = 0

Query: 3   ELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAEC---IESLSTSFA 62
           EL+FIP+PG GHL   VE+A + V RD  LS+T++ +   +    +     I SLS S +
Sbjct: 4   ELVFIPSPGDGHLRPLVEVAKLHVDRDDHLSITIIIIPQMHGFSSSNSSSYIASLS-SDS 63

Query: 63  GKNIQFNVL--PEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSP-RLVGLV 122
            + + +NVL  P+ P  +++K  F   ++++KP V+  V  LT       DSP RL G V
Sbjct: 64  EERLSYNVLSVPDKPDSDDTKPHFFDYIDNFKPQVKATVEKLTDPGPP--DSPSRLAGFV 123

Query: 123 IDMFCTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVEL 182
           +DMFC  MIDV NEFGVP Y+FYT +A+FL   ++++ LY+    N  V  L +SD  EL
Sbjct: 124 VDMFCMMMIDVANEFGVPSYMFYTSNATFLGLQVHVEYLYDV--KNYDVSDLKDSDTTEL 183

Query: 183 TLPNFVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYSQV 242
            +P    P+P K  P++   K+         +RFR E KGIL+NTF E+E    K +S V
Sbjct: 184 EVPCLTRPLPVKCFPSVLLTKEWLPVMFRQTRRFR-ETKGILVNTFAELEPQAMKFFSGV 243

Query: 243 ---LPPLYFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEA 302
              LP +Y VGPV++LK  G   S + Q+    I++WLD+QP  SVV +CFG+M  F E 
Sbjct: 244 DSPLPTVYTVGPVMNLKINGPNSSDDKQSE---ILRWLDEQPRKSVVFLCFGSMGGFREG 303

Query: 303 QVAEIANALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWT 362
           Q  EIA ALE SG RF+WSLR+  PKG    P+ + ++   LPEGFL+RT  IG+++GW 
Sbjct: 304 QAKEIAIALERSGHRFVWSLRRAQPKGSIGPPEEFTNLEEILPEGFLERTAEIGKIVGWA 363

Query: 363 SQVEILAHPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVT 422
            Q  ILA+PAIGGF+SHCGWNS LES+W GV +ATWP++AEQQ NAFEMV ELGLAVEV 
Sbjct: 364 PQSAILANPAIGGFVSHCGWNSTLESLWFGVPMATWPLYAEQQVNAFEMVEELGLAVEVR 423

Query: 423 LDYRITFGEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSL 482
             +R  F      L++AEEI+ GI+ LM E+ ++VR +VK  SE+S  ++M+GGSS V+L
Sbjct: 424 NSFRGDFMAADDELMTAEEIERGIRCLM-EQDSDVRSRVKEMSEKSHVALMDGGSSHVAL 483

Query: 483 GKFIDDVLAN 484
            KFI DV  N
Sbjct: 484 LKFIQDVTKN 483

BLAST of CsGy4G020610 vs. TAIR10
Match: AT3G21790.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 389.4 bits (999), Expect = 3.2e-108
Identity = 225/491 (45.82%), Postives = 309/491 (62.93%), Query Frame = 0

Query: 2   FELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPY----DVKVAECIESLSTS 61
           FEL+FIP PGIGHL STVEMA +LV R+ RLS++++   LP+    +V  ++ I +LS S
Sbjct: 3   FELVFIPYPGIGHLRSTVEMAKLLVDRETRLSISVII--LPFISEGEVGASDYIAALSAS 62

Query: 62  FAGKNIQFNVLPEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLVI 121
            +   +++ V+     P        + +++ +P VR  V+ L    ++  DSP++ G V+
Sbjct: 63  -SNNRLRYEVISAVDQPTIEMTTIEIHMKNQEPKVRSTVAKLLEDYSSKPDSPKIAGFVL 122

Query: 122 DMFCTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELT 181
           DMFCT+M+DV NEFG P Y+FYT SA  L+ + ++Q L +EN   +V E         L 
Sbjct: 123 DMFCTSMVDVANEFGFPSYMFYTSSAGILSVTYHVQMLCDEN-KYDVSENDYADSEAVLN 182

Query: 182 LPNFVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYSQV- 241
            P+   P P K +P   +       F N  ++FR E+KGIL+NT  E+E +V K  S   
Sbjct: 183 FPSLSRPYPVKCLPHALAANMWLPVFVNQARKFR-EMKGILVNTVAELEPYVLKFLSSSD 242

Query: 242 LPPLYFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVA 301
            PP+Y VGP+LHL+N      S+ +   +II +WLD QPPSSVV +CFG+M  F E QV 
Sbjct: 243 TPPVYPVGPLLHLENQ--RDDSKDEKRLEII-RWLDQQPPSSVVFLCFGSMGGFGEEQVR 302

Query: 302 EIANALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTSQV 361
           EIA ALE SG RF+WSLR+  P    E P  + ++   LPEGF DRT  IG+VIGW  QV
Sbjct: 303 EIAIALERSGHRFLWSLRRASPNIFKELPGEFTNLEEVLPEGFFDRTKDIGKVIGWAPQV 362

Query: 362 EILAHPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTLDY 421
            +LA+PAIGGF++HCGWNS LES+W GV  A WP++AEQ+FNAF MV ELGLAVE+   +
Sbjct: 363 AVLANPAIGGFVTHCGWNSTLESLWFGVPTAAWPLYAEQKFNAFLMVEELGLAVEIRKYW 422

Query: 422 RITFGEDKPRL----VSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVS 481
           R   GE    L    V+AEEI+  I  LM E+ ++VRK+VK  SE+   ++M+GGSS  +
Sbjct: 423 R---GEHLAGLPTATVTAEEIEKAIMCLM-EQDSDVRKRVKDMSEKCHVALMDGGSSRTA 481

Query: 482 LGKFIDDVLAN 484
           L KFI++V  N
Sbjct: 483 LQKFIEEVAKN 481

BLAST of CsGy4G020610 vs. TAIR10
Match: AT3G21780.1 (UDP-glucosyl transferase 71B6)

HSP 1 Score: 387.1 bits (993), Expect = 1.6e-107
Identity = 217/484 (44.83%), Postives = 303/484 (62.60%), Query Frame = 0

Query: 3   ELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAGKN 62
           EL+FIP+P I HL +TVEMA  LV ++  LS+T++   + +  K    I SL+++     
Sbjct: 4   ELVFIPSPAISHLMATVEMAEQLVDKNDNLSITVII--ISFSSKNTSMITSLTSN---NR 63

Query: 63  IQFNVLPEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLVIDMFCT 122
           +++ ++          K     ++S KP VR+ V+ L  S  T  D+PRL G V+DM+CT
Sbjct: 64  LRYEIISGGDQQPTELKATDSHIQSLKPLVRDAVAKLVDS--TLPDAPRLAGFVVDMYCT 123

Query: 123 TMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLPNFV 182
           +MIDV NEFGVP Y+FYT +A FL   L++Q +Y+     ++ E  L   +VEL +P+  
Sbjct: 124 SMIDVANEFGVPSYLFYTSNAGFLGLLLHIQFMYDAEDIYDMSE--LEDSDVELVVPSLT 183

Query: 183 NPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYSQ-VLPPLY 242
           +P P K +P +F +K+   +F    +RFR E KGIL+NT  ++E       S   +P  Y
Sbjct: 184 SPYPLKCLPYIFKSKEWLTFFVTQARRFR-ETKGILVNTVPDLEPQALTFLSNGNIPRAY 243

Query: 243 FVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVAEIANA 302
            VGP+LHLKN       + Q+    I++WLD+QPP SVV +CFG+M  F E QV E A A
Sbjct: 244 PVGPLLHLKNVNCDYVDKKQSE---ILRWLDEQPPRSVVFLCFGSMGGFSEEQVRETALA 303

Query: 303 LEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTSQVEILAH 362
           L+ SG RF+WSLR+  P    E P  + ++   LPEGF DRT + G+VIGW  QV ILA 
Sbjct: 304 LDRSGHRFLWSLRRASPNILREPPGEFTNLEEILPEGFFDRTANRGKVIGWAEQVAILAK 363

Query: 363 PAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTLDYRITFG 422
           PAIGGF+SH GWNS LES+W GV +A WP++AEQ+FNAFEMV ELGLAVE+   +R    
Sbjct: 364 PAIGGFVSHGGWNSTLESLWFGVPMAIWPLYAEQKFNAFEMVEELGLAVEIKKHWRGDLL 423

Query: 423 EDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSLGKFIDDVL 482
             +  +V+AEEI+ GI  LM E+ ++VRK+V   SE+   ++M+GGSS  +L +FI DV 
Sbjct: 424 LGRSEIVTAEEIEKGIICLM-EQDSDVRKRVNEISEKCHVALMDGGSSETALKRFIQDVT 473

Query: 483 ANSA 486
            N A
Sbjct: 484 ENIA 473

BLAST of CsGy4G020610 vs. TAIR10
Match: AT4G15280.1 (UDP-glucosyl transferase 71B5)

HSP 1 Score: 376.7 bits (966), Expect = 2.2e-104
Identity = 208/489 (42.54%), Postives = 307/489 (62.78%), Query Frame = 0

Query: 3   ELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDV-KVAECIESLSTSFAGK 62
           EL+FIP PGIGHL  TV++A  L+  ++RLS+T++ +   +D    + CI SL+T     
Sbjct: 4   ELVFIPLPGIGHLRPTVKLAKQLIGSENRLSITIIIIPSRFDAGDASACIASLTTLSQDD 63

Query: 63  NIQF---NVLPEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSPR-LVGLVI 122
            + +   +V  +PP  +       V +E  K  VR+ V      AA  +D  R L G V+
Sbjct: 64  RLHYESISVAKQPPTSDPDPVPAQVYIEKQKTKVRDAV------AARIVDPTRKLAGFVV 123

Query: 123 DMFCTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELT 182
           DMFC++MIDV NEFGVPCY+ YT +A+FL   L++Q++Y++   +  V +L NS   EL 
Sbjct: 124 DMFCSSMIDVANEFGVPCYMVYTSNATFLGTMLHVQQMYDQKKYD--VSELENS-VTELE 183

Query: 183 LPNFVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYS--- 242
            P+   P P K +P + ++K+         + FR ++KGIL+NT  E+E H  K ++   
Sbjct: 184 FPSLTRPYPVKCLPHILTSKEWLPLSLAQARCFR-KMKGILVNTVAELEPHALKMFNING 243

Query: 243 QVLPPLYFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQ 302
             LP +Y VGPVLHL+N    G+ + +  ++ I++WLD+QP  SVV +CFG++  F E Q
Sbjct: 244 DDLPQVYPVGPVLHLEN----GNDDDEKQSE-ILRWLDEQPSKSVVFLCFGSLGGFTEEQ 303

Query: 303 VAEIANALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTS 362
             E A AL+ SG RF+W LR   P  K + P++Y ++   LPEGFL+RT+  G+VIGW  
Sbjct: 304 TRETAVALDRSGQRFLWCLRHASPNIKTDRPRDYTNLEEVLPEGFLERTLDRGKVIGWAP 363

Query: 363 QVEILAHPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTL 422
           QV +L  PAIGGF++HCGWNS+LES+W GV + TWP++AEQ+ NAFEMV ELGLAVE+  
Sbjct: 364 QVAVLEKPAIGGFVTHCGWNSILESLWFGVPMVTWPLYAEQKVNAFEMVEELGLAVEIRK 423

Query: 423 DYRITFGEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSLG 482
             +      +   V+AE+I+  I+++M E+ ++VR  VK  +E+   ++M+GGSS  +L 
Sbjct: 424 YLKGDLFAGEMETVTAEDIERAIRRVM-EQDSDVRNNVKEMAEKCHFALMDGGSSKAALE 476

Query: 483 KFIDDVLAN 484
           KFI DV+ N
Sbjct: 484 KFIQDVIEN 476

BLAST of CsGy4G020610 vs. TAIR10
Match: AT3G21750.1 (UDP-glucosyl transferase 71B1)

HSP 1 Score: 367.5 bits (942), Expect = 1.3e-101
Identity = 197/488 (40.37%), Postives = 299/488 (61.27%), Query Frame = 0

Query: 3   ELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAGKN 62
           EL+FIP+PG+GH+ +T  +A +LV  D+RLSVTL+ +      +V++   S   + +   
Sbjct: 4   ELVFIPSPGVGHIRATTALAKLLVASDNRLSVTLIVI----PSRVSDDASSSVYTNSEDR 63

Query: 63  IQFNVLPEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLVIDMFCT 122
           +++ +LP      +   D +  ++S KP VR VVS +    +T  DS RL G+V+DMFCT
Sbjct: 64  LRYILLP----ARDQTTDLVSYIDSQKPQVRAVVSKVAGDVSTRSDS-RLAGIVVDMFCT 123

Query: 123 TMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLPNFV 182
           +MID+ +EF +  Y+FYT +AS+L    ++Q LY+E    E+         ++  +P   
Sbjct: 124 SMIDIADEFNLSAYIFYTSNASYLGLQFHVQSLYDE---KELDVSEFKDTEMKFDVPTLT 183

Query: 183 NPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYS-----QVL 242
            P P+K +P++  NK    +     + FR   KGIL+N+  +ME      +S       +
Sbjct: 184 QPFPAKCLPSVMLNKKWFPYVLGRARSFR-ATKGILVNSVADMEPQALSFFSGGNGNTNI 243

Query: 243 PPLYFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVAE 302
           PP+Y VGP++ L+++G       +     I+ WL +QP  SVV +CFG+M  F E Q  E
Sbjct: 244 PPVYAVGPIMDLESSG------DEEKRKEILHWLKEQPTKSVVFLCFGSMGGFSEEQARE 303

Query: 303 IANALEESGVRFIWSLRQPPPKGKFE--APKNYNDIRNFLPEGFLDRTMSIGRVIGWTSQ 362
           IA ALE SG RF+WSLR+  P G      P  + ++   LP+GFLDRT+ IG++I W  Q
Sbjct: 304 IAVALERSGHRFLWSLRRASPVGNKSNPPPGEFTNLEEILPKGFLDRTVEIGKIISWAPQ 363

Query: 363 VEILAHPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTLD 422
           V++L  PAIG F++HCGWNS+LES+W GV +A WP++AEQQFNAF MV ELGLA EV  +
Sbjct: 364 VDVLNSPAIGAFVTHCGWNSILESLWFGVPMAAWPIYAEQQFNAFHMVDELGLAAEVKKE 423

Query: 423 YRITFGEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSLGK 482
           YR  F  ++P +V+A+EI+ GIK  M E+ +++RK+V    ++   ++++GGSS  +L K
Sbjct: 424 YRRDFLVEEPEIVTADEIERGIKCAM-EQDSKMRKRVMEMKDKLHVALVDGGSSNCALKK 471

Query: 483 FIDDVLAN 484
           F+ DV+ N
Sbjct: 484 FVQDVVDN 471

BLAST of CsGy4G020610 vs. Swiss-Prot
Match: sp|Q66PF3|UFOG3_FRAAN (Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3 OS=Fragaria ananassa OX=3747 GN=GT3 PE=2 SV=1)

HSP 1 Score: 450.7 bits (1158), Expect = 2.1e-125
Identity = 243/486 (50.00%), Postives = 330/486 (67.90%), Query Frame = 0

Query: 3   ELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAEC-IESL--STSFA 62
           EL+ IP+PGIGHL ST+E+A +LV+RD +L +T+L M  P   K  +  ++SL  S+S  
Sbjct: 6   ELVLIPSPGIGHLVSTLEIAKLLVSRDDKLFITVLIMHFPAVSKGTDAYVQSLADSSSPI 65

Query: 63  GKNIQFNVLPEPPLPEES---KKDFIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLV 122
            + I F  LP   +       +   +  VES +P+V++ V+NL  S  T     RL G V
Sbjct: 66  SQRINFINLPHTNMDHTEGSVRNSLVGFVESQQPHVKDAVANLRDSKTT-----RLAGFV 125

Query: 123 IDMFCTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVEL 182
           +DMFCTTMI+V N+ GVP YVF+T  A+ L    +LQEL ++   N+   +  +SD  EL
Sbjct: 126 VDMFCTTMINVANQLGVPSYVFFTSGAATLGLLFHLQELRDQ--YNKDCTEFKDSD-AEL 185

Query: 183 TLPNFVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHV--AKSYS 242
            +P+F NP+P+K++P     KD A  F N IKRFR E KGIL+NTF ++ESH   A S  
Sbjct: 186 IIPSFFNPLPAKVLPGRMLVKDSAEPFLNVIKRFR-ETKGILVNTFTDLESHALHALSSD 245

Query: 243 QVLPPLYFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQ 302
             +PP+Y VGP+L+L +      S+     + I+KWLDDQPP SVV +CFG+M SFDE+Q
Sbjct: 246 AEIPPVYPVGPLLNLNSNESRVDSDEVKKKNDILKWLDDQPPLSVVFLCFGSMGSFDESQ 305

Query: 303 VAEIANALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTS 362
           V EIANALE +G RF+WSLR+ PP GK   P +Y+D    LPEGFLDRT  IG+VIGW  
Sbjct: 306 VREIANALEHAGHRFLWSLRRSPPTGKVAFPSDYDDHTGVLPEGFLDRTGGIGKVIGWAP 365

Query: 363 QVEILAHPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTL 422
           QV +LAHP++GGF+SHCGWNS LES+WHGV +ATWP++AEQQ NAF+ V EL LAVE+ +
Sbjct: 366 QVAVLAHPSVGGFVSHCGWNSTLESLWHGVPVATWPLYAEQQLNAFQPVKELELAVEIDM 425

Query: 423 DYRITFGEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSLG 481
            YR       P LVSA+EI+ GI+++M  +S+++RK+VK  SE+ +K++M+GGSS+ SLG
Sbjct: 426 SYR----SKSPVLVSAKEIERGIREVMELDSSDIRKRVKEMSEKGKKALMDGGSSYTSLG 478

BLAST of CsGy4G020610 vs. Swiss-Prot
Match: sp|D3UAG1|U7A16_PYRCO (UDP-glycosyltransferase 71A16 OS=Pyrus communis OX=23211 GN=UGT71A16 PE=1 SV=1)

HSP 1 Score: 446.0 bits (1146), Expect = 5.2e-124
Identity = 243/487 (49.90%), Postives = 327/487 (67.15%), Query Frame = 0

Query: 3   ELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAGKN 62
           +L+F+PAPGIGH+ STVEMA  LV RD +L +T+L MKLPYD        S+S       
Sbjct: 6   QLVFVPAPGIGHIVSTVEMAKQLVARDDQLFITVLVMKLPYDQPFTNTDSSIS-----HR 65

Query: 63  IQFNVLPEPPLPEESK-----KDFIVLVESYKPYVREVVSNL--TASAATSIDSPRLVGL 122
           I F  LPE  L ++         F + VE++K +VR+ V NL   +  + S   PRL G 
Sbjct: 66  INFVNLPEAQLDKQDTVPNPGSFFRMFVENHKTHVRDAVINLLPESDQSESTSKPRLAGF 125

Query: 123 VIDMFCTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVE 182
           V+DMF  ++IDV NEF VP YVF+T ++S LA   + Q L +E G  ++ E  L S   E
Sbjct: 126 VLDMFSASLIDVANEFEVPSYVFFTSNSSTLALLSHFQSLRDEGGI-DITE--LTSSTAE 185

Query: 183 LTLPNFVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYSQ 242
           L +P+F+NP P  ++P  F +K+      N++ R++ + KGIL+NTF E+ESH       
Sbjct: 186 LAVPSFINPYPVAVLPGSFLDKESTKSTLNNVGRYK-QTKGILVNTFLELESHALHYLDS 245

Query: 243 --VLPPLYFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEA 302
              +PP+Y VGP+L+LK      SS     +D I++WLDDQPP SVV +CFG+M SF +A
Sbjct: 246 GVKIPPVYPVGPLLNLK------SSHEDKGSD-ILRWLDDQPPLSVVFLCFGSMGSFGDA 305

Query: 303 QVAEIANALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWT 362
           QV EIA  LE SG RF+WSLRQPP KGK   P +Y D++  LPEGFLDRT ++GRVIGW 
Sbjct: 306 QVKEIACTLEHSGHRFLWSLRQPPSKGKRALPSDYADLKTVLPEGFLDRTATVGRVIGWA 365

Query: 363 SQVEILAHPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVT 422
            Q  IL HPAIGGF+SHCGWNS LES+W+GV IA WPM+AEQ  NAF++VVELGLAVE+ 
Sbjct: 366 PQAAILGHPAIGGFVSHCGWNSTLESIWNGVPIAAWPMYAEQNMNAFQLVVELGLAVEIK 425

Query: 423 LDYRITFGEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSL 481
           +DYR    +D   +VSAE+I+ GI+++M E  ++VRK+VK  SE+S+K++++GGSS+ SL
Sbjct: 426 MDYR----KDSDVVVSAEDIERGIRQVM-ELDSDVRKRVKEMSEKSKKALVDGGSSYSSL 471

BLAST of CsGy4G020610 vs. Swiss-Prot
Match: sp|D3THI6|U7A15_MALDO (UDP-glycosyltransferase 71A15 OS=Malus domestica OX=3750 GN=UGT71A15 PE=1 SV=1)

HSP 1 Score: 438.0 bits (1125), Expect = 1.4e-121
Identity = 236/490 (48.16%), Postives = 325/490 (66.33%), Query Frame = 0

Query: 3   ELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAGKN 62
           +L+F+PAPGIGH+ STVEMA  L  RD +L +T+L MKLPY         S+S       
Sbjct: 6   QLVFVPAPGIGHIVSTVEMAKQLAARDDQLFITVLVMKLPYAQPFTNTDSSIS-----HR 65

Query: 63  IQFNVLPEPPLPEESKKD--------FIVLVESYKPYVREVVSNL--TASAATSIDSPRL 122
           I F  LPE    +  K+D        F + VE++K +VR+ V N+   +  + S   PRL
Sbjct: 66  INFVNLPE---AQPDKQDIVPNPGSFFRMFVENHKSHVRDAVINVLPESDQSESTSKPRL 125

Query: 123 VGLVIDMFCTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSD 182
            G V+DMF  ++IDV NEF VP Y+F+T +AS LA   + Q L +E G  ++ E  L S 
Sbjct: 126 AGFVLDMFSASLIDVANEFKVPSYLFFTSNASALALMSHFQSLRDEGGI-DITE--LTSS 185

Query: 183 NVELTLPNFVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAK- 242
             EL +P+F+NP P+ ++P    + +      NH+ +++ + KGIL+NTF E+ESH    
Sbjct: 186 TAELAVPSFINPYPAAVLPGSLLDMESTKSTLNHVSKYK-QTKGILVNTFMELESHALHY 245

Query: 243 -SYSQVLPPLYFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSF 302
                 +PP+Y VGP+L+LK       S  ++ A  I++WLDDQPP SVV +CFG+M SF
Sbjct: 246 LDSGDKIPPVYPVGPLLNLK-------SSDEDKASDILRWLDDQPPFSVVFLCFGSMGSF 305

Query: 303 DEAQVAEIANALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVI 362
            EAQV EIA ALE SG RF+WSLR+PPP+GK   P +Y D++  LPEGFLDRT ++G+VI
Sbjct: 306 GEAQVKEIACALEHSGHRFLWSLRRPPPQGKRAMPSDYEDLKTVLPEGFLDRTATVGKVI 365

Query: 363 GWTSQVEILAHPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAV 422
           GW  Q  IL HPA GGF+SHCGWNS LES+W+GV IA WP++AEQ  NAF++VVELGLAV
Sbjct: 366 GWAPQAAILGHPATGGFVSHCGWNSTLESLWNGVPIAAWPLYAEQNLNAFQLVVELGLAV 425

Query: 423 EVTLDYRITFGEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSF 481
           E+ +DYR     D   +VSAE+I+ GI+++M E  ++VRK+VK  SE+S+K++++GGSS+
Sbjct: 426 EIKMDYR----RDSDVVVSAEDIERGIRRVM-ELDSDVRKRVKEMSEKSKKALVDGGSSY 471

BLAST of CsGy4G020610 vs. Swiss-Prot
Match: sp|Q2V6K0|UFOG6_FRAAN (UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria ananassa OX=3747 GN=GT6 PE=1 SV=1)

HSP 1 Score: 433.7 bits (1114), Expect = 2.7e-120
Identity = 237/484 (48.97%), Postives = 326/484 (67.36%), Query Frame = 0

Query: 3   ELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAEC-IESLST--SFA 62
           ELIFIP PGIGH+ STVE+A +L+ RD  L +T+L MK P+    ++  I+SL+   S  
Sbjct: 6   ELIFIPIPGIGHIVSTVEIAKLLLCRDDNLFITILIMKFPFTADGSDVYIKSLAVDPSLK 65

Query: 63  GKNIQFNVLPEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLVIDM 122
            + I+F  LP+          F   ++S+K +V++ V+ L     T  ++ R+ G VIDM
Sbjct: 66  TQRIRFVNLPQEHFQGTGATGFFTFIDSHKSHVKDAVTRL---METKSETTRIAGFVIDM 125

Query: 123 FCTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLP 182
           FCT MID+ NEFG+P YVFYT  A+ L    +LQ L +E   N+   +  +SD  EL + 
Sbjct: 126 FCTGMIDLANEFGLPSYVFYTSGAADLGLMFHLQALRDE--ENKDCTEFKDSD-AELVVS 185

Query: 183 NFVNPIP-SKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYSQ--V 242
           +FVNP+P ++++P++   K+   +F N  KR+R E KGIL+NTF E+E H  +S S    
Sbjct: 186 SFVNPLPAARVLPSVVFEKEGGNFFLNFAKRYR-ETKGILVNTFLELEPHAIQSLSSDGK 245

Query: 243 LPPLYFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVA 302
           + P+Y VGP+L++K+ G   SSE       I++WLDDQPPSSVV +CFG+M  F E QV 
Sbjct: 246 ILPVYPVGPILNVKSEGNQVSSEKSKQKSDILEWLDDQPPSSVVFLCFGSMGCFGEDQVK 305

Query: 303 EIANALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTSQV 362
           EIA+ALE+ G+RF+WSLRQ P K K   P +Y D +  LPEGFLDRT  +G+VIGW  Q+
Sbjct: 306 EIAHALEQGGIRFLWSLRQ-PSKEKIGFPSDYTDYKAVLPEGFLDRTTDLGKVIGWAPQL 365

Query: 363 EILAHPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTLDY 422
            ILAHPA+GGF+SHCGWNS LES+W+GV IATWP +AEQQ NAFE+V EL LAVE+ + Y
Sbjct: 366 AILAHPAVGGFVSHCGWNSTLESIWYGVPIATWPFYAEQQVNAFELVKELKLAVEIDMGY 425

Query: 423 RITFGEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSLGKF 481
           R    +D   +VS E I+ GIK++M +ES E+RK+VK  S+ SRK++ E GSS+ SLG+F
Sbjct: 426 R----KDSGVIVSRENIEKGIKEVMEQES-ELRKRVKEMSQMSRKALEEDGSSYSSLGRF 476

BLAST of CsGy4G020610 vs. Swiss-Prot
Match: sp|Q9LSY8|U71B2_ARATH (UDP-glycosyltransferase 71B2 OS=Arabidopsis thaliana OX=3702 GN=UGT71B2 PE=1 SV=1)

HSP 1 Score: 412.5 bits (1059), Expect = 6.4e-114
Identity = 231/490 (47.14%), Postives = 313/490 (63.88%), Query Frame = 0

Query: 3   ELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAEC---IESLSTSFA 62
           EL+FIP+PG GHL   VE+A + V RD  LS+T++ +   +    +     I SLS S +
Sbjct: 4   ELVFIPSPGDGHLRPLVEVAKLHVDRDDHLSITIIIIPQMHGFSSSNSSSYIASLS-SDS 63

Query: 63  GKNIQFNVL--PEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSP-RLVGLV 122
            + + +NVL  P+ P  +++K  F   ++++KP V+  V  LT       DSP RL G V
Sbjct: 64  EERLSYNVLSVPDKPDSDDTKPHFFDYIDNFKPQVKATVEKLTDPGPP--DSPSRLAGFV 123

Query: 123 IDMFCTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVEL 182
           +DMFC  MIDV NEFGVP Y+FYT +A+FL   ++++ LY+    N  V  L +SD  EL
Sbjct: 124 VDMFCMMMIDVANEFGVPSYMFYTSNATFLGLQVHVEYLYDV--KNYDVSDLKDSDTTEL 183

Query: 183 TLPNFVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYSQV 242
            +P    P+P K  P++   K+         +RFR E KGIL+NTF E+E    K +S V
Sbjct: 184 EVPCLTRPLPVKCFPSVLLTKEWLPVMFRQTRRFR-ETKGILVNTFAELEPQAMKFFSGV 243

Query: 243 ---LPPLYFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEA 302
              LP +Y VGPV++LK  G   S + Q+    I++WLD+QP  SVV +CFG+M  F E 
Sbjct: 244 DSPLPTVYTVGPVMNLKINGPNSSDDKQSE---ILRWLDEQPRKSVVFLCFGSMGGFREG 303

Query: 303 QVAEIANALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWT 362
           Q  EIA ALE SG RF+WSLR+  PKG    P+ + ++   LPEGFL+RT  IG+++GW 
Sbjct: 304 QAKEIAIALERSGHRFVWSLRRAQPKGSIGPPEEFTNLEEILPEGFLERTAEIGKIVGWA 363

Query: 363 SQVEILAHPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVT 422
            Q  ILA+PAIGGF+SHCGWNS LES+W GV +ATWP++AEQQ NAFEMV ELGLAVEV 
Sbjct: 364 PQSAILANPAIGGFVSHCGWNSTLESLWFGVPMATWPLYAEQQVNAFEMVEELGLAVEVR 423

Query: 423 LDYRITFGEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSL 482
             +R  F      L++AEEI+ GI+ LM E+ ++VR +VK  SE+S  ++M+GGSS V+L
Sbjct: 424 NSFRGDFMAADDELMTAEEIERGIRCLM-EQDSDVRSRVKEMSEKSHVALMDGGSSHVAL 483

Query: 483 GKFIDDVLAN 484
            KFI DV  N
Sbjct: 484 LKFIQDVTKN 483

BLAST of CsGy4G020610 vs. TrEMBL
Match: tr|A0A1S3CLY5|A0A1S3CLY5_CUCME (Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103502474 PE=3 SV=1)

HSP 1 Score: 947.2 bits (2447), Expect = 1.5e-272
Identity = 470/487 (96.51%), Postives = 481/487 (98.77%), Query Frame = 0

Query: 1   MFELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAG 60
           MFELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAG
Sbjct: 1   MFELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAG 60

Query: 61  KNIQFNVLPEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLVIDMF 120
           KNIQFNVLPEPPLPEESKKDFIVLVESYKPYVRE VSN TASAATS+DSPRLVGLVIDMF
Sbjct: 61  KNIQFNVLPEPPLPEESKKDFIVLVESYKPYVREAVSNFTASAATSLDSPRLVGLVIDMF 120

Query: 121 CTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLPN 180
           CTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLPN
Sbjct: 121 CTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLPN 180

Query: 181 FVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYSQVLPPL 240
           F NPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESH AKSYSQVLPPL
Sbjct: 181 FANPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHAAKSYSQVLPPL 240

Query: 241 YFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVAEIAN 300
           YFVGPVLHLKNAGVAGSSEAQ+NADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVAEIAN
Sbjct: 241 YFVGPVLHLKNAGVAGSSEAQDNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVAEIAN 300

Query: 301 ALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTSQVEILA 360
           ALEESGVRFIWSLRQPPPKGKFEAP+NYND++NFLPEGFLDRTMSIGRVIGWTSQVEILA
Sbjct: 301 ALEESGVRFIWSLRQPPPKGKFEAPRNYNDVKNFLPEGFLDRTMSIGRVIGWTSQVEILA 360

Query: 361 HPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTLDYRITF 420
           HPAIGGF+SHCGWNS+LESVWHGV IATWPMHAEQQFNAFEMVVELGLAVEVTLDYRITF
Sbjct: 361 HPAIGGFVSHCGWNSILESVWHGVPIATWPMHAEQQFNAFEMVVELGLAVEVTLDYRITF 420

Query: 421 GEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSLGKFIDDV 480
           GEDKPRLVSAEE+KSGIKKLMGEES+EVRKKVKAKSEES+KSVMEGGSSF+SLGKFIDDV
Sbjct: 421 GEDKPRLVSAEEVKSGIKKLMGEESDEVRKKVKAKSEESQKSVMEGGSSFISLGKFIDDV 480

Query: 481 LANSAGG 488
           LANS GG
Sbjct: 481 LANSTGG 487

BLAST of CsGy4G020610 vs. TrEMBL
Match: tr|A0A0A0L1T2|A0A0A0L1T2_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G618540 PE=4 SV=1)

HSP 1 Score: 897.9 bits (2319), Expect = 1.0e-257
Identity = 447/447 (100.00%), Postives = 447/447 (100.00%), Query Frame = 0

Query: 1   MFELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAG 60
           MFELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAG
Sbjct: 1   MFELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAG 60

Query: 61  KNIQFNVLPEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLVIDMF 120
           KNIQFNVLPEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLVIDMF
Sbjct: 61  KNIQFNVLPEPPLPEESKKDFIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLVIDMF 120

Query: 121 CTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLPN 180
           CTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLPN
Sbjct: 121 CTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLPN 180

Query: 181 FVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYSQVLPPL 240
           FVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYSQVLPPL
Sbjct: 181 FVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAKSYSQVLPPL 240

Query: 241 YFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVAEIAN 300
           YFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVAEIAN
Sbjct: 241 YFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQVAEIAN 300

Query: 301 ALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTSQVEILA 360
           ALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTSQVEILA
Sbjct: 301 ALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTSQVEILA 360

Query: 361 HPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTLDYRITF 420
           HPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTLDYRITF
Sbjct: 361 HPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTLDYRITF 420

Query: 421 GEDKPRLVSAEEIKSGIKKLMGEESNE 448
           GEDKPRLVSAEEIKSGIKKLMGEESNE
Sbjct: 421 GEDKPRLVSAEEIKSGIKKLMGEESNE 447

BLAST of CsGy4G020610 vs. TrEMBL
Match: tr|K7NBW4|K7NBW4_SIRGR (Glycosyltransferase OS=Siraitia grosvenorii OX=190515 GN=UDPG7 PE=2 SV=1)

HSP 1 Score: 627.9 bits (1618), Expect = 2.0e-176
Identity = 321/493 (65.11%), Postives = 388/493 (78.70%), Query Frame = 0

Query: 2   FELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAGK 61
           FEL+FIP P +GHLA+ VEMAN+LVTRD RL+VT+L +KLP   K AE I+SLS SFA +
Sbjct: 4   FELVFIPLPVMGHLAAMVEMANILVTRDQRLTVTILVIKLPLYGKTAEYIQSLSASFASE 63

Query: 62  NIQFNVLPEPPLPEESKKDFIV--LVESYKPYVREVVSNLTASAATSIDSPRLVGLVIDM 121
           +++F +LPE  LPEES+K+F++   +ESYKP +RE + +LT S     DSPRL G V+DM
Sbjct: 64  SMRFIILPEVLLPEESEKEFMLKAFLESYKPIIREAIIDLTDS-QMGPDSPRLAGFVLDM 123

Query: 122 FCTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELTLP 181
           FCTTMIDV NEFGVP YVF T +A FLA S +LQELY+EN S EVV+QL NS N E+ LP
Sbjct: 124 FCTTMIDVANEFGVPSYVFCTSNAGFLALSFHLQELYDENNSKEVVKQLQNS-NAEIALP 183

Query: 182 NFVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHV----AKSYSQ 241
           +FVNPIP K+IP +FSN D A WFH+ ++R+R  +KGILINTF ++ESHV    ++S S 
Sbjct: 184 SFVNPIPGKMIPDIFSNDDTASWFHDQVERYRSGVKGILINTFAKLESHVMNSMSRSSSS 243

Query: 242 VLPPLYFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQV 301
             PPLY +GP+LHLKN    G     +  D I+KWLD+QPP SVV +CFG+M SFDE QV
Sbjct: 244 RAPPLYSIGPILHLKNNNTVGPGGTLHCTD-ILKWLDNQPPVSVVFLCFGSMGSFDEDQV 303

Query: 302 AEIANALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTSQ 361
            EIA+ALE SGVRF+WSLRQPPPK KFEAP  Y DI+  LPEGFL+RT  IGRVIGW  Q
Sbjct: 304 KEIAHALERSGVRFLWSLRQPPPKDKFEAPSEYTDIKYVLPEGFLERTAGIGRVIGWAPQ 363

Query: 362 VEILAHPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTLD 421
           VEILAHPA GGF+SHCGWNS LES+WHGV +ATWP++AEQQF AFEMVVELGLAV++TLD
Sbjct: 364 VEILAHPATGGFVSHCGWNSTLESMWHGVPMATWPLYAEQQFTAFEMVVELGLAVDITLD 423

Query: 422 YRITFGEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSLGK 481
           Y+     ++ R+VSAEEI+SGI+KLM EE  E+RKKVKAKSEESRKS+MEGGSSF+SLG+
Sbjct: 424 YQKHPHGERSRVVSAEEIQSGIRKLM-EEGGEMRKKVKAKSEESRKSLMEGGSSFISLGR 483

Query: 482 FIDDVLANSAGGG 489
           FIDDVL N   GG
Sbjct: 484 FIDDVLGNGPEGG 492

BLAST of CsGy4G020610 vs. TrEMBL
Match: tr|A0A1S3CLZ0|A0A1S3CLZ0_CUCME (Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103502477 PE=3 SV=1)

HSP 1 Score: 617.8 bits (1592), Expect = 2.0e-173
Identity = 324/493 (65.72%), Postives = 377/493 (76.47%), Query Frame = 0

Query: 2   FELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVAECIESLSTSFAGK 61
           FEL+FIP PGIGHLASTVE+ANVL +RD RLSVT+LA+KLP D+K  E I+SLS SF GK
Sbjct: 4   FELVFIPGPGIGHLASTVELANVLASRDDRLSVTVLAIKLPNDIKTTERIQSLSASFEGK 63

Query: 62  NIQFNVLPEPPLPEESKKD----FIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLVI 121
           +I+F VLPE P P +S           +ES+KP+VRE+V+NLT       DS RLVG VI
Sbjct: 64  SIRFIVLPELPFPNQSSTPPPLMLQAFLESHKPHVREIVTNLT------YDSNRLVGFVI 123

Query: 122 DMFCTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVELT 181
           DMFCT+MI+V NEF VPCY+FYT +A FLAFS +LQELY +N  N   EQL NS NVEL 
Sbjct: 124 DMFCTSMINVANEFKVPCYLFYTSNAGFLAFSFHLQELYNQN--NSTGEQLQNS-NVELA 183

Query: 182 LPNFVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAK---SYS 241
           LP+F+NPIPSK IP    +KD AVWFH++ KRFR  +KGILINTF EME  + K   + S
Sbjct: 184 LPSFINPIPSKAIPPFLFDKDMAVWFHDNTKRFRSGVKGILINTFVEMEPQMIKWMSNGS 243

Query: 242 QVLPPLYFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEAQ 301
             +P +Y VGP+L LK+ GV   + A N AD I+KWLDDQPP+SVV +CFG+  SFDE Q
Sbjct: 244 SKIPKVYTVGPILQLKSIGVTQCNNALNGAD-ILKWLDDQPPASVVFLCFGSKGSFDEDQ 303

Query: 302 VAEIANALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWTS 361
           V EIA ALE S VRFIWSLRQPPPKGKFE P NY DI + LPEGFL+RT  IGRVIGW  
Sbjct: 304 VLEIARALERSEVRFIWSLRQPPPKGKFEEPSNYADINDVLPEGFLNRTADIGRVIGWAP 363

Query: 362 QVEILAHPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVTL 421
           Q+EIL+HPA GGFISHCGWNS LESVWHGV +ATWP++AEQQFNAFEMVVELGLAVE+TL
Sbjct: 364 QIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTL 423

Query: 422 DYRITFGEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSLG 481
           DY   F   + R+VSAEEI+SGI+KLMG+  NE+RKKVK K EESRKS+M GGSSF SL 
Sbjct: 424 DYVKDFHIGRSRVVSAEEIESGIRKLMGDYGNEIRKKVKVKGEESRKSMMVGGSSFNSLD 483

Query: 482 KFIDDVLANSAGG 488
            FIDD LAN   G
Sbjct: 484 HFIDDALANLEEG 486

BLAST of CsGy4G020610 vs. TrEMBL
Match: tr|A0A0A0L321|A0A0A0L321_CUCSA (Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_4G618520 PE=3 SV=1)

HSP 1 Score: 613.6 bits (1581), Expect = 3.8e-172
Identity = 320/494 (64.78%), Postives = 378/494 (76.52%), Query Frame = 0

Query: 2   FELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVK-VAECIESLSTSFAG 61
           FEL+FIP PGIGHLASTVE+ANVLV+RD RLSVT+LA+KLP D+K   E I+SLS SF G
Sbjct: 4   FELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSASFEG 63

Query: 62  KNIQFNVLPEPPLPEESKKD----FIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLV 121
           K+I+F VLPE P P +S +         +ES+KP+VRE+V+NL        DS RLVG V
Sbjct: 64  KSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIH------DSNRLVGFV 123

Query: 122 IDMFCTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVEL 181
           IDMFCT+MI+V NEF VPCY+FYT +A FL FS +LQELY +N  N   EQL NS NVEL
Sbjct: 124 IDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQN--NSTAEQLQNS-NVEL 183

Query: 182 TLPNFVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAK---SY 241
            LP+F+NPIP+K IP    +KD A WFH++ KRFR E+KGILINTF EME  + K   + 
Sbjct: 184 ALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNG 243

Query: 242 SQVLPPLYFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEA 301
           S  +P +Y VGP+L LK+ GV  S+ A N AD I+KWLDDQPP+SVV +CFG+  SFDE 
Sbjct: 244 SSKIPKVYTVGPILQLKSIGVTQSNNALNGAD-ILKWLDDQPPASVVFLCFGSKGSFDED 303

Query: 302 QVAEIANALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWT 361
           QV EIA ALE S VRF+WSLRQPPPKGKFE P NY +I + LPEGFL+RT  IGRVIGW 
Sbjct: 304 QVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWA 363

Query: 362 SQVEILAHPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVT 421
            Q+EIL+HPA GGFISHCGWNS LESVWHGV +ATWP++AEQQFNAFEMVVELGLAVE+T
Sbjct: 364 PQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELT 423

Query: 422 LDYRITFGEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSL 481
           LDY   F   + R+VSAEEI+SGI+KLMG+  NE+RKK+K K EESRKS+MEGGSSF SL
Sbjct: 424 LDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSL 483

Query: 482 GKFIDDVLANSAGG 488
             FIDD L N   G
Sbjct: 484 RHFIDDALTNLQEG 487

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004146065.16.4e-280100.00PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis sativus][more]
XP_008464636.12.2e-27296.51PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo][more]
KGN54989.11.5e-257100.00hypothetical protein Csa_4G618540 [Cucumis sativus][more]
XP_023521422.17.4e-22880.65anthocyanidin 3-O-glucosyltransferase 2-like [Cucurbita pepo subsp. pepo][more]
XP_022936724.13.7e-21978.21anthocyanidin 3-O-glucosyltransferase 2-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
AT3G21760.13.5e-11547.14UDP-Glycosyltransferase superfamily protein[more]
AT3G21790.13.2e-10845.82UDP-Glycosyltransferase superfamily protein[more]
AT3G21780.11.6e-10744.83UDP-glucosyl transferase 71B6[more]
AT4G15280.12.2e-10442.54UDP-glucosyl transferase 71B5[more]
AT3G21750.11.3e-10140.37UDP-glucosyl transferase 71B1[more]
Match NameE-valueIdentityDescription
sp|Q66PF3|UFOG3_FRAAN2.1e-12550.00Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3 OS=Fragaria ananassa OX... [more]
sp|D3UAG1|U7A16_PYRCO5.2e-12449.90UDP-glycosyltransferase 71A16 OS=Pyrus communis OX=23211 GN=UGT71A16 PE=1 SV=1[more]
sp|D3THI6|U7A15_MALDO1.4e-12148.16UDP-glycosyltransferase 71A15 OS=Malus domestica OX=3750 GN=UGT71A15 PE=1 SV=1[more]
sp|Q2V6K0|UFOG6_FRAAN2.7e-12048.97UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria ananassa OX=3747 GN=... [more]
sp|Q9LSY8|U71B2_ARATH6.4e-11447.14UDP-glycosyltransferase 71B2 OS=Arabidopsis thaliana OX=3702 GN=UGT71B2 PE=1 SV=... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3CLY5|A0A1S3CLY5_CUCME1.5e-27296.51Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103502474 PE=3 SV=1[more]
tr|A0A0A0L1T2|A0A0A0L1T2_CUCSA1.0e-257100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G618540 PE=4 SV=1[more]
tr|K7NBW4|K7NBW4_SIRGR2.0e-17665.11Glycosyltransferase OS=Siraitia grosvenorii OX=190515 GN=UDPG7 PE=2 SV=1[more]
tr|A0A1S3CLZ0|A0A1S3CLZ0_CUCME2.0e-17365.72Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103502477 PE=3 SV=1[more]
tr|A0A0A0L321|A0A0A0L321_CUCSA3.8e-17264.78Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_4G618520 PE=3 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016758 transferase activity, transferring hexosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy4G020610.1CsGy4G020610.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 271..410
e-value: 1.2E-21
score: 77.1
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 254..461
e-value: 6.7E-132
score: 442.6
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 5..253
e-value: 6.7E-132
score: 442.6
coord: 462..470
e-value: 6.7E-132
score: 442.6
NoneNo IPR availablePANTHERPTHR11926:SF753UDP-GLYCOSYLTRANSFERASE 71B1-RELATEDcoord: 2..481
NoneNo IPR availablePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 2..481
NoneNo IPR availableSUPERFAMILYSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 4..481

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None