CsGy4G020640 (gene) Cucumber (Gy14) v2

NameCsGy4G020640
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
DescriptionGlycosyltransferase
LocationChr4 : 27461254 .. 27463713 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTCTCTGTAAATTTGTAAGTAGTTCATTACCAGATCAAGATCTTTGAGCTCATTTTCATATCTGCCCCTTGGAATTGGTCACCTTGCATTCACCATCAAAATGGCCAACTGAAAACATTTTTTTATCAAAATATATGAGACTGTTGTGTAATTTTTTTTTATCCTTTCACCATGAGAAAGAAATGACAAAATATATGAGATTGATTCTTGTAAAATGTTAAAAAAGGATATCTTCTTATAAACACTATCATTATGTCACATGCAATTCAAACCATTCAATCTTCCATAAACATTATCCACTTGGTTTCATTTCAAAACTCTCTCTCCTTTCTACAACACAACCAAATCCCATCAATTAGTTACTTCTACACATTGCAAACTAAACATATCTCCCATTTCCTTTAGTTTTCTCTGCAAGCCTGCTCCAAATGAAGAAATTTGAGCTTGTTTTCATACCAATACCGGGGTCTGGTCACCTTGCTTCCATGGTTGAGATGGCAAATACTCTCCTCGCTCGAGATCATCGTCTTGCTGTCACAATGATTGCCTTTAAGCTACCCCTTGATCCCAAAGCTAATGAATATATTCAATCCCTTTCTGCACAGTCTCTTACCAACAACAACTCCATACAATTCATTGTCCTTCCTGAATTACCTGATATCCCAAACAATGGGAACCGTTTCTTCCTGGAAGTAGTTCTTGAAAGCTACAAACCCCATGTCAAACAAGCTCTTATCTCCTTTCTTACTACCTCCACCAACCATCTTGCTGGATTCGTGTTGGACTCGTTTTGCTCAACCATGGTTGATGTAGCTAACGAATTTAAGGTCCCTTCTTATGTGTACTACACTTCTTGTGCTGCCTATCTTGCTTTTAGTTTACATCTTGAACAACTCTACACACAAGATAATAGTAGTAATGAGGTAATTCAACAATTGAAGGATTCAGATGTTAATTTGAGTGTACCAAGTTTAGTGAATCAAGTTCCAAGTAAAACCATTCCAAGTGTCTTCTTTATTAACAATTTTGCTGTTTGGTTTCATGAACAAGCTAAAAGAATTAGATTTGATGTAAAAGGTGTTCTTATCAATACATTTGAGGAGCTGGAATCACATGCGTTATCTTCTTTGTCAACTGACTCCTCTTTGCAACTCCCACCTTTGTATTCTGTCGGACCTGTTTTGCACTTGAACAAGAACACTGAGACTATGGATGATGGAGATGTGTTGAAGTGGCTTGATGATCAACCACTTTCATCGGTGGTGTTTTTGTGCTTTGGAAGTAGAGGAGCTTTCAAAAAGGATCAGGTGGAGGAGATTGCACGAGCGCTTGAGAGAAGTAGAGTTCGTTTCATTTGGTCTCTTCGACGACCAGGGAATGTGTTTCAATCATCAATCGACTATACAAATTTTGAAGACATCTTACCTAAGGGATTTCTTGATCGAACACAGAACATTGGGAGAGTCATCAGCTGGGCACCGCAAGTGGAGATATTAGGCCATCCAGCCACAGGTAAAGATCCTACAACTTGGATTTAGAAGTGAAAATATCTTAAGTGTATATATATGTATATATAAGAAAACATCCTTCAAATACAATTTTTTATTGATGTCTTTGTAAAATTGAAACACTAATACAGATATTTTAATTCTTGGTCATAGGTGGGTTCGTATCACATTGTGGTTGGAACTCGACGTTGGAAAGTTTGTGGCATGGCGTGCCGATGGCAACATGGCCAATGTATGCAGAGCAACAATTCAACGCATTTGATCTGGTGGTAGAATTGGGATTGGCTGTGGAGATCAAGATAAGTTATTGTATTGAACTTAAAGAACAAGCCAACCCAATAATAATGGCAGAAGAGATAGAAAGAGGAATTAGAAAGTTGATGGACAACAACAAAAATGAGATAAGGAAGAAAGTGAAAACAAAAAGTGAAGAATGCAGAAAAAGTGTAATAGAAGGTGGATCCTCTTTCATCTCATTAGGAAAATTTATTGATGATGTTTTGAGCAACTCTACAACAGGAGGAAACTAAGGATGTAGTAGGCAGTGACAAATTGAAAAGTTATAACTTGGTATCGAGGAAAAAGTTATAACTTGGGTGTGAAGGAGGCTAGCTAGAAGCCAAATTTATCACTAGACCTAGCTAATTGTAAGGAACGTAAAGTTGAGGAGGGATTGAGCCCCTTGAACATATAGTATATATGTGTAAGTGTCGTTAGAGTAGTTGTTTCATTTTATTAGTCAAATTTTTGGATACATCCATCGCTCTACCCATCTATCTACAACCCATCCAATTCTCCAATCATTTGTTTGCAACGTCACTTACTATTTTTTCTTTTTTTCTTAAATATTGTTTACCTTTTTCTATTTCGTTGTTGGAGTGAGATTCATAAATATCAGCGCAATTCAAGATGATATAGTTTGAGATAATAAAATGGGGCATTTGCAAAAAT

mRNA sequence

ATTCTCTGTAAATTTGTAAGTAGTTCATTACCAGATCAAGATCTTTGAGCTCATTTTCATATCTGCCCCTTGGAATTGGTCACCTTGCATTCACCATCAAAATGGCCAACTGAAAACATTTTTTTATCAAAATATATGAGACTGTTGTGTAATTTTTTTTTATCCTTTCACCATGAGAAAGAAATGACAAAATATATGAGATTGATTCTTGTAAAATGTTAAAAAAGGATATCTTCTTATAAACACTATCATTATGTCACATGCAATTCAAACCATTCAATCTTCCATAAACATTATCCACTTGGTTTCATTTCAAAACTCTCTCTCCTTTCTACAACACAACCAAATCCCATCAATTAGTTACTTCTACACATTGCAAACTAAACATATCTCCCATTTCCTTTAGTTTTCTCTGCAAGCCTGCTCCAAATGAAGAAATTTGAGCTTGTTTTCATACCAATACCGGGGTCTGGTCACCTTGCTTCCATGGTTGAGATGGCAAATACTCTCCTCGCTCGAGATCATCGTCTTGCTGTCACAATGATTGCCTTTAAGCTACCCCTTGATCCCAAAGCTAATGAATATATTCAATCCCTTTCTGCACAGTCTCTTACCAACAACAACTCCATACAATTCATTGTCCTTCCTGAATTACCTGATATCCCAAACAATGGGAACCGTTTCTTCCTGGAAGTAGTTCTTGAAAGCTACAAACCCCATGTCAAACAAGCTCTTATCTCCTTTCTTACTACCTCCACCAACCATCTTGCTGGATTCGTGTTGGACTCGTTTTGCTCAACCATGGTTGATGTAGCTAACGAATTTAAGGTCCCTTCTTATGTGTACTACACTTCTTGTGCTGCCTATCTTGCTTTTAGTTTACATCTTGAACAACTCTACACACAAGATAATAGTAGTAATGAGGTAATTCAACAATTGAAGGATTCAGATGTTAATTTGAGTGTACCAAGTTTAGTGAATCAAGTTCCAAGTAAAACCATTCCAAGTGTCTTCTTTATTAACAATTTTGCTGTTTGGTTTCATGAACAAGCTAAAAGAATTAGATTTGATGTAAAAGGTGTTCTTATCAATACATTTGAGGAGCTGGAATCACATGCGTTATCTTCTTTGTCAACTGACTCCTCTTTGCAACTCCCACCTTTGTATTCTGTCGGACCTGTTTTGCACTTGAACAAGAACACTGAGACTATGGATGATGGAGATGTGTTGAAGTGGCTTGATGATCAACCACTTTCATCGGTGGTGTTTTTGTGCTTTGGAAGTAGAGGAGCTTTCAAAAAGGATCAGGTGGAGGAGATTGCACGAGCGCTTGAGAGAAGTAGAGTTCGTTTCATTTGGTCTCTTCGACGACCAGGGAATGTGTTTCAATCATCAATCGACTATACAAATTTTGAAGACATCTTACCTAAGGGATTTCTTGATCGAACACAGAACATTGGGAGAGTCATCAGCTGGGCACCGCAAGTGGAGATATTAGGCCATCCAGCCACAGGTGGGTTCGTATCACATTGTGGTTGGAACTCGACGTTGGAAAGTTTGTGGCATGGCGTGCCGATGGCAACATGGCCAATGTATGCAGAGCAACAATTCAACGCATTTGATCTGGTGGTAGAATTGGGATTGGCTGTGGAGATCAAGATAAGTTATTGTATTGAACTTAAAGAACAAGCCAACCCAATAATAATGGCAGAAGAGATAGAAAGAGGAATTAGAAAGTTGATGGACAACAACAAAAATGAGATAAGGAAGAAAGTGAAAACAAAAAGTGAAGAATGCAGAAAAAGTGTAATAGAAGGTGGATCCTCTTTCATCTCATTAGGAAAATTTATTGATGATGTTTTGAGCAACTCTACAACAGGAGGAAACTAAGGATGTAGTAGGCAGTGACAAATTGAAAAGTTATAACTTGGTATCGAGGAAAAAGTTATAACTTGGGTGTGAAGGAGGCTAGCTAGAAGCCAAATTTATCACTAGACCTAGCTAATTGTAAGGAACGTAAAGTTGAGGAGGGATTGAGCCCCTTGAACATATAGTATATATGTGTAAGTGTCGTTAGAGTAGTTGTTTCATTTTATTAGTCAAATTTTTGGATACATCCATCGCTCTACCCATCTATCTACAACCCATCCAATTCTCCAATCATTTGTTTGCAACGTCACTTACTATTTTTTCTTTTTTTCTTAAATATTGTTTACCTTTTTCTATTTCGTTGTTGGAGTGAGATTCATAAATATCAGCGCAATTCAAGATGATATAGTTTGAGATAATAAAATGGGGCATTTGCAAAAAT

Coding sequence (CDS)

ATGAAGAAATTTGAGCTTGTTTTCATACCAATACCGGGGTCTGGTCACCTTGCTTCCATGGTTGAGATGGCAAATACTCTCCTCGCTCGAGATCATCGTCTTGCTGTCACAATGATTGCCTTTAAGCTACCCCTTGATCCCAAAGCTAATGAATATATTCAATCCCTTTCTGCACAGTCTCTTACCAACAACAACTCCATACAATTCATTGTCCTTCCTGAATTACCTGATATCCCAAACAATGGGAACCGTTTCTTCCTGGAAGTAGTTCTTGAAAGCTACAAACCCCATGTCAAACAAGCTCTTATCTCCTTTCTTACTACCTCCACCAACCATCTTGCTGGATTCGTGTTGGACTCGTTTTGCTCAACCATGGTTGATGTAGCTAACGAATTTAAGGTCCCTTCTTATGTGTACTACACTTCTTGTGCTGCCTATCTTGCTTTTAGTTTACATCTTGAACAACTCTACACACAAGATAATAGTAGTAATGAGGTAATTCAACAATTGAAGGATTCAGATGTTAATTTGAGTGTACCAAGTTTAGTGAATCAAGTTCCAAGTAAAACCATTCCAAGTGTCTTCTTTATTAACAATTTTGCTGTTTGGTTTCATGAACAAGCTAAAAGAATTAGATTTGATGTAAAAGGTGTTCTTATCAATACATTTGAGGAGCTGGAATCACATGCGTTATCTTCTTTGTCAACTGACTCCTCTTTGCAACTCCCACCTTTGTATTCTGTCGGACCTGTTTTGCACTTGAACAAGAACACTGAGACTATGGATGATGGAGATGTGTTGAAGTGGCTTGATGATCAACCACTTTCATCGGTGGTGTTTTTGTGCTTTGGAAGTAGAGGAGCTTTCAAAAAGGATCAGGTGGAGGAGATTGCACGAGCGCTTGAGAGAAGTAGAGTTCGTTTCATTTGGTCTCTTCGACGACCAGGGAATGTGTTTCAATCATCAATCGACTATACAAATTTTGAAGACATCTTACCTAAGGGATTTCTTGATCGAACACAGAACATTGGGAGAGTCATCAGCTGGGCACCGCAAGTGGAGATATTAGGCCATCCAGCCACAGGTGGGTTCGTATCACATTGTGGTTGGAACTCGACGTTGGAAAGTTTGTGGCATGGCGTGCCGATGGCAACATGGCCAATGTATGCAGAGCAACAATTCAACGCATTTGATCTGGTGGTAGAATTGGGATTGGCTGTGGAGATCAAGATAAGTTATTGTATTGAACTTAAAGAACAAGCCAACCCAATAATAATGGCAGAAGAGATAGAAAGAGGAATTAGAAAGTTGATGGACAACAACAAAAATGAGATAAGGAAGAAAGTGAAAACAAAAAGTGAAGAATGCAGAAAAAGTGTAATAGAAGGTGGATCCTCTTTCATCTCATTAGGAAAATTTATTGATGATGTTTTGAGCAACTCTACAACAGGAGGAAACTAA

Protein sequence

MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYIQSLSAQSLTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSLQLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIARALERSRVRFIWSLRRPGNVFQSSIDYTNFEDILPKGFLDRTQNIGRVISWAPQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIELKEQANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVLSNSTTGGN
BLAST of CsGy4G020640 vs. NCBI nr
Match: XP_004146066.1 (PREDICTED: anthocyanidin 3-O-glucosyltransferase 2 [Cucumis sativus] >KGN54992.1 hypothetical protein Csa_4G620550 [Cucumis sativus])

HSP 1 Score: 967.6 bits (2500), Expect = 1.6e-278
Identity = 486/486 (100.00%), Postives = 486/486 (100.00%), Query Frame = 0

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYIQSLSAQS 60
           MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYIQSLSAQS
Sbjct: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYIQSLSAQS 60

Query: 61  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDS 120
           LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDS
Sbjct: 61  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDS 120

Query: 121 FCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVP 180
           FCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVP
Sbjct: 121 FCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVP 180

Query: 181 SLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSL 240
           SLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSL
Sbjct: 181 SLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSL 240

Query: 241 QLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIARA 300
           QLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIARA
Sbjct: 241 QLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIARA 300

Query: 301 LERSRVRFIWSLRRPGNVFQSSIDYTNFEDILPKGFLDRTQNIGRVISWAPQVEILGHPA 360
           LERSRVRFIWSLRRPGNVFQSSIDYTNFEDILPKGFLDRTQNIGRVISWAPQVEILGHPA
Sbjct: 301 LERSRVRFIWSLRRPGNVFQSSIDYTNFEDILPKGFLDRTQNIGRVISWAPQVEILGHPA 360

Query: 361 TGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIELKEQ 420
           TGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIELKEQ
Sbjct: 361 TGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIELKEQ 420

Query: 421 ANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVLSN 480
           ANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVLSN
Sbjct: 421 ANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVLSN 480

Query: 481 STTGGN 487
           STTGGN
Sbjct: 481 STTGGN 486

BLAST of CsGy4G020640 vs. NCBI nr
Match: XP_008464637.1 (PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo])

HSP 1 Score: 836.3 bits (2159), Expect = 5.4e-239
Identity = 427/488 (87.50%), Postives = 447/488 (91.60%), Query Frame = 0

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYIQSLSAQS 60
           MKKFELVFIPIPGSGHLASM EMAN+LLARDHRLAVTMIA KLPLD K NEYIQSL AQS
Sbjct: 1   MKKFELVFIPIPGSGHLASMFEMANSLLARDHRLAVTMIAIKLPLDAKVNEYIQSLYAQS 60

Query: 61  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDS 120
           LT NNSI+FI+LPELP  PN+ N+ F EVVLESYKPHVKQALISFLTTSTNHL GFVLDS
Sbjct: 61  LT-NNSIKFIILPELPPPPNDENKIFFEVVLESYKPHVKQALISFLTTSTNHLVGFVLDS 120

Query: 121 FCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVP 180
           FC TMVDVANEFKVPSYVYYTS AAYLAFSLHLEQLYTQDNSSNEVIQQ KDS+VN SV 
Sbjct: 121 FCLTMVDVANEFKVPSYVYYTSSAAYLAFSLHLEQLYTQDNSSNEVIQQSKDSNVNFSVS 180

Query: 181 SLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSL 240
           SLVNQVPSK IPSVFFINNFAVWFHEQAKRIRFDVKGVLINTF+ELESH +SSLSTDSSL
Sbjct: 181 SLVNQVPSKVIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFDELESHVISSLSTDSSL 240

Query: 241 QLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIARA 300
           QLPPLY VGP+LHLNKNTETMDD  VLKWLDDQPL SVVFLCFGSRGAF+KDQVEEIARA
Sbjct: 241 QLPPLYPVGPILHLNKNTETMDDRVVLKWLDDQPLQSVVFLCFGSRGAFQKDQVEEIARA 300

Query: 301 LERSRVRFIWSLRRP-GNVFQSSIDYTNFEDILPKGFLDRTQNIGRVISWAPQVEILGHP 360
           LERSRVRFIWSLRRP G+VFQSSIDYTNFEDILP+GFLDRT+NIGRVI WAPQVEILGHP
Sbjct: 301 LERSRVRFIWSLRRPSGDVFQSSIDYTNFEDILPEGFLDRTKNIGRVIKWAPQVEILGHP 360

Query: 361 ATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIELKE 420
             GGFVSHCGWNSTLESLW+G+PMATWPMYAEQQFNAF+LVVELGLAVEI I Y  +LKE
Sbjct: 361 TIGGFVSHCGWNSTLESLWYGIPMATWPMYAEQQFNAFELVVELGLAVEITIDYQNDLKE 420

Query: 421 QANP-IIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVL 480
              P I+ AEEIE+GIRKLMD+N NEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVL
Sbjct: 421 LDKPRILSAEEIEKGIRKLMDDNNNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVL 480

Query: 481 SNSTTGGN 487
            NS  G N
Sbjct: 481 INSPRGAN 487

BLAST of CsGy4G020640 vs. NCBI nr
Match: XP_023535691.1 (anthocyanidin 3-O-glucosyltransferase 2-like isoform X2 [Cucurbita pepo subsp. pepo] >XP_023535692.1 anthocyanidin 3-O-glucosyltransferase 2-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 628.6 bits (1620), Expect = 1.7e-176
Identity = 319/486 (65.64%), Postives = 384/486 (79.01%), Query Frame = 0

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYIQSLSAQS 60
           M KFELVFIP+PG GHL S VEMA  L+ RD RL++TM+  K+P D KA+EYIQSLS +S
Sbjct: 1   MNKFELVFIPMPGMGHLVSTVEMATLLVTRDPRLSITMLGMKMPFDSKASEYIQSLS-ES 60

Query: 61  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDS 120
           L+NN S++ IVLPELP  P +     L+V+L+SYKPHVK+A+ S L   TN LAGFVLD 
Sbjct: 61  LSNNPSLRLIVLPELP-APKDSKDLLLKVLLDSYKPHVKEAVSSLL---TNPLAGFVLDM 120

Query: 121 FCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVP 180
           F +TMVDVA E  VPSYVYYTS AAYLAFSLHLE++Y Q NS+  V  Q K+ D +L V 
Sbjct: 121 FTTTMVDVAKELGVPSYVYYTSSAAYLAFSLHLEEIYRQKNSNEAVNPQFKNPDFDLRVS 180

Query: 181 SLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSL 240
           SL++ +PSK IP +FF+   AVW +E+ KR+R ++KG+LINTFEELESH + SLS+DSSL
Sbjct: 181 SLIHPIPSKVIPGIFFMEKGAVWIYEETKRLRTEMKGILINTFEELESHVMCSLSSDSSL 240

Query: 241 QLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIARA 300
            LPPLYS+GP+LHLN N    D  DVLKWLDDQP SSVVFLCFGSRG+F+K QVEEIA  
Sbjct: 241 NLPPLYSIGPILHLNNN--KTDRADVLKWLDDQPPSSVVFLCFGSRGSFEKGQVEEIAEG 300

Query: 301 LERSRVRFIWSLRR--PGNVFQSSIDYTNFEDILPKGFLDRTQNIGRVISWAPQVEILGH 360
           LERS VRF+W+LR+  P  VFQ   DYT+F+DILP+GFLDRT  +GRVI WAPQVEILGH
Sbjct: 301 LERSGVRFVWTLRKPPPKEVFQDPTDYTDFKDILPEGFLDRTAEVGRVIGWAPQVEILGH 360

Query: 361 PATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIELK 420
           PATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNA+++VVELGLAVEI + Y  E  
Sbjct: 361 PATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAYEMVVELGLAVEITVEYRKEGA 420

Query: 421 EQANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVL 480
                ++  EEIE+GIRKLM+ + +E+RKKVK  SEE R+ V+EGGSS++S+GKFI+DVL
Sbjct: 421 SDEPRVVSGEEIEKGIRKLMEED-SEVRKKVKGVSEESRRCVMEGGSSYVSMGKFIEDVL 478

Query: 481 SNSTTG 485
           + S  G
Sbjct: 481 ATSPRG 478

BLAST of CsGy4G020640 vs. NCBI nr
Match: XP_023535689.1 (anthocyanidin 3-O-glucosyltransferase 2-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023535690.1 anthocyanidin 3-O-glucosyltransferase 2-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 628.6 bits (1620), Expect = 1.7e-176
Identity = 319/486 (65.64%), Postives = 384/486 (79.01%), Query Frame = 0

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYIQSLSAQS 60
           M KFELVFIP+PG GHL S VEMA  L+ RD RL++TM+  K+P D KA+EYIQSLS +S
Sbjct: 63  MNKFELVFIPMPGMGHLVSTVEMATLLVTRDPRLSITMLGMKMPFDSKASEYIQSLS-ES 122

Query: 61  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDS 120
           L+NN S++ IVLPELP  P +     L+V+L+SYKPHVK+A+ S L   TN LAGFVLD 
Sbjct: 123 LSNNPSLRLIVLPELP-APKDSKDLLLKVLLDSYKPHVKEAVSSLL---TNPLAGFVLDM 182

Query: 121 FCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVP 180
           F +TMVDVA E  VPSYVYYTS AAYLAFSLHLE++Y Q NS+  V  Q K+ D +L V 
Sbjct: 183 FTTTMVDVAKELGVPSYVYYTSSAAYLAFSLHLEEIYRQKNSNEAVNPQFKNPDFDLRVS 242

Query: 181 SLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSL 240
           SL++ +PSK IP +FF+   AVW +E+ KR+R ++KG+LINTFEELESH + SLS+DSSL
Sbjct: 243 SLIHPIPSKVIPGIFFMEKGAVWIYEETKRLRTEMKGILINTFEELESHVMCSLSSDSSL 302

Query: 241 QLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIARA 300
            LPPLYS+GP+LHLN N    D  DVLKWLDDQP SSVVFLCFGSRG+F+K QVEEIA  
Sbjct: 303 NLPPLYSIGPILHLNNN--KTDRADVLKWLDDQPPSSVVFLCFGSRGSFEKGQVEEIAEG 362

Query: 301 LERSRVRFIWSLRR--PGNVFQSSIDYTNFEDILPKGFLDRTQNIGRVISWAPQVEILGH 360
           LERS VRF+W+LR+  P  VFQ   DYT+F+DILP+GFLDRT  +GRVI WAPQVEILGH
Sbjct: 363 LERSGVRFVWTLRKPPPKEVFQDPTDYTDFKDILPEGFLDRTAEVGRVIGWAPQVEILGH 422

Query: 361 PATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIELK 420
           PATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNA+++VVELGLAVEI + Y  E  
Sbjct: 423 PATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAYEMVVELGLAVEITVEYRKEGA 482

Query: 421 EQANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVL 480
                ++  EEIE+GIRKLM+ + +E+RKKVK  SEE R+ V+EGGSS++S+GKFI+DVL
Sbjct: 483 SDEPRVVSGEEIEKGIRKLMEED-SEVRKKVKGVSEESRRCVMEGGSSYVSMGKFIEDVL 540

Query: 481 SNSTTG 485
           + S  G
Sbjct: 543 ATSPRG 540

BLAST of CsGy4G020640 vs. NCBI nr
Match: XP_022976770.1 (anthocyanidin 3-O-glucosyltransferase 2-like [Cucurbita maxima])

HSP 1 Score: 622.9 bits (1605), Expect = 9.5e-175
Identity = 317/487 (65.09%), Postives = 383/487 (78.64%), Query Frame = 0

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYIQSLSAQS 60
           M KFELVFIP+PG GHL S VEMA  L+ RD RL++TM+  KLP D KA+EYIQSLS +S
Sbjct: 1   MNKFELVFIPMPGMGHLVSTVEMATLLVTRDPRLSITMLGMKLPFDSKASEYIQSLS-ES 60

Query: 61  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDS 120
           L+NN S++ I LPELP +P +     L+V+L+SYKPHVK+A+ S L   TN LAGFVLD 
Sbjct: 61  LSNNPSLRLIGLPELP-VPKDSKDLLLKVLLDSYKPHVKEAVSSLL---TNPLAGFVLDM 120

Query: 121 FCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVP 180
           F +TMVDVA E  +PSYVYYTS AAYLAFSLHLE++Y Q NS+  V  Q KD D +L V 
Sbjct: 121 FTTTMVDVAKELGIPSYVYYTSSAAYLAFSLHLEEIYRQKNSNEAVNPQFKDPDFDLRVS 180

Query: 181 SLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSL 240
           SL++ +PSK IP +FF+   AVW +E+ KR+R ++KG+LINTFEELESH + SLS+DSS 
Sbjct: 181 SLIHPIPSKVIPGIFFMEKGAVWVYEETKRLRTEMKGILINTFEELESHVICSLSSDSSF 240

Query: 241 QLPPLYSVGPVLHLNKN-TETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIAR 300
            LPPLYS+GP+LHLN N  E  D   VLKW+D+QP SSVVFLCFGSRG+F+K QVEEIA 
Sbjct: 241 NLPPLYSIGPILHLNNNKIEGTDRAGVLKWMDEQPTSSVVFLCFGSRGSFEKGQVEEIAD 300

Query: 301 ALERSRVRFIWSLRR--PGNVFQSSIDYTNFEDILPKGFLDRTQNIGRVISWAPQVEILG 360
            LERS VRF+W+LR+  P  VFQ   DYT+F+DILP+GFLDRT  +GRVI WAPQVEILG
Sbjct: 301 GLERSGVRFVWTLRKPPPKEVFQDPTDYTDFKDILPEGFLDRTAEVGRVIGWAPQVEILG 360

Query: 361 HPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIEL 420
           HPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNA+++VVELGLAVEI + Y  E 
Sbjct: 361 HPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAYEMVVELGLAVEITVEYRKEG 420

Query: 421 KEQANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDV 480
                 ++  EEIE+GIRKLM+ + +E+RKKVK  SE  R+SV+EGGSS IS+GKFI+DV
Sbjct: 421 ASDEPRVVSGEEIEKGIRKLMEED-SEVRKKVKGVSEGSRRSVMEGGSSHISMGKFIEDV 480

Query: 481 LSNSTTG 485
           L++S  G
Sbjct: 481 LASSPEG 481

BLAST of CsGy4G020640 vs. TAIR10
Match: AT3G21760.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 406.8 bits (1044), Expect = 1.9e-113
Identity = 232/492 (47.15%), Postives = 311/492 (63.21%), Query Frame = 0

Query: 3   KFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLP---LDPKANEYIQSLSAQ 62
           K ELVFIP PG GHL  +VE+A   + RD  L++T+I            ++ YI SLS+ 
Sbjct: 2   KLELVFIPSPGDGHLRPLVEVAKLHVDRDDHLSITIIIIPQMHGFSSSNSSSYIASLSSD 61

Query: 63  SLTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLT-----TSTNHLA 122
           S     S   + +P+ PD  +    FF    ++++KP VK A +  LT      S + LA
Sbjct: 62  S-EERLSYNVLSVPDKPDSDDTKPHFF--DYIDNFKPQVK-ATVEKLTDPGPPDSPSRLA 121

Query: 123 GFVLDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSD 182
           GFV+D FC  M+DVANEF VPSY++YTS A +L   +H+E LY   +  N  +  LKDSD
Sbjct: 122 GFVVDMFCMMMIDVANEFGVPSYMFYTSNATFLGLQVHVEYLY---DVKNYDVSDLKDSD 181

Query: 183 -VNLSVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSS 242
              L VP L   +P K  PSV     +      Q +R R + KG+L+NTF ELE  A+  
Sbjct: 182 TTELEVPCLTRPLPVKCFPSVLLTKEWLPVMFRQTRRFR-ETKGILVNTFAELEPQAMKF 241

Query: 243 LSTDSSLQLPPLYSVGPVLHLNKNTETMDD---GDVLKWLDDQPLSSVVFLCFGSRGAFK 302
            S   S  LP +Y+VGPV++L  N     D    ++L+WLD+QP  SVVFLCFGS G F+
Sbjct: 242 FSGVDS-PLPTVYTVGPVMNLKINGPNSSDDKQSEILRWLDEQPRKSVVFLCFGSMGGFR 301

Query: 303 KDQVEEIARALERSRVRFIWSLRR--PGNVFQSSIDYTNFEDILPKGFLDRTQNIGRVIS 362
           + Q +EIA ALERS  RF+WSLRR  P        ++TN E+ILP+GFL+RT  IG+++ 
Sbjct: 302 EGQAKEIAIALERSGHRFVWSLRRAQPKGSIGPPEEFTNLEEILPEGFLERTAEIGKIVG 361

Query: 363 WAPQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVE 422
           WAPQ  IL +PA GGFVSHCGWNSTLESLW GVPMATWP+YAEQQ NAF++V ELGLAVE
Sbjct: 362 WAPQSAILANPAIGGFVSHCGWNSTLESLWFGVPMATWPLYAEQQVNAFEMVEELGLAVE 421

Query: 423 IKISYCIELKEQANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFI 481
           ++ S+  +     + ++ AEEIERGIR LM+ + +++R +VK  SE+   ++++GGSS +
Sbjct: 422 VRNSFRGDFMAADDELMTAEEIERGIRCLMEQD-SDVRSRVKEMSEKSHVALMDGGSSHV 481

BLAST of CsGy4G020640 vs. TAIR10
Match: AT3G21790.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 391.7 bits (1005), Expect = 6.4e-109
Identity = 226/491 (46.03%), Postives = 323/491 (65.78%), Query Frame = 0

Query: 3   KFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPK--ANEYIQSLSAQS 62
           KFELVFIP PG GHL S VEMA  L+ R+ RL++++I      + +  A++YI +LSA S
Sbjct: 2   KFELVFIPYPGIGHLRSTVEMAKLLVDRETRLSISVIILPFISEGEVGASDYIAALSASS 61

Query: 63  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTN-----HLAG 122
              NN +++ V+  + D P       +E+ +++ +P V+  +   L   ++      +AG
Sbjct: 62  ---NNRLRYEVISAV-DQPTI-EMTTIEIHMKNQEPKVRSTVAKLLEDYSSKPDSPKIAG 121

Query: 123 FVLDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDV 182
           FVLD FC++MVDVANEF  PSY++YTS A  L+ + H+ Q+   +N  +       DS+ 
Sbjct: 122 FVLDMFCTSMVDVANEFGFPSYMFYTSSAGILSVTYHV-QMLCDENKYDVSENDYADSEA 181

Query: 183 NLSVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLS 242
            L+ PSL    P K +P     N +   F  QA++ R ++KG+L+NT  ELE + L  LS
Sbjct: 182 VLNFPSLSRPYPVKCLPHALAANMWLPVFVNQARKFR-EMKGILVNTVAELEPYVLKFLS 241

Query: 243 TDSSLQLPPLYSVGPVLHL-NKNTETMDDG--DVLKWLDDQPLSSVVFLCFGSRGAFKKD 302
           +  +   PP+Y VGP+LHL N+  ++ D+   ++++WLD QP SSVVFLCFGS G F ++
Sbjct: 242 SSDT---PPVYPVGPLLHLENQRDDSKDEKRLEIIRWLDQQPPSSVVFLCFGSMGGFGEE 301

Query: 303 QVEEIARALERSRVRFIWSLRRPG-NVFQS-SIDYTNFEDILPKGFLDRTQNIGRVISWA 362
           QV EIA ALERS  RF+WSLRR   N+F+    ++TN E++LP+GF DRT++IG+VI WA
Sbjct: 302 QVREIAIALERSGHRFLWSLRRASPNIFKELPGEFTNLEEVLPEGFFDRTKDIGKVIGWA 361

Query: 363 PQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIK 422
           PQV +L +PA GGFV+HCGWNSTLESLW GVP A WP+YAEQ+FNAF +V ELGLAVEI+
Sbjct: 362 PQVAVLANPAIGGFVTHCGWNSTLESLWFGVPTAAWPLYAEQKFNAFLMVEELGLAVEIR 421

Query: 423 ISYCIE-LKEQANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFIS 481
             +  E L       + AEEIE+ I  LM+ + +++RK+VK  SE+C  ++++GGSS  +
Sbjct: 422 KYWRGEHLAGLPTATVTAEEIEKAIMCLMEQD-SDVRKRVKDMSEKCHVALMDGGSSRTA 481

BLAST of CsGy4G020640 vs. TAIR10
Match: AT3G21780.1 (UDP-glucosyl transferase 71B6)

HSP 1 Score: 387.9 bits (995), Expect = 9.3e-108
Identity = 223/486 (45.88%), Postives = 312/486 (64.20%), Query Frame = 0

Query: 3   KFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYIQSLSAQSLT 62
           K ELVFIP P   HL + VEMA  L+ ++  L++T+I   +    K    I      SLT
Sbjct: 2   KIELVFIPSPAISHLMATVEMAEQLVDKNDNLSITVII--ISFSSKNTSMI-----TSLT 61

Query: 63  NNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQA---LISFLTTSTNHLAGFVLD 122
           +NN +++ ++      P        +  ++S KP V+ A   L+         LAGFV+D
Sbjct: 62  SNNRLRYEIISGGDQQPTELKA--TDSHIQSLKPLVRDAVAKLVDSTLPDAPRLAGFVVD 121

Query: 123 SFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSV 182
            +C++M+DVANEF VPSY++YTS A +L   LH++ +Y  ++  +  + +L+DSDV L V
Sbjct: 122 MYCTSMIDVANEFGVPSYLFYTSNAGFLGLLLHIQFMYDAEDIYD--MSELEDSDVELVV 181

Query: 183 PSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSS 242
           PSL +  P K +P +F    +  +F  QA+R R + KG+L+NT  +LE  AL+ LS  + 
Sbjct: 182 PSLTSPYPLKCLPYIFKSKEWLTFFVTQARRFR-ETKGILVNTVPDLEPQALTFLSNGN- 241

Query: 243 LQLPPLYSVGPVLHL-NKNTETMD--DGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEE 302
             +P  Y VGP+LHL N N + +D    ++L+WLD+QP  SVVFLCFGS G F ++QV E
Sbjct: 242 --IPRAYPVGPLLHLKNVNCDYVDKKQSEILRWLDEQPPRSVVFLCFGSMGGFSEEQVRE 301

Query: 303 IARALERSRVRFIWSLRR--PGNVFQSSIDYTNFEDILPKGFLDRTQNIGRVISWAPQVE 362
            A AL+RS  RF+WSLRR  P  + +   ++TN E+ILP+GF DRT N G+VI WA QV 
Sbjct: 302 TALALDRSGHRFLWSLRRASPNILREPPGEFTNLEEILPEGFFDRTANRGKVIGWAEQVA 361

Query: 363 ILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYC 422
           IL  PA GGFVSH GWNSTLESLW GVPMA WP+YAEQ+FNAF++V ELGLAVEIK  + 
Sbjct: 362 ILAKPAIGGFVSHGGWNSTLESLWFGVPMAIWPLYAEQKFNAFEMVEELGLAVEIKKHWR 421

Query: 423 IELKEQANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFISLGKFI 481
            +L    + I+ AEEIE+GI  LM+ + +++RK+V   SE+C  ++++GGSS  +L +FI
Sbjct: 422 GDLLLGRSEIVTAEEIEKGIICLMEQD-SDVRKRVNEISEKCHVALMDGGSSETALKRFI 471

BLAST of CsGy4G020640 vs. TAIR10
Match: AT4G15280.1 (UDP-glucosyl transferase 71B5)

HSP 1 Score: 381.7 bits (979), Expect = 6.7e-106
Identity = 213/482 (44.19%), Postives = 299/482 (62.03%), Query Frame = 0

Query: 3   KFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDP-KANEYIQSLSAQSL 62
           K ELVFIP+PG GHL   V++A  L+  ++RL++T+I      D   A+  I SL+  S 
Sbjct: 2   KIELVFIPLPGIGHLRPTVKLAKQLIGSENRLSITIIIIPSRFDAGDASACIASLTTLSQ 61

Query: 63  TNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDSF 122
            +    + I + + P   ++ +    +V +E  K  V+ A+ + +   T  LAGFV+D F
Sbjct: 62  DDRLHYESISVAKQPP-TSDPDPVPAQVYIEKQKTKVRDAVAARIVDPTRKLAGFVVDMF 121

Query: 123 CSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVPS 182
           CS+M+DVANEF VP Y+ YTS A +L   LH++Q+Y Q       + +L++S   L  PS
Sbjct: 122 CSSMIDVANEFGVPCYMVYTSNATFLGTMLHVQQMYDQKKYD---VSELENSVTELEFPS 181

Query: 183 LVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSLQ 242
           L    P K +P +     +      QA+  R  +KG+L+NT  ELE HAL   + +    
Sbjct: 182 LTRPYPVKCLPHILTSKEWLPLSLAQARCFR-KMKGILVNTVAELEPHALKMFNINGD-D 241

Query: 243 LPPLYSVGPVLHL-NKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIARA 302
           LP +Y VGPVLHL N N +     ++L+WLD+QP  SVVFLCFGS G F ++Q  E A A
Sbjct: 242 LPQVYPVGPVLHLENGNDDDEKQSEILRWLDEQPSKSVVFLCFGSLGGFTEEQTRETAVA 301

Query: 303 LERSRVRFIWSLRR--PGNVFQSSIDYTNFEDILPKGFLDRTQNIGRVISWAPQVEILGH 362
           L+RS  RF+W LR   P        DYTN E++LP+GFL+RT + G+VI WAPQV +L  
Sbjct: 302 LDRSGQRFLWCLRHASPNIKTDRPRDYTNLEEVLPEGFLERTLDRGKVIGWAPQVAVLEK 361

Query: 363 PATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIELK 422
           PA GGFV+HCGWNS LESLW GVPM TWP+YAEQ+ NAF++V ELGLAVEI+     +L 
Sbjct: 362 PAIGGFVTHCGWNSILESLWFGVPMVTWPLYAEQKVNAFEMVEELGLAVEIRKYLKGDLF 421

Query: 423 EQANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVL 481
                 + AE+IER IR++M+ + +++R  VK  +E+C  ++++GGSS  +L KFI DV+
Sbjct: 422 AGEMETVTAEDIERAIRRVMEQD-SDVRNNVKEMAEKCHFALMDGGSSKAALEKFIQDVI 476

BLAST of CsGy4G020640 vs. TAIR10
Match: AT3G21750.1 (UDP-glucosyl transferase 71B1)

HSP 1 Score: 376.7 bits (966), Expect = 2.1e-104
Identity = 203/488 (41.60%), Postives = 305/488 (62.50%), Query Frame = 0

Query: 3   KFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYIQSLSAQSLT 62
           K ELVFIP PG GH+ +   +A  L+A D+RL+VT+I     +   A+  + +       
Sbjct: 2   KVELVFIPSPGVGHIRATTALAKLLVASDNRLSVTLIVIPSRVSDDASSSVYT------N 61

Query: 63  NNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFL-----TTSTNHLAGFV 122
           + + +++I+LP      +      L   ++S KP V+ A++S +     T S + LAG V
Sbjct: 62  SEDRLRYILLPARDQTTD------LVSYIDSQKPQVR-AVVSKVAGDVSTRSDSRLAGIV 121

Query: 123 LDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNL 182
           +D FC++M+D+A+EF + +Y++YTS A+YL    H++ LY +       + + KD+++  
Sbjct: 122 VDMFCTSMIDIADEFNLSAYIFYTSNASYLGLQFHVQSLYDEKELD---VSEFKDTEMKF 181

Query: 183 SVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLS-T 242
            VP+L    P+K +PSV     +  +   +A+  R   KG+L+N+  ++E  ALS  S  
Sbjct: 182 DVPTLTQPFPAKCLPSVMLNKKWFPYVLGRARSFR-ATKGILVNSVADMEPQALSFFSGG 241

Query: 243 DSSLQLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEE 302
           + +  +PP+Y+VGP++ L  + +     ++L WL +QP  SVVFLCFGS G F ++Q  E
Sbjct: 242 NGNTNIPPVYAVGPIMDLESSGDEEKRKEILHWLKEQPTKSVVFLCFGSMGGFSEEQARE 301

Query: 303 IARALERSRVRFIWSLRRPGNVFQSSI----DYTNFEDILPKGFLDRTQNIGRVISWAPQ 362
           IA ALERS  RF+WSLRR   V   S     ++TN E+ILPKGFLDRT  IG++ISWAPQ
Sbjct: 302 IAVALERSGHRFLWSLRRASPVGNKSNPPPGEFTNLEEILPKGFLDRTVEIGKIISWAPQ 361

Query: 363 VEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKIS 422
           V++L  PA G FV+HCGWNS LESLW GVPMA WP+YAEQQFNAF +V ELGLA E+K  
Sbjct: 362 VDVLNSPAIGAFVTHCGWNSILESLWFGVPMAAWPIYAEQQFNAFHMVDELGLAAEVKKE 421

Query: 423 YCIELKEQANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFISLGK 481
           Y  +   +   I+ A+EIERGI+  M+ + +++RK+V    ++   ++++GGSS  +L K
Sbjct: 422 YRRDFLVEEPEIVTADEIERGIKCAMEQD-SKMRKRVMEMKDKLHVALVDGGSSNCALKK 471

BLAST of CsGy4G020640 vs. Swiss-Prot
Match: sp|Q66PF3|UFOG3_FRAAN (Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3 OS=Fragaria ananassa OX=3747 GN=GT3 PE=2 SV=1)

HSP 1 Score: 444.9 bits (1143), Expect = 1.2e-123
Identity = 240/486 (49.38%), Postives = 321/486 (66.05%), Query Frame = 0

Query: 2   KKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANE-YIQSLSAQS 61
           K  ELV IP PG GHL S +E+A  L++RD +L +T++    P   K  + Y+QSL+  S
Sbjct: 3   KPAELVLIPSPGIGHLVSTLEIAKLLVSRDDKLFITVLIMHFPAVSKGTDAYVQSLADSS 62

Query: 62  LTNNNSIQFIVLPEL-PDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLD 121
              +  I FI LP    D      R  L   +ES +PHVK A+ +   + T  LAGFV+D
Sbjct: 63  SPISQRINFINLPHTNMDHTEGSVRNSLVGFVESQQPHVKDAVANLRDSKTTRLAGFVVD 122

Query: 122 SFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSV 181
            FC+TM++VAN+  VPSYV++TS AA L    HL++L  Q N       + KDSD  L +
Sbjct: 123 MFCTTMINVANQLGVPSYVFFTSGAATLGLLFHLQELRDQYNKD---CTEFKDSDAELII 182

Query: 182 PSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSS 241
           PS  N +P+K +P    + + A  F    KR R + KG+L+NTF +LESHAL +LS+D+ 
Sbjct: 183 PSFFNPLPAKVLPGRMLVKDSAEPFLNVIKRFR-ETKGILVNTFTDLESHALHALSSDA- 242

Query: 242 LQLPPLYSVGPVLHLNKNTETMDD------GDVLKWLDDQPLSSVVFLCFGSRGAFKKDQ 301
            ++PP+Y VGP+L+LN N   +D        D+LKWLDDQP  SVVFLCFGS G+F + Q
Sbjct: 243 -EIPPVYPVGPLLNLNSNESRVDSDEVKKKNDILKWLDDQPPLSVVFLCFGSMGSFDESQ 302

Query: 302 VEEIARALERSRVRFIWSLRR--PGNVFQSSIDYTNFEDILPKGFLDRTQNIGRVISWAP 361
           V EIA ALE +  RF+WSLRR  P        DY +   +LP+GFLDRT  IG+VI WAP
Sbjct: 303 VREIANALEHAGHRFLWSLRRSPPTGKVAFPSDYDDHTGVLPEGFLDRTGGIGKVIGWAP 362

Query: 362 QVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKI 421
           QV +L HP+ GGFVSHCGWNSTLESLWHGVP+ATWP+YAEQQ NAF  V EL LAVEI +
Sbjct: 363 QVAVLAHPSVGGFVSHCGWNSTLESLWHGVPVATWPLYAEQQLNAFQPVKELELAVEIDM 422

Query: 422 SYCIELKEQANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFISLG 478
           SY    + ++  ++ A+EIERGIR++M+ + ++IRK+VK  SE+ +K++++GGSS+ SLG
Sbjct: 423 SY----RSKSPVLVSAKEIERGIREVMELDSSDIRKRVKEMSEKGKKALMDGGSSYTSLG 478

BLAST of CsGy4G020640 vs. Swiss-Prot
Match: sp|Q2V6K0|UFOG6_FRAAN (UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria ananassa OX=3747 GN=GT6 PE=1 SV=1)

HSP 1 Score: 431.0 bits (1107), Expect = 1.7e-119
Identity = 234/489 (47.85%), Postives = 324/489 (66.26%), Query Frame = 0

Query: 2   KKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANE-YIQSLSAQS 61
           K  EL+FIPIPG GH+ S VE+A  LL RD  L +T++  K P     ++ YI+SL+   
Sbjct: 3   KASELIFIPIPGIGHIVSTVEIAKLLLCRDDNLFITILIMKFPFTADGSDVYIKSLAVDP 62

Query: 62  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTT--STNHLAGFVL 121
                 I+F+ LP+          FF    ++S+K HVK A+   + T   T  +AGFV+
Sbjct: 63  SLKTQRIRFVNLPQEHFQGTGATGFF--TFIDSHKSHVKDAVTRLMETKSETTRIAGFVI 122

Query: 122 DSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLS 181
           D FC+ M+D+ANEF +PSYV+YTS AA L    HL+ L  ++N       + KDSD  L 
Sbjct: 123 DMFCTGMIDLANEFGLPSYVFYTSGAADLGLMFHLQALRDEENKD---CTEFKDSDAELV 182

Query: 182 VPSLVNQVP-SKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTD 241
           V S VN +P ++ +PSV F      +F   AKR R + KG+L+NTF ELE HA+ SLS+D
Sbjct: 183 VSSFVNPLPAARVLPSVVFEKEGGNFFLNFAKRYR-ETKGILVNTFLELEPHAIQSLSSD 242

Query: 242 SSLQLPPLYSVGPVLHLN------KNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKK 301
              ++ P+Y VGP+L++        + ++    D+L+WLDDQP SSVVFLCFGS G F +
Sbjct: 243 G--KILPVYPVGPILNVKSEGNQVSSEKSKQKSDILEWLDDQPPSSVVFLCFGSMGCFGE 302

Query: 302 DQVEEIARALERSRVRFIWSLRRPGNV---FQSSIDYTNFEDILPKGFLDRTQNIGRVIS 361
           DQV+EIA ALE+  +RF+WSLR+P      F S  DYT+++ +LP+GFLDRT ++G+VI 
Sbjct: 303 DQVKEIAHALEQGGIRFLWSLRQPSKEKIGFPS--DYTDYKAVLPEGFLDRTTDLGKVIG 362

Query: 362 WAPQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVE 421
           WAPQ+ IL HPA GGFVSHCGWNSTLES+W+GVP+ATWP YAEQQ NAF+LV EL LAVE
Sbjct: 363 WAPQLAILAHPAVGGFVSHCGWNSTLESIWYGVPIATWPFYAEQQVNAFELVKELKLAVE 422

Query: 422 IKISYCIELKEQANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFI 478
           I + Y    ++ +  I+  E IE+GI+++M+  ++E+RK+VK  S+  RK++ E GSS+ 
Sbjct: 423 IDMGY----RKDSGVIVSRENIEKGIKEVME-QESELRKRVKEMSQMSRKALEEDGSSYS 476

BLAST of CsGy4G020640 vs. Swiss-Prot
Match: sp|D3THI6|U7A15_MALDO (UDP-glycosyltransferase 71A15 OS=Malus domestica OX=3750 GN=UGT71A15 PE=1 SV=1)

HSP 1 Score: 425.2 bits (1092), Expect = 9.5e-118
Identity = 236/488 (48.36%), Postives = 331/488 (67.83%), Query Frame = 0

Query: 5   ELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYIQSLSAQSLTNN 64
           +LVF+P PG GH+ S VEMA  L ARD +L +T++  KLP       Y Q  +    + +
Sbjct: 6   QLVFVPAPGIGHIVSTVEMAKQLAARDDQLFITVLVMKLP-------YAQPFTNTDSSIS 65

Query: 65  NSIQFIVLPEL-PD----IPNNGNRFFLEVVLESYKPHVKQALISFL-------TTSTNH 124
           + I F+ LPE  PD    +PN G+  F  + +E++K HV+ A+I+ L       +TS   
Sbjct: 66  HRINFVNLPEAQPDKQDIVPNPGS--FFRMFVENHKSHVRDAVINVLPESDQSESTSKPR 125

Query: 125 LAGFVLDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKD 184
           LAGFVLD F ++++DVANEFKVPSY+++TS A+ LA   H + L  +       I +L  
Sbjct: 126 LAGFVLDMFSASLIDVANEFKVPSYLFFTSNASALALMSHFQSLRDEGGID---ITELTS 185

Query: 185 SDVNLSVPSLVNQVPSKTIP-SVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHAL 244
           S   L+VPS +N  P+  +P S+  + +     +  +K  +   KG+L+NTF ELESHAL
Sbjct: 186 STAELAVPSFINPYPAAVLPGSLLDMESTKSTLNHVSKYKQ--TKGILVNTFMELESHAL 245

Query: 245 SSLSTDSSLQLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKK 304
             L  DS  ++PP+Y VGP+L+L K+++     D+L+WLDDQP  SVVFLCFGS G+F +
Sbjct: 246 HYL--DSGDKIPPVYPVGPLLNL-KSSDEDKASDILRWLDDQPPFSVVFLCFGSMGSFGE 305

Query: 305 DQVEEIARALERSRVRFIWSLRRPGNVFQSSI--DYTNFEDILPKGFLDRTQNIGRVISW 364
            QV+EIA ALE S  RF+WSLRRP    + ++  DY + + +LP+GFLDRT  +G+VI W
Sbjct: 306 AQVKEIACALEHSGHRFLWSLRRPPPQGKRAMPSDYEDLKTVLPEGFLDRTATVGKVIGW 365

Query: 365 APQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEI 424
           APQ  ILGHPATGGFVSHCGWNSTLESLW+GVP+A WP+YAEQ  NAF LVVELGLAVEI
Sbjct: 366 APQAAILGHPATGGFVSHCGWNSTLESLWNGVPIAAWPLYAEQNLNAFQLVVELGLAVEI 425

Query: 425 KISYCIELKEQANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFIS 478
           K+ Y    +  ++ ++ AE+IERGIR++M+ + +++RK+VK  SE+ +K++++GGSS+ S
Sbjct: 426 KMDY----RRDSDVVVSAEDIERGIRRVMELD-SDVRKRVKEMSEKSKKALVDGGSSYSS 471

BLAST of CsGy4G020640 vs. Swiss-Prot
Match: sp|D3UAG1|U7A16_PYRCO (UDP-glycosyltransferase 71A16 OS=Pyrus communis OX=23211 GN=UGT71A16 PE=1 SV=1)

HSP 1 Score: 424.9 bits (1091), Expect = 1.2e-117
Identity = 234/488 (47.95%), Postives = 323/488 (66.19%), Query Frame = 0

Query: 5   ELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYIQSLSAQSLTNN 64
           +LVF+P PG GH+ S VEMA  L+ARD +L +T++  KLP D        S+S       
Sbjct: 6   QLVFVPAPGIGHIVSTVEMAKQLVARDDQLFITVLVMKLPYDQPFTNTDSSIS------- 65

Query: 65  NSIQFIVLPEL-----PDIPNNGNRFFLEVVLESYKPHVKQALISFL-------TTSTNH 124
           + I F+ LPE        +PN G+  F  + +E++K HV+ A+I+ L       +TS   
Sbjct: 66  HRINFVNLPEAQLDKQDTVPNPGS--FFRMFVENHKTHVRDAVINLLPESDQSESTSKPR 125

Query: 125 LAGFVLDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKD 184
           LAGFVLD F ++++DVANEF+VPSYV++TS ++ LA   H + L  +       I +L  
Sbjct: 126 LAGFVLDMFSASLIDVANEFEVPSYVFFTSNSSTLALLSHFQSLRDEGGID---ITELTS 185

Query: 185 SDVNLSVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALS 244
           S   L+VPS +N  P   +P  F              R +   KG+L+NTF ELESHAL 
Sbjct: 186 STAELAVPSFINPYPVAVLPGSFLDKESTKSTLNNVGRYK-QTKGILVNTFLELESHALH 245

Query: 245 SLSTDSSLQLPPLYSVGPVLHLNKNTETMDDG-DVLKWLDDQPLSSVVFLCFGSRGAFKK 304
            L  DS +++PP+Y VGP+L+L  + E  D G D+L+WLDDQP  SVVFLCFGS G+F  
Sbjct: 246 YL--DSGVKIPPVYPVGPLLNLKSSHE--DKGSDILRWLDDQPPLSVVFLCFGSMGSFGD 305

Query: 305 DQVEEIARALERSRVRFIWSLRRPGNVFQSSI--DYTNFEDILPKGFLDRTQNIGRVISW 364
            QV+EIA  LE S  RF+WSLR+P +  + ++  DY + + +LP+GFLDRT  +GRVI W
Sbjct: 306 AQVKEIACTLEHSGHRFLWSLRQPPSKGKRALPSDYADLKTVLPEGFLDRTATVGRVIGW 365

Query: 365 APQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEI 424
           APQ  ILGHPA GGFVSHCGWNSTLES+W+GVP+A WPMYAEQ  NAF LVVELGLAVEI
Sbjct: 366 APQAAILGHPAIGGFVSHCGWNSTLESIWNGVPIAAWPMYAEQNMNAFQLVVELGLAVEI 425

Query: 425 KISYCIELKEQANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFIS 478
           K+ Y    ++ ++ ++ AE+IERGIR++M+ + +++RK+VK  SE+ +K++++GGSS+ S
Sbjct: 426 KMDY----RKDSDVVVSAEDIERGIRQVMELD-SDVRKRVKEMSEKSKKALVDGGSSYSS 471

BLAST of CsGy4G020640 vs. Swiss-Prot
Match: sp|Q6VAB2|U71E1_STERE (UDP-glycosyltransferase 71E1 OS=Stevia rebaudiana OX=55670 GN=UGT71E1 PE=2 SV=1)

HSP 1 Score: 411.4 bits (1056), Expect = 1.4e-113
Identity = 233/491 (47.45%), Postives = 314/491 (63.95%), Query Frame = 0

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPK----ANEYIQSL 60
           M   ELVFIP PG+GHL   VE+A  LL RD RL+VT+I   L L PK    A   + SL
Sbjct: 1   MSTSELVFIPSPGAGHLPPTVELAKLLLHRDQRLSVTIIVMNLWLGPKHNTEARPCVPSL 60

Query: 61  SAQSLTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGF 120
               +  + S   ++ P            F+   +E +KP V+  +   + + +  LAGF
Sbjct: 61  RFVDIPCDESTMALISPNT----------FISAFVEHHKPRVRDIVRGIIESDSVRLAGF 120

Query: 121 VLDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVN 180
           VLD FC  M DVANEF VPSY Y+TS AA L    HL+  + +D+   +   +LK+SD  
Sbjct: 121 VLDMFCMPMSDVANEFGVPSYNYFTSGAATLGLMFHLQ--WKRDHEGYDA-TELKNSDTE 180

Query: 181 LSVPSLVNQVPSKTIPSVFF-INNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLS 240
           LSVPS VN VP+K +P V       +  F + A+RIR + KG+++N+ + +E HAL  LS
Sbjct: 181 LSVPSYVNPVPAKVLPEVVLDKEGGSKMFLDLAERIR-ESKGIIVNSCQAIERHALEYLS 240

Query: 241 TDSSLQLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVE 300
           ++++  +PP++ VGP+L+L    +     ++++WL++QP SSVVFLCFGS G+F + QV+
Sbjct: 241 SNNN-GIPPVFPVGPILNLENKKDDAKTDEIMRWLNEQPESSVVFLCFGSMGSFNEKQVK 300

Query: 301 EIARALERSRVRFIWSLRR--PGNVFQSSIDYTNFEDILPKGFLDRTQNIGRVISWAPQV 360
           EIA A+ERS  RF+WSLRR  P    +   +Y N E++LP+GFL RT +IG+VI WAPQ+
Sbjct: 301 EIAVAIERSGHRFLWSLRRPTPKEKIEFPKEYENLEEVLPEGFLKRTSSIGKVIGWAPQM 360

Query: 361 EILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISY 420
            +L HP+ GGFVSHCGWNSTLES+W GVPMA WP+YAEQ  NAF LVVELGLA EI++ Y
Sbjct: 361 AVLSHPSVGGFVSHCGWNSTLESMWCGVPMAAWPLYAEQTLNAFLLVVELGLAAEIRMDY 420

Query: 421 CIELKE--QANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFISLG 480
             + K        +  EEIE GIRKLM +   EIR KVK   E+ R +V+EGGSS+ S+G
Sbjct: 421 RTDTKAGYDGGMEVTVEEIEDGIRKLMSD--GEIRNKVKDVKEKSRAAVVEGGSSYASIG 473

Query: 481 KFIDDVLSNST 483
           KFI+ V SN T
Sbjct: 481 KFIEHV-SNVT 473

BLAST of CsGy4G020640 vs. TrEMBL
Match: tr|A0A0A0L341|A0A0A0L341_CUCSA (Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_4G620550 PE=3 SV=1)

HSP 1 Score: 967.6 bits (2500), Expect = 1.0e-278
Identity = 486/486 (100.00%), Postives = 486/486 (100.00%), Query Frame = 0

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYIQSLSAQS 60
           MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYIQSLSAQS
Sbjct: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYIQSLSAQS 60

Query: 61  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDS 120
           LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDS
Sbjct: 61  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDS 120

Query: 121 FCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVP 180
           FCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVP
Sbjct: 121 FCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVP 180

Query: 181 SLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSL 240
           SLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSL
Sbjct: 181 SLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSL 240

Query: 241 QLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIARA 300
           QLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIARA
Sbjct: 241 QLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIARA 300

Query: 301 LERSRVRFIWSLRRPGNVFQSSIDYTNFEDILPKGFLDRTQNIGRVISWAPQVEILGHPA 360
           LERSRVRFIWSLRRPGNVFQSSIDYTNFEDILPKGFLDRTQNIGRVISWAPQVEILGHPA
Sbjct: 301 LERSRVRFIWSLRRPGNVFQSSIDYTNFEDILPKGFLDRTQNIGRVISWAPQVEILGHPA 360

Query: 361 TGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIELKEQ 420
           TGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIELKEQ
Sbjct: 361 TGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIELKEQ 420

Query: 421 ANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVLSN 480
           ANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVLSN
Sbjct: 421 ANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVLSN 480

Query: 481 STTGGN 487
           STTGGN
Sbjct: 481 STTGGN 486

BLAST of CsGy4G020640 vs. TrEMBL
Match: tr|A0A1S3CLY9|A0A1S3CLY9_CUCME (Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103502475 PE=3 SV=1)

HSP 1 Score: 836.3 bits (2159), Expect = 3.6e-239
Identity = 427/488 (87.50%), Postives = 447/488 (91.60%), Query Frame = 0

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYIQSLSAQS 60
           MKKFELVFIPIPGSGHLASM EMAN+LLARDHRLAVTMIA KLPLD K NEYIQSL AQS
Sbjct: 1   MKKFELVFIPIPGSGHLASMFEMANSLLARDHRLAVTMIAIKLPLDAKVNEYIQSLYAQS 60

Query: 61  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDS 120
           LT NNSI+FI+LPELP  PN+ N+ F EVVLESYKPHVKQALISFLTTSTNHL GFVLDS
Sbjct: 61  LT-NNSIKFIILPELPPPPNDENKIFFEVVLESYKPHVKQALISFLTTSTNHLVGFVLDS 120

Query: 121 FCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVP 180
           FC TMVDVANEFKVPSYVYYTS AAYLAFSLHLEQLYTQDNSSNEVIQQ KDS+VN SV 
Sbjct: 121 FCLTMVDVANEFKVPSYVYYTSSAAYLAFSLHLEQLYTQDNSSNEVIQQSKDSNVNFSVS 180

Query: 181 SLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSL 240
           SLVNQVPSK IPSVFFINNFAVWFHEQAKRIRFDVKGVLINTF+ELESH +SSLSTDSSL
Sbjct: 181 SLVNQVPSKVIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFDELESHVISSLSTDSSL 240

Query: 241 QLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIARA 300
           QLPPLY VGP+LHLNKNTETMDD  VLKWLDDQPL SVVFLCFGSRGAF+KDQVEEIARA
Sbjct: 241 QLPPLYPVGPILHLNKNTETMDDRVVLKWLDDQPLQSVVFLCFGSRGAFQKDQVEEIARA 300

Query: 301 LERSRVRFIWSLRRP-GNVFQSSIDYTNFEDILPKGFLDRTQNIGRVISWAPQVEILGHP 360
           LERSRVRFIWSLRRP G+VFQSSIDYTNFEDILP+GFLDRT+NIGRVI WAPQVEILGHP
Sbjct: 301 LERSRVRFIWSLRRPSGDVFQSSIDYTNFEDILPEGFLDRTKNIGRVIKWAPQVEILGHP 360

Query: 361 ATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIELKE 420
             GGFVSHCGWNSTLESLW+G+PMATWPMYAEQQFNAF+LVVELGLAVEI I Y  +LKE
Sbjct: 361 TIGGFVSHCGWNSTLESLWYGIPMATWPMYAEQQFNAFELVVELGLAVEITIDYQNDLKE 420

Query: 421 QANP-IIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVL 480
              P I+ AEEIE+GIRKLMD+N NEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVL
Sbjct: 421 LDKPRILSAEEIEKGIRKLMDDNNNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVL 480

Query: 481 SNSTTGGN 487
            NS  G N
Sbjct: 481 INSPRGAN 487

BLAST of CsGy4G020640 vs. TrEMBL
Match: tr|A0A1S4E4S9|A0A1S4E4S9_CUCME (Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103502509 PE=3 SV=1)

HSP 1 Score: 598.6 bits (1542), Expect = 1.3e-167
Identity = 309/383 (80.68%), Postives = 333/383 (86.95%), Query Frame = 0

Query: 102 LISFLTTSTNHLAGFVLDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDN 161
           LISFLTTSTNHL GFVLD F STMV+VANE +VPSYVY TS A +L+FSL+LEQLY Q+N
Sbjct: 47  LISFLTTSTNHLVGFVLDMFYSTMVEVANELEVPSYVYCTSGAGFLSFSLYLEQLYAQNN 106

Query: 162 SSNEVIQQLKDSDVNLSVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLIN 221
           S NEVIQQLKD D +LSV SLVNQ PSK IPS+ FINN AVWFHEQ KR R DVKG+L+N
Sbjct: 107 SRNEVIQQLKDLD-DLSVSSLVNQGPSKVIPSILFINNNAVWFHEQVKRTRSDVKGILMN 166

Query: 222 TFEELESHALSSLSTDSSLQLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFL 281
           T EELESH + SLSTDSSLQLPPLY VGP  HLNKNTETMD  DVLKWLDDQPLSSVVFL
Sbjct: 167 TLEELESHVICSLSTDSSLQLPPLYPVGP-XHLNKNTETMDHVDVLKWLDDQPLSSVVFL 226

Query: 282 CFGSRGAFKKDQVEEIARALERSRVRFIWSLRR--PGNVFQSSIDYTNFEDILPKGFLDR 341
           CFGSRGAF+KDQVEEIA+ LERSRV F+WSLRR  PG+V QSSIDYTNFEDILPK FLDR
Sbjct: 227 CFGSRGAFEKDQVEEIAQVLERSRVPFVWSLRRSSPGSVLQSSIDYTNFEDILPKEFLDR 286

Query: 342 TQNIGRVISWAPQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDL 401
           T+N+GRVI+WAPQVEILGHPAT GFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAF+L
Sbjct: 287 TENVGRVINWAPQVEILGHPATCGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFEL 346

Query: 402 VVELGLAVEIKISYCIELKEQANP-IIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRK 461
           VVELGLA+ I I Y  E KE   P I+  EEIE+GIRKLMD N  EIRKKVKTKSEEC+K
Sbjct: 347 VVELGLAMTITIDYQNEYKELDKPRILSTEEIEKGIRKLMDENNIEIRKKVKTKSEECKK 406

Query: 462 SVIEGGSSFISLGKFIDDVLSNS 482
           SV+EGGSSFISLGKFIDDVL NS
Sbjct: 407 SVMEGGSSFISLGKFIDDVLMNS 427

BLAST of CsGy4G020640 vs. TrEMBL
Match: tr|K7NBW4|K7NBW4_SIRGR (Glycosyltransferase OS=Siraitia grosvenorii OX=190515 GN=UDPG7 PE=2 SV=1)

HSP 1 Score: 567.4 bits (1461), Expect = 3.1e-158
Identity = 293/497 (58.95%), Postives = 374/497 (75.25%), Query Frame = 0

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYIQSLSAQS 60
           MKKFELVFIP+P  GHLA+MVEMAN L+ RD RL VT++  KLPL  K  EYIQSLSA  
Sbjct: 1   MKKFELVFIPLPVMGHLAAMVEMANILVTRDQRLTVTILVIKLPLYGKTAEYIQSLSASF 60

Query: 61  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALI----SFLTTSTNHLAGF 120
              + S++FI+LPE+     +   F L+  LESYKP +++A+I    S +   +  LAGF
Sbjct: 61  A--SESMRFIILPEVLLPEESEKEFMLKAFLESYKPIIREAIIDLTDSQMGPDSPRLAGF 120

Query: 121 VLDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVN 180
           VLD FC+TM+DVANEF VPSYV+ TS A +LA S HL++LY  +N+S EV++QL++S+  
Sbjct: 121 VLDMFCTTMIDVANEFGVPSYVFCTSNAGFLALSFHLQELY-DENNSKEVVKQLQNSNAE 180

Query: 181 LSVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLST 240
           +++PS VN +P K IP +F  ++ A WFH+Q +R R  VKG+LINTF +LESH ++S+S 
Sbjct: 181 IALPSFVNPIPGKMIPDIFSNDDTASWFHDQVERYRSGVKGILINTFAKLESHVMNSMSR 240

Query: 241 DSSLQLPPLYSVGPVLHLNKNTETMDDG------DVLKWLDDQPLSSVVFLCFGSRGAFK 300
            SS + PPLYS+GP+LHL KN  T+  G      D+LKWLD+QP  SVVFLCFGS G+F 
Sbjct: 241 SSSSRAPPLYSIGPILHL-KNNNTVGPGGTLHCTDILKWLDNQPPVSVVFLCFGSMGSFD 300

Query: 301 KDQVEEIARALERSRVRFIWSLRR--PGNVFQSSIDYTNFEDILPKGFLDRTQNIGRVIS 360
           +DQV+EIA ALERS VRF+WSLR+  P + F++  +YT+ + +LP+GFL+RT  IGRVI 
Sbjct: 301 EDQVKEIAHALERSGVRFLWSLRQPPPKDKFEAPSEYTDIKYVLPEGFLERTAGIGRVIG 360

Query: 361 WAPQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVE 420
           WAPQVEIL HPATGGFVSHCGWNSTLES+WHGVPMATWP+YAEQQF AF++VVELGLAV+
Sbjct: 361 WAPQVEILAHPATGGFVSHCGWNSTLESMWHGVPMATWPLYAEQQFTAFEMVVELGLAVD 420

Query: 421 IKISYCIELKEQANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFI 480
           I + Y      + + ++ AEEI+ GIRKLM+    E+RKKVK KSEE RKS++EGGSSFI
Sbjct: 421 ITLDYQKHPHGERSRVVSAEEIQSGIRKLMEEG-GEMRKKVKAKSEESRKSLMEGGSSFI 480

Query: 481 SLGKFIDDVLSNSTTGG 486
           SLG+FIDDVL N   GG
Sbjct: 481 SLGRFIDDVLGNGPEGG 492

BLAST of CsGy4G020640 vs. TrEMBL
Match: tr|A0A1S3CLZ0|A0A1S3CLZ0_CUCME (Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103502477 PE=3 SV=1)

HSP 1 Score: 554.3 bits (1427), Expect = 2.7e-154
Identity = 287/494 (58.10%), Postives = 364/494 (73.68%), Query Frame = 0

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYIQSLSAQS 60
           M KFELVFIP PG GHLAS VE+AN L +RD RL+VT++A KLP D K  E IQSLSA  
Sbjct: 1   MNKFELVFIPGPGIGHLASTVELANVLASRDDRLSVTVLAIKLPNDIKTTERIQSLSAS- 60

Query: 61  LTNNNSIQFIVLPELPDIPNNGNR---FFLEVVLESYKPHVKQALISFLTTSTNHLAGFV 120
                SI+FIVLPELP  PN  +      L+  LES+KPHV++ +++ LT  +N L GFV
Sbjct: 61  -FEGKSIRFIVLPELP-FPNQSSTPPPLMLQAFLESHKPHVRE-IVTNLTYDSNRLVGFV 120

Query: 121 LDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNL 180
           +D FC++M++VANEFKVP Y++YTS A +LAFS HL++LY Q+NS+ E   QL++S+V L
Sbjct: 121 IDMFCTSMINVANEFKVPCYLFYTSNAGFLAFSFHLQELYNQNNSTGE---QLQNSNVEL 180

Query: 181 SVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTD 240
           ++PS +N +PSK IP   F  + AVWFH+  KR R  VKG+LINTF E+E   +  +S  
Sbjct: 181 ALPSFINPIPSKAIPPFLFDKDMAVWFHDNTKRFRSGVKGILINTFVEMEPQMIKWMSNG 240

Query: 241 SSLQLPPLYSVGPVLHL-----NKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKD 300
           SS ++P +Y+VGP+L L      +    ++  D+LKWLDDQP +SVVFLCFGS+G+F +D
Sbjct: 241 SS-KIPKVYTVGPILQLKSIGVTQCNNALNGADILKWLDDQPPASVVFLCFGSKGSFDED 300

Query: 301 QVEEIARALERSRVRFIWSLRR--PGNVFQSSIDYTNFEDILPKGFLDRTQNIGRVISWA 360
           QV EIARALERS VRFIWSLR+  P   F+   +Y +  D+LP+GFL+RT +IGRVI WA
Sbjct: 301 QVLEIARALERSEVRFIWSLRQPPPKGKFEEPSNYADINDVLPEGFLNRTADIGRVIGWA 360

Query: 361 PQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIK 420
           PQ+EIL HPATGGF+SHCGWNSTLES+WHGVPMATWP+YAEQQFNAF++VVELGLAVE+ 
Sbjct: 361 PQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELT 420

Query: 421 ISYCIELKEQANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFISL 480
           + Y  +     + ++ AEEIE GIRKLM +  NEIRKKVK K EE RKS++ GGSSF SL
Sbjct: 421 LDYVKDFHIGRSRVVSAEEIESGIRKLMGDYGNEIRKKVKVKGEESRKSMMVGGSSFNSL 480

Query: 481 GKFIDDVLSNSTTG 485
             FIDD L+N   G
Sbjct: 481 DHFIDDALANLEEG 486

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004146066.11.6e-278100.00PREDICTED: anthocyanidin 3-O-glucosyltransferase 2 [Cucumis sativus] >KGN54992.1... [more]
XP_008464637.15.4e-23987.50PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo][more]
XP_023535691.11.7e-17665.64anthocyanidin 3-O-glucosyltransferase 2-like isoform X2 [Cucurbita pepo subsp. p... [more]
XP_023535689.11.7e-17665.64anthocyanidin 3-O-glucosyltransferase 2-like isoform X1 [Cucurbita pepo subsp. p... [more]
XP_022976770.19.5e-17565.09anthocyanidin 3-O-glucosyltransferase 2-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT3G21760.11.9e-11347.15UDP-Glycosyltransferase superfamily protein[more]
AT3G21790.16.4e-10946.03UDP-Glycosyltransferase superfamily protein[more]
AT3G21780.19.3e-10845.88UDP-glucosyl transferase 71B6[more]
AT4G15280.16.7e-10644.19UDP-glucosyl transferase 71B5[more]
AT3G21750.12.1e-10441.60UDP-glucosyl transferase 71B1[more]
Match NameE-valueIdentityDescription
sp|Q66PF3|UFOG3_FRAAN1.2e-12349.38Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3 OS=Fragaria ananassa OX... [more]
sp|Q2V6K0|UFOG6_FRAAN1.7e-11947.85UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria ananassa OX=3747 GN=... [more]
sp|D3THI6|U7A15_MALDO9.5e-11848.36UDP-glycosyltransferase 71A15 OS=Malus domestica OX=3750 GN=UGT71A15 PE=1 SV=1[more]
sp|D3UAG1|U7A16_PYRCO1.2e-11747.95UDP-glycosyltransferase 71A16 OS=Pyrus communis OX=23211 GN=UGT71A16 PE=1 SV=1[more]
sp|Q6VAB2|U71E1_STERE1.4e-11347.45UDP-glycosyltransferase 71E1 OS=Stevia rebaudiana OX=55670 GN=UGT71E1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
tr|A0A0A0L341|A0A0A0L341_CUCSA1.0e-278100.00Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_4G620550 PE=3 SV=1[more]
tr|A0A1S3CLY9|A0A1S3CLY9_CUCME3.6e-23987.50Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103502475 PE=3 SV=1[more]
tr|A0A1S4E4S9|A0A1S4E4S9_CUCME1.3e-16780.68Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103502509 PE=3 SV=1[more]
tr|K7NBW4|K7NBW4_SIRGR3.1e-15858.95Glycosyltransferase OS=Siraitia grosvenorii OX=190515 GN=UDPG7 PE=2 SV=1[more]
tr|A0A1S3CLZ0|A0A1S3CLZ0_CUCME2.7e-15458.10Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103502477 PE=3 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: INTERPRO
TermDefinition
IPR035595UDP_glycos_trans_CS
IPR002213UDP_glucos_trans
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016758 transferase activity, transferring hexosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy4G020640.1CsGy4G020640.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 427..447
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 7..258
e-value: 1.1E-135
score: 455.0
coord: 459..467
e-value: 1.1E-135
score: 455.0
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 259..458
e-value: 1.1E-135
score: 455.0
NoneNo IPR availablePANTHERPTHR11926:SF753UDP-GLYCOSYLTRANSFERASE 71B1-RELATEDcoord: 2..478
NoneNo IPR availablePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 2..478
NoneNo IPR availableSUPERFAMILYSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 2..478
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 10..409
e-value: 6.9E-25
score: 87.7
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 349..392