CsGy1G017140.1 (mRNA) Cucumber (Gy14) v2

NameCsGy1G017140.1
TypemRNA
OrganismCucumis sativus (Cucumber (Gy14) v2)
DescriptionExostosin family protein
LocationChr1 : 14904971 .. 14907856 (-)
Sequence length1155
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATCGAAGAGGGTTTGGCGGAGGCTCGAGCAGCGATCCGACTAGCGATCGTAACCCGAAACTACACGTCGGAAAAGGAGGAGAGTTTCATACCGAGAGGAAGGGTTTACAGAAACGCATACGCTTTCCATCAGTTAAGATTCCCACTGTCCTATTTAATTTCCTTTCCATTTTTGGCTCTCTACAAATTTCCATATGCTTTGTTTCTTTCTTAAGAAAATAATTCATTTATATTTATCAGGAAATAAATAATGATTATATATAACACATTTTAGGGAAAAATGAAAACGGATCACATTTTTAGTATTCCATTAAACGTTTCTCACCCCCAAATAAAAAACTCACGAGTTTTCCATCTATAATTTTTTATAAAATCAAAAAATTCATATTAATTAATTTGGTCAGAAATTGACACATTTTCTAAAAAAGCTTTAGCTTTTTTACTAAGTTACATATATATGGATATATACATACTATTTTGATTATGTAATTTGAGATTTCTTTTAAAAATGTTATTCTATTTATTCTTAATTACTAAAATTGAACAAATTCAATATTCTTGTAAAAGTTTTGATATTTTGATATATATTTTTTTAAAGCGAAATGAATTAAATGTTTGTTAAAGAGAAGATTAAATGTGTTGAATACTTTCCTAAAACAAAGTAGATGAAATAGACATAGTACTCAAAGCAATAAAACTTAAAAATTGTTCATTATATAGTCTACTTAATTAGCCTAATAATTACTCCACCAAATAATTGATGCATTAAAGTATAGTAAATGAAATATTTACAAATTCATAAAAATTACAAAATTTTCAATAAATATGTCATTTAAACATAATTAGAAATTAATAATTGGAGAAGTCATGGCTAGAGTATGAATAACAAAAAATAAAAAATTAAAATAATACTATGTGTAATAATTAGGAATAAAGGGGGCAGGGGCAGGGGCAGGGGGAGGGGGAGGCTGCCCAGCCGCCTTGGCACGGACTAAATCTAGGCTGCCCAAAATAGTCCAACGATTGTCTCTGTTGACATTGTTCCTTCGTTGTTTTTAGCTAAACAGTTGTCCATTCATCATTATTATTATTTTTATTACTCTACACTATTACCATTTTCATTCCATCAGTACTTAAATTCTGTATTTTGATTTTAAACCTTAGACTTAACTTTGCTAACAACTTCCTTTTAGCCTCTATCTTTATTCCCCTTTATATTTTGTAATTTGTTTTTTTTTTTAAAAAAAATTGATTTGTAATTACTAAAAGTTGGAAACATAGAATTGCATGATATGACGTTTTCCAGTTGGTCTGTTTTAATTATTGGACTATGAAAACGTTCATCTTGTTTTATGTAAATCTTCTTTTACTTTTCAAATAGTAATTATGTTGATCCATATTCCTCAATTCACCGTATAAACTCTTTAATATAATAATTTTATGATCAGAGATGTTCAGCAACTTTTAAGGGAGTGCAGGTTAACATTATTCTTTGGATAAAAATGAAAATAAATAGACATTGTGATACAACCACGATCAAATCAAACAATTAGAAGTATATAAACTAAAGCAATATCTACGTGTAAGATTAACAAACATTAGTGATCTTGAAATATACAATTCAATTATGATATTGTTTGTTACACTGAGGTATTTTCTAGACCATAGCATCGGATTCACACAAACAATAGTGATTAGTGTGTGAACCAATAGACAAATGTAGATGGTAATAATAATAATATGTATTGTTCATAAACTGTAGGAGTCATATTGAGATGAAGAAGAGGTTAAAAATATGGACATACAAAGAAGGAGAGCAGCCATTGGTGCACGATGGGCCGATGAAACACATATACTCAATTGAGGGCCATTTTATTGACGAAATGGACAGTGGAAAGAGCCCATTTTCGGCCCATGAACCAGAAGAGGCCCAAGTATTTTTCTTGCCTATAAGTATCGTTTATATTGTGGATTACATCTACAAGCCCATTACCACATACGCACGTGACCGTCTCGTTCGAATTTTCACGGATTATGTGAGGGTGGTAGCTAATAAGTACCCTTACTGGAACCGTACACGTGGAGCAGATCATTTCATGGTCTCCTGCCATGATTGGGTAACTTCCTAGTAACTCAAACCAAAACTGCAAAATTATAGTTAACAGAATGGACTATGAATTGGGAGTTTAACAAAATTAACATTTTGGAAAGCTTTCAGGCGCCGGAAGTAACAAAAGAAGATCCTAACCTCTTCAAATATTTCATCAGAGTTCTTTGCAATGCCAACACATCCGAAGGCTTCAATCCAATGCGAGATGCATCCTTGCCCGAGATTAACTTACCTCCAACTTTCCACCTCAATCTTCCTCGATTAGGCCAACCGCCACAGAACCGCTCAATTCTAGCTTTCTTCGCCGGCGGAGCACACGGATTCATCCGCCACATCCTAATGCAGCATTGGAAAGACAAAGACCATGAAATCCAAGTCCACGAGTACCTTCCTCCATCCCAAAACTACACCGAATTGATCGATCGAAGCAAATTCTGCCTCTGCCCTAGCGGATACGAAGTTGCAAGCCCTAGGTTAGTGGAAGCGATCCACGGCGGTTGTGTACCAGTGGTAATCTCTGATTATTACTCCTTGCCGTTCGATGATGTGCTGGATTGGAGCAAATTCTCGATGCGGATTCCGTCTGAGAGGATTCCGGAGATCAAGACGATCTTGAGAGGAGTTTCGATGAAGAAGTACTTGAAACTACAGCGAGGAGTGATGAAAGTGCAGAGACATTTTGAGATTCATCGGCCGGCAAAGGCGTTTGATATGTTCCATATGGTACTTCACTCTGTTTGGCTCAGACGACTCAATGTAAAGCTTACACATTGA

mRNA sequence

ATGATCGAAGAGGGTTTGGCGGAGGCTCGAGCAGCGATCCGACTAGCGATCGTAACCCGAAACTACACGTCGGAAAAGGAGGAGAGTTTCATACCGAGAGGAAGGGTTTACAGAAACGCATACGCTTTCCATCAGAGTCATATTGAGATGAAGAAGAGGTTAAAAATATGGACATACAAAGAAGGAGAGCAGCCATTGGTGCACGATGGGCCGATGAAACACATATACTCAATTGAGGGCCATTTTATTGACGAAATGGACAGTGGAAAGAGCCCATTTTCGGCCCATGAACCAGAAGAGGCCCAAGTATTTTTCTTGCCTATAAGTATCGTTTATATTGTGGATTACATCTACAAGCCCATTACCACATACGCACGTGACCGTCTCGTTCGAATTTTCACGGATTATGTGAGGGTGGTAGCTAATAAGTACCCTTACTGGAACCGTACACGTGGAGCAGATCATTTCATGGTCTCCTGCCATGATTGGGCGCCGGAAGTAACAAAAGAAGATCCTAACCTCTTCAAATATTTCATCAGAGTTCTTTGCAATGCCAACACATCCGAAGGCTTCAATCCAATGCGAGATGCATCCTTGCCCGAGATTAACTTACCTCCAACTTTCCACCTCAATCTTCCTCGATTAGGCCAACCGCCACAGAACCGCTCAATTCTAGCTTTCTTCGCCGGCGGAGCACACGGATTCATCCGCCACATCCTAATGCAGCATTGGAAAGACAAAGACCATGAAATCCAAGTCCACGAGTACCTTCCTCCATCCCAAAACTACACCGAATTGATCGATCGAAGCAAATTCTGCCTCTGCCCTAGCGGATACGAAGTTGCAAGCCCTAGGTTAGTGGAAGCGATCCACGGCGGTTGTGTACCAGTGGTAATCTCTGATTATTACTCCTTGCCGTTCGATGATGTGCTGGATTGGAGCAAATTCTCGATGCGGATTCCGTCTGAGAGGATTCCGGAGATCAAGACGATCTTGAGAGGAGTTTCGATGAAGAAGTACTTGAAACTACAGCGAGGAGTGATGAAAGTGCAGAGACATTTTGAGATTCATCGGCCGGCAAAGGCGTTTGATATGTTCCATATGGTACTTCACTCTGTTTGGCTCAGACGACTCAATGTAAAGCTTACACATTGA

Coding sequence (CDS)

ATGATCGAAGAGGGTTTGGCGGAGGCTCGAGCAGCGATCCGACTAGCGATCGTAACCCGAAACTACACGTCGGAAAAGGAGGAGAGTTTCATACCGAGAGGAAGGGTTTACAGAAACGCATACGCTTTCCATCAGAGTCATATTGAGATGAAGAAGAGGTTAAAAATATGGACATACAAAGAAGGAGAGCAGCCATTGGTGCACGATGGGCCGATGAAACACATATACTCAATTGAGGGCCATTTTATTGACGAAATGGACAGTGGAAAGAGCCCATTTTCGGCCCATGAACCAGAAGAGGCCCAAGTATTTTTCTTGCCTATAAGTATCGTTTATATTGTGGATTACATCTACAAGCCCATTACCACATACGCACGTGACCGTCTCGTTCGAATTTTCACGGATTATGTGAGGGTGGTAGCTAATAAGTACCCTTACTGGAACCGTACACGTGGAGCAGATCATTTCATGGTCTCCTGCCATGATTGGGCGCCGGAAGTAACAAAAGAAGATCCTAACCTCTTCAAATATTTCATCAGAGTTCTTTGCAATGCCAACACATCCGAAGGCTTCAATCCAATGCGAGATGCATCCTTGCCCGAGATTAACTTACCTCCAACTTTCCACCTCAATCTTCCTCGATTAGGCCAACCGCCACAGAACCGCTCAATTCTAGCTTTCTTCGCCGGCGGAGCACACGGATTCATCCGCCACATCCTAATGCAGCATTGGAAAGACAAAGACCATGAAATCCAAGTCCACGAGTACCTTCCTCCATCCCAAAACTACACCGAATTGATCGATCGAAGCAAATTCTGCCTCTGCCCTAGCGGATACGAAGTTGCAAGCCCTAGGTTAGTGGAAGCGATCCACGGCGGTTGTGTACCAGTGGTAATCTCTGATTATTACTCCTTGCCGTTCGATGATGTGCTGGATTGGAGCAAATTCTCGATGCGGATTCCGTCTGAGAGGATTCCGGAGATCAAGACGATCTTGAGAGGAGTTTCGATGAAGAAGTACTTGAAACTACAGCGAGGAGTGATGAAAGTGCAGAGACATTTTGAGATTCATCGGCCGGCAAAGGCGTTTGATATGTTCCATATGGTACTTCACTCTGTTTGGCTCAGACGACTCAATGTAAAGCTTACACATTGA

Protein sequence

MIEEGLAEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKLTH
BLAST of CsGy1G017140.1 vs. NCBI nr
Match: XP_011655344.1 (PREDICTED: probable glycosyltransferase At5g20260 [Cucumis sativus] >KGN65113.1 hypothetical protein Csa_1G226410 [Cucumis sativus])

HSP 1 Score: 802.0 bits (2070), Expect = 9.0e-229
Identity = 384/384 (100.00%), Postives = 384/384 (100.00%), Query Frame = 0

Query: 1   MIEEGLAEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYK 60
           MIEEGLAEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYK
Sbjct: 95  MIEEGLAEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYK 154

Query: 61  EGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKP 120
           EGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKP
Sbjct: 155 EGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKP 214

Query: 121 ITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIR 180
           ITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIR
Sbjct: 215 ITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIR 274

Query: 181 VLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHIL 240
           VLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHIL
Sbjct: 275 VLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHIL 334

Query: 241 MQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVIS 300
           MQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVIS
Sbjct: 335 MQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVIS 394

Query: 301 DYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPA 360
           DYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPA
Sbjct: 395 DYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPA 454

Query: 361 KAFDMFHMVLHSVWLRRLNVKLTH 385
           KAFDMFHMVLHSVWLRRLNVKLTH
Sbjct: 455 KAFDMFHMVLHSVWLRRLNVKLTH 478

BLAST of CsGy1G017140.1 vs. NCBI nr
Match: XP_008443936.1 (PREDICTED: probable glycosyltransferase At5g20260 [Cucumis melo])

HSP 1 Score: 785.0 bits (2026), Expect = 1.1e-223
Identity = 374/384 (97.40%), Postives = 379/384 (98.70%), Query Frame = 0

Query: 1   MIEEGLAEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYK 60
           MIEEGLAEARAAIR AIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYK
Sbjct: 97  MIEEGLAEARAAIRQAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYK 156

Query: 61  EGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKP 120
           EGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAH+PEEA VFFLPISIVYIVDYIYKP
Sbjct: 157 EGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHDPEEAHVFFLPISIVYIVDYIYKP 216

Query: 121 ITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIR 180
           ITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIR
Sbjct: 217 ITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIR 276

Query: 181 VLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHIL 240
           VLCNANTSEGFNPMRDASLPEINLPPTFHLNLPR GQPPQNRSILAFFAGGAHGFIRH+L
Sbjct: 277 VLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRSGQPPQNRSILAFFAGGAHGFIRHVL 336

Query: 241 MQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVIS 300
           MQHWKDKD EIQVHEYLPP++NYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPV+IS
Sbjct: 337 MQHWKDKDDEIQVHEYLPPAKNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVIIS 396

Query: 301 DYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPA 360
           DYYSLPFDDVLDWSKFSMRIPSERIPEIK ILRGVSMKKYLKLQRGVMKVQRHFEIHRPA
Sbjct: 397 DYYSLPFDDVLDWSKFSMRIPSERIPEIKKILRGVSMKKYLKLQRGVMKVQRHFEIHRPA 456

Query: 361 KAFDMFHMVLHSVWLRRLNVKLTH 385
           KAFDMFHMVLHSVWLRRLNVKLTH
Sbjct: 457 KAFDMFHMVLHSVWLRRLNVKLTH 480

BLAST of CsGy1G017140.1 vs. NCBI nr
Match: XP_022135540.1 (probable glycosyltransferase At5g20260 [Momordica charantia])

HSP 1 Score: 709.9 bits (1831), Expect = 4.7e-201
Identity = 336/383 (87.73%), Postives = 354/383 (92.43%), Query Frame = 0

Query: 2   IEEGLAEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKE 61
           IEE LA ARAAIR AIV RNYTSE+ ESFIPRGRVYRNAYAFHQSHIEM KR K+W YKE
Sbjct: 87  IEEDLARARAAIREAIVKRNYTSERAESFIPRGRVYRNAYAFHQSHIEMVKRFKVWAYKE 146

Query: 62  GEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPI 121
           GEQPLVHDGPMKHIYSIEGHFIDEMD GKSPFSA  P+EA VFFLPISIV+IVDYIYKPI
Sbjct: 147 GEQPLVHDGPMKHIYSIEGHFIDEMDGGKSPFSARHPDEAHVFFLPISIVFIVDYIYKPI 206

Query: 122 TTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRV 181
           TTYARDRLVRIFTDYV VVA+KYPYWNR+RGADHFM SCHDWAPE TKEDPNLFKYFIRV
Sbjct: 207 TTYARDRLVRIFTDYVNVVADKYPYWNRSRGADHFMASCHDWAPETTKEDPNLFKYFIRV 266

Query: 182 LCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILM 241
           LCNANTSEGFNPMRDASLPEINLPP+F LNLPRLGQP + RSILAFFAGGAHGFIR IL+
Sbjct: 267 LCNANTSEGFNPMRDASLPEINLPPSFQLNLPRLGQPIEKRSILAFFAGGAHGFIRQILI 326

Query: 242 QHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISD 301
           +HWKDKD EIQVHEYLP  QNY ELI RS+FCLCPSGYEVASPRLVEAIHGGCVPV+ISD
Sbjct: 327 EHWKDKDDEIQVHEYLPRGQNYDELIGRSRFCLCPSGYEVASPRLVEAIHGGCVPVIISD 386

Query: 302 YYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAK 361
           YYSLPFDDVLDWSKFS+RIPS RIPEIKTIL+GVS  KY KLQRGVMKVQRHFE+HRPAK
Sbjct: 387 YYSLPFDDVLDWSKFSLRIPSGRIPEIKTILKGVSPVKYSKLQRGVMKVQRHFEVHRPAK 446

Query: 362 AFDMFHMVLHSVWLRRLNVKLTH 385
            FD+FHMVLHSVWLRRLN+KL+H
Sbjct: 447 PFDVFHMVLHSVWLRRLNIKLSH 469

BLAST of CsGy1G017140.1 vs. NCBI nr
Match: XP_022987712.1 (probable glycosyltransferase At5g20260 isoform X1 [Cucurbita maxima])

HSP 1 Score: 679.9 bits (1753), Expect = 5.2e-192
Identity = 318/383 (83.03%), Postives = 347/383 (90.60%), Query Frame = 0

Query: 2   IEEGLAEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKE 61
           IEE LA ARAAIR AIV RNYTSEK ESFIPRGRVYRNAYAFHQSHIEM KR K+WTYKE
Sbjct: 79  IEEDLARARAAIREAIVRRNYTSEKVESFIPRGRVYRNAYAFHQSHIEMVKRFKVWTYKE 138

Query: 62  GEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPI 121
           GEQPLVHDGPMK+IYSIEGHFIDEMDSGKSPFSA  P+EA VFFLP+SI YI++YIY PI
Sbjct: 139 GEQPLVHDGPMKNIYSIEGHFIDEMDSGKSPFSAQNPDEAHVFFLPVSIAYIIEYIYTPI 198

Query: 122 TTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRV 181
           TTYARDRL+RIF DYVRVVA++YPYWNRTRGADHFM SCHDWAP++T+ DP+LFKY IRV
Sbjct: 199 TTYARDRLIRIFKDYVRVVADRYPYWNRTRGADHFMASCHDWAPDITQTDPDLFKYLIRV 258

Query: 182 LCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILM 241
           LCNANTSEGFNP+RDASLPEINLP  FHL+L R GQPP+NRSILAFFAGGAHGFIR +L 
Sbjct: 259 LCNANTSEGFNPVRDASLPEINLPANFHLDLSRSGQPPENRSILAFFAGGAHGFIRKMLF 318

Query: 242 QHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISD 301
           +HWKDKD EIQVHEYL   QNY E I RS+FCLCPSGYEVASPRLVEAI GGCVPV+ISD
Sbjct: 319 EHWKDKDDEIQVHEYLHKGQNYGEFISRSRFCLCPSGYEVASPRLVEAIQGGCVPVIISD 378

Query: 302 YYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAK 361
           YYSLPFDDVLDWSKFS+RIPS+RIPEIK IL+GVS  KYLKL +GVMKVQRHFE+HRPAK
Sbjct: 379 YYSLPFDDVLDWSKFSLRIPSKRIPEIKKILKGVSPAKYLKLHQGVMKVQRHFEVHRPAK 438

Query: 362 AFDMFHMVLHSVWLRRLNVKLTH 385
            FD+FHMVLHSVWLRRLN++ +H
Sbjct: 439 PFDVFHMVLHSVWLRRLNIRPSH 461

BLAST of CsGy1G017140.1 vs. NCBI nr
Match: XP_023515662.1 (probable glycosyltransferase At5g20260 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 675.2 bits (1741), Expect = 1.3e-190
Identity = 315/383 (82.25%), Postives = 344/383 (89.82%), Query Frame = 0

Query: 2   IEEGLAEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKE 61
           IEE LA AR AI+ AIV RNYTSEK ESFIPRGRVYRNAYAFHQSHIEM KR K+WTYKE
Sbjct: 59  IEEDLARARVAIQEAIVRRNYTSEKVESFIPRGRVYRNAYAFHQSHIEMVKRFKVWTYKE 118

Query: 62  GEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPI 121
           GEQPLVHDGPMK+IYSIEGHFIDEMDSGKSPFSA  P+EA VFFLP+SI YI++YIY PI
Sbjct: 119 GEQPLVHDGPMKNIYSIEGHFIDEMDSGKSPFSAQNPDEAHVFFLPVSIAYIIEYIYTPI 178

Query: 122 TTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRV 181
           TTYARDRL+RIF DY  VVAN+YPYWNRTRGADHFM SCHDWAP++T+ DP+LFKY IRV
Sbjct: 179 TTYARDRLIRIFKDYATVVANRYPYWNRTRGADHFMASCHDWAPDITQADPDLFKYLIRV 238

Query: 182 LCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILM 241
           LCNANTSEGFNP+RDASLPEINLP  F LNL R GQPP+NRSILAFFAGGAHGFIR +L 
Sbjct: 239 LCNANTSEGFNPVRDASLPEINLPANFQLNLSRSGQPPENRSILAFFAGGAHGFIRKMLF 298

Query: 242 QHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISD 301
           +HWKDKD EIQVHEYL   QNY E I RS+FCLCPSGYEVASPRLVEAI GGCVPV+ISD
Sbjct: 299 EHWKDKDDEIQVHEYLHKGQNYGEFISRSRFCLCPSGYEVASPRLVEAIQGGCVPVIISD 358

Query: 302 YYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAK 361
           YYSLPFDDVLDWSKFS+RIPS+RIPEIK IL+G+S  KYLKLQ+GVMKVQRHFE+HRPAK
Sbjct: 359 YYSLPFDDVLDWSKFSLRIPSKRIPEIKKILKGISPAKYLKLQQGVMKVQRHFEVHRPAK 418

Query: 362 AFDMFHMVLHSVWLRRLNVKLTH 385
            FD+FHMVLHSVWLRRLN++ +H
Sbjct: 419 PFDVFHMVLHSVWLRRLNIRPSH 441

BLAST of CsGy1G017140.1 vs. TAIR10
Match: AT5G20260.1 (Exostosin family protein)

HSP 1 Score: 531.6 bits (1368), Expect = 4.1e-151
Identity = 242/383 (63.19%), Postives = 317/383 (82.77%), Query Frame = 0

Query: 1   MIEEGLAEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYK 60
           +IEEGLA++R+AIR A+  + + S+KEE+F+PRG VYRNA+AFHQSHIEM+K+ K+W Y+
Sbjct: 76  IIEEGLAKSRSAIREAVRLKKFVSDKEETFVPRGAVYRNAFAFHQSHIEMEKKFKVWVYR 135

Query: 61  EGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKP 120
           EGE PLVH GPM +IYSIEG F+DE+++G SPF+A+ PEEA  F LP+S+  IV Y+Y+P
Sbjct: 136 EGETPLVHMGPMNNIYSIEGQFMDEIETGMSPFAANNPEEAHAFLLPVSVANIVHYLYRP 195

Query: 121 ITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIR 180
           + TY+R++L ++F DYV VVA+KYPYWNR+ GADHF VSCHDWAP+V+  +P L K  IR
Sbjct: 196 LVTYSREQLHKVFLDYVDVVAHKYPYWNRSLGADHFYVSCHDWAPDVSGSNPELMKNLIR 255

Query: 181 VLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPP-QNRSILAFFAGGAHGFIRHI 240
           VLCNANTSEGF P RD S+PEIN+P   HL  PRL +    +R ILAFFAGG+HG+IR I
Sbjct: 256 VLCNANTSEGFMPQRDVSIPEINIPGG-HLGPPRLSRSSGHDRPILAFFAGGSHGYIRRI 315

Query: 241 LMQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVI 300
           L+QHWKDKD E+QVHEYL  +++Y +L+  ++FCLCPSGYEVASPR+V AI+ GCVPV+I
Sbjct: 316 LLQHWKDKDEEVQVHEYLAKNKDYFKLMATARFCLCPSGYEVASPRVVAAINLGCVPVII 375

Query: 301 SDYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRP 360
           SD+Y+LPF DVLDW+KF++ +PS++IPEIKTIL+ +S ++Y  LQR V++VQRHF I+RP
Sbjct: 376 SDHYALPFSDVLDWTKFTIHVPSKKIPEIKTILKSISWRRYRVLQRRVLQVQRHFVINRP 435

Query: 361 AKAFDMFHMVLHSVWLRRLNVKL 383
           ++ FDM  M+LHSVWLRRLN++L
Sbjct: 436 SQPFDMLRMLLHSVWLRRLNLRL 457

BLAST of CsGy1G017140.1 vs. TAIR10
Match: AT5G11130.1 (Exostosin family protein)

HSP 1 Score: 501.5 bits (1290), Expect = 4.6e-142
Identity = 223/386 (57.77%), Postives = 302/386 (78.24%), Query Frame = 0

Query: 2   IEEGLAEARAAIRLA-----IVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKI 61
           IEEGLA ARAAIR A        R+ T+  +   +  G VY NA+ FHQSH EM+KR KI
Sbjct: 92  IEEGLAMARAAIRKAGEKNLRRDRDRTNNSDVGVVSNGSVYLNAFTFHQSHKEMEKRFKI 151

Query: 62  WTYKEGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDY 121
           WTY+EGE PL H GP+ +IY+IEG F+DE+++G S F A  PEEA VF++P+ IV I+ +
Sbjct: 152 WTYREGEAPLFHKGPLNNIYAIEGQFMDEIENGNSRFKAASPEEATVFYIPVGIVNIIRF 211

Query: 122 IYKPITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFK 181
           +Y+P T+YARDRL  I  DY+ +++N+YPYWNR+RGADHF +SCHDWAP+V+  DP L+K
Sbjct: 212 VYRPYTSYARDRLQNIVKDYISLISNRYPYWNRSRGADHFFLSCHDWAPDVSAVDPELYK 271

Query: 182 YFIRVLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFI 241
           +FIR LCNAN+SEGF PMRD SLPEIN+P +  L     G+PPQNR +LAFFAGG+HG +
Sbjct: 272 HFIRALCNANSSEGFTPMRDVSLPEINIPHS-QLGFVHTGEPPQNRKLLAFFAGGSHGDV 331

Query: 242 RHILMQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVP 301
           R IL QHWK+KD ++ V+E LP + NYT+++D++KFCLCPSG+EVASPR+VE+++ GCVP
Sbjct: 332 RKILFQHWKEKDKDVLVYENLPKTMNYTKMMDKAKFCLCPSGWEVASPRIVESLYSGCVP 391

Query: 302 VVISDYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEI 361
           V+I+DYY LPF DVL+W  FS+ IP  ++P+IK IL  ++ ++YL +QR V++V++HF I
Sbjct: 392 VIIADYYVLPFSDVLNWKTFSVHIPISKMPDIKKILEAITEEEYLNMQRRVLEVRKHFVI 451

Query: 362 HRPAKAFDMFHMVLHSVWLRRLNVKL 383
           +RP+K +DM HM++HS+WLRRLNV++
Sbjct: 452 NRPSKPYDMLHMIMHSIWLRRLNVRI 476

BLAST of CsGy1G017140.1 vs. TAIR10
Match: AT3G42180.1 (Exostosin family protein)

HSP 1 Score: 488.8 bits (1257), Expect = 3.1e-138
Identity = 237/383 (61.88%), Postives = 291/383 (75.98%), Query Frame = 0

Query: 8   EARAAIRLAIVTRNYTSEKEE-SFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPL 67
           +ARAAIR A+  +N TS +E  ++IP G++YRN++AFHQSHIEM K  K+W+YKEGEQPL
Sbjct: 87  KARAAIRRAVRFKNCTSNEEVITYIPTGQIYRNSFAFHQSHIEMMKTFKVWSYKEGEQPL 146

Query: 68  VHDGPMKHIYSIEGHFIDE----MDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPIT 127
           VHDGP+  IY IEG FIDE    M      F A  PEEA  FFLP S+  IV Y+Y+PIT
Sbjct: 147 VHDGPVNDIYGIEGQFIDELSYVMGGPSGRFRASRPEEAHAFFLPFSVANIVHYVYQPIT 206

Query: 128 T---YARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFI 187
           +   + R RL RIF DYV VVA+K+P+WN++ GADHFMVSCHDWAP+V    P  FK F+
Sbjct: 207 SPADFNRARLHRIFNDYVDVVAHKHPFWNQSNGADHFMVSCHDWAPDVPDSKPEFFKNFM 266

Query: 188 RVLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHI 247
           R LCNANTSEGF    D S+PEIN+P    L  P +GQ P+NR+ILAFFAG AHG+IR +
Sbjct: 267 RGLCNANTSEGFRRNIDFSIPEINIPKR-KLKPPFMGQNPENRTILAFFAGRAHGYIREV 326

Query: 248 LMQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVI 307
           L  HWK KD ++QV+++L   QNY ELI  SKFCLCPSGYEVASPR VEAI+ GCVPVVI
Sbjct: 327 LFSHWKGKDKDVQVYDHLTKGQNYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVI 386

Query: 308 SDYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRP 367
           SD YSLPF+DVLDWSKFS+ IP ++IP+IK IL+     KYL++ R VMKV+RHF ++RP
Sbjct: 387 SDNYSLPFNDVLDWSKFSVEIPVDKIPDIKKILQXXXHDKYLRMYRNVMKVRRHFVVNRP 446

Query: 368 AKAFDMFHMVLHSVWLRRLNVKL 383
           A+ FD+ HM+LHSVWLRRLN++L
Sbjct: 447 AQPFDVIHMILHSVWLRRLNIRL 468

BLAST of CsGy1G017140.1 vs. TAIR10
Match: AT5G33290.1 (xylogalacturonan deficient 1)

HSP 1 Score: 456.4 bits (1173), Expect = 1.7e-128
Identity = 223/388 (57.47%), Postives = 284/388 (73.20%), Query Frame = 0

Query: 2   IEEGLAEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKE 61
           IE  LA+ARAAI+ A  T+NY S           +Y+N  AFHQSH EM  R K+WTY E
Sbjct: 122 IESDLAKARAAIKKAASTQNYVSS----------LYKNPAAFHQSHTEMMNRFKVWTYTE 181

Query: 62  GEQPLVHDGPMKHIYSIEGHFIDEM----DSGKSPFSAHEPEEAQVFFLPISIVYIVDYI 121
           GE PL HDGP+  IY IEG F+DEM       +S F A  PE A VFF+P S+  ++ ++
Sbjct: 182 GEVPLFHDGPVNDIYGIEGQFMDEMCVDGPKSRSRFRADRPENAHVFFIPFSVAKVIHFV 241

Query: 122 YKPITT---YARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNL 181
           YKPIT+   ++R RL R+  DYV VVA K+PYWNR++G DHFMVSCHDWAP+V   +P L
Sbjct: 242 YKPITSVEGFSRARLHRLIEDYVDVVATKHPYWNRSQGGDHFMVSCHDWAPDVIDGNPKL 301

Query: 182 FKYFIRVLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHG 241
           F+ FIR LCNANTSEGF P  D S+PEI LP    L    LG+ P+ RSILAFFAG +HG
Sbjct: 302 FEKFIRGLCNANTSEGFRPNVDVSIPEIYLPKG-KLGPSFLGKSPRVRSILAFFAGRSHG 361

Query: 242 FIRHILMQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGC 301
            IR IL QHWK+ D+E+QV++ LPP ++YT+ +  SKFCLCPSG+EVASPR VEAI+ GC
Sbjct: 362 EIRKILFQHWKEMDNEVQVYDRLPPGKDYTKTMGMSKFCLCPSGWEVASPREVEAIYAGC 421

Query: 302 VPVVISDYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHF 361
           VPV+ISD YSLPF DVL+W  FS++IP  RI EIKTIL+ VS+ +YLK+ + V++V++HF
Sbjct: 422 VPVIISDNYSLPFSDVLNWDSFSIQIPVSRIKEIKTILQSVSLVRYLKMYKRVLEVKQHF 481

Query: 362 EIHRPAKAFDMFHMVLHSVWLRRLNVKL 383
            ++RPAK +D+ HM+LHS+WLRRLN++L
Sbjct: 482 VLNRPAKPYDVMHMMLHSIWLRRLNLRL 498

BLAST of CsGy1G017140.1 vs. TAIR10
Match: AT5G03795.1 (Exostosin family protein)

HSP 1 Score: 404.1 bits (1037), Expect = 9.9e-113
Identity = 196/383 (51.17%), Postives = 274/383 (71.54%), Query Frame = 0

Query: 2   IEEGLAEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKE 61
           IE  L +ARA+I+ A +        +  ++P G +Y NA  FH+S++EM+K+ KI+ YKE
Sbjct: 141 IEFKLQKARASIKAASMD---DPVDDPDYVPLGPMYWNAKVFHRSYLEMEKQFKIYVYKE 200

Query: 62  GEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPI 121
           GE PL HDGP K IYS+EG FI E+++  + F  + P++A VF+LP S+V +V Y+Y+  
Sbjct: 201 GEPPLFHDGPCKSIYSMEGSFIYEIET-DTRFRTNNPDKAHVFYLPFSVVKMVRYVYE-- 260

Query: 122 TTYARD--RLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFI 181
              +RD   +     DY+ +V +KYPYWNR+ GADHF++SCHDW PE +   P+L    I
Sbjct: 261 -RNSRDFSPIRNTVKDYINLVGDKYPYWNRSIGADHFILSCHDWGPEASFSHPHLGHNSI 320

Query: 182 RVLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHI 241
           R LCNANTSE F P +D S+PEINL  T  L     G  P +R ILAFFAGG HG +R +
Sbjct: 321 RALCNANTSERFKPRKDVSIPEINL-RTGSLTGLVGGPSPSSRPILAFFAGGVHGPVRPV 380

Query: 242 LMQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVI 301
           L+QHW++KD++I+VH+YLP   +Y++++  SKFC+CPSGYEVASPR+VEA++ GCVPV+I
Sbjct: 381 LLQHWENKDNDIRVHKYLPRGTSYSDMMRNSKFCICPSGYEVASPRIVEALYSGCVPVLI 440

Query: 302 SDYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRP 361
           +  Y  PF DVL+W  FS+ +  E IP +KTIL  +S ++YL++ R V+KV+RHFE++ P
Sbjct: 441 NSGYVPPFSDVLNWRSFSVIVSVEDIPNLKTILTSISPRQYLRMYRRVLKVRRHFEVNSP 500

Query: 362 AKAFDMFHMVLHSVWLRRLNVKL 383
           AK FD+FHM+LHS+W+RRLNVK+
Sbjct: 501 AKRFDVFHMILHSIWVRRLNVKI 515

BLAST of CsGy1G017140.1 vs. Swiss-Prot
Match: sp|Q3E9A4|GLYT5_ARATH (Probable glycosyltransferase At5g20260 OS=Arabidopsis thaliana OX=3702 GN=At5g20260 PE=3 SV=3)

HSP 1 Score: 531.6 bits (1368), Expect = 7.4e-150
Identity = 242/383 (63.19%), Postives = 317/383 (82.77%), Query Frame = 0

Query: 1   MIEEGLAEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYK 60
           +IEEGLA++R+AIR A+  + + S+KEE+F+PRG VYRNA+AFHQSHIEM+K+ K+W Y+
Sbjct: 84  IIEEGLAKSRSAIREAVRLKKFVSDKEETFVPRGAVYRNAFAFHQSHIEMEKKFKVWVYR 143

Query: 61  EGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKP 120
           EGE PLVH GPM +IYSIEG F+DE+++G SPF+A+ PEEA  F LP+S+  IV Y+Y+P
Sbjct: 144 EGETPLVHMGPMNNIYSIEGQFMDEIETGMSPFAANNPEEAHAFLLPVSVANIVHYLYRP 203

Query: 121 ITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIR 180
           + TY+R++L ++F DYV VVA+KYPYWNR+ GADHF VSCHDWAP+V+  +P L K  IR
Sbjct: 204 LVTYSREQLHKVFLDYVDVVAHKYPYWNRSLGADHFYVSCHDWAPDVSGSNPELMKNLIR 263

Query: 181 VLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPP-QNRSILAFFAGGAHGFIRHI 240
           VLCNANTSEGF P RD S+PEIN+P   HL  PRL +    +R ILAFFAGG+HG+IR I
Sbjct: 264 VLCNANTSEGFMPQRDVSIPEINIPGG-HLGPPRLSRSSGHDRPILAFFAGGSHGYIRRI 323

Query: 241 LMQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVI 300
           L+QHWKDKD E+QVHEYL  +++Y +L+  ++FCLCPSGYEVASPR+V AI+ GCVPV+I
Sbjct: 324 LLQHWKDKDEEVQVHEYLAKNKDYFKLMATARFCLCPSGYEVASPRVVAAINLGCVPVII 383

Query: 301 SDYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRP 360
           SD+Y+LPF DVLDW+KF++ +PS++IPEIKTIL+ +S ++Y  LQR V++VQRHF I+RP
Sbjct: 384 SDHYALPFSDVLDWTKFTIHVPSKKIPEIKTILKSISWRRYRVLQRRVLQVQRHFVINRP 443

Query: 361 AKAFDMFHMVLHSVWLRRLNVKL 383
           ++ FDM  M+LHSVWLRRLN++L
Sbjct: 444 SQPFDMLRMLLHSVWLRRLNLRL 465

BLAST of CsGy1G017140.1 vs. Swiss-Prot
Match: sp|Q9LFP3|GLYT4_ARATH (Probable glycosyltransferase At5g11130 OS=Arabidopsis thaliana OX=3702 GN=At5g11130/At5g11120 PE=3 SV=2)

HSP 1 Score: 501.5 bits (1290), Expect = 8.2e-141
Identity = 223/386 (57.77%), Postives = 302/386 (78.24%), Query Frame = 0

Query: 2   IEEGLAEARAAIRLA-----IVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKI 61
           IEEGLA ARAAIR A        R+ T+  +   +  G VY NA+ FHQSH EM+KR KI
Sbjct: 92  IEEGLAMARAAIRKAGEKNLRRDRDRTNNSDVGVVSNGSVYLNAFTFHQSHKEMEKRFKI 151

Query: 62  WTYKEGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDY 121
           WTY+EGE PL H GP+ +IY+IEG F+DE+++G S F A  PEEA VF++P+ IV I+ +
Sbjct: 152 WTYREGEAPLFHKGPLNNIYAIEGQFMDEIENGNSRFKAASPEEATVFYIPVGIVNIIRF 211

Query: 122 IYKPITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFK 181
           +Y+P T+YARDRL  I  DY+ +++N+YPYWNR+RGADHF +SCHDWAP+V+  DP L+K
Sbjct: 212 VYRPYTSYARDRLQNIVKDYISLISNRYPYWNRSRGADHFFLSCHDWAPDVSAVDPELYK 271

Query: 182 YFIRVLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFI 241
           +FIR LCNAN+SEGF PMRD SLPEIN+P +  L     G+PPQNR +LAFFAGG+HG +
Sbjct: 272 HFIRALCNANSSEGFTPMRDVSLPEINIPHS-QLGFVHTGEPPQNRKLLAFFAGGSHGDV 331

Query: 242 RHILMQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVP 301
           R IL QHWK+KD ++ V+E LP + NYT+++D++KFCLCPSG+EVASPR+VE+++ GCVP
Sbjct: 332 RKILFQHWKEKDKDVLVYENLPKTMNYTKMMDKAKFCLCPSGWEVASPRIVESLYSGCVP 391

Query: 302 VVISDYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEI 361
           V+I+DYY LPF DVL+W  FS+ IP  ++P+IK IL  ++ ++YL +QR V++V++HF I
Sbjct: 392 VIIADYYVLPFSDVLNWKTFSVHIPISKMPDIKKILEAITEEEYLNMQRRVLEVRKHFVI 451

Query: 362 HRPAKAFDMFHMVLHSVWLRRLNVKL 383
           +RP+K +DM HM++HS+WLRRLNV++
Sbjct: 452 NRPSKPYDMLHMIMHSIWLRRLNVRI 476

BLAST of CsGy1G017140.1 vs. Swiss-Prot
Match: sp|Q3EAR7|GLYT2_ARATH (Probable glycosyltransferase At3g42180 OS=Arabidopsis thaliana OX=3702 GN=At3g42180 PE=2 SV=2)

HSP 1 Score: 488.8 bits (1257), Expect = 5.5e-137
Identity = 237/383 (61.88%), Postives = 291/383 (75.98%), Query Frame = 0

Query: 8   EARAAIRLAIVTRNYTSEKEE-SFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPL 67
           +ARAAIR A+  +N TS +E  ++IP G++YRN++AFHQSHIEM K  K+W+YKEGEQPL
Sbjct: 87  KARAAIRRAVRFKNCTSNEEVITYIPTGQIYRNSFAFHQSHIEMMKTFKVWSYKEGEQPL 146

Query: 68  VHDGPMKHIYSIEGHFIDE----MDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPIT 127
           VHDGP+  IY IEG FIDE    M      F A  PEEA  FFLP S+  IV Y+Y+PIT
Sbjct: 147 VHDGPVNDIYGIEGQFIDELSYVMGGPSGRFRASRPEEAHAFFLPFSVANIVHYVYQPIT 206

Query: 128 T---YARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFI 187
           +   + R RL RIF DYV VVA+K+P+WN++ GADHFMVSCHDWAP+V    P  FK F+
Sbjct: 207 SPADFNRARLHRIFNDYVDVVAHKHPFWNQSNGADHFMVSCHDWAPDVPDSKPEFFKNFM 266

Query: 188 RVLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHI 247
           R LCNANTSEGF    D S+PEIN+P    L  P +GQ P+NR+ILAFFAG AHG+IR +
Sbjct: 267 RGLCNANTSEGFRRNIDFSIPEINIPKR-KLKPPFMGQNPENRTILAFFAGRAHGYIREV 326

Query: 248 LMQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVI 307
           L  HWK KD ++QV+++L   QNY ELI  SKFCLCPSGYEVASPR VEAI+ GCVPVVI
Sbjct: 327 LFSHWKGKDKDVQVYDHLTKGQNYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVI 386

Query: 308 SDYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRP 367
           SD YSLPF+DVLDWSKFS+ IP ++IP+IK IL+     KYL++ R VMKV+RHF ++RP
Sbjct: 387 SDNYSLPFNDVLDWSKFSVEIPVDKIPDIKKILQXXXHDKYLRMYRNVMKVRRHFVVNRP 446

Query: 368 AKAFDMFHMVLHSVWLRRLNVKL 383
           A+ FD+ HM+LHSVWLRRLN++L
Sbjct: 447 AQPFDVIHMILHSVWLRRLNIRL 468

BLAST of CsGy1G017140.1 vs. Swiss-Prot
Match: sp|Q94AA9|XGD1_ARATH (Xylogalacturonan beta-1,3-xylosyltransferase OS=Arabidopsis thaliana OX=3702 GN=XGD1 PE=1 SV=2)

HSP 1 Score: 456.4 bits (1173), Expect = 3.0e-127
Identity = 223/388 (57.47%), Postives = 284/388 (73.20%), Query Frame = 0

Query: 2   IEEGLAEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKE 61
           IE  LA+ARAAI+ A  T+NY S           +Y+N  AFHQSH EM  R K+WTY E
Sbjct: 122 IESDLAKARAAIKKAASTQNYVSS----------LYKNPAAFHQSHTEMMNRFKVWTYTE 181

Query: 62  GEQPLVHDGPMKHIYSIEGHFIDEM----DSGKSPFSAHEPEEAQVFFLPISIVYIVDYI 121
           GE PL HDGP+  IY IEG F+DEM       +S F A  PE A VFF+P S+  ++ ++
Sbjct: 182 GEVPLFHDGPVNDIYGIEGQFMDEMCVDGPKSRSRFRADRPENAHVFFIPFSVAKVIHFV 241

Query: 122 YKPITT---YARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNL 181
           YKPIT+   ++R RL R+  DYV VVA K+PYWNR++G DHFMVSCHDWAP+V   +P L
Sbjct: 242 YKPITSVEGFSRARLHRLIEDYVDVVATKHPYWNRSQGGDHFMVSCHDWAPDVIDGNPKL 301

Query: 182 FKYFIRVLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHG 241
           F+ FIR LCNANTSEGF P  D S+PEI LP    L    LG+ P+ RSILAFFAG +HG
Sbjct: 302 FEKFIRGLCNANTSEGFRPNVDVSIPEIYLPKG-KLGPSFLGKSPRVRSILAFFAGRSHG 361

Query: 242 FIRHILMQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGC 301
            IR IL QHWK+ D+E+QV++ LPP ++YT+ +  SKFCLCPSG+EVASPR VEAI+ GC
Sbjct: 362 EIRKILFQHWKEMDNEVQVYDRLPPGKDYTKTMGMSKFCLCPSGWEVASPREVEAIYAGC 421

Query: 302 VPVVISDYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHF 361
           VPV+ISD YSLPF DVL+W  FS++IP  RI EIKTIL+ VS+ +YLK+ + V++V++HF
Sbjct: 422 VPVIISDNYSLPFSDVLNWDSFSIQIPVSRIKEIKTILQSVSLVRYLKMYKRVLEVKQHF 481

Query: 362 EIHRPAKAFDMFHMVLHSVWLRRLNVKL 383
            ++RPAK +D+ HM+LHS+WLRRLN++L
Sbjct: 482 VLNRPAKPYDVMHMMLHSIWLRRLNLRL 498

BLAST of CsGy1G017140.1 vs. Swiss-Prot
Match: sp|Q9FFN2|GLYT3_ARATH (Probable glycosyltransferase At5g03795 OS=Arabidopsis thaliana OX=3702 GN=At5g03795 PE=3 SV=2)

HSP 1 Score: 404.1 bits (1037), Expect = 1.8e-111
Identity = 196/383 (51.17%), Postives = 274/383 (71.54%), Query Frame = 0

Query: 2   IEEGLAEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKE 61
           IE  L +ARA+I+ A +        +  ++P G +Y NA  FH+S++EM+K+ KI+ YKE
Sbjct: 141 IEFKLQKARASIKAASMD---DPVDDPDYVPLGPMYWNAKVFHRSYLEMEKQFKIYVYKE 200

Query: 62  GEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPI 121
           GE PL HDGP K IYS+EG FI E+++  + F  + P++A VF+LP S+V +V Y+Y+  
Sbjct: 201 GEPPLFHDGPCKSIYSMEGSFIYEIET-DTRFRTNNPDKAHVFYLPFSVVKMVRYVYE-- 260

Query: 122 TTYARD--RLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFI 181
              +RD   +     DY+ +V +KYPYWNR+ GADHF++SCHDW PE +   P+L    I
Sbjct: 261 -RNSRDFSPIRNTVKDYINLVGDKYPYWNRSIGADHFILSCHDWGPEASFSHPHLGHNSI 320

Query: 182 RVLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHI 241
           R LCNANTSE F P +D S+PEINL  T  L     G  P +R ILAFFAGG HG +R +
Sbjct: 321 RALCNANTSERFKPRKDVSIPEINL-RTGSLTGLVGGPSPSSRPILAFFAGGVHGPVRPV 380

Query: 242 LMQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVI 301
           L+QHW++KD++I+VH+YLP   +Y++++  SKFC+CPSGYEVASPR+VEA++ GCVPV+I
Sbjct: 381 LLQHWENKDNDIRVHKYLPRGTSYSDMMRNSKFCICPSGYEVASPRIVEALYSGCVPVLI 440

Query: 302 SDYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRP 361
           +  Y  PF DVL+W  FS+ +  E IP +KTIL  +S ++YL++ R V+KV+RHFE++ P
Sbjct: 441 NSGYVPPFSDVLNWRSFSVIVSVEDIPNLKTILTSISPRQYLRMYRRVLKVRRHFEVNSP 500

Query: 362 AKAFDMFHMVLHSVWLRRLNVKL 383
           AK FD+FHM+LHS+W+RRLNVK+
Sbjct: 501 AKRFDVFHMILHSIWVRRLNVKI 515

BLAST of CsGy1G017140.1 vs. TrEMBL
Match: tr|A0A0A0LTL1|A0A0A0LTL1_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G226410 PE=4 SV=1)

HSP 1 Score: 802.0 bits (2070), Expect = 5.9e-229
Identity = 384/384 (100.00%), Postives = 384/384 (100.00%), Query Frame = 0

Query: 1   MIEEGLAEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYK 60
           MIEEGLAEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYK
Sbjct: 95  MIEEGLAEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYK 154

Query: 61  EGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKP 120
           EGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKP
Sbjct: 155 EGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKP 214

Query: 121 ITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIR 180
           ITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIR
Sbjct: 215 ITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIR 274

Query: 181 VLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHIL 240
           VLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHIL
Sbjct: 275 VLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHIL 334

Query: 241 MQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVIS 300
           MQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVIS
Sbjct: 335 MQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVIS 394

Query: 301 DYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPA 360
           DYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPA
Sbjct: 395 DYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPA 454

Query: 361 KAFDMFHMVLHSVWLRRLNVKLTH 385
           KAFDMFHMVLHSVWLRRLNVKLTH
Sbjct: 455 KAFDMFHMVLHSVWLRRLNVKLTH 478

BLAST of CsGy1G017140.1 vs. TrEMBL
Match: tr|A0A1S3B8R4|A0A1S3B8R4_CUCME (probable glycosyltransferase At5g20260 OS=Cucumis melo OX=3656 GN=LOC103487410 PE=4 SV=1)

HSP 1 Score: 785.0 bits (2026), Expect = 7.5e-224
Identity = 374/384 (97.40%), Postives = 379/384 (98.70%), Query Frame = 0

Query: 1   MIEEGLAEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYK 60
           MIEEGLAEARAAIR AIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYK
Sbjct: 97  MIEEGLAEARAAIRQAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYK 156

Query: 61  EGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKP 120
           EGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAH+PEEA VFFLPISIVYIVDYIYKP
Sbjct: 157 EGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHDPEEAHVFFLPISIVYIVDYIYKP 216

Query: 121 ITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIR 180
           ITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIR
Sbjct: 217 ITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIR 276

Query: 181 VLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHIL 240
           VLCNANTSEGFNPMRDASLPEINLPPTFHLNLPR GQPPQNRSILAFFAGGAHGFIRH+L
Sbjct: 277 VLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRSGQPPQNRSILAFFAGGAHGFIRHVL 336

Query: 241 MQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVIS 300
           MQHWKDKD EIQVHEYLPP++NYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPV+IS
Sbjct: 337 MQHWKDKDDEIQVHEYLPPAKNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVIIS 396

Query: 301 DYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPA 360
           DYYSLPFDDVLDWSKFSMRIPSERIPEIK ILRGVSMKKYLKLQRGVMKVQRHFEIHRPA
Sbjct: 397 DYYSLPFDDVLDWSKFSMRIPSERIPEIKKILRGVSMKKYLKLQRGVMKVQRHFEIHRPA 456

Query: 361 KAFDMFHMVLHSVWLRRLNVKLTH 385
           KAFDMFHMVLHSVWLRRLNVKLTH
Sbjct: 457 KAFDMFHMVLHSVWLRRLNVKLTH 480

BLAST of CsGy1G017140.1 vs. TrEMBL
Match: tr|A0A2P4KFA0|A0A2P4KFA0_QUESU (Putative glycosyltransferase OS=Quercus suber OX=58331 GN=CFP56_18609 PE=4 SV=1)

HSP 1 Score: 587.4 bits (1513), Expect = 2.3e-164
Identity = 280/385 (72.73%), Postives = 326/385 (84.68%), Query Frame = 0

Query: 2   IEEGLAEARAAIRLAIVTR--NYTSEKEE-SFIPRGRVYRNAYAFHQSHIEMKKRLKIWT 61
           IE+ LA ARAAI  AI TR  NYTS+ ++ SFIPRG +YRNAYAFHQSH EM KR K+W 
Sbjct: 413 IEDDLARARAAIHKAIRTRKWNYTSDDDDGSFIPRGSIYRNAYAFHQSHKEMVKRFKLWA 472

Query: 62  YKEGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIY 121
           Y+EGEQPLVHDGP KHIYSIEGHFIDEM+SGKS F A  P+EA VFFLPIS+ YIV+YIY
Sbjct: 473 YREGEQPLVHDGPTKHIYSIEGHFIDEMESGKSTFMARHPDEAHVFFLPISVTYIVEYIY 532

Query: 122 KPITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYF 181
           KP+T YARDRLVRI TDY+  VA++YPYWNR+ GADHF VSCHDW PEV+K+DP LFK F
Sbjct: 533 KPVTNYARDRLVRIVTDYIYTVADRYPYWNRSSGADHFFVSCHDWGPEVSKDDPKLFKNF 592

Query: 182 IRVLCNANTSEGFNPMRDASLPEINLPPTFHLNLPR-LGQPPQNRSILAFFAGGAHGFIR 241
           +RVLCNANTSEGF P RD SLPE NL P F L+ PR LG  P  RSILAFFAGGAHG IR
Sbjct: 593 MRVLCNANTSEGFQPRRDVSLPEFNLEP-FKLSPPRNLGVAPNKRSILAFFAGGAHGDIR 652

Query: 242 HILMQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPV 301
           + L+++WKDKD E++VHEYLP +QNY++L+ +SKFCLCPSG+EVASPRLVEAI   CVPV
Sbjct: 653 NALLEYWKDKDDEVRVHEYLPKNQNYSKLMGQSKFCLCPSGFEVASPRLVEAIFAECVPV 712

Query: 302 VISDYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIH 361
           +ISDYY LPF DVL+WSKFS+ IPS+RIPEIKTIL+G+S  KYLK+Q+ V KVQRHFE++
Sbjct: 713 IISDYYVLPFSDVLNWSKFSLHIPSKRIPEIKTILKGISNSKYLKMQKRVTKVQRHFELN 772

Query: 362 RPAKAFDMFHMVLHSVWLRRLNVKL 383
           RPAK FD+FHM+LHSVWLRRLN++L
Sbjct: 773 RPAKPFDVFHMLLHSVWLRRLNIRL 796

BLAST of CsGy1G017140.1 vs. TrEMBL
Match: tr|A0A2I4F501|A0A2I4F501_9ROSI (probable glycosyltransferase At5g20260 OS=Juglans regia OX=51240 GN=LOC108995588 PE=4 SV=1)

HSP 1 Score: 584.3 bits (1505), Expect = 1.9e-163
Identity = 279/385 (72.47%), Postives = 321/385 (83.38%), Query Frame = 0

Query: 2   IEEGLAEARAAIRLAIVTRNYTSEKEES---FIPRGRVYRNAYAFHQSHIEMKKRLKIWT 61
           IE  LA ARAAIR AI+TRN+TS  + S   FIPRG +YRNAYAFHQSHIEM KR K+WT
Sbjct: 131 IEADLARARAAIRKAILTRNFTSHDKGSVIGFIPRGCIYRNAYAFHQSHIEMVKRFKVWT 190

Query: 62  YKEGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIY 121
           YKEGEQPLVHDGP KHIYSIEGHFIDEMD G S F AH P+EA VFFLPIS+ YIV+YIY
Sbjct: 191 YKEGEQPLVHDGPTKHIYSIEGHFIDEMDGGMSTFMAHHPDEAHVFFLPISVTYIVEYIY 250

Query: 122 KPITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYF 181
            PITTY RDRLVRI TDY+  V  KYPYWNR+ GADHF+VSCHDWAP+V+KE P L+K F
Sbjct: 251 LPITTYDRDRLVRIVTDYIYTVRKKYPYWNRSSGADHFLVSCHDWAPQVSKEKPELYKNF 310

Query: 182 IRVLCNANTSEGFNPMRDASLPEINLPPTFHLNLPR-LGQPPQNRSILAFFAGGAHGFIR 241
           IRVLCNANTSEGF P RD SLPE NL P F LN PR +G P   R+IL FFAG AHG IR
Sbjct: 311 IRVLCNANTSEGFEPKRDVSLPEFNLEP-FKLNPPRDIGLPTAKRTILGFFAGRAHGDIR 370

Query: 242 HILMQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPV 301
           +IL  HWKDKD +I+V E+LP +QNY++L+ +SKFCLCPSGYEVASPRLVEAIH GCVPV
Sbjct: 371 NILFAHWKDKDEDIRVFEHLPENQNYSKLMGQSKFCLCPSGYEVASPRLVEAIHAGCVPV 430

Query: 302 VISDYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIH 361
           ++SDYY LPF DVLDWSKFS++IP+ RIPEIKTIL+G+   +YLK+QR V KV+RHFE++
Sbjct: 431 IVSDYYVLPFSDVLDWSKFSLQIPTNRIPEIKTILKGIPNFQYLKMQRRVTKVRRHFEMN 490

Query: 362 RPAKAFDMFHMVLHSVWLRRLNVKL 383
           RPAK FD+FHM+LHSVWLRRL+++L
Sbjct: 491 RPAKPFDVFHMLLHSVWLRRLDIRL 514

BLAST of CsGy1G017140.1 vs. TrEMBL
Match: tr|A0A061GVS6|A0A061GVS6_THECC (Exostosin family protein OS=Theobroma cacao OX=3641 GN=TCM_038164 PE=4 SV=1)

HSP 1 Score: 582.4 bits (1500), Expect = 7.4e-163
Identity = 269/381 (70.60%), Postives = 323/381 (84.78%), Query Frame = 0

Query: 2   IEEGLAEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKE 61
           +E  LA ARAAIR AI TRNYTS KEE FIPRG +YRN YAFHQSHIEM +R KIWTYKE
Sbjct: 89  VEADLASARAAIREAIRTRNYTSYKEEKFIPRGCMYRNEYAFHQSHIEMVERFKIWTYKE 148

Query: 62  GEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPI 121
           GE+PLVH GPMKHIY+IEG FI+E++ GKSPF A  P+EA VFFLP+S+ YIV+YIY PI
Sbjct: 149 GERPLVHTGPMKHIYAIEGQFIEEIEGGKSPFKAQHPDEAHVFFLPVSVAYIVNYIYLPI 208

Query: 122 TTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRV 181
           TTY+RDRLVRIFTDY++VVA KYPYW+RT+GADHFMVSCHDWAPEV  +DP L+K  IRV
Sbjct: 209 TTYSRDRLVRIFTDYIKVVAKKYPYWSRTKGADHFMVSCHDWAPEVAGQDPELYKNLIRV 268

Query: 182 LCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILM 241
           LCNAN+SEGF+P RD +LPE+NLPP    +  R  QPP  R+ILAFFAGGAHG IR IL+
Sbjct: 269 LCNANSSEGFHPKRDVALPELNLPPR-GFSPRRFAQPPDKRTILAFFAGGAHGNIRKILL 328

Query: 242 QHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISD 301
            HWKDKD+E+QVHEYL   Q+Y++L+ RSKFCLCPSG+EVASPR+VE+ + GCVPV+ISD
Sbjct: 329 HHWKDKDNEVQVHEYLSKGQDYSKLMGRSKFCLCPSGFEVASPRVVESFYAGCVPVIISD 388

Query: 302 YYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAK 361
            Y LPF DVLDWSKFS++IP E+IP+IKTIL+ +   KYL++QR V+K++RHFE++RPAK
Sbjct: 389 NYVLPFSDVLDWSKFSVQIPVEKIPQIKTILQSIPGNKYLEMQRRVLKLRRHFELNRPAK 448

Query: 362 AFDMFHMVLHSVWLRRLNVKL 383
            FD+ HMVLHS+WLRRLN++L
Sbjct: 449 PFDIIHMVLHSIWLRRLNLRL 468

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011655344.19.0e-229100.00PREDICTED: probable glycosyltransferase At5g20260 [Cucumis sativus] >KGN65113.1 ... [more]
XP_008443936.11.1e-22397.40PREDICTED: probable glycosyltransferase At5g20260 [Cucumis melo][more]
XP_022135540.14.7e-20187.73probable glycosyltransferase At5g20260 [Momordica charantia][more]
XP_022987712.15.2e-19283.03probable glycosyltransferase At5g20260 isoform X1 [Cucurbita maxima][more]
XP_023515662.11.3e-19082.25probable glycosyltransferase At5g20260 isoform X1 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
AT5G20260.14.1e-15163.19Exostosin family protein[more]
AT5G11130.14.6e-14257.77Exostosin family protein[more]
AT3G42180.13.1e-13861.88Exostosin family protein[more]
AT5G33290.11.7e-12857.47xylogalacturonan deficient 1[more]
AT5G03795.19.9e-11351.17Exostosin family protein[more]
Match NameE-valueIdentityDescription
sp|Q3E9A4|GLYT5_ARATH7.4e-15063.19Probable glycosyltransferase At5g20260 OS=Arabidopsis thaliana OX=3702 GN=At5g20... [more]
sp|Q9LFP3|GLYT4_ARATH8.2e-14157.77Probable glycosyltransferase At5g11130 OS=Arabidopsis thaliana OX=3702 GN=At5g11... [more]
sp|Q3EAR7|GLYT2_ARATH5.5e-13761.88Probable glycosyltransferase At3g42180 OS=Arabidopsis thaliana OX=3702 GN=At3g42... [more]
sp|Q94AA9|XGD1_ARATH3.0e-12757.47Xylogalacturonan beta-1,3-xylosyltransferase OS=Arabidopsis thaliana OX=3702 GN=... [more]
sp|Q9FFN2|GLYT3_ARATH1.8e-11151.17Probable glycosyltransferase At5g03795 OS=Arabidopsis thaliana OX=3702 GN=At5g03... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LTL1|A0A0A0LTL1_CUCSA5.9e-229100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G226410 PE=4 SV=1[more]
tr|A0A1S3B8R4|A0A1S3B8R4_CUCME7.5e-22497.40probable glycosyltransferase At5g20260 OS=Cucumis melo OX=3656 GN=LOC103487410 P... [more]
tr|A0A2P4KFA0|A0A2P4KFA0_QUESU2.3e-16472.73Putative glycosyltransferase OS=Quercus suber OX=58331 GN=CFP56_18609 PE=4 SV=1[more]
tr|A0A2I4F501|A0A2I4F501_9ROSI1.9e-16372.47probable glycosyltransferase At5g20260 OS=Juglans regia OX=51240 GN=LOC108995588... [more]
tr|A0A061GVS6|A0A061GVS6_THECC7.4e-16370.60Exostosin family protein OS=Theobroma cacao OX=3641 GN=TCM_038164 PE=4 SV=1[more]
The following terms have been associated with this mRNA:
Vocabulary: Biological Process
TermDefinition
GO:0006486protein glycosylation
Vocabulary: Molecular Function
TermDefinition
GO:0016757transferase activity, transferring glycosyl groups
Vocabulary: INTERPRO
TermDefinition
IPR004263Exostosin
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006486 protein glycosylation
cellular_component GO:0005575 cellular_component
molecular_function GO:0016757 transferase activity, transferring glycosyl groups

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CsGy1G017140CsGy1G017140gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CsGy1G017140.1CsGy1G017140.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy1G017140.1.CDS.3CsGy1G017140.1.CDS.3CDS
CsGy1G017140.1.CDS.2CsGy1G017140.1.CDS.2CDS
CsGy1G017140.1.CDS.1CsGy1G017140.1.CDS.1CDS


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy1G017140.1.exon.3CsGy1G017140.1.exon.3exon
CsGy1G017140.1.exon.2CsGy1G017140.1.exon.2exon
CsGy1G017140.1.exon.1CsGy1G017140.1.exon.1exon


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004263Exostosin-likePFAMPF03016Exostosincoord: 51..333
e-value: 5.6E-55
score: 186.6
NoneNo IPR availablePANTHERPTHR11062:SF106SUBFAMILY NOT NAMEDcoord: 2..384
NoneNo IPR availablePANTHERPTHR11062EXOSTOSIN HEPARAN SULFATE GLYCOSYLTRANSFERASE -RELATEDcoord: 2..384