CsGy6G035950 (gene) Cucumber (Gy14) v2

NameCsGy6G035950
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
DescriptionPutative beta-1,3-galactosyltransferase 20-like protein
LocationChr6 : 31619242 .. 31624997 (+)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAGTTTAGTGGAAGTTATTTTACAATTCAGTCCCTCAAAATGTTCTTATTTTCGTTTCATGGGTTACTTCAATCTAACATGATAGCTTCGTTCTCGGTCCTTTCCGCTTCGTTGATGTTTATGGGACCTTCCATTTGTGTATTCAATCTAAATTTCAGGTCACATTTCTCCACCCAGACTGTTTTTCCCGCATTTACTTGATCTTCGGTGATTCCTTGGTGTTGCCCAGCCCTGTTGATGGGTTTCATTTTTTTTCTTGTTTTCTCGATCACCATTTCCTACATAGGGTTTTTTTCGGGATAAGATTCAACCAAGATCCCTAAAAGAAACGACGAACCCCGGTGATTTTGGGGTTCTACAACAGTTTTTCGTGTTTCCGATTACAATGTGTTTGTTTGTTTGTTATATCTGAAATTTGGTTGTTCCTAAACCCTGTATTTCACATCCATTCGATCTTTTGGTTATTGGGGTATTTGGGTATGTGTATGAGATTTTTGTTGCCATATTGCATCGTGTTGATCCATGTTCGGATGTTGGGGATGAAGGCTTGGCGACTCGGATGAATTTGTATGTGTAGTTTTATGGTTTCTGTAGAGTACGTTTGAGGCTGTTGTTAGGCAATTTTATTTTTATTTGTAGATGAAGAAGGTTAAAACCGAACCTCCGGTTGCGAGGAGACTCAGGTTATCGCATCTTCTTCTCGTAATTGGAGTGTTGTATTTAGTTTTCATATCATTTAAGTTTCCACGTTTTTTGGAAATTGCTGCGACGTTGAGCGGGGATGAAAGTAATAATGGGTTGGATTCAAATGGAGTTGACAGTGAAGGAATGGATTTTAGCAAAGCGTCGTTGAGTTCTGTTTATAAGGATACATTTCATCGGAAACTGGAAGATAATCAGCATTTAGAAGCACCATTGACGCCTAAAAAAGAGCCACTCGAAGAGGTGAATAATGTTACTGGACCGATAAAGCCAATTAAGCATAAATATGGTCGGATAACCGGTAACATTTCGAGTCAGCTGAATCATACCAATGATTTTTCAATGCTTGAGACAATGGCAGATGAAGCTTGGACATTAGGCTCGATGGCTTGGGAAGAAGTAGATAAATTTGGGTTGAATGAGACTTCTGAAAGTTCTATACTCGAGGGAAAACCTGAGTCATGTCCTTCATGGATATCTACTGATGGGAAAAAGTTAATGGAGGGAGATGGACTCATGTTCCTTCCTTGTGGACTTGCTGCAGGTTCATCTATTACAATAATTGGAACCCCTCATCTTGCTCATCAGGAGTACGTGCCCCAACTTTTGAAGGTGGGAGGTGATCCTAAGGTCATGGTTTCACAGTTTATGGTTGAATTGCAAGGATTGAAATCGGTCGATGGTGAGGACCCACCAAAGATCCTTCACTTGAATCCACGGCTGAAAGGTGATTGGAGTAAACGGCCGGTCATTGAACACAATACATGTTATAGGATGCAGTGGGGAACGGCTCAAAGGTGTGATGGTTTGCCATCAAGTAGTGAGGACGAAATGCTTGGTGAGTTATCCCCTCTGTTCCTATTTTTCTCGTATCTTTCATTACCAGCACTCCAAACATAAATTTTAATCCTTCATGCATTGAAAAAAAATTTATTCAATGTGGATGCTTCTCCTAATAATTGGACATAGAGTTGAAGCATAATTTTTACCTTTTCTCTTACACTCATTATTATAAGACTGCTAGTCACATCGTTCCAACCACTCAAATCATTACTTTACCTCTTCATGTGCAGACTTAGTAGACTATTCCTGTTTTAAGAAAGCTATAGCAGAGCGGATGAAAGTTTATAACATTCTCCTACAGTTCTACTATAATTATTCTTTTGTCACTTTGTTTTAAACAAATAAAAAGGAAAACAATACCATTATGGTTAAGTTGTGTCTTTCTTTCCCTTCCAACACCTCAATTATTTCATTCTTCTAGGTGTGCCGTAGTACCAATGCCGTCTGACATTTTGTTCCTTTCAATTTTTTTATTTGACAGTTGATGGAAATCATCGATGCGAAAAATGGTTGAGAAGTGATGTTACAGATTCGAAAGAATCGAAAACAACCTCATGGTTCAGGAGATTCATAGGGAGGGAGCAAAAGCCAGAAGTGACTTGGCCATTTCCCTTTATGGAGGGCAGATTGTTTATCTTAACACTTCGTGCTGGTGTTGATGGATACCATATAAATGTTGGTGGTCGGCATTTGACTTCTTTTGCCTATCGCCCTGTAAGTATATGGAGAAGTGGAGGATGTTGATCACTATTTCACTTTGGTTGATGTTTGTCTTCTGCTGCAGTTGATATTGATCATCCGTCCCACGTTTTAATATTAAATTTCTTATATCTTTGTTTGAATTTCATATAGGGATTTACGCTTGAAGATGCAACTGGATTAGCAGTTAAAGGAGATGTGGACATTCATTCTACATATGCTACAGCTCTTCCTACGTCTCATCCAAGCTTCTCTCCTCAACGAGTTCTTGAAATGTCAGAGAAATGGAAATCTCAGCCTTTGCCAAAGAGTTCCGTTTTTCTTTTTATTGGTGTTCTGTCTGCTACTAATCATTTTGCGGAGCGTATGGCCGTTAGGAAAACTTGGATGCAGTCTTCAGCTGTCATGTCATCAAATGTAGTTGTTCGCTTCTTTGTTGCACTGGTATGACTCTTGTGTTCATCTCTCCAAGGTATATTCATAGAAGTAGAGTGAAGAATAGTAGTTGTTGACTTGTTGTAATTGTTGATGTTTCTCATGCATACAGAAGATGAAGAATCCCGTAACCTTGATTAATGTATTCTGAAAATGTTCTATTTGGATGCGAACAAAATATGTGCTACGTTCCACTGACAATGGAATATGTTGACACATAAACGGGAAAACAACAAGAGAGCATGCAATGGTTAGCTATGCTGGTTCGCATTTTAAATAAATAAGAATCTAAGAACTAATGCAATGTTGTATGACAATCATACAGCTGTACTCGTATCAAGTCATATGATGCAGATCTGAAACAGTGATTCAATCCATAATTATCTCTTTTCCTTGATTTCCTATATATTTAATGAGCTGAAGACTTTTTTTTTCTGATTGAAAACAAACAAACCTCGTATTCATCAAAAGAAAGGAAACCACTGCCAAAAGACCTTATAGACGAGGGATCCCTTATTTGAAATGGAACTGTTTTCTCTTTCCTTCCAACAATGTTTTTGTACCCCTGGAATGATTCACTGTAACCAGTTTATTTATAGACTTACTTATTTCATTCCAATTGGTTCTATTTTTGGTATTTTGGTTCACTTCAAATCCATTCTCCTTTGGTATATGCAGTTATTGGCTGTTTATAAAACTAAAAGTCAAATTATATTTATTATAGAAATCTGATCAATTATATAAAACCATACCGAATTGAACTTTAAAGGCCCGTTGAAGCCTGTGAACGATGCCATTCTTGCCAAAAGTGATGTTGATTAGGAGGTGGCAGCCGCATATGATAATAAATGTAAAAGCACTATTGAGGAACTTAGTAGTTTCCTTACTGGGAAACCCTGAGAGGCTTTCCGTATATCTTTTATTTGGCTATGGAAGTATTTTCTTGGCTGCTATCCTTCTATTTGGCTAACAAAGGAGAATTTAGATTTTAGTGAAAGTCTTGGAGAAAGAAGACCTTTATATGTTTGCAGACTTTGTTATTTCCAGAGGGGGAAATACCTCATGAAATTCCAATGTAGCATATAGTTCTATCTTGGTTATCGACATGTAAGCTGAGTCACTACCACTGTAAGGGAAGACTGATTCGTGGTCAGTCCTCCCATCCACTTTGGCACTCGGGATTTCTTAGATCTTAATTTCATTTATGTTCTTCCGTGTTTATTTGATTATTTCCACGGTGGGTTTTTGTATTTCTTCTCATGTATAATCTCACAATGTAGAATCCGAGGAAGGAGGTCAATGCTGTGCTGAAAAAGGAAGCTGCATATTTCGGTGATATTGTGATCCTGCCCTTCATGGACCGCTATGAGCTTGTCGTTCTCAAGACTATTGCCATATGTGAGTTTGGGGTGAGTTTGCGTTTCCCCTCTTCTGTTATCACTGTTTGGGATCTTCTTCATAGAACTTTGTACTATAAACTGTGAATAGCTTATGACCATAGTGTATGCTGTTTTTCAATCTCCCCTTGCAGGTAGTGAACTTGACAGCTTCATATATTATGAAATGTGACGACGATACCTTTGTGAGGGTGGAAACTGTTTTAAAACAGATCGAAGGCATTTCATCCAAGAAGTCCCTATACATGGGCAATCTCAACCTCTTGCATCGCCCTCTCAGACATGGAAAATGGGCAGTCACATATGAGGTAAATTATATACGTTTTAGCGTATAAAAGTTGTATACTCTTTGCTTTTATCCCACTTTGGTTACAACGGTAATCCACCTATTTTACTGGATAACTTTCATTCATCAATTAACAATGTTTAGCACAACATTGCTGCTTGTAAATATAATTGCCAATTGATATTATTACTTATATGCCCTTAACAAACTTGATTCTGTGTTTTTTTTATATTATTATTTTATAATTGCTATGAACTTGATTATCTATCCCAATCTTAAATACCCTCCCCTTCTCAAAAAAAAAAAAAAAAAAATTGCAATCTCAAAGAATTCCTTTCCATTCCTATGTCTCGTCACTAAACTACAGAAGTGGGCCCACCTAACTCCTTTCCTTTATTGTACATTTATTATTGGTGGCTTATCCCTTTGTTTCAATCTATTAAACAAAAAGCTTTGAACAATCCATTTTGTTTCCTTTCTTTACACTAACGATTTGTTTCCATGTACAAATATCAGGAATGGCCAGAAGAAGTCTATCCTCCATATGCCAATGGGCCAGGATATATCGTTTCCATTGACATTGCTAAATACATTGTCTCTCAACATGAAAACAAGAGCTTGAGGGTTAGTTTGAGAACATTAATCTTATTGTTTCTGTTTGTCTGTACCCCTCTTTTGTGTAATGATAGGATGTTTTGGTATGATGGCAGATATTCAAGATGGAGGATGTGAGCATGGGAATGTGGGTTGAACAGTTCAACAGTACTGTGGCGACAGTTCAATACTCTCACAACTGGAAATTTTGCCAATATGGATGTATGGAAGACTATTTTACAGCACACTATCAATCTCCAAGACAGATACTCTGCTTGTGGGATAAATTGGCCAGAGGACACGCTCATTGTTGCAACTTCAGGTAACAATTTTCCCCACTCTGTATTTCTAGCCATGTTTATTATTCAGTGTAGGCATCATGAATTCAAACTATGATGCATTATAATTGTTAATTTTTATTTTTTAAAAAAAGAAAATCAAGATCAGTTTTCCATGTATAGAATTATGATGTGGAATGGGCATTTTGATATGAAGTTAGCTTGATTCCTCCATTTATTATTTTACTTTTGCCTCGAGTGTTTCTTAATGGATGGATAGTCTACCTAACTCTTTTTTTCATAATTATTTTCTGAATACAAGCAAGTTTCTCAGGCCATTTAGGCCCAAAAATGGAAGTTTATAACTTTATTAGACAAAATAAACACGAATATTAAAATTAACTTTATAATAAAAGGACAAAAATGGAACTTGTTTCTATCTCTACATTGAACTATTTTCGTATGAGTCTAATAATGAGTTGCTTTGCTTGTACCGAAAAGAAAAAAAAAATCAATTCAA

mRNA sequence

AAAAGTTTAGTGGAAGTTATTTTACAATTCAGTCCCTCAAAATGTTCTTATTTTCGTTTCATGGGTTACTTCAATCTAACATGATAGCTTCGTTCTCGGTCCTTTCCGCTTCGTTGATGTTTATGGGACCTTCCATTTGTGTATTCAATCTAAATTTCAGGTCACATTTCTCCACCCAGACTGTTTTTCCCGCATTTACTTGATCTTCGGTGATTCCTTGGTGTTGCCCAGCCCTGTTGATGGGTTTCATTTTTTTTCTTGTTTTCTCGATCACCATTTCCTACATAGGGTTTTTTTCGGGATAAGATTCAACCAAGATCCCTAAAAGAAACGACGAACCCCGGTGATTTTGGGGTTCTACAACAGTTTTTCGTGTTTCCGATTACAATGTGTTTGTTTGTTTGTTATATCTGAAATTTGGTTGTTCCTAAACCCTGTATTTCACATCCATTCGATCTTTTGGTTATTGGGGTATTTGGGTATGTGTATGAGATTTTTGTTGCCATATTGCATCGTGTTGATCCATGTTCGGATGTTGGGGATGAAGGCTTGGCGACTCGGATGAATTTGTATGTGTAGTTTTATGGTTTCTGTAGAGTACGTTTGAGGCTGTTGTTAGGCAATTTTATTTTTATTTGTAGATGAAGAAGGTTAAAACCGAACCTCCGGTTGCGAGGAGACTCAGGTTATCGCATCTTCTTCTCGTAATTGGAGTGTTGTATTTAGTTTTCATATCATTTAAGTTTCCACGTTTTTTGGAAATTGCTGCGACGTTGAGCGGGGATGAAAGTAATAATGGGTTGGATTCAAATGGAGTTGACAGTGAAGGAATGGATTTTAGCAAAGCGTCGTTGAGTTCTGTTTATAAGGATACATTTCATCGGAAACTGGAAGATAATCAGCATTTAGAAGCACCATTGACGCCTAAAAAAGAGCCACTCGAAGAGGTGAATAATGTTACTGGACCGATAAAGCCAATTAAGCATAAATATGGTCGGATAACCGGTAACATTTCGAGTCAGCTGAATCATACCAATGATTTTTCAATGCTTGAGACAATGGCAGATGAAGCTTGGACATTAGGCTCGATGGCTTGGGAAGAAGTAGATAAATTTGGGTTGAATGAGACTTCTGAAAGTTCTATACTCGAGGGAAAACCTGAGTCATGTCCTTCATGGATATCTACTGATGGGAAAAAGTTAATGGAGGGAGATGGACTCATGTTCCTTCCTTGTGGACTTGCTGCAGGTTCATCTATTACAATAATTGGAACCCCTCATCTTGCTCATCAGGAGTACGTGCCCCAACTTTTGAAGGTGGGAGGTGATCCTAAGGTCATGGTTTCACAGTTTATGGTTGAATTGCAAGGATTGAAATCGGTCGATGGTGAGGACCCACCAAAGATCCTTCACTTGAATCCACGGCTGAAAGGTGATTGGAGTAAACGGCCGGTCATTGAACACAATACATGTTATAGGATGCAGTGGGGAACGGCTCAAAGGTGTGATGGTTTGCCATCAAGTAGTGAGGACGAAATGCTTGTTGATGGAAATCATCGATGCGAAAAATGGTTGAGAAGTGATGTTACAGATTCGAAAGAATCGAAAACAACCTCATGGTTCAGGAGATTCATAGGGAGGGAGCAAAAGCCAGAAGTGACTTGGCCATTTCCCTTTATGGAGGGCAGATTGTTTATCTTAACACTTCGTGCTGGTGTTGATGGATACCATATAAATGTTGGTGGTCGGCATTTGACTTCTTTTGCCTATCGCCCTGGATTTACGCTTGAAGATGCAACTGGATTAGCAGTTAAAGGAGATGTGGACATTCATTCTACATATGCTACAGCTCTTCCTACGTCTCATCCAAGCTTCTCTCCTCAACGAGTTCTTGAAATGTCAGAGAAATGGAAATCTCAGCCTTTGCCAAAGAGTTCCGTTTTTCTTTTTATTGGTGTTCTGTCTGCTACTAATCATTTTGCGGAGCGTATGGCCGTTAGGAAAACTTGGATGCAGTCTTCAGCTGTCATGTCATCAAATGTAGTTGTTCGCTTCTTTGTTGCACTGAATCCGAGGAAGGAGGTCAATGCTGTGCTGAAAAAGGAAGCTGCATATTTCGGTGATATTGTGATCCTGCCCTTCATGGACCGCTATGAGCTTGTCGTTCTCAAGACTATTGCCATATGTGAGTTTGGGGTAGTGAACTTGACAGCTTCATATATTATGAAATGTGACGACGATACCTTTGTGAGGGTGGAAACTGTTTTAAAACAGATCGAAGGCATTTCATCCAAGAAGTCCCTATACATGGGCAATCTCAACCTCTTGCATCGCCCTCTCAGACATGGAAAATGGGCAGTCACATATGAGGAATGGCCAGAAGAAGTCTATCCTCCATATGCCAATGGGCCAGGATATATCGTTTCCATTGACATTGCTAAATACATTGTCTCTCAACATGAAAACAAGAGCTTGAGGATATTCAAGATGGAGGATGTGAGCATGGGAATGTGGGTTGAACAGTTCAACAGTACTGTGGCGACAGTTCAATACTCTCACAACTGGAAATTTTGCCAATATGGATGTATGGAAGACTATTTTACAGCACACTATCAATCTCCAAGACAGATACTCTGCTTGTGGGATAAATTGGCCAGAGGACACGCTCATTGTTGCAACTTCAGGTAACAATTTTCCCCACTCTGTATTTCTAGCCATGTTTATTATTCAGTGTAGGCATCATGAATTCAAACTATGATGCATTATAATTGTTAATTTTTATTTTTTAAAAAAAGAAAATCAAGATCAGTTTTCCATGTATAGAATTATGATGTGGAATGGGCATTTTGATATGAAGTTAGCTTGATTCCTCCATTTATTATTTTACTTTTGCCTCGAGTGTTTCTTAATGGATGGATAGTCTACCTAACTCTTTTTTTCATAATTATTTTCTGAATACAAGCAAGTTTCTCAGGCCATTTAGGCCCAAAAATGGAAGTTTATAACTTTATTAGACAAAATAAACACGAATATTAAAATTAACTTTATAATAAAAGGACAAAAATGGAACTTGTTTCTATCTCTACATTGAACTATTTTCGTATGAGTCTAATAATGAGTTGCTTTGCTTGTACCGAAAAGAAAAAAAAAATCAATTCAA

Coding sequence (CDS)

ATGAAGAAGGTTAAAACCGAACCTCCGGTTGCGAGGAGACTCAGGTTATCGCATCTTCTTCTCGTAATTGGAGTGTTGTATTTAGTTTTCATATCATTTAAGTTTCCACGTTTTTTGGAAATTGCTGCGACGTTGAGCGGGGATGAAAGTAATAATGGGTTGGATTCAAATGGAGTTGACAGTGAAGGAATGGATTTTAGCAAAGCGTCGTTGAGTTCTGTTTATAAGGATACATTTCATCGGAAACTGGAAGATAATCAGCATTTAGAAGCACCATTGACGCCTAAAAAAGAGCCACTCGAAGAGGTGAATAATGTTACTGGACCGATAAAGCCAATTAAGCATAAATATGGTCGGATAACCGGTAACATTTCGAGTCAGCTGAATCATACCAATGATTTTTCAATGCTTGAGACAATGGCAGATGAAGCTTGGACATTAGGCTCGATGGCTTGGGAAGAAGTAGATAAATTTGGGTTGAATGAGACTTCTGAAAGTTCTATACTCGAGGGAAAACCTGAGTCATGTCCTTCATGGATATCTACTGATGGGAAAAAGTTAATGGAGGGAGATGGACTCATGTTCCTTCCTTGTGGACTTGCTGCAGGTTCATCTATTACAATAATTGGAACCCCTCATCTTGCTCATCAGGAGTACGTGCCCCAACTTTTGAAGGTGGGAGGTGATCCTAAGGTCATGGTTTCACAGTTTATGGTTGAATTGCAAGGATTGAAATCGGTCGATGGTGAGGACCCACCAAAGATCCTTCACTTGAATCCACGGCTGAAAGGTGATTGGAGTAAACGGCCGGTCATTGAACACAATACATGTTATAGGATGCAGTGGGGAACGGCTCAAAGGTGTGATGGTTTGCCATCAAGTAGTGAGGACGAAATGCTTGTTGATGGAAATCATCGATGCGAAAAATGGTTGAGAAGTGATGTTACAGATTCGAAAGAATCGAAAACAACCTCATGGTTCAGGAGATTCATAGGGAGGGAGCAAAAGCCAGAAGTGACTTGGCCATTTCCCTTTATGGAGGGCAGATTGTTTATCTTAACACTTCGTGCTGGTGTTGATGGATACCATATAAATGTTGGTGGTCGGCATTTGACTTCTTTTGCCTATCGCCCTGGATTTACGCTTGAAGATGCAACTGGATTAGCAGTTAAAGGAGATGTGGACATTCATTCTACATATGCTACAGCTCTTCCTACGTCTCATCCAAGCTTCTCTCCTCAACGAGTTCTTGAAATGTCAGAGAAATGGAAATCTCAGCCTTTGCCAAAGAGTTCCGTTTTTCTTTTTATTGGTGTTCTGTCTGCTACTAATCATTTTGCGGAGCGTATGGCCGTTAGGAAAACTTGGATGCAGTCTTCAGCTGTCATGTCATCAAATGTAGTTGTTCGCTTCTTTGTTGCACTGAATCCGAGGAAGGAGGTCAATGCTGTGCTGAAAAAGGAAGCTGCATATTTCGGTGATATTGTGATCCTGCCCTTCATGGACCGCTATGAGCTTGTCGTTCTCAAGACTATTGCCATATGTGAGTTTGGGGTAGTGAACTTGACAGCTTCATATATTATGAAATGTGACGACGATACCTTTGTGAGGGTGGAAACTGTTTTAAAACAGATCGAAGGCATTTCATCCAAGAAGTCCCTATACATGGGCAATCTCAACCTCTTGCATCGCCCTCTCAGACATGGAAAATGGGCAGTCACATATGAGGAATGGCCAGAAGAAGTCTATCCTCCATATGCCAATGGGCCAGGATATATCGTTTCCATTGACATTGCTAAATACATTGTCTCTCAACATGAAAACAAGAGCTTGAGGATATTCAAGATGGAGGATGTGAGCATGGGAATGTGGGTTGAACAGTTCAACAGTACTGTGGCGACAGTTCAATACTCTCACAACTGGAAATTTTGCCAATATGGATGTATGGAAGACTATTTTACAGCACACTATCAATCTCCAAGACAGATACTCTGCTTGTGGGATAAATTGGCCAGAGGACACGCTCATTGTTGCAACTTCAGGTAA

Protein sequence

MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLDSNGVDSEGMDFSKASLSSVYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIKHKYGRITGNISSQLNHTNDFSMLETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPESCPSWISTDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPKVMVSQFMVELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDEMLVDGNHRCEKWLRSDVTDSKESKTTSWFRRFIGREQKPEVTWPFPFMEGRLFILTLRAGVDGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLEMSEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPRKEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVETVLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIAKYIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQSPRQILCLWDKLARGHAHCCNFR
BLAST of CsGy6G035950 vs. NCBI nr
Match: XP_011658301.1 (PREDICTED: probable beta-1,3-galactosyltransferase 20 [Cucumis sativus] >KGN49434.1 hypothetical protein Csa_6G524710 [Cucumis sativus])

HSP 1 Score: 1363.6 bits (3528), Expect = 0.0e+00
Identity = 681/681 (100.00%), Postives = 681/681 (100.00%), Query Frame = 0

Query: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLXXXXXX 60
           MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLXXXXXX
Sbjct: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLXXXXXX 60

Query: 61  XXXXXXXXXXLSSVYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIKHKYGRI 120
           XXXXXXXXXXLSSVYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIKHKYGRI
Sbjct: 61  XXXXXXXXXXLSSVYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIKHKYGRI 120

Query: 121 TGNISSQLNHTNDFSMLETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPESCPSWI 180
           TGNISSQLNHTNDFSMLETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPESCPSWI
Sbjct: 121 TGNISSQLNHTNDFSMLETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPESCPSWI 180

Query: 181 STDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPKVMVSQFMVE 240
           STDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPKVMVSQFMVE
Sbjct: 181 STDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPKVMVSQFMVE 240

Query: 241 LQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDEML 300
           LQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDEML
Sbjct: 241 LQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDEML 300

Query: 301 VDGNHRCEKWLRSDVTDSKESKTTSWFRRFIGREQKPEVTWPFPFMEGRLFILTLRAGVD 360
           VDGNHRCEKWLRSDVTDSKESKTTSWFRRFIGREQKPEVTWPFPFMEGRLFILTLRAGVD
Sbjct: 301 VDGNHRCEKWLRSDVTDSKESKTTSWFRRFIGREQKPEVTWPFPFMEGRLFILTLRAGVD 360

Query: 361 GYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLEMS 420
           GYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLEMS
Sbjct: 361 GYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLEMS 420

Query: 421 EKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPRKE 480
           EKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPRKE
Sbjct: 421 EKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPRKE 480

Query: 481 VNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVET 540
           VNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVET
Sbjct: 481 VNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVET 540

Query: 541 VLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIAK 600
           VLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIAK
Sbjct: 541 VLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIAK 600

Query: 601 YIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQSP 660
           YIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQSP
Sbjct: 601 YIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQSP 660

Query: 661 RQILCLWDKLARGHAHCCNFR 682
           RQILCLWDKLARGHAHCCNFR
Sbjct: 661 RQILCLWDKLARGHAHCCNFR 681

BLAST of CsGy6G035950 vs. NCBI nr
Match: XP_008439584.1 (PREDICTED: hydroxyproline O-galactosyltransferase GALT2 [Cucumis melo])

HSP 1 Score: 1337.8 bits (3461), Expect = 0.0e+00
Identity = 665/681 (97.65%), Postives = 675/681 (99.12%), Query Frame = 0

Query: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLXXXXXX 60
           MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLXXXXXX
Sbjct: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLXXXXXX 60

Query: 61  XXXXXXXXXXLSSVYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIKHKYGRI 120
           XXXXXXXXXXLSSVYKDTFHRKLEDN+HLEAPLTPKKEPLEEVNNVTGPIKPI+HKYGRI
Sbjct: 61  XXXXXXXXXXLSSVYKDTFHRKLEDNEHLEAPLTPKKEPLEEVNNVTGPIKPIQHKYGRI 120

Query: 121 TGNISSQLNHTNDFSMLETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPESCPSWI 180
           TGNISS LNHTNDFSMLE MADEAWTLG MAWEE+DKFGLNET+ESSILEGKPESCPSWI
Sbjct: 121 TGNISSLLNHTNDFSMLEKMADEAWTLGLMAWEEIDKFGLNETAESSILEGKPESCPSWI 180

Query: 181 STDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPKVMVSQFMVE 240
           STDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDP VMVSQFMVE
Sbjct: 181 STDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPNVMVSQFMVE 240

Query: 241 LQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDEML 300
           LQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSS+DEML
Sbjct: 241 LQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEML 300

Query: 301 VDGNHRCEKWLRSDVTDSKESKTTSWFRRFIGREQKPEVTWPFPFMEGRLFILTLRAGVD 360
           VDGN RCEKWLRSDVTD+KESKTTSWF+RFIGREQKPEVTWPFPFMEGRLFILTLRAGVD
Sbjct: 301 VDGNRRCEKWLRSDVTDTKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGVD 360

Query: 361 GYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLEMS 420
           GYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYAT+LPTSHPSFSPQRVLEMS
Sbjct: 361 GYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPQRVLEMS 420

Query: 421 EKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPRKE 480
           EKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAV SSNVVVRFFVALNPRKE
Sbjct: 421 EKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALNPRKE 480

Query: 481 VNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVET 540
           VNAVLK+EAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVET
Sbjct: 481 VNAVLKREAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVET 540

Query: 541 VLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIAK 600
           VLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIAK
Sbjct: 541 VLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIAK 600

Query: 601 YIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQSP 660
           YIVSQHEN+SLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQSP
Sbjct: 601 YIVSQHENRSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQSP 660

Query: 661 RQILCLWDKLARGHAHCCNFR 682
           RQILCLWDKLARGHAHCCNFR
Sbjct: 661 RQILCLWDKLARGHAHCCNFR 681

BLAST of CsGy6G035950 vs. NCBI nr
Match: XP_022146632.1 (hydroxyproline O-galactosyltransferase GALT2 [Momordica charantia])

HSP 1 Score: 1263.4 bits (3268), Expect = 0.0e+00
Identity = 609/682 (89.30%), Postives = 638/682 (93.55%), Query Frame = 0

Query: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGL-XXXXX 60
           MKK+KTEPP ARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESN GL      
Sbjct: 1   MKKLKTEPPGARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDGTIGG 60

Query: 61  XXXXXXXXXXXLSSVYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIKHKYGR 120
                      LSSVYKDTFHRKLEDNQ+ EAP+TPKKEPLE+VNNV+GPIKPIKHKYGR
Sbjct: 61  DSEGVDFSRASLSSVYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIKPIKHKYGR 120

Query: 121 ITGNISSQLNHTNDFSMLETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPESCPSW 180
           ITG I S  NHTNDFSMLE +ADEAWTLG  AWEEVDKFGL+ET+ESSILEGKPE+CPSW
Sbjct: 121 ITGKILSHQNHTNDFSMLEKLADEAWTLGLKAWEEVDKFGLDETAESSILEGKPETCPSW 180

Query: 181 ISTDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPKVMVSQFMV 240
           ISTDGKK++EGDG+MFLPCGLAAGSSITIIGTPH AHQEYVPQLLKVG DP VMVSQFMV
Sbjct: 181 ISTDGKKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPMVMVSQFMV 240

Query: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDEM 300
           ELQGLK+VDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDG+PSS++DEM
Sbjct: 241 ELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSNDDEM 300

Query: 301 LVDGNHRCEKWLRSDVTDSKESKTTSWFRRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360
           LVDGN RCEKW+RSD+ DSKESKTTSWF+RFIGREQKPEVTWPFPFMEGRLFILTLRAGV
Sbjct: 301 LVDGNRRCEKWVRSDIIDSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLEM 420
           DGYHINVGGRHLTSF YR GFTLEDATGLAVKGDVDIHS YAT+LPT+HPSFSPQRVLEM
Sbjct: 361 DGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQRVLEM 420

Query: 421 SEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPRK 480
           SEKWKSQPLPKSSV LFIGVLSATNHFAERMAVRKTWMQSSAV +SNVVVRFFVALNPRK
Sbjct: 421 SEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRFFVALNPRK 480

Query: 481 EVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVE 540
           EVNAVLKKEA YFGDIVILPFMDRYELVVLKTIAICEFG VNLTASY+MKCDDDTFVRV+
Sbjct: 481 EVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTFVRVD 540

Query: 541 TVLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIA 600
           TVLKQI G+SSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYI+S DIA
Sbjct: 541 TVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIISSDIA 600

Query: 601 KYIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQS 660
           KYIVSQHEN+SLRIFKMEDVSMGMWVEQFN TVA VQYSHNWKFCQYGCMEDYFTAHYQS
Sbjct: 601 KYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTAHYQS 660

Query: 661 PRQILCLWDKLARGHAHCCNFR 682
           PRQI+CLWDKL +GHAHCCNFR
Sbjct: 661 PRQIICLWDKLGQGHAHCCNFR 682

BLAST of CsGy6G035950 vs. NCBI nr
Match: XP_023517655.1 (hydroxyproline O-galactosyltransferase GALT2-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1261.9 bits (3264), Expect = 0.0e+00
Identity = 606/682 (88.86%), Postives = 638/682 (93.55%), Query Frame = 0

Query: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGL-XXXXX 60
           MKK+KTEPPVARR RLSH LLVIG+LYLVFISFKFPRFLEIA TLSGDESN GL      
Sbjct: 1   MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFLEIATTLSGDESNTGLDGAVGV 60

Query: 61  XXXXXXXXXXXLSSVYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIKHKYGR 120
                      L+SVYKDTFHRKLEDNQHLEAPL PK EPLEEVNNVTGPIKPI+HKYGR
Sbjct: 61  DGEGVDFSKPSLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR 120

Query: 121 ITGNISSQLNHTNDFSMLETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPESCPSW 180
           ITG +S Q NHTNDFS+LE MADEAWTLG  AWEEVDKFGLNET+ESS+LEGKPESCPSW
Sbjct: 121 ITGKVSIQQNHTNDFSILERMADEAWTLGLKAWEEVDKFGLNETTESSMLEGKPESCPSW 180

Query: 181 ISTDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPKVMVSQFMV 240
           IST+GK+L+EGDG+MFLPCGLAAGSSITIIGTPH AH EYVPQLLK+GGDP V+VSQFMV
Sbjct: 181 ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV 240

Query: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDEM 300
           ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSS+DEM
Sbjct: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300

Query: 301 LVDGNHRCEKWLRSDVTDSKESKTTSWFRRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360
           LVDGN RC+KWLRSD+ DSKE+KT+SWF+RFIGREQKPEVTWPFPF+EGRLFILTLRAGV
Sbjct: 301 LVDGNLRCDKWLRSDIVDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLEM 420
           DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYAT+LPTSHPSFSP RVLEM
Sbjct: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 420

Query: 421 SEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPRK 480
           SEKWKSQPLPK SV+LFIGVLSATNHFAERMAVRKTWMQSSAV SSNVVVRFFVALNPR 
Sbjct: 421 SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALNPRN 480

Query: 481 EVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVE 540
           EVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFG VNLTASY+MKCDDDTFVRVE
Sbjct: 481 EVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGAVNLTASYVMKCDDDTFVRVE 540

Query: 541 TVLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIA 600
           TV+KQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTY EWPEEVYPPYANGPGYI+SIDI 
Sbjct: 541 TVIKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIV 600

Query: 601 KYIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQS 660
           KYIV+QHE++SLRIFKMEDVSMGMWVEQFNSTVA VQYSH+WKFCQYGCMEDYFTAHYQS
Sbjct: 601 KYIVAQHESRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQS 660

Query: 661 PRQILCLWDKLARGHAHCCNFR 682
           PRQI+CLWDKL +G AHCCNFR
Sbjct: 661 PRQIICLWDKLTQGQAHCCNFR 682

BLAST of CsGy6G035950 vs. NCBI nr
Match: XP_022926607.1 (hydroxyproline O-galactosyltransferase GALT2-like [Cucurbita moschata])

HSP 1 Score: 1261.5 bits (3263), Expect = 0.0e+00
Identity = 607/682 (89.00%), Postives = 639/682 (93.70%), Query Frame = 0

Query: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGL-XXXXX 60
           MKK+KTEPPVARR RLSH LLVIG+LYLVFISFKFPRFL IA TLSGDESN GL      
Sbjct: 1   MKKLKTEPPVARRFRLSHFLLVIGLLYLVFISFKFPRFLGIATTLSGDESNIGLDGTVGV 60

Query: 61  XXXXXXXXXXXLSSVYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIKHKYGR 120
                      L+SVYKDTFHRKLEDNQHLEAPL PK EPLEEVNNVTGPIKPI+HKYGR
Sbjct: 61  DGEGVDFSKASLASVYKDTFHRKLEDNQHLEAPLMPKTEPLEEVNNVTGPIKPIQHKYGR 120

Query: 121 ITGNISSQLNHTNDFSMLETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPESCPSW 180
           ITG +S Q NHTNDFS+LE MADEAWTLGS AWEEVDKFGLNET+ESS+LEGKPESCPSW
Sbjct: 121 ITGKVSIQQNHTNDFSILERMADEAWTLGSKAWEEVDKFGLNETTESSMLEGKPESCPSW 180

Query: 181 ISTDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPKVMVSQFMV 240
           IST+GK+L+EGDG+MFLPCGLAAGSSITIIGTPH AH EYVPQLLK+GGDP V+VSQFMV
Sbjct: 181 ISTEGKELLEGDGIMFLPCGLAAGSSITIIGTPHHAHVEYVPQLLKLGGDPTVLVSQFMV 240

Query: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDEM 300
           ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSS+DEM
Sbjct: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300

Query: 301 LVDGNHRCEKWLRSDVTDSKESKTTSWFRRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360
           LVDGN RC+KWLRSD+ DSKE+KT+SWF+RFIGREQKPEVTWPFPF+EGRLFILTLRAGV
Sbjct: 301 LVDGNLRCDKWLRSDIEDSKETKTSSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLEM 420
           DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYAT+LPTSHPSFSP RVLEM
Sbjct: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPHRVLEM 420

Query: 421 SEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPRK 480
           SEKWKSQPLPK SV+LFIGVLSATNHFAERMAVRKTWMQSSAV SSNVVVRFFVALNPR 
Sbjct: 421 SEKWKSQPLPKKSVYLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALNPRN 480

Query: 481 EVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVE 540
           EVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFG VNLTASY+MKCDDDTFVRVE
Sbjct: 481 EVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGAVNLTASYVMKCDDDTFVRVE 540

Query: 541 TVLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIA 600
           TV+KQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTY EWPEEVYPPYANGPGYI+SIDIA
Sbjct: 541 TVIKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYLEWPEEVYPPYANGPGYIISIDIA 600

Query: 601 KYIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQS 660
           KYIV+QHE++SLRIFKMEDVSMGMWVEQFNSTVA VQYSH+WKFCQYGCMEDYFTAHYQS
Sbjct: 601 KYIVAQHESRSLRIFKMEDVSMGMWVEQFNSTVAIVQYSHSWKFCQYGCMEDYFTAHYQS 660

Query: 661 PRQILCLWDKLARGHAHCCNFR 682
           PRQI+CLWDKL +G AHCCNFR
Sbjct: 661 PRQIICLWDKLTQGQAHCCNFR 682

BLAST of CsGy6G035950 vs. TAIR10
Match: AT4G21060.1 (Galactosyltransferase family protein)

HSP 1 Score: 910.2 bits (2351), Expect = 7.5e-265
Identity = 438/660 (66.36%), Postives = 521/660 (78.94%), Query Frame = 0

Query: 27  YLVFISFKFPRFLEIAATLSGDESNNGLXXXXXXXXXXXXXXXXLSSVYKDTFHRKLEDN 86
           YLVF++FKFP F+E+ A LSGD   +G                   S+  D  +RKLED 
Sbjct: 88  YLVFLAFKFPHFIEMVAMLSGDTGLDGALSDTSLDVSLS------GSLRNDMLNRKLEDE 147

Query: 87  QHLEAPLTPKKEPLEEVNNVTGPIKPIKHKYGRITGNISSQLNHTNDFSMLETMADEAWT 146
            H   P T +K   EE  N +  I+P+  +YGRI+G +  + N T   S  E MADEAW 
Sbjct: 148 DHQSGPSTTQKVSPEEKINGSKQIQPLLFRYGRISGEVMRRRNRTIHMSPFERMADEAWI 207

Query: 147 LGSMAWEEVDKFGLNETSES-SILEGKPESCPSWISTDGKKLMEGDGLMFLPCGLAAGSS 206
           LGS AWE+VDKF +++ +ES SI EGK ESCPS IS +G  L + + +M LPCGLAAGSS
Sbjct: 208 LGSKAWEDVDKFEVDKINESASIFEGKVESCPSQISMNGDDLNKANRIMLLPCGLAAGSS 267

Query: 207 ITIIGTPHLAHQEYVPQLLKVGGD-PKVMVSQFMVELQGLKSVDGEDPPKILHLNPRLKG 266
           ITI+GTP  AH+E VPQ  ++      V+VSQFMVELQGLK+ DGE PPKILHLNPR+KG
Sbjct: 268 ITILGTPQYAHKESVPQRSRLTRSYGMVLVSQFMVELQGLKTGDGEYPPKILHLNPRIKG 327

Query: 267 DWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDEMLVDGNHRCEKWLRSDV---TDSKES 326
           DW+ RPVIEHNTCYRMQWG AQRCDG PS  + ++LVDG  RCEKW ++D+    DSKES
Sbjct: 328 DWNHRPVIEHNTCYRMQWGVAQRCDGTPSKKDADVLVDGFRRCEKWTQNDIIDMVDSKES 387

Query: 327 KTTSWFRRFIGREQKPEVTWPFPFMEGRLFILTLRAGVDGYHINVGGRHLTSFAYRPGFT 386
           KTTSWF+RFIGREQKPEVTW FPF EG++F+LTLRAG+DG+HINVGGRH++SF YRPGFT
Sbjct: 388 KTTSWFKRFIGREQKPEVTWSFPFAEGKVFVLTLRAGIDGFHINVGGRHVSSFPYRPGFT 447

Query: 387 LEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLEMSEKWKSQPLPKSSVFLFIGVLS 446
           +EDATGLAV GDVDIHS +AT+L TSHPSFSPQ+ +E S +WK+ PLP +   LF+GVLS
Sbjct: 448 IEDATGLAVTGDVDIHSIHATSLSTSHPSFSPQKAIEFSSEWKAPPLPGTPFRLFMGVLS 507

Query: 447 ATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPRKEVNAVLKKEAAYFGDIVILPFM 506
           ATNHF+ERMAVRKTWMQ  ++ SS+VV RFFVALNPRKEVNA+LKKEA YFGDIVILPFM
Sbjct: 508 ATNHFSERMAVRKTWMQHPSIKSSDVVARFFVALNPRKEVNAMLKKEAEYFGDIVILPFM 567

Query: 507 DRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVETVLKQIEGISSKKSLYMGNLNL 566
           DRYELVVLKTIAICEFGV N+TA YIMKCDDDTF+RVE++LKQI+G+S +KSLYMGNLNL
Sbjct: 568 DRYELVVLKTIAICEFGVQNVTAPYIMKCDDDTFIRVESILKQIDGVSPEKSLYMGNLNL 627

Query: 567 LHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIAKYIVSQHENKSLRIFKMEDVSM 626
            HRPLR GKW VT+EEWPE VYPPYANGPGYI+S +IAKYIVSQ+    LR+FKMEDVSM
Sbjct: 628 RHRPLRTGKWTVTWEEWPEAVYPPYANGPGYIISSNIAKYIVSQNSRHKLRLFKMEDVSM 687

Query: 627 GMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQSPRQILCLWDKLARGHAHCCNFR 682
           G+WVEQFN+++  V+YSH+WKFCQYGC  +Y+TAHYQSP Q++CLWD L +G   CCNFR
Sbjct: 688 GLWVEQFNASMQPVEYSHSWKFCQYGCTLNYYTAHYQSPSQMMCLWDNLLKGRPQCCNFR 741

BLAST of CsGy6G035950 vs. TAIR10
Match: AT5G62620.1 (Galactosyltransferase family protein)

HSP 1 Score: 685.6 bits (1768), Expect = 3.0e-197
Identity = 348/683 (50.95%), Postives = 453/683 (66.33%), Query Frame = 0

Query: 15  RLSHLLLVIGVLYLVFISFKFPRFLEIA-ATLSGDESNNGLXXXXXXXXXXXXXXXXLSS 74
           R   +L+ +G+LY++ I+F+ P   +   ++LS D                      L+ 
Sbjct: 25  RSVQILMAVGLLYMLLITFEIPFVFKTGLSSLSQD---------------------PLTR 84

Query: 75  VYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIKHKYGRITGNISSQLNHTND 134
             K    R+L++ +   AP  P K  L + +    P + ++ +  RI  ++       N 
Sbjct: 85  PEKHNSQRELQERR---APTRPLKSLLYQESQSESPAQGLRRR-TRILSSLRFDPETFNP 144

Query: 135 FSM-----LETMADEAWTLGSMAWEEVDK----FGLNETSESSILEGKPESCPSWISTDG 194
            S      L   A  AW +G   WEE++       L +  +  I E    SC   +S  G
Sbjct: 145 SSKDGSVELHKSAKVAWEVGRKIWEELESGKTLKALEKEKKKKIEEHGTNSCSLSVSLTG 204

Query: 195 KKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQL-LKVGGDPKVMVSQFMVELQG 254
             L++   +M LPCGL  GS IT++G P  AH E  P++ +   GD  V VSQF +ELQG
Sbjct: 205 SDLLKRGNIMELPCGLTLGSHITVVGKPRAAHSEKDPKISMLKEGDEAVKVSQFKLELQG 264

Query: 255 LKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDEMLVDG 314
           LK+V+GE+PP+ILHLNPRLKGDWS +PVIE NTCYRMQWG+AQRC+G   S +DE  VDG
Sbjct: 265 LKAVEGEEPPRILHLNPRLKGDWSGKPVIEQNTCYRMQWGSAQRCEGW-RSRDDEETVDG 324

Query: 315 NHRCEKWLRSDVTDSKESKTTS----WFRRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 374
             +CEKW R D   SKE +++     W  R IGR +K  V WPFPF   +LF+LTL AG+
Sbjct: 325 QVKCEKWARDDSITSKEEESSKAASWWLSRLIGRSKKVTVEWPFPFTVDKLFVLTLSAGL 384

Query: 375 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLEM 434
           +GYH++V G+H+TSF YR GFTLEDATGL + GD+D+HS +A +LPTSHPSFSPQR LE+
Sbjct: 385 EGYHVSVDGKHVTSFPYRTGFTLEDATGLTINGDIDVHSVFAGSLPTSHPSFSPQRHLEL 444

Query: 435 SEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPRK 494
           S  W++  LP   V +FIG+LSA NHFAERMAVR++WMQ   V SS VV RFFVAL+ RK
Sbjct: 445 SSNWQAPSLPDEQVDMFIGILSAGNHFAERMAVRRSWMQHKLVKSSKVVARFFVALHSRK 504

Query: 495 EVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVE 554
           EVN  LKKEA +FGDIVI+P+MD Y+LVVLKT+AICE+G   L A +IMKCDDDTFV+V+
Sbjct: 505 EVNVELKKEAEFFGDIVIVPYMDSYDLVVLKTVAICEYGAHQLAAKFIMKCDDDTFVQVD 564

Query: 555 TVLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIA 614
            VL + +   + +SLY+GN+N  H+PLR GKW+VTYEEWPEE YPPYANGPGYI+S DI+
Sbjct: 565 AVLSEAKKTPTDRSLYIGNINYYHKPLRQGKWSVTYEEWPEEDYPPYANGPGYILSNDIS 624

Query: 615 KYIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQS 674
           ++IV + E   LR+FKMEDVS+GMWVEQFN+    V Y H+ +FCQ+GC+E+Y TAHYQS
Sbjct: 625 RFIVKEFEKHKLRMFKMEDVSVGMWVEQFNNGTKPVDYIHSLRFCQFGCIENYLTAHYQS 681

Query: 675 PRQILCLWDKLA-RGHAHCCNFR 682
           PRQ++CLWDKL   G   CCN R
Sbjct: 685 PRQMICLWDKLVLTGKPQCCNMR 681

BLAST of CsGy6G035950 vs. TAIR10
Match: AT1G74800.1 (Galactosyltransferase family protein)

HSP 1 Score: 682.6 bits (1760), Expect = 2.6e-196
Identity = 318/547 (58.14%), Postives = 407/547 (74.41%), Query Frame = 0

Query: 137 LETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPESCPSWISTDGKKLMEGDG-LMF 196
           L   A EAW LG   W+E++   L +  E    + KP+SCP  +S  G + M  +  LM 
Sbjct: 136 LHKSAKEAWQLGRKLWKELESGRLEKLVEKP-EKNKPDSCPHSVSLTGSEFMNRENKLME 195

Query: 197 LPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPKVMVSQFMVELQGLKSVDGEDPPKI 256
           LPCGL  GS IT++G P  AH +         GD   +VSQF++ELQGLK+V+GEDPP+I
Sbjct: 196 LPCGLTLGSHITLVGRPRKAHPK--------EGDWSKLVSQFVIELQGLKTVEGEDPPRI 255

Query: 257 LHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDEMLVDGNHRCEKWLRSDV 316
           LH NPRLKGDWSK+PVIE N+CYRMQWG AQRC+G   S +DE  VD + +CEKW+R D 
Sbjct: 256 LHFNPRLKGDWSKKPVIEQNSCYRMQWGPAQRCEGW-KSRDDEETVDSHVKCEKWIRDDD 315

Query: 317 TDSKESKTTSWFRRFIGREQKPEVTWPFPFMEGRLFILTLRAGVDGYHINVGGRHLTSFA 376
             S+ S+   W  R IGR ++ +V WPFPF+E +LF+LTL AG++GYHINV G+H+TSF 
Sbjct: 316 NYSEGSRARWWLNRLIGRRKRVKVEWPFPFVEEKLFVLTLSAGLEGYHINVDGKHVTSFP 375

Query: 377 YRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLEMSEKWKSQPLPKSSVFL 436
           YR GFTLEDATGL V GD+D+HS +  +LPTSHPSF+PQR LE+S++W++  +P   V +
Sbjct: 376 YRTGFTLEDATGLTVNGDIDVHSVFVASLPTSHPSFAPQRHLELSKRWQAPVVPDGPVEI 435

Query: 437 FIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPRKEVNAVLKKEAAYFGDI 496
           FIG+LSA NHF+ERMAVRK+WMQ   + S+ VV RFFVAL+ RKEVN  LKKEA YFGDI
Sbjct: 436 FIGILSAGNHFSERMAVRKSWMQHVLITSAKVVARFFVALHGRKEVNVELKKEAEYFGDI 495

Query: 497 VILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVETVLKQIEGISSKKSLY 556
           V++P+MD Y+LVVLKT+AICE G +  +A YIMKCDDDTFV++  V+ +++ +   +SLY
Sbjct: 496 VLVPYMDSYDLVVLKTVAICEHGALAFSAKYIMKCDDDTFVKLGAVINEVKKVPEGRSLY 555

Query: 557 MGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIAKYIVSQHENKSLRIFK 616
           +GN+N  H+PLR GKWAVTYEEWPEE YPPYANGPGY++S DIA++IV + E   LR+FK
Sbjct: 556 IGNMNYYHKPLRGGKWAVTYEEWPEEDYPPYANGPGYVLSSDIARFIVDKFERHKLRLFK 615

Query: 617 MEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQSPRQILCLWDKLAR-GH 676
           MEDVS+GMWVE F +T   V Y H+ +FCQ+GC+E+Y+TAHYQSPRQ++CLWDKL R   
Sbjct: 616 MEDVSVGMWVEHFKNTTNPVDYRHSLRFCQFGCVENYYTAHYQSPRQMICLWDKLLRQNK 672

Query: 677 AHCCNFR 682
             CCN R
Sbjct: 676 PECCNMR 672

BLAST of CsGy6G035950 vs. TAIR10
Match: AT1G27120.1 (Galactosyltransferase family protein)

HSP 1 Score: 659.1 bits (1699), Expect = 3.0e-189
Identity = 341/695 (49.06%), Postives = 443/695 (63.74%), Query Frame = 0

Query: 1   MKKVKTEPPVAR-RLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESN-----NGL 60
           MKK K +   ++ R  L   LLV+ + Y + +SF+ P      +    D+ +     + L
Sbjct: 1   MKKSKLDNSSSQIRFGLVQFLLVVLLFYFLCMSFEIPFIFRTGSGSGSDDVSSSSFADAL 60

Query: 61  XXXXXXXXXXXXXXXXLSSVYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIK 120
                           +    +   HR  +D   ++  L  +K  + E  +V+       
Sbjct: 61  PRPMVVGGGSREANWVVGEEEEADPHRHFKDPGRVQLRLPERK--MREFKSVSEIF---- 120

Query: 121 HKYGRITGNISSQLNHTNDFSMLETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPE 180
                +  +       +++FS+    A  A ++G   W+ +D  GL +  ++ + + + E
Sbjct: 121 -----VNESFFDNGGFSDEFSIFHKTAKHAISMGRKMWDGLDS-GLIKPDKAPV-KTRIE 180

Query: 181 SCPSWISTDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPKVMV 240
            CP  +S    + +    ++ LPCGL  GS IT++ TPH AH E         GD   MV
Sbjct: 181 KCPDMVSVSESEFVNRSRILVLPCGLTLGSHITVVATPHWAHVE-------KDGDKTAMV 240

Query: 241 SQFMVELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSS 300
           SQFM+ELQGLK+VDGEDPP+ILH NPR+KGDWS RPVIE NTCYRMQWG+  RCDG   S
Sbjct: 241 SQFMMELQGLKAVDGEDPPRILHFNPRIKGDWSGRPVIEQNTCYRMQWGSGLRCDG-RES 300

Query: 301 SEDEMLVDGNHRCEKWLRSDVTDSKESKTTSWFRRFIGREQKPEV-------TWPFPFME 360
           S+DE  VDG  +CE+W R                                   W +PF E
Sbjct: 301 SDDEEYVDGEVKCERWKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMITHDWDYPFAE 360

Query: 361 GRLFILTLRAGVDGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTS 420
           G+LF+LTLRAG++GYHI+V GRH+TSF YR GF LEDATGLAVKG++D+HS YA +LP++
Sbjct: 361 GKLFVLTLRAGMEGYHISVNGRHITSFPYRTGFVLEDATGLAVKGNIDVHSVYAASLPST 420

Query: 421 HPSFSPQRVLEMSEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNV 480
           +PSF+PQ+ LEM   WK+  LP+  V LFIG+LSA NHFAERMAVRK+WMQ   V SS V
Sbjct: 421 NPSFAPQKHLEMQRIWKAPSLPQKPVELFIGILSAGNHFAERMAVRKSWMQQKLVRSSKV 480

Query: 481 VVRFFVALNPRKEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYI 540
           V RFFVAL+ RKEVN  LKKEA YFGDIVI+P+MD Y+LVVLKT+AICE+GV  + A Y+
Sbjct: 481 VARFFVALHARKEVNVDLKKEAEYFGDIVIVPYMDHYDLVVLKTVAICEYGVNTVAAKYV 540

Query: 541 MKCDDDTFVRVETVLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYA 600
           MKCDDDTFVRV+ V+++ E +  ++SLY+GN+N  H+PLR GKWAVT+EEWPEE YPPYA
Sbjct: 541 MKCDDDTFVRVDAVIQEAEKVKGRESLYIGNINFNHKPLRTGKWAVTFEEWPEEYYPPYA 600

Query: 601 NGPGYIVSIDIAKYIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYG 660
           NGPGYI+S D+AK+IV   E K LR+FKMEDVSMGMWVE+FN T   V   H+ KFCQ+G
Sbjct: 601 NGPGYILSYDVAKFIVDDFEQKRLRLFKMEDVSMGMWVEKFNET-RPVAVVHSLKFCQFG 660

Query: 661 CMEDYFTAHYQSPRQILCLWDKLAR-GHAHCCNFR 682
           C+EDYFTAHYQSPRQ++C+WDKL R G   CCN R
Sbjct: 661 CIEDYFTAHYQSPRQMICMWDKLQRLGKPQCCNMR 673

BLAST of CsGy6G035950 vs. TAIR10
Match: AT1G26810.1 (galactosyltransferase1)

HSP 1 Score: 328.2 bits (840), Expect = 1.2e-89
Identity = 206/567 (36.33%), Postives = 296/567 (52.20%), Query Frame = 0

Query: 134 FSMLETMADEAWTL---------GSMAWEE----VDKFGLNETSESSILEGKPESCPSWI 193
           ++ LE++ D A +L           + WE     V+   L + +E+   +GK E CP ++
Sbjct: 99  WNRLESLVDNAQSLVNGVDAIKEAGIVWESLVSAVEAKKLVDVNENQTRKGKEELCPQFL 158

Query: 194 STDGKKLMEGDGL-MFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPKVMVSQFMV 253
           S       +G  L + +PCGL  GSSIT+IG                   P  +V  F +
Sbjct: 159 SKMNATEADGSSLKLQIPCGLTQGSSITVIGI------------------PDGLVGSFRI 218

Query: 254 ELQGLKSVDGEDPPKILHLNPRLKGDWS-KRPVIEHNTCYRMQ-WGTAQRCDGLPSSSED 313
           +L G       DPP I+H N RL GD S + PVI  N+    Q WG  +RC   P    D
Sbjct: 219 DLTGQPLPGEPDPPIIVHYNVRLLGDKSTEDPVIVQNSWTASQDWGAEERC---PKFDPD 278

Query: 314 -EMLVDGNHRCEKWLRSDVTDSKESKTTSWFRRF--IGREQKPEVTWPFPFMEGRLFILT 373
               VD    C K +  ++  +  +   S   R   + RE      + FPF +G L + T
Sbjct: 279 MNKKVDDLDECNKMVGGEINRTSSTSLQSNTSRGVPVAREASKHEKY-FPFKQGFLSVAT 338

Query: 374 LRAGVDGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQ 433
           LR G +G  + V G+H+TSFA+R        + + + GD  + S  A+ LPTS  S   +
Sbjct: 339 LRVGTEGMQMTVDGKHITSFAFRDTLEPWLVSEIRITGDFRLISILASGLPTSEES---E 398

Query: 434 RVLEMSEKWKSQPL-PKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFV 493
            V+++ E  KS  L P   + L IGV S  N+F  RMAVR+TWMQ   V S  V VRFFV
Sbjct: 399 HVVDL-EALKSPTLSPLRPLDLVIGVFSTANNFKRRMAVRRTWMQYDDVRSGRVAVRFFV 458

Query: 494 ALNPRKEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDD 553
            L+    VN  L  EA  +GD+ ++PF+D Y L+  KT+AIC FG    +A +IMK DDD
Sbjct: 459 GLHKSPLVNLELWNEARTYGDVQLMPFVDYYSLISWKTLAICIFGTEVDSAKFIMKTDDD 518

Query: 554 TFVRVETVLKQIEGISSKKSLYMGNLNLLHRPLRH--GKWAVTYEEWPEEVYPPYANGPG 613
            FVRV+ VL  +   ++ + L  G +N   +P+R+   KW ++YEEWPEE YPP+A+GPG
Sbjct: 519 AFVRVDEVLLSLSMTNNTRGLIYGLINSDSQPIRNPDSKWYISYEEWPEEKYPPWAHGPG 578

Query: 614 YIVSIDIAKYIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMED 673
           YIVS DIA+ +    +  +L++FK+EDV+MG+W+ +         Y ++ +    GC + 
Sbjct: 579 YIVSRDIAESVGKLFKEGNLKMFKLEDVAMGIWIAELTKHGLEPHYENDGRIISDGCKDG 638

Query: 674 YFTAHYQSPRQILCLWDKLARGHAHCC 679
           Y  AHYQSP ++ CLW K        C
Sbjct: 639 YVVAHYQSPAEMTCLWRKYQETKRSLC 639

BLAST of CsGy6G035950 vs. Swiss-Prot
Match: sp|A7XDQ9|B3GTK_ARATH (Hydroxyproline O-galactosyltransferase GALT2 OS=Arabidopsis thaliana OX=3702 GN=GALT2 PE=1 SV=1)

HSP 1 Score: 924.9 bits (2389), Expect = 5.3e-268
Identity = 451/690 (65.36%), Postives = 538/690 (77.97%), Query Frame = 0

Query: 1   MKKVKTEP----PVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLXX 60
           MK+VK+E       +RR +LSH LL I   YLVF++FKFP F+E+ A LSGD   +G   
Sbjct: 1   MKRVKSESFRGVYSSRRFKLSHFLLAIAGFYLVFLAFKFPHFIEMVAMLSGDTGLDGALS 60

Query: 61  XXXXXXXXXXXXXXLSSVYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIKHK 120
                           S+  D  +RKLED  H   P T +K   EE  N +  I+P+  +
Sbjct: 61  DTSLDVSLS------GSLRNDMLNRKLEDEDHQSGPSTTQKVSPEEKINGSKQIQPLLFR 120

Query: 121 YGRITGNISSQLNHTNDFSMLETMADEAWTLGSMAWEEVDKFGLNETSES-SILEGKPES 180
           YGRI+G +  + N T   S  E MADEAW LGS AWE+VDKF +++ +ES SI EGK ES
Sbjct: 121 YGRISGEVMRRRNRTIHMSPFERMADEAWILGSKAWEDVDKFEVDKINESASIFEGKVES 180

Query: 181 CPSWISTDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGD-PKVMV 240
           CPS IS +G  L + + +M LPCGLAAGSSITI+GTP  AH+E VPQ  ++      V+V
Sbjct: 181 CPSQISMNGDDLNKANRIMLLPCGLAAGSSITILGTPQYAHKESVPQRSRLTRSYGMVLV 240

Query: 241 SQFMVELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSS 300
           SQFMVELQGLK+ DGE PPKILHLNPR+KGDW+ RPVIEHNTCYRMQWG AQRCDG PS 
Sbjct: 241 SQFMVELQGLKTGDGEYPPKILHLNPRIKGDWNHRPVIEHNTCYRMQWGVAQRCDGTPSK 300

Query: 301 SEDEMLVDGNHRCEKWLRSDV---TDSKESKTTSWFRRFIGREQKPEVTWPFPFMEGRLF 360
            + ++LVDG  RCEKW ++D+    DSKESKTTSWF+RFIGREQKPEVTW FPF EG++F
Sbjct: 301 KDADVLVDGFRRCEKWTQNDIIDMVDSKESKTTSWFKRFIGREQKPEVTWSFPFAEGKVF 360

Query: 361 ILTLRAGVDGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSF 420
           +LTLRAG+DG+HINVGGRH++SF YRPGFT+EDATGLAV GDVDIHS +AT+L TSHPSF
Sbjct: 361 VLTLRAGIDGFHINVGGRHVSSFPYRPGFTIEDATGLAVTGDVDIHSIHATSLSTSHPSF 420

Query: 421 SPQRVLEMSEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRF 480
           SPQ+ +E S +WK+ PLP +   LF+GVLSATNHF+ERMAVRKTWMQ  ++ SS+VV RF
Sbjct: 421 SPQKAIEFSSEWKAPPLPGTPFRLFMGVLSATNHFSERMAVRKTWMQHPSIKSSDVVARF 480

Query: 481 FVALNPRKEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCD 540
           FVALNPRKEVNA+LKKEA YFGDIVILPFMDRYELVVLKTIAICEFGV N+TA YIMKCD
Sbjct: 481 FVALNPRKEVNAMLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGVQNVTAPYIMKCD 540

Query: 541 DDTFVRVETVLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPG 600
           DDTF+RVE++LKQI+G+S +KSLYMGNLNL HRPLR GKW VT+EEWPE VYPPYANGPG
Sbjct: 541 DDTFIRVESILKQIDGVSPEKSLYMGNLNLRHRPLRTGKWTVTWEEWPEAVYPPYANGPG 600

Query: 601 YIVSIDIAKYIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMED 660
           YI+S +IAKYIVSQ+    LR+FKMEDVSMG+WVEQFN+++  V+YSH+WKFCQYGC  +
Sbjct: 601 YIISSNIAKYIVSQNSRHKLRLFKMEDVSMGLWVEQFNASMQPVEYSHSWKFCQYGCTLN 660

Query: 661 YFTAHYQSPRQILCLWDKLARGHAHCCNFR 682
           Y+TAHYQSP Q++CLWD L +G   CCNFR
Sbjct: 661 YYTAHYQSPSQMMCLWDNLLKGRPQCCNFR 684

BLAST of CsGy6G035950 vs. Swiss-Prot
Match: sp|Q9LV16|B3GTJ_ARATH (Hydroxyproline O-galactosyltransferase GALT6 OS=Arabidopsis thaliana OX=3702 GN=GALT6 PE=2 SV=2)

HSP 1 Score: 685.6 bits (1768), Expect = 5.4e-196
Identity = 348/683 (50.95%), Postives = 453/683 (66.33%), Query Frame = 0

Query: 15  RLSHLLLVIGVLYLVFISFKFPRFLEIA-ATLSGDESNNGLXXXXXXXXXXXXXXXXLSS 74
           R   +L+ +G+LY++ I+F+ P   +   ++LS D                      L+ 
Sbjct: 25  RSVQILMAVGLLYMLLITFEIPFVFKTGLSSLSQD---------------------PLTR 84

Query: 75  VYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIKHKYGRITGNISSQLNHTND 134
             K    R+L++ +   AP  P K  L + +    P + ++ +  RI  ++       N 
Sbjct: 85  PEKHNSQRELQERR---APTRPLKSLLYQESQSESPAQGLRRR-TRILSSLRFDPETFNP 144

Query: 135 FSM-----LETMADEAWTLGSMAWEEVDK----FGLNETSESSILEGKPESCPSWISTDG 194
            S      L   A  AW +G   WEE++       L +  +  I E    SC   +S  G
Sbjct: 145 SSKDGSVELHKSAKVAWEVGRKIWEELESGKTLKALEKEKKKKIEEHGTNSCSLSVSLTG 204

Query: 195 KKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQL-LKVGGDPKVMVSQFMVELQG 254
             L++   +M LPCGL  GS IT++G P  AH E  P++ +   GD  V VSQF +ELQG
Sbjct: 205 SDLLKRGNIMELPCGLTLGSHITVVGKPRAAHSEKDPKISMLKEGDEAVKVSQFKLELQG 264

Query: 255 LKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDEMLVDG 314
           LK+V+GE+PP+ILHLNPRLKGDWS +PVIE NTCYRMQWG+AQRC+G   S +DE  VDG
Sbjct: 265 LKAVEGEEPPRILHLNPRLKGDWSGKPVIEQNTCYRMQWGSAQRCEGW-RSRDDEETVDG 324

Query: 315 NHRCEKWLRSDVTDSKESKTTS----WFRRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 374
             +CEKW R D   SKE +++     W  R IGR +K  V WPFPF   +LF+LTL AG+
Sbjct: 325 QVKCEKWARDDSITSKEEESSKAASWWLSRLIGRSKKVTVEWPFPFTVDKLFVLTLSAGL 384

Query: 375 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLEM 434
           +GYH++V G+H+TSF YR GFTLEDATGL + GD+D+HS +A +LPTSHPSFSPQR LE+
Sbjct: 385 EGYHVSVDGKHVTSFPYRTGFTLEDATGLTINGDIDVHSVFAGSLPTSHPSFSPQRHLEL 444

Query: 435 SEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPRK 494
           S  W++  LP   V +FIG+LSA NHFAERMAVR++WMQ   V SS VV RFFVAL+ RK
Sbjct: 445 SSNWQAPSLPDEQVDMFIGILSAGNHFAERMAVRRSWMQHKLVKSSKVVARFFVALHSRK 504

Query: 495 EVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVE 554
           EVN  LKKEA +FGDIVI+P+MD Y+LVVLKT+AICE+G   L A +IMKCDDDTFV+V+
Sbjct: 505 EVNVELKKEAEFFGDIVIVPYMDSYDLVVLKTVAICEYGAHQLAAKFIMKCDDDTFVQVD 564

Query: 555 TVLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIA 614
            VL + +   + +SLY+GN+N  H+PLR GKW+VTYEEWPEE YPPYANGPGYI+S DI+
Sbjct: 565 AVLSEAKKTPTDRSLYIGNINYYHKPLRQGKWSVTYEEWPEEDYPPYANGPGYILSNDIS 624

Query: 615 KYIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQS 674
           ++IV + E   LR+FKMEDVS+GMWVEQFN+    V Y H+ +FCQ+GC+E+Y TAHYQS
Sbjct: 625 RFIVKEFEKHKLRMFKMEDVSVGMWVEQFNNGTKPVDYIHSLRFCQFGCIENYLTAHYQS 681

Query: 675 PRQILCLWDKLA-RGHAHCCNFR 682
           PRQ++CLWDKL   G   CCN R
Sbjct: 685 PRQMICLWDKLVLTGKPQCCNMR 681

BLAST of CsGy6G035950 vs. Swiss-Prot
Match: sp|Q8RX55|B3GTI_ARATH (Hydroxyproline O-galactosyltransferase GALT5 OS=Arabidopsis thaliana OX=3702 GN=GALT5 PE=1 SV=1)

HSP 1 Score: 682.6 bits (1760), Expect = 4.6e-195
Identity = 318/547 (58.14%), Postives = 407/547 (74.41%), Query Frame = 0

Query: 137 LETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPESCPSWISTDGKKLMEGDG-LMF 196
           L   A EAW LG   W+E++   L +  E    + KP+SCP  +S  G + M  +  LM 
Sbjct: 136 LHKSAKEAWQLGRKLWKELESGRLEKLVEKP-EKNKPDSCPHSVSLTGSEFMNRENKLME 195

Query: 197 LPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPKVMVSQFMVELQGLKSVDGEDPPKI 256
           LPCGL  GS IT++G P  AH +         GD   +VSQF++ELQGLK+V+GEDPP+I
Sbjct: 196 LPCGLTLGSHITLVGRPRKAHPK--------EGDWSKLVSQFVIELQGLKTVEGEDPPRI 255

Query: 257 LHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDEMLVDGNHRCEKWLRSDV 316
           LH NPRLKGDWSK+PVIE N+CYRMQWG AQRC+G   S +DE  VD + +CEKW+R D 
Sbjct: 256 LHFNPRLKGDWSKKPVIEQNSCYRMQWGPAQRCEGW-KSRDDEETVDSHVKCEKWIRDDD 315

Query: 317 TDSKESKTTSWFRRFIGREQKPEVTWPFPFMEGRLFILTLRAGVDGYHINVGGRHLTSFA 376
             S+ S+   W  R IGR ++ +V WPFPF+E +LF+LTL AG++GYHINV G+H+TSF 
Sbjct: 316 NYSEGSRARWWLNRLIGRRKRVKVEWPFPFVEEKLFVLTLSAGLEGYHINVDGKHVTSFP 375

Query: 377 YRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLEMSEKWKSQPLPKSSVFL 436
           YR GFTLEDATGL V GD+D+HS +  +LPTSHPSF+PQR LE+S++W++  +P   V +
Sbjct: 376 YRTGFTLEDATGLTVNGDIDVHSVFVASLPTSHPSFAPQRHLELSKRWQAPVVPDGPVEI 435

Query: 437 FIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPRKEVNAVLKKEAAYFGDI 496
           FIG+LSA NHF+ERMAVRK+WMQ   + S+ VV RFFVAL+ RKEVN  LKKEA YFGDI
Sbjct: 436 FIGILSAGNHFSERMAVRKSWMQHVLITSAKVVARFFVALHGRKEVNVELKKEAEYFGDI 495

Query: 497 VILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVETVLKQIEGISSKKSLY 556
           V++P+MD Y+LVVLKT+AICE G +  +A YIMKCDDDTFV++  V+ +++ +   +SLY
Sbjct: 496 VLVPYMDSYDLVVLKTVAICEHGALAFSAKYIMKCDDDTFVKLGAVINEVKKVPEGRSLY 555

Query: 557 MGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIAKYIVSQHENKSLRIFK 616
           +GN+N  H+PLR GKWAVTYEEWPEE YPPYANGPGY++S DIA++IV + E   LR+FK
Sbjct: 556 IGNMNYYHKPLRGGKWAVTYEEWPEEDYPPYANGPGYVLSSDIARFIVDKFERHKLRLFK 615

Query: 617 MEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQSPRQILCLWDKLAR-GH 676
           MEDVS+GMWVE F +T   V Y H+ +FCQ+GC+E+Y+TAHYQSPRQ++CLWDKL R   
Sbjct: 616 MEDVSVGMWVEHFKNTTNPVDYRHSLRFCQFGCVENYYTAHYQSPRQMICLWDKLLRQNK 672

Query: 677 AHCCNFR 682
             CCN R
Sbjct: 676 PECCNMR 672

BLAST of CsGy6G035950 vs. Swiss-Prot
Match: sp|Q8GXG6|B3GTH_ARATH (Hydroxyproline O-galactosyltransferase GALT4 OS=Arabidopsis thaliana OX=3702 GN=GALT4 PE=2 SV=2)

HSP 1 Score: 659.1 bits (1699), Expect = 5.5e-188
Identity = 341/695 (49.06%), Postives = 443/695 (63.74%), Query Frame = 0

Query: 1   MKKVKTEPPVAR-RLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESN-----NGL 60
           MKK K +   ++ R  L   LLV+ + Y + +SF+ P      +    D+ +     + L
Sbjct: 1   MKKSKLDNSSSQIRFGLVQFLLVVLLFYFLCMSFEIPFIFRTGSGSGSDDVSSSSFADAL 60

Query: 61  XXXXXXXXXXXXXXXXLSSVYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIK 120
                           +    +   HR  +D   ++  L  +K  + E  +V+       
Sbjct: 61  PRPMVVGGGSREANWVVGEEEEADPHRHFKDPGRVQLRLPERK--MREFKSVSEIF---- 120

Query: 121 HKYGRITGNISSQLNHTNDFSMLETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPE 180
                +  +       +++FS+    A  A ++G   W+ +D  GL +  ++ + + + E
Sbjct: 121 -----VNESFFDNGGFSDEFSIFHKTAKHAISMGRKMWDGLDS-GLIKPDKAPV-KTRIE 180

Query: 181 SCPSWISTDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPKVMV 240
            CP  +S    + +    ++ LPCGL  GS IT++ TPH AH E         GD   MV
Sbjct: 181 KCPDMVSVSESEFVNRSRILVLPCGLTLGSHITVVATPHWAHVE-------KDGDKTAMV 240

Query: 241 SQFMVELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSS 300
           SQFM+ELQGLK+VDGEDPP+ILH NPR+KGDWS RPVIE NTCYRMQWG+  RCDG   S
Sbjct: 241 SQFMMELQGLKAVDGEDPPRILHFNPRIKGDWSGRPVIEQNTCYRMQWGSGLRCDG-RES 300

Query: 301 SEDEMLVDGNHRCEKWLRSDVTDSKESKTTSWFRRFIGREQKPEV-------TWPFPFME 360
           S+DE  VDG  +CE+W R                                   W +PF E
Sbjct: 301 SDDEEYVDGEVKCERWKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMITHDWDYPFAE 360

Query: 361 GRLFILTLRAGVDGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTS 420
           G+LF+LTLRAG++GYHI+V GRH+TSF YR GF LEDATGLAVKG++D+HS YA +LP++
Sbjct: 361 GKLFVLTLRAGMEGYHISVNGRHITSFPYRTGFVLEDATGLAVKGNIDVHSVYAASLPST 420

Query: 421 HPSFSPQRVLEMSEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNV 480
           +PSF+PQ+ LEM   WK+  LP+  V LFIG+LSA NHFAERMAVRK+WMQ   V SS V
Sbjct: 421 NPSFAPQKHLEMQRIWKAPSLPQKPVELFIGILSAGNHFAERMAVRKSWMQQKLVRSSKV 480

Query: 481 VVRFFVALNPRKEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYI 540
           V RFFVAL+ RKEVN  LKKEA YFGDIVI+P+MD Y+LVVLKT+AICE+GV  + A Y+
Sbjct: 481 VARFFVALHARKEVNVDLKKEAEYFGDIVIVPYMDHYDLVVLKTVAICEYGVNTVAAKYV 540

Query: 541 MKCDDDTFVRVETVLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYA 600
           MKCDDDTFVRV+ V+++ E +  ++SLY+GN+N  H+PLR GKWAVT+EEWPEE YPPYA
Sbjct: 541 MKCDDDTFVRVDAVIQEAEKVKGRESLYIGNINFNHKPLRTGKWAVTFEEWPEEYYPPYA 600

Query: 601 NGPGYIVSIDIAKYIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYG 660
           NGPGYI+S D+AK+IV   E K LR+FKMEDVSMGMWVE+FN T   V   H+ KFCQ+G
Sbjct: 601 NGPGYILSYDVAKFIVDDFEQKRLRLFKMEDVSMGMWVEKFNET-RPVAVVHSLKFCQFG 660

Query: 661 CMEDYFTAHYQSPRQILCLWDKLAR-GHAHCCNFR 682
           C+EDYFTAHYQSPRQ++C+WDKL R G   CCN R
Sbjct: 661 CIEDYFTAHYQSPRQMICMWDKLQRLGKPQCCNMR 673

BLAST of CsGy6G035950 vs. Swiss-Prot
Match: sp|Q8L7F9|B3GTF_ARATH (Beta-1,3-galactosyltransferase GALT1 OS=Arabidopsis thaliana OX=3702 GN=GALT1 PE=1 SV=1)

HSP 1 Score: 328.2 bits (840), Expect = 2.2e-88
Identity = 206/567 (36.33%), Postives = 296/567 (52.20%), Query Frame = 0

Query: 134 FSMLETMADEAWTL---------GSMAWEE----VDKFGLNETSESSILEGKPESCPSWI 193
           ++ LE++ D A +L           + WE     V+   L + +E+   +GK E CP ++
Sbjct: 99  WNRLESLVDNAQSLVNGVDAIKEAGIVWESLVSAVEAKKLVDVNENQTRKGKEELCPQFL 158

Query: 194 STDGKKLMEGDGL-MFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPKVMVSQFMV 253
           S       +G  L + +PCGL  GSSIT+IG                   P  +V  F +
Sbjct: 159 SKMNATEADGSSLKLQIPCGLTQGSSITVIGI------------------PDGLVGSFRI 218

Query: 254 ELQGLKSVDGEDPPKILHLNPRLKGDWS-KRPVIEHNTCYRMQ-WGTAQRCDGLPSSSED 313
           +L G       DPP I+H N RL GD S + PVI  N+    Q WG  +RC   P    D
Sbjct: 219 DLTGQPLPGEPDPPIIVHYNVRLLGDKSTEDPVIVQNSWTASQDWGAEERC---PKFDPD 278

Query: 314 -EMLVDGNHRCEKWLRSDVTDSKESKTTSWFRRF--IGREQKPEVTWPFPFMEGRLFILT 373
               VD    C K +  ++  +  +   S   R   + RE      + FPF +G L + T
Sbjct: 279 MNKKVDDLDECNKMVGGEINRTSSTSLQSNTSRGVPVAREASKHEKY-FPFKQGFLSVAT 338

Query: 374 LRAGVDGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQ 433
           LR G +G  + V G+H+TSFA+R        + + + GD  + S  A+ LPTS  S   +
Sbjct: 339 LRVGTEGMQMTVDGKHITSFAFRDTLEPWLVSEIRITGDFRLISILASGLPTSEES---E 398

Query: 434 RVLEMSEKWKSQPL-PKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFV 493
            V+++ E  KS  L P   + L IGV S  N+F  RMAVR+TWMQ   V S  V VRFFV
Sbjct: 399 HVVDL-EALKSPTLSPLRPLDLVIGVFSTANNFKRRMAVRRTWMQYDDVRSGRVAVRFFV 458

Query: 494 ALNPRKEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDD 553
            L+    VN  L  EA  +GD+ ++PF+D Y L+  KT+AIC FG    +A +IMK DDD
Sbjct: 459 GLHKSPLVNLELWNEARTYGDVQLMPFVDYYSLISWKTLAICIFGTEVDSAKFIMKTDDD 518

Query: 554 TFVRVETVLKQIEGISSKKSLYMGNLNLLHRPLRH--GKWAVTYEEWPEEVYPPYANGPG 613
            FVRV+ VL  +   ++ + L  G +N   +P+R+   KW ++YEEWPEE YPP+A+GPG
Sbjct: 519 AFVRVDEVLLSLSMTNNTRGLIYGLINSDSQPIRNPDSKWYISYEEWPEEKYPPWAHGPG 578

Query: 614 YIVSIDIAKYIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMED 673
           YIVS DIA+ +    +  +L++FK+EDV+MG+W+ +         Y ++ +    GC + 
Sbjct: 579 YIVSRDIAESVGKLFKEGNLKMFKLEDVAMGIWIAELTKHGLEPHYENDGRIISDGCKDG 638

Query: 674 YFTAHYQSPRQILCLWDKLARGHAHCC 679
           Y  AHYQSP ++ CLW K        C
Sbjct: 639 YVVAHYQSPAEMTCLWRKYQETKRSLC 639

BLAST of CsGy6G035950 vs. TrEMBL
Match: tr|A0A0A0KKS2|A0A0A0KKS2_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G524710 PE=4 SV=1)

HSP 1 Score: 1363.6 bits (3528), Expect = 0.0e+00
Identity = 681/681 (100.00%), Postives = 681/681 (100.00%), Query Frame = 0

Query: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLXXXXXX 60
           MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLXXXXXX
Sbjct: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLXXXXXX 60

Query: 61  XXXXXXXXXXLSSVYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIKHKYGRI 120
           XXXXXXXXXXLSSVYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIKHKYGRI
Sbjct: 61  XXXXXXXXXXLSSVYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIKHKYGRI 120

Query: 121 TGNISSQLNHTNDFSMLETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPESCPSWI 180
           TGNISSQLNHTNDFSMLETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPESCPSWI
Sbjct: 121 TGNISSQLNHTNDFSMLETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPESCPSWI 180

Query: 181 STDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPKVMVSQFMVE 240
           STDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPKVMVSQFMVE
Sbjct: 181 STDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPKVMVSQFMVE 240

Query: 241 LQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDEML 300
           LQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDEML
Sbjct: 241 LQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDEML 300

Query: 301 VDGNHRCEKWLRSDVTDSKESKTTSWFRRFIGREQKPEVTWPFPFMEGRLFILTLRAGVD 360
           VDGNHRCEKWLRSDVTDSKESKTTSWFRRFIGREQKPEVTWPFPFMEGRLFILTLRAGVD
Sbjct: 301 VDGNHRCEKWLRSDVTDSKESKTTSWFRRFIGREQKPEVTWPFPFMEGRLFILTLRAGVD 360

Query: 361 GYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLEMS 420
           GYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLEMS
Sbjct: 361 GYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLEMS 420

Query: 421 EKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPRKE 480
           EKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPRKE
Sbjct: 421 EKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPRKE 480

Query: 481 VNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVET 540
           VNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVET
Sbjct: 481 VNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVET 540

Query: 541 VLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIAK 600
           VLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIAK
Sbjct: 541 VLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIAK 600

Query: 601 YIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQSP 660
           YIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQSP
Sbjct: 601 YIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQSP 660

Query: 661 RQILCLWDKLARGHAHCCNFR 682
           RQILCLWDKLARGHAHCCNFR
Sbjct: 661 RQILCLWDKLARGHAHCCNFR 681

BLAST of CsGy6G035950 vs. TrEMBL
Match: tr|A0A1S3AZQ8|A0A1S3AZQ8_CUCME (hydroxyproline O-galactosyltransferase GALT2 OS=Cucumis melo OX=3656 GN=LOC103484337 PE=4 SV=1)

HSP 1 Score: 1337.8 bits (3461), Expect = 0.0e+00
Identity = 665/681 (97.65%), Postives = 675/681 (99.12%), Query Frame = 0

Query: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLXXXXXX 60
           MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLXXXXXX
Sbjct: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLXXXXXX 60

Query: 61  XXXXXXXXXXLSSVYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIKHKYGRI 120
           XXXXXXXXXXLSSVYKDTFHRKLEDN+HLEAPLTPKKEPLEEVNNVTGPIKPI+HKYGRI
Sbjct: 61  XXXXXXXXXXLSSVYKDTFHRKLEDNEHLEAPLTPKKEPLEEVNNVTGPIKPIQHKYGRI 120

Query: 121 TGNISSQLNHTNDFSMLETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPESCPSWI 180
           TGNISS LNHTNDFSMLE MADEAWTLG MAWEE+DKFGLNET+ESSILEGKPESCPSWI
Sbjct: 121 TGNISSLLNHTNDFSMLEKMADEAWTLGLMAWEEIDKFGLNETAESSILEGKPESCPSWI 180

Query: 181 STDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPKVMVSQFMVE 240
           STDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDP VMVSQFMVE
Sbjct: 181 STDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPNVMVSQFMVE 240

Query: 241 LQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDEML 300
           LQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSS+DEML
Sbjct: 241 LQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEML 300

Query: 301 VDGNHRCEKWLRSDVTDSKESKTTSWFRRFIGREQKPEVTWPFPFMEGRLFILTLRAGVD 360
           VDGN RCEKWLRSDVTD+KESKTTSWF+RFIGREQKPEVTWPFPFMEGRLFILTLRAGVD
Sbjct: 301 VDGNRRCEKWLRSDVTDTKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGVD 360

Query: 361 GYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLEMS 420
           GYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYAT+LPTSHPSFSPQRVLEMS
Sbjct: 361 GYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPQRVLEMS 420

Query: 421 EKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPRKE 480
           EKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAV SSNVVVRFFVALNPRKE
Sbjct: 421 EKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALNPRKE 480

Query: 481 VNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVET 540
           VNAVLK+EAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVET
Sbjct: 481 VNAVLKREAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVET 540

Query: 541 VLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIAK 600
           VLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIAK
Sbjct: 541 VLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIAK 600

Query: 601 YIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQSP 660
           YIVSQHEN+SLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQSP
Sbjct: 601 YIVSQHENRSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQSP 660

Query: 661 RQILCLWDKLARGHAHCCNFR 682
           RQILCLWDKLARGHAHCCNFR
Sbjct: 661 RQILCLWDKLARGHAHCCNFR 681

BLAST of CsGy6G035950 vs. TrEMBL
Match: tr|A0A2I4GUV9|A0A2I4GUV9_9ROSI (hydroxyproline O-galactosyltransferase GALT2 OS=Juglans regia OX=51240 GN=LOC109011081 PE=4 SV=1)

HSP 1 Score: 1118.2 bits (2891), Expect = 0.0e+00
Identity = 537/683 (78.62%), Postives = 595/683 (87.12%), Query Frame = 0

Query: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGL-XXXXX 60
           MK+ K+EPP ARR +LSH LL I VLYLVFISFKFP FLEIAA LSGD+S  G       
Sbjct: 1   MKRPKSEPPGARRFKLSHFLLGIAVLYLVFISFKFPHFLEIAAMLSGDDSYVGTDGTMRG 60

Query: 61  XXXXXXXXXXXLSSVYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIKHKYGR 120
                       +SVYKD FHRKLEDNQ+ +AP  P +EPLEE  + + PIKP++H+YGR
Sbjct: 61  DSEDPDLSKPFFTSVYKDAFHRKLEDNQNQDAPFRPSQEPLEEKKSASRPIKPLQHRYGR 120

Query: 121 ITGNISSQLNHTNDFSMLETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPESCPSW 180
           ITG I  + N T D S+LE MADEAWTLG  AWEE+DK    ET ESSILEGKPESCPSW
Sbjct: 121 ITGEIMKRRNRTIDLSVLERMADEAWTLGLKAWEELDKVDEKETGESSILEGKPESCPSW 180

Query: 181 ISTDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKV-GGDPKVMVSQFM 240
           IS  G++L +GD LM LPCGLAAGSS+T++GTPH AHQEYVPQL K+ GG   VMVSQFM
Sbjct: 181 ISISGEEL-KGDRLMILPCGLAAGSSVTVVGTPHYAHQEYVPQLAKLRGGSAMVMVSQFM 240

Query: 241 VELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDE 300
           VELQGLK VDGE+PPKILHLNPRLKGDWSKRPVIEHNTCYRMQWG AQRCDGLPS+++D+
Sbjct: 241 VELQGLKVVDGEEPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGKAQRCDGLPSNNDDD 300

Query: 301 MLVDGNHRCEKWLRSDVTDSKESKTTSWFRRFIGREQKPEVTWPFPFMEGRLFILTLRAG 360
           MLVDG+ RCEKW+R+D+ DSKESKTTSWF+RFIGREQKPEVTWPFPF+EGRLFILTLRAG
Sbjct: 301 MLVDGHGRCEKWMRNDIVDSKESKTTSWFKRFIGREQKPEVTWPFPFVEGRLFILTLRAG 360

Query: 361 VDGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLE 420
           VDGYHI+VGGRH+TSF YR GFTLEDATGLA+KGDVD+HS YAT+LPTSHPSFSP RVLE
Sbjct: 361 VDGYHISVGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVYATSLPTSHPSFSPHRVLE 420

Query: 421 MSEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPR 480
            SEKWK  PLPKS V LF+GVLSA NHFAERMAVRKTWMQ+SA+ SS+VVVRFFVALNPR
Sbjct: 421 FSEKWKVNPLPKSKVPLFVGVLSAPNHFAERMAVRKTWMQTSAIKSSDVVVRFFVALNPR 480

Query: 481 KEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRV 540
           KEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGV N+TA+YIMKCDDDTFVRV
Sbjct: 481 KEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVQNVTAAYIMKCDDDTFVRV 540

Query: 541 ETVLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDI 600
           +TVLK+IEGISS KSLYMGNLNLLHRPLR GKWAVTYEEWPEEVYPPYANGPGY++SIDI
Sbjct: 541 DTVLKEIEGISSNKSLYMGNLNLLHRPLRSGKWAVTYEEWPEEVYPPYANGPGYVISIDI 600

Query: 601 AKYIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQ 660
           AKYI+SQH N+SLR+FKMEDVSMGMWVEQFNS+ A VQYSH+WKFCQYGC+E+YFTAHYQ
Sbjct: 601 AKYIISQHGNRSLRLFKMEDVSMGMWVEQFNSSKAAVQYSHSWKFCQYGCLENYFTAHYQ 660

Query: 661 SPRQILCLWDKLARGHAHCCNFR 682
           SPRQ++CLW  LARG AHCCNFR
Sbjct: 661 SPRQMICLWGNLARGRAHCCNFR 682

BLAST of CsGy6G035950 vs. TrEMBL
Match: tr|M5WLY9|M5WLY9_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_4G086100 PE=4 SV=1)

HSP 1 Score: 1104.7 bits (2856), Expect = 0.0e+00
Identity = 532/685 (77.66%), Postives = 595/685 (86.86%), Query Frame = 0

Query: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLXXXXXX 60
           MK++K EP VARR +L HLL  +  LYL+FIS KFP+FLEIA  +SGD+   GL      
Sbjct: 1   MKRLKIEPSVARRFKLQHLLFALAALYLIFISVKFPQFLEIAKAMSGDDGYVGLDLAKVQ 60

Query: 61  XXXXXXXXXXL-SSVYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIKHKYGR 120
                     L SSVYKDTFHRKLED Q  +AP+ P KEPLEE  + + PI+P++H+YGR
Sbjct: 61  DSQDGDLSKPLFSSVYKDTFHRKLED-QSQDAPVRPSKEPLEEKKSESKPIRPLQHRYGR 120

Query: 121 ITGNISSQLNHTNDFSMLETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPESCPSW 180
           ITG I  Q N TN+ S+LE MADEAWTLG  AWEEVDK    E  ESSI+EGKPESCPSW
Sbjct: 121 ITGEILRQRNRTNELSVLERMADEAWTLGLNAWEEVDKHDGKEIGESSIVEGKPESCPSW 180

Query: 181 ISTDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKV-GGDPKVMVSQFM 240
           +S  G++L  GD LMFLPCGLAAGSS+T++GT H AHQEYVPQL K+  GD  VMVSQFM
Sbjct: 181 LSMSGEELAMGDKLMFLPCGLAAGSSVTVVGTSHYAHQEYVPQLAKLRRGDGIVMVSQFM 240

Query: 241 VELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDE 300
           VELQGLKSVDGEDPPKILHLNPRLKGDWS RPVIEHNTCYRMQWG+AQRCDGLPS + ++
Sbjct: 241 VELQGLKSVDGEDPPKILHLNPRLKGDWSHRPVIEHNTCYRMQWGSAQRCDGLPSKNNED 300

Query: 301 MLVDGNHRCEKWLRSDVTDSKES--KTTSWFRRFIGREQKPEVTWPFPFMEGRLFILTLR 360
           MLVDG  RCEKW+R+D+ DSKES  KTTSWF+RFIGREQKPEVTWPFPF EGRLFILT+R
Sbjct: 301 MLVDGYGRCEKWMRNDMVDSKESKTKTTSWFKRFIGREQKPEVTWPFPFTEGRLFILTIR 360

Query: 361 AGVDGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRV 420
           AGVDG+HI+VGGRH+TSF YR GFTLEDATGLA+KGDVD+HS YAT+LP SHPSFSPQRV
Sbjct: 361 AGVDGFHISVGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVYATSLPASHPSFSPQRV 420

Query: 421 LEMSEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALN 480
           LEMSEKWK++PLPKS V LFIGVLSATNHFAERMAVRKTWMQSS + SS+VVVRFFVALN
Sbjct: 421 LEMSEKWKARPLPKSPVRLFIGVLSATNHFAERMAVRKTWMQSSVIKSSDVVVRFFVALN 480

Query: 481 PRKEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFV 540
           PRKEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTI+ICEFGV N+TA+YIMKCDDDTFV
Sbjct: 481 PRKEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTISICEFGVQNVTAAYIMKCDDDTFV 540

Query: 541 RVETVLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSI 600
           RV+TVLK+IEGISSKKSLYMGNLNLLHRPLR GKWAVTYEEWPEEVYPPYANGPGYI+SI
Sbjct: 541 RVDTVLKEIEGISSKKSLYMGNLNLLHRPLRSGKWAVTYEEWPEEVYPPYANGPGYIISI 600

Query: 601 DIAKYIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAH 660
           DIAK+++SQH ++SLR+FKMEDVSMGMWVEQFNS++ATVQYSHNWKFCQYGCME+Y+TAH
Sbjct: 601 DIAKFVISQHGSRSLRLFKMEDVSMGMWVEQFNSSMATVQYSHNWKFCQYGCMENYYTAH 660

Query: 661 YQSPRQILCLWDKLARGHAHCCNFR 682
           YQSPRQ++CLWDKLARG   CCNFR
Sbjct: 661 YQSPRQMICLWDKLARGRVQCCNFR 684

BLAST of CsGy6G035950 vs. TrEMBL
Match: tr|A0A061GKH2|A0A061GKH2_THECC (Galactosyltransferase family protein isoform 1 OS=Theobroma cacao OX=3641 GN=TCM_029407 PE=4 SV=1)

HSP 1 Score: 1097.4 bits (2837), Expect = 0.0e+00
Identity = 528/683 (77.31%), Postives = 588/683 (86.09%), Query Frame = 0

Query: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLXXXXXX 60
           MK+VK+E    RR +LSH LL IG LYL+FI+FKFP FLEIAA LSGD S + L      
Sbjct: 1   MKRVKSELSTGRRFKLSHFLLGIGGLYLIFIAFKFPHFLEIAAVLSGDGSYDELDGKVVG 60

Query: 61  XXXXXXXXXXL-SSVYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIKHKYGR 120
                     L +SVYKDTFHRKLEDN + +APL P KEPLEE      PIKP++H+YGR
Sbjct: 61  DVNDADLNKPLVNSVYKDTFHRKLEDNLNQDAPLRPSKEPLEEGKGRLQPIKPLQHRYGR 120

Query: 121 ITGNISSQLNHTNDFSMLETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPESCPSW 180
           ITG I  ++N T+D S+LE MADEAWTLG  AWEEVDKF   +  ++S+ +GKPESCPSW
Sbjct: 121 ITGEIMRRMNKTSDLSVLERMADEAWTLGLKAWEEVDKFDGKKIGQNSLFDGKPESCPSW 180

Query: 181 ISTDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVG-GDPKVMVSQFM 240
           +S  G+ L  GD LMFLPCGL AGSSIT++GTP  AHQE+VPQL ++  GD  VMVSQFM
Sbjct: 181 LSVSGEDLASGDRLMFLPCGLKAGSSITVVGTPRYAHQEFVPQLARLRLGDGLVMVSQFM 240

Query: 241 VELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDE 300
           VELQGLKSVDGEDPPKILHLNPRLKGDWS RPVIEHNTCYRMQWGTAQRCDGL S  +++
Sbjct: 241 VELQGLKSVDGEDPPKILHLNPRLKGDWSHRPVIEHNTCYRMQWGTAQRCDGLRSKDDED 300

Query: 301 MLVDGNHRCEKWLRSDVTDSKESKTTSWFRRFIGREQKPEVTWPFPFMEGRLFILTLRAG 360
           MLVDG+ RCEKW+R DV DSKESKTTSWF+RFIGREQKPEVTWPFPF EGRLFILTLRA 
Sbjct: 301 MLVDGHRRCEKWIRDDVADSKESKTTSWFKRFIGREQKPEVTWPFPFAEGRLFILTLRAA 360

Query: 361 VDGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLE 420
           VDGYHINVGGRH+TSF YR GF+LEDATGLA+KGDVD+HS YAT+LPTSHPSFSPQRVLE
Sbjct: 361 VDGYHINVGGRHVTSFPYRTGFSLEDATGLAIKGDVDVHSVYATSLPTSHPSFSPQRVLE 420

Query: 421 MSEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPR 480
           MS KWK+ PLP+ S+ LFIGVLSATNHFAERMAVRKTWMQSSA+ SSNVVVRFFVALN R
Sbjct: 421 MSPKWKAYPLPRRSIQLFIGVLSATNHFAERMAVRKTWMQSSAIKSSNVVVRFFVALNTR 480

Query: 481 KEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRV 540
           KEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGV N++A+YIMKCDDDTFVRV
Sbjct: 481 KEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVQNVSAAYIMKCDDDTFVRV 540

Query: 541 ETVLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDI 600
           +TVLK+I+GIS KKSLYMGNLNLLHRPLR+GKWAVTYEEWPEEVYPPYANGPGYI+S DI
Sbjct: 541 DTVLKEIDGISPKKSLYMGNLNLLHRPLRNGKWAVTYEEWPEEVYPPYANGPGYIISSDI 600

Query: 601 AKYIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQ 660
           AK+I+SQH N+ LR+FKMEDVSMGMWVEQFNS+  TVQYSHNWKFCQYGCM DY+TAHYQ
Sbjct: 601 AKFIISQHGNRKLRLFKMEDVSMGMWVEQFNSS-TTVQYSHNWKFCQYGCMVDYYTAHYQ 660

Query: 661 SPRQILCLWDKLARGHAHCCNFR 682
           SPRQ++CLWDKL+RG AHCCNFR
Sbjct: 661 SPRQMICLWDKLSRGRAHCCNFR 682

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011658301.10.0e+00100.00PREDICTED: probable beta-1,3-galactosyltransferase 20 [Cucumis sativus] >KGN4943... [more]
XP_008439584.10.0e+0097.65PREDICTED: hydroxyproline O-galactosyltransferase GALT2 [Cucumis melo][more]
XP_022146632.10.0e+0089.30hydroxyproline O-galactosyltransferase GALT2 [Momordica charantia][more]
XP_023517655.10.0e+0088.86hydroxyproline O-galactosyltransferase GALT2-like [Cucurbita pepo subsp. pepo][more]
XP_022926607.10.0e+0089.00hydroxyproline O-galactosyltransferase GALT2-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
AT4G21060.17.5e-26566.36Galactosyltransferase family protein[more]
AT5G62620.13.0e-19750.95Galactosyltransferase family protein[more]
AT1G74800.12.6e-19658.14Galactosyltransferase family protein[more]
AT1G27120.13.0e-18949.06Galactosyltransferase family protein[more]
AT1G26810.11.2e-8936.33galactosyltransferase1[more]
Match NameE-valueIdentityDescription
sp|A7XDQ9|B3GTK_ARATH5.3e-26865.36Hydroxyproline O-galactosyltransferase GALT2 OS=Arabidopsis thaliana OX=3702 GN=... [more]
sp|Q9LV16|B3GTJ_ARATH5.4e-19650.95Hydroxyproline O-galactosyltransferase GALT6 OS=Arabidopsis thaliana OX=3702 GN=... [more]
sp|Q8RX55|B3GTI_ARATH4.6e-19558.14Hydroxyproline O-galactosyltransferase GALT5 OS=Arabidopsis thaliana OX=3702 GN=... [more]
sp|Q8GXG6|B3GTH_ARATH5.5e-18849.06Hydroxyproline O-galactosyltransferase GALT4 OS=Arabidopsis thaliana OX=3702 GN=... [more]
sp|Q8L7F9|B3GTF_ARATH2.2e-8836.33Beta-1,3-galactosyltransferase GALT1 OS=Arabidopsis thaliana OX=3702 GN=GALT1 PE... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0KKS2|A0A0A0KKS2_CUCSA0.0e+00100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G524710 PE=4 SV=1[more]
tr|A0A1S3AZQ8|A0A1S3AZQ8_CUCME0.0e+0097.65hydroxyproline O-galactosyltransferase GALT2 OS=Cucumis melo OX=3656 GN=LOC10348... [more]
tr|A0A2I4GUV9|A0A2I4GUV9_9ROSI0.0e+0078.62hydroxyproline O-galactosyltransferase GALT2 OS=Juglans regia OX=51240 GN=LOC109... [more]
tr|M5WLY9|M5WLY9_PRUPE0.0e+0077.66Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_4G086100 PE=4 SV=1[more]
tr|A0A061GKH2|A0A061GKH2_THECC0.0e+0077.31Galactosyltransferase family protein isoform 1 OS=Theobroma cacao OX=3641 GN=TCM... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008378galactosyltransferase activity
GO:0030246carbohydrate binding
Vocabulary: Biological Process
TermDefinition
GO:0006486protein glycosylation
Vocabulary: Cellular Component
TermDefinition
GO:0016020membrane
Vocabulary: INTERPRO
TermDefinition
IPR013320ConA-like_dom_sf
IPR002659Glyco_trans_31
IPR001079Galectin_CRD
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006486 protein glycosylation
cellular_component GO:0005794 Golgi apparatus
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0030246 carbohydrate binding
molecular_function GO:0008378 galactosyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy6G035950.1CsGy6G035950.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001079Galectin, carbohydrate recognition domainSMARTSM00908Gal_bind_lectin_2coord: 196..401
e-value: 2.0E-24
score: 97.2
IPR001079Galectin, carbohydrate recognition domainPFAMPF00337Gal-bind_lectincoord: 194..400
e-value: 5.7E-48
score: 161.8
IPR001079Galectin, carbohydrate recognition domainPROSITEPS51304GALECTINcoord: 192..402
score: 30.649
IPR001079Galectin, carbohydrate recognition domainCDDcd00070GLECTcoord: 195..398
e-value: 1.48926E-21
score: 90.3859
NoneNo IPR availableGENE3DG3DSA:2.60.120.200coord: 193..302
e-value: 4.9E-11
score: 44.2
NoneNo IPR availableGENE3DG3DSA:3.90.550.50coord: 425..642
e-value: 2.3E-5
score: 25.4
NoneNo IPR availableGENE3DG3DSA:2.60.120.200coord: 316..399
e-value: 1.6E-9
score: 39.3
NoneNo IPR availablePANTHERPTHR11214:SF218SUBFAMILY NOT NAMEDcoord: 117..679
IPR002659Glycosyl transferase, family 31PFAMPF01762Galactosyl_Tcoord: 448..628
e-value: 4.1E-33
score: 114.8
IPR002659Glycosyl transferase, family 31PANTHERPTHR11214BETA-1,3-N-ACETYLGLUCOSAMINYLTRANSFERASEcoord: 117..679
IPR013320Concanavalin A-like lectin/glucanase domain superfamilySUPERFAMILYSSF49899Concanavalin A-like lectins/glucanasescoord: 343..399
coord: 195..290