Moc01g33540 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc01g33540
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionhydroxyproline O-galactosyltransferase GALT2
Locationchr1: 23643055 .. 23648641 (+)
RNA-Seq ExpressionMoc01g33540
SyntenyMoc01g33540
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAAGCTTAAAACCGAACCTCCGGGTGCGCGGAGGCTCAGGTTGTCGCATCTTCTTCTCGTAATTGGAGTGTTGTATTTAGTTTTCATATCATTCAAGTTTCCGCGTTTTTTGGAAATTGCTGCAACGTTGAGCGGGGATGAAAGTAATGTCGGGTTGGATGGGACAATCGGAGGGGACAGTGAAGGTGTGGATTTTAGCAGAGCATCGTTGAGTTCTGTTTACAAGGATACGTTTCATCGGAAATTGGAAGATAACCAAAACAGGGAAGCACCGGTGACGCCTAAAAAGGAGCCACTGGAAGATGTGAATAATGTGTCCGGACCCATCAAGCCAATTAAACATAAATACGGTCGGATAACTGGTAAAATTTTGAGTCATCAAAACCATACTAATGACTTTTCTATGCTCGAGAAATTGGCAGATGAAGCTTGGACATTAGGCTTGAAGGCTTGGGAAGAAGTGGATAAATTTGGTTTAGATGAGACTGCAGAAAGTTCTATACTCGAGGGAAAGCCCGAGACATGTCCTTCATGGATATCTACGGATGGGAAAAAGATGTTGGAGGGAGATGGAATCATGTTCCTTCCTTGTGGACTTGCTGCAGGTTCATCGATTACAATAATTGGAACCCCCCATCATGCTCACCAGGAGTATGTACCCCAACTTCTGAAGGTGGGAGCTGATCCTATGGTTATGGTTTCACAGTTCATGGTTGAATTGCAAGGGCTGAAAGCAGTCGATGGTGAGGATCCACCAAAGATCCTTCACCTGAATCCACGACTGAAAGGAGATTGGAGTAAACGGCCTGTCATTGAACATAATACATGTTATAGGATGCAGTGGGGAACGGCTCAAAGGTGTGATGGTGTGCCATCAAGTAATGATGATGAAATGCTTGGTGAGATTATCATCCCTTCGCCTTATTTATTTCTTGTGTTTTCACTAGCCCTCAATCATCATTGCGTGACACACAAAAATGGTGGGGGTGCACAAGAATGTAAAATTACTGTACAAGTTATTGTTCTTTTCGATGATAACGGAAGCAAGCAAATGAAGAAATAGTTTCATTTTGGTTTAGACGATGCTGGTAATGTTAAAAGAAGAGAATATTTTCAGCAAGTAAGATTAATTTTTTTTTGGTCATCTACTTATTGGTTAGAGTGTTAGCCCCTTCTTGATAGTCCAAATGTGCATTGGGTTTCTGGATTTATTGTCCTTGTTGAAGTCTAGTCAGGAGTCCTAATTTTGGCACTATATGGTGAAAGAGGTAAAAATGTACAATCTCATATTGATTTATTTATTTTTATGTGAAATTGAATCTATACACGATGGCATGTAAAAATGTACAATCTCTTATTGATTTATTATTTACTGTTAATTTGAATCTATACATGCTGGCAGGTAAAAATGTACAATCTCTTATTGATTTATTATTTTCTGTTAATTTGAATCTATACATGATGATTATGATGCTCACTTCGAAAAATTGTTTAATGTAGCAGCCCCTCCCAACAATTAGATGCATGGCCAAAAAATAATTTTTAATTTATCTCTCATGCATCCAACAATATAAAATTGTTTCTCATGTTATTTCAACCCCTTTAAATAATTGTTCTACCTTATCACATACAAAGTTAGTTTTCTAGTCCTGCTTTGATATTGCTATAGTAGAGTAGATGGAAGTTTATCAGATTCTCTATAATTAATTTTCGAGTCACCTTTATCCCCAATTCCTCACTTTTGTTCATTCTTCAAGCTGTGCTATAGTGCCATATACCATCTGATTTTTTTGCTCCCTTTGATTCTTTTAATTTGACAGTTGATGGAAATCGTCGATGTGAAAAATGGGTGAGGAGTGATATTATAGACTCAAAAGAATCGAAAACAACCTCATGGTTCAAGAGATTCATAGGGCGAGAGCAGAAGCCAGAAGTAACCTGGCCATTTCCTTTTATGGAGGGCAGATTGTTTATCCTGACACTACGGGCTGGTGTTGATGGATACCATATTAATGTTGGTGGTCGGCACTTGACTTCTTTTCCCTATCGCCTCGTAAGTATCTGAAGAAGGGGGAAGATGCAGATAGCTGTTTCACTTTCTATTAATATTCATTTGTGGTTGATGTTTGTCTTACTGTTGCAGTTGGTACAGATTTTCTCTCCCATCTTGTTAATATTAATTTTCTTGTATCTTTTTTTGGCTTTCATTTAGGGGTTTACGCTTGAAGATGCAACTGGATTAGCAGTTAAAGGAGACGTGGACATTCATTCCGCATATGCTACATCTCTTCCTACTACTCATCCAAGCTTCTCTCCCCAACGAGTTCTTGAGATGTCAGAGAAGTGGAAATCTCAGCCTTTACCTAAGAGTTCTGTTCGTCTTTTTATTGGTGTTCTATCTGCCACTAATCACTTTGCGGAGCGAATGGCTGTTAGGAAAACTTGGATGCAATCTTCAGCTGTCAAGGCATCAAATGTAGTTGTTCGCTTCTTTGTTGCACTGGTATGGCTCTTGTATTCATCCTTTTACGGTACATATATTAAAGTAGATTAGAAAGAAGAATAGTAGTTTCTGTAATTGTTGGTCTTTTTAATGTCTCCACATGATGAAGAATCTCATAACTTTTTATTTAAATCTATCCTGGAAATATATTTTTTTTGGATGGAAACAAAAGCTATATGTGCTATGTTCCCCTGACAATGGAATAGGTTTACATATGAAAAGGAAAACAATAAGAAAGCACGCAAGGTTTAGTTACACTGATAAACTTTTTTTTAAAAGAAGCATTTGTGGTCTCTTCTTCCGCCCAACTTCTCCATTTAAAAGAAGCATTTGTGGACTTAAATGTTGTATTGTAATCTTGTGAGTTGGATGCTACAGCTTTACTAACATCACTTTTAAGGAGCTAAGTCAAGTAAAAAAAGGTTAGACATCAAGTTTTGTGATGCAGATTTAAAGCAACGATTTAATTTATAATTATATTTCTCCCAACTCCTATTGTCCTGTATATGTAATATGCTATAGACTTGTTTACTTACTGGAAATAGAGTTATTTTCTTTCATCTTCACCAATAATGTTTTTGTACCCCTGGAGTGATTCACTATGTCAATTCACTTATAGGTCTCTCTTATTTCATTCCAATTGATTTTTATTTTGGTTTTGCTTCAAATTCATTCTAGTTCAGTAGATAAGGTGTAATGACTGTTTATGAAGCCAAATCTAATTAAATAAAATTTTTGAAAACATTGATTCTCAGAAATTATATTTTTTATTGCAATATCCTTTTTGTTTTTATAAAAAAACTTTATTGCAATATCCTTTTTGTAGTGGTGCTTATGATTTTATGGTTTTAGAAGTTTTGTACTCCAGAACATTGTATTTGGGCTCTTTAAGAGATTGCTAGAACTGTTCCCTTTGAACAGTAACTCTTCTAAAGTTGAAGGAATTTGAACCTTGAACAGATTGATTAAATCCAGTCCCTTATGGCAGATATTCCTTTAATCTCGATATGTAATATAACTGTCCCAATCTACACACATGATATAGAGTTGGTTTACGTTTATAATACTCGTTGCCTGTGCGTATAGTCAAAAATATTTTTGTTAGTGCTTCCTAGGAAGATTCATATGCCACTCAAATGGATGAAGGAGCATCAGTAACTACTATACTACCCACACCTCATCTACCCTTGCAATCTCAAGTAATAGGAATGATCCTTTACCTTTGCTGTATTTTAACCTCACAGTCCCACAATGGCACAATGAAGCTTTGTGGAAATGCCATTCTTGTTGACCAGAAGGTGACAGCGGCTTATGATATTAAAAGTAGAAGCATTAATGGGGAACTTAGTGGTTTTCTTCCTGGGGAAGAGATGTTTGCCATGCATCTTTTCTTTGTTATGGAAGCATTCTCTTGGTTGCTATCCTATATGGCAAACAAAGGGAATTTAGATTTTAAGAAATGACTTCTATATGTTTGTAGATGACCTAATACTTTCCAAAGGGAAAATCCTCATTAAATTCAGAAGTAGCACGTAGTTATAATTCTATCTTATTCATTTGCCGAGTGGTTCACACTTTAATGGGAGATTATGTCTCAGTTGCAGTTCTCCCAAGTCCCATCCTCCTTGGCACTCAGGATTACTTAGCATATAGTCATCTTATTTTTCATTCATGTTATTTCGTTCTTATTTTCGTTATTGTTATTTTTTATTGCTCCTTATACGTATACTCACACATGTAGAATCCACGGAAGGAGGTCAATGCTGTGCTGAAGAAGGAAGCTGAATACTTTGGTGATATTGTGATCCTGCCCTTCATGGATCGCTATGAGCTTGTTGTTCTCAAGACTATTGCTATATGCGAGTTTGGGGTGAGTTCGCATTTTACTTCTTCTGTTGATTATTATTTGGATATTTTTCATAGGACTTCGTAATATAAAGTCCACTTTTTTCCTTCCAGACTGTGAATTTGACGGCTTCATATGTTATGAAATGTGATGATGACACCTTTGTGAGGGTGGACACTGTTTTAAAACAGATCAGCGGTGTTTCATCCAAGAAGTCTCTATACATGGGCAATCTCAACCTCTTGCATCGCCCTCTCCGACATGGAAAATGGGCAGTCACTTATGAGGTAAATATGTGGGCATCGAAAATTTGGCTATTCTGTGCTTGTTCCATTTGGTGCCATCCTTGAGTTACAAAGGCTAGCCCACCAGTTTTTCTGGATAACTTTTCATCAATCAGATCATGTTTAGCACAGTGTTGCTACTTGAAAATATAATTGGATGACTTTTCATCAATCTGATCGAAAGAACTTAGTATTATCATTTATCAACAAGAAGCTATAAACTTATGGAGGATCAGATAGCTCTTCCTCTTGATTATGCCAATTCATAAAAACATAACCCCCCCCCCCCCCCCCCCCCCCCAACACTTACATTCTCTAAACATTCCTTCTTGTCTCTTTAACTCCTATGTCTCATCATTTAAAAACCGAAGTGGGGCCCACTTATCTAACTCACAATCTAAACACCGCCTCGACTGACCCGTTTCCTCCGATATACACTTATTCTTGGTGTCCTGTCATTTTCTCTCAACTTTTTAACAAAAAGCTTTCAACCGTCTTTTTGTATCCTCCTTTATTATGCTAATGATTTATTCTCATGTATAATAACAGGAATGGCCAGAAGAAGTGTATCCTCCATATGCCAATGGGCCAGGATATATCATTTCTAGTGACATTGCTAAATACATTGTCTCCCAACATGAAAACAGGAGCCTGAGGGTCAGTTCGAAGACGAACCTTATTGTCTGTTTCTTTGTCTGCACCTTTTTTGTGCACTGAATGGGTTGTTTTGGTCTGGCAGATATTCAAGATGGAGGATGTGAGCATGGGAATGTGGGTTGAGCAGTTCAACGGTACCGTGGCCGCGGTCCAGTACTCTCACAACTGGAAATTCTGCCAGTATGGGTGTATGGAGGACTATTTTACGGCACATTATCAATCTCCAAGGCAGATAATCTGTCTGTGGGATAAGCTGGGGCAGGGTCACGCTCACTGTTGCAACTTCAGGTGA

mRNA sequence

ATGAAGAAGCTTAAAACCGAACCTCCGGGTGCGCGGAGGCTCAGGTTGTCGCATCTTCTTCTCGTAATTGGAGTGTTGTATTTAGTTTTCATATCATTCAAGTTTCCGCGTTTTTTGGAAATTGCTGCAACGTTGAGCGGGGATGAAAGTAATGTCGGGTTGGATGGGACAATCGGAGGGGACAGTGAAGGTGTGGATTTTAGCAGAGCATCGTTGAGTTCTGTTTACAAGGATACGTTTCATCGGAAATTGGAAGATAACCAAAACAGGGAAGCACCGGTGACGCCTAAAAAGGAGCCACTGGAAGATGTGAATAATGTGTCCGGACCCATCAAGCCAATTAAACATAAATACGGTCGGATAACTGGTAAAATTTTGAGTCATCAAAACCATACTAATGACTTTTCTATGCTCGAGAAATTGGCAGATGAAGCTTGGACATTAGGCTTGAAGGCTTGGGAAGAAGTGGATAAATTTGGTTTAGATGAGACTGCAGAAAGTTCTATACTCGAGGGAAAGCCCGAGACATGTCCTTCATGGATATCTACGGATGGGAAAAAGATGTTGGAGGGAGATGGAATCATGTTCCTTCCTTGTGGACTTGCTGCAGGTTCATCGATTACAATAATTGGAACCCCCCATCATGCTCACCAGGAGTATGTACCCCAACTTCTGAAGGTGGGAGCTGATCCTATGGTTATGGTTTCACAGTTCATGGTTGAATTGCAAGGGCTGAAAGCAGTCGATGGTGAGGATCCACCAAAGATCCTTCACCTGAATCCACGACTGAAAGGAGATTGGAGTAAACGGCCTGTCATTGAACATAATACATGTTATAGGATGCAGTGGGGAACGGCTCAAAGGTGTGATGGTGTGCCATCAAGTAATGATGATGAAATGCTTGTTGATGGAAATCGTCGATGTGAAAAATGGGTGAGGAGTGATATTATAGACTCAAAAGAATCGAAAACAACCTCATGGTTCAAGAGATTCATAGGGCGAGAGCAGAAGCCAGAAGTAACCTGGCCATTTCCTTTTATGGAGGGCAGATTGTTTATCCTGACACTACGGGCTGGTGTTGATGGATACCATATTAATGTTGGTGGTCGGCACTTGACTTCTTTTCCCTATCGCCTCGGGTTTACGCTTGAAGATGCAACTGGATTAGCAGTTAAAGGAGACGTGGACATTCATTCCGCATATGCTACATCTCTTCCTACTACTCATCCAAGCTTCTCTCCCCAACGAGTTCTTGAGATGTCAGAGAAGTGGAAATCTCAGCCTTTACCTAAGAGTTCTGTTCGTCTTTTTATTGGTGTTCTATCTGCCACTAATCACTTTGCGGAGCGAATGGCTGTTAGGAAAACTTGGATGCAATCTTCAGCTGTCAAGGCATCAAATGTAGTTGTTCGCTTCTTTGTTGCACTGAATCCACGGAAGGAGGTCAATGCTGTGCTGAAGAAGGAAGCTGAATACTTTGGTGATATTGTGATCCTGCCCTTCATGGATCGCTATGAGCTTGTTGTTCTCAAGACTATTGCTATATGCGAGTTTGGGACTGTGAATTTGACGGCTTCATATGTTATGAAATGTGATGATGACACCTTTGTGAGGGTGGACACTGTTTTAAAACAGATCAGCGGTGTTTCATCCAAGAAGTCTCTATACATGGGCAATCTCAACCTCTTGCATCGCCCTCTCCGACATGGAAAATGGGCAGTCACTTATGAGGAATGGCCAGAAGAAGTGTATCCTCCATATGCCAATGGGCCAGGATATATCATTTCTAGTGACATTGCTAAATACATTGTCTCCCAACATGAAAACAGGAGCCTGAGGATATTCAAGATGGAGGATGTGAGCATGGGAATGTGGGTTGAGCAGTTCAACGGTACCGTGGCCGCGGTCCAGTACTCTCACAACTGGAAATTCTGCCAGTATGGGTGTATGGAGGACTATTTTACGGCACATTATCAATCTCCAAGGCAGATAATCTGTCTGTGGGATAAGCTGGGGCAGGGTCACGCTCACTGTTGCAACTTCAGGTGA

Coding sequence (CDS)

ATGAAGAAGCTTAAAACCGAACCTCCGGGTGCGCGGAGGCTCAGGTTGTCGCATCTTCTTCTCGTAATTGGAGTGTTGTATTTAGTTTTCATATCATTCAAGTTTCCGCGTTTTTTGGAAATTGCTGCAACGTTGAGCGGGGATGAAAGTAATGTCGGGTTGGATGGGACAATCGGAGGGGACAGTGAAGGTGTGGATTTTAGCAGAGCATCGTTGAGTTCTGTTTACAAGGATACGTTTCATCGGAAATTGGAAGATAACCAAAACAGGGAAGCACCGGTGACGCCTAAAAAGGAGCCACTGGAAGATGTGAATAATGTGTCCGGACCCATCAAGCCAATTAAACATAAATACGGTCGGATAACTGGTAAAATTTTGAGTCATCAAAACCATACTAATGACTTTTCTATGCTCGAGAAATTGGCAGATGAAGCTTGGACATTAGGCTTGAAGGCTTGGGAAGAAGTGGATAAATTTGGTTTAGATGAGACTGCAGAAAGTTCTATACTCGAGGGAAAGCCCGAGACATGTCCTTCATGGATATCTACGGATGGGAAAAAGATGTTGGAGGGAGATGGAATCATGTTCCTTCCTTGTGGACTTGCTGCAGGTTCATCGATTACAATAATTGGAACCCCCCATCATGCTCACCAGGAGTATGTACCCCAACTTCTGAAGGTGGGAGCTGATCCTATGGTTATGGTTTCACAGTTCATGGTTGAATTGCAAGGGCTGAAAGCAGTCGATGGTGAGGATCCACCAAAGATCCTTCACCTGAATCCACGACTGAAAGGAGATTGGAGTAAACGGCCTGTCATTGAACATAATACATGTTATAGGATGCAGTGGGGAACGGCTCAAAGGTGTGATGGTGTGCCATCAAGTAATGATGATGAAATGCTTGTTGATGGAAATCGTCGATGTGAAAAATGGGTGAGGAGTGATATTATAGACTCAAAAGAATCGAAAACAACCTCATGGTTCAAGAGATTCATAGGGCGAGAGCAGAAGCCAGAAGTAACCTGGCCATTTCCTTTTATGGAGGGCAGATTGTTTATCCTGACACTACGGGCTGGTGTTGATGGATACCATATTAATGTTGGTGGTCGGCACTTGACTTCTTTTCCCTATCGCCTCGGGTTTACGCTTGAAGATGCAACTGGATTAGCAGTTAAAGGAGACGTGGACATTCATTCCGCATATGCTACATCTCTTCCTACTACTCATCCAAGCTTCTCTCCCCAACGAGTTCTTGAGATGTCAGAGAAGTGGAAATCTCAGCCTTTACCTAAGAGTTCTGTTCGTCTTTTTATTGGTGTTCTATCTGCCACTAATCACTTTGCGGAGCGAATGGCTGTTAGGAAAACTTGGATGCAATCTTCAGCTGTCAAGGCATCAAATGTAGTTGTTCGCTTCTTTGTTGCACTGAATCCACGGAAGGAGGTCAATGCTGTGCTGAAGAAGGAAGCTGAATACTTTGGTGATATTGTGATCCTGCCCTTCATGGATCGCTATGAGCTTGTTGTTCTCAAGACTATTGCTATATGCGAGTTTGGGACTGTGAATTTGACGGCTTCATATGTTATGAAATGTGATGATGACACCTTTGTGAGGGTGGACACTGTTTTAAAACAGATCAGCGGTGTTTCATCCAAGAAGTCTCTATACATGGGCAATCTCAACCTCTTGCATCGCCCTCTCCGACATGGAAAATGGGCAGTCACTTATGAGGAATGGCCAGAAGAAGTGTATCCTCCATATGCCAATGGGCCAGGATATATCATTTCTAGTGACATTGCTAAATACATTGTCTCCCAACATGAAAACAGGAGCCTGAGGATATTCAAGATGGAGGATGTGAGCATGGGAATGTGGGTTGAGCAGTTCAACGGTACCGTGGCCGCGGTCCAGTACTCTCACAACTGGAAATTCTGCCAGTATGGGTGTATGGAGGACTATTTTACGGCACATTATCAATCTCCAAGGCAGATAATCTGTCTGTGGGATAAGCTGGGGCAGGGTCACGCTCACTGTTGCAACTTCAGGTGA

Protein sequence

MKKLKTEPPGARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDGTIGGDSEGVDFSRASLSSVYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIKPIKHKYGRITGKILSHQNHTNDFSMLEKLADEAWTLGLKAWEEVDKFGLDETAESSILEGKPETCPSWISTDGKKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPMVMVSQFMVELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSNDDEMLVDGNRRCEKWVRSDIIDSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGVDGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQRVLEMSEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRFFVALNPRKEVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTFVRVDTVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIISSDIAKYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTAHYQSPRQIICLWDKLGQGHAHCCNFR
Homology
BLAST of Moc01g33540 vs. NCBI nr
Match: XP_022146632.1 (hydroxyproline O-galactosyltransferase GALT2 [Momordica charantia])

HSP 1 Score: 1407.1 bits (3641), Expect = 0.0e+00
Identity = 682/682 (100.00%), Postives = 682/682 (100.00%), Query Frame = 0

Query: 1   MKKLKTEPPGARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDGTIGG 60
           MKKLKTEPPGARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDGTIGG
Sbjct: 1   MKKLKTEPPGARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDGTIGG 60

Query: 61  DSEGVDFSRASLSSVYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIKPIKHKYGR 120
           DSEGVDFSRASLSSVYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIKPIKHKYGR
Sbjct: 61  DSEGVDFSRASLSSVYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIKPIKHKYGR 120

Query: 121 ITGKILSHQNHTNDFSMLEKLADEAWTLGLKAWEEVDKFGLDETAESSILEGKPETCPSW 180
           ITGKILSHQNHTNDFSMLEKLADEAWTLGLKAWEEVDKFGLDETAESSILEGKPETCPSW
Sbjct: 121 ITGKILSHQNHTNDFSMLEKLADEAWTLGLKAWEEVDKFGLDETAESSILEGKPETCPSW 180

Query: 181 ISTDGKKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPMVMVSQFMV 240
           ISTDGKKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPMVMVSQFMV
Sbjct: 181 ISTDGKKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPMVMVSQFMV 240

Query: 241 ELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSNDDEM 300
           ELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSNDDEM
Sbjct: 241 ELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSNDDEM 300

Query: 301 LVDGNRRCEKWVRSDIIDSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360
           LVDGNRRCEKWVRSDIIDSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV
Sbjct: 301 LVDGNRRCEKWVRSDIIDSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQRVLEM 420
           DGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQRVLEM
Sbjct: 361 DGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQRVLEM 420

Query: 421 SEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRFFVALNPRK 480
           SEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRFFVALNPRK
Sbjct: 421 SEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRFFVALNPRK 480

Query: 481 EVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTFVRVD 540
           EVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTFVRVD
Sbjct: 481 EVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTFVRVD 540

Query: 541 TVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIISSDIA 600
           TVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIISSDIA
Sbjct: 541 TVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIISSDIA 600

Query: 601 KYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTAHYQS 660
           KYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTAHYQS
Sbjct: 601 KYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTAHYQS 660

Query: 661 PRQIICLWDKLGQGHAHCCNFR 683
           PRQIICLWDKLGQGHAHCCNFR
Sbjct: 661 PRQIICLWDKLGQGHAHCCNFR 682

BLAST of Moc01g33540 vs. NCBI nr
Match: XP_038882369.1 (hydroxyproline O-galactosyltransferase GALT2 [Benincasa hispida])

HSP 1 Score: 1306.6 bits (3380), Expect = 0.0e+00
Identity = 629/682 (92.23%), Postives = 656/682 (96.19%), Query Frame = 0

Query: 1   MKKLKTEPPGARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDGTIGG 60
           MKKLKTEPP ARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDE+N+GLD   G 
Sbjct: 1   MKKLKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDENNIGLDSN-GV 60

Query: 61  DSEGVDFSRASLSSVYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIKPIKHKYGR 120
           DSEGVDFS+ASLSSVYKDTFHRKLEDNQ+ EAP+TPKKEPLE VNNV+GPIKPI+HKYGR
Sbjct: 61  DSEGVDFSKASLSSVYKDTFHRKLEDNQHLEAPLTPKKEPLEAVNNVTGPIKPIQHKYGR 120

Query: 121 ITGKILSHQNHTNDFSMLEKLADEAWTLGLKAWEEVDKFGLDETAESSILEGKPETCPSW 180
           ITGK+ + QNHTNDFSMLEK+ADEAWTLGLKAWEEVDKFGL+ETAESSILEGKPE+CPSW
Sbjct: 121 ITGKVSNQQNHTNDFSMLEKMADEAWTLGLKAWEEVDKFGLNETAESSILEGKPESCPSW 180

Query: 181 ISTDGKKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPMVMVSQFMV 240
           ISTDGKK+LEGDGIMFLPCGLAAGSSITIIGTPH AH+EYVPQLLKVG DP VMVSQFMV
Sbjct: 181 ISTDGKKLLEGDGIMFLPCGLAAGSSITIIGTPHLAHKEYVPQLLKVGDDPNVMVSQFMV 240

Query: 241 ELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSNDDEM 300
           ELQGLK+VDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDG+PSS+DDEM
Sbjct: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300

Query: 301 LVDGNRRCEKWVRSDIIDSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360
           LVDGNRRCEKW RSDI DSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV
Sbjct: 301 LVDGNRRCEKWSRSDITDSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQRVLEM 420
           DGYHINVGGRHLTSF YR GFTLEDATGLAVKGDVDIHS YATSLPT+HPSFSPQRVLEM
Sbjct: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPQRVLEM 420

Query: 421 SEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRFFVALNPRK 480
           SEKWKSQPLPK S+ LFIGVLSATNHFAERMAVRKTWMQSSAVK+SNVVVRFFVALNPRK
Sbjct: 421 SEKWKSQPLPKRSIFLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALNPRK 480

Query: 481 EVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTFVRVD 540
           EVNAVLKKEA YFGDIVILPFMDRYELVVLKTIAICEFG VNLTASY+MKCDDDTFVR++
Sbjct: 481 EVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGAVNLTASYIMKCDDDTFVRME 540

Query: 541 TVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIISSDIA 600
           TVLKQI G+SSK+SLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIIS DIA
Sbjct: 541 TVLKQIEGISSKRSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIISIDIA 600

Query: 601 KYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTAHYQS 660
           KYIVSQHEN+SLRIFKMEDVSMGMWVEQFN TVA VQYSHNWKFCQYGCMEDYFTAHYQS
Sbjct: 601 KYIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQS 660

Query: 661 PRQIICLWDKLGQGHAHCCNFR 683
           PRQIICLWDKL +GHAHCCNFR
Sbjct: 661 PRQIICLWDKLARGHAHCCNFR 681

BLAST of Moc01g33540 vs. NCBI nr
Match: XP_008439584.1 (PREDICTED: hydroxyproline O-galactosyltransferase GALT2 [Cucumis melo] >TYK13311.1 hydroxyproline O-galactosyltransferase GALT2 [Cucumis melo var. makuwa])

HSP 1 Score: 1300.4 bits (3364), Expect = 0.0e+00
Identity = 624/682 (91.50%), Postives = 655/682 (96.04%), Query Frame = 0

Query: 1   MKKLKTEPPGARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDGTIGG 60
           MKK+KTEPP ARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESN GLD   G 
Sbjct: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLDSN-GV 60

Query: 61  DSEGVDFSRASLSSVYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIKPIKHKYGR 120
           DSEG+DFS+ASLSSVYKDTFHRKLEDN++ EAP+TPKKEPLE+VNNV+GPIKPI+HKYGR
Sbjct: 61  DSEGMDFSKASLSSVYKDTFHRKLEDNEHLEAPLTPKKEPLEEVNNVTGPIKPIQHKYGR 120

Query: 121 ITGKILSHQNHTNDFSMLEKLADEAWTLGLKAWEEVDKFGLDETAESSILEGKPETCPSW 180
           ITG I S  NHTNDFSMLEK+ADEAWTLGL AWEE+DKFGL+ETAESSILEGKPE+CPSW
Sbjct: 121 ITGNISSLLNHTNDFSMLEKMADEAWTLGLMAWEEIDKFGLNETAESSILEGKPESCPSW 180

Query: 181 ISTDGKKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPMVMVSQFMV 240
           ISTDGKK++EGDG+MFLPCGLAAGSSITIIGTPH AHQEYVPQLLKVG DP VMVSQFMV
Sbjct: 181 ISTDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPNVMVSQFMV 240

Query: 241 ELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSNDDEM 300
           ELQGLK+VDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDG+PSS+DDEM
Sbjct: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300

Query: 301 LVDGNRRCEKWVRSDIIDSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360
           LVDGNRRCEKW+RSD+ D+KESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV
Sbjct: 301 LVDGNRRCEKWLRSDVTDTKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQRVLEM 420
           DGYHINVGGRHLTSF YR GFTLEDATGLAVKGDVDIHS YATSLPT+HPSFSPQRVLEM
Sbjct: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPQRVLEM 420

Query: 421 SEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRFFVALNPRK 480
           SEKWKSQPLPKSSV LFIGVLSATNHFAERMAVRKTWMQSSAVK+SNVVVRFFVALNPRK
Sbjct: 421 SEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALNPRK 480

Query: 481 EVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTFVRVD 540
           EVNAVLK+EA YFGDIVILPFMDRYELVVLKTIAICEFG VNLTASY+MKCDDDTFVRV+
Sbjct: 481 EVNAVLKREAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVE 540

Query: 541 TVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIISSDIA 600
           TVLKQI G+SSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYI+S DIA
Sbjct: 541 TVLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIA 600

Query: 601 KYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTAHYQS 660
           KYIVSQHENRSLRIFKMEDVSMGMWVEQFN TVA VQYSHNWKFCQYGCMEDYFTAHYQS
Sbjct: 601 KYIVSQHENRSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQS 660

Query: 661 PRQIICLWDKLGQGHAHCCNFR 683
           PRQI+CLWDKL +GHAHCCNFR
Sbjct: 661 PRQILCLWDKLARGHAHCCNFR 681

BLAST of Moc01g33540 vs. NCBI nr
Match: KAA0052514.1 (hydroxyproline O-galactosyltransferase GALT2 [Cucumis melo var. makuwa])

HSP 1 Score: 1299.3 bits (3361), Expect = 0.0e+00
Identity = 623/682 (91.35%), Postives = 655/682 (96.04%), Query Frame = 0

Query: 1   MKKLKTEPPGARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDGTIGG 60
           MKK+KTEPP ARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESN GLD   G 
Sbjct: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLDSN-GV 60

Query: 61  DSEGVDFSRASLSSVYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIKPIKHKYGR 120
           DSEG+DFS+ASLSSVYKDTFHRKLEDN++ EAP+TPKKEPLE+VNNV+GPIKPI+HKYGR
Sbjct: 61  DSEGMDFSKASLSSVYKDTFHRKLEDNEHLEAPLTPKKEPLEEVNNVTGPIKPIQHKYGR 120

Query: 121 ITGKILSHQNHTNDFSMLEKLADEAWTLGLKAWEEVDKFGLDETAESSILEGKPETCPSW 180
           ITG I S  NHTNDFSMLEK+ADEAWTLGL AWEE+DKFGL+ETAESSILEGKPE+CPSW
Sbjct: 121 ITGNISSLLNHTNDFSMLEKMADEAWTLGLMAWEEIDKFGLNETAESSILEGKPESCPSW 180

Query: 181 ISTDGKKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPMVMVSQFMV 240
           ISTDGKK++EGDG+MFLPCGLAAGSSITIIGTPH AHQEYVPQLLKVG DP VMVSQFMV
Sbjct: 181 ISTDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPNVMVSQFMV 240

Query: 241 ELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSNDDEM 300
           ELQGLK+VDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDG+PSS+DDEM
Sbjct: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300

Query: 301 LVDGNRRCEKWVRSDIIDSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360
           LVDGNRRCEKW+RSD+ D+KESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV
Sbjct: 301 LVDGNRRCEKWLRSDVTDTKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQRVLEM 420
           DGYHINVGGRHLTSF YR GFTLEDATGLAVKGDVDIHS YATSLPT+HPSFSPQRVLEM
Sbjct: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPQRVLEM 420

Query: 421 SEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRFFVALNPRK 480
           SEKWKSQPLPKSSV LFIGVLSATNHFAERMAVRKTWMQSSAVK+SNVVVRFFVALNPRK
Sbjct: 421 SEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALNPRK 480

Query: 481 EVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTFVRVD 540
           EVNAVLK+EA YFGDIVILPFMDRYELVVLKTIAICEFG +NLTASY+MKCDDDTFVRV+
Sbjct: 481 EVNAVLKREAAYFGDIVILPFMDRYELVVLKTIAICEFGVMNLTASYIMKCDDDTFVRVE 540

Query: 541 TVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIISSDIA 600
           TVLKQI G+SSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYI+S DIA
Sbjct: 541 TVLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIA 600

Query: 601 KYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTAHYQS 660
           KYIVSQHENRSLRIFKMEDVSMGMWVEQFN TVA VQYSHNWKFCQYGCMEDYFTAHYQS
Sbjct: 601 KYIVSQHENRSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQS 660

Query: 661 PRQIICLWDKLGQGHAHCCNFR 683
           PRQI+CLWDKL +GHAHCCNFR
Sbjct: 661 PRQILCLWDKLARGHAHCCNFR 681

BLAST of Moc01g33540 vs. NCBI nr
Match: XP_011658301.1 (hydroxyproline O-galactosyltransferase GALT2 [Cucumis sativus] >KGN49434.1 hypothetical protein Csa_003395 [Cucumis sativus])

HSP 1 Score: 1292.3 bits (3343), Expect = 0.0e+00
Identity = 620/682 (90.91%), Postives = 651/682 (95.45%), Query Frame = 0

Query: 1   MKKLKTEPPGARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDGTIGG 60
           MKK+KTEPP ARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESN GLD   G 
Sbjct: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLDSN-GV 60

Query: 61  DSEGVDFSRASLSSVYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIKPIKHKYGR 120
           DSEG+DFS+ASLSSVYKDTFHRKLEDNQ+ EAP+TPKKEPLE+VNNV+GPIKPIKHKYGR
Sbjct: 61  DSEGMDFSKASLSSVYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIKHKYGR 120

Query: 121 ITGKILSHQNHTNDFSMLEKLADEAWTLGLKAWEEVDKFGLDETAESSILEGKPETCPSW 180
           ITG I S  NHTNDFSMLE +ADEAWTLG  AWEEVDKFGL+ET+ESSILEGKPE+CPSW
Sbjct: 121 ITGNISSQLNHTNDFSMLETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPESCPSW 180

Query: 181 ISTDGKKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPMVMVSQFMV 240
           ISTDGKK++EGDG+MFLPCGLAAGSSITIIGTPH AHQEYVPQLLKVG DP VMVSQFMV
Sbjct: 181 ISTDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPKVMVSQFMV 240

Query: 241 ELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSNDDEM 300
           ELQGLK+VDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDG+PSS++DEM
Sbjct: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDEM 300

Query: 301 LVDGNRRCEKWVRSDIIDSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360
           LVDGN RCEKW+RSD+ DSKESKTTSWF+RFIGREQKPEVTWPFPFMEGRLFILTLRAGV
Sbjct: 301 LVDGNHRCEKWLRSDVTDSKESKTTSWFRRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQRVLEM 420
           DGYHINVGGRHLTSF YR GFTLEDATGLAVKGDVDIHS YAT+LPT+HPSFSPQRVLEM
Sbjct: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLEM 420

Query: 421 SEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRFFVALNPRK 480
           SEKWKSQPLPKSSV LFIGVLSATNHFAERMAVRKTWMQSSAV +SNVVVRFFVALNPRK
Sbjct: 421 SEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPRK 480

Query: 481 EVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTFVRVD 540
           EVNAVLKKEA YFGDIVILPFMDRYELVVLKTIAICEFG VNLTASY+MKCDDDTFVRV+
Sbjct: 481 EVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVE 540

Query: 541 TVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIISSDIA 600
           TVLKQI G+SSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYI+S DIA
Sbjct: 541 TVLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIA 600

Query: 601 KYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTAHYQS 660
           KYIVSQHEN+SLRIFKMEDVSMGMWVEQFN TVA VQYSHNWKFCQYGCMEDYFTAHYQS
Sbjct: 601 KYIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQS 660

Query: 661 PRQIICLWDKLGQGHAHCCNFR 683
           PRQI+CLWDKL +GHAHCCNFR
Sbjct: 661 PRQILCLWDKLARGHAHCCNFR 681

BLAST of Moc01g33540 vs. ExPASy Swiss-Prot
Match: A7XDQ9 (Hydroxyproline O-galactosyltransferase GALT2 OS=Arabidopsis thaliana OX=3702 GN=GALT2 PE=1 SV=1)

HSP 1 Score: 949.9 bits (2454), Expect = 1.6e-275
Identity = 462/691 (66.86%), Postives = 551/691 (79.74%), Query Frame = 0

Query: 1   MKKLKTEP----PGARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDG 60
           MK++K+E       +RR +LSH LL I   YLVF++FKFP F+E+ A LSGD    GLDG
Sbjct: 1   MKRVKSESFRGVYSSRRFKLSHFLLAIAGFYLVFLAFKFPHFIEMVAMLSGD---TGLDG 60

Query: 61  TIGGDSEGVDFSRASLSSVYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIKPIKH 120
            +   S  V  S     S+  D  +RKLED  ++  P T +K   E+  N S  I+P+  
Sbjct: 61  ALSDTSLDVSLS----GSLRNDMLNRKLEDEDHQSGPSTTQKVSPEEKINGSKQIQPLLF 120

Query: 121 KYGRITGKILSHQNHTNDFSMLEKLADEAWTLGLKAWEEVDKFGLDETAES-SILEGKPE 180
           +YGRI+G+++  +N T   S  E++ADEAW LG KAWE+VDKF +D+  ES SI EGK E
Sbjct: 121 RYGRISGEVMRRRNRTIHMSPFERMADEAWILGSKAWEDVDKFEVDKINESASIFEGKVE 180

Query: 181 TCPSWISTDGKKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGAD-PMVM 240
           +CPS IS +G  + + + IM LPCGLAAGSSITI+GTP +AH+E VPQ  ++     MV+
Sbjct: 181 SCPSQISMNGDDLNKANRIMLLPCGLAAGSSITILGTPQYAHKESVPQRSRLTRSYGMVL 240

Query: 241 VSQFMVELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPS 300
           VSQFMVELQGLK  DGE PPKILHLNPR+KGDW+ RPVIEHNTCYRMQWG AQRCDG PS
Sbjct: 241 VSQFMVELQGLKTGDGEYPPKILHLNPRIKGDWNHRPVIEHNTCYRMQWGVAQRCDGTPS 300

Query: 301 SNDDEMLVDGNRRCEKWVRSDII---DSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRL 360
             D ++LVDG RRCEKW ++DII   DSKESKTTSWFKRFIGREQKPEVTW FPF EG++
Sbjct: 301 KKDADVLVDGFRRCEKWTQNDIIDMVDSKESKTTSWFKRFIGREQKPEVTWSFPFAEGKV 360

Query: 361 FILTLRAGVDGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPS 420
           F+LTLRAG+DG+HINVGGRH++SFPYR GFT+EDATGLAV GDVDIHS +ATSL T+HPS
Sbjct: 361 FVLTLRAGIDGFHINVGGRHVSSFPYRPGFTIEDATGLAVTGDVDIHSIHATSLSTSHPS 420

Query: 421 FSPQRVLEMSEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVR 480
           FSPQ+ +E S +WK+ PLP +  RLF+GVLSATNHF+ERMAVRKTWMQ  ++K+S+VV R
Sbjct: 421 FSPQKAIEFSSEWKAPPLPGTPFRLFMGVLSATNHFSERMAVRKTWMQHPSIKSSDVVAR 480

Query: 481 FFVALNPRKEVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKC 540
           FFVALNPRKEVNA+LKKEAEYFGDIVILPFMDRYELVVLKTIAICEFG  N+TA Y+MKC
Sbjct: 481 FFVALNPRKEVNAMLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGVQNVTAPYIMKC 540

Query: 541 DDDTFVRVDTVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGP 600
           DDDTF+RV+++LKQI GVS +KSLYMGNLNL HRPLR GKW VT+EEWPE VYPPYANGP
Sbjct: 541 DDDTFIRVESILKQIDGVSPEKSLYMGNLNLRHRPLRTGKWTVTWEEWPEAVYPPYANGP 600

Query: 601 GYIISSDIAKYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCME 660
           GYIISS+IAKYIVSQ+    LR+FKMEDVSMG+WVEQFN ++  V+YSH+WKFCQYGC  
Sbjct: 601 GYIISSNIAKYIVSQNSRHKLRLFKMEDVSMGLWVEQFNASMQPVEYSHSWKFCQYGCTL 660

Query: 661 DYFTAHYQSPRQIICLWDKLGQGHAHCCNFR 683
           +Y+TAHYQSP Q++CLWD L +G   CCNFR
Sbjct: 661 NYYTAHYQSPSQMMCLWDNLLKGRPQCCNFR 684

BLAST of Moc01g33540 vs. ExPASy Swiss-Prot
Match: Q9LV16 (Hydroxyproline O-galactosyltransferase GALT6 OS=Arabidopsis thaliana OX=3702 GN=GALT6 PE=2 SV=2)

HSP 1 Score: 708.0 bits (1826), Expect = 1.0e-202
Identity = 356/684 (52.05%), Postives = 452/684 (66.08%), Query Frame = 0

Query: 15  RLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDGTIGGDSEGVDFSRASLSS 74
           R   +L+ +G+LY++ I+F+ P   +                           S  S   
Sbjct: 25  RSVQILMAVGLLYMLLITFEIPFVFK------------------------TGLSSLSQDP 84

Query: 75  VYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIKPIKHKYGRITGKILSHQNHTND 134
           + +   H    + Q R AP  P K  L   +    P + ++ +  RI   +       N 
Sbjct: 85  LTRPEKHNSQRELQERRAPTRPLKSLLYQESQSESPAQGLRRR-TRILSSLRFDPETFNP 144

Query: 135 FSM-----LEKLADEAWTLGLKAWEEVDK----FGLDETAESSILEGKPETCPSWISTDG 194
            S      L K A  AW +G K WEE++       L++  +  I E    +C   +S  G
Sbjct: 145 SSKDGSVELHKSAKVAWEVGRKIWEELESGKTLKALEKEKKKKIEEHGTNSCSLSVSLTG 204

Query: 195 KKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQ--LLKVGADPMVMVSQFMVELQ 254
             +L+   IM LPCGL  GS IT++G P  AH E  P+  +LK G D  V VSQF +ELQ
Sbjct: 205 SDLLKRGNIMELPCGLTLGSHITVVGKPRAAHSEKDPKISMLKEG-DEAVKVSQFKLELQ 264

Query: 255 GLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSNDDEMLVD 314
           GLKAV+GE+PP+ILHLNPRLKGDWS +PVIE NTCYRMQWG+AQRC+G   S DDE  VD
Sbjct: 265 GLKAVEGEEPPRILHLNPRLKGDWSGKPVIEQNTCYRMQWGSAQRCEGW-RSRDDEETVD 324

Query: 315 GNRRCEKWVRSDIIDSKESKTTS----WFKRFIGREQKPEVTWPFPFMEGRLFILTLRAG 374
           G  +CEKW R D I SKE +++     W  R IGR +K  V WPFPF   +LF+LTL AG
Sbjct: 325 GQVKCEKWARDDSITSKEEESSKAASWWLSRLIGRSKKVTVEWPFPFTVDKLFVLTLSAG 384

Query: 375 VDGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQRVLE 434
           ++GYH++V G+H+TSFPYR GFTLEDATGL + GD+D+HS +A SLPT+HPSFSPQR LE
Sbjct: 385 LEGYHVSVDGKHVTSFPYRTGFTLEDATGLTINGDIDVHSVFAGSLPTSHPSFSPQRHLE 444

Query: 435 MSEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRFFVALNPR 494
           +S  W++  LP   V +FIG+LSA NHFAERMAVR++WMQ   VK+S VV RFFVAL+ R
Sbjct: 445 LSSNWQAPSLPDEQVDMFIGILSAGNHFAERMAVRRSWMQHKLVKSSKVVARFFVALHSR 504

Query: 495 KEVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTFVRV 554
           KEVN  LKKEAE+FGDIVI+P+MD Y+LVVLKT+AICE+G   L A ++MKCDDDTFV+V
Sbjct: 505 KEVNVELKKEAEFFGDIVIVPYMDSYDLVVLKTVAICEYGAHQLAAKFIMKCDDDTFVQV 564

Query: 555 DTVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIISSDI 614
           D VL +     + +SLY+GN+N  H+PLR GKW+VTYEEWPEE YPPYANGPGYI+S+DI
Sbjct: 565 DAVLSEAKKTPTDRSLYIGNINYYHKPLRQGKWSVTYEEWPEEDYPPYANGPGYILSNDI 624

Query: 615 AKYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTAHYQ 674
           +++IV + E   LR+FKMEDVS+GMWVEQFN     V Y H+ +FCQ+GC+E+Y TAHYQ
Sbjct: 625 SRFIVKEFEKHKLRMFKMEDVSVGMWVEQFNNGTKPVDYIHSLRFCQFGCIENYLTAHYQ 681

Query: 675 SPRQIICLWDKLG-QGHAHCCNFR 683
           SPRQ+ICLWDKL   G   CCN R
Sbjct: 685 SPRQMICLWDKLVLTGKPQCCNMR 681

BLAST of Moc01g33540 vs. ExPASy Swiss-Prot
Match: Q8RX55 (Hydroxyproline O-galactosyltransferase GALT5 OS=Arabidopsis thaliana OX=3702 GN=GALT5 PE=1 SV=1)

HSP 1 Score: 696.0 bits (1795), Expect = 4.1e-199
Identity = 354/687 (51.53%), Postives = 460/687 (66.96%), Query Frame = 0

Query: 15  RLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDGTIGGDSEGVDFSRASLSS 74
           R   +++ IG LYLV +S + P   +     S   S+V LD               +LS 
Sbjct: 25  RSVRVIMAIGFLYLVIVSVEIPLVFK-----SWSSSSVPLD---------------ALSR 84

Query: 75  VYKDTFHRKLEDNQNREAPVTPKKEPLEDVN-NVSGPI----------KPIKHKYGRITG 134
           +       KL + Q  +  + P   PLE V+  VS P           K  +H  G ++ 
Sbjct: 85  L------EKLNNEQEPQVEIIP-NPPLEPVSYPVSNPTIVTRTDLVQNKVREHHRGVLSS 144

Query: 135 KILSHQNHTNDFSM------LEKLADEAWTLGLKAWEEVDKFGLDETAESSILEGKPETC 194
             L   + T D S       L K A EAW LG K W+E++   L++  E    + KP++C
Sbjct: 145 --LRFDSETFDPSSKDGSVELHKSAKEAWQLGRKLWKELESGRLEKLVEKP-EKNKPDSC 204

Query: 195 PSWISTDGKKMLEGDG-IMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPMVMVS 254
           P  +S  G + +  +  +M LPCGL  GS IT++G P  AH +          D   +VS
Sbjct: 205 PHSVSLTGSEFMNRENKLMELPCGLTLGSHITLVGRPRKAHPK--------EGDWSKLVS 264

Query: 255 QFMVELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSN 314
           QF++ELQGLK V+GEDPP+ILH NPRLKGDWSK+PVIE N+CYRMQWG AQRC+G   S 
Sbjct: 265 QFVIELQGLKTVEGEDPPRILHFNPRLKGDWSKKPVIEQNSCYRMQWGPAQRCEGW-KSR 324

Query: 315 DDEMLVDGNRRCEKWVRSDIIDSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTL 374
           DDE  VD + +CEKW+R D   S+ S+   W  R IGR ++ +V WPFPF+E +LF+LTL
Sbjct: 325 DDEETVDSHVKCEKWIRDDDNYSEGSRARWWLNRLIGRRKRVKVEWPFPFVEEKLFVLTL 384

Query: 375 RAGVDGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQR 434
            AG++GYHINV G+H+TSFPYR GFTLEDATGL V GD+D+HS +  SLPT+HPSF+PQR
Sbjct: 385 SAGLEGYHINVDGKHVTSFPYRTGFTLEDATGLTVNGDIDVHSVFVASLPTSHPSFAPQR 444

Query: 435 VLEMSEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRFFVAL 494
            LE+S++W++  +P   V +FIG+LSA NHF+ERMAVRK+WMQ   + ++ VV RFFVAL
Sbjct: 445 HLELSKRWQAPVVPDGPVEIFIGILSAGNHFSERMAVRKSWMQHVLITSAKVVARFFVAL 504

Query: 495 NPRKEVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTF 554
           + RKEVN  LKKEAEYFGDIV++P+MD Y+LVVLKT+AICE G +  +A Y+MKCDDDTF
Sbjct: 505 HGRKEVNVELKKEAEYFGDIVLVPYMDSYDLVVLKTVAICEHGALAFSAKYIMKCDDDTF 564

Query: 555 VRVDTVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIIS 614
           V++  V+ ++  V   +SLY+GN+N  H+PLR GKWAVTYEEWPEE YPPYANGPGY++S
Sbjct: 565 VKLGAVINEVKKVPEGRSLYIGNMNYYHKPLRGGKWAVTYEEWPEEDYPPYANGPGYVLS 624

Query: 615 SDIAKYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTA 674
           SDIA++IV + E   LR+FKMEDVS+GMWVE F  T   V Y H+ +FCQ+GC+E+Y+TA
Sbjct: 625 SDIARFIVDKFERHKLRLFKMEDVSVGMWVEHFKNTTNPVDYRHSLRFCQFGCVENYYTA 672

Query: 675 HYQSPRQIICLWDK-LGQGHAHCCNFR 683
           HYQSPRQ+ICLWDK L Q    CCN R
Sbjct: 685 HYQSPRQMICLWDKLLRQNKPECCNMR 672

BLAST of Moc01g33540 vs. ExPASy Swiss-Prot
Match: Q8GXG6 (Hydroxyproline O-galactosyltransferase GALT4 OS=Arabidopsis thaliana OX=3702 GN=GALT4 PE=2 SV=2)

HSP 1 Score: 695.3 bits (1793), Expect = 7.0e-199
Identity = 362/698 (51.86%), Postives = 463/698 (66.33%), Query Frame = 0

Query: 1   MKKLKTEPPGAR-RLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDE--SNVGLDG- 60
           MKK K +   ++ R  L   LLV+ + Y + +SF+ P      +    D+  S+   D  
Sbjct: 1   MKKSKLDNSSSQIRFGLVQFLLVVLLFYFLCMSFEIPFIFRTGSGSGSDDVSSSSFADAL 60

Query: 61  ----TIGGDSEGVDFSRASLSSVYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIK 120
                +GG S   ++    +    +   HR  +D    +  +  +K  + +  +VS    
Sbjct: 61  PRPMVVGGGSREANW---VVGEEEEADPHRHFKDPGRVQLRLPERK--MREFKSVSEIF- 120

Query: 121 PIKHKYGRITGKILSHQNHTNDFSMLEKLADEAWTLGLKAWEEVDKFGLDETAESSILEG 180
                   +      +   +++FS+  K A  A ++G K W+ +D  GL +  ++ + + 
Sbjct: 121 --------VNESFFDNGGFSDEFSIFHKTAKHAISMGRKMWDGLDS-GLIKPDKAPV-KT 180

Query: 181 KPETCPSWISTDGKKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPM 240
           + E CP  +S    + +    I+ LPCGL  GS IT++ TPH AH E          D  
Sbjct: 181 RIEKCPDMVSVSESEFVNRSRILVLPCGLTLGSHITVVATPHWAHVE-------KDGDKT 240

Query: 241 VMVSQFMVELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGV 300
            MVSQFM+ELQGLKAVDGEDPP+ILH NPR+KGDWS RPVIE NTCYRMQWG+  RCDG 
Sbjct: 241 AMVSQFMMELQGLKAVDGEDPPRILHFNPRIKGDWSGRPVIEQNTCYRMQWGSGLRCDGR 300

Query: 301 PSSNDDEMLVDGNRRCEKWVRSDI------IDSKESKTTSWFKRFIGREQKPEV-TWPFP 360
            SS DDE  VDG  +CE+W R D        D  ESK T W  R +GR +K     W +P
Sbjct: 301 ESS-DDEEYVDGEVKCERWKRDDDDGGNNGDDFDESKKTWWLNRLMGRRKKMITHDWDYP 360

Query: 361 FMEGRLFILTLRAGVDGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSL 420
           F EG+LF+LTLRAG++GYHI+V GRH+TSFPYR GF LEDATGLAVKG++D+HS YA SL
Sbjct: 361 FAEGKLFVLTLRAGMEGYHISVNGRHITSFPYRTGFVLEDATGLAVKGNIDVHSVYAASL 420

Query: 421 PTTHPSFSPQRVLEMSEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKA 480
           P+T+PSF+PQ+ LEM   WK+  LP+  V LFIG+LSA NHFAERMAVRK+WMQ   V++
Sbjct: 421 PSTNPSFAPQKHLEMQRIWKAPSLPQKPVELFIGILSAGNHFAERMAVRKSWMQQKLVRS 480

Query: 481 SNVVVRFFVALNPRKEVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTA 540
           S VV RFFVAL+ RKEVN  LKKEAEYFGDIVI+P+MD Y+LVVLKT+AICE+G   + A
Sbjct: 481 SKVVARFFVALHARKEVNVDLKKEAEYFGDIVIVPYMDHYDLVVLKTVAICEYGVNTVAA 540

Query: 541 SYVMKCDDDTFVRVDTVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYP 600
            YVMKCDDDTFVRVD V+++   V  ++SLY+GN+N  H+PLR GKWAVT+EEWPEE YP
Sbjct: 541 KYVMKCDDDTFVRVDAVIQEAEKVKGRESLYIGNINFNHKPLRTGKWAVTFEEWPEEYYP 600

Query: 601 PYANGPGYIISSDIAKYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFC 660
           PYANGPGYI+S D+AK+IV   E + LR+FKMEDVSMGMWVE+FN T   V   H+ KFC
Sbjct: 601 PYANGPGYILSYDVAKFIVDDFEQKRLRLFKMEDVSMGMWVEKFNET-RPVAVVHSLKFC 660

Query: 661 QYGCMEDYFTAHYQSPRQIICLWDKLGQ-GHAHCCNFR 683
           Q+GC+EDYFTAHYQSPRQ+IC+WDKL + G   CCN R
Sbjct: 661 QFGCIEDYFTAHYQSPRQMICMWDKLQRLGKPQCCNMR 673

BLAST of Moc01g33540 vs. ExPASy Swiss-Prot
Match: Q8L7F9 (Beta-1,3-galactosyltransferase GALT1 OS=Arabidopsis thaliana OX=3702 GN=GALT1 PE=1 SV=1)

HSP 1 Score: 332.8 bits (852), Expect = 9.1e-90
Identity = 211/569 (37.08%), Postives = 303/569 (53.25%), Query Frame = 0

Query: 135 FSMLEKLADEAWTL--GLKA-------WEE----VDKFGLDETAESSILEGKPETCPSWI 194
           ++ LE L D A +L  G+ A       WE     V+   L +  E+   +GK E CP ++
Sbjct: 99  WNRLESLVDNAQSLVNGVDAIKEAGIVWESLVSAVEAKKLVDVNENQTRKGKEELCPQFL 158

Query: 195 STDGKKMLEGDGI-MFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPMVMVSQFMV 254
           S       +G  + + +PCGL  GSSIT+IG P                    +V  F +
Sbjct: 159 SKMNATEADGSSLKLQIPCGLTQGSSITVIGIPDG------------------LVGSFRI 218

Query: 255 ELQGLKAVDGEDPPKILHLNPRLKGDWS-KRPVIEHNTCYRMQ-WGTAQRCDGVPSSNDD 314
           +L G       DPP I+H N RL GD S + PVI  N+    Q WG  +RC   P  + D
Sbjct: 219 DLTGQPLPGEPDPPIIVHYNVRLLGDKSTEDPVIVQNSWTASQDWGAEERC---PKFDPD 278

Query: 315 -EMLVDGNRRCEKWVRSDIIDSKESKTTSWFKRF--IGREQKPEVTWPFPFMEGRLFILT 374
               VD    C K V  +I  +  +   S   R   + RE      + FPF +G L + T
Sbjct: 279 MNKKVDDLDECNKMVGGEINRTSSTSLQSNTSRGVPVAREASKHEKY-FPFKQGFLSVAT 338

Query: 375 LRAGVDGYHINVGGRHLTSFPYRLGFTLED--ATGLAVKGDVDIHSAYATSLPTTHPSFS 434
           LR G +G  + V G+H+TSF +R   TLE    + + + GD  + S  A+ LPT+  S  
Sbjct: 339 LRVGTEGMQMTVDGKHITSFAFR--DTLEPWLVSEIRITGDFRLISILASGLPTSEES-- 398

Query: 435 PQRVLEMSEKWKSQPL-PKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRF 494
            + V+++ E  KS  L P   + L IGV S  N+F  RMAVR+TWMQ   V++  V VRF
Sbjct: 399 -EHVVDL-EALKSPTLSPLRPLDLVIGVFSTANNFKRRMAVRRTWMQYDDVRSGRVAVRF 458

Query: 495 FVALNPRKEVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCD 554
           FV L+    VN  L  EA  +GD+ ++PF+D Y L+  KT+AIC FGT   +A ++MK D
Sbjct: 459 FVGLHKSPLVNLELWNEARTYGDVQLMPFVDYYSLISWKTLAICIFGTEVDSAKFIMKTD 518

Query: 555 DDTFVRVDTVLKQISGVSSKKSLYMGNLNLLHRPLRH--GKWAVTYEEWPEEVYPPYANG 614
           DD FVRVD VL  +S  ++ + L  G +N   +P+R+   KW ++YEEWPEE YPP+A+G
Sbjct: 519 DDAFVRVDEVLLSLSMTNNTRGLIYGLINSDSQPIRNPDSKWYISYEEWPEEKYPPWAHG 578

Query: 615 PGYIISSDIAKYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCM 674
           PGYI+S DIA+ +    +  +L++FK+EDV+MG+W+ +         Y ++ +    GC 
Sbjct: 579 PGYIVSRDIAESVGKLFKEGNLKMFKLEDVAMGIWIAELTKHGLEPHYENDGRIISDGCK 638

Query: 675 EDYFTAHYQSPRQIICLWDKLGQGHAHCC 680
           + Y  AHYQSP ++ CLW K  +     C
Sbjct: 639 DGYVVAHYQSPAEMTCLWRKYQETKRSLC 639

BLAST of Moc01g33540 vs. ExPASy TrEMBL
Match: A0A6J1D042 (hydroxyproline O-galactosyltransferase GALT2 OS=Momordica charantia OX=3673 GN=LOC111015786 PE=3 SV=1)

HSP 1 Score: 1407.1 bits (3641), Expect = 0.0e+00
Identity = 682/682 (100.00%), Postives = 682/682 (100.00%), Query Frame = 0

Query: 1   MKKLKTEPPGARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDGTIGG 60
           MKKLKTEPPGARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDGTIGG
Sbjct: 1   MKKLKTEPPGARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDGTIGG 60

Query: 61  DSEGVDFSRASLSSVYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIKPIKHKYGR 120
           DSEGVDFSRASLSSVYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIKPIKHKYGR
Sbjct: 61  DSEGVDFSRASLSSVYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIKPIKHKYGR 120

Query: 121 ITGKILSHQNHTNDFSMLEKLADEAWTLGLKAWEEVDKFGLDETAESSILEGKPETCPSW 180
           ITGKILSHQNHTNDFSMLEKLADEAWTLGLKAWEEVDKFGLDETAESSILEGKPETCPSW
Sbjct: 121 ITGKILSHQNHTNDFSMLEKLADEAWTLGLKAWEEVDKFGLDETAESSILEGKPETCPSW 180

Query: 181 ISTDGKKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPMVMVSQFMV 240
           ISTDGKKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPMVMVSQFMV
Sbjct: 181 ISTDGKKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPMVMVSQFMV 240

Query: 241 ELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSNDDEM 300
           ELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSNDDEM
Sbjct: 241 ELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSNDDEM 300

Query: 301 LVDGNRRCEKWVRSDIIDSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360
           LVDGNRRCEKWVRSDIIDSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV
Sbjct: 301 LVDGNRRCEKWVRSDIIDSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQRVLEM 420
           DGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQRVLEM
Sbjct: 361 DGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQRVLEM 420

Query: 421 SEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRFFVALNPRK 480
           SEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRFFVALNPRK
Sbjct: 421 SEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRFFVALNPRK 480

Query: 481 EVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTFVRVD 540
           EVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTFVRVD
Sbjct: 481 EVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTFVRVD 540

Query: 541 TVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIISSDIA 600
           TVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIISSDIA
Sbjct: 541 TVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIISSDIA 600

Query: 601 KYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTAHYQS 660
           KYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTAHYQS
Sbjct: 601 KYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTAHYQS 660

Query: 661 PRQIICLWDKLGQGHAHCCNFR 683
           PRQIICLWDKLGQGHAHCCNFR
Sbjct: 661 PRQIICLWDKLGQGHAHCCNFR 682

BLAST of Moc01g33540 vs. ExPASy TrEMBL
Match: A0A5D3CNA3 (Hydroxyproline O-galactosyltransferase GALT2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G009340 PE=3 SV=1)

HSP 1 Score: 1300.4 bits (3364), Expect = 0.0e+00
Identity = 624/682 (91.50%), Postives = 655/682 (96.04%), Query Frame = 0

Query: 1   MKKLKTEPPGARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDGTIGG 60
           MKK+KTEPP ARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESN GLD   G 
Sbjct: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLDSN-GV 60

Query: 61  DSEGVDFSRASLSSVYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIKPIKHKYGR 120
           DSEG+DFS+ASLSSVYKDTFHRKLEDN++ EAP+TPKKEPLE+VNNV+GPIKPI+HKYGR
Sbjct: 61  DSEGMDFSKASLSSVYKDTFHRKLEDNEHLEAPLTPKKEPLEEVNNVTGPIKPIQHKYGR 120

Query: 121 ITGKILSHQNHTNDFSMLEKLADEAWTLGLKAWEEVDKFGLDETAESSILEGKPETCPSW 180
           ITG I S  NHTNDFSMLEK+ADEAWTLGL AWEE+DKFGL+ETAESSILEGKPE+CPSW
Sbjct: 121 ITGNISSLLNHTNDFSMLEKMADEAWTLGLMAWEEIDKFGLNETAESSILEGKPESCPSW 180

Query: 181 ISTDGKKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPMVMVSQFMV 240
           ISTDGKK++EGDG+MFLPCGLAAGSSITIIGTPH AHQEYVPQLLKVG DP VMVSQFMV
Sbjct: 181 ISTDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPNVMVSQFMV 240

Query: 241 ELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSNDDEM 300
           ELQGLK+VDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDG+PSS+DDEM
Sbjct: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300

Query: 301 LVDGNRRCEKWVRSDIIDSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360
           LVDGNRRCEKW+RSD+ D+KESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV
Sbjct: 301 LVDGNRRCEKWLRSDVTDTKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQRVLEM 420
           DGYHINVGGRHLTSF YR GFTLEDATGLAVKGDVDIHS YATSLPT+HPSFSPQRVLEM
Sbjct: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPQRVLEM 420

Query: 421 SEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRFFVALNPRK 480
           SEKWKSQPLPKSSV LFIGVLSATNHFAERMAVRKTWMQSSAVK+SNVVVRFFVALNPRK
Sbjct: 421 SEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALNPRK 480

Query: 481 EVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTFVRVD 540
           EVNAVLK+EA YFGDIVILPFMDRYELVVLKTIAICEFG VNLTASY+MKCDDDTFVRV+
Sbjct: 481 EVNAVLKREAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVE 540

Query: 541 TVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIISSDIA 600
           TVLKQI G+SSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYI+S DIA
Sbjct: 541 TVLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIA 600

Query: 601 KYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTAHYQS 660
           KYIVSQHENRSLRIFKMEDVSMGMWVEQFN TVA VQYSHNWKFCQYGCMEDYFTAHYQS
Sbjct: 601 KYIVSQHENRSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQS 660

Query: 661 PRQIICLWDKLGQGHAHCCNFR 683
           PRQI+CLWDKL +GHAHCCNFR
Sbjct: 661 PRQILCLWDKLARGHAHCCNFR 681

BLAST of Moc01g33540 vs. ExPASy TrEMBL
Match: A0A1S3AZQ8 (hydroxyproline O-galactosyltransferase GALT2 OS=Cucumis melo OX=3656 GN=LOC103484337 PE=3 SV=1)

HSP 1 Score: 1300.4 bits (3364), Expect = 0.0e+00
Identity = 624/682 (91.50%), Postives = 655/682 (96.04%), Query Frame = 0

Query: 1   MKKLKTEPPGARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDGTIGG 60
           MKK+KTEPP ARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESN GLD   G 
Sbjct: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLDSN-GV 60

Query: 61  DSEGVDFSRASLSSVYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIKPIKHKYGR 120
           DSEG+DFS+ASLSSVYKDTFHRKLEDN++ EAP+TPKKEPLE+VNNV+GPIKPI+HKYGR
Sbjct: 61  DSEGMDFSKASLSSVYKDTFHRKLEDNEHLEAPLTPKKEPLEEVNNVTGPIKPIQHKYGR 120

Query: 121 ITGKILSHQNHTNDFSMLEKLADEAWTLGLKAWEEVDKFGLDETAESSILEGKPETCPSW 180
           ITG I S  NHTNDFSMLEK+ADEAWTLGL AWEE+DKFGL+ETAESSILEGKPE+CPSW
Sbjct: 121 ITGNISSLLNHTNDFSMLEKMADEAWTLGLMAWEEIDKFGLNETAESSILEGKPESCPSW 180

Query: 181 ISTDGKKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPMVMVSQFMV 240
           ISTDGKK++EGDG+MFLPCGLAAGSSITIIGTPH AHQEYVPQLLKVG DP VMVSQFMV
Sbjct: 181 ISTDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPNVMVSQFMV 240

Query: 241 ELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSNDDEM 300
           ELQGLK+VDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDG+PSS+DDEM
Sbjct: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300

Query: 301 LVDGNRRCEKWVRSDIIDSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360
           LVDGNRRCEKW+RSD+ D+KESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV
Sbjct: 301 LVDGNRRCEKWLRSDVTDTKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQRVLEM 420
           DGYHINVGGRHLTSF YR GFTLEDATGLAVKGDVDIHS YATSLPT+HPSFSPQRVLEM
Sbjct: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPQRVLEM 420

Query: 421 SEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRFFVALNPRK 480
           SEKWKSQPLPKSSV LFIGVLSATNHFAERMAVRKTWMQSSAVK+SNVVVRFFVALNPRK
Sbjct: 421 SEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALNPRK 480

Query: 481 EVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTFVRVD 540
           EVNAVLK+EA YFGDIVILPFMDRYELVVLKTIAICEFG VNLTASY+MKCDDDTFVRV+
Sbjct: 481 EVNAVLKREAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVE 540

Query: 541 TVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIISSDIA 600
           TVLKQI G+SSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYI+S DIA
Sbjct: 541 TVLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIA 600

Query: 601 KYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTAHYQS 660
           KYIVSQHENRSLRIFKMEDVSMGMWVEQFN TVA VQYSHNWKFCQYGCMEDYFTAHYQS
Sbjct: 601 KYIVSQHENRSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQS 660

Query: 661 PRQIICLWDKLGQGHAHCCNFR 683
           PRQI+CLWDKL +GHAHCCNFR
Sbjct: 661 PRQILCLWDKLARGHAHCCNFR 681

BLAST of Moc01g33540 vs. ExPASy TrEMBL
Match: A0A5A7UAW9 (Hydroxyproline O-galactosyltransferase GALT2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold120G001140 PE=3 SV=1)

HSP 1 Score: 1299.3 bits (3361), Expect = 0.0e+00
Identity = 623/682 (91.35%), Postives = 655/682 (96.04%), Query Frame = 0

Query: 1   MKKLKTEPPGARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDGTIGG 60
           MKK+KTEPP ARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESN GLD   G 
Sbjct: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLDSN-GV 60

Query: 61  DSEGVDFSRASLSSVYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIKPIKHKYGR 120
           DSEG+DFS+ASLSSVYKDTFHRKLEDN++ EAP+TPKKEPLE+VNNV+GPIKPI+HKYGR
Sbjct: 61  DSEGMDFSKASLSSVYKDTFHRKLEDNEHLEAPLTPKKEPLEEVNNVTGPIKPIQHKYGR 120

Query: 121 ITGKILSHQNHTNDFSMLEKLADEAWTLGLKAWEEVDKFGLDETAESSILEGKPETCPSW 180
           ITG I S  NHTNDFSMLEK+ADEAWTLGL AWEE+DKFGL+ETAESSILEGKPE+CPSW
Sbjct: 121 ITGNISSLLNHTNDFSMLEKMADEAWTLGLMAWEEIDKFGLNETAESSILEGKPESCPSW 180

Query: 181 ISTDGKKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPMVMVSQFMV 240
           ISTDGKK++EGDG+MFLPCGLAAGSSITIIGTPH AHQEYVPQLLKVG DP VMVSQFMV
Sbjct: 181 ISTDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPNVMVSQFMV 240

Query: 241 ELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSNDDEM 300
           ELQGLK+VDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDG+PSS+DDEM
Sbjct: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSDDEM 300

Query: 301 LVDGNRRCEKWVRSDIIDSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360
           LVDGNRRCEKW+RSD+ D+KESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV
Sbjct: 301 LVDGNRRCEKWLRSDVTDTKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQRVLEM 420
           DGYHINVGGRHLTSF YR GFTLEDATGLAVKGDVDIHS YATSLPT+HPSFSPQRVLEM
Sbjct: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATSLPTSHPSFSPQRVLEM 420

Query: 421 SEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRFFVALNPRK 480
           SEKWKSQPLPKSSV LFIGVLSATNHFAERMAVRKTWMQSSAVK+SNVVVRFFVALNPRK
Sbjct: 421 SEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVKSSNVVVRFFVALNPRK 480

Query: 481 EVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTFVRVD 540
           EVNAVLK+EA YFGDIVILPFMDRYELVVLKTIAICEFG +NLTASY+MKCDDDTFVRV+
Sbjct: 481 EVNAVLKREAAYFGDIVILPFMDRYELVVLKTIAICEFGVMNLTASYIMKCDDDTFVRVE 540

Query: 541 TVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIISSDIA 600
           TVLKQI G+SSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYI+S DIA
Sbjct: 541 TVLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIA 600

Query: 601 KYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTAHYQS 660
           KYIVSQHENRSLRIFKMEDVSMGMWVEQFN TVA VQYSHNWKFCQYGCMEDYFTAHYQS
Sbjct: 601 KYIVSQHENRSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQS 660

Query: 661 PRQIICLWDKLGQGHAHCCNFR 683
           PRQI+CLWDKL +GHAHCCNFR
Sbjct: 661 PRQILCLWDKLARGHAHCCNFR 681

BLAST of Moc01g33540 vs. ExPASy TrEMBL
Match: A0A0A0KKS2 (Galectin domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G524710 PE=3 SV=1)

HSP 1 Score: 1292.3 bits (3343), Expect = 0.0e+00
Identity = 620/682 (90.91%), Postives = 651/682 (95.45%), Query Frame = 0

Query: 1   MKKLKTEPPGARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDGTIGG 60
           MKK+KTEPP ARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESN GLD   G 
Sbjct: 1   MKKVKTEPPVARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNNGLDSN-GV 60

Query: 61  DSEGVDFSRASLSSVYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIKPIKHKYGR 120
           DSEG+DFS+ASLSSVYKDTFHRKLEDNQ+ EAP+TPKKEPLE+VNNV+GPIKPIKHKYGR
Sbjct: 61  DSEGMDFSKASLSSVYKDTFHRKLEDNQHLEAPLTPKKEPLEEVNNVTGPIKPIKHKYGR 120

Query: 121 ITGKILSHQNHTNDFSMLEKLADEAWTLGLKAWEEVDKFGLDETAESSILEGKPETCPSW 180
           ITG I S  NHTNDFSMLE +ADEAWTLG  AWEEVDKFGL+ET+ESSILEGKPE+CPSW
Sbjct: 121 ITGNISSQLNHTNDFSMLETMADEAWTLGSMAWEEVDKFGLNETSESSILEGKPESCPSW 180

Query: 181 ISTDGKKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPMVMVSQFMV 240
           ISTDGKK++EGDG+MFLPCGLAAGSSITIIGTPH AHQEYVPQLLKVG DP VMVSQFMV
Sbjct: 181 ISTDGKKLMEGDGLMFLPCGLAAGSSITIIGTPHLAHQEYVPQLLKVGGDPKVMVSQFMV 240

Query: 241 ELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSNDDEM 300
           ELQGLK+VDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDG+PSS++DEM
Sbjct: 241 ELQGLKSVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGLPSSSEDEM 300

Query: 301 LVDGNRRCEKWVRSDIIDSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360
           LVDGN RCEKW+RSD+ DSKESKTTSWF+RFIGREQKPEVTWPFPFMEGRLFILTLRAGV
Sbjct: 301 LVDGNHRCEKWLRSDVTDSKESKTTSWFRRFIGREQKPEVTWPFPFMEGRLFILTLRAGV 360

Query: 361 DGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQRVLEM 420
           DGYHINVGGRHLTSF YR GFTLEDATGLAVKGDVDIHS YAT+LPT+HPSFSPQRVLEM
Sbjct: 361 DGYHINVGGRHLTSFAYRPGFTLEDATGLAVKGDVDIHSTYATALPTSHPSFSPQRVLEM 420

Query: 421 SEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRFFVALNPRK 480
           SEKWKSQPLPKSSV LFIGVLSATNHFAERMAVRKTWMQSSAV +SNVVVRFFVALNPRK
Sbjct: 421 SEKWKSQPLPKSSVFLFIGVLSATNHFAERMAVRKTWMQSSAVMSSNVVVRFFVALNPRK 480

Query: 481 EVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTFVRVD 540
           EVNAVLKKEA YFGDIVILPFMDRYELVVLKTIAICEFG VNLTASY+MKCDDDTFVRV+
Sbjct: 481 EVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVVNLTASYIMKCDDDTFVRVE 540

Query: 541 TVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIISSDIA 600
           TVLKQI G+SSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYI+S DIA
Sbjct: 541 TVLKQIEGISSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIVSIDIA 600

Query: 601 KYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTAHYQS 660
           KYIVSQHEN+SLRIFKMEDVSMGMWVEQFN TVA VQYSHNWKFCQYGCMEDYFTAHYQS
Sbjct: 601 KYIVSQHENKSLRIFKMEDVSMGMWVEQFNSTVATVQYSHNWKFCQYGCMEDYFTAHYQS 660

Query: 661 PRQIICLWDKLGQGHAHCCNFR 683
           PRQI+CLWDKL +GHAHCCNFR
Sbjct: 661 PRQILCLWDKLARGHAHCCNFR 681

BLAST of Moc01g33540 vs. TAIR 10
Match: AT4G21060.2 (Galactosyltransferase family protein )

HSP 1 Score: 949.9 bits (2454), Expect = 1.1e-276
Identity = 462/691 (66.86%), Postives = 551/691 (79.74%), Query Frame = 0

Query: 1   MKKLKTEP----PGARRLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDG 60
           MK++K+E       +RR +LSH LL I   YLVF++FKFP F+E+ A LSGD    GLDG
Sbjct: 1   MKRVKSESFRGVYSSRRFKLSHFLLAIAGFYLVFLAFKFPHFIEMVAMLSGD---TGLDG 60

Query: 61  TIGGDSEGVDFSRASLSSVYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIKPIKH 120
            +   S  V  S     S+  D  +RKLED  ++  P T +K   E+  N S  I+P+  
Sbjct: 61  ALSDTSLDVSLS----GSLRNDMLNRKLEDEDHQSGPSTTQKVSPEEKINGSKQIQPLLF 120

Query: 121 KYGRITGKILSHQNHTNDFSMLEKLADEAWTLGLKAWEEVDKFGLDETAES-SILEGKPE 180
           +YGRI+G+++  +N T   S  E++ADEAW LG KAWE+VDKF +D+  ES SI EGK E
Sbjct: 121 RYGRISGEVMRRRNRTIHMSPFERMADEAWILGSKAWEDVDKFEVDKINESASIFEGKVE 180

Query: 181 TCPSWISTDGKKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGAD-PMVM 240
           +CPS IS +G  + + + IM LPCGLAAGSSITI+GTP +AH+E VPQ  ++     MV+
Sbjct: 181 SCPSQISMNGDDLNKANRIMLLPCGLAAGSSITILGTPQYAHKESVPQRSRLTRSYGMVL 240

Query: 241 VSQFMVELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPS 300
           VSQFMVELQGLK  DGE PPKILHLNPR+KGDW+ RPVIEHNTCYRMQWG AQRCDG PS
Sbjct: 241 VSQFMVELQGLKTGDGEYPPKILHLNPRIKGDWNHRPVIEHNTCYRMQWGVAQRCDGTPS 300

Query: 301 SNDDEMLVDGNRRCEKWVRSDII---DSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRL 360
             D ++LVDG RRCEKW ++DII   DSKESKTTSWFKRFIGREQKPEVTW FPF EG++
Sbjct: 301 KKDADVLVDGFRRCEKWTQNDIIDMVDSKESKTTSWFKRFIGREQKPEVTWSFPFAEGKV 360

Query: 361 FILTLRAGVDGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPS 420
           F+LTLRAG+DG+HINVGGRH++SFPYR GFT+EDATGLAV GDVDIHS +ATSL T+HPS
Sbjct: 361 FVLTLRAGIDGFHINVGGRHVSSFPYRPGFTIEDATGLAVTGDVDIHSIHATSLSTSHPS 420

Query: 421 FSPQRVLEMSEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVR 480
           FSPQ+ +E S +WK+ PLP +  RLF+GVLSATNHF+ERMAVRKTWMQ  ++K+S+VV R
Sbjct: 421 FSPQKAIEFSSEWKAPPLPGTPFRLFMGVLSATNHFSERMAVRKTWMQHPSIKSSDVVAR 480

Query: 481 FFVALNPRKEVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKC 540
           FFVALNPRKEVNA+LKKEAEYFGDIVILPFMDRYELVVLKTIAICEFG  N+TA Y+MKC
Sbjct: 481 FFVALNPRKEVNAMLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGVQNVTAPYIMKC 540

Query: 541 DDDTFVRVDTVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGP 600
           DDDTF+RV+++LKQI GVS +KSLYMGNLNL HRPLR GKW VT+EEWPE VYPPYANGP
Sbjct: 541 DDDTFIRVESILKQIDGVSPEKSLYMGNLNLRHRPLRTGKWTVTWEEWPEAVYPPYANGP 600

Query: 601 GYIISSDIAKYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCME 660
           GYIISS+IAKYIVSQ+    LR+FKMEDVSMG+WVEQFN ++  V+YSH+WKFCQYGC  
Sbjct: 601 GYIISSNIAKYIVSQNSRHKLRLFKMEDVSMGLWVEQFNASMQPVEYSHSWKFCQYGCTL 660

Query: 661 DYFTAHYQSPRQIICLWDKLGQGHAHCCNFR 683
           +Y+TAHYQSP Q++CLWD L +G   CCNFR
Sbjct: 661 NYYTAHYQSPSQMMCLWDNLLKGRPQCCNFR 684

BLAST of Moc01g33540 vs. TAIR 10
Match: AT4G21060.1 (Galactosyltransferase family protein )

HSP 1 Score: 934.1 bits (2413), Expect = 6.4e-272
Identity = 450/661 (68.08%), Postives = 534/661 (80.79%), Query Frame = 0

Query: 27  YLVFISFKFPRFLEIAATLSGDESNVGLDGTIGGDSEGVDFSRASLSSVYKDTFHRKLED 86
           YLVF++FKFP F+E+ A LSGD    GLDG +   S  V  S     S+  D  +RKLED
Sbjct: 88  YLVFLAFKFPHFIEMVAMLSGD---TGLDGALSDTSLDVSLS----GSLRNDMLNRKLED 147

Query: 87  NQNREAPVTPKKEPLEDVNNVSGPIKPIKHKYGRITGKILSHQNHTNDFSMLEKLADEAW 146
             ++  P T +K   E+  N S  I+P+  +YGRI+G+++  +N T   S  E++ADEAW
Sbjct: 148 EDHQSGPSTTQKVSPEEKINGSKQIQPLLFRYGRISGEVMRRRNRTIHMSPFERMADEAW 207

Query: 147 TLGLKAWEEVDKFGLDETAES-SILEGKPETCPSWISTDGKKMLEGDGIMFLPCGLAAGS 206
            LG KAWE+VDKF +D+  ES SI EGK E+CPS IS +G  + + + IM LPCGLAAGS
Sbjct: 208 ILGSKAWEDVDKFEVDKINESASIFEGKVESCPSQISMNGDDLNKANRIMLLPCGLAAGS 267

Query: 207 SITIIGTPHHAHQEYVPQLLKVGAD-PMVMVSQFMVELQGLKAVDGEDPPKILHLNPRLK 266
           SITI+GTP +AH+E VPQ  ++     MV+VSQFMVELQGLK  DGE PPKILHLNPR+K
Sbjct: 268 SITILGTPQYAHKESVPQRSRLTRSYGMVLVSQFMVELQGLKTGDGEYPPKILHLNPRIK 327

Query: 267 GDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSNDDEMLVDGNRRCEKWVRSDII---DSKE 326
           GDW+ RPVIEHNTCYRMQWG AQRCDG PS  D ++LVDG RRCEKW ++DII   DSKE
Sbjct: 328 GDWNHRPVIEHNTCYRMQWGVAQRCDGTPSKKDADVLVDGFRRCEKWTQNDIIDMVDSKE 387

Query: 327 SKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTLRAGVDGYHINVGGRHLTSFPYRLGF 386
           SKTTSWFKRFIGREQKPEVTW FPF EG++F+LTLRAG+DG+HINVGGRH++SFPYR GF
Sbjct: 388 SKTTSWFKRFIGREQKPEVTWSFPFAEGKVFVLTLRAGIDGFHINVGGRHVSSFPYRPGF 447

Query: 387 TLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQRVLEMSEKWKSQPLPKSSVRLFIGVL 446
           T+EDATGLAV GDVDIHS +ATSL T+HPSFSPQ+ +E S +WK+ PLP +  RLF+GVL
Sbjct: 448 TIEDATGLAVTGDVDIHSIHATSLSTSHPSFSPQKAIEFSSEWKAPPLPGTPFRLFMGVL 507

Query: 447 SATNHFAERMAVRKTWMQSSAVKASNVVVRFFVALNPRKEVNAVLKKEAEYFGDIVILPF 506
           SATNHF+ERMAVRKTWMQ  ++K+S+VV RFFVALNPRKEVNA+LKKEAEYFGDIVILPF
Sbjct: 508 SATNHFSERMAVRKTWMQHPSIKSSDVVARFFVALNPRKEVNAMLKKEAEYFGDIVILPF 567

Query: 507 MDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTFVRVDTVLKQISGVSSKKSLYMGNLN 566
           MDRYELVVLKTIAICEFG  N+TA Y+MKCDDDTF+RV+++LKQI GVS +KSLYMGNLN
Sbjct: 568 MDRYELVVLKTIAICEFGVQNVTAPYIMKCDDDTFIRVESILKQIDGVSPEKSLYMGNLN 627

Query: 567 LLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIISSDIAKYIVSQHENRSLRIFKMEDVS 626
           L HRPLR GKW VT+EEWPE VYPPYANGPGYIISS+IAKYIVSQ+    LR+FKMEDVS
Sbjct: 628 LRHRPLRTGKWTVTWEEWPEAVYPPYANGPGYIISSNIAKYIVSQNSRHKLRLFKMEDVS 687

Query: 627 MGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTAHYQSPRQIICLWDKLGQGHAHCCNF 683
           MG+WVEQFN ++  V+YSH+WKFCQYGC  +Y+TAHYQSP Q++CLWD L +G   CCNF
Sbjct: 688 MGLWVEQFNASMQPVEYSHSWKFCQYGCTLNYYTAHYQSPSQMMCLWDNLLKGRPQCCNF 741

BLAST of Moc01g33540 vs. TAIR 10
Match: AT5G62620.1 (Galactosyltransferase family protein )

HSP 1 Score: 708.0 bits (1826), Expect = 7.4e-204
Identity = 356/684 (52.05%), Postives = 452/684 (66.08%), Query Frame = 0

Query: 15  RLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDGTIGGDSEGVDFSRASLSS 74
           R   +L+ +G+LY++ I+F+ P   +                           S  S   
Sbjct: 25  RSVQILMAVGLLYMLLITFEIPFVFK------------------------TGLSSLSQDP 84

Query: 75  VYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIKPIKHKYGRITGKILSHQNHTND 134
           + +   H    + Q R AP  P K  L   +    P + ++ +  RI   +       N 
Sbjct: 85  LTRPEKHNSQRELQERRAPTRPLKSLLYQESQSESPAQGLRRR-TRILSSLRFDPETFNP 144

Query: 135 FSM-----LEKLADEAWTLGLKAWEEVDK----FGLDETAESSILEGKPETCPSWISTDG 194
            S      L K A  AW +G K WEE++       L++  +  I E    +C   +S  G
Sbjct: 145 SSKDGSVELHKSAKVAWEVGRKIWEELESGKTLKALEKEKKKKIEEHGTNSCSLSVSLTG 204

Query: 195 KKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQ--LLKVGADPMVMVSQFMVELQ 254
             +L+   IM LPCGL  GS IT++G P  AH E  P+  +LK G D  V VSQF +ELQ
Sbjct: 205 SDLLKRGNIMELPCGLTLGSHITVVGKPRAAHSEKDPKISMLKEG-DEAVKVSQFKLELQ 264

Query: 255 GLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSNDDEMLVD 314
           GLKAV+GE+PP+ILHLNPRLKGDWS +PVIE NTCYRMQWG+AQRC+G   S DDE  VD
Sbjct: 265 GLKAVEGEEPPRILHLNPRLKGDWSGKPVIEQNTCYRMQWGSAQRCEGW-RSRDDEETVD 324

Query: 315 GNRRCEKWVRSDIIDSKESKTTS----WFKRFIGREQKPEVTWPFPFMEGRLFILTLRAG 374
           G  +CEKW R D I SKE +++     W  R IGR +K  V WPFPF   +LF+LTL AG
Sbjct: 325 GQVKCEKWARDDSITSKEEESSKAASWWLSRLIGRSKKVTVEWPFPFTVDKLFVLTLSAG 384

Query: 375 VDGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQRVLE 434
           ++GYH++V G+H+TSFPYR GFTLEDATGL + GD+D+HS +A SLPT+HPSFSPQR LE
Sbjct: 385 LEGYHVSVDGKHVTSFPYRTGFTLEDATGLTINGDIDVHSVFAGSLPTSHPSFSPQRHLE 444

Query: 435 MSEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRFFVALNPR 494
           +S  W++  LP   V +FIG+LSA NHFAERMAVR++WMQ   VK+S VV RFFVAL+ R
Sbjct: 445 LSSNWQAPSLPDEQVDMFIGILSAGNHFAERMAVRRSWMQHKLVKSSKVVARFFVALHSR 504

Query: 495 KEVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTFVRV 554
           KEVN  LKKEAE+FGDIVI+P+MD Y+LVVLKT+AICE+G   L A ++MKCDDDTFV+V
Sbjct: 505 KEVNVELKKEAEFFGDIVIVPYMDSYDLVVLKTVAICEYGAHQLAAKFIMKCDDDTFVQV 564

Query: 555 DTVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIISSDI 614
           D VL +     + +SLY+GN+N  H+PLR GKW+VTYEEWPEE YPPYANGPGYI+S+DI
Sbjct: 565 DAVLSEAKKTPTDRSLYIGNINYYHKPLRQGKWSVTYEEWPEEDYPPYANGPGYILSNDI 624

Query: 615 AKYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTAHYQ 674
           +++IV + E   LR+FKMEDVS+GMWVEQFN     V Y H+ +FCQ+GC+E+Y TAHYQ
Sbjct: 625 SRFIVKEFEKHKLRMFKMEDVSVGMWVEQFNNGTKPVDYIHSLRFCQFGCIENYLTAHYQ 681

Query: 675 SPRQIICLWDKLG-QGHAHCCNFR 683
           SPRQ+ICLWDKL   G   CCN R
Sbjct: 685 SPRQMICLWDKLVLTGKPQCCNMR 681

BLAST of Moc01g33540 vs. TAIR 10
Match: AT1G74800.1 (Galactosyltransferase family protein )

HSP 1 Score: 696.0 bits (1795), Expect = 2.9e-200
Identity = 354/687 (51.53%), Postives = 460/687 (66.96%), Query Frame = 0

Query: 15  RLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDESNVGLDGTIGGDSEGVDFSRASLSS 74
           R   +++ IG LYLV +S + P   +     S   S+V LD               +LS 
Sbjct: 25  RSVRVIMAIGFLYLVIVSVEIPLVFK-----SWSSSSVPLD---------------ALSR 84

Query: 75  VYKDTFHRKLEDNQNREAPVTPKKEPLEDVN-NVSGPI----------KPIKHKYGRITG 134
           +       KL + Q  +  + P   PLE V+  VS P           K  +H  G ++ 
Sbjct: 85  L------EKLNNEQEPQVEIIP-NPPLEPVSYPVSNPTIVTRTDLVQNKVREHHRGVLSS 144

Query: 135 KILSHQNHTNDFSM------LEKLADEAWTLGLKAWEEVDKFGLDETAESSILEGKPETC 194
             L   + T D S       L K A EAW LG K W+E++   L++  E    + KP++C
Sbjct: 145 --LRFDSETFDPSSKDGSVELHKSAKEAWQLGRKLWKELESGRLEKLVEKP-EKNKPDSC 204

Query: 195 PSWISTDGKKMLEGDG-IMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPMVMVS 254
           P  +S  G + +  +  +M LPCGL  GS IT++G P  AH +          D   +VS
Sbjct: 205 PHSVSLTGSEFMNRENKLMELPCGLTLGSHITLVGRPRKAHPK--------EGDWSKLVS 264

Query: 255 QFMVELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGVPSSN 314
           QF++ELQGLK V+GEDPP+ILH NPRLKGDWSK+PVIE N+CYRMQWG AQRC+G   S 
Sbjct: 265 QFVIELQGLKTVEGEDPPRILHFNPRLKGDWSKKPVIEQNSCYRMQWGPAQRCEGW-KSR 324

Query: 315 DDEMLVDGNRRCEKWVRSDIIDSKESKTTSWFKRFIGREQKPEVTWPFPFMEGRLFILTL 374
           DDE  VD + +CEKW+R D   S+ S+   W  R IGR ++ +V WPFPF+E +LF+LTL
Sbjct: 325 DDEETVDSHVKCEKWIRDDDNYSEGSRARWWLNRLIGRRKRVKVEWPFPFVEEKLFVLTL 384

Query: 375 RAGVDGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSLPTTHPSFSPQR 434
            AG++GYHINV G+H+TSFPYR GFTLEDATGL V GD+D+HS +  SLPT+HPSF+PQR
Sbjct: 385 SAGLEGYHINVDGKHVTSFPYRTGFTLEDATGLTVNGDIDVHSVFVASLPTSHPSFAPQR 444

Query: 435 VLEMSEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKASNVVVRFFVAL 494
            LE+S++W++  +P   V +FIG+LSA NHF+ERMAVRK+WMQ   + ++ VV RFFVAL
Sbjct: 445 HLELSKRWQAPVVPDGPVEIFIGILSAGNHFSERMAVRKSWMQHVLITSAKVVARFFVAL 504

Query: 495 NPRKEVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTASYVMKCDDDTF 554
           + RKEVN  LKKEAEYFGDIV++P+MD Y+LVVLKT+AICE G +  +A Y+MKCDDDTF
Sbjct: 505 HGRKEVNVELKKEAEYFGDIVLVPYMDSYDLVVLKTVAICEHGALAFSAKYIMKCDDDTF 564

Query: 555 VRVDTVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYPPYANGPGYIIS 614
           V++  V+ ++  V   +SLY+GN+N  H+PLR GKWAVTYEEWPEE YPPYANGPGY++S
Sbjct: 565 VKLGAVINEVKKVPEGRSLYIGNMNYYHKPLRGGKWAVTYEEWPEEDYPPYANGPGYVLS 624

Query: 615 SDIAKYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFCQYGCMEDYFTA 674
           SDIA++IV + E   LR+FKMEDVS+GMWVE F  T   V Y H+ +FCQ+GC+E+Y+TA
Sbjct: 625 SDIARFIVDKFERHKLRLFKMEDVSVGMWVEHFKNTTNPVDYRHSLRFCQFGCVENYYTA 672

Query: 675 HYQSPRQIICLWDK-LGQGHAHCCNFR 683
           HYQSPRQ+ICLWDK L Q    CCN R
Sbjct: 685 HYQSPRQMICLWDKLLRQNKPECCNMR 672

BLAST of Moc01g33540 vs. TAIR 10
Match: AT1G27120.1 (Galactosyltransferase family protein )

HSP 1 Score: 695.3 bits (1793), Expect = 5.0e-200
Identity = 362/698 (51.86%), Postives = 463/698 (66.33%), Query Frame = 0

Query: 1   MKKLKTEPPGAR-RLRLSHLLLVIGVLYLVFISFKFPRFLEIAATLSGDE--SNVGLDG- 60
           MKK K +   ++ R  L   LLV+ + Y + +SF+ P      +    D+  S+   D  
Sbjct: 1   MKKSKLDNSSSQIRFGLVQFLLVVLLFYFLCMSFEIPFIFRTGSGSGSDDVSSSSFADAL 60

Query: 61  ----TIGGDSEGVDFSRASLSSVYKDTFHRKLEDNQNREAPVTPKKEPLEDVNNVSGPIK 120
                +GG S   ++    +    +   HR  +D    +  +  +K  + +  +VS    
Sbjct: 61  PRPMVVGGGSREANW---VVGEEEEADPHRHFKDPGRVQLRLPERK--MREFKSVSEIF- 120

Query: 121 PIKHKYGRITGKILSHQNHTNDFSMLEKLADEAWTLGLKAWEEVDKFGLDETAESSILEG 180
                   +      +   +++FS+  K A  A ++G K W+ +D  GL +  ++ + + 
Sbjct: 121 --------VNESFFDNGGFSDEFSIFHKTAKHAISMGRKMWDGLDS-GLIKPDKAPV-KT 180

Query: 181 KPETCPSWISTDGKKMLEGDGIMFLPCGLAAGSSITIIGTPHHAHQEYVPQLLKVGADPM 240
           + E CP  +S    + +    I+ LPCGL  GS IT++ TPH AH E          D  
Sbjct: 181 RIEKCPDMVSVSESEFVNRSRILVLPCGLTLGSHITVVATPHWAHVE-------KDGDKT 240

Query: 241 VMVSQFMVELQGLKAVDGEDPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGTAQRCDGV 300
            MVSQFM+ELQGLKAVDGEDPP+ILH NPR+KGDWS RPVIE NTCYRMQWG+  RCDG 
Sbjct: 241 AMVSQFMMELQGLKAVDGEDPPRILHFNPRIKGDWSGRPVIEQNTCYRMQWGSGLRCDGR 300

Query: 301 PSSNDDEMLVDGNRRCEKWVRSDI------IDSKESKTTSWFKRFIGREQKPEV-TWPFP 360
            SS DDE  VDG  +CE+W R D        D  ESK T W  R +GR +K     W +P
Sbjct: 301 ESS-DDEEYVDGEVKCERWKRDDDDGGNNGDDFDESKKTWWLNRLMGRRKKMITHDWDYP 360

Query: 361 FMEGRLFILTLRAGVDGYHINVGGRHLTSFPYRLGFTLEDATGLAVKGDVDIHSAYATSL 420
           F EG+LF+LTLRAG++GYHI+V GRH+TSFPYR GF LEDATGLAVKG++D+HS YA SL
Sbjct: 361 FAEGKLFVLTLRAGMEGYHISVNGRHITSFPYRTGFVLEDATGLAVKGNIDVHSVYAASL 420

Query: 421 PTTHPSFSPQRVLEMSEKWKSQPLPKSSVRLFIGVLSATNHFAERMAVRKTWMQSSAVKA 480
           P+T+PSF+PQ+ LEM   WK+  LP+  V LFIG+LSA NHFAERMAVRK+WMQ   V++
Sbjct: 421 PSTNPSFAPQKHLEMQRIWKAPSLPQKPVELFIGILSAGNHFAERMAVRKSWMQQKLVRS 480

Query: 481 SNVVVRFFVALNPRKEVNAVLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGTVNLTA 540
           S VV RFFVAL+ RKEVN  LKKEAEYFGDIVI+P+MD Y+LVVLKT+AICE+G   + A
Sbjct: 481 SKVVARFFVALHARKEVNVDLKKEAEYFGDIVIVPYMDHYDLVVLKTVAICEYGVNTVAA 540

Query: 541 SYVMKCDDDTFVRVDTVLKQISGVSSKKSLYMGNLNLLHRPLRHGKWAVTYEEWPEEVYP 600
            YVMKCDDDTFVRVD V+++   V  ++SLY+GN+N  H+PLR GKWAVT+EEWPEE YP
Sbjct: 541 KYVMKCDDDTFVRVDAVIQEAEKVKGRESLYIGNINFNHKPLRTGKWAVTFEEWPEEYYP 600

Query: 601 PYANGPGYIISSDIAKYIVSQHENRSLRIFKMEDVSMGMWVEQFNGTVAAVQYSHNWKFC 660
           PYANGPGYI+S D+AK+IV   E + LR+FKMEDVSMGMWVE+FN T   V   H+ KFC
Sbjct: 601 PYANGPGYILSYDVAKFIVDDFEQKRLRLFKMEDVSMGMWVEKFNET-RPVAVVHSLKFC 660

Query: 661 QYGCMEDYFTAHYQSPRQIICLWDKLGQ-GHAHCCNFR 683
           Q+GC+EDYFTAHYQSPRQ+IC+WDKL + G   CCN R
Sbjct: 661 QFGCIEDYFTAHYQSPRQMICMWDKLQRLGKPQCCNMR 673

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022146632.10.0e+00100.00hydroxyproline O-galactosyltransferase GALT2 [Momordica charantia][more]
XP_038882369.10.0e+0092.23hydroxyproline O-galactosyltransferase GALT2 [Benincasa hispida][more]
XP_008439584.10.0e+0091.50PREDICTED: hydroxyproline O-galactosyltransferase GALT2 [Cucumis melo] >TYK13311... [more]
KAA0052514.10.0e+0091.35hydroxyproline O-galactosyltransferase GALT2 [Cucumis melo var. makuwa][more]
XP_011658301.10.0e+0090.91hydroxyproline O-galactosyltransferase GALT2 [Cucumis sativus] >KGN49434.1 hypot... [more]
Match NameE-valueIdentityDescription
A7XDQ91.6e-27566.86Hydroxyproline O-galactosyltransferase GALT2 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q9LV161.0e-20252.05Hydroxyproline O-galactosyltransferase GALT6 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q8RX554.1e-19951.53Hydroxyproline O-galactosyltransferase GALT5 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q8GXG67.0e-19951.86Hydroxyproline O-galactosyltransferase GALT4 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q8L7F99.1e-9037.08Beta-1,3-galactosyltransferase GALT1 OS=Arabidopsis thaliana OX=3702 GN=GALT1 PE... [more]
Match NameE-valueIdentityDescription
A0A6J1D0420.0e+00100.00hydroxyproline O-galactosyltransferase GALT2 OS=Momordica charantia OX=3673 GN=L... [more]
A0A5D3CNA30.0e+0091.50Hydroxyproline O-galactosyltransferase GALT2 OS=Cucumis melo var. makuwa OX=1194... [more]
A0A1S3AZQ80.0e+0091.50hydroxyproline O-galactosyltransferase GALT2 OS=Cucumis melo OX=3656 GN=LOC10348... [more]
A0A5A7UAW90.0e+0091.35Hydroxyproline O-galactosyltransferase GALT2 OS=Cucumis melo var. makuwa OX=1194... [more]
A0A0A0KKS20.0e+0090.91Galectin domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G524710 PE... [more]
Match NameE-valueIdentityDescription
AT4G21060.21.1e-27666.86Galactosyltransferase family protein [more]
AT4G21060.16.4e-27268.08Galactosyltransferase family protein [more]
AT5G62620.17.4e-20452.05Galactosyltransferase family protein [more]
AT1G74800.12.9e-20051.53Galactosyltransferase family protein [more]
AT1G27120.15.0e-20051.86Galactosyltransferase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001079Galectin, carbohydrate recognition domainSMARTSM00908Gal_bind_lectin_2coord: 197..402
e-value: 1.4E-26
score: 104.3
IPR001079Galectin, carbohydrate recognition domainPFAMPF00337Gal-bind_lectincoord: 194..400
e-value: 1.2E-48
score: 164.0
IPR001079Galectin, carbohydrate recognition domainPROSITEPS51304GALECTINcoord: 193..403
score: 31.256947
IPR001079Galectin, carbohydrate recognition domainCDDcd00070GLECTcoord: 196..399
e-value: 1.73067E-23
score: 94.2379
IPR002659Glycosyl transferase, family 31PFAMPF01762Galactosyl_Tcoord: 449..629
e-value: 1.2E-32
score: 113.3
IPR002659Glycosyl transferase, family 31PANTHERPTHR11214BETA-1,3-N-ACETYLGLUCOSAMINYLTRANSFERASEcoord: 8..682
NoneNo IPR availableGENE3D3.90.550.50coord: 425..668
e-value: 3.7E-22
score: 80.8
NoneNo IPR availableGENE3D2.60.120.200coord: 194..302
e-value: 2.8E-12
score: 48.2
NoneNo IPR availableGENE3D2.60.120.200coord: 312..400
e-value: 4.6E-12
score: 47.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 85..106
NoneNo IPR availablePANTHERPTHR11214:SF212HYDROXYPROLINE O-GALACTOSYLTRANSFERASE GALT2coord: 8..682
IPR013320Concanavalin A-like lectin/glucanase domain superfamilySUPERFAMILY49899Concanavalin A-like lectins/glucanasescoord: 195..401

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc01g33540.1Moc01g33540.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006486 protein glycosylation
cellular_component GO:0000139 Golgi membrane
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0030246 carbohydrate binding
molecular_function GO:0008378 galactosyltransferase activity
molecular_function GO:0016758 hexosyltransferase activity