Bhi04G000009 (gene) Wax gourd

NameBhi04G000009
Typegene
OrganismBenincasa hispida (Wax gourd)
DescriptionHexosyltransferase
Locationchr4 : 366961 .. 373306 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CACAAGTTTCGTGACACATCGATGAACTCTTAATGTGACGTTTTCTTTAAAAATTATGAATTTTGAGTGGAAATTTGAACCCAGGACCCGAGAGCGACCTATAAGGGTAGAGAGGAAGCGGTTAGGCAAAGAAAGAAATCATTGAATGGCTGGAGGTAGGAACGGGCCAAGCCCATTTGTGTGATTGTGAGGACCTTCGAAACTGAATCTACTGGCAGCCAGCAGGCGAGATCTCAACGACTCTGTCTTCTTCTTCCTTTTGCAGTGAGTTTCCCATTACTGATTTCCTTCTTTCTTTCTGTTTCCCTTTTATGCCATTTTTGTCTCTGTAATTCATACATTGATTCTCACAACACCAACAATTTCTGCGTTTGAAGTCAAATCCCTTTCTCTGTCTCCATCTCGCTCTTGCTCCTCAAGATTGACGAGATGGAGAGTTTGCCCACCACCGCCAAACCCGAAAGGCGTCCTCGATCCAAGCCTCTTCACGCCTCCAAACCCTCCATTCTTCTGGCATTCTTCTCTTGTCTTGCGTGGCTCTATGTTGCTGGCAGGTATCTTCTTCTTTTTCTTCTATTTCGTAACAATGTTGGACGGCAGAATCTGAATCCGGGATATTAAGCCACAATCATACGGCACTTTCATTCTTGGGTCGTTTAGGCAGAATTGGATCTTCTTGTTGCATCCGATTATTTTTTCAATTGCAATTTTTGCATGATCTTATGATCAACAGGAAATGCATTCCCCGGGTGAGGTAGAGAGAGGGAGAAAATTGATTGTTAGGATTGCGGTATTCTATTGCATTTCACTTGACTTTTGCTTATAAATTCGTATATTCTGCTTATTGTTGGTCTGGTTTCTTCAGGCTCTGGCAAGATGCAGAAAATAGGAAAATACTTGCTAGTCTTTTACAGAAGAACGCATCCCAGGTATATTAGTTATGAGTACTTTCTCAATTCTGGGAGAAATATACTGGAATTGCTGCCAATTTTTGCATCTCTTTTAATTGAGTCGAAATTTTGAACGAGGACCGAACGTGTATTAGTTAATAGCTAGTATCCGATATTTCTATTTCTACATGTAATCGTTTTCCTCCACATCCAGCCATTAGTTTGCCATGTATCTGTAGATTTTGTATTCATTTTTTTAGCATGGAACCGTTAATCCATTAGAATTCTGCCATCGTGTGCAGAGACCCGTGATTTTGAGCGTTGAAGATAAGCTGCAAGTCCTGGGTTGCAAGTATGTGCCTAATTCTCCTTTCTTGATTCTCATATGAAACCTGATCTTTGTTTTATAATTTTTTTTAAACGATCTCCTACGTCGTTATTGTGAGAGTTAGACTTCTGTTTCCTTTTCATTCTTTATATAATTTGTTTTCACTTACTTAGTTACTAATGTAAATACCAATCTTAATGTTTTACATATTATTGAATTGATAATTGTATATAATTTTATTAAGAGATTTGGAGAGAAGGATTGCGGAAGTTGAGATGGATTTAACATTAGCAAAGAGTCAAGGGTACCTGAAGAATCAACTGCGACAAAGTGGATCTTCTTCTGACCCTGGTCGTAAGCTCCTTGCGGTTATCGGTGTTTATACAGGATTTGGAAGTCGACTAAGGCGGAATGTATTCAGAGGATCTTGGATGCCAAAAGGTCAGTCAAGTTTTAAATATTATTTGATTGGGGGTGTAAACAACTTAGAACTTCATTTCAGATGACATTTGTATTACCTTATTGTACTTAACTCTTCCAGTGGTCTCAATCTTTTGTTCTCATTTGTTTTGCTTTCCAAATCTAGAATGATCAAGGTTCTATCAAGTATCAGAAAATTTGAGCTAACAAAGCTCCATTTGAGTATTCAGCTCAACTCAAATAAACAGTAATTATAATTCTAATATTCTCTAGGAATAATCCATATATTATACTTAAATTATACAAGATCTAGCATGCCCATGGTACCATATATTATACTTAAATTATTCAGCTCAACTCAAATAAACAGTAATTATAATTCTAATATTCTCTAGGAATAATCCATATATTATACTTAAATTATACAAGATCTAGCATGCCCATGGTACCAAATGGTCTGTGACTGGTTTTGTTGATATTGACATTTTTCCTCGGAAATGAAAGACTTCCTCCTAGGGGCCTGTAATAGGTGGGCAGAGAGAATGAGATCCCCTTCCAATTCAACCTGGTTTATTAAACAGGATTAGGGGAGAAAAAAGAGGGGAAGGGCCATGTTTCACATAAAATAAATCTTGAGAGTGATGCTTGATTTCAACAAAAGAAACTTATCATATTTGCGATGCAGAAACTTTTTCTTTTAAAGATATACTAAAGTATTAATAGGGGAATGTTTTCTCTGCTAGAATATTGAATCTTGTAAAATCACTTGTAAATGTTTCAATATTTGGAGAAATATTGACTGGTTTAATTTCTGGAATGTGAGCGTGAACATGTAATTAATGACTTTTGGCATTTATAGGTGATGCATTGAAAAAGTTGGAGGAAAGAGGGGTGGTCATACGCTTTGTTATTGGTCGGAGGTTTTCCTTTCTTCTTTCAACTCTATTTTCATATCAATGTTCTTCACTTCCATTTTGATGACATTGTGTTTTTCATCTCTTTGTTGTTATGTGTCATAGCTATTTTGTGTACTTGTCCTGTCATAACTTTATTGATTTGGCATTGGCAGCGCAAATCGAGGTGATAGCTTAGACCGCAATATTGACAAGGAAAATGATTCAACCAAGGACTTCTTGATACTTGTATGTTATCGAAGTTCTAGCCTTACGTATATCACCTGTTCCAATCACTTAGAATTAGCAAAGACTTTCAACTCAACTTTTTGGTTCATGGTTCTCTTGTGCTTGCATGTTCTCTTGTTTCCATTGAGGACTAGAATCTAGACACTATCGTCCATAGATGGTTTGCCTAGGAGTATATGTGTTGTTATCCTACAGAATATAGAATTATTAATTTATATCACGCCACGAATTGTTTGGCAATTCTCTCAACATATGTTCCCTATATTTTGTAGGAAGGTCATGAAGAAGCTGACGAAGAGTTACCTAAAAAGGCCAAGTTCTTTTTCAGTACAGCAGTTCAAAATTGGGATGCAGAATTTTATGTGAAAGTCGATGACAACATTGACCTTGATCTTGGTTGATCATCTAATCAACAACCTCTATTTCTAAGAAATTTATTTTTCATGGCTTGATGATTAATTGAAACTATTTTTTGGCCAAGTAGAGGGTTTAATTGGGCTTTTTGAACATCGTCGTGGTCAAGATGGTACTTACGTTGGATGCATGAAATCTGGAGATGTGATTGCTGAAGAGTATGAACTCTATTCCATTTTTATTATAAAATGCATGTGGATGTATAAATTCTATTCTCCAACTTCTACTTTTTAAATTGATCGAAGAAATTTTTTTCACCTACTTAAGTTTCATGTTTTTTATAGTAAAGATTCTTCATTCTTTGTTTTCGCAAGGTCAAGATTTTATGGTACTCGTGGTTTTGTTGATTAAACCTCTCCTCTCATCTGTTGAAATGCTATGTTTTTTTTCCCAGAGGAAAGCAGTGGTATGAACCTGAATGGTGGAAGTTTGGAGATGAGAAATCGTATGTGTAATCTAGCTTCAGGATGGAGACATGAAGAAATTTTGTAATTATGTTCTTCAAATATATTGAATACTACCTGGACTGCAGGTGTAAAACGTGACCAATTTTTGCATCCTTTAGGTACTTCCGGCATGCATCTGGTGCGCTTATTATCCTCTCCAAGAATTTGGCTCAGTATATTAATATTAACAGGTAATTTCAATAACTCGATTCCAGAAGATGGTTTACTCAATCAAAGGAATATCCCATGCTCGGAAGTCCAGGATTCCAATCTTAGTTTATTCCTAGACGATTTAATGCTAAATGAATATTCTTTTTTATCTCAAATCATCTTTTTCCGGTAGAATTATCATCAACAAATTGACAGCCTTCTTTTAACTTTCTGATTTTCCTTGTGAAAACCAGGAAATAAACTCTCACGTGCAAAATTATACTCCTTTAAGACAAATTAGCCCCATCAATTTGAATGTGGTATTCTGCTATATAATTATAAAAGATTCTTATTATTGAAATTTCTCATGAATTTTTCAGATTAACTCCTAGAATCTATTTCTTGCTTATGTCGTGTATTATAATTATCCTGCATCTGAAAGCTTTATTAAATGTTTGTGTCATTTGCAGTGCATCTTTGAAGACTTATGCACATGACGATATATCAGTGGGGTCATGGATGATTGGTCTCCAGGCTACTCATATAGATGATAATCGACTGTGCTGCAGTAGTGTTAGACAAGGTGAATGTGCCTCCATTCCATTATTGTGGTGGTTGTTTTTTCTCATATTTCTACCACATTGTTTAGAATACTACATTTAAAACTTGATGAGAGCGGATCAAGGTGCTCTGGTGCATATTTTTCTACTCATCATCCACCCAAAATCTTTGCCTTTAGTTTCTGGTCATTTTTCATGACTACCCTACGGGCACATTGGAAAATGGAAAACTCAACTAGAAACGGATATATGATAATAACTTCATTTACTTTTTATAAATTAGTTTTGGACTAAATTTGCAAATTTGTTTCTAATGTTTAAAGCATTTTTTAAAAAATCTGACCTTGTTCTTCTAAAACATTTTTTCTAATGTTTCAGTTCATTTCAATCAAATCCTTCTGTAAAGACGAGGTTAGAGTTACGCGGATATAGAATGAAAAATTGATGAAGAGAATCTCCGTTTTATGTAATTTGCTGCTCACTGCTCAAAAACTCATTTTTATCTTTCTATTAATCTGAAGTTCCTCCTTAGATCTCCTCTTTTTTACTCCCTTCCTCTCCTTGCCTGGGTCTCGCTCGATCCATTATTCTCTGCTTCTCTCCCACTCAAGGATCTTACTCATTCCATAAAAGTCTCTCGTTTACTCAATCTGTTGCTTATCTTTCTTTGGTGAAGTAGCAATTAGGGAAAATAAGAATAGTATGAGTTAAGTGGCTGGTCAAATGCAACATCATACGTCTATCGGTTTGCGTTTGTATTATTCTTTTTTAATGGAAGGGTTGAAATGAAATAATGATACAATAACAGAACAGGATACTAGAAGTTTATAAGAATAGAGACAAAATCATAACACGTTTATAAACCTTGGGCACTAGTTGTTTATGAAGTTTACTTCACCAGTTGGAATTATTAATTGCCCAGTTCACAGCAAGAAGGTAATTTTTGGTTTATGATTATCAACACTTATGCCACTAATGGACATGTTCTTTTTTTCAGACAAGGTGTGTTCCGTGGTTTAATTGATCCAGAATAACCGAGAGCGTTAGTTTATCTGGACATTCAGCTAATTTTGCGGAGCTTAGACAATTCTTTTGCTACAGTTTAATGTGCCTGAGTTTATTTACCCTCCAGCTGAGCTGATGCGGGGGAATCTCGACGTGTTCACTAGATTCCTCTTCTCGGATGCTGAGGTACACAGCAATTTTTTTTTTTTTTTTCTTGGGATAATAGTGTCAGGTTGACTGAACTGAATGCCCTCTTGTTAATATTAGAAAGTAAAAAATTAGTTTGATACGTGTGATGGATATATTGAAATATCACAGTTCTTTTGTCCCTTTTCTTTTCTTACTCATAAACATTTCTTTCTTCTTTCTGAAGAGAAGAGTTCTGTCTTCAAATAAAATGTTATATTGGCATGATGCTACTGTTACGTTTACAATCATTACTTCAATTCATAGGACTGGGTTTGCCATGCGAGATTGAGTGGAATGTGGCCTATATGGAACCAACTAGTTAGTTCGAGTCCAAGAGATGAAGTTTTCTGTTTCAAGTATCAAACTTTTAAGAATGTTATCACTTTATGGAAAACATAAGCTGTTCATAATTTTCTTGTCCGAGTCCAAGAGATCATTGTTTCTTGTTCTTTCTTCTCTTATGAAAAAAAAAAATGTTCGGCAATCATTTTTGGTTTCTTGTTCTCTACAATTTTTATTGAAATTGTAAAAAACACCTTTGAACTTTGTAATATGTTTAAAAAATATTCTTATACTTTTAAAAGTTGCAATATTATCCTTAAACTTTTATAAGCA

mRNA sequence

CACAAGTTTCGTGACACATCGATGAACTCTTAATGTGACGTTTTCTTTAAAAATTATGAATTTTGAGTGGAAATTTGAACCCAGGACCCGAGAGCGACCTATAAGGGTAGAGAGGAAGCGGTTAGGCAAAGAAAGAAATCATTGAATGGCTGGAGGTAGGAACGGGCCAAGCCCATTTGTGTGATTGTGAGGACCTTCGAAACTGAATCTACTGGCAGCCAGCAGGCGAGATCTCAACGACTCTGTCTTCTTCTTCCTTTTGCATCAAATCCCTTTCTCTGTCTCCATCTCGCTCTTGCTCCTCAAGATTGACGAGATGGAGAGTTTGCCCACCACCGCCAAACCCGAAAGGCGTCCTCGATCCAAGCCTCTTCACGCCTCCAAACCCTCCATTCTTCTGGCATTCTTCTCTTGTCTTGCGTGGCTCTATGTTGCTGGCAGGCTCTGGCAAGATGCAGAAAATAGGAAAATACTTGCTAGTCTTTTACAGAAGAACGCATCCCAGAGACCCGTGATTTTGAGCGTTGAAGATAAGCTGCAAGTCCTGGGTTGCAAAGATTTGGAGAGAAGGATTGCGGAAGTTGAGATGGATTTAACATTAGCAAAGAGTCAAGGGTACCTGAAGAATCAACTGCGACAAAGTGGATCTTCTTCTGACCCTGGTCGTAAGCTCCTTGCGGTTATCGGTGTTTATACAGGATTTGGAAGTCGACTAAGGCGGAATGTATTCAGAGGATCTTGGATGCCAAAAGGTGATGCATTGAAAAAGTTGGAGGAAAGAGGGGTGGTCATACGCTTTGTTATTGGTCGGAGCGCAAATCGAGGTGATAGCTTAGACCGCAATATTGACAAGGAAAATGATTCAACCAAGGACTTCTTGATACTTGAAGGTCATGAAGAAGCTGACGAAGAGTTACCTAAAAAGGCCAAGTTCTTTTTCAGTACAGCAGTTCAAAATTGGGATGCAGAATTTTATGTGAAAGTCGATGACAACATTGACCTTGATCTTGAGGGTTTAATTGGGCTTTTTGAACATCGTCGTGGTCAAGATGGTACTTACGTTGGATGCATGAAATCTGGAGATGTGATTGCTGAAGAAGGAAAGCAGTGGTATGAACCTGAATGGTGGAAGTTTGGAGATGAGAAATCGTACTTCCGGCATGCATCTGGTGCGCTTATTATCCTCTCCAAGAATTTGGCTCAGTATATTAATATTAACAGTGCATCTTTGAAGACTTATGCACATGACGATATATCAGTGGGGTCATGGATGATTGGTCTCCAGGCTACTCATATAGATGATAATCGACTGTGCTGCAGTAGTGTTAGACAAGACAAGGTGTGTTCCGTGGTTTAATTGATCCAGAATAACCGAGAGCGTTAGTTTATCTGGACATTCAGCTAATTTTGCGGAGCTTAGACAATTCTTTTGCTACAGTTTAATGTGCCTGAGTTTATTTACCCTCCAGCTGAGCTGATGCGGGGGAATCTCGACGTGTTCACTAGATTCCTCTTCTCGGATGCTGAGGACTGGGTTTGCCATGCGAGATTGAGTGGAATGTGGCCTATATGGAACCAACTAGTTAGTTCGAGTCCAAGAGATGAAGTTTTCTGTTTCAAGTATCAAACTTTTAAGAATGTTATCACTTTATGGAAAACATAAGCTGTTCATAATTTTCTTGTCCGAGTCCAAGAGATCATTGTTTCTTGTTCTTTCTTCTCTTATGAAAAAAAAAAATGTTCGGCAATCATTTTTGGTTTCTTGTTCTCTACAATTTTTATTGAAATTGTAAAAAACACCTTTGAACTTTGTAATATGTTTAAAAAATATTCTTATACTTTTAAAAGTTGCAATATTATCCTTAAACTTTTATAAGCA

Coding sequence (CDS)

ATGGAGAGTTTGCCCACCACCGCCAAACCCGAAAGGCGTCCTCGATCCAAGCCTCTTCACGCCTCCAAACCCTCCATTCTTCTGGCATTCTTCTCTTGTCTTGCGTGGCTCTATGTTGCTGGCAGGCTCTGGCAAGATGCAGAAAATAGGAAAATACTTGCTAGTCTTTTACAGAAGAACGCATCCCAGAGACCCGTGATTTTGAGCGTTGAAGATAAGCTGCAAGTCCTGGGTTGCAAAGATTTGGAGAGAAGGATTGCGGAAGTTGAGATGGATTTAACATTAGCAAAGAGTCAAGGGTACCTGAAGAATCAACTGCGACAAAGTGGATCTTCTTCTGACCCTGGTCGTAAGCTCCTTGCGGTTATCGGTGTTTATACAGGATTTGGAAGTCGACTAAGGCGGAATGTATTCAGAGGATCTTGGATGCCAAAAGGTGATGCATTGAAAAAGTTGGAGGAAAGAGGGGTGGTCATACGCTTTGTTATTGGTCGGAGCGCAAATCGAGGTGATAGCTTAGACCGCAATATTGACAAGGAAAATGATTCAACCAAGGACTTCTTGATACTTGAAGGTCATGAAGAAGCTGACGAAGAGTTACCTAAAAAGGCCAAGTTCTTTTTCAGTACAGCAGTTCAAAATTGGGATGCAGAATTTTATGTGAAAGTCGATGACAACATTGACCTTGATCTTGAGGGTTTAATTGGGCTTTTTGAACATCGTCGTGGTCAAGATGGTACTTACGTTGGATGCATGAAATCTGGAGATGTGATTGCTGAAGAAGGAAAGCAGTGGTATGAACCTGAATGGTGGAAGTTTGGAGATGAGAAATCGTACTTCCGGCATGCATCTGGTGCGCTTATTATCCTCTCCAAGAATTTGGCTCAGTATATTAATATTAACAGTGCATCTTTGAAGACTTATGCACATGACGATATATCAGTGGGGTCATGGATGATTGGTCTCCAGGCTACTCATATAGATGATAATCGACTGTGCTGCAGTAGTGTTAGACAAGACAAGGTGTGTTCCGTGGTTTAA

Protein sequence

MESLPTTAKPERRPRSKPLHASKPSILLAFFSCLAWLYVAGRLWQDAENRKILASLLQKNASQRPVILSVEDKLQVLGCKDLERRIAEVEMDLTLAKSQGYLKNQLRQSGSSSDPGRKLLAVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGVVIRFVIGRSANRGDSLDRNIDKENDSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDDNIDLDLEGLIGLFEHRRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYININSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSSVRQDKVCSVV
BLAST of Bhi04G000009 vs. Swiss-Prot
Match: sp|Q5XEZ1|B3GT9_ARATH (Hydroxyproline O-galactosyltransferase HPGT3 OS=Arabidopsis thaliana OX=3702 GN=HPGT3 PE=2 SV=1)

HSP 1 Score: 544.7 bits (1402), Expect = 7.6e-154
Identity = 268/349 (76.79%), Postives = 306/349 (87.68%), Query Frame = 0

Query: 1   MESLPTT--AKPERRPRSKPL--HASKPSILLAFFSCLAWLYVAGRLWQDAENRKILASL 60
           MESLPTT  +K ERR RS      +SKPS+++AFFSC+AWLYVAGRLWQDAENR +L ++
Sbjct: 1   MESLPTTVPSKSERRARSSKFSQSSSKPSVIMAFFSCVAWLYVAGRLWQDAENRVVLNNI 60

Query: 61  LQKNASQRPVILSVEDKLQVLGCKDLERRIAEVEMDLTLAKSQGYLKNQLRQSGSSSDPG 120
           L+K+  Q+P +L+V+DKL VLGCKDLERRI E EM+LTLAKSQGYLKN   +SGSSS  G
Sbjct: 61  LKKSYDQKPKVLTVDDKLMVLGCKDLERRIVETEMELTLAKSQGYLKN--LKSGSSS--G 120

Query: 121 RKLLAVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGVVIRFVIGRSANRGDSLDRN 180
           +KLLAVIGVY+GFGS LRRN FRGS+MP+GDAL+KLEERG+VIRFVIGRS NRGDSLDR 
Sbjct: 121 KKLLAVIGVYSGFGSHLRRNTFRGSYMPQGDALRKLEERGIVIRFVIGRSPNRGDSLDRK 180

Query: 181 IDKENDSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDDNIDLDLEGLIG 240
           ID+EN + KDFLILE HEEA EEL KK KFFFS AVQNWDAEFY+KVDDNIDLDLEGLIG
Sbjct: 181 IDEENQARKDFLILENHEEAQEELAKKVKFFFSAAVQNWDAEFYIKVDDNIDLDLEGLIG 240

Query: 241 LFEHRRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQ 300
           L E RRGQD  Y+GCMKSG+V+AEEG +WYEPEWWKFGDEKSYFRHA+G+L+ILSK LAQ
Sbjct: 241 LLESRRGQDAAYIGCMKSGEVVAEEGGKWYEPEWWKFGDEKSYFRHAAGSLLILSKTLAQ 300

Query: 301 YININSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSSVRQDKVCSV 346
           Y+NINS SLKTYA DD S+GSWMIG+QAT+IDDNRLCCSS+RQDKVCSV
Sbjct: 301 YVNINSGSLKTYAFDDTSIGSWMIGVQATYIDDNRLCCSSIRQDKVCSV 345

BLAST of Bhi04G000009 vs. Swiss-Prot
Match: sp|Q94A05|B3GTA_ARATH (Hydroxyproline O-galactosyltransferase HPGT2 OS=Arabidopsis thaliana OX=3702 GN=HPGT2 PE=1 SV=1)

HSP 1 Score: 535.4 bits (1378), Expect = 4.6e-151
Identity = 264/348 (75.86%), Postives = 296/348 (85.06%), Query Frame = 0

Query: 1   MESLPTT--AKPERRPR-SKPLHASKPSILLAFFSCLAWLYVAGRLWQDAENRKILASLL 60
           MESLPTT   K +RR R SK  + SKPS++LAFFSCLAWLYVAGRLWQDA+ R  L ++L
Sbjct: 1   MESLPTTVSGKSDRRGRFSKSQNTSKPSLILAFFSCLAWLYVAGRLWQDAQYRAALNTVL 60

Query: 61  QKNASQRPVILSVEDKLQVLGCKDLERRIAEVEMDLTLAKSQGYLKNQLRQSGSSSDPGR 120
           + N  QRP +L+VEDKL VLGCKDLERRI E EM+L  AKSQGYLK Q     S S  G+
Sbjct: 61  KMNYDQRPKVLTVEDKLVVLGCKDLERRIVETEMELAQAKSQGYLKKQ----KSVSSSGK 120

Query: 121 KLLAVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGVVIRFVIGRSANRGDSLDRNI 180
           K+LAVIGVYTGFGS L+RN FRGSWMP+ DALKKLEERGVVIRFVIGRSANRGDSLDR I
Sbjct: 121 KMLAVIGVYTGFGSHLKRNKFRGSWMPRDDALKKLEERGVVIRFVIGRSANRGDSLDRKI 180

Query: 181 DKENDSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDDNIDLDLEGLIGL 240
           D+EN +TKDFLILE HEEA EELPKK KFF+S AVQNWDAEFYVKVDDN+DLDLEG+I L
Sbjct: 181 DEENRATKDFLILENHEEAQEELPKKVKFFYSAAVQNWDAEFYVKVDDNVDLDLEGMIAL 240

Query: 241 FEHRRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQY 300
            E RR QDG Y+GCMKSGDVI EEG QWYEPEWWKFGD+KSYFRHA+G+L+ILSKNLAQY
Sbjct: 241 LESRRSQDGAYIGCMKSGDVITEEGSQWYEPEWWKFGDDKSYFRHATGSLVILSKNLAQY 300

Query: 301 ININSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSSVRQDKVCSV 346
           +NINS  LKTYA DD ++GSWMIG+QAT+IDDNRLCCSS RQ+KVCS+
Sbjct: 301 VNINSGLLKTYAFDDTTIGSWMIGVQATYIDDNRLCCSSTRQEKVCSM 344

BLAST of Bhi04G000009 vs. Swiss-Prot
Match: sp|Q94F27|B3GTB_ARATH (Hydroxyproline O-galactosyltransferase HPGT1 OS=Arabidopsis thaliana OX=3702 GN=HPTG1 PE=1 SV=1)

HSP 1 Score: 299.7 bits (766), Expect = 4.3e-80
Identity = 144/337 (42.73%), Postives = 221/337 (65.58%), Query Frame = 0

Query: 12  RRPRSKPLHASKPSILLAF-FSCLAWLYVAGRLWQDAENRKILASLLQKNASQRPVILSV 71
           R+  S  L +S+ S LL F F+  A  YVAGRLWQ+++ R  L + L +   Q    +SV
Sbjct: 3   RKGSSIRLSSSRISTLLLFMFATFASFYVAGRLWQESQTRVHLINELDRVTGQGKSAISV 62

Query: 72  EDKLQVLGCKDLERRIAEVEMDLTLAKSQGYLKNQLRQSGSSSDPGRKLLAVIGVYTGFG 131
           +D L+++ C++ ++ +A +EM+L+ A+ +G++    + +   ++  ++ L VIG+ T  G
Sbjct: 63  DDTLKIIACREQKKTLAALEMELSSARQEGFVSKSPKLA-DGTETKKRPLVVIGIMTSLG 122

Query: 132 SRLRRNVFRGSWMPKGDALKKLE-ERGVVIRFVIGRSANRGDSLDRNIDKENDSTKDFLI 191
           ++ +R+  R +WM  G +LKKLE E+GV+ RFVIGRSAN+GDS+D++ID EN  T DF+I
Sbjct: 123 NKKKRDAVRQAWMGTGASLKKLESEKGVIARFVIGRSANKGDSMDKSIDTENSQTDDFII 182

Query: 192 LEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDDNIDLDLEGLIGLFEHRRGQDGTYV 251
           L+   EA EE  KK K FF+ A   WDA+FY K  DNI ++++ L             Y+
Sbjct: 183 LDDVVEAPEEASKKVKLFFAYAADRWDAQFYAKAIDNIYVNIDALGTTLAAHLENPRAYI 242

Query: 252 GCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYININSASLKTYA 311
           GCMKSG+V +E   +WYEPEWWKFGD+K+YFRHA G + +++  LA++++IN   L +YA
Sbjct: 243 GCMKSGEVFSEPNHKWYEPEWWKFGDKKAYFRHAYGEMYVITHALARFVSINRDILHSYA 302

Query: 312 HDDISVGSWMIGLQATHIDDNRLCCSSVRQDKVCSVV 347
           HDD+S GSW +GL   H+D+ + CCS+   + +C+ V
Sbjct: 303 HDDVSTGSWFVGLDVKHVDEGKFCCSAWSSEAICAGV 338

BLAST of Bhi04G000009 vs. Swiss-Prot
Match: sp|Q9ZV71|B3GT3_ARATH (Probable beta-1,3-galactosyltransferase 3 OS=Arabidopsis thaliana OX=3702 GN=B3GALT3 PE=2 SV=1)

HSP 1 Score: 236.1 bits (601), Expect = 5.8e-61
Identity = 145/360 (40.28%), Postives = 208/360 (57.78%), Query Frame = 0

Query: 4   LPTTAKPERRPRSKPLHASKPSILLAFFSCLAWLYVAGRLW--------------QDAEN 63
           + T  K E  P S+ L + K + LL F S    +    R+W               +AE 
Sbjct: 1   MSTKIKGELFP-SRSLVSKKWTFLLCFGSFCFGILFTDRMWIIPESKDMPRPSVSTEAER 60

Query: 64  RKILA------SLLQKNASQRPVILSVEDKLQVLGCKDLERRIAEVEMDLTLAKS-QGYL 123
            K+++      +L QK  ++ P  L  E        + L++ I+ +EM+L  A+S Q  L
Sbjct: 61  LKLISEGCDPKTLYQKEVNRDPQALFGEVSKTHNAIQTLDKTISSLEMELAAARSAQESL 120

Query: 124 KNQLRQSGSSSD---PG-RKLLAVIGVYTGFGSRLRRNVFRGSWMPKGDALKKL-EERGV 183
            N    S        PG R+ L V+G+ T F SR RR+  R +WMP G+  KKL EE+G+
Sbjct: 121 VNGAPISNDMEKKQLPGKRRYLMVVGINTAFSSRKRRDSVRTTWMPSGEKRKKLEEEKGI 180

Query: 184 VIRFVIGRSANRGDSLDRNIDKENDSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDA 243
           +IRFVIG SA  G  LDR+I+ E+    DFL L+ H E   EL  K K +FSTAV  WDA
Sbjct: 181 IIRFVIGHSATAGGILDRSIEAEDKKHGDFLRLD-HVEGYLELSGKTKTYFSTAVSKWDA 240

Query: 244 EFYVKVDDNIDLDLEGLIGLFEHRRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDE- 303
           EFYVKVDD++ +++  L       R +   Y+GCMKSG V++++G +++EPE+WKFG+  
Sbjct: 241 EFYVKVDDDVHVNIATLGETLVRHRKKHRVYLGCMKSGPVLSQKGVRYHEPEYWKFGENG 300

Query: 304 KSYFRHASGALIILSKNLAQYININSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSS 337
             YFRHA+G L  +S++LA YI++N   L  YA++D+++G+W IGL  THIDD RLCC +
Sbjct: 301 NKYFRHATGQLYAISRDLASYISLNQHVLHKYANEDVTLGAWFIGLDVTHIDDRRLCCGT 358

BLAST of Bhi04G000009 vs. Swiss-Prot
Match: sp|A8MRC7|B3GT2_ARATH (Probable beta-1,3-galactosyltransferase 2 OS=Arabidopsis thaliana OX=3702 GN=B3GALT2 PE=2 SV=1)

HSP 1 Score: 232.6 bits (592), Expect = 6.4e-60
Identity = 136/359 (37.88%), Postives = 205/359 (57.10%), Query Frame = 0

Query: 7   TAKPERRPRSKPLHASKPSILLAFFSCLAWLYVAGRLWQ-------------DAENRKIL 66
           +AK +    S+   + K +ILL   S    ++   R+W              +AE  K++
Sbjct: 2   SAKIKGEYSSRSFVSRKWTILLCLGSFCVGMFFTNRMWNIPESKGMSHPSVTEAERLKLV 61

Query: 67  A------SLLQKNASQRPVILSVEDKLQVLGCKDLERRIAEVEMDLTLAKSQGYLKNQLR 126
           +      +L QK   + P  L  E     +  + L++ I+ +EM+L  A+S   ++  L+
Sbjct: 62  SEGCNPKALYQKEVKRDPQALFGEVANTHIALQTLDKTISSLEMELAAARS---VQESLQ 121

Query: 127 QSGSSSD--------PGRKLLAVIGVYTGFGSRLRRNVFRGSWMPKGDALKKL-EERGVV 186
                SD          R+ L V+G+ T F SR RR+  R +WMP+G+  K+L EE+G++
Sbjct: 122 NGAPLSDDMGKKQPQEQRRFLMVVGINTAFSSRKRRDSIRATWMPQGEKRKRLEEEKGII 181

Query: 187 IRFVIGRSANRGDSLDRNIDKENDSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAE 246
           IRFVIG SA  G  LDR I+ E+    DFL L+ H E   EL  K K +FSTA   WDA+
Sbjct: 182 IRFVIGHSATTGGILDRAIEAEDRKHGDFLRLD-HVEGYLELSGKTKTYFSTAFSMWDAD 241

Query: 247 FYVKVDDNIDLDLEGLIGLFEHRRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDE-K 306
           FYVKVDD++ +++  L       R +   Y+GCMKSG V++++G +++EPE+WKFG+   
Sbjct: 242 FYVKVDDDVHVNIATLGETLVRHRKKPRVYIGCMKSGPVLSQKGVRYHEPEYWKFGENGN 301

Query: 307 SYFRHASGALIILSKNLAQYININSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSS 337
            YFRHA+G L  +S++LA YI+IN   L  YA++D+S+G+W IG+   HIDD RLCC +
Sbjct: 302 KYFRHATGQLYAISRDLASYISINQHVLHKYANEDVSLGAWFIGIDVKHIDDRRLCCGT 356

BLAST of Bhi04G000009 vs. TAIR10
Match: AT2G25300.1 (Galactosyltransferase family protein)

HSP 1 Score: 544.7 bits (1402), Expect = 4.2e-155
Identity = 268/349 (76.79%), Postives = 306/349 (87.68%), Query Frame = 0

Query: 1   MESLPTT--AKPERRPRSKPL--HASKPSILLAFFSCLAWLYVAGRLWQDAENRKILASL 60
           MESLPTT  +K ERR RS      +SKPS+++AFFSC+AWLYVAGRLWQDAENR +L ++
Sbjct: 1   MESLPTTVPSKSERRARSSKFSQSSSKPSVIMAFFSCVAWLYVAGRLWQDAENRVVLNNI 60

Query: 61  LQKNASQRPVILSVEDKLQVLGCKDLERRIAEVEMDLTLAKSQGYLKNQLRQSGSSSDPG 120
           L+K+  Q+P +L+V+DKL VLGCKDLERRI E EM+LTLAKSQGYLKN   +SGSSS  G
Sbjct: 61  LKKSYDQKPKVLTVDDKLMVLGCKDLERRIVETEMELTLAKSQGYLKN--LKSGSSS--G 120

Query: 121 RKLLAVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGVVIRFVIGRSANRGDSLDRN 180
           +KLLAVIGVY+GFGS LRRN FRGS+MP+GDAL+KLEERG+VIRFVIGRS NRGDSLDR 
Sbjct: 121 KKLLAVIGVYSGFGSHLRRNTFRGSYMPQGDALRKLEERGIVIRFVIGRSPNRGDSLDRK 180

Query: 181 IDKENDSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDDNIDLDLEGLIG 240
           ID+EN + KDFLILE HEEA EEL KK KFFFS AVQNWDAEFY+KVDDNIDLDLEGLIG
Sbjct: 181 IDEENQARKDFLILENHEEAQEELAKKVKFFFSAAVQNWDAEFYIKVDDNIDLDLEGLIG 240

Query: 241 LFEHRRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQ 300
           L E RRGQD  Y+GCMKSG+V+AEEG +WYEPEWWKFGDEKSYFRHA+G+L+ILSK LAQ
Sbjct: 241 LLESRRGQDAAYIGCMKSGEVVAEEGGKWYEPEWWKFGDEKSYFRHAAGSLLILSKTLAQ 300

Query: 301 YININSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSSVRQDKVCSV 346
           Y+NINS SLKTYA DD S+GSWMIG+QAT+IDDNRLCCSS+RQDKVCSV
Sbjct: 301 YVNINSGSLKTYAFDDTSIGSWMIGVQATYIDDNRLCCSSIRQDKVCSV 345

BLAST of Bhi04G000009 vs. TAIR10
Match: AT4G32120.1 (Galactosyltransferase family protein)

HSP 1 Score: 535.4 bits (1378), Expect = 2.6e-152
Identity = 264/348 (75.86%), Postives = 296/348 (85.06%), Query Frame = 0

Query: 1   MESLPTT--AKPERRPR-SKPLHASKPSILLAFFSCLAWLYVAGRLWQDAENRKILASLL 60
           MESLPTT   K +RR R SK  + SKPS++LAFFSCLAWLYVAGRLWQDA+ R  L ++L
Sbjct: 1   MESLPTTVSGKSDRRGRFSKSQNTSKPSLILAFFSCLAWLYVAGRLWQDAQYRAALNTVL 60

Query: 61  QKNASQRPVILSVEDKLQVLGCKDLERRIAEVEMDLTLAKSQGYLKNQLRQSGSSSDPGR 120
           + N  QRP +L+VEDKL VLGCKDLERRI E EM+L  AKSQGYLK Q     S S  G+
Sbjct: 61  KMNYDQRPKVLTVEDKLVVLGCKDLERRIVETEMELAQAKSQGYLKKQ----KSVSSSGK 120

Query: 121 KLLAVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGVVIRFVIGRSANRGDSLDRNI 180
           K+LAVIGVYTGFGS L+RN FRGSWMP+ DALKKLEERGVVIRFVIGRSANRGDSLDR I
Sbjct: 121 KMLAVIGVYTGFGSHLKRNKFRGSWMPRDDALKKLEERGVVIRFVIGRSANRGDSLDRKI 180

Query: 181 DKENDSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDDNIDLDLEGLIGL 240
           D+EN +TKDFLILE HEEA EELPKK KFF+S AVQNWDAEFYVKVDDN+DLDLEG+I L
Sbjct: 181 DEENRATKDFLILENHEEAQEELPKKVKFFYSAAVQNWDAEFYVKVDDNVDLDLEGMIAL 240

Query: 241 FEHRRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQY 300
            E RR QDG Y+GCMKSGDVI EEG QWYEPEWWKFGD+KSYFRHA+G+L+ILSKNLAQY
Sbjct: 241 LESRRSQDGAYIGCMKSGDVITEEGSQWYEPEWWKFGDDKSYFRHATGSLVILSKNLAQY 300

Query: 301 ININSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSSVRQDKVCSV 346
           +NINS  LKTYA DD ++GSWMIG+QAT+IDDNRLCCSS RQ+KVCS+
Sbjct: 301 VNINSGLLKTYAFDDTTIGSWMIGVQATYIDDNRLCCSSTRQEKVCSM 344

BLAST of Bhi04G000009 vs. TAIR10
Match: AT5G53340.1 (Galactosyltransferase family protein)

HSP 1 Score: 299.7 bits (766), Expect = 2.4e-81
Identity = 144/337 (42.73%), Postives = 221/337 (65.58%), Query Frame = 0

Query: 12  RRPRSKPLHASKPSILLAF-FSCLAWLYVAGRLWQDAENRKILASLLQKNASQRPVILSV 71
           R+  S  L +S+ S LL F F+  A  YVAGRLWQ+++ R  L + L +   Q    +SV
Sbjct: 3   RKGSSIRLSSSRISTLLLFMFATFASFYVAGRLWQESQTRVHLINELDRVTGQGKSAISV 62

Query: 72  EDKLQVLGCKDLERRIAEVEMDLTLAKSQGYLKNQLRQSGSSSDPGRKLLAVIGVYTGFG 131
           +D L+++ C++ ++ +A +EM+L+ A+ +G++    + +   ++  ++ L VIG+ T  G
Sbjct: 63  DDTLKIIACREQKKTLAALEMELSSARQEGFVSKSPKLA-DGTETKKRPLVVIGIMTSLG 122

Query: 132 SRLRRNVFRGSWMPKGDALKKLE-ERGVVIRFVIGRSANRGDSLDRNIDKENDSTKDFLI 191
           ++ +R+  R +WM  G +LKKLE E+GV+ RFVIGRSAN+GDS+D++ID EN  T DF+I
Sbjct: 123 NKKKRDAVRQAWMGTGASLKKLESEKGVIARFVIGRSANKGDSMDKSIDTENSQTDDFII 182

Query: 192 LEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDDNIDLDLEGLIGLFEHRRGQDGTYV 251
           L+   EA EE  KK K FF+ A   WDA+FY K  DNI ++++ L             Y+
Sbjct: 183 LDDVVEAPEEASKKVKLFFAYAADRWDAQFYAKAIDNIYVNIDALGTTLAAHLENPRAYI 242

Query: 252 GCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYININSASLKTYA 311
           GCMKSG+V +E   +WYEPEWWKFGD+K+YFRHA G + +++  LA++++IN   L +YA
Sbjct: 243 GCMKSGEVFSEPNHKWYEPEWWKFGDKKAYFRHAYGEMYVITHALARFVSINRDILHSYA 302

Query: 312 HDDISVGSWMIGLQATHIDDNRLCCSSVRQDKVCSVV 347
           HDD+S GSW +GL   H+D+ + CCS+   + +C+ V
Sbjct: 303 HDDVSTGSWFVGLDVKHVDEGKFCCSAWSSEAICAGV 338

BLAST of Bhi04G000009 vs. TAIR10
Match: AT2G32430.1 (Galactosyltransferase family protein)

HSP 1 Score: 236.1 bits (601), Expect = 3.2e-62
Identity = 145/360 (40.28%), Postives = 208/360 (57.78%), Query Frame = 0

Query: 4   LPTTAKPERRPRSKPLHASKPSILLAFFSCLAWLYVAGRLW--------------QDAEN 63
           + T  K E  P S+ L + K + LL F S    +    R+W               +AE 
Sbjct: 1   MSTKIKGELFP-SRSLVSKKWTFLLCFGSFCFGILFTDRMWIIPESKDMPRPSVSTEAER 60

Query: 64  RKILA------SLLQKNASQRPVILSVEDKLQVLGCKDLERRIAEVEMDLTLAKS-QGYL 123
            K+++      +L QK  ++ P  L  E        + L++ I+ +EM+L  A+S Q  L
Sbjct: 61  LKLISEGCDPKTLYQKEVNRDPQALFGEVSKTHNAIQTLDKTISSLEMELAAARSAQESL 120

Query: 124 KNQLRQSGSSSD---PG-RKLLAVIGVYTGFGSRLRRNVFRGSWMPKGDALKKL-EERGV 183
            N    S        PG R+ L V+G+ T F SR RR+  R +WMP G+  KKL EE+G+
Sbjct: 121 VNGAPISNDMEKKQLPGKRRYLMVVGINTAFSSRKRRDSVRTTWMPSGEKRKKLEEEKGI 180

Query: 184 VIRFVIGRSANRGDSLDRNIDKENDSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDA 243
           +IRFVIG SA  G  LDR+I+ E+    DFL L+ H E   EL  K K +FSTAV  WDA
Sbjct: 181 IIRFVIGHSATAGGILDRSIEAEDKKHGDFLRLD-HVEGYLELSGKTKTYFSTAVSKWDA 240

Query: 244 EFYVKVDDNIDLDLEGLIGLFEHRRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDE- 303
           EFYVKVDD++ +++  L       R +   Y+GCMKSG V++++G +++EPE+WKFG+  
Sbjct: 241 EFYVKVDDDVHVNIATLGETLVRHRKKHRVYLGCMKSGPVLSQKGVRYHEPEYWKFGENG 300

Query: 304 KSYFRHASGALIILSKNLAQYININSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSS 337
             YFRHA+G L  +S++LA YI++N   L  YA++D+++G+W IGL  THIDD RLCC +
Sbjct: 301 NKYFRHATGQLYAISRDLASYISLNQHVLHKYANEDVTLGAWFIGLDVTHIDDRRLCCGT 358

BLAST of Bhi04G000009 vs. TAIR10
Match: AT1G05170.2 (Galactosyltransferase family protein)

HSP 1 Score: 232.6 bits (592), Expect = 3.6e-61
Identity = 136/359 (37.88%), Postives = 205/359 (57.10%), Query Frame = 0

Query: 7   TAKPERRPRSKPLHASKPSILLAFFSCLAWLYVAGRLWQ-------------DAENRKIL 66
           +AK +    S+   + K +ILL   S    ++   R+W              +AE  K++
Sbjct: 2   SAKIKGEYSSRSFVSRKWTILLCLGSFCVGMFFTNRMWNIPESKGMSHPSVTEAERLKLV 61

Query: 67  A------SLLQKNASQRPVILSVEDKLQVLGCKDLERRIAEVEMDLTLAKSQGYLKNQLR 126
           +      +L QK   + P  L  E     +  + L++ I+ +EM+L  A+S   ++  L+
Sbjct: 62  SEGCNPKALYQKEVKRDPQALFGEVANTHIALQTLDKTISSLEMELAAARS---VQESLQ 121

Query: 127 QSGSSSD--------PGRKLLAVIGVYTGFGSRLRRNVFRGSWMPKGDALKKL-EERGVV 186
                SD          R+ L V+G+ T F SR RR+  R +WMP+G+  K+L EE+G++
Sbjct: 122 NGAPLSDDMGKKQPQEQRRFLMVVGINTAFSSRKRRDSIRATWMPQGEKRKRLEEEKGII 181

Query: 187 IRFVIGRSANRGDSLDRNIDKENDSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAE 246
           IRFVIG SA  G  LDR I+ E+    DFL L+ H E   EL  K K +FSTA   WDA+
Sbjct: 182 IRFVIGHSATTGGILDRAIEAEDRKHGDFLRLD-HVEGYLELSGKTKTYFSTAFSMWDAD 241

Query: 247 FYVKVDDNIDLDLEGLIGLFEHRRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDE-K 306
           FYVKVDD++ +++  L       R +   Y+GCMKSG V++++G +++EPE+WKFG+   
Sbjct: 242 FYVKVDDDVHVNIATLGETLVRHRKKPRVYIGCMKSGPVLSQKGVRYHEPEYWKFGENGN 301

Query: 307 SYFRHASGALIILSKNLAQYININSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSS 337
            YFRHA+G L  +S++LA YI+IN   L  YA++D+S+G+W IG+   HIDD RLCC +
Sbjct: 302 KYFRHATGQLYAISRDLASYISINQHVLHKYANEDVSLGAWFIGIDVKHIDDRRLCCGT 356

BLAST of Bhi04G000009 vs. TrEMBL
Match: tr|A0A1S3BCY2|A0A1S3BCY2_CUCME (Hexosyltransferase OS=Cucumis melo OX=3656 GN=LOC103488325 PE=3 SV=1)

HSP 1 Score: 679.9 bits (1753), Expect = 3.1e-192
Identity = 334/346 (96.53%), Postives = 342/346 (98.84%), Query Frame = 0

Query: 1   MESLPTTAKPERRPRSKPLHASKPSILLAFFSCLAWLYVAGRLWQDAENRKILASLLQKN 60
           MESLPTT+KPERRPRSKPLHASKPSILLAF SCLAWLYVAGRLWQDAENRK+L++LLQKN
Sbjct: 1   MESLPTTSKPERRPRSKPLHASKPSILLAFLSCLAWLYVAGRLWQDAENRKLLSTLLQKN 60

Query: 61  ASQRPVILSVEDKLQVLGCKDLERRIAEVEMDLTLAKSQGYLKNQLRQSGSSSDPGRKLL 120
           ASQRPVILSVEDKLQVLGCKDLERRI EVEMDLTLAKSQGYLKNQLRQSGSSS+PGRKLL
Sbjct: 61  ASQRPVILSVEDKLQVLGCKDLERRIVEVEMDLTLAKSQGYLKNQLRQSGSSSNPGRKLL 120

Query: 121 AVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGVVIRFVIGRSANRGDSLDRNIDKE 180
           AVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGV+IRFVIGRSANRGDSLDRNIDKE
Sbjct: 121 AVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGVIIRFVIGRSANRGDSLDRNIDKE 180

Query: 181 NDSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDDNIDLDLEGLIGLFEH 240
           N STKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDD+IDLDLEGLIGL EH
Sbjct: 181 NHSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDDHIDLDLEGLIGLLEH 240

Query: 241 RRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYINI 300
           RRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYINI
Sbjct: 241 RRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYINI 300

Query: 301 NSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSSVRQDKVCSVV 347
           NSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSS+RQDKVCSVV
Sbjct: 301 NSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSSIRQDKVCSVV 346

BLAST of Bhi04G000009 vs. TrEMBL
Match: tr|A0A0A0LSK0|A0A0A0LSK0_CUCSA (Hexosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_2G382400 PE=3 SV=1)

HSP 1 Score: 676.8 bits (1745), Expect = 2.6e-191
Identity = 332/346 (95.95%), Postives = 340/346 (98.27%), Query Frame = 0

Query: 1   MESLPTTAKPERRPRSKPLHASKPSILLAFFSCLAWLYVAGRLWQDAENRKILASLLQKN 60
           MESLPTT+KPERRPRSKP+HASKPSILLAF SCLAWLYVAGRLWQDAENRK+L +LLQKN
Sbjct: 1   MESLPTTSKPERRPRSKPIHASKPSILLAFLSCLAWLYVAGRLWQDAENRKLLTTLLQKN 60

Query: 61  ASQRPVILSVEDKLQVLGCKDLERRIAEVEMDLTLAKSQGYLKNQLRQSGSSSDPGRKLL 120
           ASQRPVILSVEDKLQVLGCKDLERRI EVEMDLTLAKSQGYLKNQLRQSGSSSDPGRKLL
Sbjct: 61  ASQRPVILSVEDKLQVLGCKDLERRIVEVEMDLTLAKSQGYLKNQLRQSGSSSDPGRKLL 120

Query: 121 AVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGVVIRFVIGRSANRGDSLDRNIDKE 180
           AVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGV+IRFVIGRSANRGDSLDRNIDKE
Sbjct: 121 AVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGVIIRFVIGRSANRGDSLDRNIDKE 180

Query: 181 NDSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDDNIDLDLEGLIGLFEH 240
           N STKDFLILEGHEEADEELPKKAKFFFSTAVQNWDA+FYVKVDDNIDLDLEGLIGL EH
Sbjct: 181 NLSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAQFYVKVDDNIDLDLEGLIGLLEH 240

Query: 241 RRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYINI 300
           RRGQD TYVGCMKSGDVIA+EGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYINI
Sbjct: 241 RRGQDSTYVGCMKSGDVIADEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYINI 300

Query: 301 NSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSSVRQDKVCSVV 347
           NSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSS+RQDKVCSVV
Sbjct: 301 NSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSSIRQDKVCSVV 346

BLAST of Bhi04G000009 vs. TrEMBL
Match: tr|A0A2I4FX81|A0A2I4FX81_9ROSI (Hexosyltransferase OS=Juglans regia OX=51240 GN=LOC109002804 PE=3 SV=1)

HSP 1 Score: 607.4 bits (1565), Expect = 1.9e-170
Identity = 294/345 (85.22%), Postives = 326/345 (94.49%), Query Frame = 0

Query: 1   MESLPTTAKPERRPRSKPLHASKPSILLAFFSCLAWLYVAGRLWQDAENRKILASLLQKN 60
           MESLPTT K ERR RSKPL +SKPS+++AFFSCLAWLYVAGRLW+DAENRK+LA+LL KN
Sbjct: 1   MESLPTTMKSERRSRSKPLQSSKPSLVMAFFSCLAWLYVAGRLWEDAENRKLLANLLYKN 60

Query: 61  ASQRPVILSVEDKLQVLGCKDLERRIAEVEMDLTLAKSQGYLKNQLRQSGSSSDPGRKLL 120
           A QRP +L+VEDKL VLGC+DLERRI E EMDLTLAKSQGYLK++L+QSGSSS  G+KLL
Sbjct: 61  ALQRPKVLTVEDKLMVLGCRDLERRIVEAEMDLTLAKSQGYLKDKLQQSGSSS--GQKLL 120

Query: 121 AVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGVVIRFVIGRSANRGDSLDRNIDKE 180
           AVIGVYTGFGSRL+RNVFRGSWMPKGDAL+KLEERGVVIRFVIGRSANRGDSLDRNID+E
Sbjct: 121 AVIGVYTGFGSRLKRNVFRGSWMPKGDALRKLEERGVVIRFVIGRSANRGDSLDRNIDEE 180

Query: 181 NDSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDDNIDLDLEGLIGLFEH 240
             STKDFLILEGHEEA EELPKKAKFFFSTAVQNWDAEFYVKVDD+IDLDLEGLIGL + 
Sbjct: 181 YRSTKDFLILEGHEEAQEELPKKAKFFFSTAVQNWDAEFYVKVDDSIDLDLEGLIGLLDR 240

Query: 241 RRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYINI 300
           RRGQDG Y+GCMKSGDVI++EGK WYEP+WWKFGDEKSYFRHASG+L+ILSKNLAQYINI
Sbjct: 241 RRGQDGAYIGCMKSGDVISDEGKSWYEPDWWKFGDEKSYFRHASGSLLILSKNLAQYINI 300

Query: 301 NSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSSVRQDKVCSV 346
           NSASLK+YAHDD+SVGSWM+GLQAT+ID+NRLCCSS+RQDKVCS+
Sbjct: 301 NSASLKSYAHDDVSVGSWMMGLQATYIDENRLCCSSIRQDKVCSL 343

BLAST of Bhi04G000009 vs. TrEMBL
Match: tr|A0A2P4N206|A0A2P4N206_QUESU (Hexosyltransferase OS=Quercus suber OX=58331 GN=CFP56_13742 PE=3 SV=1)

HSP 1 Score: 598.2 bits (1541), Expect = 1.2e-167
Identity = 293/345 (84.93%), Postives = 317/345 (91.88%), Query Frame = 0

Query: 1   MESLPTTAKPERRPRSKPLHASKPSILLAFFSCLAWLYVAGRLWQDAENRKILASLLQKN 60
           MESLPTT K ERR RSKPL  SKPS+L+AFFSCLAWLYVAGRLWQDAENRK+L +LL KN
Sbjct: 1   MESLPTTMKSERRWRSKPLQTSKPSLLMAFFSCLAWLYVAGRLWQDAENRKVLTNLLYKN 60

Query: 61  ASQRPVILSVEDKLQVLGCKDLERRIAEVEMDLTLAKSQGYLKNQLRQSGSSSDPGRKLL 120
           + QRP IL+VEDKL VLGC+DLERRI E EM+LTLAKSQGYL  QL+QSGSSS  G+KLL
Sbjct: 61  SLQRPKILTVEDKLSVLGCRDLERRIVEAEMELTLAKSQGYLNKQLQQSGSSS--GKKLL 120

Query: 121 AVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGVVIRFVIGRSANRGDSLDRNIDKE 180
           AVIGVYTGFGSRL+RNVFRGSWMPKGDAL+KLEERGVVIRFVIGRSANRGDSLDRNI++E
Sbjct: 121 AVIGVYTGFGSRLKRNVFRGSWMPKGDALRKLEERGVVIRFVIGRSANRGDSLDRNINEE 180

Query: 181 NDSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDDNIDLDLEGLIGLFEH 240
           N STKDFLILEGHEEA EELPKKAKFF STAVQ WDA+F+VKVDDNIDLDLE LIGL E 
Sbjct: 181 NRSTKDFLILEGHEEAQEELPKKAKFFLSTAVQKWDADFFVKVDDNIDLDLEALIGLLER 240

Query: 241 RRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYINI 300
           RRGQDG Y+GCMKSGDVI+EEGK WYEP+WWKFGDEKSYFRHA  ALIILSKNLAQY+NI
Sbjct: 241 RRGQDGAYIGCMKSGDVISEEGKPWYEPDWWKFGDEKSYFRHAGTALIILSKNLAQYVNI 300

Query: 301 NSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSSVRQDKVCSV 346
           NSASLKTYAHDD SVGSWM+GLQAT+IDDNRLCCSS+RQDKVCS+
Sbjct: 301 NSASLKTYAHDDTSVGSWMMGLQATYIDDNRLCCSSIRQDKVCSL 343

BLAST of Bhi04G000009 vs. TrEMBL
Match: tr|A0A2P5DDF1|A0A2P5DDF1_PARAD (Hexosyltransferase OS=Parasponia andersonii OX=3476 GN=PanWU01x14_074650 PE=3 SV=1)

HSP 1 Score: 598.2 bits (1541), Expect = 1.2e-167
Identity = 293/346 (84.68%), Postives = 318/346 (91.91%), Query Frame = 0

Query: 1   MESLPTTAK-PERRPRSKPLHASKPSILLAFFSCLAWLYVAGRLWQDAENRKILASLLQK 60
           MESLPTT K  ERR RSKPL  SKPS+L+A FSC+AWLYVAGRLWQDAENR +LA LL+K
Sbjct: 1   MESLPTTTKSSERRWRSKPLQTSKPSLLMALFSCMAWLYVAGRLWQDAENRTLLAGLLKK 60

Query: 61  NASQRPVILSVEDKLQVLGCKDLERRIAEVEMDLTLAKSQGYLKNQLRQSGSSSDPGRKL 120
           NA QRP ILSVEDKL VLGCKDLERRI E EMDLTLAKSQGYLKN L+ SGSSS    ++
Sbjct: 61  NAGQRPKILSVEDKLAVLGCKDLERRIVEAEMDLTLAKSQGYLKNHLQGSGSSSS---QI 120

Query: 121 LAVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGVVIRFVIGRSANRGDSLDRNIDK 180
           LAVIGVYTGFGSRL+RNVFRGSWMP+GDALKKLEERGVVIRFVIGRSANRGDSLDRNID+
Sbjct: 121 LAVIGVYTGFGSRLKRNVFRGSWMPRGDALKKLEERGVVIRFVIGRSANRGDSLDRNIDE 180

Query: 181 ENDSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDDNIDLDLEGLIGLFE 240
           EN STKDF ILEGHEEA EELPKKAKFFFSTAVQNW+AEFYVKVDDNIDLDLEGLIGL E
Sbjct: 181 ENRSTKDFFILEGHEEAQEELPKKAKFFFSTAVQNWNAEFYVKVDDNIDLDLEGLIGLLE 240

Query: 241 HRRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYIN 300
           HRRGQ+ +Y+GCMKSGDV+AEEGK WYEPEWWKFGDEKSYFRHASG+L+ILSKNLAQY+ 
Sbjct: 241 HRRGQNSSYIGCMKSGDVVAEEGKPWYEPEWWKFGDEKSYFRHASGSLLILSKNLAQYVY 300

Query: 301 INSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSSVRQDKVCSV 346
           +NSASLKTYAHDDIS+GSWM+GLQAT+IDDNR CCSS+RQDKVCS+
Sbjct: 301 VNSASLKTYAHDDISIGSWMMGLQATYIDDNRFCCSSIRQDKVCSL 343

BLAST of Bhi04G000009 vs. NCBI nr
Match: XP_008445237.1 (PREDICTED: hydroxyproline O-galactosyltransferase HPGT3-like [Cucumis melo])

HSP 1 Score: 679.9 bits (1753), Expect = 4.6e-192
Identity = 334/346 (96.53%), Postives = 342/346 (98.84%), Query Frame = 0

Query: 1   MESLPTTAKPERRPRSKPLHASKPSILLAFFSCLAWLYVAGRLWQDAENRKILASLLQKN 60
           MESLPTT+KPERRPRSKPLHASKPSILLAF SCLAWLYVAGRLWQDAENRK+L++LLQKN
Sbjct: 1   MESLPTTSKPERRPRSKPLHASKPSILLAFLSCLAWLYVAGRLWQDAENRKLLSTLLQKN 60

Query: 61  ASQRPVILSVEDKLQVLGCKDLERRIAEVEMDLTLAKSQGYLKNQLRQSGSSSDPGRKLL 120
           ASQRPVILSVEDKLQVLGCKDLERRI EVEMDLTLAKSQGYLKNQLRQSGSSS+PGRKLL
Sbjct: 61  ASQRPVILSVEDKLQVLGCKDLERRIVEVEMDLTLAKSQGYLKNQLRQSGSSSNPGRKLL 120

Query: 121 AVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGVVIRFVIGRSANRGDSLDRNIDKE 180
           AVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGV+IRFVIGRSANRGDSLDRNIDKE
Sbjct: 121 AVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGVIIRFVIGRSANRGDSLDRNIDKE 180

Query: 181 NDSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDDNIDLDLEGLIGLFEH 240
           N STKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDD+IDLDLEGLIGL EH
Sbjct: 181 NHSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDDHIDLDLEGLIGLLEH 240

Query: 241 RRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYINI 300
           RRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYINI
Sbjct: 241 RRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYINI 300

Query: 301 NSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSSVRQDKVCSVV 347
           NSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSS+RQDKVCSVV
Sbjct: 301 NSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSSIRQDKVCSVV 346

BLAST of Bhi04G000009 vs. NCBI nr
Match: XP_004138719.1 (PREDICTED: probable beta-1,3-galactosyltransferase 9 [Cucumis sativus] >KGN62966.1 hypothetical protein Csa_2G382400 [Cucumis sativus])

HSP 1 Score: 676.8 bits (1745), Expect = 3.9e-191
Identity = 332/346 (95.95%), Postives = 340/346 (98.27%), Query Frame = 0

Query: 1   MESLPTTAKPERRPRSKPLHASKPSILLAFFSCLAWLYVAGRLWQDAENRKILASLLQKN 60
           MESLPTT+KPERRPRSKP+HASKPSILLAF SCLAWLYVAGRLWQDAENRK+L +LLQKN
Sbjct: 1   MESLPTTSKPERRPRSKPIHASKPSILLAFLSCLAWLYVAGRLWQDAENRKLLTTLLQKN 60

Query: 61  ASQRPVILSVEDKLQVLGCKDLERRIAEVEMDLTLAKSQGYLKNQLRQSGSSSDPGRKLL 120
           ASQRPVILSVEDKLQVLGCKDLERRI EVEMDLTLAKSQGYLKNQLRQSGSSSDPGRKLL
Sbjct: 61  ASQRPVILSVEDKLQVLGCKDLERRIVEVEMDLTLAKSQGYLKNQLRQSGSSSDPGRKLL 120

Query: 121 AVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGVVIRFVIGRSANRGDSLDRNIDKE 180
           AVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGV+IRFVIGRSANRGDSLDRNIDKE
Sbjct: 121 AVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGVIIRFVIGRSANRGDSLDRNIDKE 180

Query: 181 NDSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDDNIDLDLEGLIGLFEH 240
           N STKDFLILEGHEEADEELPKKAKFFFSTAVQNWDA+FYVKVDDNIDLDLEGLIGL EH
Sbjct: 181 NLSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAQFYVKVDDNIDLDLEGLIGLLEH 240

Query: 241 RRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYINI 300
           RRGQD TYVGCMKSGDVIA+EGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYINI
Sbjct: 241 RRGQDSTYVGCMKSGDVIADEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYINI 300

Query: 301 NSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSSVRQDKVCSVV 347
           NSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSS+RQDKVCSVV
Sbjct: 301 NSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSSIRQDKVCSVV 346

BLAST of Bhi04G000009 vs. NCBI nr
Match: XP_022962170.1 (hydroxyproline O-galactosyltransferase HPGT3-like [Cucurbita moschata] >XP_022962171.1 hydroxyproline O-galactosyltransferase HPGT3-like [Cucurbita moschata])

HSP 1 Score: 650.6 bits (1677), Expect = 3.0e-183
Identity = 317/346 (91.62%), Postives = 333/346 (96.24%), Query Frame = 0

Query: 1   MESLPTTAKPERRPRSKPLHASKPSILLAFFSCLAWLYVAGRLWQDAENRKILASLLQKN 60
           MESLPTT KPERR RSKPL+ SKPSI +AFFSCLAWLYVAGRLWQDAENRK+LASLLQKN
Sbjct: 1   MESLPTTVKPERRARSKPLYTSKPSIFMAFFSCLAWLYVAGRLWQDAENRKLLASLLQKN 60

Query: 61  ASQRPVILSVEDKLQVLGCKDLERRIAEVEMDLTLAKSQGYLKNQLRQSGSSSDPGRKLL 120
           ASQRP+ILSVEDKLQVLGCKDLERRI EVEMDLTLAKSQGYLKNQLRQSGSSS+P RKLL
Sbjct: 61  ASQRPLILSVEDKLQVLGCKDLERRIVEVEMDLTLAKSQGYLKNQLRQSGSSSEPSRKLL 120

Query: 121 AVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGVVIRFVIGRSANRGDSLDRNIDKE 180
           AVIGV+TGFGSRLRRN FRGSWMPKGDALKKLEERGVVIRFVIGRSANRGD+LDRNIDKE
Sbjct: 121 AVIGVHTGFGSRLRRNAFRGSWMPKGDALKKLEERGVVIRFVIGRSANRGDTLDRNIDKE 180

Query: 181 NDSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDDNIDLDLEGLIGLFEH 240
           N STKDFLILEGHEEAD+ELPKKAKFFF TAVQNWDAEFYVKVDDNIDLDLEGLI L EH
Sbjct: 181 NHSTKDFLILEGHEEADDELPKKAKFFFITAVQNWDAEFYVKVDDNIDLDLEGLIELLEH 240

Query: 241 RRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYINI 300
           RRGQDGTY+GCMKSGDVIAEEGK WYEPEWWKFGD+KSYF+HASG+LIILSKNLAQYI I
Sbjct: 241 RRGQDGTYIGCMKSGDVIAEEGKDWYEPEWWKFGDKKSYFQHASGSLIILSKNLAQYIYI 300

Query: 301 NSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSSVRQDKVCSVV 347
           NSASLKTYAHDD+S+GSWMIGLQATHIDDNRLCCSS+RQDKVCS+V
Sbjct: 301 NSASLKTYAHDDVSMGSWMIGLQATHIDDNRLCCSSIRQDKVCSMV 346

BLAST of Bhi04G000009 vs. NCBI nr
Match: XP_022132214.1 (hydroxyproline O-galactosyltransferase HPGT3-like [Momordica charantia])

HSP 1 Score: 649.4 bits (1674), Expect = 6.7e-183
Identity = 318/345 (92.17%), Postives = 331/345 (95.94%), Query Frame = 0

Query: 1   MESLPTTAKPERRPRSKPLHASKPSILLAFFSCLAWLYVAGRLWQDAENRKILASLLQKN 60
           MESLPTT KPERR RSK L  S PS+L+AFFSCLAWLYVAGRLWQDAENRK+LA+LLQKN
Sbjct: 1   MESLPTTVKPERRGRSKTLQTSNPSMLMAFFSCLAWLYVAGRLWQDAENRKLLANLLQKN 60

Query: 61  ASQRPVILSVEDKLQVLGCKDLERRIAEVEMDLTLAKSQGYLKNQLRQSGSSSDPGRKLL 120
           A+QRP+ILSVEDKLQVLGCKDLERRI EVEMDLTLAKSQGYLKNQLRQSGSSS PG KLL
Sbjct: 61  AAQRPMILSVEDKLQVLGCKDLERRIVEVEMDLTLAKSQGYLKNQLRQSGSSSGPGHKLL 120

Query: 121 AVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGVVIRFVIGRSANRGDSLDRNIDKE 180
           AVIGVYTGFGSRL+RNVFRGSWMPKGDALKKLEERGVVIRFVIGRSANRGDSLDRNIDKE
Sbjct: 121 AVIGVYTGFGSRLKRNVFRGSWMPKGDALKKLEERGVVIRFVIGRSANRGDSLDRNIDKE 180

Query: 181 NDSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDDNIDLDLEGLIGLFEH 240
           N STKDFLILEGHEEADEE PKKAKFFFSTAVQNWDAEFYVKVDDNIDLDLEGLIGL E 
Sbjct: 181 NHSTKDFLILEGHEEADEEFPKKAKFFFSTAVQNWDAEFYVKVDDNIDLDLEGLIGLLER 240

Query: 241 RRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYINI 300
           RRGQDGTY+GCMKSGDVIA+EGKQWYEPEWWKFGDEKSYFRHASG+LIILSKNLAQY+NI
Sbjct: 241 RRGQDGTYIGCMKSGDVIADEGKQWYEPEWWKFGDEKSYFRHASGSLIILSKNLAQYVNI 300

Query: 301 NSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSSVRQDKVCSV 346
           NSASLKTYAHDDISVGSWMIGLQATHIDD+RLCCSS RQ+KVCSV
Sbjct: 301 NSASLKTYAHDDISVGSWMIGLQATHIDDSRLCCSSSRQEKVCSV 345

BLAST of Bhi04G000009 vs. NCBI nr
Match: XP_023546298.1 (hydroxyproline O-galactosyltransferase HPGT3-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 648.3 bits (1671), Expect = 1.5e-182
Identity = 316/346 (91.33%), Postives = 333/346 (96.24%), Query Frame = 0

Query: 1   MESLPTTAKPERRPRSKPLHASKPSILLAFFSCLAWLYVAGRLWQDAENRKILASLLQKN 60
           MESLPTTAKPERR RSKPL+ SKPSI +AFFSCLAWLYVAGRLWQDAENRK+LASLLQKN
Sbjct: 1   MESLPTTAKPERRARSKPLYTSKPSIFMAFFSCLAWLYVAGRLWQDAENRKLLASLLQKN 60

Query: 61  ASQRPVILSVEDKLQVLGCKDLERRIAEVEMDLTLAKSQGYLKNQLRQSGSSSDPGRKLL 120
           ASQRP+ILSVEDKLQVLGCKDLERRI EVEMDLTLAKSQGYLKNQLRQSGSSS+P RKLL
Sbjct: 61  ASQRPLILSVEDKLQVLGCKDLERRIVEVEMDLTLAKSQGYLKNQLRQSGSSSEPSRKLL 120

Query: 121 AVIGVYTGFGSRLRRNVFRGSWMPKGDALKKLEERGVVIRFVIGRSANRGDSLDRNIDKE 180
           AVIGV+TGFGS+LRRN FRGSWMPKGDALKKLEERGVVIRFVIGRSANRGD+LDRNIDKE
Sbjct: 121 AVIGVHTGFGSQLRRNAFRGSWMPKGDALKKLEERGVVIRFVIGRSANRGDTLDRNIDKE 180

Query: 181 NDSTKDFLILEGHEEADEELPKKAKFFFSTAVQNWDAEFYVKVDDNIDLDLEGLIGLFEH 240
           N STKDFLILEGHEEAD+ELPKKAKFFF TAVQNWDAEFYVKVDDNIDLDLEGLI L EH
Sbjct: 181 NQSTKDFLILEGHEEADDELPKKAKFFFITAVQNWDAEFYVKVDDNIDLDLEGLIELLEH 240

Query: 241 RRGQDGTYVGCMKSGDVIAEEGKQWYEPEWWKFGDEKSYFRHASGALIILSKNLAQYINI 300
           RRGQDGTY+GCMKSGDVIAEEGK WYEPEWWKFGD+KSYF+HASG+LIILSKNLAQYI I
Sbjct: 241 RRGQDGTYIGCMKSGDVIAEEGKDWYEPEWWKFGDKKSYFQHASGSLIILSKNLAQYIYI 300

Query: 301 NSASLKTYAHDDISVGSWMIGLQATHIDDNRLCCSSVRQDKVCSVV 347
           NSASLKTYAHDD+S+GSWMIGLQATHID+NRLCC SVRQDKVCS+V
Sbjct: 301 NSASLKTYAHDDVSMGSWMIGLQATHIDENRLCCGSVRQDKVCSMV 346

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
sp|Q5XEZ1|B3GT9_ARATH7.6e-15476.79Hydroxyproline O-galactosyltransferase HPGT3 OS=Arabidopsis thaliana OX=3702 GN=... [more]
sp|Q94A05|B3GTA_ARATH4.6e-15175.86Hydroxyproline O-galactosyltransferase HPGT2 OS=Arabidopsis thaliana OX=3702 GN=... [more]
sp|Q94F27|B3GTB_ARATH4.3e-8042.73Hydroxyproline O-galactosyltransferase HPGT1 OS=Arabidopsis thaliana OX=3702 GN=... [more]
sp|Q9ZV71|B3GT3_ARATH5.8e-6140.28Probable beta-1,3-galactosyltransferase 3 OS=Arabidopsis thaliana OX=3702 GN=B3G... [more]
sp|A8MRC7|B3GT2_ARATH6.4e-6037.88Probable beta-1,3-galactosyltransferase 2 OS=Arabidopsis thaliana OX=3702 GN=B3G... [more]
Match NameE-valueIdentityDescription
AT2G25300.14.2e-15576.79Galactosyltransferase family protein[more]
AT4G32120.12.6e-15275.86Galactosyltransferase family protein[more]
AT5G53340.12.4e-8142.73Galactosyltransferase family protein[more]
AT2G32430.13.2e-6240.28Galactosyltransferase family protein[more]
AT1G05170.23.6e-6137.88Galactosyltransferase family protein[more]
Match NameE-valueIdentityDescription
tr|A0A1S3BCY2|A0A1S3BCY2_CUCME3.1e-19296.53Hexosyltransferase OS=Cucumis melo OX=3656 GN=LOC103488325 PE=3 SV=1[more]
tr|A0A0A0LSK0|A0A0A0LSK0_CUCSA2.6e-19195.95Hexosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_2G382400 PE=3 SV=1[more]
tr|A0A2I4FX81|A0A2I4FX81_9ROSI1.9e-17085.22Hexosyltransferase OS=Juglans regia OX=51240 GN=LOC109002804 PE=3 SV=1[more]
tr|A0A2P4N206|A0A2P4N206_QUESU1.2e-16784.93Hexosyltransferase OS=Quercus suber OX=58331 GN=CFP56_13742 PE=3 SV=1[more]
tr|A0A2P5DDF1|A0A2P5DDF1_PARAD1.2e-16784.68Hexosyltransferase OS=Parasponia andersonii OX=3476 GN=PanWU01x14_074650 PE=3 SV... [more]
Match NameE-valueIdentityDescription
XP_008445237.14.6e-19296.53PREDICTED: hydroxyproline O-galactosyltransferase HPGT3-like [Cucumis melo][more]
XP_004138719.13.9e-19195.95PREDICTED: probable beta-1,3-galactosyltransferase 9 [Cucumis sativus] >KGN62966... [more]
XP_022962170.13.0e-18391.62hydroxyproline O-galactosyltransferase HPGT3-like [Cucurbita moschata] >XP_02296... [more]
XP_022132214.16.7e-18392.17hydroxyproline O-galactosyltransferase HPGT3-like [Momordica charantia][more]
XP_023546298.11.5e-18291.33hydroxyproline O-galactosyltransferase HPGT3-like isoform X1 [Cucurbita pepo sub... [more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006486protein glycosylation
Vocabulary: Molecular Function
TermDefinition
GO:0008378galactosyltransferase activity
Vocabulary: Cellular Component
TermDefinition
GO:0016020membrane
Vocabulary: INTERPRO
TermDefinition
IPR002659Glyco_trans_31
IPR025298DUF4094
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006486 protein glycosylation
biological_process GO:0030206 chondroitin sulfate biosynthetic process
cellular_component GO:0000139 Golgi membrane
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005794 Golgi apparatus
cellular_component GO:0016020 membrane
molecular_function GO:0008378 galactosyltransferase activity
molecular_function GO:0047220 galactosylxylosylprotein 3-beta-galactosyltransferase activity
molecular_function GO:0016740 transferase activity
molecular_function GO:0016757 transferase activity, transferring glycosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi04M000009Bhi04M000009mRNA


Analysis Name: InterPro Annotations of wax gourd
Date Performed: 2019-11-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 79..99
NoneNo IPR availablePANTHERPTHR11214:SF124HYDROXYPROLINE O-GALACTOSYLTRANSFERASE HPGT2-RELATEDcoord: 1..343
IPR025298Domain of unknown function DUF4094PFAMPF13334DUF4094coord: 22..99
e-value: 5.1E-9
score: 36.3
IPR002659Glycosyl transferase, family 31PFAMPF01762Galactosyl_Tcoord: 134..328
e-value: 7.9E-31
score: 107.3
IPR002659Glycosyl transferase, family 31PANTHERPTHR11214BETA-1,3-N-ACETYLGLUCOSAMINYLTRANSFERASEcoord: 1..343