CmaCh04G003200 (gene) Cucurbita maxima (Rimu)

NameCmaCh04G003200
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionBeta-1,3-galactosyltransferase-like protein
LocationCma_Chr04 : 1583712 .. 1587500 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAAGTTGGGCACCACCATCACCAGGGCCGAGTGCAGTGCTGGGATGAGGTGTTCATTCCCACCACCACTTAAGCCTCTTTTTCTTTTTTCTTTTTTCTTTTTTAAATTTCGAATCTAGTTTTAATTCGTTCCCGTAATCGGATGCAATAACAACGAAGGTAATGAATGGATTAAACCACGTTTGTGACAGGTTGATAATGCAAAGCACGCAGAAAATCAAGCCATGTGAGCTCTAACACGCTGTCGTTTTCATCTTCTTTCCTTTTTGTTCTCTCTTTTTCTGTCTCTGTGTTGTTTTCGAGCTCAGTTTTCCAGAGAGCTTCAATTCATTTCCATTTACTGGAACTAGGGTTAGGGTTCGACGGAATTCTTGATTGTAGAAACGAGGTGTTCAAGTCTGGGGGGAAATCTGTTTGCATAAAATCCCCTACGTTTTTTTTGTTCTTCTTTCGTCTGATTTCTTGTTTTTTTTCCCTATACTTGACCTGTTTGTTGGTGGGTGGTGGTGGTTTTGTTCTACATTGCGTTGGGGGCTAGAAGGGGAGGATGAAAAGGGGGAAATTGGATACAATGGTATCACGAAACCGAATTAGGTTGCTTCAATTTCTTATGGGGTTGGTGTTTTTCTATCTGCTTTTCATGAGTTTTGAAATACCGCTGGTGTATCGAACCGGATATGGGTCGGTGCCTGATGATGGAACATTTGGATTCACCAGCGACACTTTGCCGAGGCCGTTTCTGCTTGAAAGTGAAGAGGAAATGGCTGATAAAGACGCCCCTCGTCGACCCTCTGATGATACCTTTCTGGTTTCTCATGGCTCGCCGCATCGGACACCCGAGAGGCGAATGCGTGAGTTCAAGAAAGTTTCGGGTTTAGTCTTCGACGAAAGCACATTTGATCGTAATGCTAGTAAGGGGGAGTTCTCGGAGCTTCATAAAGCGGTTAAACATGCTTGGGTAGTGGGGAAAAAGCTTTGGGGGGACTTAGAGTCCGGAAAAATTGTTCTCCAACCCAAAACGAAGACAGAGAATCAGTCGGAGACTTGTCCACATTCGATTACGCTTTCTGGATCCGAATTTGAGGCACAGAGTCGGATTCTGGAGCTCCCCTGCGGCTTGACGCTCTGGTCGCATATCACAGTGGTGGGGACGCCTCGTTGGGCTCACTTGGAAGATGATCCCAAGATTTCAATCTTGAGAGAAGGGGATGATTCAGTGATGGTTTCACAGTTTATGATGGAGTTGCAAGGGCTGAAGACGGTGGATGGTGAAGACCCACCAAGAATTCTTCATTTCAATCCAAGGTTGAAGGGAGATTGGAGTGGCAAGCCTGTTATCGAACAGAACACTTGTTATAGGATGCAATGGGGCACAGCGCTGAGATGTGAGGGATGGAAAGCCAGGGCAGATGAAGAAACAGGTAACTTTTGTTTCACGGTCTTTGTATTGTCCATTGATTAGAATTCTGTATGAACTGAACTAGCTTTGTTTTGATAGAATGCATTGAATCCTGAATTTCTTCAAGTGAGATGTTACCTAACTTACCATTGTGATTTCTTGAAGTCGACGGGCAGGTAAAATGTGAGAAATGGATTCGTGACGACGACAGCCATTCCGAAGAATCGAAGGTAATATGGTGGTTAAATAGACTAATAGGACGCACGAAAAAGGTGGCGATCGATTGGCCATATCCTTTCGCGGAGGGCAGGCTATTTGTTCTAACTGTGAGTGCTGGGTTGGAAGGTTACCATATCAATGTTGATGGAAGGCATGTCACTTCTTTTCCATATCGCACTGTAAGTACTCGTTCTTCTATAAATCTAAATTGGTACTCTGCGTCGATAGCTTAGATTGATGGCGATGACAGGGGTTTGTTCTGGAGGATGCCACTGGGTTGTCTGTAAATGGCGATATTGACGTGCACTCCATTTTTGCTGCTTCCTTACCTGCTACACATCCTAGCTTTGCACCACAGAAGCATATTGAGATGTTGACACAATGGAAAGCCCCTTCACTTCCCAAGAAAAATGTGGAGCTTTTCATTGGCGTTCTTTCTGCTGGTAATCATTTTGCGGAGCGAATGGCTGTTAGGAAGTCTTGGATGCAACATAGATTAATCAGATCTTCAATAGTTGTCGCTAGGTTCTTCGTGGCAATGGTAAGGAACGGCGTGCTTCTCAGAAACGTTCCATCATTTATATCATAATAGTGACAACAAATTCTATGTTACATTCTTACAGCACGGAAGAAAGGAAGTAAATATCGAGTTGAAGAAAGAGGCCGAGTATTTTGGAGATATTGTAATAGTTCCTTTCATGGATAACTATGATCTCGTTGTACTGAAGACGATTGCAATATGTGAATATGGGGTGAGTTGTGATAAATTGACACAAACCTTTTGAGTTGGGTAGTCTTTGTCGTTGGAAATATGATTCCACGCTGATCTTGCAGGTTCGCACGGTGGCTGCAAACTATATCATGAAGTGTGACGATGATACATTTGTTAGAGTGGATGCAGTGATTGATGAAGCTCACAAAGTCCAATCTGATGGTGGGAGCCTTTATGTTGGAAACATGAACTTTCACCATAAACCTCTTCGTCATGGAAAATGGGCAGTGACTTACGAGGTATGACCATAAGAATGCGACTCGTGACATAGTTCTGTAATCCTGTTTTTCATCGACCTTCTACAGCTGAAGGGATTTGAACTTACTATATAATGCATCCTTTGGTGACCATAGTAAATGTCTTGTGGCTTTGACTGTTTATGTTAATCCGTTTTTGCTGCATTTCATAACATGCTTTTGTATGCACTTGATTACGATAAGTTCACGGTCGGTTAAGAGCTATCCATCCTGAACAACCAAAAGTTGCTTGCTGAATGGATCTTTATAATCTATATATCAATCCTCCGGTGGCCATTGTTTCATTAGCCTGTTAGTCGTTCTTTTTCGCATCATTCTCTAAATACCATCATCTCAACCACATGGGGGGCCTATAAATTGGTTCATAATTGCGTATTTAGCTATGTTTGTTTCGATCTGTTCAGGAATGGCCAGAAGAAGATTACCCAACGTACGCAAATGGGCCGGGTTACATTCTGTCATCGGACATTGCAGAGTATATTGTATCTGAGTTTGAGAAGCACAGATTAAGGGTAAGTTTCATTCAAACTCCTCCCCTTCCCTTCCCTTCCCTTCCCTATAAACACGATCGATCTTATGAAATGCACAATAAAACAGTTGTTCAAGATGGAAGATGTGAGCATGGGAATGTGGGTGGAGCAGTTCAACCGTACGAAACCAGTTTTATACCATCACAGTCTGAGGTTTTGCCAGTTTGGGTGCATTGAAGATTACTTAACTTCTCATTACCAATCTCCTAGACAGATGGTGTGCTTGTGGGAGAAGTTGATGCATCAAAGAAGGCCACAGTGCTGCAACATGAGATGATTCATATTTATCGGAAGAAGAAGAAGAAGAAGAAGAAGATGCAAGTTTTGGTTCAAATCTGTAAATACTATAATAGAAGTGGGATGAATCTAATCTGGAAGCTTCCTTCCTTTTCTATAAATTCATTCATTTTTTGGGTTTCTTCCTTTTGGTTTTACTTCCAGTTTATGTTATATCCCCATTTATGCTATAAATGGATTTAACATCTCAATTATCAATCTTGTTGTTGGGCTCCACGCAATCAATTCTAGAAAAGAGAAGCCTATTTCCATCTCAATCTTTAAAGCTAAAACAAATTGATTTGGTATGTATTTTGGTAGTTGCGTCACTAAAAG

mRNA sequence

GAAAGTTGGGCACCACCATCACCAGGGCCGAGTGCAGTGCTGGGATGAGGTGTTCATTCCCACCACCACTTAAGCCTCTTTTTCTTTTTTCTTTTTTCTTTTTTAAATTTCGAATCTAGTTTTAATTCGTTCCCGTAATCGGATGCAATAACAACGAAGGTAATGAATGGATTAAACCACGTTTGTGACAGGTTGATAATGCAAAGCACGCAGAAAATCAAGCCATGTGAGCTCTAACACGCTGTCGTTTTCATCTTCTTTCCTTTTTGTTCTCTCTTTTTCTGTCTCTGTGTTGTTTTCGAGCTCAGTTTTCCAGAGAGCTTCAATTCATTTCCATTTACTGGAACTAGGGTTAGGGTTCGACGGAATTCTTGATTGTAGAAACGAGGTGTTCAAGTCTGGGGGGAAATCTGTTTGCATAAAATCCCCTACGTTTTTTTTGTTCTTCTTTCGTCTGATTTCTTGTTTTTTTTCCCTATACTTGACCTGTTTGTTGGTGGGTGGTGGTGGTTTTGTTCTACATTGCGTTGGGGGCTAGAAGGGGAGGATGAAAAGGGGGAAATTGGATACAATGGTATCACGAAACCGAATTAGGTTGCTTCAATTTCTTATGGGGTTGGTGTTTTTCTATCTGCTTTTCATGAGTTTTGAAATACCGCTGGTGTATCGAACCGGATATGGGTCGGTGCCTGATGATGGAACATTTGGATTCACCAGCGACACTTTGCCGAGGCCGTTTCTGCTTGAAAGTGAAGAGGAAATGGCTGATAAAGACGCCCCTCGTCGACCCTCTGATGATACCTTTCTGGTTTCTCATGGCTCGCCGCATCGGACACCCGAGAGGCGAATGCGTGAGTTCAAGAAAGTTTCGGGTTTAGTCTTCGACGAAAGCACATTTGATCGTAATGCTAGTAAGGGGGAGTTCTCGGAGCTTCATAAAGCGGTTAAACATGCTTGGGTAGTGGGGAAAAAGCTTTGGGGGGACTTAGAGTCCGGAAAAATTGTTCTCCAACCCAAAACGAAGACAGAGAATCAGTCGGAGACTTGTCCACATTCGATTACGCTTTCTGGATCCGAATTTGAGGCACAGAGTCGGATTCTGGAGCTCCCCTGCGGCTTGACGCTCTGGTCGCATATCACAGTGGTGGGGACGCCTCGTTGGGCTCACTTGGAAGATGATCCCAAGATTTCAATCTTGAGAGAAGGGGATGATTCAGTGATGGTTTCACAGTTTATGATGGAGTTGCAAGGGCTGAAGACGGTGGATGGTGAAGACCCACCAAGAATTCTTCATTTCAATCCAAGGTTGAAGGGAGATTGGAGTGGCAAGCCTGTTATCGAACAGAACACTTGTTATAGGATGCAATGGGGCACAGCGCTGAGATGTGAGGGATGGAAAGCCAGGGCAGATGAAGAAACAGTCGACGGGCAGGTAAAATGTGAGAAATGGATTCGTGACGACGACAGCCATTCCGAAGAATCGAAGGTAATATGGTGGTTAAATAGACTAATAGGACGCACGAAAAAGGTGGCGATCGATTGGCCATATCCTTTCGCGGAGGGCAGGCTATTTGTTCTAACTGTGAGTGCTGGGTTGGAAGGTTACCATATCAATGTTGATGGAAGGCATGTCACTTCTTTTCCATATCGCACTGGGTTTGTTCTGGAGGATGCCACTGGGTTGTCTGTAAATGGCGATATTGACGTGCACTCCATTTTTGCTGCTTCCTTACCTGCTACACATCCTAGCTTTGCACCACAGAAGCATATTGAGATGTTGACACAATGGAAAGCCCCTTCACTTCCCAAGAAAAATGTGGAGCTTTTCATTGGCGTTCTTTCTGCTGGTAATCATTTTGCGGAGCGAATGGCTGTTAGGAAGTCTTGGATGCAACATAGATTAATCAGATCTTCAATAGTTGTCGCTAGGTTCTTCGTGGCAATGCACGGAAGAAAGGAAGTAAATATCGAGTTGAAGAAAGAGGCCGAGTATTTTGGAGATATTGTAATAGTTCCTTTCATGGATAACTATGATCTCGTTGTACTGAAGACGATTGCAATATGTGAATATGGGGTTCGCACGGTGGCTGCAAACTATATCATGAAGTGTGACGATGATACATTTGTTAGAGTGGATGCAGTGATTGATGAAGCTCACAAAGTCCAATCTGATGGTGGGAGCCTTTATGTTGGAAACATGAACTTTCACCATAAACCTCTTCGTCATGGAAAATGGGCAGTGACTTACGAGGAATGGCCAGAAGAAGATTACCCAACGTACGCAAATGGGCCGGGTTACATTCTGTCATCGGACATTGCAGAGTATATTGTATCTGAGTTTGAGAAGCACAGATTAAGGTTGTTCAAGATGGAAGATGTGAGCATGGGAATGTGGGTGGAGCAGTTCAACCGTACGAAACCAGTTTTATACCATCACAGTCTGAGGTTTTGCCAGTTTGGGTGCATTGAAGATTACTTAACTTCTCATTACCAATCTCCTAGACAGATGGTGTGCTTGTGGGAGAAGTTGATGCATCAAAGAAGGCCACAGTGCTGCAACATGAGATGATTCATATTTATCGGAAGAAGAAGAAGAAGAAGAAGAAGATGCAAGTTTTGGTTCAAATCTGTAAATACTATAATAGAAGTGGGATGAATCTAATCTGGAAGCTTCCTTCCTTTTCTATAAATTCATTCATTTTTTGGGTTTCTTCCTTTTGGTTTTACTTCCAGTTTATGTTATATCCCCATTTATGCTATAAATGGATTTAACATCTCAATTATCAATCTTGTTGTTGGGCTCCACGCAATCAATTCTAGAAAAGAGAAGCCTATTTCCATCTCAATCTTTAAAGCTAAAACAAATTGATTTGGTATGTATTTTGGTAGTTGCGTCACTAAAAG

Coding sequence (CDS)

ATGAAAAGGGGGAAATTGGATACAATGGTATCACGAAACCGAATTAGGTTGCTTCAATTTCTTATGGGGTTGGTGTTTTTCTATCTGCTTTTCATGAGTTTTGAAATACCGCTGGTGTATCGAACCGGATATGGGTCGGTGCCTGATGATGGAACATTTGGATTCACCAGCGACACTTTGCCGAGGCCGTTTCTGCTTGAAAGTGAAGAGGAAATGGCTGATAAAGACGCCCCTCGTCGACCCTCTGATGATACCTTTCTGGTTTCTCATGGCTCGCCGCATCGGACACCCGAGAGGCGAATGCGTGAGTTCAAGAAAGTTTCGGGTTTAGTCTTCGACGAAAGCACATTTGATCGTAATGCTAGTAAGGGGGAGTTCTCGGAGCTTCATAAAGCGGTTAAACATGCTTGGGTAGTGGGGAAAAAGCTTTGGGGGGACTTAGAGTCCGGAAAAATTGTTCTCCAACCCAAAACGAAGACAGAGAATCAGTCGGAGACTTGTCCACATTCGATTACGCTTTCTGGATCCGAATTTGAGGCACAGAGTCGGATTCTGGAGCTCCCCTGCGGCTTGACGCTCTGGTCGCATATCACAGTGGTGGGGACGCCTCGTTGGGCTCACTTGGAAGATGATCCCAAGATTTCAATCTTGAGAGAAGGGGATGATTCAGTGATGGTTTCACAGTTTATGATGGAGTTGCAAGGGCTGAAGACGGTGGATGGTGAAGACCCACCAAGAATTCTTCATTTCAATCCAAGGTTGAAGGGAGATTGGAGTGGCAAGCCTGTTATCGAACAGAACACTTGTTATAGGATGCAATGGGGCACAGCGCTGAGATGTGAGGGATGGAAAGCCAGGGCAGATGAAGAAACAGTCGACGGGCAGGTAAAATGTGAGAAATGGATTCGTGACGACGACAGCCATTCCGAAGAATCGAAGGTAATATGGTGGTTAAATAGACTAATAGGACGCACGAAAAAGGTGGCGATCGATTGGCCATATCCTTTCGCGGAGGGCAGGCTATTTGTTCTAACTGTGAGTGCTGGGTTGGAAGGTTACCATATCAATGTTGATGGAAGGCATGTCACTTCTTTTCCATATCGCACTGGGTTTGTTCTGGAGGATGCCACTGGGTTGTCTGTAAATGGCGATATTGACGTGCACTCCATTTTTGCTGCTTCCTTACCTGCTACACATCCTAGCTTTGCACCACAGAAGCATATTGAGATGTTGACACAATGGAAAGCCCCTTCACTTCCCAAGAAAAATGTGGAGCTTTTCATTGGCGTTCTTTCTGCTGGTAATCATTTTGCGGAGCGAATGGCTGTTAGGAAGTCTTGGATGCAACATAGATTAATCAGATCTTCAATAGTTGTCGCTAGGTTCTTCGTGGCAATGCACGGAAGAAAGGAAGTAAATATCGAGTTGAAGAAAGAGGCCGAGTATTTTGGAGATATTGTAATAGTTCCTTTCATGGATAACTATGATCTCGTTGTACTGAAGACGATTGCAATATGTGAATATGGGGTTCGCACGGTGGCTGCAAACTATATCATGAAGTGTGACGATGATACATTTGTTAGAGTGGATGCAGTGATTGATGAAGCTCACAAAGTCCAATCTGATGGTGGGAGCCTTTATGTTGGAAACATGAACTTTCACCATAAACCTCTTCGTCATGGAAAATGGGCAGTGACTTACGAGGAATGGCCAGAAGAAGATTACCCAACGTACGCAAATGGGCCGGGTTACATTCTGTCATCGGACATTGCAGAGTATATTGTATCTGAGTTTGAGAAGCACAGATTAAGGTTGTTCAAGATGGAAGATGTGAGCATGGGAATGTGGGTGGAGCAGTTCAACCGTACGAAACCAGTTTTATACCATCACAGTCTGAGGTTTTGCCAGTTTGGGTGCATTGAAGATTACTTAACTTCTCATTACCAATCTCCTAGACAGATGGTGTGCTTGTGGGAGAAGTTGATGCATCAAAGAAGGCCACAGTGCTGCAACATGAGATGA

Protein sequence

MKRGKLDTMVSRNRIRLLQFLMGLVFFYLLFMSFEIPLVYRTGYGSVPDDGTFGFTSDTLPRPFLLESEEEMADKDAPRRPSDDTFLVSHGSPHRTPERRMREFKKVSGLVFDESTFDRNASKGEFSELHKAVKHAWVVGKKLWGDLESGKIVLQPKTKTENQSETCPHSITLSGSEFEAQSRILELPCGLTLWSHITVVGTPRWAHLEDDPKISILREGDDSVMVSQFMMELQGLKTVDGEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWKARADEETVDGQVKCEKWIRDDDSHSEESKVIWWLNRLIGRTKKVAIDWPYPFAEGRLFVLTVSAGLEGYHINVDGRHVTSFPYRTGFVLEDATGLSVNGDIDVHSIFAASLPATHPSFAPQKHIEMLTQWKAPSLPKKNVELFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVARFFVAMHGRKEVNIELKKEAEYFGDIVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMKCDDDTFVRVDAVIDEAHKVQSDGGSLYVGNMNFHHKPLRHGKWAVTYEEWPEEDYPTYANGPGYILSSDIAEYIVSEFEKHRLRLFKMEDVSMGMWVEQFNRTKPVLYHHSLRFCQFGCIEDYLTSHYQSPRQMVCLWEKLMHQRRPQCCNMR
BLAST of CmaCh04G003200 vs. Swiss-Prot
Match: B3GTH_ARATH (Hydroxyproline O-galactosyltransferase GALT4 OS=Arabidopsis thaliana GN=GALT4 PE=2 SV=2)

HSP 1 Score: 916.8 bits (2368), Expect = 1.4e-265
Identity = 452/690 (65.51%), Postives = 534/690 (77.39%), Query Frame = 1

Query: 1   MKRGKLDTMVSRNRIRLLQFLMGLVFFYLLFMSFEIPLVYRTGYGSVPDDGTFGFTSDTL 60
           MK+ KLD   S+ R  L+QFL+ ++ FY L MSFEIP ++RTG GS  DD +    +D L
Sbjct: 1   MKKSKLDNSSSQIRFGLVQFLLVVLLFYFLCMSFEIPFIFRTGSGSGSDDVSSSSFADAL 60

Query: 61  PRPFLLES----------EEEMADKDAPRRPSDDTFLVSHGSPHRTPERRMREFKKVSGL 120
           PRP ++            EEE AD   P R   D   V      R PER+MREFK VS +
Sbjct: 61  PRPMVVGGGSREANWVVGEEEEAD---PHRHFKDPGRVQL----RLPERKMREFKSVSEI 120

Query: 121 VFDESTFDRNASKGEFSELHKAVKHAWVVGKKLWGDLESGKIVLQPKTKTENQSETCPHS 180
             +ES FD      EFS  HK  KHA  +G+K+W  L+SG ++   K   + + E CP  
Sbjct: 121 FVNESFFDNGGFSDEFSIFHKTAKHAISMGRKMWDGLDSG-LIKPDKAPVKTRIEKCPDM 180

Query: 181 ITLSGSEFEAQSRILELPCGLTLWSHITVVGTPRWAHLEDDPKISILREGDDSVMVSQFM 240
           +++S SEF  +SRIL LPCGLTL SHITVV TP WAH+E D        GD + MVSQFM
Sbjct: 181 VSVSESEFVNRSRILVLPCGLTLGSHITVVATPHWAHVEKD--------GDKTAMVSQFM 240

Query: 241 MELQGLKTVDGEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWKARADEE 300
           MELQGLK VDGEDPPRILHFNPR+KGDWSG+PVIEQNTCYRMQWG+ LRC+G ++  DEE
Sbjct: 241 MELQGLKAVDGEDPPRILHFNPRIKGDWSGRPVIEQNTCYRMQWGSGLRCDGRESSDDEE 300

Query: 301 TVDGQVKCEKWIRDDDSHS------EESKVIWWLNRLIGRTKK-VAIDWPYPFAEGRLFV 360
            VDG+VKCE+W RDDD         +ESK  WWLNRL+GR KK +  DW YPFAEG+LFV
Sbjct: 301 YVDGEVKCERWKRDDDDGGNNGDDFDESKKTWWLNRLMGRRKKMITHDWDYPFAEGKLFV 360

Query: 361 LTVSAGLEGYHINVDGRHVTSFPYRTGFVLEDATGLSVNGDIDVHSIFAASLPATHPSFA 420
           LT+ AG+EGYHI+V+GRH+TSFPYRTGFVLEDATGL+V G+IDVHS++AASLP+T+PSFA
Sbjct: 361 LTLRAGMEGYHISVNGRHITSFPYRTGFVLEDATGLAVKGNIDVHSVYAASLPSTNPSFA 420

Query: 421 PQKHIEMLTQWKAPSLPKKNVELFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVARFF 480
           PQKH+EM   WKAPSLP+K VELFIG+LSAGNHFAERMAVRKSWMQ +L+RSS VVARFF
Sbjct: 421 PQKHLEMQRIWKAPSLPQKPVELFIGILSAGNHFAERMAVRKSWMQQKLVRSSKVVARFF 480

Query: 481 VAMHGRKEVNIELKKEAEYFGDIVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMKCDD 540
           VA+H RKEVN++LKKEAEYFGDIVIVP+MD+YDLVVLKT+AICEYGV TVAA Y+MKCDD
Sbjct: 481 VALHARKEVNVDLKKEAEYFGDIVIVPYMDHYDLVVLKTVAICEYGVNTVAAKYVMKCDD 540

Query: 541 DTFVRVDAVIDEAHKVQSDGGSLYVGNMNFHHKPLRHGKWAVTYEEWPEEDYPTYANGPG 600
           DTFVRVDAVI EA KV+    SLY+GN+NF+HKPLR GKWAVT+EEWPEE YP YANGPG
Sbjct: 541 DTFVRVDAVIQEAEKVKG-RESLYIGNINFNHKPLRTGKWAVTFEEWPEEYYPPYANGPG 600

Query: 601 YILSSDIAEYIVSEFEKHRLRLFKMEDVSMGMWVEQFNRTKPVLYHHSLRFCQFGCIEDY 660
           YILS D+A++IV +FE+ RLRLFKMEDVSMGMWVE+FN T+PV   HSL+FCQFGCIEDY
Sbjct: 601 YILSYDVAKFIVDDFEQKRLRLFKMEDVSMGMWVEKFNETRPVAVVHSLKFCQFGCIEDY 660

Query: 661 LTSHYQSPRQMVCLWEKLMHQRRPQCCNMR 674
            T+HYQSPRQM+C+W+KL    +PQCCNMR
Sbjct: 661 FTAHYQSPRQMICMWDKLQRLGKPQCCNMR 673

BLAST of CmaCh04G003200 vs. Swiss-Prot
Match: B3GTJ_ARATH (Hydroxyproline O-galactosyltransferase GALT6 OS=Arabidopsis thaliana GN=GALT6 PE=2 SV=2)

HSP 1 Score: 904.0 bits (2335), Expect = 9.5e-262
Identity = 432/680 (63.53%), Postives = 539/680 (79.26%), Query Frame = 1

Query: 5   KLDTMVSRNRIRLLQFLMGLVFFYLLFMSFEIPLVYRTGYGSVPDDGTFGFTSDTLPRPF 64
           K D  VS ++ R +Q LM +   Y+L ++FEIP V++TG  S+        + D L RP 
Sbjct: 14  KFDIFVSLSKQRSVQILMAVGLLYMLLITFEIPFVFKTGLSSL--------SQDPLTRPE 73

Query: 65  LLESEEEMADKDAPRRPSDDTFLVSHGSPHRTPERRMREFKKV-SGLVFDESTFDRNASK 124
              S+ E+ ++ AP RP     L+   S   +P + +R   ++ S L FD  TF+ ++  
Sbjct: 74  KHNSQRELQERRAPTRPLKS--LLYQESQSESPAQGLRRRTRILSSLRFDPETFNPSSKD 133

Query: 125 GEFSELHKAVKHAWVVGKKLWGDLESGKIVL-----QPKTKTENQSETCPHSITLSGSEF 184
           G   ELHK+ K AW VG+K+W +LESGK +      + K   E+ + +C  S++L+GS+ 
Sbjct: 134 GSV-ELHKSAKVAWEVGRKIWEELESGKTLKALEKEKKKKIEEHGTNSCSLSVSLTGSDL 193

Query: 185 EAQSRILELPCGLTLWSHITVVGTPRWAHLEDDPKISILREGDDSVMVSQFMMELQGLKT 244
             +  I+ELPCGLTL SHITVVG PR AH E DPKIS+L+EGD++V VSQF +ELQGLK 
Sbjct: 194 LKRGNIMELPCGLTLGSHITVVGKPRAAHSEKDPKISMLKEGDEAVKVSQFKLELQGLKA 253

Query: 245 VDGEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWKARADEETVDGQVKC 304
           V+GE+PPRILH NPRLKGDWSGKPVIEQNTCYRMQWG+A RCEGW++R DEETVDGQVKC
Sbjct: 254 VEGEEPPRILHLNPRLKGDWSGKPVIEQNTCYRMQWGSAQRCEGWRSRDDEETVDGQVKC 313

Query: 305 EKWIRDDDSHSEESK----VIWWLNRLIGRTKKVAIDWPYPFAEGRLFVLTVSAGLEGYH 364
           EKW RDD   S+E +      WWL+RLIGR+KKV ++WP+PF   +LFVLT+SAGLEGYH
Sbjct: 314 EKWARDDSITSKEEESSKAASWWLSRLIGRSKKVTVEWPFPFTVDKLFVLTLSAGLEGYH 373

Query: 365 INVDGRHVTSFPYRTGFVLEDATGLSVNGDIDVHSIFAASLPATHPSFAPQKHIEMLTQW 424
           ++VDG+HVTSFPYRTGF LEDATGL++NGDIDVHS+FA SLP +HPSF+PQ+H+E+ + W
Sbjct: 374 VSVDGKHVTSFPYRTGFTLEDATGLTINGDIDVHSVFAGSLPTSHPSFSPQRHLELSSNW 433

Query: 425 KAPSLPKKNVELFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVARFFVAMHGRKEVNI 484
           +APSLP + V++FIG+LSAGNHFAERMAVR+SWMQH+L++SS VVARFFVA+H RKEVN+
Sbjct: 434 QAPSLPDEQVDMFIGILSAGNHFAERMAVRRSWMQHKLVKSSKVVARFFVALHSRKEVNV 493

Query: 485 ELKKEAEYFGDIVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMKCDDDTFVRVDAVID 544
           ELKKEAE+FGDIVIVP+MD+YDLVVLKT+AICEYG   +AA +IMKCDDDTFV+VDAV+ 
Sbjct: 494 ELKKEAEFFGDIVIVPYMDSYDLVVLKTVAICEYGAHQLAAKFIMKCDDDTFVQVDAVLS 553

Query: 545 EAHKVQSDGGSLYVGNMNFHHKPLRHGKWAVTYEEWPEEDYPTYANGPGYILSSDIAEYI 604
           EA K  +D  SLY+GN+N++HKPLR GKW+VTYEEWPEEDYP YANGPGYILS+DI+ +I
Sbjct: 554 EAKKTPTD-RSLYIGNINYYHKPLRQGKWSVTYEEWPEEDYPPYANGPGYILSNDISRFI 613

Query: 605 VSEFEKHRLRLFKMEDVSMGMWVEQFNR-TKPVLYHHSLRFCQFGCIEDYLTSHYQSPRQ 664
           V EFEKH+LR+FKMEDVS+GMWVEQFN  TKPV Y HSLRFCQFGCIE+YLT+HYQSPRQ
Sbjct: 614 VKEFEKHKLRMFKMEDVSVGMWVEQFNNGTKPVDYIHSLRFCQFGCIENYLTAHYQSPRQ 673

Query: 665 MVCLWEKLMHQRRPQCCNMR 674
           M+CLW+KL+   +PQCCNMR
Sbjct: 674 MICLWDKLVLTGKPQCCNMR 681

BLAST of CmaCh04G003200 vs. Swiss-Prot
Match: B3GTI_ARATH (Hydroxyproline O-galactosyltransferase GALT5 OS=Arabidopsis thaliana GN=GALT5 PE=1 SV=1)

HSP 1 Score: 879.4 bits (2271), Expect = 2.5e-254
Identity = 432/681 (63.44%), Postives = 536/681 (78.71%), Query Frame = 1

Query: 2   KRGKLDTMVSRNRIRLLQFLMGLVFFYLLFMSFEIPLVYRT-GYGSVPDDGTFGFTSDTL 61
           K  K+D   S  + R ++ +M + F YL+ +S EIPLV+++    SVP         D L
Sbjct: 11  KIDKIDLFSSLWKQRSVRVIMAIGFLYLVIVSVEIPLVFKSWSSSSVP--------LDAL 70

Query: 62  PRPFLLESEEEMADKDAPRRPSDD-TFLVSHGS-PHRTP--ERRMREFKK--VSGLVFDE 121
            R   L +E+E   +  P  P +  ++ VS+ +   RT   + ++RE  +  +S L FD 
Sbjct: 71  SRLEKLNNEQEPQVEIIPNPPLEPVSYPVSNPTIVTRTDLVQNKVREHHRGVLSSLRFDS 130

Query: 122 STFDRNASKGEFSELHKAVKHAWVVGKKLWGDLESGKIVLQPKTKTENQSETCPHSITLS 181
            TFD ++  G   ELHK+ K AW +G+KLW +LESG++    +   +N+ ++CPHS++L+
Sbjct: 131 ETFDPSSKDGSV-ELHKSAKEAWQLGRKLWKELESGRLEKLVEKPEKNKPDSCPHSVSLT 190

Query: 182 GSEF-EAQSRILELPCGLTLWSHITVVGTPRWAHLEDDPKISILREGDDSVMVSQFMMEL 241
           GSEF   +++++ELPCGLTL SHIT+VG PR AH    PK     EGD S +VSQF++EL
Sbjct: 191 GSEFMNRENKLMELPCGLTLGSHITLVGRPRKAH----PK-----EGDWSKLVSQFVIEL 250

Query: 242 QGLKTVDGEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWKARADEETVD 301
           QGLKTV+GEDPPRILHFNPRLKGDWS KPVIEQN+CYRMQWG A RCEGWK+R DEETVD
Sbjct: 251 QGLKTVEGEDPPRILHFNPRLKGDWSKKPVIEQNSCYRMQWGPAQRCEGWKSRDDEETVD 310

Query: 302 GQVKCEKWIRDDDSHSEESKVIWWLNRLIGRTKKVAIDWPYPFAEGRLFVLTVSAGLEGY 361
             VKCEKWIRDDD++SE S+  WWLNRLIGR K+V ++WP+PF E +LFVLT+SAGLEGY
Sbjct: 311 SHVKCEKWIRDDDNYSEGSRARWWLNRLIGRRKRVKVEWPFPFVEEKLFVLTLSAGLEGY 370

Query: 362 HINVDGRHVTSFPYRTGFVLEDATGLSVNGDIDVHSIFAASLPATHPSFAPQKHIEMLTQ 421
           HINVDG+HVTSFPYRTGF LEDATGL+VNGDIDVHS+F ASLP +HPSFAPQ+H+E+  +
Sbjct: 371 HINVDGKHVTSFPYRTGFTLEDATGLTVNGDIDVHSVFVASLPTSHPSFAPQRHLELSKR 430

Query: 422 WKAPSLPKKNVELFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVARFFVAMHGRKEVN 481
           W+AP +P   VE+FIG+LSAGNHF+ERMAVRKSWMQH LI S+ VVARFFVA+HGRKEVN
Sbjct: 431 WQAPVVPDGPVEIFIGILSAGNHFSERMAVRKSWMQHVLITSAKVVARFFVALHGRKEVN 490

Query: 482 IELKKEAEYFGDIVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMKCDDDTFVRVDAVI 541
           +ELKKEAEYFGDIV+VP+MD+YDLVVLKT+AICE+G    +A YIMKCDDDTFV++ AVI
Sbjct: 491 VELKKEAEYFGDIVLVPYMDSYDLVVLKTVAICEHGALAFSAKYIMKCDDDTFVKLGAVI 550

Query: 542 DEAHKVQSDGGSLYVGNMNFHHKPLRHGKWAVTYEEWPEEDYPTYANGPGYILSSDIAEY 601
           +E  KV  +G SLY+GNMN++HKPLR GKWAVTYEEWPEEDYP YANGPGY+LSSDIA +
Sbjct: 551 NEVKKV-PEGRSLYIGNMNYYHKPLRGGKWAVTYEEWPEEDYPPYANGPGYVLSSDIARF 610

Query: 602 IVSEFEKHRLRLFKMEDVSMGMWVEQF-NRTKPVLYHHSLRFCQFGCIEDYLTSHYQSPR 661
           IV +FE+H+LRLFKMEDVS+GMWVE F N T PV Y HSLRFCQFGC+E+Y T+HYQSPR
Sbjct: 611 IVDKFERHKLRLFKMEDVSVGMWVEHFKNTTNPVDYRHSLRFCQFGCVENYYTAHYQSPR 670

Query: 662 QMVCLWEKLMHQRRPQCCNMR 674
           QM+CLW+KL+ Q +P+CCNMR
Sbjct: 671 QMICLWDKLLRQNKPECCNMR 672

BLAST of CmaCh04G003200 vs. Swiss-Prot
Match: B3GTK_ARATH (Hydroxyproline O-galactosyltransferase GALT2 OS=Arabidopsis thaliana GN=GALT2 PE=1 SV=1)

HSP 1 Score: 704.1 bits (1816), Expect = 1.4e-201
Identity = 358/694 (51.59%), Postives = 472/694 (68.01%), Query Frame = 1

Query: 1   MKRGKLDT---MVSRNRIRLLQFLMGLVFFYLLFMSFEIPLVYRTGYGSVPDDGTFGFTS 60
           MKR K ++   + S  R +L  FL+ +  FYL+F++F+ P           D G  G  S
Sbjct: 1   MKRVKSESFRGVYSSRRFKLSHFLLAIAGFYLVFLAFKFPHFIEMVAMLSGDTGLDGALS 60

Query: 61  DTLPRPFLLES------EEEMADKDAPRRPSDDTFLVSHGSPHRTPERRMREFKKVSGLV 120
           DT     L  S        ++ D+D    PS    +        +PE ++   K++  L+
Sbjct: 61  DTSLDVSLSGSLRNDMLNRKLEDEDHQSGPSTTQKV--------SPEEKINGSKQIQPLL 120

Query: 121 FDESTFDRNASKGEFSELH-----KAVKHAWVVGKKLWGDLESGKI--VLQPKTKTENQS 180
           F          +     +H     +    AW++G K W D++  ++  + +  +  E + 
Sbjct: 121 FRYGRISGEVMRRRNRTIHMSPFERMADEAWILGSKAWEDVDKFEVDKINESASIFEGKV 180

Query: 181 ETCPHSITLSGSEFEAQSRILELPCGLTLWSHITVVGTPRWAHLEDDPKISILREGDDSV 240
           E+CP  I+++G +    +RI+ LPCGL   S IT++GTP++AH E  P+ S L      V
Sbjct: 181 ESCPSQISMNGDDLNKANRIMLLPCGLAAGSSITILGTPQYAHKESVPQRSRLTRSYGMV 240

Query: 241 MVSQFMMELQGLKTVDGEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWK 300
           +VSQFM+ELQGLKT DGE PP+ILH NPR+KGDW+ +PVIE NTCYRMQWG A RC+G  
Sbjct: 241 LVSQFMVELQGLKTGDGEYPPKILHLNPRIKGDWNHRPVIEHNTCYRMQWGVAQRCDGTP 300

Query: 301 ARADEET-VDGQVKCEKWIRDDDSH---SEESKVIWWLNRLIGRTKKVAIDWPYPFAEGR 360
           ++ D +  VDG  +CEKW ++D      S+ESK   W  R IGR +K  + W +PFAEG+
Sbjct: 301 SKKDADVLVDGFRRCEKWTQNDIIDMVDSKESKTTSWFKRFIGREQKPEVTWSFPFAEGK 360

Query: 361 LFVLTVSAGLEGYHINVDGRHVTSFPYRTGFVLEDATGLSVNGDIDVHSIFAASLPATHP 420
           +FVLT+ AG++G+HINV GRHV+SFPYR GF +EDATGL+V GD+D+HSI A SL  +HP
Sbjct: 361 VFVLTLRAGIDGFHINVGGRHVSSFPYRPGFTIEDATGLAVTGDVDIHSIHATSLSTSHP 420

Query: 421 SFAPQKHIEMLTQWKAPSLPKKNVELFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVA 480
           SF+PQK IE  ++WKAP LP     LF+GVLSA NHF+ERMAVRK+WMQH  I+SS VVA
Sbjct: 421 SFSPQKAIEFSSEWKAPPLPGTPFRLFMGVLSATNHFSERMAVRKTWMQHPSIKSSDVVA 480

Query: 481 RFFVAMHGRKEVNIELKKEAEYFGDIVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMK 540
           RFFVA++ RKEVN  LKKEAEYFGDIVI+PFMD Y+LVVLKTIAICE+GV+ V A YIMK
Sbjct: 481 RFFVALNPRKEVNAMLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGVQNVTAPYIMK 540

Query: 541 CDDDTFVRVDAVIDEAHKVQSDGGSLYVGNMNFHHKPLRHGKWAVTYEEWPEEDYPTYAN 600
           CDDDTF+RV++++ +   V S   SLY+GN+N  H+PLR GKW VT+EEWPE  YP YAN
Sbjct: 541 CDDDTFIRVESILKQIDGV-SPEKSLYMGNLNLRHRPLRTGKWTVTWEEWPEAVYPPYAN 600

Query: 601 GPGYILSSDIAEYIVSEFEKHRLRLFKMEDVSMGMWVEQFNRT-KPVLYHHSLRFCQFGC 660
           GPGYI+SS+IA+YIVS+  +H+LRLFKMEDVSMG+WVEQFN + +PV Y HS +FCQ+GC
Sbjct: 601 GPGYIISSNIAKYIVSQNSRHKLRLFKMEDVSMGLWVEQFNASMQPVEYSHSWKFCQYGC 660

Query: 661 IEDYLTSHYQSPRQMVCLWEKLMHQRRPQCCNMR 674
             +Y T+HYQSP QM+CLW+ L+ + RPQCCN R
Sbjct: 661 TLNYYTAHYQSPSQMMCLWDNLL-KGRPQCCNFR 684

BLAST of CmaCh04G003200 vs. Swiss-Prot
Match: B3GTF_ARATH (Beta-1,3-galactosyltransferase GALT1 OS=Arabidopsis thaliana GN=GALT1 PE=1 SV=1)

HSP 1 Score: 342.4 bits (877), Expect = 1.1e-92
Identity = 203/550 (36.91%), Postives = 310/550 (56.36%), Query Frame = 1

Query: 132 AVKHAWVVGKKLWGDLESGKIVLQPKTKT-ENQSETCPHSIT-LSGSEFEAQSRILELPC 191
           A+K A +V + L   +E+ K+V   + +T + + E CP  ++ ++ +E +  S  L++PC
Sbjct: 118 AIKEAGIVWESLVSAVEAKKLVDVNENQTRKGKEELCPQFLSKMNATEADGSSLKLQIPC 177

Query: 192 GLTLWSHITVVGTPRWAHLEDDPKISILREGDDSVMVSQFMMELQGLKTVDGEDPPRILH 251
           GLT  S ITV+G P               +G    +V  F ++L G       DPP I+H
Sbjct: 178 GLTQGSSITVIGIP---------------DG----LVGSFRIDLTGQPLPGEPDPPIIVH 237

Query: 252 FNPRLKGDWSGK-PVIEQNTCYRMQ-WGTALRCEGWKARADEETVDGQVKCEKWIRDDDS 311
           +N RL GD S + PVI QN+    Q WG   RC  +    +++ VD   +C K +  + +
Sbjct: 238 YNVRLLGDKSTEDPVIVQNSWTASQDWGAEERCPKFDPDMNKK-VDDLDECNKMVGGEIN 297

Query: 312 HSEESKVIWWLNRLIGRTKKVAIDWPY-PFAEGRLFVLTVSAGLEGYHINVDGRHVTSFP 371
            +  + +    +R +   ++ +    Y PF +G L V T+  G EG  + VDG+H+TSF 
Sbjct: 298 RTSSTSLQSNTSRGVPVAREASKHEKYFPFKQGFLSVATLRVGTEGMQMTVDGKHITSFA 357

Query: 372 YRTGFVLEDATGLSVNGDIDVHSIFAASLPATHPSFAPQKHIEMLTQWKAPSL-PKKNVE 431
           +R        + + + GD  + SI A+ LP +  S    +H+  L   K+P+L P + ++
Sbjct: 358 FRDTLEPWLVSEIRITGDFRLISILASGLPTSEES----EHVVDLEALKSPTLSPLRPLD 417

Query: 432 LFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVARFFVAMHGRKEVNIELKKEAEYFGD 491
           L IGV S  N+F  RMAVR++WMQ+  +RS  V  RFFV +H    VN+EL  EA  +GD
Sbjct: 418 LVIGVFSTANNFKRRMAVRRTWMQYDDVRSGRVAVRFFVGLHKSPLVNLELWNEARTYGD 477

Query: 492 IVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMKCDDDTFVRVDAVIDEAHKVQSDGGS 551
           + ++PF+D Y L+  KT+AIC +G    +A +IMK DDD FVRVD V+       +  G 
Sbjct: 478 VQLMPFVDYYSLISWKTLAICIFGTEVDSAKFIMKTDDDAFVRVDEVLLSLSMTNNTRGL 537

Query: 552 LYVGNMNFHHKPLRH--GKWAVTYEEWPEEDYPTYANGPGYILSSDIAEYIVSEFEKHRL 611
           +Y G +N   +P+R+   KW ++YEEWPEE YP +A+GPGYI+S DIAE +   F++  L
Sbjct: 538 IY-GLINSDSQPIRNPDSKWYISYEEWPEEKYPPWAHGPGYIVSRDIAESVGKLFKEGNL 597

Query: 612 RLFKMEDVSMGMWVEQFNRTKPVL---YHHSLRFCQFGCIEDYLTSHYQSPRQMVCLWEK 671
           ++FK+EDV+MG+W+ +   TK  L   Y +  R    GC + Y+ +HYQSP +M CLW K
Sbjct: 598 KMFKLEDVAMGIWIAEL--TKHGLEPHYENDGRIISDGCKDGYVVAHYQSPAEMTCLWRK 640

BLAST of CmaCh04G003200 vs. TrEMBL
Match: A0A0A0KQG2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G604080 PE=4 SV=1)

HSP 1 Score: 1274.6 bits (3297), Expect = 0.0e+00
Identity = 604/673 (89.75%), Postives = 633/673 (94.06%), Query Frame = 1

Query: 1   MKRGKLDTMVSRNRIRLLQFLMGLVFFYLLFMSFEIPLVYRTGYGSVPDDGTFGFTSDTL 60
           MKRGK D MVS NRIRLLQ LMGLVF YLLFMSFEIPLVYRTGYGSV  DGTFGFTSD L
Sbjct: 1   MKRGKFDVMVSINRIRLLQILMGLVFLYLLFMSFEIPLVYRTGYGSVSGDGTFGFTSDAL 60

Query: 61  PRPFLLESEEEMADKDAPRRPSDDTFLVSHGSPHRTPERRMREFKKVSGLVFDESTFDRN 120
           PRPFLLESEEEM DK APRRPSDD F +SHGSPHRTPERRMREF+KVSGLVFDESTFDRN
Sbjct: 61  PRPFLLESEEEMTDKGAPRRPSDDPFRISHGSPHRTPERRMREFRKVSGLVFDESTFDRN 120

Query: 121 ASKGEFSELHKAVKHAWVVGKKLWGDLESGKIVLQPKTKTENQSETCPHSITLSGSEFEA 180
           A+KGEFSEL KA KHAWVVGKKLW +LESGKI L+PK K ENQSE+CPHSITLSGSEF+A
Sbjct: 121 ATKGEFSELQKAAKHAWVVGKKLWEELESGKIELKPKAKMENQSESCPHSITLSGSEFQA 180

Query: 181 QSRILELPCGLTLWSHITVVGTPRWAHLEDDPKISILREGDDSVMVSQFMMELQGLKTVD 240
           Q RI+ELPCGLTLWSHITVVGTP WAH E+DPKISIL+EGDDSV+VSQFMMELQGLKTVD
Sbjct: 181 QGRIMELPCGLTLWSHITVVGTPHWAHSEEDPKISILKEGDDSVLVSQFMMELQGLKTVD 240

Query: 241 GEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWKARADEETVDGQVKCEK 300
           GEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWK+RADEETVDGQVKCEK
Sbjct: 241 GEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWKSRADEETVDGQVKCEK 300

Query: 301 WIRDDDSHSEESKVIWWLNRLIGRTKKVAIDWPYPFAEGRLFVLTVSAGLEGYHINVDGR 360
           WIRDDDS SEESKVIWWLNRLIGRTKKV IDWPYPF EGRLFVLTVSAGLEGYHINVDGR
Sbjct: 301 WIRDDDSRSEESKVIWWLNRLIGRTKKVMIDWPYPFVEGRLFVLTVSAGLEGYHINVDGR 360

Query: 361 HVTSFPYRTGFVLEDATGLSVNGDIDVHSIFAASLPATHPSFAPQKHIEMLTQWKAPSLP 420
           HVTSFPYRTGFVLEDATGLSVNGDIDVHS+FAASLP  HPSFAPQKH+EMLTQWKAP +P
Sbjct: 361 HVTSFPYRTGFVLEDATGLSVNGDIDVHSLFAASLPTAHPSFAPQKHMEMLTQWKAPPIP 420

Query: 421 KKNVELFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVARFFVAMHGRKEVNIELKKEA 480
           K NVELFIG+LSAGNHFAERMAVRKSWMQHRLIRSS+ VARFFVAMHGRKEVN ELKKEA
Sbjct: 421 KSNVELFIGILSAGNHFAERMAVRKSWMQHRLIRSSLAVARFFVAMHGRKEVNTELKKEA 480

Query: 481 EYFGDIVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMKCDDDTFVRVDAVIDEAHKVQ 540
           EYFGDIVIVP+MDNYDLVVLKTIAICEYG RTVAA YIMKCDDDTFVRVDAV+ EAHKVQ
Sbjct: 481 EYFGDIVIVPYMDNYDLVVLKTIAICEYGARTVAAKYIMKCDDDTFVRVDAVLSEAHKVQ 540

Query: 541 SDGGSLYVGNMNFHHKPLRHGKWAVTYEEWPEEDYPTYANGPGYILSSDIAEYIVSEFEK 600
           + G SLYVGNMN+HHKPLRHGKWAVTYEEWPEEDYP YANGPGYILSSDIAEYIVSEFEK
Sbjct: 541 A-GRSLYVGNMNYHHKPLRHGKWAVTYEEWPEEDYPAYANGPGYILSSDIAEYIVSEFEK 600

Query: 601 HRLRLFKMEDVSMGMWVEQFNRTKPVLYHHSLRFCQFGCIEDYLTSHYQSPRQMVCLWEK 660
           H+LRLFKMEDVSMGMWVEQFN +KPV + HSLRFCQFGCIEDYLT+HYQSPRQM+CLW+K
Sbjct: 601 HKLRLFKMEDVSMGMWVEQFNSSKPVKFLHSLRFCQFGCIEDYLTAHYQSPRQMMCLWDK 660

Query: 661 LMHQRRPQCCNMR 674
           LM Q++PQCCNMR
Sbjct: 661 LMQQKKPQCCNMR 672

BLAST of CmaCh04G003200 vs. TrEMBL
Match: W9R193_9ROSA (Putative beta-1,3-galactosyltransferase 19 OS=Morus notabilis GN=L484_021051 PE=4 SV=1)

HSP 1 Score: 1124.4 bits (2907), Expect = 0.0e+00
Identity = 528/673 (78.45%), Postives = 590/673 (87.67%), Query Frame = 1

Query: 1   MKRGKLDTMVSRNRIRLLQFLMGLVFFYLLFMSFEIPLVYRTGYGSVPDDGTFGFTSDTL 60
           MKRGKLD+++S +R+RLLQ LM LVFF +LFMSFEIPLV RTG G+  D+  + F SD L
Sbjct: 47  MKRGKLDSLMSPSRLRLLQILMALVFFCMLFMSFEIPLVLRTGLGASGDE-MYSFISDAL 106

Query: 61  PRPFLLESEEEMADKDAPRRPSDDTFLVSHGSPHRTPERRMREFKKVSGLVFDESTFDRN 120
           PRP  LESEE+ ADKDAP RP+D+   V  GSPHRTP    REFKKVSGL F+ + FD +
Sbjct: 107 PRPLALESEEDFADKDAPSRPADNPLRVFGGSPHRTP---TREFKKVSGLAFNGTVFDAH 166

Query: 121 ASKGEFSELHKAVKHAWVVGKKLWGDLESGKIVLQPKTKTENQSETCPHSITLSGSEFEA 180
             +G  SELH A KHAW VG+KLW +LESGKI   P  K EN+SE CPHSI LSGS+F A
Sbjct: 167 VGEGNSSELHMAAKHAWAVGRKLWNELESGKIQNNPIVKPENRSEQCPHSIALSGSDFRA 226

Query: 181 QSRILELPCGLTLWSHITVVGTPRWAHLEDDPKISILREGDDSVMVSQFMMELQGLKTVD 240
           ++R+L LPCGLTLWSHITVVGTPRWAH E DPKI++L+EGD+SVMVSQFMMELQGLKTVD
Sbjct: 227 RNRVLVLPCGLTLWSHITVVGTPRWAHQEYDPKIAVLKEGDESVMVSQFMMELQGLKTVD 286

Query: 241 GEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWKARADEETVDGQVKCEK 300
           GEDPPRILHFNPRLKGDWSGKPVIE+NTCYRMQWG+ALRCEGWK+RADEET+DGQVKCEK
Sbjct: 287 GEDPPRILHFNPRLKGDWSGKPVIEENTCYRMQWGSALRCEGWKSRADEETIDGQVKCEK 346

Query: 301 WIRDDDSHSEESKVIWWLNRLIGRTKKVAIDWPYPFAEGRLFVLTVSAGLEGYHINVDGR 360
           WIRDDD+HSEESK +WWLNRLIGRTKKV IDWPYPFAEGRLFVLTVSAGLEGYH+NVDGR
Sbjct: 347 WIRDDDNHSEESKALWWLNRLIGRTKKVTIDWPYPFAEGRLFVLTVSAGLEGYHVNVDGR 406

Query: 361 HVTSFPYRTGFVLEDATGLSVNGDIDVHSIFAASLPATHPSFAPQKHIEMLTQWKAPSLP 420
           HVTSFPYRTGFVLEDATGL VNGD+DVHS+FAASLP +HPSFAPQ H+EM  +WKAP L 
Sbjct: 407 HVTSFPYRTGFVLEDATGLFVNGDVDVHSVFAASLPTSHPSFAPQLHLEMSARWKAPPLS 466

Query: 421 KKNVELFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVARFFVAMHGRKEVNIELKKEA 480
               ELFIG+LSAGNHFAERMAVRKSWMQH+LI+SS  VARFFVA+HGRKEVN+ELKKEA
Sbjct: 467 NDRAELFIGILSAGNHFAERMAVRKSWMQHKLIKSSHAVARFFVALHGRKEVNVELKKEA 526

Query: 481 EYFGDIVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMKCDDDTFVRVDAVIDEAHKVQ 540
           +YFGDIVIVP+MDNYDLVVLKTIAICEYG RTVAA +IMKCDDDTFVRVD V+ EAHKV 
Sbjct: 527 DYFGDIVIVPYMDNYDLVVLKTIAICEYGHRTVAAKHIMKCDDDTFVRVDTVLKEAHKVG 586

Query: 541 SDGGSLYVGNMNFHHKPLRHGKWAVTYEEWPEEDYPTYANGPGYILSSDIAEYIVSEFEK 600
            D  SLY+GN+N+HHKPLR+GKWAVTYEEWPEEDYP YANGPGYI+SSDIAE+I+SEFEK
Sbjct: 587 ED-KSLYIGNINYHHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIISSDIAEFIISEFEK 646

Query: 601 HRLRLFKMEDVSMGMWVEQFNRTKPVLYHHSLRFCQFGCIEDYLTSHYQSPRQMVCLWEK 660
           H+LRLFKMEDVSMGMWVEQFN +KPV Y HS+RFCQFGCI+DY T+HYQSPRQM+C+W K
Sbjct: 647 HKLRLFKMEDVSMGMWVEQFNSSKPVQYVHSVRFCQFGCIDDYYTAHYQSPRQMMCMWGK 706

Query: 661 LMHQRRPQCCNMR 674
           L    RPQCCNMR
Sbjct: 707 LQQHGRPQCCNMR 714

BLAST of CmaCh04G003200 vs. TrEMBL
Match: M5Y3K0_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002487mg PE=4 SV=1)

HSP 1 Score: 1103.6 bits (2853), Expect = 0.0e+00
Identity = 512/673 (76.08%), Postives = 589/673 (87.52%), Query Frame = 1

Query: 1   MKRGKLDTMVSRNRIRLLQFLMGLVFFYLLFMSFEIPLVYRTGYGSVPDDGTFGFTSDTL 60
           MKRGK+D+M+  +R+ ++Q L+G VF YLLF++FEIP V + G+GS   D +     D L
Sbjct: 1   MKRGKVDSMLPPSRLGMVQILIGAVFVYLLFITFEIPHVLKHGFGSSGSDDSL----DAL 60

Query: 61  PRPFLLESEEEMADKDAPRRPSDDTFLVSHGSPHRTPERRMREFKKVSGLVFDESTFDRN 120
           P  F+LESEEEM + DAP RP+++ F  S GSP RTP+RR RE KKVSGLVF ++ FD N
Sbjct: 61  PITFMLESEEEMGESDAPSRPTENPFRDSEGSPSRTPQRRTREAKKVSGLVFKDTLFDAN 120

Query: 121 ASKGEFSELHKAVKHAWVVGKKLWGDLESGKIVLQPKTKTENQSETCPHSITLSGSEFEA 180
            S+ + SELHKA ++AW  GKKLW +LESGK+    K K+EN+SE CPHS+ LSGSEFEA
Sbjct: 121 VSRDQVSELHKAARNAWTAGKKLWAELESGKLEFGLKNKSENRSEPCPHSLILSGSEFEA 180

Query: 181 QSRILELPCGLTLWSHITVVGTPRWAHLEDDPKISILREGDDSVMVSQFMMELQGLKTVD 240
           + R++ LPCG+TLWSHITVVGTP+WAH E DPKIS+L+EGD++VMVSQFMMELQGLK V+
Sbjct: 181 RKRVMVLPCGMTLWSHITVVGTPKWAHSEYDPKISMLKEGDEAVMVSQFMMELQGLKIVE 240

Query: 241 GEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWKARADEETVDGQVKCEK 300
           GEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWG+ALRCEGWK+RADE+TVDGQVKCEK
Sbjct: 241 GEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGSALRCEGWKSRADEDTVDGQVKCEK 300

Query: 301 WIRDDDSHSEESKVIWWLNRLIGRTKKVAIDWPYPFAEGRLFVLTVSAGLEGYHINVDGR 360
           WIRDDD HSEESK  WWLNRLIGRTKKV IDWPYPFAEG+LFVLTVSAGLEGYHINVDGR
Sbjct: 301 WIRDDDDHSEESKATWWLNRLIGRTKKVTIDWPYPFAEGKLFVLTVSAGLEGYHINVDGR 360

Query: 361 HVTSFPYRTGFVLEDATGLSVNGDIDVHSIFAASLPATHPSFAPQKHIEMLTQWKAPSLP 420
           H+TSFPYRTGF LEDATGLSVNGDIDVHS+ AASLP +HPSFAP  H+EM+T+WKAPSLP
Sbjct: 361 HLTSFPYRTGFALEDATGLSVNGDIDVHSVLAASLPTSHPSFAPSMHLEMVTRWKAPSLP 420

Query: 421 KKNVELFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVARFFVAMHGRKEVNIELKKEA 480
             +VELFIG+LSAGNHFAERMAVRKSWMQH+LI+SS VVARFFVA+HGR EVN+EL KE 
Sbjct: 421 YGHVELFIGILSAGNHFAERMAVRKSWMQHKLIKSSRVVARFFVALHGRNEVNMELMKEV 480

Query: 481 EYFGDIVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMKCDDDTFVRVDAVIDEAHKVQ 540
            YFGDIVIVP+MDNYDLVVLKT+AICEYG+RTV A YIMKCDDDTFVR+DAV+ EA KV 
Sbjct: 481 GYFGDIVIVPYMDNYDLVVLKTVAICEYGIRTVPAKYIMKCDDDTFVRLDAVLKEARKVH 540

Query: 541 SDGGSLYVGNMNFHHKPLRHGKWAVTYEEWPEEDYPTYANGPGYILSSDIAEYIVSEFEK 600
               SLY+GNMN+HHKPLRHGKWAVTYEEWPEEDYP+YANGPGY+LSSDIA++IVS+FEK
Sbjct: 541 GH-RSLYIGNMNYHHKPLRHGKWAVTYEEWPEEDYPSYANGPGYVLSSDIAKFIVSDFEK 600

Query: 601 HRLRLFKMEDVSMGMWVEQFNRTKPVLYHHSLRFCQFGCIEDYLTSHYQSPRQMVCLWEK 660
           H+LRLFKMEDVSMGMWVEQFN +KPV Y HSL+FCQFGCI+DY T+HYQSPRQM+C+W+K
Sbjct: 601 HKLRLFKMEDVSMGMWVEQFNNSKPVEYVHSLKFCQFGCIDDYYTAHYQSPRQMICMWDK 660

Query: 661 LMHQRRPQCCNMR 674
           L HQ +PQCCNMR
Sbjct: 661 LQHQGKPQCCNMR 668

BLAST of CmaCh04G003200 vs. TrEMBL
Match: A0A061E5K7_THECC (Galactosyltransferase family protein isoform 1 OS=Theobroma cacao GN=TCM_010069 PE=4 SV=1)

HSP 1 Score: 1099.3 bits (2842), Expect = 0.0e+00
Identity = 523/677 (77.25%), Postives = 600/677 (88.63%), Query Frame = 1

Query: 1   MKRGKLDTMVSRNRIRLLQFLMGLVFFYLLFMSFEIPLVYRTGYGSVPDDGTFGFTSDTL 60
           MKR KLD++VS +R+RL+QFLMG++F YLLFMSFEIP V++TGYGS    G+ GF +DTL
Sbjct: 1   MKRAKLDSLVSPSRLRLVQFLMGVLFLYLLFMSFEIPHVFKTGYGS----GSGGFFTDTL 60

Query: 61  PRPFLLESEEEMADKDAPRRPSDDTFLVSHGSPHRTPERRMREFKKVSGLVFDESTFDRN 120
           PRP  LESEE+  DK AP RP++D   V      RTPER+MREFKKVSGL+F+ES+FD N
Sbjct: 61  PRPLFLESEEDFTDKSAPARPANDPDPVRQPG-SRTPERKMREFKKVSGLLFNESSFDSN 120

Query: 121 ASKGEFSELHKAVKHAWVVGKKLWGDLESG--KIVLQP--KTKTENQSETCPHSITLSGS 180
            SK EFS LHK  +HA+VVGKKLW DL+SG  K   +P  + +  N++E+CPHSI+LSGS
Sbjct: 121 DSKDEFSVLHKTARHAFVVGKKLWDDLQSGQNKSDSEPGQQNQGRNRTESCPHSISLSGS 180

Query: 181 EFEAQSRILELPCGLTLWSHITVVGTPRWAHLEDDPKISILREGDDSVMVSQFMMELQGL 240
           EF ++ RIL LPCGLTL SHITVVG P W+H E DPKI++L+EGD+SVMVSQFMMELQGL
Sbjct: 181 EFMSRGRILVLPCGLTLGSHITVVGLPHWSHAEYDPKIAVLKEGDESVMVSQFMMELQGL 240

Query: 241 KTVDGEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWKARADEETVDGQV 300
           KTVDGEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWG+ALRCEGWK+RADEETVDGQV
Sbjct: 241 KTVDGEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGSALRCEGWKSRADEETVDGQV 300

Query: 301 KCEKWIRDDDSHSEESKVIWWLNRLIGRTKKVAIDWPYPFAEGRLFVLTVSAGLEGYHIN 360
           KCEKWIRDDD+  EESK  WWLNRLIGR KKV ++WPYPFAEG+LFVLT+SAGLEGYH+N
Sbjct: 301 KCEKWIRDDDNGLEESKATWWLNRLIGRKKKVVLEWPYPFAEGKLFVLTLSAGLEGYHLN 360

Query: 361 VDGRHVTSFPYRTGFVLEDATGLSVNGDIDVHSIFAASLPATHPSFAPQKHIEMLTQWKA 420
           VDGRHVTSFPYRTGFVLEDATGLS+NGD+DVHS+FAASLP +HPSFAPQKH+E L++WKA
Sbjct: 361 VDGRHVTSFPYRTGFVLEDATGLSLNGDLDVHSVFAASLPTSHPSFAPQKHLERLSKWKA 420

Query: 421 PSLPKKNVELFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVARFFVAMHGRKEVNIEL 480
           P LP  NVELFIG+LSAGNHFAERMAVRKSWMQH+LIRSS VVARFFVA++GRKEVN+EL
Sbjct: 421 PPLPDGNVELFIGILSAGNHFAERMAVRKSWMQHKLIRSSKVVARFFVALNGRKEVNVEL 480

Query: 481 KKEAEYFGDIVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMKCDDDTFVRVDAVIDEA 540
           KKEAEYFGDIVIVP+MDNYDLVVLKT+AICEYGVRTVAA YIMKCDDDTFV VDAVI EA
Sbjct: 481 KKEAEYFGDIVIVPYMDNYDLVVLKTVAICEYGVRTVAAKYIMKCDDDTFVGVDAVIKEA 540

Query: 541 HKVQSDGGSLYVGNMNFHHKPLRHGKWAVTYEEWPEEDYPTYANGPGYILSSDIAEYIVS 600
            KV     SLY+GNMN++HKPLR+GKWAVTYEEWPEEDYP YANGPGYI+SSDIA++IV+
Sbjct: 541 KKVGDK--SLYIGNMNYYHKPLRNGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFIVA 600

Query: 601 EFEKHRLRLFKMEDVSMGMWVEQFNRTKPVLYHHSLRFCQFGCIEDYLTSHYQSPRQMVC 660
           EFEKH+LRLFKMEDVSMGMWVE+FN +KPV Y HSL+FCQFGCI+DY T+HYQSPRQM+C
Sbjct: 601 EFEKHKLRLFKMEDVSMGMWVEKFNSSKPVEYQHSLKFCQFGCIDDYYTAHYQSPRQMLC 660

Query: 661 LWEKLMHQRRPQCCNMR 674
           +W+KL++Q +PQCCNMR
Sbjct: 661 MWDKLLNQGKPQCCNMR 670

BLAST of CmaCh04G003200 vs. TrEMBL
Match: F6HPH6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0026g01240 PE=4 SV=1)

HSP 1 Score: 1081.6 bits (2796), Expect = 0.0e+00
Identity = 515/677 (76.07%), Postives = 579/677 (85.52%), Query Frame = 1

Query: 1   MKRGKLDTMVSRNRIRLLQFLMGLVFFYLLFMSFEIPLVYRTGYGSVPDDGTFGFTSDTL 60
           MKRGK DT+V  +R++  + L GL+F YL+FMSFEIPLV RTG+GS+P DG  GF  D  
Sbjct: 1   MKRGKFDTLVPTSRLKSFKILAGLLFLYLIFMSFEIPLVLRTGFGSLPGDGFNGFLGDAF 60

Query: 61  PRPFLLESEEEMADKDAPRRPSDDTFLVSHG----SPHRTPERRMREFKKVSGLVFDEST 120
            + F+LESE++MA+KDAP RPS   F VS G    S  R P RRMRE+KKVSGL F    
Sbjct: 61  SQQFMLESEQDMAEKDAPSRPS---FRVSKGLSQSSRFRAPARRMREYKKVSGLAFHGGL 120

Query: 121 FDRNASKGEFSELHKAVKHAWVVGKKLWGDLESGKIVLQPKTKTENQSETCPHSITLSGS 180
            +   SK  +SELHK+ KHAW VGK LW  L+SG+I ++ K K +NQSE+CPHSI LSGS
Sbjct: 121 LN---SKDGYSELHKSAKHAWEVGKTLWEKLDSGEIQVESKRKAQNQSESCPHSIALSGS 180

Query: 181 EFEAQSRILELPCGLTLWSHITVVGTPRWAHLEDDPKISILREGDDSVMVSQFMMELQGL 240
           EF+ +++I+ LPCGLTL SHITVVG P WAH E DPKI++L++ D SVMVSQFMMELQGL
Sbjct: 181 EFQDRNKIMVLPCGLTLGSHITVVGKPHWAHAEYDPKIALLKDEDQSVMVSQFMMELQGL 240

Query: 241 KTVDGEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWKARADEETVDGQV 300
           KTVDGEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWG+ALRCEGWK+RADEETVDGQV
Sbjct: 241 KTVDGEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGSALRCEGWKSRADEETVDGQV 300

Query: 301 KCEKWIRDDDSHSEESKVIWWLNRLIGRTKKVAIDWPYPFAEGRLFVLTVSAGLEGYHIN 360
           KCEKWIRDDDSHSEESK  WWLNRLIGRTKKVAIDWPYPFAE +LFVLTVSAGLEGYH+N
Sbjct: 301 KCEKWIRDDDSHSEESKATWWLNRLIGRTKKVAIDWPYPFAEEKLFVLTVSAGLEGYHVN 360

Query: 361 VDGRHVTSFPYRTGFVLEDATGLSVNGDIDVHSIFAASLPATHPSFAPQKHIEMLTQWKA 420
           VDGRHVTSFPYRTGFVLEDATGL VNGDIDVHS+FAASLPA+HPSFAPQ H+E L +W+A
Sbjct: 361 VDGRHVTSFPYRTGFVLEDATGLFVNGDIDVHSVFAASLPASHPSFAPQLHLEKLPKWQA 420

Query: 421 PSLPKKNVELFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVARFFVAMHGRKEVNIEL 480
             LP   VELFIG+LSAGNHFAERMAVRKSWMQH L++SS VVARFF+A+HGRKE+N+EL
Sbjct: 421 SPLPDGPVELFIGILSAGNHFAERMAVRKSWMQHNLVKSSKVVARFFIALHGRKEINVEL 480

Query: 481 KKEAEYFGDIVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMKCDDDTFVRVDAVIDEA 540
           KKEAEYFGD VIVP+MDNYDLVVLKT+AICEYG RT AA YIMKCDDDTFVRVDAVI EA
Sbjct: 481 KKEAEYFGDTVIVPYMDNYDLVVLKTVAICEYGARTAAAKYIMKCDDDTFVRVDAVIKEA 540

Query: 541 HKVQSDGGSLYVGNMNFHHKPLRHGKWAVTYEEWPEEDYPTYANGPGYILSSDIAEYIVS 600
            KV  D  SLYVGNMN++HKPLR+GKWAVTYEEWPEEDYP YANGPGYI+S DIAE+IVS
Sbjct: 541 RKVHED-NSLYVGNMNYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSYDIAEFIVS 600

Query: 601 EFEKHRLRLFKMEDVSMGMWVEQFNRTKPVLYHHSLRFCQFGCIEDYLTSHYQSPRQMVC 660
           EFEKH+LRLFKMEDVSMGMWVEQFN + PV Y HS++FCQFGCIEDY T+HYQSPRQM+C
Sbjct: 601 EFEKHKLRLFKMEDVSMGMWVEQFNSSMPVQYLHSVKFCQFGCIEDYYTAHYQSPRQMIC 660

Query: 661 LWEKLMHQRRPQCCNMR 674
           +WEKL  Q +  CCNMR
Sbjct: 661 MWEKLQQQGKAHCCNMR 670

BLAST of CmaCh04G003200 vs. TAIR10
Match: AT1G27120.1 (AT1G27120.1 Galactosyltransferase family protein)

HSP 1 Score: 916.8 bits (2368), Expect = 8.0e-267
Identity = 452/690 (65.51%), Postives = 534/690 (77.39%), Query Frame = 1

Query: 1   MKRGKLDTMVSRNRIRLLQFLMGLVFFYLLFMSFEIPLVYRTGYGSVPDDGTFGFTSDTL 60
           MK+ KLD   S+ R  L+QFL+ ++ FY L MSFEIP ++RTG GS  DD +    +D L
Sbjct: 1   MKKSKLDNSSSQIRFGLVQFLLVVLLFYFLCMSFEIPFIFRTGSGSGSDDVSSSSFADAL 60

Query: 61  PRPFLLES----------EEEMADKDAPRRPSDDTFLVSHGSPHRTPERRMREFKKVSGL 120
           PRP ++            EEE AD   P R   D   V      R PER+MREFK VS +
Sbjct: 61  PRPMVVGGGSREANWVVGEEEEAD---PHRHFKDPGRVQL----RLPERKMREFKSVSEI 120

Query: 121 VFDESTFDRNASKGEFSELHKAVKHAWVVGKKLWGDLESGKIVLQPKTKTENQSETCPHS 180
             +ES FD      EFS  HK  KHA  +G+K+W  L+SG ++   K   + + E CP  
Sbjct: 121 FVNESFFDNGGFSDEFSIFHKTAKHAISMGRKMWDGLDSG-LIKPDKAPVKTRIEKCPDM 180

Query: 181 ITLSGSEFEAQSRILELPCGLTLWSHITVVGTPRWAHLEDDPKISILREGDDSVMVSQFM 240
           +++S SEF  +SRIL LPCGLTL SHITVV TP WAH+E D        GD + MVSQFM
Sbjct: 181 VSVSESEFVNRSRILVLPCGLTLGSHITVVATPHWAHVEKD--------GDKTAMVSQFM 240

Query: 241 MELQGLKTVDGEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWKARADEE 300
           MELQGLK VDGEDPPRILHFNPR+KGDWSG+PVIEQNTCYRMQWG+ LRC+G ++  DEE
Sbjct: 241 MELQGLKAVDGEDPPRILHFNPRIKGDWSGRPVIEQNTCYRMQWGSGLRCDGRESSDDEE 300

Query: 301 TVDGQVKCEKWIRDDDSHS------EESKVIWWLNRLIGRTKK-VAIDWPYPFAEGRLFV 360
            VDG+VKCE+W RDDD         +ESK  WWLNRL+GR KK +  DW YPFAEG+LFV
Sbjct: 301 YVDGEVKCERWKRDDDDGGNNGDDFDESKKTWWLNRLMGRRKKMITHDWDYPFAEGKLFV 360

Query: 361 LTVSAGLEGYHINVDGRHVTSFPYRTGFVLEDATGLSVNGDIDVHSIFAASLPATHPSFA 420
           LT+ AG+EGYHI+V+GRH+TSFPYRTGFVLEDATGL+V G+IDVHS++AASLP+T+PSFA
Sbjct: 361 LTLRAGMEGYHISVNGRHITSFPYRTGFVLEDATGLAVKGNIDVHSVYAASLPSTNPSFA 420

Query: 421 PQKHIEMLTQWKAPSLPKKNVELFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVARFF 480
           PQKH+EM   WKAPSLP+K VELFIG+LSAGNHFAERMAVRKSWMQ +L+RSS VVARFF
Sbjct: 421 PQKHLEMQRIWKAPSLPQKPVELFIGILSAGNHFAERMAVRKSWMQQKLVRSSKVVARFF 480

Query: 481 VAMHGRKEVNIELKKEAEYFGDIVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMKCDD 540
           VA+H RKEVN++LKKEAEYFGDIVIVP+MD+YDLVVLKT+AICEYGV TVAA Y+MKCDD
Sbjct: 481 VALHARKEVNVDLKKEAEYFGDIVIVPYMDHYDLVVLKTVAICEYGVNTVAAKYVMKCDD 540

Query: 541 DTFVRVDAVIDEAHKVQSDGGSLYVGNMNFHHKPLRHGKWAVTYEEWPEEDYPTYANGPG 600
           DTFVRVDAVI EA KV+    SLY+GN+NF+HKPLR GKWAVT+EEWPEE YP YANGPG
Sbjct: 541 DTFVRVDAVIQEAEKVKG-RESLYIGNINFNHKPLRTGKWAVTFEEWPEEYYPPYANGPG 600

Query: 601 YILSSDIAEYIVSEFEKHRLRLFKMEDVSMGMWVEQFNRTKPVLYHHSLRFCQFGCIEDY 660
           YILS D+A++IV +FE+ RLRLFKMEDVSMGMWVE+FN T+PV   HSL+FCQFGCIEDY
Sbjct: 601 YILSYDVAKFIVDDFEQKRLRLFKMEDVSMGMWVEKFNETRPVAVVHSLKFCQFGCIEDY 660

Query: 661 LTSHYQSPRQMVCLWEKLMHQRRPQCCNMR 674
            T+HYQSPRQM+C+W+KL    +PQCCNMR
Sbjct: 661 FTAHYQSPRQMICMWDKLQRLGKPQCCNMR 673

BLAST of CmaCh04G003200 vs. TAIR10
Match: AT5G62620.1 (AT5G62620.1 Galactosyltransferase family protein)

HSP 1 Score: 904.0 bits (2335), Expect = 5.3e-263
Identity = 432/680 (63.53%), Postives = 539/680 (79.26%), Query Frame = 1

Query: 5   KLDTMVSRNRIRLLQFLMGLVFFYLLFMSFEIPLVYRTGYGSVPDDGTFGFTSDTLPRPF 64
           K D  VS ++ R +Q LM +   Y+L ++FEIP V++TG  S+        + D L RP 
Sbjct: 14  KFDIFVSLSKQRSVQILMAVGLLYMLLITFEIPFVFKTGLSSL--------SQDPLTRPE 73

Query: 65  LLESEEEMADKDAPRRPSDDTFLVSHGSPHRTPERRMREFKKV-SGLVFDESTFDRNASK 124
              S+ E+ ++ AP RP     L+   S   +P + +R   ++ S L FD  TF+ ++  
Sbjct: 74  KHNSQRELQERRAPTRPLKS--LLYQESQSESPAQGLRRRTRILSSLRFDPETFNPSSKD 133

Query: 125 GEFSELHKAVKHAWVVGKKLWGDLESGKIVL-----QPKTKTENQSETCPHSITLSGSEF 184
           G   ELHK+ K AW VG+K+W +LESGK +      + K   E+ + +C  S++L+GS+ 
Sbjct: 134 GSV-ELHKSAKVAWEVGRKIWEELESGKTLKALEKEKKKKIEEHGTNSCSLSVSLTGSDL 193

Query: 185 EAQSRILELPCGLTLWSHITVVGTPRWAHLEDDPKISILREGDDSVMVSQFMMELQGLKT 244
             +  I+ELPCGLTL SHITVVG PR AH E DPKIS+L+EGD++V VSQF +ELQGLK 
Sbjct: 194 LKRGNIMELPCGLTLGSHITVVGKPRAAHSEKDPKISMLKEGDEAVKVSQFKLELQGLKA 253

Query: 245 VDGEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWKARADEETVDGQVKC 304
           V+GE+PPRILH NPRLKGDWSGKPVIEQNTCYRMQWG+A RCEGW++R DEETVDGQVKC
Sbjct: 254 VEGEEPPRILHLNPRLKGDWSGKPVIEQNTCYRMQWGSAQRCEGWRSRDDEETVDGQVKC 313

Query: 305 EKWIRDDDSHSEESK----VIWWLNRLIGRTKKVAIDWPYPFAEGRLFVLTVSAGLEGYH 364
           EKW RDD   S+E +      WWL+RLIGR+KKV ++WP+PF   +LFVLT+SAGLEGYH
Sbjct: 314 EKWARDDSITSKEEESSKAASWWLSRLIGRSKKVTVEWPFPFTVDKLFVLTLSAGLEGYH 373

Query: 365 INVDGRHVTSFPYRTGFVLEDATGLSVNGDIDVHSIFAASLPATHPSFAPQKHIEMLTQW 424
           ++VDG+HVTSFPYRTGF LEDATGL++NGDIDVHS+FA SLP +HPSF+PQ+H+E+ + W
Sbjct: 374 VSVDGKHVTSFPYRTGFTLEDATGLTINGDIDVHSVFAGSLPTSHPSFSPQRHLELSSNW 433

Query: 425 KAPSLPKKNVELFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVARFFVAMHGRKEVNI 484
           +APSLP + V++FIG+LSAGNHFAERMAVR+SWMQH+L++SS VVARFFVA+H RKEVN+
Sbjct: 434 QAPSLPDEQVDMFIGILSAGNHFAERMAVRRSWMQHKLVKSSKVVARFFVALHSRKEVNV 493

Query: 485 ELKKEAEYFGDIVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMKCDDDTFVRVDAVID 544
           ELKKEAE+FGDIVIVP+MD+YDLVVLKT+AICEYG   +AA +IMKCDDDTFV+VDAV+ 
Sbjct: 494 ELKKEAEFFGDIVIVPYMDSYDLVVLKTVAICEYGAHQLAAKFIMKCDDDTFVQVDAVLS 553

Query: 545 EAHKVQSDGGSLYVGNMNFHHKPLRHGKWAVTYEEWPEEDYPTYANGPGYILSSDIAEYI 604
           EA K  +D  SLY+GN+N++HKPLR GKW+VTYEEWPEEDYP YANGPGYILS+DI+ +I
Sbjct: 554 EAKKTPTD-RSLYIGNINYYHKPLRQGKWSVTYEEWPEEDYPPYANGPGYILSNDISRFI 613

Query: 605 VSEFEKHRLRLFKMEDVSMGMWVEQFNR-TKPVLYHHSLRFCQFGCIEDYLTSHYQSPRQ 664
           V EFEKH+LR+FKMEDVS+GMWVEQFN  TKPV Y HSLRFCQFGCIE+YLT+HYQSPRQ
Sbjct: 614 VKEFEKHKLRMFKMEDVSVGMWVEQFNNGTKPVDYIHSLRFCQFGCIENYLTAHYQSPRQ 673

Query: 665 MVCLWEKLMHQRRPQCCNMR 674
           M+CLW+KL+   +PQCCNMR
Sbjct: 674 MICLWDKLVLTGKPQCCNMR 681

BLAST of CmaCh04G003200 vs. TAIR10
Match: AT1G74800.1 (AT1G74800.1 Galactosyltransferase family protein)

HSP 1 Score: 879.4 bits (2271), Expect = 1.4e-255
Identity = 432/681 (63.44%), Postives = 536/681 (78.71%), Query Frame = 1

Query: 2   KRGKLDTMVSRNRIRLLQFLMGLVFFYLLFMSFEIPLVYRT-GYGSVPDDGTFGFTSDTL 61
           K  K+D   S  + R ++ +M + F YL+ +S EIPLV+++    SVP         D L
Sbjct: 11  KIDKIDLFSSLWKQRSVRVIMAIGFLYLVIVSVEIPLVFKSWSSSSVP--------LDAL 70

Query: 62  PRPFLLESEEEMADKDAPRRPSDD-TFLVSHGS-PHRTP--ERRMREFKK--VSGLVFDE 121
            R   L +E+E   +  P  P +  ++ VS+ +   RT   + ++RE  +  +S L FD 
Sbjct: 71  SRLEKLNNEQEPQVEIIPNPPLEPVSYPVSNPTIVTRTDLVQNKVREHHRGVLSSLRFDS 130

Query: 122 STFDRNASKGEFSELHKAVKHAWVVGKKLWGDLESGKIVLQPKTKTENQSETCPHSITLS 181
            TFD ++  G   ELHK+ K AW +G+KLW +LESG++    +   +N+ ++CPHS++L+
Sbjct: 131 ETFDPSSKDGSV-ELHKSAKEAWQLGRKLWKELESGRLEKLVEKPEKNKPDSCPHSVSLT 190

Query: 182 GSEF-EAQSRILELPCGLTLWSHITVVGTPRWAHLEDDPKISILREGDDSVMVSQFMMEL 241
           GSEF   +++++ELPCGLTL SHIT+VG PR AH    PK     EGD S +VSQF++EL
Sbjct: 191 GSEFMNRENKLMELPCGLTLGSHITLVGRPRKAH----PK-----EGDWSKLVSQFVIEL 250

Query: 242 QGLKTVDGEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWKARADEETVD 301
           QGLKTV+GEDPPRILHFNPRLKGDWS KPVIEQN+CYRMQWG A RCEGWK+R DEETVD
Sbjct: 251 QGLKTVEGEDPPRILHFNPRLKGDWSKKPVIEQNSCYRMQWGPAQRCEGWKSRDDEETVD 310

Query: 302 GQVKCEKWIRDDDSHSEESKVIWWLNRLIGRTKKVAIDWPYPFAEGRLFVLTVSAGLEGY 361
             VKCEKWIRDDD++SE S+  WWLNRLIGR K+V ++WP+PF E +LFVLT+SAGLEGY
Sbjct: 311 SHVKCEKWIRDDDNYSEGSRARWWLNRLIGRRKRVKVEWPFPFVEEKLFVLTLSAGLEGY 370

Query: 362 HINVDGRHVTSFPYRTGFVLEDATGLSVNGDIDVHSIFAASLPATHPSFAPQKHIEMLTQ 421
           HINVDG+HVTSFPYRTGF LEDATGL+VNGDIDVHS+F ASLP +HPSFAPQ+H+E+  +
Sbjct: 371 HINVDGKHVTSFPYRTGFTLEDATGLTVNGDIDVHSVFVASLPTSHPSFAPQRHLELSKR 430

Query: 422 WKAPSLPKKNVELFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVARFFVAMHGRKEVN 481
           W+AP +P   VE+FIG+LSAGNHF+ERMAVRKSWMQH LI S+ VVARFFVA+HGRKEVN
Sbjct: 431 WQAPVVPDGPVEIFIGILSAGNHFSERMAVRKSWMQHVLITSAKVVARFFVALHGRKEVN 490

Query: 482 IELKKEAEYFGDIVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMKCDDDTFVRVDAVI 541
           +ELKKEAEYFGDIV+VP+MD+YDLVVLKT+AICE+G    +A YIMKCDDDTFV++ AVI
Sbjct: 491 VELKKEAEYFGDIVLVPYMDSYDLVVLKTVAICEHGALAFSAKYIMKCDDDTFVKLGAVI 550

Query: 542 DEAHKVQSDGGSLYVGNMNFHHKPLRHGKWAVTYEEWPEEDYPTYANGPGYILSSDIAEY 601
           +E  KV  +G SLY+GNMN++HKPLR GKWAVTYEEWPEEDYP YANGPGY+LSSDIA +
Sbjct: 551 NEVKKV-PEGRSLYIGNMNYYHKPLRGGKWAVTYEEWPEEDYPPYANGPGYVLSSDIARF 610

Query: 602 IVSEFEKHRLRLFKMEDVSMGMWVEQF-NRTKPVLYHHSLRFCQFGCIEDYLTSHYQSPR 661
           IV +FE+H+LRLFKMEDVS+GMWVE F N T PV Y HSLRFCQFGC+E+Y T+HYQSPR
Sbjct: 611 IVDKFERHKLRLFKMEDVSVGMWVEHFKNTTNPVDYRHSLRFCQFGCVENYYTAHYQSPR 670

Query: 662 QMVCLWEKLMHQRRPQCCNMR 674
           QM+CLW+KL+ Q +P+CCNMR
Sbjct: 671 QMICLWDKLLRQNKPECCNMR 672

BLAST of CmaCh04G003200 vs. TAIR10
Match: AT4G21060.1 (AT4G21060.1 Galactosyltransferase family protein)

HSP 1 Score: 695.3 bits (1793), Expect = 3.8e-200
Identity = 353/676 (52.22%), Postives = 462/676 (68.34%), Query Frame = 1

Query: 16  RLLQFLMGLVFFYLLFMSFEIPLVYRTGYGSVPDDGTFGFTSDTLPRPFLLES------E 75
           R+L F  G   FYL+F++F+ P           D G  G  SDT     L  S       
Sbjct: 77  RILLFT-GFSGFYLVFLAFKFPHFIEMVAMLSGDTGLDGALSDTSLDVSLSGSLRNDMLN 136

Query: 76  EEMADKDAPRRPSDDTFLVSHGSPHRTPERRMREFKKVSGLVFDESTFDRNASKGEFSEL 135
            ++ D+D    PS    +        +PE ++   K++  L+F          +     +
Sbjct: 137 RKLEDEDHQSGPSTTQKV--------SPEEKINGSKQIQPLLFRYGRISGEVMRRRNRTI 196

Query: 136 H-----KAVKHAWVVGKKLWGDLESGKI--VLQPKTKTENQSETCPHSITLSGSEFEAQS 195
           H     +    AW++G K W D++  ++  + +  +  E + E+CP  I+++G +    +
Sbjct: 197 HMSPFERMADEAWILGSKAWEDVDKFEVDKINESASIFEGKVESCPSQISMNGDDLNKAN 256

Query: 196 RILELPCGLTLWSHITVVGTPRWAHLEDDPKISILREGDDSVMVSQFMMELQGLKTVDGE 255
           RI+ LPCGL   S IT++GTP++AH E  P+ S L      V+VSQFM+ELQGLKT DGE
Sbjct: 257 RIMLLPCGLAAGSSITILGTPQYAHKESVPQRSRLTRSYGMVLVSQFMVELQGLKTGDGE 316

Query: 256 DPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWKARADEET-VDGQVKCEKW 315
            PP+ILH NPR+KGDW+ +PVIE NTCYRMQWG A RC+G  ++ D +  VDG  +CEKW
Sbjct: 317 YPPKILHLNPRIKGDWNHRPVIEHNTCYRMQWGVAQRCDGTPSKKDADVLVDGFRRCEKW 376

Query: 316 IRDDDSH---SEESKVIWWLNRLIGRTKKVAIDWPYPFAEGRLFVLTVSAGLEGYHINVD 375
            ++D      S+ESK   W  R IGR +K  + W +PFAEG++FVLT+ AG++G+HINV 
Sbjct: 377 TQNDIIDMVDSKESKTTSWFKRFIGREQKPEVTWSFPFAEGKVFVLTLRAGIDGFHINVG 436

Query: 376 GRHVTSFPYRTGFVLEDATGLSVNGDIDVHSIFAASLPATHPSFAPQKHIEMLTQWKAPS 435
           GRHV+SFPYR GF +EDATGL+V GD+D+HSI A SL  +HPSF+PQK IE  ++WKAP 
Sbjct: 437 GRHVSSFPYRPGFTIEDATGLAVTGDVDIHSIHATSLSTSHPSFSPQKAIEFSSEWKAPP 496

Query: 436 LPKKNVELFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVARFFVAMHGRKEVNIELKK 495
           LP     LF+GVLSA NHF+ERMAVRK+WMQH  I+SS VVARFFVA++ RKEVN  LKK
Sbjct: 497 LPGTPFRLFMGVLSATNHFSERMAVRKTWMQHPSIKSSDVVARFFVALNPRKEVNAMLKK 556

Query: 496 EAEYFGDIVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMKCDDDTFVRVDAVIDEAHK 555
           EAEYFGDIVI+PFMD Y+LVVLKTIAICE+GV+ V A YIMKCDDDTF+RV++++ +   
Sbjct: 557 EAEYFGDIVILPFMDRYELVVLKTIAICEFGVQNVTAPYIMKCDDDTFIRVESILKQIDG 616

Query: 556 VQSDGGSLYVGNMNFHHKPLRHGKWAVTYEEWPEEDYPTYANGPGYILSSDIAEYIVSEF 615
           V S   SLY+GN+N  H+PLR GKW VT+EEWPE  YP YANGPGYI+SS+IA+YIVS+ 
Sbjct: 617 V-SPEKSLYMGNLNLRHRPLRTGKWTVTWEEWPEAVYPPYANGPGYIISSNIAKYIVSQN 676

Query: 616 EKHRLRLFKMEDVSMGMWVEQFNRT-KPVLYHHSLRFCQFGCIEDYLTSHYQSPRQMVCL 674
            +H+LRLFKMEDVSMG+WVEQFN + +PV Y HS +FCQ+GC  +Y T+HYQSP QM+CL
Sbjct: 677 SRHKLRLFKMEDVSMGLWVEQFNASMQPVEYSHSWKFCQYGCTLNYYTAHYQSPSQMMCL 736

BLAST of CmaCh04G003200 vs. TAIR10
Match: AT1G26810.1 (AT1G26810.1 galactosyltransferase1)

HSP 1 Score: 342.4 bits (877), Expect = 6.2e-94
Identity = 203/550 (36.91%), Postives = 310/550 (56.36%), Query Frame = 1

Query: 132 AVKHAWVVGKKLWGDLESGKIVLQPKTKT-ENQSETCPHSIT-LSGSEFEAQSRILELPC 191
           A+K A +V + L   +E+ K+V   + +T + + E CP  ++ ++ +E +  S  L++PC
Sbjct: 118 AIKEAGIVWESLVSAVEAKKLVDVNENQTRKGKEELCPQFLSKMNATEADGSSLKLQIPC 177

Query: 192 GLTLWSHITVVGTPRWAHLEDDPKISILREGDDSVMVSQFMMELQGLKTVDGEDPPRILH 251
           GLT  S ITV+G P               +G    +V  F ++L G       DPP I+H
Sbjct: 178 GLTQGSSITVIGIP---------------DG----LVGSFRIDLTGQPLPGEPDPPIIVH 237

Query: 252 FNPRLKGDWSGK-PVIEQNTCYRMQ-WGTALRCEGWKARADEETVDGQVKCEKWIRDDDS 311
           +N RL GD S + PVI QN+    Q WG   RC  +    +++ VD   +C K +  + +
Sbjct: 238 YNVRLLGDKSTEDPVIVQNSWTASQDWGAEERCPKFDPDMNKK-VDDLDECNKMVGGEIN 297

Query: 312 HSEESKVIWWLNRLIGRTKKVAIDWPY-PFAEGRLFVLTVSAGLEGYHINVDGRHVTSFP 371
            +  + +    +R +   ++ +    Y PF +G L V T+  G EG  + VDG+H+TSF 
Sbjct: 298 RTSSTSLQSNTSRGVPVAREASKHEKYFPFKQGFLSVATLRVGTEGMQMTVDGKHITSFA 357

Query: 372 YRTGFVLEDATGLSVNGDIDVHSIFAASLPATHPSFAPQKHIEMLTQWKAPSL-PKKNVE 431
           +R        + + + GD  + SI A+ LP +  S    +H+  L   K+P+L P + ++
Sbjct: 358 FRDTLEPWLVSEIRITGDFRLISILASGLPTSEES----EHVVDLEALKSPTLSPLRPLD 417

Query: 432 LFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVARFFVAMHGRKEVNIELKKEAEYFGD 491
           L IGV S  N+F  RMAVR++WMQ+  +RS  V  RFFV +H    VN+EL  EA  +GD
Sbjct: 418 LVIGVFSTANNFKRRMAVRRTWMQYDDVRSGRVAVRFFVGLHKSPLVNLELWNEARTYGD 477

Query: 492 IVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMKCDDDTFVRVDAVIDEAHKVQSDGGS 551
           + ++PF+D Y L+  KT+AIC +G    +A +IMK DDD FVRVD V+       +  G 
Sbjct: 478 VQLMPFVDYYSLISWKTLAICIFGTEVDSAKFIMKTDDDAFVRVDEVLLSLSMTNNTRGL 537

Query: 552 LYVGNMNFHHKPLRH--GKWAVTYEEWPEEDYPTYANGPGYILSSDIAEYIVSEFEKHRL 611
           +Y G +N   +P+R+   KW ++YEEWPEE YP +A+GPGYI+S DIAE +   F++  L
Sbjct: 538 IY-GLINSDSQPIRNPDSKWYISYEEWPEEKYPPWAHGPGYIVSRDIAESVGKLFKEGNL 597

Query: 612 RLFKMEDVSMGMWVEQFNRTKPVL---YHHSLRFCQFGCIEDYLTSHYQSPRQMVCLWEK 671
           ++FK+EDV+MG+W+ +   TK  L   Y +  R    GC + Y+ +HYQSP +M CLW K
Sbjct: 598 KMFKLEDVAMGIWIAEL--TKHGLEPHYENDGRIISDGCKDGYVVAHYQSPAEMTCLWRK 640

BLAST of CmaCh04G003200 vs. NCBI nr
Match: gi|659090947|ref|XP_008446287.1| (PREDICTED: probable beta-1,3-galactosyltransferase 19 [Cucumis melo])

HSP 1 Score: 1280.8 bits (3313), Expect = 0.0e+00
Identity = 610/673 (90.64%), Postives = 634/673 (94.21%), Query Frame = 1

Query: 1   MKRGKLDTMVSRNRIRLLQFLMGLVFFYLLFMSFEIPLVYRTGYGSVPDDGTFGFTSDTL 60
           MKRGK D MVSRNRIRLLQ LMGLVF YLLFMSFEIPLVYRTG+GSV  DGT GFTSD L
Sbjct: 1   MKRGKFDVMVSRNRIRLLQILMGLVFLYLLFMSFEIPLVYRTGFGSVSGDGTLGFTSDAL 60

Query: 61  PRPFLLESEEEMADKDAPRRPSDDTFLVSHGSPHRTPERRMREFKKVSGLVFDESTFDRN 120
           PRPFLLESEEEM DKDAPRRPSDD F +SHGSPHRTPERRMREF+KVSGLVFDESTFDRN
Sbjct: 61  PRPFLLESEEEMGDKDAPRRPSDDPFRISHGSPHRTPERRMREFRKVSGLVFDESTFDRN 120

Query: 121 ASKGEFSELHKAVKHAWVVGKKLWGDLESGKIVLQPKTKTENQSETCPHSITLSGSEFEA 180
           ASKGEFSEL KA KHAWVVGKKLW +LESGKI L+PK KTENQSE+CPHSITLSGSEFEA
Sbjct: 121 ASKGEFSELQKAAKHAWVVGKKLWEELESGKIELKPKAKTENQSESCPHSITLSGSEFEA 180

Query: 181 QSRILELPCGLTLWSHITVVGTPRWAHLEDDPKISILREGDDSVMVSQFMMELQGLKTVD 240
           Q RI+ELPCGLTLWSHITVVGTPRWAH E DPKISIL+EGDDSVMVSQFMMELQGLKTVD
Sbjct: 181 QGRIMELPCGLTLWSHITVVGTPRWAHSEQDPKISILKEGDDSVMVSQFMMELQGLKTVD 240

Query: 241 GEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWKARADEETVDGQVKCEK 300
           GEDPPRILHFNPRLKGDWS KPVIEQNTCYRMQWGTALRCEGWK+RADEETVD QVKCEK
Sbjct: 241 GEDPPRILHFNPRLKGDWSAKPVIEQNTCYRMQWGTALRCEGWKSRADEETVDEQVKCEK 300

Query: 301 WIRDDDSHSEESKVIWWLNRLIGRTKKVAIDWPYPFAEGRLFVLTVSAGLEGYHINVDGR 360
           WIRDDDS SEESKVIWWLNRLIGRTKKV IDWPYPF EGRLFVLTVSAGLEGYHINVDGR
Sbjct: 301 WIRDDDSRSEESKVIWWLNRLIGRTKKVMIDWPYPFVEGRLFVLTVSAGLEGYHINVDGR 360

Query: 361 HVTSFPYRTGFVLEDATGLSVNGDIDVHSIFAASLPATHPSFAPQKHIEMLTQWKAPSLP 420
           H+TSFPYRTGFVLEDATGLSVNGDIDVHS+FAASLP  HPSFAPQKH+EMLTQWKAP +P
Sbjct: 361 HITSFPYRTGFVLEDATGLSVNGDIDVHSLFAASLPTAHPSFAPQKHMEMLTQWKAPPIP 420

Query: 421 KKNVELFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVARFFVAMHGRKEVNIELKKEA 480
           K NVELFIG+LSAGNHFAERMAVRKSWMQHRLIRSS+ VARFFVAMHGRKEVN ELKKEA
Sbjct: 421 KTNVELFIGILSAGNHFAERMAVRKSWMQHRLIRSSLAVARFFVAMHGRKEVNSELKKEA 480

Query: 481 EYFGDIVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMKCDDDTFVRVDAVIDEAHKVQ 540
           EYFGDIVIVP+MDNYDLVVLKTIAICEYGVRTVAA YIMKCDDDTFVRVDAVI EAHKVQ
Sbjct: 481 EYFGDIVIVPYMDNYDLVVLKTIAICEYGVRTVAAKYIMKCDDDTFVRVDAVIGEAHKVQ 540

Query: 541 SDGGSLYVGNMNFHHKPLRHGKWAVTYEEWPEEDYPTYANGPGYILSSDIAEYIVSEFEK 600
           S G SLYVGNMN+HHKPLRHGKWAVTYEEWPEEDYP YANGPGYILSSDIAEYIVSEFEK
Sbjct: 541 S-GRSLYVGNMNYHHKPLRHGKWAVTYEEWPEEDYPAYANGPGYILSSDIAEYIVSEFEK 600

Query: 601 HRLRLFKMEDVSMGMWVEQFNRTKPVLYHHSLRFCQFGCIEDYLTSHYQSPRQMVCLWEK 660
           H+LRLFKMEDVSMGMWVEQFN +KPV + HSLRFCQFGCIEDYLT+HYQSPRQM+CLW+K
Sbjct: 601 HKLRLFKMEDVSMGMWVEQFNSSKPVEFLHSLRFCQFGCIEDYLTAHYQSPRQMMCLWDK 660

Query: 661 LMHQRRPQCCNMR 674
           LM QR+PQCCNMR
Sbjct: 661 LMQQRKPQCCNMR 672

BLAST of CmaCh04G003200 vs. NCBI nr
Match: gi|449434851|ref|XP_004135209.1| (PREDICTED: probable beta-1,3-galactosyltransferase 19 [Cucumis sativus])

HSP 1 Score: 1274.6 bits (3297), Expect = 0.0e+00
Identity = 604/673 (89.75%), Postives = 633/673 (94.06%), Query Frame = 1

Query: 1   MKRGKLDTMVSRNRIRLLQFLMGLVFFYLLFMSFEIPLVYRTGYGSVPDDGTFGFTSDTL 60
           MKRGK D MVS NRIRLLQ LMGLVF YLLFMSFEIPLVYRTGYGSV  DGTFGFTSD L
Sbjct: 1   MKRGKFDVMVSINRIRLLQILMGLVFLYLLFMSFEIPLVYRTGYGSVSGDGTFGFTSDAL 60

Query: 61  PRPFLLESEEEMADKDAPRRPSDDTFLVSHGSPHRTPERRMREFKKVSGLVFDESTFDRN 120
           PRPFLLESEEEM DK APRRPSDD F +SHGSPHRTPERRMREF+KVSGLVFDESTFDRN
Sbjct: 61  PRPFLLESEEEMTDKGAPRRPSDDPFRISHGSPHRTPERRMREFRKVSGLVFDESTFDRN 120

Query: 121 ASKGEFSELHKAVKHAWVVGKKLWGDLESGKIVLQPKTKTENQSETCPHSITLSGSEFEA 180
           A+KGEFSEL KA KHAWVVGKKLW +LESGKI L+PK K ENQSE+CPHSITLSGSEF+A
Sbjct: 121 ATKGEFSELQKAAKHAWVVGKKLWEELESGKIELKPKAKMENQSESCPHSITLSGSEFQA 180

Query: 181 QSRILELPCGLTLWSHITVVGTPRWAHLEDDPKISILREGDDSVMVSQFMMELQGLKTVD 240
           Q RI+ELPCGLTLWSHITVVGTP WAH E+DPKISIL+EGDDSV+VSQFMMELQGLKTVD
Sbjct: 181 QGRIMELPCGLTLWSHITVVGTPHWAHSEEDPKISILKEGDDSVLVSQFMMELQGLKTVD 240

Query: 241 GEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWKARADEETVDGQVKCEK 300
           GEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWK+RADEETVDGQVKCEK
Sbjct: 241 GEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWKSRADEETVDGQVKCEK 300

Query: 301 WIRDDDSHSEESKVIWWLNRLIGRTKKVAIDWPYPFAEGRLFVLTVSAGLEGYHINVDGR 360
           WIRDDDS SEESKVIWWLNRLIGRTKKV IDWPYPF EGRLFVLTVSAGLEGYHINVDGR
Sbjct: 301 WIRDDDSRSEESKVIWWLNRLIGRTKKVMIDWPYPFVEGRLFVLTVSAGLEGYHINVDGR 360

Query: 361 HVTSFPYRTGFVLEDATGLSVNGDIDVHSIFAASLPATHPSFAPQKHIEMLTQWKAPSLP 420
           HVTSFPYRTGFVLEDATGLSVNGDIDVHS+FAASLP  HPSFAPQKH+EMLTQWKAP +P
Sbjct: 361 HVTSFPYRTGFVLEDATGLSVNGDIDVHSLFAASLPTAHPSFAPQKHMEMLTQWKAPPIP 420

Query: 421 KKNVELFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVARFFVAMHGRKEVNIELKKEA 480
           K NVELFIG+LSAGNHFAERMAVRKSWMQHRLIRSS+ VARFFVAMHGRKEVN ELKKEA
Sbjct: 421 KSNVELFIGILSAGNHFAERMAVRKSWMQHRLIRSSLAVARFFVAMHGRKEVNTELKKEA 480

Query: 481 EYFGDIVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMKCDDDTFVRVDAVIDEAHKVQ 540
           EYFGDIVIVP+MDNYDLVVLKTIAICEYG RTVAA YIMKCDDDTFVRVDAV+ EAHKVQ
Sbjct: 481 EYFGDIVIVPYMDNYDLVVLKTIAICEYGARTVAAKYIMKCDDDTFVRVDAVLSEAHKVQ 540

Query: 541 SDGGSLYVGNMNFHHKPLRHGKWAVTYEEWPEEDYPTYANGPGYILSSDIAEYIVSEFEK 600
           + G SLYVGNMN+HHKPLRHGKWAVTYEEWPEEDYP YANGPGYILSSDIAEYIVSEFEK
Sbjct: 541 A-GRSLYVGNMNYHHKPLRHGKWAVTYEEWPEEDYPAYANGPGYILSSDIAEYIVSEFEK 600

Query: 601 HRLRLFKMEDVSMGMWVEQFNRTKPVLYHHSLRFCQFGCIEDYLTSHYQSPRQMVCLWEK 660
           H+LRLFKMEDVSMGMWVEQFN +KPV + HSLRFCQFGCIEDYLT+HYQSPRQM+CLW+K
Sbjct: 601 HKLRLFKMEDVSMGMWVEQFNSSKPVKFLHSLRFCQFGCIEDYLTAHYQSPRQMMCLWDK 660

Query: 661 LMHQRRPQCCNMR 674
           LM Q++PQCCNMR
Sbjct: 661 LMQQKKPQCCNMR 672

BLAST of CmaCh04G003200 vs. NCBI nr
Match: gi|703098149|ref|XP_010096305.1| (putative beta-1,3-galactosyltransferase 19 [Morus notabilis])

HSP 1 Score: 1124.4 bits (2907), Expect = 0.0e+00
Identity = 528/673 (78.45%), Postives = 590/673 (87.67%), Query Frame = 1

Query: 1   MKRGKLDTMVSRNRIRLLQFLMGLVFFYLLFMSFEIPLVYRTGYGSVPDDGTFGFTSDTL 60
           MKRGKLD+++S +R+RLLQ LM LVFF +LFMSFEIPLV RTG G+  D+  + F SD L
Sbjct: 47  MKRGKLDSLMSPSRLRLLQILMALVFFCMLFMSFEIPLVLRTGLGASGDE-MYSFISDAL 106

Query: 61  PRPFLLESEEEMADKDAPRRPSDDTFLVSHGSPHRTPERRMREFKKVSGLVFDESTFDRN 120
           PRP  LESEE+ ADKDAP RP+D+   V  GSPHRTP    REFKKVSGL F+ + FD +
Sbjct: 107 PRPLALESEEDFADKDAPSRPADNPLRVFGGSPHRTP---TREFKKVSGLAFNGTVFDAH 166

Query: 121 ASKGEFSELHKAVKHAWVVGKKLWGDLESGKIVLQPKTKTENQSETCPHSITLSGSEFEA 180
             +G  SELH A KHAW VG+KLW +LESGKI   P  K EN+SE CPHSI LSGS+F A
Sbjct: 167 VGEGNSSELHMAAKHAWAVGRKLWNELESGKIQNNPIVKPENRSEQCPHSIALSGSDFRA 226

Query: 181 QSRILELPCGLTLWSHITVVGTPRWAHLEDDPKISILREGDDSVMVSQFMMELQGLKTVD 240
           ++R+L LPCGLTLWSHITVVGTPRWAH E DPKI++L+EGD+SVMVSQFMMELQGLKTVD
Sbjct: 227 RNRVLVLPCGLTLWSHITVVGTPRWAHQEYDPKIAVLKEGDESVMVSQFMMELQGLKTVD 286

Query: 241 GEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWKARADEETVDGQVKCEK 300
           GEDPPRILHFNPRLKGDWSGKPVIE+NTCYRMQWG+ALRCEGWK+RADEET+DGQVKCEK
Sbjct: 287 GEDPPRILHFNPRLKGDWSGKPVIEENTCYRMQWGSALRCEGWKSRADEETIDGQVKCEK 346

Query: 301 WIRDDDSHSEESKVIWWLNRLIGRTKKVAIDWPYPFAEGRLFVLTVSAGLEGYHINVDGR 360
           WIRDDD+HSEESK +WWLNRLIGRTKKV IDWPYPFAEGRLFVLTVSAGLEGYH+NVDGR
Sbjct: 347 WIRDDDNHSEESKALWWLNRLIGRTKKVTIDWPYPFAEGRLFVLTVSAGLEGYHVNVDGR 406

Query: 361 HVTSFPYRTGFVLEDATGLSVNGDIDVHSIFAASLPATHPSFAPQKHIEMLTQWKAPSLP 420
           HVTSFPYRTGFVLEDATGL VNGD+DVHS+FAASLP +HPSFAPQ H+EM  +WKAP L 
Sbjct: 407 HVTSFPYRTGFVLEDATGLFVNGDVDVHSVFAASLPTSHPSFAPQLHLEMSARWKAPPLS 466

Query: 421 KKNVELFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVARFFVAMHGRKEVNIELKKEA 480
               ELFIG+LSAGNHFAERMAVRKSWMQH+LI+SS  VARFFVA+HGRKEVN+ELKKEA
Sbjct: 467 NDRAELFIGILSAGNHFAERMAVRKSWMQHKLIKSSHAVARFFVALHGRKEVNVELKKEA 526

Query: 481 EYFGDIVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMKCDDDTFVRVDAVIDEAHKVQ 540
           +YFGDIVIVP+MDNYDLVVLKTIAICEYG RTVAA +IMKCDDDTFVRVD V+ EAHKV 
Sbjct: 527 DYFGDIVIVPYMDNYDLVVLKTIAICEYGHRTVAAKHIMKCDDDTFVRVDTVLKEAHKVG 586

Query: 541 SDGGSLYVGNMNFHHKPLRHGKWAVTYEEWPEEDYPTYANGPGYILSSDIAEYIVSEFEK 600
            D  SLY+GN+N+HHKPLR+GKWAVTYEEWPEEDYP YANGPGYI+SSDIAE+I+SEFEK
Sbjct: 587 ED-KSLYIGNINYHHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIISSDIAEFIISEFEK 646

Query: 601 HRLRLFKMEDVSMGMWVEQFNRTKPVLYHHSLRFCQFGCIEDYLTSHYQSPRQMVCLWEK 660
           H+LRLFKMEDVSMGMWVEQFN +KPV Y HS+RFCQFGCI+DY T+HYQSPRQM+C+W K
Sbjct: 647 HKLRLFKMEDVSMGMWVEQFNSSKPVQYVHSVRFCQFGCIDDYYTAHYQSPRQMMCMWGK 706

Query: 661 LMHQRRPQCCNMR 674
           L    RPQCCNMR
Sbjct: 707 LQQHGRPQCCNMR 714

BLAST of CmaCh04G003200 vs. NCBI nr
Match: gi|645229969|ref|XP_008221709.1| (PREDICTED: probable beta-1,3-galactosyltransferase 19 [Prunus mume])

HSP 1 Score: 1108.6 bits (2866), Expect = 0.0e+00
Identity = 514/673 (76.37%), Postives = 589/673 (87.52%), Query Frame = 1

Query: 1   MKRGKLDTMVSRNRIRLLQFLMGLVFFYLLFMSFEIPLVYRTGYGSVPDDGTFGFTSDTL 60
           MKRGK+D+M+  +R+ ++Q L+G VF YLLF++FEIP V + G+GS   D +     D L
Sbjct: 1   MKRGKVDSMLPPSRLGMVQILIGAVFVYLLFITFEIPHVLKYGFGSSGSDDSL----DAL 60

Query: 61  PRPFLLESEEEMADKDAPRRPSDDTFLVSHGSPHRTPERRMREFKKVSGLVFDESTFDRN 120
           PR F+LESEEEM ++DAP RP++D F  S GSP RTP+RR RE KKVSGLVF ++ FD N
Sbjct: 61  PRTFMLESEEEMGERDAPSRPTEDPFRDSGGSPSRTPQRRTREVKKVSGLVFKDTLFDTN 120

Query: 121 ASKGEFSELHKAVKHAWVVGKKLWGDLESGKIVLQPKTKTENQSETCPHSITLSGSEFEA 180
            S+ + SELHKA K+AW  GKKLW +LESGK+    K K+EN+SE CPHS+ LSGSEFEA
Sbjct: 121 VSRDQVSELHKAAKNAWTAGKKLWAELESGKLEFGLKNKSENRSEPCPHSLILSGSEFEA 180

Query: 181 QSRILELPCGLTLWSHITVVGTPRWAHLEDDPKISILREGDDSVMVSQFMMELQGLKTVD 240
           + R++ LPCG+TLWSHITVVGTP+WAH E DPKIS+L+EGD++VMVSQFMMELQGLK V+
Sbjct: 181 RKRVMVLPCGMTLWSHITVVGTPKWAHSEYDPKISMLKEGDEAVMVSQFMMELQGLKNVE 240

Query: 241 GEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWKARADEETVDGQVKCEK 300
           GEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWG+ALRCEGWK+RADE+TVDGQVKCEK
Sbjct: 241 GEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGSALRCEGWKSRADEDTVDGQVKCEK 300

Query: 301 WIRDDDSHSEESKVIWWLNRLIGRTKKVAIDWPYPFAEGRLFVLTVSAGLEGYHINVDGR 360
           WIRDDD HSEESK  WWLNRLIGRTKKV IDWPYPFAEG+LFVLTVSAGLEGYHINVDGR
Sbjct: 301 WIRDDDDHSEESKATWWLNRLIGRTKKVTIDWPYPFAEGKLFVLTVSAGLEGYHINVDGR 360

Query: 361 HVTSFPYRTGFVLEDATGLSVNGDIDVHSIFAASLPATHPSFAPQKHIEMLTQWKAPSLP 420
           H+TSFPYRTGF LEDATGLSVNGDIDVHS+ AASLP +HPSFAP  H+EM+T+WK PSLP
Sbjct: 361 HLTSFPYRTGFALEDATGLSVNGDIDVHSVLAASLPTSHPSFAPSMHLEMVTRWKVPSLP 420

Query: 421 KKNVELFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVARFFVAMHGRKEVNIELKKEA 480
             +VELFIG+LSAGNHFAERMAVRKSWMQH+LI+SS VVARFFVA+HGR EVN+EL KE 
Sbjct: 421 YGHVELFIGILSAGNHFAERMAVRKSWMQHKLIKSSRVVARFFVALHGRNEVNMELMKEV 480

Query: 481 EYFGDIVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMKCDDDTFVRVDAVIDEAHKVQ 540
            YFGDIVIVP+MDNYDLVVLKT+AICEYG+RTV A YIMKCDDDTFVRVDAV+ E  KV 
Sbjct: 481 GYFGDIVIVPYMDNYDLVVLKTVAICEYGIRTVPAKYIMKCDDDTFVRVDAVLKEVRKVH 540

Query: 541 SDGGSLYVGNMNFHHKPLRHGKWAVTYEEWPEEDYPTYANGPGYILSSDIAEYIVSEFEK 600
               SLY+GNMN+HHKPLRHGKWAVTYEEWPEEDYP+YANGPGY+LSSDIA++IVS+FEK
Sbjct: 541 GH-RSLYIGNMNYHHKPLRHGKWAVTYEEWPEEDYPSYANGPGYVLSSDIAKFIVSDFEK 600

Query: 601 HRLRLFKMEDVSMGMWVEQFNRTKPVLYHHSLRFCQFGCIEDYLTSHYQSPRQMVCLWEK 660
           H+LRLFKMEDVSMGMWVEQFN +KPV Y HSL+FCQFGCI+DY T+HYQSPRQM+C+W+K
Sbjct: 601 HKLRLFKMEDVSMGMWVEQFNNSKPVEYVHSLKFCQFGCIDDYYTAHYQSPRQMICMWDK 660

Query: 661 LMHQRRPQCCNMR 674
           L HQ +PQCCNMR
Sbjct: 661 LQHQGKPQCCNMR 668

BLAST of CmaCh04G003200 vs. NCBI nr
Match: gi|596274467|ref|XP_007225156.1| (hypothetical protein PRUPE_ppa002487mg [Prunus persica])

HSP 1 Score: 1103.6 bits (2853), Expect = 0.0e+00
Identity = 512/673 (76.08%), Postives = 589/673 (87.52%), Query Frame = 1

Query: 1   MKRGKLDTMVSRNRIRLLQFLMGLVFFYLLFMSFEIPLVYRTGYGSVPDDGTFGFTSDTL 60
           MKRGK+D+M+  +R+ ++Q L+G VF YLLF++FEIP V + G+GS   D +     D L
Sbjct: 1   MKRGKVDSMLPPSRLGMVQILIGAVFVYLLFITFEIPHVLKHGFGSSGSDDSL----DAL 60

Query: 61  PRPFLLESEEEMADKDAPRRPSDDTFLVSHGSPHRTPERRMREFKKVSGLVFDESTFDRN 120
           P  F+LESEEEM + DAP RP+++ F  S GSP RTP+RR RE KKVSGLVF ++ FD N
Sbjct: 61  PITFMLESEEEMGESDAPSRPTENPFRDSEGSPSRTPQRRTREAKKVSGLVFKDTLFDAN 120

Query: 121 ASKGEFSELHKAVKHAWVVGKKLWGDLESGKIVLQPKTKTENQSETCPHSITLSGSEFEA 180
            S+ + SELHKA ++AW  GKKLW +LESGK+    K K+EN+SE CPHS+ LSGSEFEA
Sbjct: 121 VSRDQVSELHKAARNAWTAGKKLWAELESGKLEFGLKNKSENRSEPCPHSLILSGSEFEA 180

Query: 181 QSRILELPCGLTLWSHITVVGTPRWAHLEDDPKISILREGDDSVMVSQFMMELQGLKTVD 240
           + R++ LPCG+TLWSHITVVGTP+WAH E DPKIS+L+EGD++VMVSQFMMELQGLK V+
Sbjct: 181 RKRVMVLPCGMTLWSHITVVGTPKWAHSEYDPKISMLKEGDEAVMVSQFMMELQGLKIVE 240

Query: 241 GEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGTALRCEGWKARADEETVDGQVKCEK 300
           GEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWG+ALRCEGWK+RADE+TVDGQVKCEK
Sbjct: 241 GEDPPRILHFNPRLKGDWSGKPVIEQNTCYRMQWGSALRCEGWKSRADEDTVDGQVKCEK 300

Query: 301 WIRDDDSHSEESKVIWWLNRLIGRTKKVAIDWPYPFAEGRLFVLTVSAGLEGYHINVDGR 360
           WIRDDD HSEESK  WWLNRLIGRTKKV IDWPYPFAEG+LFVLTVSAGLEGYHINVDGR
Sbjct: 301 WIRDDDDHSEESKATWWLNRLIGRTKKVTIDWPYPFAEGKLFVLTVSAGLEGYHINVDGR 360

Query: 361 HVTSFPYRTGFVLEDATGLSVNGDIDVHSIFAASLPATHPSFAPQKHIEMLTQWKAPSLP 420
           H+TSFPYRTGF LEDATGLSVNGDIDVHS+ AASLP +HPSFAP  H+EM+T+WKAPSLP
Sbjct: 361 HLTSFPYRTGFALEDATGLSVNGDIDVHSVLAASLPTSHPSFAPSMHLEMVTRWKAPSLP 420

Query: 421 KKNVELFIGVLSAGNHFAERMAVRKSWMQHRLIRSSIVVARFFVAMHGRKEVNIELKKEA 480
             +VELFIG+LSAGNHFAERMAVRKSWMQH+LI+SS VVARFFVA+HGR EVN+EL KE 
Sbjct: 421 YGHVELFIGILSAGNHFAERMAVRKSWMQHKLIKSSRVVARFFVALHGRNEVNMELMKEV 480

Query: 481 EYFGDIVIVPFMDNYDLVVLKTIAICEYGVRTVAANYIMKCDDDTFVRVDAVIDEAHKVQ 540
            YFGDIVIVP+MDNYDLVVLKT+AICEYG+RTV A YIMKCDDDTFVR+DAV+ EA KV 
Sbjct: 481 GYFGDIVIVPYMDNYDLVVLKTVAICEYGIRTVPAKYIMKCDDDTFVRLDAVLKEARKVH 540

Query: 541 SDGGSLYVGNMNFHHKPLRHGKWAVTYEEWPEEDYPTYANGPGYILSSDIAEYIVSEFEK 600
               SLY+GNMN+HHKPLRHGKWAVTYEEWPEEDYP+YANGPGY+LSSDIA++IVS+FEK
Sbjct: 541 GH-RSLYIGNMNYHHKPLRHGKWAVTYEEWPEEDYPSYANGPGYVLSSDIAKFIVSDFEK 600

Query: 601 HRLRLFKMEDVSMGMWVEQFNRTKPVLYHHSLRFCQFGCIEDYLTSHYQSPRQMVCLWEK 660
           H+LRLFKMEDVSMGMWVEQFN +KPV Y HSL+FCQFGCI+DY T+HYQSPRQM+C+W+K
Sbjct: 601 HKLRLFKMEDVSMGMWVEQFNNSKPVEYVHSLKFCQFGCIDDYYTAHYQSPRQMICMWDK 660

Query: 661 LMHQRRPQCCNMR 674
           L HQ +PQCCNMR
Sbjct: 661 LQHQGKPQCCNMR 668

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
B3GTH_ARATH1.4e-26565.51Hydroxyproline O-galactosyltransferase GALT4 OS=Arabidopsis thaliana GN=GALT4 PE... [more]
B3GTJ_ARATH9.5e-26263.53Hydroxyproline O-galactosyltransferase GALT6 OS=Arabidopsis thaliana GN=GALT6 PE... [more]
B3GTI_ARATH2.5e-25463.44Hydroxyproline O-galactosyltransferase GALT5 OS=Arabidopsis thaliana GN=GALT5 PE... [more]
B3GTK_ARATH1.4e-20151.59Hydroxyproline O-galactosyltransferase GALT2 OS=Arabidopsis thaliana GN=GALT2 PE... [more]
B3GTF_ARATH1.1e-9236.91Beta-1,3-galactosyltransferase GALT1 OS=Arabidopsis thaliana GN=GALT1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KQG2_CUCSA0.0e+0089.75Uncharacterized protein OS=Cucumis sativus GN=Csa_5G604080 PE=4 SV=1[more]
W9R193_9ROSA0.0e+0078.45Putative beta-1,3-galactosyltransferase 19 OS=Morus notabilis GN=L484_021051 PE=... [more]
M5Y3K0_PRUPE0.0e+0076.08Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002487mg PE=4 SV=1[more]
A0A061E5K7_THECC0.0e+0077.25Galactosyltransferase family protein isoform 1 OS=Theobroma cacao GN=TCM_010069 ... [more]
F6HPH6_VITVI0.0e+0076.07Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0026g01240 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT1G27120.18.0e-26765.51 Galactosyltransferase family protein[more]
AT5G62620.15.3e-26363.53 Galactosyltransferase family protein[more]
AT1G74800.11.4e-25563.44 Galactosyltransferase family protein[more]
AT4G21060.13.8e-20052.22 Galactosyltransferase family protein[more]
AT1G26810.16.2e-9436.91 galactosyltransferase1[more]
Match NameE-valueIdentityDescription
gi|659090947|ref|XP_008446287.1|0.0e+0090.64PREDICTED: probable beta-1,3-galactosyltransferase 19 [Cucumis melo][more]
gi|449434851|ref|XP_004135209.1|0.0e+0089.75PREDICTED: probable beta-1,3-galactosyltransferase 19 [Cucumis sativus][more]
gi|703098149|ref|XP_010096305.1|0.0e+0078.45putative beta-1,3-galactosyltransferase 19 [Morus notabilis][more]
gi|645229969|ref|XP_008221709.1|0.0e+0076.37PREDICTED: probable beta-1,3-galactosyltransferase 19 [Prunus mume][more]
gi|596274467|ref|XP_007225156.1|0.0e+0076.08hypothetical protein PRUPE_ppa002487mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001079Galectin_CRD
IPR002659Glyco_trans_31
IPR013320ConA-like_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0030246carbohydrate binding
GO:0008378galactosyltransferase activity
Vocabulary: Biological Process
TermDefinition
GO:0006486protein glycosylation
Vocabulary: Cellular Component
TermDefinition
GO:0016020membrane
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030206 chondroitin sulfate biosynthetic process
biological_process GO:0006486 protein glycosylation
cellular_component GO:0005794 Golgi apparatus
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0030246 carbohydrate binding
molecular_function GO:0047220 galactosylxylosylprotein 3-beta-galactosyltransferase activity
molecular_function GO:0008378 galactosyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G003200.1CmaCh04G003200.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001079Galectin, carbohydrate recognition domainPFAMPF00337Gal-bind_lectincoord: 183..391
score: 2.4
IPR001079Galectin, carbohydrate recognition domainSMARTSM00908Gal_bind_lectin_2coord: 187..392
score: 1.9
IPR001079Galectin, carbohydrate recognition domainPROFILEPS51304GALECTINcoord: 183..393
score: 28
IPR002659Glycosyl transferase, family 31PANTHERPTHR11214BETA-1,3-N-ACETYLGLUCOSAMINYLTRANSFERASEcoord: 256..673
score:
IPR002659Glycosyl transferase, family 31PFAMPF01762Galactosyl_Tcoord: 439..620
score: 4.4
IPR013320Concanavalin A-like lectin/glucanase domainGENE3DG3DSA:2.60.120.200coord: 186..282
score: 7.7E-25coord: 334..390
score: 7.7
IPR013320Concanavalin A-like lectin/glucanase domainunknownSSF49899Concanavalin A-like lectins/glucanasescoord: 185..282
score: 5.85E-22coord: 334..391
score: 5.85
NoneNo IPR availablePANTHERPTHR11214:SF103BETA-1,3-GALACTOSYLTRANSFERASE 17-RELATEDcoord: 256..673
score: