CSPI01G08100 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI01G08100
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionhydroxyproline O-galactosyltransferase GALT6
LocationChr1: 5108756 .. 5112267 (-)
RNA-Seq ExpressionCSPI01G08100
SyntenyCSPI01G08100
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCTCATTTTCAAATTCTCATTGGAAGATTCGATCTATTTTCTGTTTCCGTTGGAAAGTCATTGCCTTTCAAAAGAAAAAAGAAAACCCATTATTCTATTCTATTCTATTCTATTGATTCTTCAAGTTCTCTGTTTCCTGGAAAGTGTTTTTGGTGGTTCGTCTGAGTTCTTTTTCTTTCAACTCTTTTCATTCTTAAGCCCCTTTTGAGACTTGGGTCTCTTTCGATTTTTAGAATCAAGAGCAATCTCTATCTTGAATCGTGTATCTGATCATCTGGGGTTTTTTAAAACCAGAAAAGAAAGAAGAAGAAGAAGGATTAACGATTGAAGGGTTTTGATTATCTACTCATTTTGAAGTCTTCTTTTAGGGAAAATTCTTTTTCTTTACTTTTCGGGAGTTGGGGTTGGGGCTTCTTTGTTAATCCCCTCCTTCATAGTAGAGATGAAGAGAGGGAAATTGGAGAAAGTTGATATGATCGTTTCTTTTACTAGACAGAGATCGATTCAAATACTTTTAATCATTGGGGTTCTGTATCTTCTTCTGGTTTCTTTAGAAATTCCTCTTGTTTTCAGAGCTGGGTCCAGCGTTGTTTCTCAAGACTCGTTGTCTCGGCCTTCTCCGCTTGAGAGCGAGGAAGATTTGGAAGAAAGAGAAGCCCCATCTCGTCCTTTGGAAAATATATCGAGAAATTCGTTACAGCCGACTCCTAGTCGACTGAATCAGTTCAATAAAATCATCTCCGGGTTGGCCTTGGAAACGGAGGCTTTTGAATCGAGGAGTGAAGATGCGGTTTCTGAGTTTTATAGGTCTGCAAAAATTGCTTCTGAGGTTGGGAAGAAATTCTGGGATGAGCTTGAATCTGGAAAGAGTCAACATTTGGAGAAGAAGAAGGCTGAAAAAGGATCAAATTCCTCTTGCCCACATTCGATTTCTTTATCTGGGAATGATTTTTTGGCTCATGGTGGAGTTATGATGCTGCCCTGTGGACTCACATTGGGATCACATATAACCCTTGTGGGTAAGCCAAGAGTGGCACAACCAGAGTCCGATCCTCAGATAACTATGGTGAAAAATGGTGAAGAGTCGGTTATGGTTTCACAGTTCATAATGGAGTTGCAGGGCTTGAACACTGTGGAGGGTGAAGACCCTCCTAGGATTTTGCATTTCAATCCAAGGTTGAAGGGGGATTGGAGTGGAAAGCCAGTGATTGAACTGAACACATGTTACAGGATGCAATGGGGCTCGGCGCATCGCTGCGAAGGATGGAAATCGAAGGCCAACGAAGATACAGGTAAATGACCGAAGAATTCAATCATCCTTCGTAGTTTTATAGCTGGCCAGAGCATTGGATTTTATTGACTTCATATGAGTAGAACATGCTTCTTTTTATCCATCTTTGAATGCTGATGGATTGTGATCTTTTTGGCTTCCGTTTCATCTTTTGTTAACTTTTTAGTCTATAACTCAATAGAGTATTGAAATGTTCTGCTTAGCTGAGAATGAGAGCATTATGAACATCATTCTTAGAGAACGCTGAATCATTATGTAATACTGTGATGGCTGCAGTAATTTGACTAATTCTTAAGATTTCTACAGTTGACGGCCAGGTGAAGTGCGAAAAGTGGATCAGAGATGACGAAGGGAACTCAGAGCGGTCAAAGGCGACATGGTGGTTAAACCGACTCATTGGTCGAACAAAAAGAATGGATATTGACTGGCCTTATCCTTTTGCTGAAGACAAGCTCTTTGTCCTAACACTTAGTGCAGGATTTGAAGGTTACCATGTGAATGTTGATGGGAAGCATATTGTTTCTTTCCCTTATCGCACTGTGAGTATATATACTTTAGAAATGATAGATGCCTTTGCCAAAAGTTTTTCTTTCCTTCTCTAATGGTCAACATCGTAGGGATTTGCTCTCGAAGATGCCACTGGATTATCTGTCATTGGGGACATTGATGTCCAGTCCGTGTTGGCTGCTTCTTTGCCACAATCACATCCTAGTTTTGCTCCTCAGCAGCACCTTGAGATGTCGAGGAGATGGCAAGCACCGCCTCTTCCTGATGGTGAGATAGATCTTTTTATTGGTATCCTTTCTGCTGGCAACCACTTTGCTGAACGGATGGCAGTAAGGAAGTCGTGGATGCGACATAAACTCATAAGGTCGTCAAAGATTGTAGCTCGATTTTTTGTAGCACTTGTAAGTGATAATAAAAACAGTTCTTTTTTCATTCAAGTCCAGGTTATTCTGCATTCTTATATGTGTATATAACCATTCTGAGTTCATACTTATGCAGCATGCAAGAAAGGAAGTAAATGTTGAGTTGAAGAAAGAAGCTGAGTTTTTTGGGGATATAGTTATAGTGCCTTACATGGATAACTACGACCTTGTGGTTTTGAAAACTGTGGCCATCTGTGAACATGGGGTGAGTTTCCGAACTAATATGATGCTGCTAATGTATTGATTCGAGTATACCAAGACAAGTTGTCTTATGATATATCATTAATCTTCGAGTTCTATCTTTTACATAGGTTCATGCTGTGTCTGCAAAATACATTATGAAATGTGATGACGATACGTTTGTAAAAGTGGATTCAATCATGAATGAAATTAAGAGCGTATCGGGCACCGGGAGCGTGTACATTGGCAACATAAATTACTACCACAAGCCCCTGCGCTACGGAAAATGGGCTGTCACGTATGAGGTAAGTTCCCATATATCTCGACTTTGCATTACATATCTTGGTTCCTTCTCTAATGTTGGGTTGGTGGATTGTTTCATTCAGGAATGGCCAGAAGAAGATTATCCGCCTTACGCAAATGGACCGGGGTATATAGTGTCATCTGATATTGCTCAGTTTGTCATATCAAACTTCGAGAGACGTAAATTAAGGGTACGTATACATTCCCTTTGAACATCAGTTAAAATTCAAATGATTTTGTCTTAAAACTTGAGAGAACAAGAACTAAAAGAGAGAACCGTTGTGTTTTTGACAGTTATTTAAGATGGAAGATGTGAGTATGGGGATGTGGGTGGAGCAATTCAACAGCTCAAAAGCAGTGAAATATGTGCATAGTTTCAAATATTGTCAATTTGGATGCATAGAAGAATATTCCACAGCCCATTATCAATCTCCAAGACAAATGATTTGCCTTTGGAACAAATTACTAAGGCAAGCGAAGCCTGAGTGTTGCAATATGAGATGATGAAGTTCTTGATAAAGAAAAAAAAACCTTGCTGCAGTCAGTCAATGCCTATTTTTTGACTTGCCTGTACATGAATCTCCTGCCTTGTATGCCTTGTATATTATCTCAAGCATTGGTTTTAGGCAATTTTTAGTTCATGTAGAGAAGAAAAAGAAGTTCACTCCAAATATCAATTCATTATTTCTTGAGCTCTATCTTGTGCTTTTTGGTTTGGGTTTTGCTTCTTGGGTGGGCTCTTTTTGGGGCTGGTGGGGTGGCAGTTCATGGGCCTTGGAATTGTGGATCACAAAGCCCATTTGAGTTT

mRNA sequence

CTCTCATTTTCAAATTCTCATTGGAAGATTCGATCTATTTTCTGTTTCCGTTGGAAAGTCATTGCCTTTCAAAAGAAAAAAGAAAACCCATTATTCTATTCTATTCTATTCTATTGATTCTTCAAGTTCTCTGTTTCCTGGAAAGTGTTTTTGGTGGTTCGTCTGAGTTCTTTTTCTTTCAACTCTTTTCATTCTTAAGCCCCTTTTGAGACTTGGGTCTCTTTCGATTTTTAGAATCAAGAGCAATCTCTATCTTGAATCGTGTATCTGATCATCTGGGGTTTTTTAAAACCAGAAAAGAAAGAAGAAGAAGAAGGATTAACGATTGAAGGGTTTTGATTATCTACTCATTTTGAAGTCTTCTTTTAGGGAAAATTCTTTTTCTTTACTTTTCGGGAGTTGGGGTTGGGGCTTCTTTGTTAATCCCCTCCTTCATAGTAGAGATGAAGAGAGGGAAATTGGAGAAAGTTGATATGATCGTTTCTTTTACTAGACAGAGATCGATTCAAATACTTTTAATCATTGGGGTTCTGTATCTTCTTCTGGTTTCTTTAGAAATTCCTCTTGTTTTCAGAGCTGGGTCCAGCGTTGTTTCTCAAGACTCGTTGTCTCGGCCTTCTCCGCTTGAGAGCGAGGAAGATTTGGAAGAAAGAGAAGCCCCATCTCGTCCTTTGGAAAATATATCGAGAAATTCGTTACAGCCGACTCCTAGTCGACTGAATCAGTTCAATAAAATCATCTCCGGGTTGGCCTTGGAAACGGAGGCTTTTGAATCGAGGAGTGAAGATGCGGTTTCTGAGTTTTATAGGTCTGCAAAAATTGCTTCTGAGGTTGGGAAGAAATTCTGGGATGAGCTTGAATCTGGAAAGAGTCAACATTTGGAGAAGAAGAAGGCTGAAAAAGGATCAAATTCCTCTTGCCCACATTCGATTTCTTTATCTGGGAATGATTTTTTGGCTCATGGTGGAGTTATGATGCTGCCCTGTGGACTCACATTGGGATCACATATAACCCTTGTGGGTAAGCCAAGAGTGGCACAACCAGAGTCCGATCCTCAGATAACTATGGTGAAAAATGGTGAAGAGTCGGTTATGGTTTCACAGTTCATAATGGAGTTGCAGGGCTTGAACACTGTGGAGGGTGAAGACCCTCCTAGGATTTTGCATTTCAATCCAAGGTTGAAGGGGGATTGGAGTGGAAAGCCAGTGATTGAACTGAACACATGTTACAGGATGCAATGGGGCTCGGCGCATCGCTGCGAAGGATGGAAATCGAAGGCCAACGAAGATACAGTTGACGGCCAGGTGAAGTGCGAAAAGTGGATCAGAGATGACGAAGGGAACTCAGAGCGGTCAAAGGCGACATGGTGGTTAAACCGACTCATTGGTCGAACAAAAAGAATGGATATTGACTGGCCTTATCCTTTTGCTGAAGACAAGCTCTTTGTCCTAACACTTAGTGCAGGATTTGAAGGTTACCATGTGAATGTTGATGGGAAGCATATTGTTTCTTTCCCTTATCGCACTGGATTTGCTCTCGAAGATGCCACTGGATTATCTGTCATTGGGGACATTGATGTCCAGTCCGTGTTGGCTGCTTCTTTGCCACAATCACATCCTAGTTTTGCTCCTCAGCAGCACCTTGAGATGTCGAGGAGATGGCAAGCACCGCCTCTTCCTGATGGTGAGATAGATCTTTTTATTGGTATCCTTTCTGCTGGCAACCACTTTGCTGAACGGATGGCAGTAAGGAAGTCGTGGATGCGACATAAACTCATAAGGTCGTCAAAGATTGTAGCTCGATTTTTTGTAGCACTTCATGCAAGAAAGGAAGTAAATGTTGAGTTGAAGAAAGAAGCTGAGTTTTTTGGGGATATAGTTATAGTGCCTTACATGGATAACTACGACCTTGTGGTTTTGAAAACTGTGGCCATCTGTGAACATGGGGTTCATGCTGTGTCTGCAAAATACATTATGAAATGTGATGACGATACGTTTGTAAAAGTGGATTCAATCATGAATGAAATTAAGAGCGTATCGGGCACCGGGAGCGTGTACATTGGCAACATAAATTACTACCACAAGCCCCTGCGCTACGGAAAATGGGCTGTCACGTATGAGGAATGGCCAGAAGAAGATTATCCGCCTTACGCAAATGGACCGGGGTATATAGTGTCATCTGATATTGCTCAGTTTGTCATATCAAACTTCGAGAGACGTAAATTAAGGTTATTTAAGATGGAAGATGTGAGTATGGGGATGTGGGTGGAGCAATTCAACAGCTCAAAAGCAGTGAAATATGTGCATAGTTTCAAATATTGTCAATTTGGATGCATAGAAGAATATTCCACAGCCCATTATCAATCTCCAAGACAAATGATTTGCCTTTGGAACAAATTACTAAGGCAAGCGAAGCCTGAGTGTTGCAATATGAGATGATGAAGTTCTTGATAAAGAAAAAAAAACCTTGCTGCAGTCAGTCAATGCCTATTTTTTGACTTGCCTGTACATGAATCTCCTGCCTTGTATGCCTTGTATATTATCTCAAGCATTGGTTTTAGGCAATTTTTAGTTCATGTAGAGAAGAAAAAGAAGTTCACTCCAAATATCAATTCATTATTTCTTGAGCTCTATCTTGTGCTTTTTGGTTTGGGTTTTGCTTCTTGGGTGGGCTCTTTTTGGGGCTGGTGGGGTGGCAGTTCATGGGCCTTGGAATTGTGGATCACAAAGCCCATTTGAGTTT

Coding sequence (CDS)

ATGAAGAGAGGGAAATTGGAGAAAGTTGATATGATCGTTTCTTTTACTAGACAGAGATCGATTCAAATACTTTTAATCATTGGGGTTCTGTATCTTCTTCTGGTTTCTTTAGAAATTCCTCTTGTTTTCAGAGCTGGGTCCAGCGTTGTTTCTCAAGACTCGTTGTCTCGGCCTTCTCCGCTTGAGAGCGAGGAAGATTTGGAAGAAAGAGAAGCCCCATCTCGTCCTTTGGAAAATATATCGAGAAATTCGTTACAGCCGACTCCTAGTCGACTGAATCAGTTCAATAAAATCATCTCCGGGTTGGCCTTGGAAACGGAGGCTTTTGAATCGAGGAGTGAAGATGCGGTTTCTGAGTTTTATAGGTCTGCAAAAATTGCTTCTGAGGTTGGGAAGAAATTCTGGGATGAGCTTGAATCTGGAAAGAGTCAACATTTGGAGAAGAAGAAGGCTGAAAAAGGATCAAATTCCTCTTGCCCACATTCGATTTCTTTATCTGGGAATGATTTTTTGGCTCATGGTGGAGTTATGATGCTGCCCTGTGGACTCACATTGGGATCACATATAACCCTTGTGGGTAAGCCAAGAGTGGCACAACCAGAGTCCGATCCTCAGATAACTATGGTGAAAAATGGTGAAGAGTCGGTTATGGTTTCACAGTTCATAATGGAGTTGCAGGGCTTGAACACTGTGGAGGGTGAAGACCCTCCTAGGATTTTGCATTTCAATCCAAGGTTGAAGGGGGATTGGAGTGGAAAGCCAGTGATTGAACTGAACACATGTTACAGGATGCAATGGGGCTCGGCGCATCGCTGCGAAGGATGGAAATCGAAGGCCAACGAAGATACAGTTGACGGCCAGGTGAAGTGCGAAAAGTGGATCAGAGATGACGAAGGGAACTCAGAGCGGTCAAAGGCGACATGGTGGTTAAACCGACTCATTGGTCGAACAAAAAGAATGGATATTGACTGGCCTTATCCTTTTGCTGAAGACAAGCTCTTTGTCCTAACACTTAGTGCAGGATTTGAAGGTTACCATGTGAATGTTGATGGGAAGCATATTGTTTCTTTCCCTTATCGCACTGGATTTGCTCTCGAAGATGCCACTGGATTATCTGTCATTGGGGACATTGATGTCCAGTCCGTGTTGGCTGCTTCTTTGCCACAATCACATCCTAGTTTTGCTCCTCAGCAGCACCTTGAGATGTCGAGGAGATGGCAAGCACCGCCTCTTCCTGATGGTGAGATAGATCTTTTTATTGGTATCCTTTCTGCTGGCAACCACTTTGCTGAACGGATGGCAGTAAGGAAGTCGTGGATGCGACATAAACTCATAAGGTCGTCAAAGATTGTAGCTCGATTTTTTGTAGCACTTCATGCAAGAAAGGAAGTAAATGTTGAGTTGAAGAAAGAAGCTGAGTTTTTTGGGGATATAGTTATAGTGCCTTACATGGATAACTACGACCTTGTGGTTTTGAAAACTGTGGCCATCTGTGAACATGGGGTTCATGCTGTGTCTGCAAAATACATTATGAAATGTGATGACGATACGTTTGTAAAAGTGGATTCAATCATGAATGAAATTAAGAGCGTATCGGGCACCGGGAGCGTGTACATTGGCAACATAAATTACTACCACAAGCCCCTGCGCTACGGAAAATGGGCTGTCACGTATGAGGAATGGCCAGAAGAAGATTATCCGCCTTACGCAAATGGACCGGGGTATATAGTGTCATCTGATATTGCTCAGTTTGTCATATCAAACTTCGAGAGACGTAAATTAAGGTTATTTAAGATGGAAGATGTGAGTATGGGGATGTGGGTGGAGCAATTCAACAGCTCAAAAGCAGTGAAATATGTGCATAGTTTCAAATATTGTCAATTTGGATGCATAGAAGAATATTCCACAGCCCATTATCAATCTCCAAGACAAATGATTTGCCTTTGGAACAAATTACTAAGGCAAGCGAAGCCTGAGTGTTGCAATATGAGATGA

Protein sequence

MKRGKLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSPLESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISGLALETEAFESRSEDAVSEFYRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGNDFLAHGGVMMLPCGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEGEDPPRILHFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQVKCEKWIRDDEGNSERSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYRTGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPLPDGEIDLFIGILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVIVPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSVSGTGSVYIGNINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFERRKLRLFKMEDVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWNKLLRQAKPECCNMR*
Homology
BLAST of CSPI01G08100 vs. ExPASy Swiss-Prot
Match: Q9LV16 (Hydroxyproline O-galactosyltransferase GALT6 OS=Arabidopsis thaliana OX=3702 GN=GALT6 PE=2 SV=2)

HSP 1 Score: 936.8 bits (2420), Expect = 1.3e-271
Identity = 447/674 (66.32%), Postives = 556/674 (82.49%), Query Frame = 0

Query: 2   KRGKLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSPL 61
           K  +LEK D+ VS ++QRS+QIL+ +G+LY+LL++ EIP VF+ G S +SQD L+RP   
Sbjct: 8   KLERLEKFDIFVSLSKQRSVQILMAVGLLYMLLITFEIPFVFKTGLSSLSQDPLTRPEKH 67

Query: 62  ESEEDLEEREAPSRPLEN-ISRNSLQPTPSR-LNQFNKIISGLALETEAFESRSEDAVSE 121
            S+ +L+ER AP+RPL++ + + S   +P++ L +  +I+S L  + E F   S+D   E
Sbjct: 68  NSQRELQERRAPTRPLKSLLYQESQSESPAQGLRRRTRILSSLRFDPETFNPSSKDGSVE 127

Query: 122 FYRSAKIASEVGKKFWDELESGKS----QHLEKKKAEKGSNSSCPHSISLSGNDFLAHGG 181
            ++SAK+A EVG+K W+ELESGK+    +  +KKK E+   +SC  S+SL+G+D L  G 
Sbjct: 128 LHKSAKVAWEVGRKIWEELESGKTLKALEKEKKKKIEEHGTNSCSLSVSLTGSDLLKRGN 187

Query: 182 VMMLPCGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEGED 241
           +M LPCGLTLGSHIT+VGKPR A  E DP+I+M+K G+E+V VSQF +ELQGL  VEGE+
Sbjct: 188 IMELPCGLTLGSHITVVGKPRAAHSEKDPKISMLKEGDEAVKVSQFKLELQGLKAVEGEE 247

Query: 242 PPRILHFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQVKCEKWIR 301
           PPRILH NPRLKGDWSGKPVIE NTCYRMQWGSA RCEGW+S+ +E+TVDGQVKCEKW R
Sbjct: 248 PPRILHLNPRLKGDWSGKPVIEQNTCYRMQWGSAQRCEGWRSRDDEETVDGQVKCEKWAR 307

Query: 302 DDEGNS---ERSK-ATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDG 361
           DD   S   E SK A+WWL+RLIGR+K++ ++WP+PF  DKLFVLTLSAG EGYHV+VDG
Sbjct: 308 DDSITSKEEESSKAASWWLSRLIGRSKKVTVEWPFPFTVDKLFVLTLSAGLEGYHVSVDG 367

Query: 362 KHIVSFPYRTGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPL 421
           KH+ SFPYRTGF LEDATGL++ GDIDV SV A SLP SHPSF+PQ+HLE+S  WQAP L
Sbjct: 368 KHVTSFPYRTGFTLEDATGLTINGDIDVHSVFAGSLPTSHPSFSPQRHLELSSNWQAPSL 427

Query: 422 PDGEIDLFIGILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKE 481
           PD ++D+FIGILSAGNHFAERMAVR+SWM+HKL++SSK+VARFFVALH+RKEVNVELKKE
Sbjct: 428 PDEQVDMFIGILSAGNHFAERMAVRRSWMQHKLVKSSKVVARFFVALHSRKEVNVELKKE 487

Query: 482 AEFFGDIVIVPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSV 541
           AEFFGDIVIVPYMD+YDLVVLKTVAICE+G H ++AK+IMKCDDDTFV+VD++++E K  
Sbjct: 488 AEFFGDIVIVPYMDSYDLVVLKTVAICEYGAHQLAAKFIMKCDDDTFVQVDAVLSEAKKT 547

Query: 542 SGTGSVYIGNINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFER 601
               S+YIGNINYYHKPLR GKW+VTYEEWPEEDYPPYANGPGYI+S+DI++F++  FE+
Sbjct: 548 PTDRSLYIGNINYYHKPLRQGKWSVTYEEWPEEDYPPYANGPGYILSNDISRFIVKEFEK 607

Query: 602 RKLRLFKMEDVSMGMWVEQFNS-SKAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWN 661
            KLR+FKMEDVS+GMWVEQFN+ +K V Y+HS ++CQFGCIE Y TAHYQSPRQMICLW+
Sbjct: 608 HKLRMFKMEDVSVGMWVEQFNNGTKPVDYIHSLRFCQFGCIENYLTAHYQSPRQMICLWD 667

Query: 662 KLLRQAKPECCNMR 665
           KL+   KP+CCNMR
Sbjct: 668 KLVLTGKPQCCNMR 681

BLAST of CSPI01G08100 vs. ExPASy Swiss-Prot
Match: Q8RX55 (Hydroxyproline O-galactosyltransferase GALT5 OS=Arabidopsis thaliana OX=3702 GN=GALT5 PE=1 SV=1)

HSP 1 Score: 902.9 bits (2332), Expect = 2.2e-261
Identity = 439/672 (65.33%), Postives = 539/672 (80.21%), Query Frame = 0

Query: 5   KLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRA-GSSVVSQDSLSRPSPLES 64
           K++K+D+  S  +QRS+++++ IG LYL++VS+EIPLVF++  SS V  D+LSR   L +
Sbjct: 11  KIDKIDLFSSLWKQRSVRVIMAIGFLYLVIVSVEIPLVFKSWSSSSVPLDALSRLEKLNN 70

Query: 65  EEDLEEREAPSRPLENISRNSLQPT--------PSRLNQFNK-IISGLALETEAFESRSE 124
           E++ +    P+ PLE +S     PT         +++ + ++ ++S L  ++E F+  S+
Sbjct: 71  EQEPQVEIIPNPPLEPVSYPVSNPTIVTRTDLVQNKVREHHRGVLSSLRFDSETFDPSSK 130

Query: 125 DAVSEFYRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGNDFL-AH 184
           D   E ++SAK A ++G+K W ELESG+ + L  +K EK    SCPHS+SL+G++F+   
Sbjct: 131 DGSVELHKSAKEAWQLGRKLWKELESGRLEKL-VEKPEKNKPDSCPHSVSLTGSEFMNRE 190

Query: 185 GGVMMLPCGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEG 244
             +M LPCGLTLGSHITLVG+PR A P         K G+ S +VSQF++ELQGL TVEG
Sbjct: 191 NKLMELPCGLTLGSHITLVGRPRKAHP---------KEGDWSKLVSQFVIELQGLKTVEG 250

Query: 245 EDPPRILHFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQVKCEKW 304
           EDPPRILHFNPRLKGDWS KPVIE N+CYRMQWG A RCEGWKS+ +E+TVD  VKCEKW
Sbjct: 251 EDPPRILHFNPRLKGDWSKKPVIEQNSCYRMQWGPAQRCEGWKSRDDEETVDSHVKCEKW 310

Query: 305 IRDDEGNSERSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKH 364
           IRDD+  SE S+A WWLNRLIGR KR+ ++WP+PF E+KLFVLTLSAG EGYH+NVDGKH
Sbjct: 311 IRDDDNYSEGSRARWWLNRLIGRRKRVKVEWPFPFVEEKLFVLTLSAGLEGYHINVDGKH 370

Query: 365 IVSFPYRTGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPLPD 424
           + SFPYRTGF LEDATGL+V GDIDV SV  ASLP SHPSFAPQ+HLE+S+RWQAP +PD
Sbjct: 371 VTSFPYRTGFTLEDATGLTVNGDIDVHSVFVASLPTSHPSFAPQRHLELSKRWQAPVVPD 430

Query: 425 GEIDLFIGILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAE 484
           G +++FIGILSAGNHF+ERMAVRKSWM+H LI S+K+VARFFVALH RKEVNVELKKEAE
Sbjct: 431 GPVEIFIGILSAGNHFSERMAVRKSWMQHVLITSAKVVARFFVALHGRKEVNVELKKEAE 490

Query: 485 FFGDIVIVPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSVSG 544
           +FGDIV+VPYMD+YDLVVLKTVAICEHG  A SAKYIMKCDDDTFVK+ +++NE+K V  
Sbjct: 491 YFGDIVLVPYMDSYDLVVLKTVAICEHGALAFSAKYIMKCDDDTFVKLGAVINEVKKVPE 550

Query: 545 TGSVYIGNINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFERRK 604
             S+YIGN+NYYHKPLR GKWAVTYEEWPEEDYPPYANGPGY++SSDIA+F++  FER K
Sbjct: 551 GRSLYIGNMNYYHKPLRGGKWAVTYEEWPEEDYPPYANGPGYVLSSDIARFIVDKFERHK 610

Query: 605 LRLFKMEDVSMGMWVEQF-NSSKAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWNKL 664
           LRLFKMEDVS+GMWVE F N++  V Y HS ++CQFGC+E Y TAHYQSPRQMICLW+KL
Sbjct: 611 LRLFKMEDVSVGMWVEHFKNTTNPVDYRHSLRFCQFGCVENYYTAHYQSPRQMICLWDKL 670

BLAST of CSPI01G08100 vs. ExPASy Swiss-Prot
Match: Q8GXG6 (Hydroxyproline O-galactosyltransferase GALT4 OS=Arabidopsis thaliana OX=3702 GN=GALT4 PE=2 SV=2)

HSP 1 Score: 795.0 bits (2052), Expect = 6.3e-229
Identity = 406/683 (59.44%), Postives = 508/683 (74.38%), Query Frame = 0

Query: 5   KLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRAGSSVVSQ--------DSLS 64
           K  K+D   S  R   +Q LL++ + Y L +S EIP +FR GS   S         D+L 
Sbjct: 2   KKSKLDNSSSQIRFGLVQFLLVVLLFYFLCMSFEIPFIFRTGSGSGSDDVSSSSFADALP 61

Query: 65  RPSPL--ESEE-----DLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISGLALETEAF 124
           RP  +   S E       EE   P R  ++  R  L+    ++ +F K +S + +    F
Sbjct: 62  RPMVVGGGSREANWVVGEEEEADPHRHFKDPGRVQLRLPERKMREF-KSVSEIFVNESFF 121

Query: 125 ESRS-EDAVSEFYRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGN 184
           ++    D  S F+++AK A  +G+K WD L+SG  +    K   K     CP  +S+S +
Sbjct: 122 DNGGFSDEFSIFHKTAKHAISMGRKMWDGLDSGLIK--PDKAPVKTRIEKCPDMVSVSES 181

Query: 185 DFLAHGGVMMLPCGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGL 244
           +F+    +++LPCGLTLGSHIT+V  P  A  E        K+G+++ MVSQF+MELQGL
Sbjct: 182 EFVNRSRILVLPCGLTLGSHITVVATPHWAHVE--------KDGDKTAMVSQFMMELQGL 241

Query: 245 NTVEGEDPPRILHFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQV 304
             V+GEDPPRILHFNPR+KGDWSG+PVIE NTCYRMQWGS  RC+G +S  +E+ VDG+V
Sbjct: 242 KAVDGEDPPRILHFNPRIKGDWSGRPVIEQNTCYRMQWGSGLRCDGRESSDDEEYVDGEV 301

Query: 305 KCEKWIRDDE--GNS----ERSKATWWLNRLIGRTKRM-DIDWPYPFAEDKLFVLTLSAG 364
           KCE+W RDD+  GN+    + SK TWWLNRL+GR K+M   DW YPFAE KLFVLTL AG
Sbjct: 302 KCERWKRDDDDGGNNGDDFDESKKTWWLNRLMGRRKKMITHDWDYPFAEGKLFVLTLRAG 361

Query: 365 FEGYHVNVDGKHIVSFPYRTGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLE 424
            EGYH++V+G+HI SFPYRTGF LEDATGL+V G+IDV SV AASLP ++PSFAPQ+HLE
Sbjct: 362 MEGYHISVNGRHITSFPYRTGFVLEDATGLAVKGNIDVHSVYAASLPSTNPSFAPQKHLE 421

Query: 425 MSRRWQAPPLPDGEIDLFIGILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHAR 484
           M R W+AP LP   ++LFIGILSAGNHFAERMAVRKSWM+ KL+RSSK+VARFFVALHAR
Sbjct: 422 MQRIWKAPSLPQKPVELFIGILSAGNHFAERMAVRKSWMQQKLVRSSKVVARFFVALHAR 481

Query: 485 KEVNVELKKEAEFFGDIVIVPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKV 544
           KEVNV+LKKEAE+FGDIVIVPYMD+YDLVVLKTVAICE+GV+ V+AKY+MKCDDDTFV+V
Sbjct: 482 KEVNVDLKKEAEYFGDIVIVPYMDHYDLVVLKTVAICEYGVNTVAAKYVMKCDDDTFVRV 541

Query: 545 DSIMNEIKSVSGTGSVYIGNINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDI 604
           D+++ E + V G  S+YIGNIN+ HKPLR GKWAVT+EEWPEE YPPYANGPGYI+S D+
Sbjct: 542 DAVIQEAEKVKGRESLYIGNINFNHKPLRTGKWAVTFEEWPEEYYPPYANGPGYILSYDV 601

Query: 605 AQFVISNFERRKLRLFKMEDVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYSTAHYQS 664
           A+F++ +FE+++LRLFKMEDVSMGMWVE+FN ++ V  VHS K+CQFGCIE+Y TAHYQS
Sbjct: 602 AKFIVDDFEQKRLRLFKMEDVSMGMWVEKFNETRPVAVVHSLKFCQFGCIEDYFTAHYQS 661

BLAST of CSPI01G08100 vs. ExPASy Swiss-Prot
Match: A7XDQ9 (Hydroxyproline O-galactosyltransferase GALT2 OS=Arabidopsis thaliana OX=3702 GN=GALT2 PE=1 SV=1)

HSP 1 Score: 659.4 bits (1700), Expect = 4.2e-188
Identity = 344/685 (50.22%), Postives = 457/685 (66.72%), Query Frame = 0

Query: 1   MKRGKLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIP----LVFRAGSSVVSQDSLS 60
           MKR K E    + S  R +    LL I   YL+ ++ + P    +V           +LS
Sbjct: 1   MKRVKSESFRGVYSSRRFKLSHFLLAIAGFYLVFLAFKFPHFIEMVAMLSGDTGLDGALS 60

Query: 61  RPSPLESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISGLALET---------- 120
             S   S       +  +R LE+    S   T  +++   KI     ++           
Sbjct: 61  DTSLDVSLSGSLRNDMLNRKLEDEDHQSGPSTTQKVSPEEKINGSKQIQPLLFRYGRISG 120

Query: 121 EAFESRSEDA-VSEFYRSAKIASEVGKKFWDELESGKSQHL-EKKKAEKGSNSSCPHSIS 180
           E    R+    +S F R A  A  +G K W++++  +   + E     +G   SCP  IS
Sbjct: 121 EVMRRRNRTIHMSPFERMADEAWILGSKAWEDVDKFEVDKINESASIFEGKVESCPSQIS 180

Query: 181 LSGNDFLAHGGVMMLPCGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIME 240
           ++G+D      +M+LPCGL  GS IT++G P+ A  ES PQ + +      V+VSQF++E
Sbjct: 181 MNGDDLNKANRIMLLPCGLAAGSSITILGTPQYAHKESVPQRSRLTRSYGMVLVSQFMVE 240

Query: 241 LQGLNTVEGEDPPRILHFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDT- 300
           LQGL T +GE PP+ILH NPR+KGDW+ +PVIE NTCYRMQWG A RC+G  SK + D  
Sbjct: 241 LQGLKTGDGEYPPKILHLNPRIKGDWNHRPVIEHNTCYRMQWGVAQRCDGTPSKKDADVL 300

Query: 301 VDGQVKCEKWIRD---DEGNSERSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSA 360
           VDG  +CEKW ++   D  +S+ SK T W  R IGR ++ ++ W +PFAE K+FVLTL A
Sbjct: 301 VDGFRRCEKWTQNDIIDMVDSKESKTTSWFKRFIGREQKPEVTWSFPFAEGKVFVLTLRA 360

Query: 361 GFEGYHVNVDGKHIVSFPYRTGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHL 420
           G +G+H+NV G+H+ SFPYR GF +EDATGL+V GD+D+ S+ A SL  SHPSF+PQ+ +
Sbjct: 361 GIDGFHINVGGRHVSSFPYRPGFTIEDATGLAVTGDVDIHSIHATSLSTSHPSFSPQKAI 420

Query: 421 EMSRRWQAPPLPDGEIDLFIGILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHA 480
           E S  W+APPLP     LF+G+LSA NHF+ERMAVRK+WM+H  I+SS +VARFFVAL+ 
Sbjct: 421 EFSSEWKAPPLPGTPFRLFMGVLSATNHFSERMAVRKTWMQHPSIKSSDVVARFFVALNP 480

Query: 481 RKEVNVELKKEAEFFGDIVIVPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVK 540
           RKEVN  LKKEAE+FGDIVI+P+MD Y+LVVLKT+AICE GV  V+A YIMKCDDDTF++
Sbjct: 481 RKEVNAMLKKEAEYFGDIVILPFMDRYELVVLKTIAICEFGVQNVTAPYIMKCDDDTFIR 540

Query: 541 VDSIMNEIKSVSGTGSVYIGNINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSD 600
           V+SI+ +I  VS   S+Y+GN+N  H+PLR GKW VT+EEWPE  YPPYANGPGYI+SS+
Sbjct: 541 VESILKQIDGVSPEKSLYMGNLNLRHRPLRTGKWTVTWEEWPEAVYPPYANGPGYIISSN 600

Query: 601 IAQFVISNFERRKLRLFKMEDVSMGMWVEQFNSS-KAVKYVHSFKYCQFGCIEEYSTAHY 660
           IA++++S   R KLRLFKMEDVSMG+WVEQFN+S + V+Y HS+K+CQ+GC   Y TAHY
Sbjct: 601 IAKYIVSQNSRHKLRLFKMEDVSMGLWVEQFNASMQPVEYSHSWKFCQYGCTLNYYTAHY 660

Query: 661 QSPRQMICLWNKLLRQAKPECCNMR 665
           QSP QM+CLW+ LL+  +P+CCN R
Sbjct: 661 QSPSQMMCLWDNLLK-GRPQCCNFR 684

BLAST of CSPI01G08100 vs. ExPASy Swiss-Prot
Match: Q8L7F9 (Beta-1,3-galactosyltransferase GALT1 OS=Arabidopsis thaliana OX=3702 GN=GALT1 PE=1 SV=1)

HSP 1 Score: 318.5 bits (815), Expect = 1.7e-85
Identity = 193/545 (35.41%), Postives = 284/545 (52.11%), Query Frame = 0

Query: 125 KIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSIS-LSGNDFLAHGGVMMLPCGL 184
           K A  V +     +E+ K   + + +  KG    CP  +S ++  +       + +PCGL
Sbjct: 120 KEAGIVWESLVSAVEAKKLVDVNENQTRKGKEELCPQFLSKMNATEADGSSLKLQIPCGL 179

Query: 185 TLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEGEDPPRILHFN 244
           T GS IT++G P                     +V  F ++L G       DPP I+H+N
Sbjct: 180 TQGSSITVIGIP-------------------DGLVGSFRIDLTGQPLPGEPDPPIIVHYN 239

Query: 245 PRLKGDWSGK-PVIELNTCYRMQ-WGSAHRCEGWKSKANEDTVDGQVKCEKWIRDDEGNS 304
            RL GD S + PVI  N+    Q WG+  RC  +    N+  VD   +C K +  +   +
Sbjct: 240 VRLLGDKSTEDPVIVQNSWTASQDWGAEERCPKFDPDMNK-KVDDLDECNKMVGGEINRT 299

Query: 305 ERSKATWWLNRLIGRTKRMDIDWPY-PFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR 364
             +      +R +   +       Y PF +  L V TL  G EG  + VDGKHI SF +R
Sbjct: 300 SSTSLQSNTSRGVPVAREASKHEKYFPFKQGFLSVATLRVGTEGMQMTVDGKHITSFAFR 359

Query: 365 TGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPL-PDGEIDLF 424
                   + + + GD  + S+LA+ LP S  S    +H+      ++P L P   +DL 
Sbjct: 360 DTLEPWLVSEIRITGDFRLISILASGLPTSEES----EHVVDLEALKSPTLSPLRPLDLV 419

Query: 425 IGILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIV 484
           IG+ S  N+F  RMAVR++WM++  +RS ++  RFFV LH    VN+EL  EA  +GD+ 
Sbjct: 420 IGVFSTANNFKRRMAVRRTWMQYDDVRSGRVAVRFFVGLHKSPLVNLELWNEARTYGDVQ 479

Query: 485 IVPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSVSGTGSVYI 544
           ++P++D Y L+  KT+AIC  G    SAK+IMK DDD FV+VD ++  +   + T  +  
Sbjct: 480 LMPFVDYYSLISWKTLAICIFGTEVDSAKFIMKTDDDAFVRVDEVLLSLSMTNNTRGLIY 539

Query: 545 GNINYYHKPLRY--GKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFERRKLRLF 604
           G IN   +P+R    KW ++YEEWPEE YPP+A+GPGYIVS DIA+ V   F+   L++F
Sbjct: 540 GLINSDSQPIRNPDSKWYISYEEWPEEKYPPWAHGPGYIVSRDIAESVGKLFKEGNLKMF 599

Query: 605 KMEDVSMGMWVEQFNS-SKAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWNKLLRQA 662
           K+EDV+MG+W+ +         Y +  +    GC + Y  AHYQSP +M CLW K     
Sbjct: 600 KLEDVAMGIWIAELTKHGLEPHYENDGRIISDGCKDGYVVAHYQSPAEMTCLWRKYQETK 640

BLAST of CSPI01G08100 vs. ExPASy TrEMBL
Match: A0A0A0LQS4 (Galectin domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G045680 PE=3 SV=1)

HSP 1 Score: 1339.3 bits (3465), Expect = 0.0e+00
Identity = 663/664 (99.85%), Postives = 664/664 (100.00%), Query Frame = 0

Query: 1   MKRGKLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSP 60
           MKRGKLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSP
Sbjct: 1   MKRGKLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSP 60

Query: 61  LESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISGLALETEAFESRSEDAVSEF 120
           LESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISGLALETEAFESRSEDA+SEF
Sbjct: 61  LESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISGLALETEAFESRSEDAISEF 120

Query: 121 YRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGNDFLAHGGVMMLP 180
           YRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGNDFLAHGGVMMLP
Sbjct: 121 YRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGNDFLAHGGVMMLP 180

Query: 181 CGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEGEDPPRIL 240
           CGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEGEDPPRIL
Sbjct: 181 CGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEGEDPPRIL 240

Query: 241 HFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQVKCEKWIRDDEGN 300
           HFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQVKCEKWIRDDEGN
Sbjct: 241 HFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQVKCEKWIRDDEGN 300

Query: 301 SERSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR 360
           SERSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR
Sbjct: 301 SERSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR 360

Query: 361 TGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPLPDGEIDLFI 420
           TGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPLPDGEIDLFI
Sbjct: 361 TGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPLPDGEIDLFI 420

Query: 421 GILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI 480
           GILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI
Sbjct: 421 GILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI 480

Query: 481 VPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSVSGTGSVYIG 540
           VPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSVSGTGSVYIG
Sbjct: 481 VPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSVSGTGSVYIG 540

Query: 541 NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFERRKLRLFKME 600
           NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFERRKLRLFKME
Sbjct: 541 NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFERRKLRLFKME 600

Query: 601 DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWNKLLRQAKPEC 660
           DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWNKLLRQAKPEC
Sbjct: 601 DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWNKLLRQAKPEC 660

Query: 661 CNMR 665
           CNMR
Sbjct: 661 CNMR 664

BLAST of CSPI01G08100 vs. ExPASy TrEMBL
Match: A0A5D3BK65 (Hydroxyproline O-galactosyltransferase GALT6 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G003260 PE=3 SV=1)

HSP 1 Score: 1301.6 bits (3367), Expect = 0.0e+00
Identity = 642/664 (96.69%), Postives = 651/664 (98.04%), Query Frame = 0

Query: 1   MKRGKLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSP 60
           MKRGKL+KVD+IVSFTRQRSIQILL+IGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSP
Sbjct: 1   MKRGKLDKVDIIVSFTRQRSIQILLLIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSP 60

Query: 61  LESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISGLALETEAFESRSEDAVSEF 120
           LESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIIS LALETEAFESRS+DAVSEF
Sbjct: 61  LESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISSLALETEAFESRSDDAVSEF 120

Query: 121 YRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGNDFLAHGGVMMLP 180
           YRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSG DFLAHG VMMLP
Sbjct: 121 YRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGKDFLAHGRVMMLP 180

Query: 181 CGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEGEDPPRIL 240
           CGLTLGSHITLVGKPRVAQPE DPQITMVKNGEESVMVSQFIMELQGLN VEGEDPPRI 
Sbjct: 181 CGLTLGSHITLVGKPRVAQPEYDPQITMVKNGEESVMVSQFIMELQGLNAVEGEDPPRIF 240

Query: 241 HFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQVKCEKWIRDDEGN 300
           HFNPRLKGDWSGKPVIE+NTCYRMQWGSAHRCEGWKSKANED VDGQVKCEKWIRDDEGN
Sbjct: 241 HFNPRLKGDWSGKPVIEMNTCYRMQWGSAHRCEGWKSKANEDAVDGQVKCEKWIRDDEGN 300

Query: 301 SERSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR 360
            E+SKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR
Sbjct: 301 EEQSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR 360

Query: 361 TGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPLPDGEIDLFI 420
           TGFALEDATGLSVIGDIDVQSVLAASLP+SHPSFAPQQHLEMS RWQAPPLPDGE+DLFI
Sbjct: 361 TGFALEDATGLSVIGDIDVQSVLAASLPRSHPSFAPQQHLEMSTRWQAPPLPDGEVDLFI 420

Query: 421 GILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI 480
           GILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI
Sbjct: 421 GILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI 480

Query: 481 VPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSVSGTGSVYIG 540
           VPYMDNYDLVVLKTVAICEHGVHAV AKYIMKCDDDTFVKVDSIMNEIK VSGTGSVYIG
Sbjct: 481 VPYMDNYDLVVLKTVAICEHGVHAVPAKYIMKCDDDTFVKVDSIMNEIKRVSGTGSVYIG 540

Query: 541 NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFERRKLRLFKME 600
           NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFV S+FERRKLRLFKME
Sbjct: 541 NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVTSDFERRKLRLFKME 600

Query: 601 DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWNKLLRQAKPEC 660
           DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEY TAHYQSPRQMICLWNKLLRQAKPEC
Sbjct: 601 DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYYTAHYQSPRQMICLWNKLLRQAKPEC 660

Query: 661 CNMR 665
           CNMR
Sbjct: 661 CNMR 664

BLAST of CSPI01G08100 vs. ExPASy TrEMBL
Match: A0A1S3AVE3 (hydroxyproline O-galactosyltransferase GALT6 OS=Cucumis melo OX=3656 GN=LOC103483099 PE=3 SV=1)

HSP 1 Score: 1301.6 bits (3367), Expect = 0.0e+00
Identity = 642/664 (96.69%), Postives = 651/664 (98.04%), Query Frame = 0

Query: 1   MKRGKLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSP 60
           MKRGKL+KVD+IVSFTRQRSIQILL+IGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSP
Sbjct: 1   MKRGKLDKVDIIVSFTRQRSIQILLLIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSP 60

Query: 61  LESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISGLALETEAFESRSEDAVSEF 120
           LESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIIS LALETEAFESRS+DAVSEF
Sbjct: 61  LESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISSLALETEAFESRSDDAVSEF 120

Query: 121 YRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGNDFLAHGGVMMLP 180
           YRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSG DFLAHG VMMLP
Sbjct: 121 YRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGKDFLAHGRVMMLP 180

Query: 181 CGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEGEDPPRIL 240
           CGLTLGSHITLVGKPRVAQPE DPQITMVKNGEESVMVSQFIMELQGLN VEGEDPPRI 
Sbjct: 181 CGLTLGSHITLVGKPRVAQPEYDPQITMVKNGEESVMVSQFIMELQGLNAVEGEDPPRIF 240

Query: 241 HFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQVKCEKWIRDDEGN 300
           HFNPRLKGDWSGKPVIE+NTCYRMQWGSAHRCEGWKSKANED VDGQVKCEKWIRDDEGN
Sbjct: 241 HFNPRLKGDWSGKPVIEMNTCYRMQWGSAHRCEGWKSKANEDAVDGQVKCEKWIRDDEGN 300

Query: 301 SERSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR 360
            E+SKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR
Sbjct: 301 EEQSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR 360

Query: 361 TGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPLPDGEIDLFI 420
           TGFALEDATGLSVIGDIDVQSVLAASLP+SHPSFAPQQHLEMS RWQAPPLPDGE+DLFI
Sbjct: 361 TGFALEDATGLSVIGDIDVQSVLAASLPRSHPSFAPQQHLEMSTRWQAPPLPDGEVDLFI 420

Query: 421 GILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI 480
           GILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI
Sbjct: 421 GILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI 480

Query: 481 VPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSVSGTGSVYIG 540
           VPYMDNYDLVVLKTVAICEHGVHAV AKYIMKCDDDTFVKVDSIMNEIK VSGTGSVYIG
Sbjct: 481 VPYMDNYDLVVLKTVAICEHGVHAVPAKYIMKCDDDTFVKVDSIMNEIKRVSGTGSVYIG 540

Query: 541 NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFERRKLRLFKME 600
           NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFV S+FERRKLRLFKME
Sbjct: 541 NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVTSDFERRKLRLFKME 600

Query: 601 DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWNKLLRQAKPEC 660
           DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEY TAHYQSPRQMICLWNKLLRQAKPEC
Sbjct: 601 DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYYTAHYQSPRQMICLWNKLLRQAKPEC 660

Query: 661 CNMR 665
           CNMR
Sbjct: 661 CNMR 664

BLAST of CSPI01G08100 vs. ExPASy TrEMBL
Match: A0A6J1GRU4 (hydroxyproline O-galactosyltransferase GALT6-like OS=Cucurbita moschata OX=3662 GN=LOC111456938 PE=3 SV=1)

HSP 1 Score: 1238.4 bits (3203), Expect = 0.0e+00
Identity = 609/664 (91.72%), Postives = 633/664 (95.33%), Query Frame = 0

Query: 1   MKRGKLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSP 60
           MKR K EKVDMIVS TRQRSIQILL IG LYLLLVSLEIPLVFR GS VVS DSLSRP P
Sbjct: 1   MKRPKSEKVDMIVSLTRQRSIQILLFIGFLYLLLVSLEIPLVFRVGSGVVSPDSLSRPPP 60

Query: 61  LESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISGLALETEAFESRSEDAVSEF 120
           LESEEDLEEREAPSRPLENISRNSLQPTPSRL QFNKIISGLALETEAFES  EDAVSEF
Sbjct: 61  LESEEDLEEREAPSRPLENISRNSLQPTPSRLTQFNKIISGLALETEAFESGLEDAVSEF 120

Query: 121 YRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGNDFLAHGGVMMLP 180
           YRSAKIASEVGKKFWDELESGK  H+ KKKAEK SNSSCPHSISLSG +FLAHGGVM+LP
Sbjct: 121 YRSAKIASEVGKKFWDELESGKIHHIAKKKAEKESNSSCPHSISLSGVEFLAHGGVMLLP 180

Query: 181 CGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEGEDPPRIL 240
           CGLTLGSHITLVGKPRVA PE DPQITMV+NGEESVMVSQFI+ELQGLNTVEGEDPPRIL
Sbjct: 181 CGLTLGSHITLVGKPRVAHPEYDPQITMVRNGEESVMVSQFILELQGLNTVEGEDPPRIL 240

Query: 241 HFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQVKCEKWIRDDEGN 300
           HFNPRLKGDWSG+PVIELNTCYRMQWGSA RCEGWKSKANE+TVDGQVKCEKWIRDDEG+
Sbjct: 241 HFNPRLKGDWSGRPVIELNTCYRMQWGSAVRCEGWKSKANEETVDGQVKCEKWIRDDEGH 300

Query: 301 SERSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR 360
           SE+SKATWWLNRLIGRTKRMDIDWP+PFAEDKLFVLTLSAGFEGYHV VDGKH++SFPYR
Sbjct: 301 SEQSKATWWLNRLIGRTKRMDIDWPFPFAEDKLFVLTLSAGFEGYHVTVDGKHVISFPYR 360

Query: 361 TGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPLPDGEIDLFI 420
           TGFALEDATGLS IGDIDVQSVLAASLP+SHPSFAPQQHLE SRRWQAP LPDGE+DLFI
Sbjct: 361 TGFALEDATGLSAIGDIDVQSVLAASLPRSHPSFAPQQHLEKSRRWQAPLLPDGEVDLFI 420

Query: 421 GILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI 480
           GILSAGNHFAERMAVRKSWM+HKLI+SS+IVARFFVALHARKEVNVELKKEAEFFGDIVI
Sbjct: 421 GILSAGNHFAERMAVRKSWMQHKLIKSSRIVARFFVALHARKEVNVELKKEAEFFGDIVI 480

Query: 481 VPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSVSGTGSVYIG 540
           VPYMDNYDLVVLKTVAICE+GV AVSAKYIMKCDDDTFVKVDS+MNE++SV+  GSVYIG
Sbjct: 481 VPYMDNYDLVVLKTVAICEYGVRAVSAKYIMKCDDDTFVKVDSVMNEVRSVARAGSVYIG 540

Query: 541 NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFERRKLRLFKME 600
           NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIA+FVISNFERRKLRLFKME
Sbjct: 541 NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIARFVISNFERRKLRLFKME 600

Query: 601 DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWNKLLRQAKPEC 660
           DVSMGMWVEQFNSSK VKYVHSFKYCQFGCIEEY TAHYQSPRQMICLWNKL R+ KPEC
Sbjct: 601 DVSMGMWVEQFNSSKTVKYVHSFKYCQFGCIEEYHTAHYQSPRQMICLWNKLQRRVKPEC 660

Query: 661 CNMR 665
           CNMR
Sbjct: 661 CNMR 664

BLAST of CSPI01G08100 vs. ExPASy TrEMBL
Match: A0A6J1K1N1 (hydroxyproline O-galactosyltransferase GALT6-like OS=Cucurbita maxima OX=3661 GN=LOC111490256 PE=3 SV=1)

HSP 1 Score: 1236.9 bits (3199), Expect = 0.0e+00
Identity = 606/664 (91.27%), Postives = 634/664 (95.48%), Query Frame = 0

Query: 1   MKRGKLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSP 60
           MKR K EKVDMIVS TRQRSIQILL IG LYLLLVSLEIPLVFR GS VVS DSLSRP P
Sbjct: 1   MKRPKSEKVDMIVSLTRQRSIQILLFIGFLYLLLVSLEIPLVFRVGSGVVSPDSLSRPPP 60

Query: 61  LESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISGLALETEAFESRSEDAVSEF 120
           LESEEDLEEREAPSRPLENISRNSLQPTPSRL QFNKIISGLALETEAFES  EDA+SEF
Sbjct: 61  LESEEDLEEREAPSRPLENISRNSLQPTPSRLTQFNKIISGLALETEAFESGLEDAISEF 120

Query: 121 YRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGNDFLAHGGVMMLP 180
           YRSAKIASEVGKKFWDELESGK  H+ KKKAEK SNSSCPHSISLSG +FLAHGGVM+LP
Sbjct: 121 YRSAKIASEVGKKFWDELESGKIHHIAKKKAEKESNSSCPHSISLSGAEFLAHGGVMLLP 180

Query: 181 CGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEGEDPPRIL 240
           CGLTLGSHITLVGKPRVA PE DPQITMV+NGEESVMVSQFI+ELQGLNTVEGEDPPRIL
Sbjct: 181 CGLTLGSHITLVGKPRVAHPEYDPQITMVRNGEESVMVSQFILELQGLNTVEGEDPPRIL 240

Query: 241 HFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQVKCEKWIRDDEGN 300
           HFNPRLKGDWSG+PVIELNTCYRMQWGSA RCEGWKSKANE+TVDGQVKCEKW+RDDEG+
Sbjct: 241 HFNPRLKGDWSGRPVIELNTCYRMQWGSAVRCEGWKSKANEETVDGQVKCEKWMRDDEGH 300

Query: 301 SERSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR 360
           SE+SKATWWLNRLIGRTKRMDIDWP+PFAEDKLFVLTLSAGFEGYHV VDGKH++SFPYR
Sbjct: 301 SEQSKATWWLNRLIGRTKRMDIDWPFPFAEDKLFVLTLSAGFEGYHVTVDGKHVISFPYR 360

Query: 361 TGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPLPDGEIDLFI 420
           TGFALEDATGLS IGDIDVQSVLAASLP+SHPSFAPQQHLE SRRWQAP LPDGE+DLFI
Sbjct: 361 TGFALEDATGLSAIGDIDVQSVLAASLPRSHPSFAPQQHLEKSRRWQAPLLPDGEVDLFI 420

Query: 421 GILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI 480
           GILSAGNHFAERMAVRKSWM+HKLI+SS+IVARFFVALHARKEVN+ELKKEAEFFGDIVI
Sbjct: 421 GILSAGNHFAERMAVRKSWMQHKLIKSSRIVARFFVALHARKEVNIELKKEAEFFGDIVI 480

Query: 481 VPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSVSGTGSVYIG 540
           VPYMDNYDLVVLKTVAICE+GV AVSAKYIMKCDDDTFVKVDS+MNE++SV+  GSVYIG
Sbjct: 481 VPYMDNYDLVVLKTVAICEYGVRAVSAKYIMKCDDDTFVKVDSVMNEVRSVARAGSVYIG 540

Query: 541 NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFERRKLRLFKME 600
           NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIA+FVISNFERRKLRLFKME
Sbjct: 541 NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIARFVISNFERRKLRLFKME 600

Query: 601 DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWNKLLRQAKPEC 660
           DVSMGMWVEQFNSSKAVKYVHSFK+CQFGCIEEY TAHYQSPRQMICLWNKL R+ KPEC
Sbjct: 601 DVSMGMWVEQFNSSKAVKYVHSFKFCQFGCIEEYHTAHYQSPRQMICLWNKLQRRVKPEC 660

Query: 661 CNMR 665
           CNMR
Sbjct: 661 CNMR 664

BLAST of CSPI01G08100 vs. NCBI nr
Match: XP_004152450.1 (hydroxyproline O-galactosyltransferase GALT6 [Cucumis sativus] >KGN64270.1 hypothetical protein Csa_013915 [Cucumis sativus])

HSP 1 Score: 1339.3 bits (3465), Expect = 0.0e+00
Identity = 663/664 (99.85%), Postives = 664/664 (100.00%), Query Frame = 0

Query: 1   MKRGKLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSP 60
           MKRGKLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSP
Sbjct: 1   MKRGKLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSP 60

Query: 61  LESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISGLALETEAFESRSEDAVSEF 120
           LESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISGLALETEAFESRSEDA+SEF
Sbjct: 61  LESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISGLALETEAFESRSEDAISEF 120

Query: 121 YRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGNDFLAHGGVMMLP 180
           YRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGNDFLAHGGVMMLP
Sbjct: 121 YRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGNDFLAHGGVMMLP 180

Query: 181 CGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEGEDPPRIL 240
           CGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEGEDPPRIL
Sbjct: 181 CGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEGEDPPRIL 240

Query: 241 HFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQVKCEKWIRDDEGN 300
           HFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQVKCEKWIRDDEGN
Sbjct: 241 HFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQVKCEKWIRDDEGN 300

Query: 301 SERSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR 360
           SERSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR
Sbjct: 301 SERSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR 360

Query: 361 TGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPLPDGEIDLFI 420
           TGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPLPDGEIDLFI
Sbjct: 361 TGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPLPDGEIDLFI 420

Query: 421 GILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI 480
           GILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI
Sbjct: 421 GILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI 480

Query: 481 VPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSVSGTGSVYIG 540
           VPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSVSGTGSVYIG
Sbjct: 481 VPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSVSGTGSVYIG 540

Query: 541 NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFERRKLRLFKME 600
           NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFERRKLRLFKME
Sbjct: 541 NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFERRKLRLFKME 600

Query: 601 DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWNKLLRQAKPEC 660
           DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWNKLLRQAKPEC
Sbjct: 601 DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWNKLLRQAKPEC 660

Query: 661 CNMR 665
           CNMR
Sbjct: 661 CNMR 664

BLAST of CSPI01G08100 vs. NCBI nr
Match: XP_008437765.1 (PREDICTED: hydroxyproline O-galactosyltransferase GALT6 [Cucumis melo] >TYJ99121.1 hydroxyproline O-galactosyltransferase GALT6 [Cucumis melo var. makuwa])

HSP 1 Score: 1301.6 bits (3367), Expect = 0.0e+00
Identity = 642/664 (96.69%), Postives = 651/664 (98.04%), Query Frame = 0

Query: 1   MKRGKLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSP 60
           MKRGKL+KVD+IVSFTRQRSIQILL+IGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSP
Sbjct: 1   MKRGKLDKVDIIVSFTRQRSIQILLLIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSP 60

Query: 61  LESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISGLALETEAFESRSEDAVSEF 120
           LESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIIS LALETEAFESRS+DAVSEF
Sbjct: 61  LESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISSLALETEAFESRSDDAVSEF 120

Query: 121 YRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGNDFLAHGGVMMLP 180
           YRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSG DFLAHG VMMLP
Sbjct: 121 YRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGKDFLAHGRVMMLP 180

Query: 181 CGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEGEDPPRIL 240
           CGLTLGSHITLVGKPRVAQPE DPQITMVKNGEESVMVSQFIMELQGLN VEGEDPPRI 
Sbjct: 181 CGLTLGSHITLVGKPRVAQPEYDPQITMVKNGEESVMVSQFIMELQGLNAVEGEDPPRIF 240

Query: 241 HFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQVKCEKWIRDDEGN 300
           HFNPRLKGDWSGKPVIE+NTCYRMQWGSAHRCEGWKSKANED VDGQVKCEKWIRDDEGN
Sbjct: 241 HFNPRLKGDWSGKPVIEMNTCYRMQWGSAHRCEGWKSKANEDAVDGQVKCEKWIRDDEGN 300

Query: 301 SERSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR 360
            E+SKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR
Sbjct: 301 EEQSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR 360

Query: 361 TGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPLPDGEIDLFI 420
           TGFALEDATGLSVIGDIDVQSVLAASLP+SHPSFAPQQHLEMS RWQAPPLPDGE+DLFI
Sbjct: 361 TGFALEDATGLSVIGDIDVQSVLAASLPRSHPSFAPQQHLEMSTRWQAPPLPDGEVDLFI 420

Query: 421 GILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI 480
           GILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI
Sbjct: 421 GILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI 480

Query: 481 VPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSVSGTGSVYIG 540
           VPYMDNYDLVVLKTVAICEHGVHAV AKYIMKCDDDTFVKVDSIMNEIK VSGTGSVYIG
Sbjct: 481 VPYMDNYDLVVLKTVAICEHGVHAVPAKYIMKCDDDTFVKVDSIMNEIKRVSGTGSVYIG 540

Query: 541 NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFERRKLRLFKME 600
           NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFV S+FERRKLRLFKME
Sbjct: 541 NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVTSDFERRKLRLFKME 600

Query: 601 DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWNKLLRQAKPEC 660
           DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEY TAHYQSPRQMICLWNKLLRQAKPEC
Sbjct: 601 DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYYTAHYQSPRQMICLWNKLLRQAKPEC 660

Query: 661 CNMR 665
           CNMR
Sbjct: 661 CNMR 664

BLAST of CSPI01G08100 vs. NCBI nr
Match: XP_038895105.1 (hydroxyproline O-galactosyltransferase GALT6 [Benincasa hispida])

HSP 1 Score: 1263.8 bits (3269), Expect = 0.0e+00
Identity = 624/664 (93.98%), Postives = 641/664 (96.54%), Query Frame = 0

Query: 1   MKRGKLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSP 60
           MKR KLEKVDMI+S TRQRSIQILL IGVLYLLLVSLEIPLVF  GS VVSQDSLSRPSP
Sbjct: 1   MKRVKLEKVDMIISLTRQRSIQILLFIGVLYLLLVSLEIPLVFGVGSGVVSQDSLSRPSP 60

Query: 61  LESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISGLALETEAFESRSEDAVSEF 120
           LESEEDLEEREAPSRPLENISRNSLQPT SRLNQFNKI SGLALETEAFESRS+DAVSEF
Sbjct: 61  LESEEDLEEREAPSRPLENISRNSLQPTSSRLNQFNKITSGLALETEAFESRSDDAVSEF 120

Query: 121 YRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGNDFLAHGGVMMLP 180
           YRSAK ASEVGKKFWDELESGK+QH+ KKKAEKGSNSSC HSISLSG+DFLAHGGVMMLP
Sbjct: 121 YRSAKTASEVGKKFWDELESGKNQHMGKKKAEKGSNSSCAHSISLSGSDFLAHGGVMMLP 180

Query: 181 CGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEGEDPPRIL 240
           CGLTLGSHITLVGKPRVAQP  DPQITMV+N EESVMVSQFIMELQGLNTVE EDPPRIL
Sbjct: 181 CGLTLGSHITLVGKPRVAQPAYDPQITMVRNVEESVMVSQFIMELQGLNTVEDEDPPRIL 240

Query: 241 HFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQVKCEKWIRDDEGN 300
           HFNPRLKGDWSG+PVIELNTCYRMQWGSA RCEGWKSKANEDTVDGQVKCEKWIRDDEG+
Sbjct: 241 HFNPRLKGDWSGRPVIELNTCYRMQWGSALRCEGWKSKANEDTVDGQVKCEKWIRDDEGH 300

Query: 301 SERSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR 360
           SE+SKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKH++SFPYR
Sbjct: 301 SEQSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHVISFPYR 360

Query: 361 TGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPLPDGEIDLFI 420
           TGFALEDATGLSVIGDIDVQSVLAASLP+SHPSFAPQQHLEMSR WQAPPLPDGE+DLFI
Sbjct: 361 TGFALEDATGLSVIGDIDVQSVLAASLPRSHPSFAPQQHLEMSRIWQAPPLPDGEVDLFI 420

Query: 421 GILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI 480
           GILSAGNHFAERMAVRKSWMRHKLI+SSKIVARFFVALHARKEVNVELKKEAEFFGDIVI
Sbjct: 421 GILSAGNHFAERMAVRKSWMRHKLIKSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI 480

Query: 481 VPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSVSGTGSVYIG 540
           VPYMDNYDLVVLKTVAICE+GVHAVSAKYIMKCDDDTFVKVDSIMNEIK V+G GSVYIG
Sbjct: 481 VPYMDNYDLVVLKTVAICEYGVHAVSAKYIMKCDDDTFVKVDSIMNEIKKVAGMGSVYIG 540

Query: 541 NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFERRKLRLFKME 600
           NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIA FVISNF+RRKLRLFKME
Sbjct: 541 NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAHFVISNFKRRKLRLFKME 600

Query: 601 DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWNKLLRQAKPEC 660
           DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEY TAHYQSPRQMICLWNKL RQ KPEC
Sbjct: 601 DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYYTAHYQSPRQMICLWNKLQRQVKPEC 660

Query: 661 CNMR 665
           CNMR
Sbjct: 661 CNMR 664

BLAST of CSPI01G08100 vs. NCBI nr
Match: KAG7012214.1 (Hydroxyproline O-galactosyltransferase GALT6 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1238.4 bits (3203), Expect = 0.0e+00
Identity = 608/664 (91.57%), Postives = 633/664 (95.33%), Query Frame = 0

Query: 1   MKRGKLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSP 60
           MKR K EKVDMIVS TRQRSIQILL IG LYLLLVSLEIPLVFR GS VVS DSLSRP P
Sbjct: 1   MKRPKSEKVDMIVSLTRQRSIQILLFIGFLYLLLVSLEIPLVFRVGSGVVSPDSLSRPPP 60

Query: 61  LESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISGLALETEAFESRSEDAVSEF 120
           LESEEDLEEREAPSRPLENISRNSLQPTPSRL QFNKIISGLALETEAFES  EDAVSEF
Sbjct: 61  LESEEDLEEREAPSRPLENISRNSLQPTPSRLTQFNKIISGLALETEAFESGLEDAVSEF 120

Query: 121 YRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGNDFLAHGGVMMLP 180
           YRSAKIASEVGKKFWDELESGK  H+ KKKAEK SNSSCPHSISLSG +FLAHGGVM+LP
Sbjct: 121 YRSAKIASEVGKKFWDELESGKIHHIAKKKAEKESNSSCPHSISLSGGEFLAHGGVMLLP 180

Query: 181 CGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEGEDPPRIL 240
           CGLTLGSHITLVGKPRVA PE DPQITMV+NGEESVMVSQFI+ELQGLNTVEGEDPPRIL
Sbjct: 181 CGLTLGSHITLVGKPRVAHPEYDPQITMVRNGEESVMVSQFILELQGLNTVEGEDPPRIL 240

Query: 241 HFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQVKCEKWIRDDEGN 300
           HFNPRLKGDWSG+PVIELNTCYRMQWGSA RCEGWKSKANE+TVDGQVKCEKWIRDDEG+
Sbjct: 241 HFNPRLKGDWSGRPVIELNTCYRMQWGSAVRCEGWKSKANEETVDGQVKCEKWIRDDEGH 300

Query: 301 SERSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR 360
           SE+SKATWWLNRLIGRTKRMDIDWP+PFAEDKLFVLTLSAGFEGYHV VDGKH++SFPYR
Sbjct: 301 SEQSKATWWLNRLIGRTKRMDIDWPFPFAEDKLFVLTLSAGFEGYHVTVDGKHVISFPYR 360

Query: 361 TGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPLPDGEIDLFI 420
           TGFALEDATGLS IGDIDVQSVLAASLP+SHPSFAPQQHLE SRRWQAP LPDGE+DLFI
Sbjct: 361 TGFALEDATGLSAIGDIDVQSVLAASLPRSHPSFAPQQHLEKSRRWQAPLLPDGEVDLFI 420

Query: 421 GILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI 480
           GILSAGNHFAERMAVRKSWM+HKLI+SS+IVARFFVALHARKEVNVELKKEAEFFGDIVI
Sbjct: 421 GILSAGNHFAERMAVRKSWMQHKLIKSSRIVARFFVALHARKEVNVELKKEAEFFGDIVI 480

Query: 481 VPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSVSGTGSVYIG 540
           VPYMDNYDLVVLKTVAICE+GV AVSAKYIMKCDDDTFVKVDS+MNE++SV+  GSVYIG
Sbjct: 481 VPYMDNYDLVVLKTVAICEYGVRAVSAKYIMKCDDDTFVKVDSVMNEVRSVARAGSVYIG 540

Query: 541 NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFERRKLRLFKME 600
           NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIA+FVISNFERRKLRLFKME
Sbjct: 541 NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIARFVISNFERRKLRLFKME 600

Query: 601 DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWNKLLRQAKPEC 660
           DVSMGMWVEQFNSSK VKYVHSFKYCQFGCIE+Y TAHYQSPRQMICLWNKL R+ KPEC
Sbjct: 601 DVSMGMWVEQFNSSKTVKYVHSFKYCQFGCIEDYHTAHYQSPRQMICLWNKLQRRVKPEC 660

Query: 661 CNMR 665
           CNMR
Sbjct: 661 CNMR 664

BLAST of CSPI01G08100 vs. NCBI nr
Match: XP_022954781.1 (hydroxyproline O-galactosyltransferase GALT6-like [Cucurbita moschata])

HSP 1 Score: 1238.4 bits (3203), Expect = 0.0e+00
Identity = 609/664 (91.72%), Postives = 633/664 (95.33%), Query Frame = 0

Query: 1   MKRGKLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSP 60
           MKR K EKVDMIVS TRQRSIQILL IG LYLLLVSLEIPLVFR GS VVS DSLSRP P
Sbjct: 1   MKRPKSEKVDMIVSLTRQRSIQILLFIGFLYLLLVSLEIPLVFRVGSGVVSPDSLSRPPP 60

Query: 61  LESEEDLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISGLALETEAFESRSEDAVSEF 120
           LESEEDLEEREAPSRPLENISRNSLQPTPSRL QFNKIISGLALETEAFES  EDAVSEF
Sbjct: 61  LESEEDLEEREAPSRPLENISRNSLQPTPSRLTQFNKIISGLALETEAFESGLEDAVSEF 120

Query: 121 YRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGNDFLAHGGVMMLP 180
           YRSAKIASEVGKKFWDELESGK  H+ KKKAEK SNSSCPHSISLSG +FLAHGGVM+LP
Sbjct: 121 YRSAKIASEVGKKFWDELESGKIHHIAKKKAEKESNSSCPHSISLSGVEFLAHGGVMLLP 180

Query: 181 CGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEGEDPPRIL 240
           CGLTLGSHITLVGKPRVA PE DPQITMV+NGEESVMVSQFI+ELQGLNTVEGEDPPRIL
Sbjct: 181 CGLTLGSHITLVGKPRVAHPEYDPQITMVRNGEESVMVSQFILELQGLNTVEGEDPPRIL 240

Query: 241 HFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQVKCEKWIRDDEGN 300
           HFNPRLKGDWSG+PVIELNTCYRMQWGSA RCEGWKSKANE+TVDGQVKCEKWIRDDEG+
Sbjct: 241 HFNPRLKGDWSGRPVIELNTCYRMQWGSAVRCEGWKSKANEETVDGQVKCEKWIRDDEGH 300

Query: 301 SERSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKHIVSFPYR 360
           SE+SKATWWLNRLIGRTKRMDIDWP+PFAEDKLFVLTLSAGFEGYHV VDGKH++SFPYR
Sbjct: 301 SEQSKATWWLNRLIGRTKRMDIDWPFPFAEDKLFVLTLSAGFEGYHVTVDGKHVISFPYR 360

Query: 361 TGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPLPDGEIDLFI 420
           TGFALEDATGLS IGDIDVQSVLAASLP+SHPSFAPQQHLE SRRWQAP LPDGE+DLFI
Sbjct: 361 TGFALEDATGLSAIGDIDVQSVLAASLPRSHPSFAPQQHLEKSRRWQAPLLPDGEVDLFI 420

Query: 421 GILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAEFFGDIVI 480
           GILSAGNHFAERMAVRKSWM+HKLI+SS+IVARFFVALHARKEVNVELKKEAEFFGDIVI
Sbjct: 421 GILSAGNHFAERMAVRKSWMQHKLIKSSRIVARFFVALHARKEVNVELKKEAEFFGDIVI 480

Query: 481 VPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSVSGTGSVYIG 540
           VPYMDNYDLVVLKTVAICE+GV AVSAKYIMKCDDDTFVKVDS+MNE++SV+  GSVYIG
Sbjct: 481 VPYMDNYDLVVLKTVAICEYGVRAVSAKYIMKCDDDTFVKVDSVMNEVRSVARAGSVYIG 540

Query: 541 NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFERRKLRLFKME 600
           NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIA+FVISNFERRKLRLFKME
Sbjct: 541 NINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIARFVISNFERRKLRLFKME 600

Query: 601 DVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWNKLLRQAKPEC 660
           DVSMGMWVEQFNSSK VKYVHSFKYCQFGCIEEY TAHYQSPRQMICLWNKL R+ KPEC
Sbjct: 601 DVSMGMWVEQFNSSKTVKYVHSFKYCQFGCIEEYHTAHYQSPRQMICLWNKLQRRVKPEC 660

Query: 661 CNMR 665
           CNMR
Sbjct: 661 CNMR 664

BLAST of CSPI01G08100 vs. TAIR 10
Match: AT5G62620.1 (Galactosyltransferase family protein )

HSP 1 Score: 936.8 bits (2420), Expect = 9.6e-273
Identity = 447/674 (66.32%), Postives = 556/674 (82.49%), Query Frame = 0

Query: 2   KRGKLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSPL 61
           K  +LEK D+ VS ++QRS+QIL+ +G+LY+LL++ EIP VF+ G S +SQD L+RP   
Sbjct: 8   KLERLEKFDIFVSLSKQRSVQILMAVGLLYMLLITFEIPFVFKTGLSSLSQDPLTRPEKH 67

Query: 62  ESEEDLEEREAPSRPLEN-ISRNSLQPTPSR-LNQFNKIISGLALETEAFESRSEDAVSE 121
            S+ +L+ER AP+RPL++ + + S   +P++ L +  +I+S L  + E F   S+D   E
Sbjct: 68  NSQRELQERRAPTRPLKSLLYQESQSESPAQGLRRRTRILSSLRFDPETFNPSSKDGSVE 127

Query: 122 FYRSAKIASEVGKKFWDELESGKS----QHLEKKKAEKGSNSSCPHSISLSGNDFLAHGG 181
            ++SAK+A EVG+K W+ELESGK+    +  +KKK E+   +SC  S+SL+G+D L  G 
Sbjct: 128 LHKSAKVAWEVGRKIWEELESGKTLKALEKEKKKKIEEHGTNSCSLSVSLTGSDLLKRGN 187

Query: 182 VMMLPCGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEGED 241
           +M LPCGLTLGSHIT+VGKPR A  E DP+I+M+K G+E+V VSQF +ELQGL  VEGE+
Sbjct: 188 IMELPCGLTLGSHITVVGKPRAAHSEKDPKISMLKEGDEAVKVSQFKLELQGLKAVEGEE 247

Query: 242 PPRILHFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQVKCEKWIR 301
           PPRILH NPRLKGDWSGKPVIE NTCYRMQWGSA RCEGW+S+ +E+TVDGQVKCEKW R
Sbjct: 248 PPRILHLNPRLKGDWSGKPVIEQNTCYRMQWGSAQRCEGWRSRDDEETVDGQVKCEKWAR 307

Query: 302 DDEGNS---ERSK-ATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDG 361
           DD   S   E SK A+WWL+RLIGR+K++ ++WP+PF  DKLFVLTLSAG EGYHV+VDG
Sbjct: 308 DDSITSKEEESSKAASWWLSRLIGRSKKVTVEWPFPFTVDKLFVLTLSAGLEGYHVSVDG 367

Query: 362 KHIVSFPYRTGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPL 421
           KH+ SFPYRTGF LEDATGL++ GDIDV SV A SLP SHPSF+PQ+HLE+S  WQAP L
Sbjct: 368 KHVTSFPYRTGFTLEDATGLTINGDIDVHSVFAGSLPTSHPSFSPQRHLELSSNWQAPSL 427

Query: 422 PDGEIDLFIGILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKE 481
           PD ++D+FIGILSAGNHFAERMAVR+SWM+HKL++SSK+VARFFVALH+RKEVNVELKKE
Sbjct: 428 PDEQVDMFIGILSAGNHFAERMAVRRSWMQHKLVKSSKVVARFFVALHSRKEVNVELKKE 487

Query: 482 AEFFGDIVIVPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSV 541
           AEFFGDIVIVPYMD+YDLVVLKTVAICE+G H ++AK+IMKCDDDTFV+VD++++E K  
Sbjct: 488 AEFFGDIVIVPYMDSYDLVVLKTVAICEYGAHQLAAKFIMKCDDDTFVQVDAVLSEAKKT 547

Query: 542 SGTGSVYIGNINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFER 601
               S+YIGNINYYHKPLR GKW+VTYEEWPEEDYPPYANGPGYI+S+DI++F++  FE+
Sbjct: 548 PTDRSLYIGNINYYHKPLRQGKWSVTYEEWPEEDYPPYANGPGYILSNDISRFIVKEFEK 607

Query: 602 RKLRLFKMEDVSMGMWVEQFNS-SKAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWN 661
            KLR+FKMEDVS+GMWVEQFN+ +K V Y+HS ++CQFGCIE Y TAHYQSPRQMICLW+
Sbjct: 608 HKLRMFKMEDVSVGMWVEQFNNGTKPVDYIHSLRFCQFGCIENYLTAHYQSPRQMICLWD 667

Query: 662 KLLRQAKPECCNMR 665
           KL+   KP+CCNMR
Sbjct: 668 KLVLTGKPQCCNMR 681

BLAST of CSPI01G08100 vs. TAIR 10
Match: AT1G74800.1 (Galactosyltransferase family protein )

HSP 1 Score: 902.9 bits (2332), Expect = 1.5e-262
Identity = 439/672 (65.33%), Postives = 539/672 (80.21%), Query Frame = 0

Query: 5   KLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRA-GSSVVSQDSLSRPSPLES 64
           K++K+D+  S  +QRS+++++ IG LYL++VS+EIPLVF++  SS V  D+LSR   L +
Sbjct: 11  KIDKIDLFSSLWKQRSVRVIMAIGFLYLVIVSVEIPLVFKSWSSSSVPLDALSRLEKLNN 70

Query: 65  EEDLEEREAPSRPLENISRNSLQPT--------PSRLNQFNK-IISGLALETEAFESRSE 124
           E++ +    P+ PLE +S     PT         +++ + ++ ++S L  ++E F+  S+
Sbjct: 71  EQEPQVEIIPNPPLEPVSYPVSNPTIVTRTDLVQNKVREHHRGVLSSLRFDSETFDPSSK 130

Query: 125 DAVSEFYRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGNDFL-AH 184
           D   E ++SAK A ++G+K W ELESG+ + L  +K EK    SCPHS+SL+G++F+   
Sbjct: 131 DGSVELHKSAKEAWQLGRKLWKELESGRLEKL-VEKPEKNKPDSCPHSVSLTGSEFMNRE 190

Query: 185 GGVMMLPCGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEG 244
             +M LPCGLTLGSHITLVG+PR A P         K G+ S +VSQF++ELQGL TVEG
Sbjct: 191 NKLMELPCGLTLGSHITLVGRPRKAHP---------KEGDWSKLVSQFVIELQGLKTVEG 250

Query: 245 EDPPRILHFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQVKCEKW 304
           EDPPRILHFNPRLKGDWS KPVIE N+CYRMQWG A RCEGWKS+ +E+TVD  VKCEKW
Sbjct: 251 EDPPRILHFNPRLKGDWSKKPVIEQNSCYRMQWGPAQRCEGWKSRDDEETVDSHVKCEKW 310

Query: 305 IRDDEGNSERSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDGKH 364
           IRDD+  SE S+A WWLNRLIGR KR+ ++WP+PF E+KLFVLTLSAG EGYH+NVDGKH
Sbjct: 311 IRDDDNYSEGSRARWWLNRLIGRRKRVKVEWPFPFVEEKLFVLTLSAGLEGYHINVDGKH 370

Query: 365 IVSFPYRTGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPLPD 424
           + SFPYRTGF LEDATGL+V GDIDV SV  ASLP SHPSFAPQ+HLE+S+RWQAP +PD
Sbjct: 371 VTSFPYRTGFTLEDATGLTVNGDIDVHSVFVASLPTSHPSFAPQRHLELSKRWQAPVVPD 430

Query: 425 GEIDLFIGILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKEAE 484
           G +++FIGILSAGNHF+ERMAVRKSWM+H LI S+K+VARFFVALH RKEVNVELKKEAE
Sbjct: 431 GPVEIFIGILSAGNHFSERMAVRKSWMQHVLITSAKVVARFFVALHGRKEVNVELKKEAE 490

Query: 485 FFGDIVIVPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSVSG 544
           +FGDIV+VPYMD+YDLVVLKTVAICEHG  A SAKYIMKCDDDTFVK+ +++NE+K V  
Sbjct: 491 YFGDIVLVPYMDSYDLVVLKTVAICEHGALAFSAKYIMKCDDDTFVKLGAVINEVKKVPE 550

Query: 545 TGSVYIGNINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFERRK 604
             S+YIGN+NYYHKPLR GKWAVTYEEWPEEDYPPYANGPGY++SSDIA+F++  FER K
Sbjct: 551 GRSLYIGNMNYYHKPLRGGKWAVTYEEWPEEDYPPYANGPGYVLSSDIARFIVDKFERHK 610

Query: 605 LRLFKMEDVSMGMWVEQF-NSSKAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWNKL 664
           LRLFKMEDVS+GMWVE F N++  V Y HS ++CQFGC+E Y TAHYQSPRQMICLW+KL
Sbjct: 611 LRLFKMEDVSVGMWVEHFKNTTNPVDYRHSLRFCQFGCVENYYTAHYQSPRQMICLWDKL 670

BLAST of CSPI01G08100 vs. TAIR 10
Match: AT1G27120.1 (Galactosyltransferase family protein )

HSP 1 Score: 795.0 bits (2052), Expect = 4.5e-230
Identity = 406/683 (59.44%), Postives = 508/683 (74.38%), Query Frame = 0

Query: 5   KLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRAGSSVVSQ--------DSLS 64
           K  K+D   S  R   +Q LL++ + Y L +S EIP +FR GS   S         D+L 
Sbjct: 2   KKSKLDNSSSQIRFGLVQFLLVVLLFYFLCMSFEIPFIFRTGSGSGSDDVSSSSFADALP 61

Query: 65  RPSPL--ESEE-----DLEEREAPSRPLENISRNSLQPTPSRLNQFNKIISGLALETEAF 124
           RP  +   S E       EE   P R  ++  R  L+    ++ +F K +S + +    F
Sbjct: 62  RPMVVGGGSREANWVVGEEEEADPHRHFKDPGRVQLRLPERKMREF-KSVSEIFVNESFF 121

Query: 125 ESRS-EDAVSEFYRSAKIASEVGKKFWDELESGKSQHLEKKKAEKGSNSSCPHSISLSGN 184
           ++    D  S F+++AK A  +G+K WD L+SG  +    K   K     CP  +S+S +
Sbjct: 122 DNGGFSDEFSIFHKTAKHAISMGRKMWDGLDSGLIK--PDKAPVKTRIEKCPDMVSVSES 181

Query: 185 DFLAHGGVMMLPCGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGL 244
           +F+    +++LPCGLTLGSHIT+V  P  A  E        K+G+++ MVSQF+MELQGL
Sbjct: 182 EFVNRSRILVLPCGLTLGSHITVVATPHWAHVE--------KDGDKTAMVSQFMMELQGL 241

Query: 245 NTVEGEDPPRILHFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQV 304
             V+GEDPPRILHFNPR+KGDWSG+PVIE NTCYRMQWGS  RC+G +S  +E+ VDG+V
Sbjct: 242 KAVDGEDPPRILHFNPRIKGDWSGRPVIEQNTCYRMQWGSGLRCDGRESSDDEEYVDGEV 301

Query: 305 KCEKWIRDDE--GNS----ERSKATWWLNRLIGRTKRM-DIDWPYPFAEDKLFVLTLSAG 364
           KCE+W RDD+  GN+    + SK TWWLNRL+GR K+M   DW YPFAE KLFVLTL AG
Sbjct: 302 KCERWKRDDDDGGNNGDDFDESKKTWWLNRLMGRRKKMITHDWDYPFAEGKLFVLTLRAG 361

Query: 365 FEGYHVNVDGKHIVSFPYRTGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLE 424
            EGYH++V+G+HI SFPYRTGF LEDATGL+V G+IDV SV AASLP ++PSFAPQ+HLE
Sbjct: 362 MEGYHISVNGRHITSFPYRTGFVLEDATGLAVKGNIDVHSVYAASLPSTNPSFAPQKHLE 421

Query: 425 MSRRWQAPPLPDGEIDLFIGILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHAR 484
           M R W+AP LP   ++LFIGILSAGNHFAERMAVRKSWM+ KL+RSSK+VARFFVALHAR
Sbjct: 422 MQRIWKAPSLPQKPVELFIGILSAGNHFAERMAVRKSWMQQKLVRSSKVVARFFVALHAR 481

Query: 485 KEVNVELKKEAEFFGDIVIVPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKV 544
           KEVNV+LKKEAE+FGDIVIVPYMD+YDLVVLKTVAICE+GV+ V+AKY+MKCDDDTFV+V
Sbjct: 482 KEVNVDLKKEAEYFGDIVIVPYMDHYDLVVLKTVAICEYGVNTVAAKYVMKCDDDTFVRV 541

Query: 545 DSIMNEIKSVSGTGSVYIGNINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDI 604
           D+++ E + V G  S+YIGNIN+ HKPLR GKWAVT+EEWPEE YPPYANGPGYI+S D+
Sbjct: 542 DAVIQEAEKVKGRESLYIGNINFNHKPLRTGKWAVTFEEWPEEYYPPYANGPGYILSYDV 601

Query: 605 AQFVISNFERRKLRLFKMEDVSMGMWVEQFNSSKAVKYVHSFKYCQFGCIEEYSTAHYQS 664
           A+F++ +FE+++LRLFKMEDVSMGMWVE+FN ++ V  VHS K+CQFGCIE+Y TAHYQS
Sbjct: 602 AKFIVDDFEQKRLRLFKMEDVSMGMWVEKFNETRPVAVVHSLKFCQFGCIEDYFTAHYQS 661

BLAST of CSPI01G08100 vs. TAIR 10
Match: AT5G62620.2 (Galactosyltransferase family protein )

HSP 1 Score: 749.2 bits (1933), Expect = 2.8e-216
Identity = 366/563 (65.01%), Postives = 457/563 (81.17%), Query Frame = 0

Query: 2   KRGKLEKVDMIVSFTRQRSIQILLIIGVLYLLLVSLEIPLVFRAGSSVVSQDSLSRPSPL 61
           K  +LEK D+ VS ++QRS+QIL+ +G+LY+LL++ EIP VF+ G S +SQD L+RP   
Sbjct: 8   KLERLEKFDIFVSLSKQRSVQILMAVGLLYMLLITFEIPFVFKTGLSSLSQDPLTRPEKH 67

Query: 62  ESEEDLEEREAPSRPLEN-ISRNSLQPTPSR-LNQFNKIISGLALETEAFESRSEDAVSE 121
            S+ +L+ER AP+RPL++ + + S   +P++ L +  +I+S L  + E F   S+D   E
Sbjct: 68  NSQRELQERRAPTRPLKSLLYQESQSESPAQGLRRRTRILSSLRFDPETFNPSSKDGSVE 127

Query: 122 FYRSAKIASEVGKKFWDELESGKS----QHLEKKKAEKGSNSSCPHSISLSGNDFLAHGG 181
            ++SAK+A EVG+K W+ELESGK+    +  +KKK E+   +SC  S+SL+G+D L  G 
Sbjct: 128 LHKSAKVAWEVGRKIWEELESGKTLKALEKEKKKKIEEHGTNSCSLSVSLTGSDLLKRGN 187

Query: 182 VMMLPCGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEGED 241
           +M LPCGLTLGSHIT+VGKPR A  E DP+I+M+K G+E+V VSQF +ELQGL  VEGE+
Sbjct: 188 IMELPCGLTLGSHITVVGKPRAAHSEKDPKISMLKEGDEAVKVSQFKLELQGLKAVEGEE 247

Query: 242 PPRILHFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDTVDGQVKCEKWIR 301
           PPRILH NPRLKGDWSGKPVIE NTCYRMQWGSA RCEGW+S+ +E+TVDGQVKCEKW R
Sbjct: 248 PPRILHLNPRLKGDWSGKPVIEQNTCYRMQWGSAQRCEGWRSRDDEETVDGQVKCEKWAR 307

Query: 302 DDEGNS---ERSK-ATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDG 361
           DD   S   E SK A+WWL+RLIGR+K++ ++WP+PF  DKLFVLTLSAG EGYHV+VDG
Sbjct: 308 DDSITSKEEESSKAASWWLSRLIGRSKKVTVEWPFPFTVDKLFVLTLSAGLEGYHVSVDG 367

Query: 362 KHIVSFPYRTGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPL 421
           KH+ SFPYRTGF LEDATGL++ GDIDV SV A SLP SHPSF+PQ+HLE+S  WQAP L
Sbjct: 368 KHVTSFPYRTGFTLEDATGLTINGDIDVHSVFAGSLPTSHPSFSPQRHLELSSNWQAPSL 427

Query: 422 PDGEIDLFIGILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKE 481
           PD ++D+FIGILSAGNHFAERMAVR+SWM+HKL++SSK+VARFFVALH+RKEVNVELKKE
Sbjct: 428 PDEQVDMFIGILSAGNHFAERMAVRRSWMQHKLVKSSKVVARFFVALHSRKEVNVELKKE 487

Query: 482 AEFFGDIVIVPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSV 541
           AEFFGDIVIVPYMD+YDLVVLKTVAICE+G H ++AK+IMKCDDDTFV+VD++++E K  
Sbjct: 488 AEFFGDIVIVPYMDSYDLVVLKTVAICEYGAHQLAAKFIMKCDDDTFVQVDAVLSEAKKT 547

Query: 542 SGTGSVYIGNINYYHKPLRYGKW 555
               S+YIGNINYYHKPLR GKW
Sbjct: 548 PTDRSLYIGNINYYHKPLRQGKW 570

BLAST of CSPI01G08100 vs. TAIR 10
Match: AT4G21060.1 (Galactosyltransferase family protein )

HSP 1 Score: 664.5 bits (1713), Expect = 9.2e-191
Identity = 317/554 (57.22%), Postives = 413/554 (74.55%), Query Frame = 0

Query: 117 VSEFYRSAKIASEVGKKFWDELESGKSQHL-EKKKAEKGSNSSCPHSISLSGNDFLAHGG 176
           +S F R A  A  +G K W++++  +   + E     +G   SCP  IS++G+D      
Sbjct: 189 MSPFERMADEAWILGSKAWEDVDKFEVDKINESASIFEGKVESCPSQISMNGDDLNKANR 248

Query: 177 VMMLPCGLTLGSHITLVGKPRVAQPESDPQITMVKNGEESVMVSQFIMELQGLNTVEGED 236
           +M+LPCGL  GS IT++G P+ A  ES PQ + +      V+VSQF++ELQGL T +GE 
Sbjct: 249 IMLLPCGLAAGSSITILGTPQYAHKESVPQRSRLTRSYGMVLVSQFMVELQGLKTGDGEY 308

Query: 237 PPRILHFNPRLKGDWSGKPVIELNTCYRMQWGSAHRCEGWKSKANEDT-VDGQVKCEKWI 296
           PP+ILH NPR+KGDW+ +PVIE NTCYRMQWG A RC+G  SK + D  VDG  +CEKW 
Sbjct: 309 PPKILHLNPRIKGDWNHRPVIEHNTCYRMQWGVAQRCDGTPSKKDADVLVDGFRRCEKWT 368

Query: 297 RD---DEGNSERSKATWWLNRLIGRTKRMDIDWPYPFAEDKLFVLTLSAGFEGYHVNVDG 356
           ++   D  +S+ SK T W  R IGR ++ ++ W +PFAE K+FVLTL AG +G+H+NV G
Sbjct: 369 QNDIIDMVDSKESKTTSWFKRFIGREQKPEVTWSFPFAEGKVFVLTLRAGIDGFHINVGG 428

Query: 357 KHIVSFPYRTGFALEDATGLSVIGDIDVQSVLAASLPQSHPSFAPQQHLEMSRRWQAPPL 416
           +H+ SFPYR GF +EDATGL+V GD+D+ S+ A SL  SHPSF+PQ+ +E S  W+APPL
Sbjct: 429 RHVSSFPYRPGFTIEDATGLAVTGDVDIHSIHATSLSTSHPSFSPQKAIEFSSEWKAPPL 488

Query: 417 PDGEIDLFIGILSAGNHFAERMAVRKSWMRHKLIRSSKIVARFFVALHARKEVNVELKKE 476
           P     LF+G+LSA NHF+ERMAVRK+WM+H  I+SS +VARFFVAL+ RKEVN  LKKE
Sbjct: 489 PGTPFRLFMGVLSATNHFSERMAVRKTWMQHPSIKSSDVVARFFVALNPRKEVNAMLKKE 548

Query: 477 AEFFGDIVIVPYMDNYDLVVLKTVAICEHGVHAVSAKYIMKCDDDTFVKVDSIMNEIKSV 536
           AE+FGDIVI+P+MD Y+LVVLKT+AICE GV  V+A YIMKCDDDTF++V+SI+ +I  V
Sbjct: 549 AEYFGDIVILPFMDRYELVVLKTIAICEFGVQNVTAPYIMKCDDDTFIRVESILKQIDGV 608

Query: 537 SGTGSVYIGNINYYHKPLRYGKWAVTYEEWPEEDYPPYANGPGYIVSSDIAQFVISNFER 596
           S   S+Y+GN+N  H+PLR GKW VT+EEWPE  YPPYANGPGYI+SS+IA++++S   R
Sbjct: 609 SPEKSLYMGNLNLRHRPLRTGKWTVTWEEWPEAVYPPYANGPGYIISSNIAKYIVSQNSR 668

Query: 597 RKLRLFKMEDVSMGMWVEQFNSS-KAVKYVHSFKYCQFGCIEEYSTAHYQSPRQMICLWN 656
            KLRLFKMEDVSMG+WVEQFN+S + V+Y HS+K+CQ+GC   Y TAHYQSP QM+CLW+
Sbjct: 669 HKLRLFKMEDVSMGLWVEQFNASMQPVEYSHSWKFCQYGCTLNYYTAHYQSPSQMMCLWD 728

Query: 657 KLLRQAKPECCNMR 665
            LL+  +P+CCN R
Sbjct: 729 NLLK-GRPQCCNFR 741

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LV161.3e-27166.32Hydroxyproline O-galactosyltransferase GALT6 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q8RX552.2e-26165.33Hydroxyproline O-galactosyltransferase GALT5 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q8GXG66.3e-22959.44Hydroxyproline O-galactosyltransferase GALT4 OS=Arabidopsis thaliana OX=3702 GN=... [more]
A7XDQ94.2e-18850.22Hydroxyproline O-galactosyltransferase GALT2 OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q8L7F91.7e-8535.41Beta-1,3-galactosyltransferase GALT1 OS=Arabidopsis thaliana OX=3702 GN=GALT1 PE... [more]
Match NameE-valueIdentityDescription
A0A0A0LQS40.0e+0099.85Galectin domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G045680 PE... [more]
A0A5D3BK650.0e+0096.69Hydroxyproline O-galactosyltransferase GALT6 OS=Cucumis melo var. makuwa OX=1194... [more]
A0A1S3AVE30.0e+0096.69hydroxyproline O-galactosyltransferase GALT6 OS=Cucumis melo OX=3656 GN=LOC10348... [more]
A0A6J1GRU40.0e+0091.72hydroxyproline O-galactosyltransferase GALT6-like OS=Cucurbita moschata OX=3662 ... [more]
A0A6J1K1N10.0e+0091.27hydroxyproline O-galactosyltransferase GALT6-like OS=Cucurbita maxima OX=3661 GN... [more]
Match NameE-valueIdentityDescription
XP_004152450.10.0e+0099.85hydroxyproline O-galactosyltransferase GALT6 [Cucumis sativus] >KGN64270.1 hypot... [more]
XP_008437765.10.0e+0096.69PREDICTED: hydroxyproline O-galactosyltransferase GALT6 [Cucumis melo] >TYJ99121... [more]
XP_038895105.10.0e+0093.98hydroxyproline O-galactosyltransferase GALT6 [Benincasa hispida][more]
KAG7012214.10.0e+0091.57Hydroxyproline O-galactosyltransferase GALT6 [Cucurbita argyrosperma subsp. argy... [more]
XP_022954781.10.0e+0091.72hydroxyproline O-galactosyltransferase GALT6-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
AT5G62620.19.6e-27366.32Galactosyltransferase family protein [more]
AT1G74800.11.5e-26265.33Galactosyltransferase family protein [more]
AT1G27120.14.5e-23059.44Galactosyltransferase family protein [more]
AT5G62620.22.8e-21665.01Galactosyltransferase family protein [more]
AT4G21060.19.2e-19157.22Galactosyltransferase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001079Galectin, carbohydrate recognition domainSMARTSM00908Gal_bind_lectin_2coord: 179..384
e-value: 1.0E-22
score: 91.5
IPR001079Galectin, carbohydrate recognition domainPFAMPF00337Gal-bind_lectincoord: 177..383
e-value: 7.5E-45
score: 151.7
IPR001079Galectin, carbohydrate recognition domainPROSITEPS51304GALECTINcoord: 175..385
score: 28.934305
IPR001079Galectin, carbohydrate recognition domainCDDcd00070GLECTcoord: 179..382
e-value: 8.4456E-19
score: 80.7559
NoneNo IPR availableGENE3D2.60.120.200coord: 176..383
e-value: 6.6E-23
score: 82.7
NoneNo IPR availableGENE3D3.90.550.50coord: 415..656
e-value: 1.3E-18
score: 69.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 59..76
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 52..88
NoneNo IPR availablePANTHERPTHR11214:SF271HYDROXYPROLINE O-GALACTOSYLTRANSFERASE GALT5coord: 10..664
IPR002659Glycosyl transferase, family 31PFAMPF01762Galactosyl_Tcoord: 431..611
e-value: 2.4E-32
score: 112.3
IPR002659Glycosyl transferase, family 31PANTHERPTHR11214BETA-1,3-N-ACETYLGLUCOSAMINYLTRANSFERASEcoord: 10..664
IPR013320Concanavalin A-like lectin/glucanase domain superfamilySUPERFAMILY49899Concanavalin A-like lectins/glucanasescoord: 177..383

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G08100.1CSPI01G08100.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006486 protein glycosylation
cellular_component GO:0000139 Golgi membrane
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0030246 carbohydrate binding
molecular_function GO:0008378 galactosyltransferase activity
molecular_function GO:0016758 hexosyltransferase activity