CSPI01G16600 (gene) Wild cucumber (PI 183967)

NameCSPI01G16600
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionExostosin family protein
LocationChr1 : 12170377 .. 12174275 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTAATTTCTTCTTCTTATTCTTCTTCTTCTCAAAGGTTTAATGTTCTTTTTTAAATTATTGAGTCCTTTTTAATTAGTTAATTTTAGCTTCCATTGTTTTTAATGGCTTCCTCCTTGGAGTTTCCTCATAAACTCTCTTTCTTTTTACTTCTACCTTTCTTCCTCCTTCTTCTTCTCCTTCTCTGCTTCTTCCCACCAAATGATCAAATCAACCCTTTCTCATCCATATTATCCAAAAATCTTTTCCCTTTCCATTCATCCAAACAACCCCAGCCACCATTGTCGCCACCCCAATCCACCCTTCAATTTCCTCCAACCACTGCCACTGCCACTGCCCCCTCACAGCCGCAAGATTACTCCTCCACCCGGAAGGTTGGTTTGGATGGTTTCTTATTTGTTTGGTAACAATTTGGTTTTGCTTTTTTCTTTATAAAAAAAAAGTTGTATGCCTTTCTTTCCTCCTCAATTTTTCAAATTATGTTAATTTTCTCCTTTCTTAATCAAAACACTTAAAATTTTTACTTAAATAGTTTTTCTGAATCTTTTTCTTACAGTTTTGAAAACACGAGTAGAAAATTAATGAAAAAATTCATACAAACTTATTGATCAAAGTATTAACTTGTGTTTTCTAATTAGTTTTTAAAGACAAATTTAGTTTAATTTTTAAAAGAAAATACTAATCTTTTTAATCAAACGCAGTCGTCTCGATCCTTTCATCACAGATAGAATTTTTTTTTTATATAAGGTGATGAATTAGGATAGGTTTAAGATAATATTTTACATTATACGATACTGTTGCATGCAGAAGAAGAGTGAAATGATCGAAGAGGGTTTGTCGGAGGCTCGAGCAGCGATCCGACTAGCGATCGTAACCCGAAACTACACGTCGGAAAAGGAGGAGAGTTTCATACCGAGAGGAAGGGTTTACAGAAACGCATACGCTTTCCATCAGTTAAGATTCCCACTGTCCTATTTAATTTCCTTTCCATTTTTGGCTCTCTACAAATTTCCATATGCTTTGTTTCTTTCTTAAGAAAATAATTCATTTATATTTATCAGGAAATAAATAATGATTATATATAACACATTTTAGGGAAAAATGAAAACGGATCACATTTTTAGTATTCCATTAAACGTTTCTCACCCCCAAATAAAAAACTCACGAGTTTTCCATCTATAATTTTTTATAAAATCAAAAAATTCATATTAATTAATTTGGTCAGAAATTGACACATTTTCTAAAAAAGCTTTAACTTTTTTACTAAGTTACATATATATGGATATATACATACTATTTTGATTATGTAATTTGAGATTTCTTTTAAAAAATGTTATTCTATTTATTCTTAATTACTAAAATTGAACAAATTCAATATTCTTGTAAAAGTTTTGATATTTTGATATATATTTTTTAAAGCGAAATGAATTAAATGTTTGTTAAAGAGAAGATTAAATGTGTTGAATACTTTCCTAAAACAAAGTAGATGAAATAGACATAGTACTCGAAGCAATAAAACTTAAAAATTGTTCATTATATAGTCTACTTAATTAGCCTAATAATTACTCCACCAAATAATTGATGCATTAAAGTATAGTAAATGAAATATTTACAAATTCATAAAAATTACAAAATTTTCAATAAATATGTCATTTAAACATAATTAGAAATTAATAATTGGAGAAGTCATGGCTAGAGTATGAATAACAAAAAAAATTAAAAATTAAAATAATACTATGTGTAATAATTAGGAATAAAGGGGGCAGGGGGCAGGNNNNNNNNGGGGGGGAGGCTGCCCAGCCGCCTTGGGCACGGACTAAATCTAGGCTGCCCAAAATAGTCCAACGATTGTCTCTGTTGACATTGTTCCTTCGTTGTTTTTAGCTAAACAGTTGTCCATTCATCATTATTATTTTTTTTATTACTCTACACTATTACCATTTTCATTCCATCAGTACTTAAATTCTGTATTTTGATTTTAAACCTTAGACTTAACTTTGCTAACAACTTCCTTTTAGCCTCTATCTTTATTCCCCTTTATATTTTGTAATTTGTTTTTTTTTAAAAAAAAAATGATTTGTAATTACTAAAAGTTGGAAACATAGAATTGCATGATATGACGTTTTCCAGTTGGTCTGTTTTAATTATTGGACTATGAAAACGTTCATCTTGTTTTATGTAAATCTTCTTTTACTTTTCAAATAGTAATTATGTTGATCCATATTCCTCAATTCACCGTATAAACTCTTTAATATAATAATTTTATGACCAGAGATGTTCAGCAACTTTTAAGGGAGTGCAGGTTAACATTATTCTTTGGACAAAAATGAAAATAAATAGGCATTGTGATACAACAACCACCAAATCAAACAATTAAAAGTATAGAAACTAAAGTAAAATTTACGTGTAAGATTAACAAACATTAGTGATCTTGAAATATACAATTCAATTATGATATTGTTTGTTACACTGAGGTATTTTCTAGACCATAGCATCGGATTCACACAAACAATAGTGATTAGTGTGTGAACCAATAGACAAATGTAGATGGTAATAGTAATAATATGTATTGTTCATAAACTGTAGGAGTCATATTGAGATGAAGAAGAGGTTAAAAATATGGACATACAAAGAAGGAGAGCAGCCATTGGTGCACGATGGGCCGATGAAACACATATACTCAATTGAGGGCCATTTTATTGACGAAATGGACAGTGGAAAGAGCCCATTTTCGGCCCATGAACCAGAAGAGGCCCAAGTATTTTTCTTGCCTATAAGTATCGTTTATATTGTGGATTACATCTACAAGCCCATTACCACATACGCACGTGACCGTCTCGTTCGAATTTTCACGGATTATGTGAGGGTGGTAGCTAATAAGTACCCTTACTGGAACCGTACACGTGGAGCAGATCATTTCATGGTCTCCTGCCATGATTGGGTAACTTCCTAGTAACTCAAACCAAAACTGCAAAATTATAGTTAACAGAATGGACTATGAATTGGGAGTTTAACAAAATTAACATTTTGGAAAGCTTTCAGGCGCCGGAAGTAACAAAAGAAGATCCTAACCTCTTCAAATATTTCATCAGAGTTCTTTGCAATGCCAACACATCCGAAGGCTTCAATCCAATGCGAGATGCATCCTTGCCCGAGATTAACTTACCTCCAACTTTCCACCTCAATCTTCCTCGATTAGGCCAACCGCCACAGAACCGCTCAATTCTAGCTTTCTTCGCCGGCGGAGCACACGGATTCATCCGCCACATCCTAATGCAGCATTGGAAAGACAAAGACCATGAAATCCAAGTCCACGAGTACCTTCCTCCATCCCAAAACTACACCGAATTGATCGATCGAAGCAAATTCTGCCTCTGCCCTAGCGGATACGAAGTTGCAAGCCCTAGGTTAGTGGAAGCGATCCACGGCGGTTGTGTACCAGTGGTAATCTCTGATTATTACTCCTTGCCGTTCGATGATGTGCTGGATTGGAGCAAATTCTCGATGCGGATTCCGTCTGAGAGGATTCCGGAGATCAAGACGATCTTGAGAGGAGTTTCGATGAAGAAGTACTTGAAACTACAGCGAGGAGTGATGAAAGTGCAGAGACATTTTGAGATTCATCGGCCGGCGAAGGCGTTTGATATGTTCCATATGGTACTTCACTCTGTTTGGCTCAGACGACTCAATGTAAAGCTTACACATTGATTGGGTAATGGTTTTCCTTCGTGGAATTATACCAATACATATACACAGAGAAAAATAATTATTCAAAATTAAACAATTAAAAGAAAAGGGAAATATGTGAATTACCATAGTGCCCCCCAGAAATGACCCTGCCAATGCAAAATTATGTGGTATGGTTGTCTTTGCCAATACTATTGAAGACTATAATAACTA

mRNA sequence

ATGGCTTCCTCCTTGGAGTTTCCTCATAAACTCTCTTTCTTTTTACTTCTACCTTTCTTCCTCCTTCTTCTTCTCCTTCTCTGCTTCTTCCCACCAAATGATCAAATCAACCCTTTCTCATCCATATTATCCAAAAATCTTTTCCCTTTCCATTCATCCAAACAACCCCAGCCACCATTGTCGCCACCCCAATCCACCCTTCAATTTCCTCCAACCACTGCCACTGCCACTGCCCCCTCACAGCCGCAAGATTACTCCTCCACCCGGAAGAAGAAGAGTGAAATGATCGAAGAGGGTTTGTCGGAGGCTCGAGCAGCGATCCGACTAGCGATCGTAACCCGAAACTACACGTCGGAAAAGGAGGAGAGTTTCATACCGAGAGGAAGGGTTTACAGAAACGCATACGCTTTCCATCAGAGTCATATTGAGATGAAGAAGAGGTTAAAAATATGGACATACAAAGAAGGAGAGCAGCCATTGGTGCACGATGGGCCGATGAAACACATATACTCAATTGAGGGCCATTTTATTGACGAAATGGACAGTGGAAAGAGCCCATTTTCGGCCCATGAACCAGAAGAGGCCCAAGTATTTTTCTTGCCTATAAGTATCGTTTATATTGTGGATTACATCTACAAGCCCATTACCACATACGCACGTGACCGTCTCGTTCGAATTTTCACGGATTATGTGAGGGTGGTAGCTAATAAGTACCCTTACTGGAACCGTACACGTGGAGCAGATCATTTCATGGTCTCCTGCCATGATTGGGCGCCGGAAGTAACAAAAGAAGATCCTAACCTCTTCAAATATTTCATCAGAGTTCTTTGCAATGCCAACACATCCGAAGGCTTCAATCCAATGCGAGATGCATCCTTGCCCGAGATTAACTTACCTCCAACTTTCCACCTCAATCTTCCTCGATTAGGCCAACCGCCACAGAACCGCTCAATTCTAGCTTTCTTCGCCGGCGGAGCACACGGATTCATCCGCCACATCCTAATGCAGCATTGGAAAGACAAAGACCATGAAATCCAAGTCCACGAGTACCTTCCTCCATCCCAAAACTACACCGAATTGATCGATCGAAGCAAATTCTGCCTCTGCCCTAGCGGATACGAAGTTGCAAGCCCTAGGTTAGTGGAAGCGATCCACGGCGGTTGTGTACCAGTGGTAATCTCTGATTATTACTCCTTGCCGTTCGATGATGTGCTGGATTGGAGCAAATTCTCGATGCGGATTCCGTCTGAGAGGATTCCGGAGATCAAGACGATCTTGAGAGGAGTTTCGATGAAGAAGTACTTGAAACTACAGCGAGGAGTGATGAAAGTGCAGAGACATTTTGAGATTCATCGGCCGGCGAAGGCGTTTGATATGTTCCATATGGTACTTCACTCTGTTTGGCTCAGACGACTCAATGTAAAGCTTACACATTGA

Coding sequence (CDS)

ATGGCTTCCTCCTTGGAGTTTCCTCATAAACTCTCTTTCTTTTTACTTCTACCTTTCTTCCTCCTTCTTCTTCTCCTTCTCTGCTTCTTCCCACCAAATGATCAAATCAACCCTTTCTCATCCATATTATCCAAAAATCTTTTCCCTTTCCATTCATCCAAACAACCCCAGCCACCATTGTCGCCACCCCAATCCACCCTTCAATTTCCTCCAACCACTGCCACTGCCACTGCCCCCTCACAGCCGCAAGATTACTCCTCCACCCGGAAGAAGAAGAGTGAAATGATCGAAGAGGGTTTGTCGGAGGCTCGAGCAGCGATCCGACTAGCGATCGTAACCCGAAACTACACGTCGGAAAAGGAGGAGAGTTTCATACCGAGAGGAAGGGTTTACAGAAACGCATACGCTTTCCATCAGAGTCATATTGAGATGAAGAAGAGGTTAAAAATATGGACATACAAAGAAGGAGAGCAGCCATTGGTGCACGATGGGCCGATGAAACACATATACTCAATTGAGGGCCATTTTATTGACGAAATGGACAGTGGAAAGAGCCCATTTTCGGCCCATGAACCAGAAGAGGCCCAAGTATTTTTCTTGCCTATAAGTATCGTTTATATTGTGGATTACATCTACAAGCCCATTACCACATACGCACGTGACCGTCTCGTTCGAATTTTCACGGATTATGTGAGGGTGGTAGCTAATAAGTACCCTTACTGGAACCGTACACGTGGAGCAGATCATTTCATGGTCTCCTGCCATGATTGGGCGCCGGAAGTAACAAAAGAAGATCCTAACCTCTTCAAATATTTCATCAGAGTTCTTTGCAATGCCAACACATCCGAAGGCTTCAATCCAATGCGAGATGCATCCTTGCCCGAGATTAACTTACCTCCAACTTTCCACCTCAATCTTCCTCGATTAGGCCAACCGCCACAGAACCGCTCAATTCTAGCTTTCTTCGCCGGCGGAGCACACGGATTCATCCGCCACATCCTAATGCAGCATTGGAAAGACAAAGACCATGAAATCCAAGTCCACGAGTACCTTCCTCCATCCCAAAACTACACCGAATTGATCGATCGAAGCAAATTCTGCCTCTGCCCTAGCGGATACGAAGTTGCAAGCCCTAGGTTAGTGGAAGCGATCCACGGCGGTTGTGTACCAGTGGTAATCTCTGATTATTACTCCTTGCCGTTCGATGATGTGCTGGATTGGAGCAAATTCTCGATGCGGATTCCGTCTGAGAGGATTCCGGAGATCAAGACGATCTTGAGAGGAGTTTCGATGAAGAAGTACTTGAAACTACAGCGAGGAGTGATGAAAGTGCAGAGACATTTTGAGATTCATCGGCCGGCGAAGGCGTTTGATATGTTCCATATGGTACTTCACTCTGTTTGGCTCAGACGACTCAATGTAAAGCTTACACATTGA
BLAST of CSPI01G16600 vs. Swiss-Prot
Match: GLYT5_ARATH (Probable glycosyltransferase At5g20260 OS=Arabidopsis thaliana GN=At5g20260 PE=3 SV=3)

HSP 1 Score: 539.7 bits (1389), Expect = 3.3e-152
Identity = 262/462 (56.71%), Postives = 352/462 (76.19%), Query Frame = 1

Query: 16  LLPFFLLLLLLLCFFPPNDQINPFSSILSKNLFPFHSSKQPQPPLSPPQSTLQFPPTTAT 75
           L+P  LLLL+LL F+  +   N  S+ LS  +     +  P P LS     ++F   ++ 
Sbjct: 13  LVPTLLLLLVLLVFYQHHSSPNLNSNALSSFVDATSLAPSPSPSLS-----MEFSVASSN 72

Query: 76  ATAPSQPQDYSSTRKKKSEMIEEGLSEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAY 135
            +  S P +    +  K  +IEEGL+++R+AIR A+  + + S+KEE+F+PRG VYRNA+
Sbjct: 73  LSTISSPPE---NKGNKRNIIEEGLAKSRSAIREAVRLKKFVSDKEETFVPRGAVYRNAF 132

Query: 136 AFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEA 195
           AFHQSHIEM+K+ K+W Y+EGE PLVH GPM +IYSIEG F+DE+++G SPF+A+ PEEA
Sbjct: 133 AFHQSHIEMEKKFKVWVYREGETPLVHMGPMNNIYSIEGQFMDEIETGMSPFAANNPEEA 192

Query: 196 QVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCH 255
             F LP+S+  IV Y+Y+P+ TY+R++L ++F DYV VVA+KYPYWNR+ GADHF VSCH
Sbjct: 193 HAFLLPVSVANIVHYLYRPLVTYSREQLHKVFLDYVDVVAHKYPYWNRSLGADHFYVSCH 252

Query: 256 DWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPP-Q 315
           DWAP+V+  +P L K  IRVLCNANTSEGF P RD S+PEIN+P   HL  PRL +    
Sbjct: 253 DWAPDVSGSNPELMKNLIRVLCNANTSEGFMPQRDVSIPEINIPGG-HLGPPRLSRSSGH 312

Query: 316 NRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYE 375
           +R ILAFFAGG+HG+IR IL+QHWKDKD E+QVHEYL  +++Y +L+  ++FCLCPSGYE
Sbjct: 313 DRPILAFFAGGSHGYIRRILLQHWKDKDEEVQVHEYLAKNKDYFKLMATARFCLCPSGYE 372

Query: 376 VASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKY 435
           VASPR+V AI+ GCVPV+ISD+Y+LPF DVLDW+KF++ +PS++IPEIKTIL+ +S ++Y
Sbjct: 373 VASPRVVAAINLGCVPVIISDHYALPFSDVLDWTKFTIHVPSKKIPEIKTILKSISWRRY 432

Query: 436 LKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKL 477
             LQR V++VQRHF I+RP++ FDM  M+LHSVWLRRLN++L
Sbjct: 433 RVLQRRVLQVQRHFVINRPSQPFDMLRMLLHSVWLRRLNLRL 465

BLAST of CSPI01G16600 vs. Swiss-Prot
Match: GLYT2_ARATH (Probable glycosyltransferase At3g42180 OS=Arabidopsis thaliana GN=At3g42180 PE=2 SV=2)

HSP 1 Score: 498.8 bits (1283), Expect = 6.5e-140
Identity = 267/478 (55.86%), Postives = 331/478 (69.25%), Query Frame = 1

Query: 14  FLLLPFFLLLLLLLCFFP----PNDQINP---FSSILSKNLFPFHSSKQPQPPLSPPQST 73
           F LL F L+L+LLL F      PN++  P   FSS+   +L    ++ Q     S   S+
Sbjct: 9   FCLLGFPLILILLLSFLLFSSFPNNESPPQQFFSSLTMSSLLVHTNALQS----SSSSSS 68

Query: 74  LQFPPTTATATAPSQPQDYSSTRKKKSEMIEEGLSEARAAIRLAIVTRNYTSEKEE-SFI 133
           L  PP T               R+   E  EE L +ARAAIR A+  +N TS +E  ++I
Sbjct: 69  LYSPPITVK-------------RRSNLEKREEELRKARAAIRRAVRFKNCTSNEEVITYI 128

Query: 134 PRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFIDE----MD 193
           P G++YRN++AFHQSHIEM K  K+W+YKEGEQPLVHDGP+  IY IEG FIDE    M 
Sbjct: 129 PTGQIYRNSFAFHQSHIEMMKTFKVWSYKEGEQPLVHDGPVNDIYGIEGQFIDELSYVMG 188

Query: 194 SGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYA---RDRLVRIFTDYVRVVANKY 253
                F A  PEEA  FFLP S+  IV Y+Y+PIT+ A   R RL RIF DYV VVA+K+
Sbjct: 189 GPSGRFRASRPEEAHAFFLPFSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVAHKH 248

Query: 254 PYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINL 313
           P+WN++ GADHFMVSCHDWAP+V    P  FK F+R LCNANTSEGF    D S+PEIN+
Sbjct: 249 PFWNQSNGADHFMVSCHDWAPDVPDSKPEFFKNFMRGLCNANTSEGFRRNIDFSIPEINI 308

Query: 314 PPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYT 373
           P    L  P +GQ P+NR+ILAFFAG AHG+IR +L  HWK KD ++QV+++L   QNY 
Sbjct: 309 PKR-KLKPPFMGQNPENRTILAFFAGRAHGYIREVLFSHWKGKDKDVQVYDHLTKGQNYH 368

Query: 374 ELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSER 433
           ELI  SKFCLCPSGYEVASPR VEAI+ GCVPVVISD YSLPF+DVLDWSKFS+ IP ++
Sbjct: 369 ELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFNDVLDWSKFSVEIPVDK 428

Query: 434 IPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKL 477
           IP+IK IL+ +   KYL++ R VMKV+RHF ++RPA+ FD+ HM+LHSVWLRRLN++L
Sbjct: 429 IPDIKKILQEIPHDKYLRMYRNVMKVRRHFVVNRPAQPFDVIHMILHSVWLRRLNIRL 468

BLAST of CSPI01G16600 vs. Swiss-Prot
Match: GLYT4_ARATH (Probable glycosyltransferase At5g11130 OS=Arabidopsis thaliana GN=At5g11120/At5g11130 PE=3 SV=2)

HSP 1 Score: 493.8 bits (1270), Expect = 2.1e-138
Identity = 241/476 (50.63%), Postives = 339/476 (71.22%), Query Frame = 1

Query: 14  FLLLPFFLLLLLLLCFFPPNDQINPFSSILSKNLFPFHSSKQPQPPLSPPQSTLQFPPTT 73
           FLL P    L+++L F+  N     FSS++  +      S  PQ   S    + +  P  
Sbjct: 10  FLLYPS---LVIILFFYSINHHNQIFSSVVDDDP-SCRLSSSPQAVFS----SFRIFPFR 69

Query: 74  ATATAPSQPQDYSSTRK--------KKSEMIEEGLSEARAAIRLAIVT-----RNYTSEK 133
           ++++  +   + +ST +        +  E IEEGL+ ARAAIR A        R+ T+  
Sbjct: 70  SSSSCLNITSNNNSTSEVVVVEEVDEAVERIEEGLAMARAAIRKAGEKNLRRDRDRTNNS 129

Query: 134 EESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFIDEM 193
           +   +  G VY NA+ FHQSH EM+KR KIWTY+EGE PL H GP+ +IY+IEG F+DE+
Sbjct: 130 DVGVVSNGSVYLNAFTFHQSHKEMEKRFKIWTYREGEAPLFHKGPLNNIYAIEGQFMDEI 189

Query: 194 DSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPY 253
           ++G S F A  PEEA VF++P+ IV I+ ++Y+P T+YARDRL  I  DY+ +++N+YPY
Sbjct: 190 ENGNSRFKAASPEEATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKDYISLISNRYPY 249

Query: 254 WNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINLPP 313
           WNR+RGADHF +SCHDWAP+V+  DP L+K+FIR LCNAN+SEGF PMRD SLPEIN+P 
Sbjct: 250 WNRSRGADHFFLSCHDWAPDVSAVDPELYKHFIRALCNANSSEGFTPMRDVSLPEINIPH 309

Query: 314 TFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYTEL 373
           +  L     G+PPQNR +LAFFAGG+HG +R IL QHWK+KD ++ V+E LP + NYT++
Sbjct: 310 S-QLGFVHTGEPPQNRKLLAFFAGGSHGDVRKILFQHWKEKDKDVLVYENLPKTMNYTKM 369

Query: 374 IDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSERIP 433
           +D++KFCLCPSG+EVASPR+VE+++ GCVPV+I+DYY LPF DVL+W  FS+ IP  ++P
Sbjct: 370 MDKAKFCLCPSGWEVASPRIVESLYSGCVPVIIADYYVLPFSDVLNWKTFSVHIPISKMP 429

Query: 434 EIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKL 477
           +IK IL  ++ ++YL +QR V++V++HF I+RP+K +DM HM++HS+WLRRLNV++
Sbjct: 430 DIKKILEAITEEEYLNMQRRVLEVRKHFVINRPSKPYDMLHMIMHSIWLRRLNVRI 476

BLAST of CSPI01G16600 vs. Swiss-Prot
Match: XGD1_ARATH (Xylogalacturonan beta-1,3-xylosyltransferase OS=Arabidopsis thaliana GN=XGD1 PE=1 SV=2)

HSP 1 Score: 453.8 bits (1166), Expect = 2.4e-126
Identity = 232/440 (52.73%), Postives = 302/440 (68.64%), Query Frame = 1

Query: 58  PPLSPPQSTLQFPPTTATATAPSQPQDYSSTRKKKS--------------EMIEEGLSEA 117
           PPLSP   +       A++++ S   D+ +  K  S              + IE  L++A
Sbjct: 70  PPLSPLGQSNTTNTILASSSSSSSFSDHQNQNKSPSPTSKKIVIRKRSGLDKIESDLAKA 129

Query: 118 RAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHD 177
           RAAI+ A  T+NY S           +Y+N  AFHQSH EM  R K+WTY EGE PL HD
Sbjct: 130 RAAIKKAASTQNYVSS----------LYKNPAAFHQSHTEMMNRFKVWTYTEGEVPLFHD 189

Query: 178 GPMKHIYSIEGHFIDEM----DSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITT-- 237
           GP+  IY IEG F+DEM       +S F A  PE A VFF+P S+  ++ ++YKPIT+  
Sbjct: 190 GPVNDIYGIEGQFMDEMCVDGPKSRSRFRADRPENAHVFFIPFSVAKVIHFVYKPITSVE 249

Query: 238 -YARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVL 297
            ++R RL R+  DYV VVA K+PYWNR++G DHFMVSCHDWAP+V   +P LF+ FIR L
Sbjct: 250 GFSRARLHRLIEDYVDVVATKHPYWNRSQGGDHFMVSCHDWAPDVIDGNPKLFEKFIRGL 309

Query: 298 CNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILMQ 357
           CNANTSEGF P  D S+PEI LP    L    LG+ P+ RSILAFFAG +HG IR IL Q
Sbjct: 310 CNANTSEGFRPNVDVSIPEIYLPKG-KLGPSFLGKSPRVRSILAFFAGRSHGEIRKILFQ 369

Query: 358 HWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDY 417
           HWK+ D+E+QV++ LPP ++YT+ +  SKFCLCPSG+EVASPR VEAI+ GCVPV+ISD 
Sbjct: 370 HWKEMDNEVQVYDRLPPGKDYTKTMGMSKFCLCPSGWEVASPREVEAIYAGCVPVIISDN 429

Query: 418 YSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKA 477
           YSLPF DVL+W  FS++IP  RI EIKTIL+ VS+ +YLK+ + V++V++HF ++RPAK 
Sbjct: 430 YSLPFSDVLNWDSFSIQIPVSRIKEIKTILQSVSLVRYLKMYKRVLEVKQHFVLNRPAKP 489

BLAST of CSPI01G16600 vs. Swiss-Prot
Match: GLYT3_ARATH (Probable glycosyltransferase At5g03795 OS=Arabidopsis thaliana GN=At5g03795 PE=3 SV=2)

HSP 1 Score: 405.2 bits (1040), Expect = 9.8e-112
Identity = 204/418 (48.80%), Postives = 286/418 (68.42%), Query Frame = 1

Query: 66  TLQFPPTTATATAPSQPQDYSSTRKKKS-----EMIEEGLSEARAAIRLAIVTRNYTSEK 125
           T+Q      TAT+ +     S   KK+      E IE  L +ARA+I+ A +        
Sbjct: 106 TIQLNMINVTATSNNVSSTASLEPKKRRVLSNLEKIEFKLQKARASIKAASMD---DPVD 165

Query: 126 EESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFIDEM 185
           +  ++P G +Y NA  FH+S++EM+K+ KI+ YKEGE PL HDGP K IYS+EG FI E+
Sbjct: 166 DPDYVPLGPMYWNAKVFHRSYLEMEKQFKIYVYKEGEPPLFHDGPCKSIYSMEGSFIYEI 225

Query: 186 DSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARD--RLVRIFTDYVRVVANKY 245
           ++  + F  + P++A VF+LP S+V +V Y+Y+     +RD   +     DY+ +V +KY
Sbjct: 226 ETD-TRFRTNNPDKAHVFYLPFSVVKMVRYVYE---RNSRDFSPIRNTVKDYINLVGDKY 285

Query: 246 PYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINL 305
           PYWNR+ GADHF++SCHDW PE +   P+L    IR LCNANTSE F P +D S+PEINL
Sbjct: 286 PYWNRSIGADHFILSCHDWGPEASFSHPHLGHNSIRALCNANTSERFKPRKDVSIPEINL 345

Query: 306 PPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYT 365
                  L   G  P +R ILAFFAGG HG +R +L+QHW++KD++I+VH+YLP   +Y+
Sbjct: 346 RTGSLTGLVG-GPSPSSRPILAFFAGGVHGPVRPVLLQHWENKDNDIRVHKYLPRGTSYS 405

Query: 366 ELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSER 425
           +++  SKFC+CPSGYEVASPR+VEA++ GCVPV+I+  Y  PF DVL+W  FS+ +  E 
Sbjct: 406 DMMRNSKFCICPSGYEVASPRIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVSVED 465

Query: 426 IPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKL 477
           IP +KTIL  +S ++YL++ R V+KV+RHFE++ PAK FD+FHM+LHS+W+RRLNVK+
Sbjct: 466 IPNLKTILTSISPRQYLRMYRRVLKVRRHFEVNSPAKRFDVFHMILHSIWVRRLNVKI 515

BLAST of CSPI01G16600 vs. TrEMBL
Match: A0A0A0LTL1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G226410 PE=4 SV=1)

HSP 1 Score: 976.9 bits (2524), Expect = 9.1e-282
Identity = 477/478 (99.79%), Postives = 478/478 (100.00%), Query Frame = 1

Query: 1   MASSLEFPHKLSFFLLLPFFLLLLLLLCFFPPNDQINPFSSILSKNLFPFHSSKQPQPPL 60
           MASSLEFPHKLSFFLLLPFFLLLLLLLCFFPPNDQINPFSSILSKNLFPFHSSKQPQPPL
Sbjct: 1   MASSLEFPHKLSFFLLLPFFLLLLLLLCFFPPNDQINPFSSILSKNLFPFHSSKQPQPPL 60

Query: 61  SPPQSTLQFPPTTATATAPSQPQDYSSTRKKKSEMIEEGLSEARAAIRLAIVTRNYTSEK 120
           SPPQSTLQFPPTTATATAPSQPQDYSSTRKKKSEMIEEGL+EARAAIRLAIVTRNYTSEK
Sbjct: 61  SPPQSTLQFPPTTATATAPSQPQDYSSTRKKKSEMIEEGLAEARAAIRLAIVTRNYTSEK 120

Query: 121 EESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFIDEM 180
           EESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFIDEM
Sbjct: 121 EESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFIDEM 180

Query: 181 DSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPY 240
           DSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPY
Sbjct: 181 DSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPY 240

Query: 241 WNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINLPP 300
           WNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINLPP
Sbjct: 241 WNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINLPP 300

Query: 301 TFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYTEL 360
           TFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYTEL
Sbjct: 301 TFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYTEL 360

Query: 361 IDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSERIP 420
           IDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSERIP
Sbjct: 361 IDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSERIP 420

Query: 421 EIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKLTH 479
           EIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKLTH
Sbjct: 421 EIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKLTH 478

BLAST of CSPI01G16600 vs. TrEMBL
Match: A0A061GVS6_THECC (Exostosin family protein OS=Theobroma cacao GN=TCM_038164 PE=4 SV=1)

HSP 1 Score: 582.4 bits (1500), Expect = 5.0e-163
Identity = 295/476 (61.97%), Postives = 359/476 (75.42%), Query Frame = 1

Query: 15  LLLPFFLLLLLLLCFFPPNDQINPFSSILSKNLFPF--------------HSSKQPQPPL 74
           LL P F+ L+LL C  P          +  KNLF F              HS+KQ    L
Sbjct: 11  LLFPAFIPLILL-CLSP----------MYQKNLFIFFPSFSITFTYQNSNHSTKQLLAEL 70

Query: 75  SPPQSTLQFPPTTATATAPSQPQDYSSTRKKKSEMIEEGLSEARAAIRLAIVTRNYTSEK 134
           S         P+ + +T PS        +K +SE +E  L+ ARAAIR AI TRNYTS K
Sbjct: 71  S-----FNISPSPSPST-PSYNAVSCIRKKGRSERVEADLASARAAIREAIRTRNYTSYK 130

Query: 135 EESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFIDEM 194
           EE FIPRG +YRN YAFHQSHIEM +R KIWTYKEGE+PLVH GPMKHIY+IEG FI+E+
Sbjct: 131 EEKFIPRGCMYRNEYAFHQSHIEMVERFKIWTYKEGERPLVHTGPMKHIYAIEGQFIEEI 190

Query: 195 DSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPY 254
           + GKSPF A  P+EA VFFLP+S+ YIV+YIY PITTY+RDRLVRIFTDY++VVA KYPY
Sbjct: 191 EGGKSPFKAQHPDEAHVFFLPVSVAYIVNYIYLPITTYSRDRLVRIFTDYIKVVAKKYPY 250

Query: 255 WNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINLPP 314
           W+RT+GADHFMVSCHDWAPEV  +DP L+K  IRVLCNAN+SEGF+P RD +LPE+NLPP
Sbjct: 251 WSRTKGADHFMVSCHDWAPEVAGQDPELYKNLIRVLCNANSSEGFHPKRDVALPELNLPP 310

Query: 315 TFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYTEL 374
               +  R  QPP  R+ILAFFAGGAHG IR IL+ HWKDKD+E+QVHEYL   Q+Y++L
Sbjct: 311 R-GFSPRRFAQPPDKRTILAFFAGGAHGNIRKILLHHWKDKDNEVQVHEYLSKGQDYSKL 370

Query: 375 IDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSERIP 434
           + RSKFCLCPSG+EVASPR+VE+ + GCVPV+ISD Y LPF DVLDWSKFS++IP E+IP
Sbjct: 371 MGRSKFCLCPSGFEVASPRVVESFYAGCVPVIISDNYVLPFSDVLDWSKFSVQIPVEKIP 430

Query: 435 EIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKL 477
           +IKTIL+ +   KYL++QR V+K++RHFE++RPAK FD+ HMVLHS+WLRRLN++L
Sbjct: 431 QIKTILQSIPGNKYLEMQRRVLKLRRHFELNRPAKPFDIIHMVLHSIWLRRLNLRL 468

BLAST of CSPI01G16600 vs. TrEMBL
Match: U5G659_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0006s06330g PE=4 SV=1)

HSP 1 Score: 570.1 bits (1468), Expect = 2.6e-159
Identity = 297/490 (60.61%), Postives = 362/490 (73.88%), Query Frame = 1

Query: 7   FPHKLSFFLLLPFFLLLLLLLCFFPPNDQINPFSSILSKNL---FPFHSSKQP--QPPLS 66
           F   + F LL   FLLLL+       N+Q     +I+S  L   FP   + Q    PPL+
Sbjct: 4   FSCPIHFILLSTIFLLLLIYNSPLFKNNQ-----TIISPPLTPIFPLFKNNQTIISPPLT 63

Query: 67  P-------PQSTLQFPPTTA---TATAPSQPQDYSSTRKKKS--EMIEEGLSEARAAIRL 126
           P         +T Q  P      T+T  +      S +KKKS  E IE  L  AR AI+ 
Sbjct: 64  PIFNQHKSNSNTTQVLPQVGSPLTSTNIALNNSIVSHKKKKSGIERIEADLVNARVAIQE 123

Query: 127 AIVTRNYT-SEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKH 186
           AI  +NYT +EKE++FIPRG +YRNAYAFHQS+ EM KR KIW Y+EGE P+VH+GPMKH
Sbjct: 124 AIRRKNYTLTEKEDAFIPRGSMYRNAYAFHQSYSEMVKRFKIWVYREGETPMVHNGPMKH 183

Query: 187 IYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFT 246
           IYSIEG FIDEM+SGKSPF A   +EA  FFLPIS+ YIV+++Y PITTY R+RLVRIF 
Sbjct: 184 IYSIEGQFIDEMESGKSPFLARNHDEAHAFFLPISVAYIVEFVYLPITTYHRERLVRIFK 243

Query: 247 DYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPM 306
           DYV VVANKYPYWNR+RG DHFMVSCHDWAP+V+++DP L+K  IRV+CNANTSEGF P 
Sbjct: 244 DYVTVVANKYPYWNRSRGGDHFMVSCHDWAPQVSRDDPELYKNLIRVMCNANTSEGFRPR 303

Query: 307 RDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVH 366
           RDA+LPE+N PP   L     G  P  R I AFFAGGAHG IR IL++HWK+KD EIQVH
Sbjct: 304 RDATLPELNCPP-LKLTPACRGLAPHERKIFAFFAGGAHGDIRKILLRHWKEKDDEIQVH 363

Query: 367 EYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWS 426
           EYLP  Q+Y EL+ +SKFCLCPSG+EVASPR+ E+I+ GCVPV+ISD+Y+LPF DVLDWS
Sbjct: 364 EYLPKDQDYMELMGQSKFCLCPSGFEVASPRVAESIYSGCVPVIISDHYNLPFSDVLDWS 423

Query: 427 KFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVW 479
           +FS++IP E+IPEIKTILRG+S  +YLK+Q+GVMKVQRHF ++RPAK +D+ HMVLHSVW
Sbjct: 424 QFSVQIPVEKIPEIKTILRGISYDEYLKMQKGVMKVQRHFVLNRPAKPYDVLHMVLHSVW 483

BLAST of CSPI01G16600 vs. TrEMBL
Match: A0A0A0LJ06_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G008045 PE=4 SV=1)

HSP 1 Score: 569.7 bits (1467), Expect = 3.4e-159
Identity = 284/474 (59.92%), Postives = 359/474 (75.74%), Query Frame = 1

Query: 13  FFLLLPFFLLLLLLLCFF--PPNDQINPFSSI--LSKNLFPFHSSKQPQPPLSPPQSTLQ 72
           + LLLP  LLLL+ L FF  PP   ++  +    L+ + FP +S ++   P+        
Sbjct: 9   YCLLLPASLLLLVFLQFFSVPPLLDLSQATEAFPLASSFFPINSMREGNKPMKAI----- 68

Query: 73  FPPTTATATAPSQPQDYSSTRKKKS-EMIEEGLSEARAAIRLAIVTRNYTSEKEESFIPR 132
                           +   +KK S +MIE  L+EARA+IR A++ +N+TSEK+E++IPR
Sbjct: 69  ----------------FIKKKKKTSLKMIEASLAEARASIRKAVLWKNFTSEKKETYIPR 128

Query: 133 GRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPF 192
           G +YRN YAFHQSHIEM KR K+W+Y+EGEQPL HDGP+  IY+IEG FIDE+D  KSPF
Sbjct: 129 GPIYRNPYAFHQSHIEMVKRFKVWSYREGEQPLFHDGPLNSIYAIEGQFIDELDCSKSPF 188

Query: 193 SAHEPEEAQVFFLPISIVYIVDYIYKPITT---YARDRLVRIFTDYVRVVANKYPYWNRT 252
            A  P+EA VF LP+SI  I+ +IY+PIT+   Y RDR+ R+ TDY+RVVAN+YPYWNR+
Sbjct: 189 RASHPDEAHVFLLPLSITNIIHFIYRPITSPADYNRDRMHRVTTDYIRVVANRYPYWNRS 248

Query: 253 RGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINLPPTFHL 312
            GADHF+VSCHDWAPE++  +P LFK FIRV+CNAN +EGF P  D  LPEIN+ P   L
Sbjct: 249 NGADHFVVSCHDWAPEISDANPQLFKNFIRVVCNANITEGFRPNIDIPLPEINIHPGT-L 308

Query: 313 NLPRLGQPPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYTELIDRS 372
             P LGQPP+ R ILAFFAGGAHG+IR IL++HWK+KD+E+QVHEYLP +QNYT+LI  S
Sbjct: 309 GPPDLGQPPERRPILAFFAGGAHGYIRKILIKHWKEKDNEVQVHEYLPKTQNYTKLIGES 368

Query: 373 KFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSERIPEIKT 432
           KFCLCPSGYEVASPR+VEAI+GGCVPV+ISD YSLPF DVLDWS+FS++IP +RIPEIKT
Sbjct: 369 KFCLCPSGYEVASPRVVEAIYGGCVPVIISDNYSLPFSDVLDWSRFSVQIPVQRIPEIKT 428

Query: 433 ILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKLTH 479
           IL+ +S +KYLKL +GV+KV+RHF+I+RPAK FD+ HM+LHS+WLRRLN  L H
Sbjct: 429 ILKAISEEKYLKLYKGVIKVKRHFKINRPAKPFDVIHMLLHSLWLRRLNFGLPH 460

BLAST of CSPI01G16600 vs. TrEMBL
Match: A0A059AYA0_EUCGR (Uncharacterized protein (Fragment) OS=Eucalyptus grandis GN=EUGRSUZ_H01381 PE=4 SV=1)

HSP 1 Score: 569.7 bits (1467), Expect = 3.4e-159
Identity = 263/381 (69.03%), Postives = 313/381 (82.15%), Query Frame = 1

Query: 96  IEEGLSEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKE 155
           +E  L+ ARAAIR A+  RNYTS+KEE+FIPRG +YRNAYAFHQSHIEM KR KIW Y+E
Sbjct: 9   VERDLARARAAIRDAVRARNYTSDKEETFIPRGAIYRNAYAFHQSHIEMVKRFKIWNYRE 68

Query: 156 GEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPI 215
           GE P+VH GP+ +IYSIEGHFIDE++SG SPF A  P++A  FFLPISI  I+ ++Y+P+
Sbjct: 69  GELPMVHIGPVNNIYSIEGHFIDELESGLSPFLARHPDQAHAFFLPISIAGIITFLYRPL 128

Query: 216 TTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRV 275
            +Y RD L+  FTDYV VVA KYP+WNR+ GADHFMVSCHDWAP+VT+EDP+ FK F+RV
Sbjct: 129 VSYDRDPLIHTFTDYVDVVARKYPFWNRSLGADHFMVSCHDWAPDVTREDPDKFKNFMRV 188

Query: 276 LCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILM 335
           LCNANTSEGFNP RDASLPE NL P F + +P LG  P  R I AFF+GG HG IR IL+
Sbjct: 189 LCNANTSEGFNPTRDASLPEFNLHP-FKITIPHLGLRPSRRDIFAFFSGGPHGDIRKILL 248

Query: 336 QHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISD 395
            HWKDKD E+QVHEYLP  QNY + + RSKFCLCPSGYEVASPRLVEAIH GCVPV++S 
Sbjct: 249 HHWKDKDSEVQVHEYLPKGQNYMQTMGRSKFCLCPSGYEVASPRLVEAIHSGCVPVILSA 308

Query: 396 YYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAK 455
           YY LPF DVLDWSKFS+ +P E+IPE+K IL+ VS ++YLKLQR V++V++HFE++RPAK
Sbjct: 309 YYPLPFSDVLDWSKFSITVPVEKIPELKAILKRVSERRYLKLQRRVVQVRQHFEVNRPAK 368

Query: 456 AFDMFHMVLHSVWLRRLNVKL 477
            FD+ HMVLHSVWLRRLNV L
Sbjct: 369 PFDVLHMVLHSVWLRRLNVGL 388

BLAST of CSPI01G16600 vs. TAIR10
Match: AT5G20260.1 (AT5G20260.1 Exostosin family protein)

HSP 1 Score: 539.7 bits (1389), Expect = 1.9e-153
Identity = 262/462 (56.71%), Postives = 352/462 (76.19%), Query Frame = 1

Query: 16  LLPFFLLLLLLLCFFPPNDQINPFSSILSKNLFPFHSSKQPQPPLSPPQSTLQFPPTTAT 75
           L+P  LLLL+LL F+  +   N  S+ LS  +     +  P P LS     ++F   ++ 
Sbjct: 5   LVPTLLLLLVLLVFYQHHSSPNLNSNALSSFVDATSLAPSPSPSLS-----MEFSVASSN 64

Query: 76  ATAPSQPQDYSSTRKKKSEMIEEGLSEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAY 135
            +  S P +    +  K  +IEEGL+++R+AIR A+  + + S+KEE+F+PRG VYRNA+
Sbjct: 65  LSTISSPPE---NKGNKRNIIEEGLAKSRSAIREAVRLKKFVSDKEETFVPRGAVYRNAF 124

Query: 136 AFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEA 195
           AFHQSHIEM+K+ K+W Y+EGE PLVH GPM +IYSIEG F+DE+++G SPF+A+ PEEA
Sbjct: 125 AFHQSHIEMEKKFKVWVYREGETPLVHMGPMNNIYSIEGQFMDEIETGMSPFAANNPEEA 184

Query: 196 QVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCH 255
             F LP+S+  IV Y+Y+P+ TY+R++L ++F DYV VVA+KYPYWNR+ GADHF VSCH
Sbjct: 185 HAFLLPVSVANIVHYLYRPLVTYSREQLHKVFLDYVDVVAHKYPYWNRSLGADHFYVSCH 244

Query: 256 DWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPP-Q 315
           DWAP+V+  +P L K  IRVLCNANTSEGF P RD S+PEIN+P   HL  PRL +    
Sbjct: 245 DWAPDVSGSNPELMKNLIRVLCNANTSEGFMPQRDVSIPEINIPGG-HLGPPRLSRSSGH 304

Query: 316 NRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYE 375
           +R ILAFFAGG+HG+IR IL+QHWKDKD E+QVHEYL  +++Y +L+  ++FCLCPSGYE
Sbjct: 305 DRPILAFFAGGSHGYIRRILLQHWKDKDEEVQVHEYLAKNKDYFKLMATARFCLCPSGYE 364

Query: 376 VASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKY 435
           VASPR+V AI+ GCVPV+ISD+Y+LPF DVLDW+KF++ +PS++IPEIKTIL+ +S ++Y
Sbjct: 365 VASPRVVAAINLGCVPVIISDHYALPFSDVLDWTKFTIHVPSKKIPEIKTILKSISWRRY 424

Query: 436 LKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKL 477
             LQR V++VQRHF I+RP++ FDM  M+LHSVWLRRLN++L
Sbjct: 425 RVLQRRVLQVQRHFVINRPSQPFDMLRMLLHSVWLRRLNLRL 457

BLAST of CSPI01G16600 vs. TAIR10
Match: AT3G42180.1 (AT3G42180.1 Exostosin family protein)

HSP 1 Score: 498.8 bits (1283), Expect = 3.7e-141
Identity = 267/478 (55.86%), Postives = 331/478 (69.25%), Query Frame = 1

Query: 14  FLLLPFFLLLLLLLCFFP----PNDQINP---FSSILSKNLFPFHSSKQPQPPLSPPQST 73
           F LL F L+L+LLL F      PN++  P   FSS+   +L    ++ Q     S   S+
Sbjct: 9   FCLLGFPLILILLLSFLLFSSFPNNESPPQQFFSSLTMSSLLVHTNALQS----SSSSSS 68

Query: 74  LQFPPTTATATAPSQPQDYSSTRKKKSEMIEEGLSEARAAIRLAIVTRNYTSEKEE-SFI 133
           L  PP T               R+   E  EE L +ARAAIR A+  +N TS +E  ++I
Sbjct: 69  LYSPPITVK-------------RRSNLEKREEELRKARAAIRRAVRFKNCTSNEEVITYI 128

Query: 134 PRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFIDE----MD 193
           P G++YRN++AFHQSHIEM K  K+W+YKEGEQPLVHDGP+  IY IEG FIDE    M 
Sbjct: 129 PTGQIYRNSFAFHQSHIEMMKTFKVWSYKEGEQPLVHDGPVNDIYGIEGQFIDELSYVMG 188

Query: 194 SGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYA---RDRLVRIFTDYVRVVANKY 253
                F A  PEEA  FFLP S+  IV Y+Y+PIT+ A   R RL RIF DYV VVA+K+
Sbjct: 189 GPSGRFRASRPEEAHAFFLPFSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVAHKH 248

Query: 254 PYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINL 313
           P+WN++ GADHFMVSCHDWAP+V    P  FK F+R LCNANTSEGF    D S+PEIN+
Sbjct: 249 PFWNQSNGADHFMVSCHDWAPDVPDSKPEFFKNFMRGLCNANTSEGFRRNIDFSIPEINI 308

Query: 314 PPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYT 373
           P    L  P +GQ P+NR+ILAFFAG AHG+IR +L  HWK KD ++QV+++L   QNY 
Sbjct: 309 PKR-KLKPPFMGQNPENRTILAFFAGRAHGYIREVLFSHWKGKDKDVQVYDHLTKGQNYH 368

Query: 374 ELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSER 433
           ELI  SKFCLCPSGYEVASPR VEAI+ GCVPVVISD YSLPF+DVLDWSKFS+ IP ++
Sbjct: 369 ELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFNDVLDWSKFSVEIPVDK 428

Query: 434 IPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKL 477
           IP+IK IL+ +   KYL++ R VMKV+RHF ++RPA+ FD+ HM+LHSVWLRRLN++L
Sbjct: 429 IPDIKKILQEIPHDKYLRMYRNVMKVRRHFVVNRPAQPFDVIHMILHSVWLRRLNIRL 468

BLAST of CSPI01G16600 vs. TAIR10
Match: AT5G11130.1 (AT5G11130.1 Exostosin family protein)

HSP 1 Score: 493.8 bits (1270), Expect = 1.2e-139
Identity = 241/476 (50.63%), Postives = 339/476 (71.22%), Query Frame = 1

Query: 14  FLLLPFFLLLLLLLCFFPPNDQINPFSSILSKNLFPFHSSKQPQPPLSPPQSTLQFPPTT 73
           FLL P    L+++L F+  N     FSS++  +      S  PQ   S    + +  P  
Sbjct: 10  FLLYPS---LVIILFFYSINHHNQIFSSVVDDDP-SCRLSSSPQAVFS----SFRIFPFR 69

Query: 74  ATATAPSQPQDYSSTRK--------KKSEMIEEGLSEARAAIRLAIVT-----RNYTSEK 133
           ++++  +   + +ST +        +  E IEEGL+ ARAAIR A        R+ T+  
Sbjct: 70  SSSSCLNITSNNNSTSEVVVVEEVDEAVERIEEGLAMARAAIRKAGEKNLRRDRDRTNNS 129

Query: 134 EESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFIDEM 193
           +   +  G VY NA+ FHQSH EM+KR KIWTY+EGE PL H GP+ +IY+IEG F+DE+
Sbjct: 130 DVGVVSNGSVYLNAFTFHQSHKEMEKRFKIWTYREGEAPLFHKGPLNNIYAIEGQFMDEI 189

Query: 194 DSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPY 253
           ++G S F A  PEEA VF++P+ IV I+ ++Y+P T+YARDRL  I  DY+ +++N+YPY
Sbjct: 190 ENGNSRFKAASPEEATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKDYISLISNRYPY 249

Query: 254 WNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINLPP 313
           WNR+RGADHF +SCHDWAP+V+  DP L+K+FIR LCNAN+SEGF PMRD SLPEIN+P 
Sbjct: 250 WNRSRGADHFFLSCHDWAPDVSAVDPELYKHFIRALCNANSSEGFTPMRDVSLPEINIPH 309

Query: 314 TFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYTEL 373
           +  L     G+PPQNR +LAFFAGG+HG +R IL QHWK+KD ++ V+E LP + NYT++
Sbjct: 310 S-QLGFVHTGEPPQNRKLLAFFAGGSHGDVRKILFQHWKEKDKDVLVYENLPKTMNYTKM 369

Query: 374 IDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSERIP 433
           +D++KFCLCPSG+EVASPR+VE+++ GCVPV+I+DYY LPF DVL+W  FS+ IP  ++P
Sbjct: 370 MDKAKFCLCPSGWEVASPRIVESLYSGCVPVIIADYYVLPFSDVLNWKTFSVHIPISKMP 429

Query: 434 EIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKL 477
           +IK IL  ++ ++YL +QR V++V++HF I+RP+K +DM HM++HS+WLRRLNV++
Sbjct: 430 DIKKILEAITEEEYLNMQRRVLEVRKHFVINRPSKPYDMLHMIMHSIWLRRLNVRI 476

BLAST of CSPI01G16600 vs. TAIR10
Match: AT5G33290.1 (AT5G33290.1 xylogalacturonan deficient 1)

HSP 1 Score: 453.8 bits (1166), Expect = 1.4e-127
Identity = 232/440 (52.73%), Postives = 302/440 (68.64%), Query Frame = 1

Query: 58  PPLSPPQSTLQFPPTTATATAPSQPQDYSSTRKKKS--------------EMIEEGLSEA 117
           PPLSP   +       A++++ S   D+ +  K  S              + IE  L++A
Sbjct: 70  PPLSPLGQSNTTNTILASSSSSSSFSDHQNQNKSPSPTSKKIVIRKRSGLDKIESDLAKA 129

Query: 118 RAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHD 177
           RAAI+ A  T+NY S           +Y+N  AFHQSH EM  R K+WTY EGE PL HD
Sbjct: 130 RAAIKKAASTQNYVSS----------LYKNPAAFHQSHTEMMNRFKVWTYTEGEVPLFHD 189

Query: 178 GPMKHIYSIEGHFIDEM----DSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITT-- 237
           GP+  IY IEG F+DEM       +S F A  PE A VFF+P S+  ++ ++YKPIT+  
Sbjct: 190 GPVNDIYGIEGQFMDEMCVDGPKSRSRFRADRPENAHVFFIPFSVAKVIHFVYKPITSVE 249

Query: 238 -YARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVL 297
            ++R RL R+  DYV VVA K+PYWNR++G DHFMVSCHDWAP+V   +P LF+ FIR L
Sbjct: 250 GFSRARLHRLIEDYVDVVATKHPYWNRSQGGDHFMVSCHDWAPDVIDGNPKLFEKFIRGL 309

Query: 298 CNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILMQ 357
           CNANTSEGF P  D S+PEI LP    L    LG+ P+ RSILAFFAG +HG IR IL Q
Sbjct: 310 CNANTSEGFRPNVDVSIPEIYLPKG-KLGPSFLGKSPRVRSILAFFAGRSHGEIRKILFQ 369

Query: 358 HWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDY 417
           HWK+ D+E+QV++ LPP ++YT+ +  SKFCLCPSG+EVASPR VEAI+ GCVPV+ISD 
Sbjct: 370 HWKEMDNEVQVYDRLPPGKDYTKTMGMSKFCLCPSGWEVASPREVEAIYAGCVPVIISDN 429

Query: 418 YSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKA 477
           YSLPF DVL+W  FS++IP  RI EIKTIL+ VS+ +YLK+ + V++V++HF ++RPAK 
Sbjct: 430 YSLPFSDVLNWDSFSIQIPVSRIKEIKTILQSVSLVRYLKMYKRVLEVKQHFVLNRPAKP 489

BLAST of CSPI01G16600 vs. TAIR10
Match: AT5G03795.1 (AT5G03795.1 Exostosin family protein)

HSP 1 Score: 405.2 bits (1040), Expect = 5.5e-113
Identity = 204/418 (48.80%), Postives = 286/418 (68.42%), Query Frame = 1

Query: 66  TLQFPPTTATATAPSQPQDYSSTRKKKS-----EMIEEGLSEARAAIRLAIVTRNYTSEK 125
           T+Q      TAT+ +     S   KK+      E IE  L +ARA+I+ A +        
Sbjct: 106 TIQLNMINVTATSNNVSSTASLEPKKRRVLSNLEKIEFKLQKARASIKAASMD---DPVD 165

Query: 126 EESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFIDEM 185
           +  ++P G +Y NA  FH+S++EM+K+ KI+ YKEGE PL HDGP K IYS+EG FI E+
Sbjct: 166 DPDYVPLGPMYWNAKVFHRSYLEMEKQFKIYVYKEGEPPLFHDGPCKSIYSMEGSFIYEI 225

Query: 186 DSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARD--RLVRIFTDYVRVVANKY 245
           ++  + F  + P++A VF+LP S+V +V Y+Y+     +RD   +     DY+ +V +KY
Sbjct: 226 ETD-TRFRTNNPDKAHVFYLPFSVVKMVRYVYE---RNSRDFSPIRNTVKDYINLVGDKY 285

Query: 246 PYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINL 305
           PYWNR+ GADHF++SCHDW PE +   P+L    IR LCNANTSE F P +D S+PEINL
Sbjct: 286 PYWNRSIGADHFILSCHDWGPEASFSHPHLGHNSIRALCNANTSERFKPRKDVSIPEINL 345

Query: 306 PPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYT 365
                  L   G  P +R ILAFFAGG HG +R +L+QHW++KD++I+VH+YLP   +Y+
Sbjct: 346 RTGSLTGLVG-GPSPSSRPILAFFAGGVHGPVRPVLLQHWENKDNDIRVHKYLPRGTSYS 405

Query: 366 ELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSER 425
           +++  SKFC+CPSGYEVASPR+VEA++ GCVPV+I+  Y  PF DVL+W  FS+ +  E 
Sbjct: 406 DMMRNSKFCICPSGYEVASPRIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVSVED 465

Query: 426 IPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKL 477
           IP +KTIL  +S ++YL++ R V+KV+RHFE++ PAK FD+FHM+LHS+W+RRLNVK+
Sbjct: 466 IPNLKTILTSISPRQYLRMYRRVLKVRRHFEVNSPAKRFDVFHMILHSIWVRRLNVKI 515

BLAST of CSPI01G16600 vs. NCBI nr
Match: gi|778659929|ref|XP_011655344.1| (PREDICTED: probable glycosyltransferase At5g20260 [Cucumis sativus])

HSP 1 Score: 976.9 bits (2524), Expect = 1.3e-281
Identity = 477/478 (99.79%), Postives = 478/478 (100.00%), Query Frame = 1

Query: 1   MASSLEFPHKLSFFLLLPFFLLLLLLLCFFPPNDQINPFSSILSKNLFPFHSSKQPQPPL 60
           MASSLEFPHKLSFFLLLPFFLLLLLLLCFFPPNDQINPFSSILSKNLFPFHSSKQPQPPL
Sbjct: 1   MASSLEFPHKLSFFLLLPFFLLLLLLLCFFPPNDQINPFSSILSKNLFPFHSSKQPQPPL 60

Query: 61  SPPQSTLQFPPTTATATAPSQPQDYSSTRKKKSEMIEEGLSEARAAIRLAIVTRNYTSEK 120
           SPPQSTLQFPPTTATATAPSQPQDYSSTRKKKSEMIEEGL+EARAAIRLAIVTRNYTSEK
Sbjct: 61  SPPQSTLQFPPTTATATAPSQPQDYSSTRKKKSEMIEEGLAEARAAIRLAIVTRNYTSEK 120

Query: 121 EESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFIDEM 180
           EESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFIDEM
Sbjct: 121 EESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFIDEM 180

Query: 181 DSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPY 240
           DSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPY
Sbjct: 181 DSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPY 240

Query: 241 WNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINLPP 300
           WNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINLPP
Sbjct: 241 WNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINLPP 300

Query: 301 TFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYTEL 360
           TFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYTEL
Sbjct: 301 TFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYTEL 360

Query: 361 IDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSERIP 420
           IDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSERIP
Sbjct: 361 IDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSERIP 420

Query: 421 EIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKLTH 479
           EIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKLTH
Sbjct: 421 EIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKLTH 478

BLAST of CSPI01G16600 vs. NCBI nr
Match: gi|659086442|ref|XP_008443936.1| (PREDICTED: probable glycosyltransferase At5g20260 [Cucumis melo])

HSP 1 Score: 925.6 bits (2391), Expect = 3.5e-266
Identity = 454/480 (94.58%), Postives = 462/480 (96.25%), Query Frame = 1

Query: 1   MASSLEFPHKLSFFLLLPFFLLLLLLLCFFPPNDQINPFSSILSKNLFPFHSSKQPQPPL 60
           MASS EF HKLSFFLLLPFFLLLLLLLCFFPPN+QINPFSSILSKNLF FHS KQPQ P 
Sbjct: 1   MASSFEFLHKLSFFLLLPFFLLLLLLLCFFPPNEQINPFSSILSKNLFLFHSFKQPQQPF 60

Query: 61  SPPQSTLQFPPTTATATA--PSQPQDYSSTRKKKSEMIEEGLSEARAAIRLAIVTRNYTS 120
           SPPQSTLQFPP TA +    PS PQDYSSTRKKKSEMIEEGL+EARAAIR AIVTRNYTS
Sbjct: 61  SPPQSTLQFPPATAPSAIIPPSPPQDYSSTRKKKSEMIEEGLAEARAAIRQAIVTRNYTS 120

Query: 121 EKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFID 180
           EKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFID
Sbjct: 121 EKEESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFID 180

Query: 181 EMDSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKY 240
           EMDSGKSPFSAH+PEEA VFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKY
Sbjct: 181 EMDSGKSPFSAHDPEEAHVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKY 240

Query: 241 PYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINL 300
           PYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINL
Sbjct: 241 PYWNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINL 300

Query: 301 PPTFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYT 360
           PPTFHLNLPR GQPPQNRSILAFFAGGAHGFIRH+LMQHWKDKD EIQVHEYLPP++NYT
Sbjct: 301 PPTFHLNLPRSGQPPQNRSILAFFAGGAHGFIRHVLMQHWKDKDDEIQVHEYLPPAKNYT 360

Query: 361 ELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSER 420
           ELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPV+ISDYYSLPFDDVLDWSKFSMRIPSER
Sbjct: 361 ELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVIISDYYSLPFDDVLDWSKFSMRIPSER 420

Query: 421 IPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKLTH 479
           IPEIK ILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKLTH
Sbjct: 421 IPEIKKILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKLTH 480

BLAST of CSPI01G16600 vs. NCBI nr
Match: gi|1009110120|ref|XP_015893842.1| (PREDICTED: probable glycosyltransferase At5g20260 [Ziziphus jujuba])

HSP 1 Score: 583.6 bits (1503), Expect = 3.2e-163
Identity = 286/472 (60.59%), Postives = 353/472 (74.79%), Query Frame = 1

Query: 14  FLLLPFFLLLLLLLCFFPPNDQINPFSSILSKNLFPFHSSKQPQPPLSPPQSTLQFPPTT 73
           FL +  F+++LL++CF    +  N F+       F F S+       +   +T      T
Sbjct: 10  FLFVQTFIVILLIICFCYYPEHQNHFT------FFNFSSNTTTDITTTTTTTTTTTTANT 69

Query: 74  ATATAPSQPQD---YSST----RKKKSEMIEEGLSEARAAIRLAIVTRNYTSEKEESFIP 133
            +  + + PQ    YSS     RK + + +EE L+ ARA+I  A+  RNYTS+K ESFIP
Sbjct: 70  TSNLSHTNPQTHHHYSSANQNKRKSRLDKVEEDLARARASILKAVRYRNYTSDKVESFIP 129

Query: 134 RGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSP 193
           RG  YRNAYAFHQSHIEM KR KIW YKEG++PL H GPMKHIYSIEGHFIDEM+SGKSP
Sbjct: 130 RGCAYRNAYAFHQSHIEMVKRFKIWAYKEGDRPLFHSGPMKHIYSIEGHFIDEMESGKSP 189

Query: 194 FSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPYWNRTRG 253
           F AH P+EA  FFLP+S+  IV+YIY PIT+Y RDRLVR+ TDYV++V +KYP WNR+ G
Sbjct: 190 FMAHHPDEAHAFFLPVSVANIVEYIYLPITSYDRDRLVRVVTDYVKIVGDKYPCWNRSSG 249

Query: 254 ADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINLPPTFHLNL 313
           ADHFM+SCHDWAPE T +DP+LFK FIRVLCNANTSEGF P+RD SLPE+NL P + L+ 
Sbjct: 250 ADHFMLSCHDWAPEATHDDPHLFKNFIRVLCNANTSEGFKPLRDVSLPELNLSPYWELSK 309

Query: 314 PRLGQPPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYTELIDRSKF 373
           P   QPP NR++LAFFAGGAHG+IR IL  HWK+KD E+QV++ LP   NY +++ +SKF
Sbjct: 310 PSFAQPPDNRTVLAFFAGGAHGYIRQILFDHWKEKDDEVQVYKDLPKDLNYDKMMGQSKF 369

Query: 374 CLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSERIPEIKTIL 433
           CLCPSGYEVASPR+VE+I  GCVPV+ISD+Y+LPF DVLDWSKFS+ IPS RIPEIK IL
Sbjct: 370 CLCPSGYEVASPRVVESIQAGCVPVIISDHYALPFSDVLDWSKFSLHIPSNRIPEIKKIL 429

Query: 434 RGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKLTH 479
             V   +YLK+Q    KV+RHF+++RPAK FD+FHMVLHSVWLRRLN+ + H
Sbjct: 430 ISVPHSRYLKMQERGTKVRRHFQLNRPAKPFDVFHMVLHSVWLRRLNIGILH 475

BLAST of CSPI01G16600 vs. NCBI nr
Match: gi|590578652|ref|XP_007013570.1| (Exostosin family protein [Theobroma cacao])

HSP 1 Score: 582.4 bits (1500), Expect = 7.2e-163
Identity = 295/476 (61.97%), Postives = 359/476 (75.42%), Query Frame = 1

Query: 15  LLLPFFLLLLLLLCFFPPNDQINPFSSILSKNLFPF--------------HSSKQPQPPL 74
           LL P F+ L+LL C  P          +  KNLF F              HS+KQ    L
Sbjct: 11  LLFPAFIPLILL-CLSP----------MYQKNLFIFFPSFSITFTYQNSNHSTKQLLAEL 70

Query: 75  SPPQSTLQFPPTTATATAPSQPQDYSSTRKKKSEMIEEGLSEARAAIRLAIVTRNYTSEK 134
           S         P+ + +T PS        +K +SE +E  L+ ARAAIR AI TRNYTS K
Sbjct: 71  S-----FNISPSPSPST-PSYNAVSCIRKKGRSERVEADLASARAAIREAIRTRNYTSYK 130

Query: 135 EESFIPRGRVYRNAYAFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHIYSIEGHFIDEM 194
           EE FIPRG +YRN YAFHQSHIEM +R KIWTYKEGE+PLVH GPMKHIY+IEG FI+E+
Sbjct: 131 EEKFIPRGCMYRNEYAFHQSHIEMVERFKIWTYKEGERPLVHTGPMKHIYAIEGQFIEEI 190

Query: 195 DSGKSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPY 254
           + GKSPF A  P+EA VFFLP+S+ YIV+YIY PITTY+RDRLVRIFTDY++VVA KYPY
Sbjct: 191 EGGKSPFKAQHPDEAHVFFLPVSVAYIVNYIYLPITTYSRDRLVRIFTDYIKVVAKKYPY 250

Query: 255 WNRTRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINLPP 314
           W+RT+GADHFMVSCHDWAPEV  +DP L+K  IRVLCNAN+SEGF+P RD +LPE+NLPP
Sbjct: 251 WSRTKGADHFMVSCHDWAPEVAGQDPELYKNLIRVLCNANSSEGFHPKRDVALPELNLPP 310

Query: 315 TFHLNLPRLGQPPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYTEL 374
               +  R  QPP  R+ILAFFAGGAHG IR IL+ HWKDKD+E+QVHEYL   Q+Y++L
Sbjct: 311 R-GFSPRRFAQPPDKRTILAFFAGGAHGNIRKILLHHWKDKDNEVQVHEYLSKGQDYSKL 370

Query: 375 IDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSERIP 434
           + RSKFCLCPSG+EVASPR+VE+ + GCVPV+ISD Y LPF DVLDWSKFS++IP E+IP
Sbjct: 371 MGRSKFCLCPSGFEVASPRVVESFYAGCVPVIISDNYVLPFSDVLDWSKFSVQIPVEKIP 430

Query: 435 EIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKL 477
           +IKTIL+ +   KYL++QR V+K++RHFE++RPAK FD+ HMVLHS+WLRRLN++L
Sbjct: 431 QIKTILQSIPGNKYLEMQRRVLKLRRHFELNRPAKPFDIIHMVLHSIWLRRLNLRL 468

BLAST of CSPI01G16600 vs. NCBI nr
Match: gi|778666208|ref|XP_011648705.1| (PREDICTED: probable glycosyltransferase At3g42180 [Cucumis sativus])

HSP 1 Score: 571.2 bits (1471), Expect = 1.7e-159
Identity = 266/395 (67.34%), Postives = 330/395 (83.54%), Query Frame = 1

Query: 89  RKKKS--EMIEEGLSEARAAIRLAIVTRNYTSEKEESFIPRGRVYRNAYAFHQSHIEMKK 148
           +KKK+  +MIE  L+EARA+IR A++ +N+TSEK+E++IPRG +YRN YAFHQSHIEM K
Sbjct: 4   KKKKTSLKMIEASLAEARASIRKAVLWKNFTSEKKETYIPRGPIYRNPYAFHQSHIEMVK 63

Query: 149 RLKIWTYKEGEQPLVHDGPMKHIYSIEGHFIDEMDSGKSPFSAHEPEEAQVFFLPISIVY 208
           R K+W+Y+EGEQPL HDGP+  IY+IEG FIDE+D  KSPF A  P+EA VF LP+SI  
Sbjct: 64  RFKVWSYREGEQPLFHDGPLNSIYAIEGQFIDELDCSKSPFRASHPDEAHVFLLPLSITN 123

Query: 209 IVDYIYKPITT---YARDRLVRIFTDYVRVVANKYPYWNRTRGADHFMVSCHDWAPEVTK 268
           I+ +IY+PIT+   Y RDR+ R+ TDY+RVVAN+YPYWNR+ GADHF+VSCHDWAPE++ 
Sbjct: 124 IIHFIYRPITSPADYNRDRMHRVTTDYIRVVANRYPYWNRSNGADHFVVSCHDWAPEISD 183

Query: 269 EDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQPPQNRSILAFFA 328
            +P LFK FIRV+CNAN +EGF P  D  LPEIN+ P   L  P LGQPP+ R ILAFFA
Sbjct: 184 ANPQLFKNFIRVVCNANITEGFRPNIDIPLPEINIHPGT-LGPPDLGQPPERRPILAFFA 243

Query: 329 GGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEA 388
           GGAHG+IR IL++HWK+KD+E+QVHEYLP +QNYT+LI  SKFCLCPSGYEVASPR+VEA
Sbjct: 244 GGAHGYIRKILIKHWKEKDNEVQVHEYLPKTQNYTKLIGESKFCLCPSGYEVASPRVVEA 303

Query: 389 IHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMK 448
           I+GGCVPV+ISD YSLPF DVLDWS+FS++IP +RIPEIKTIL+ +S +KYLKL +GV+K
Sbjct: 304 IYGGCVPVIISDNYSLPFSDVLDWSRFSVQIPVQRIPEIKTILKAISEEKYLKLYKGVIK 363

Query: 449 VQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKLTH 479
           V+RHF+I+RPAK FD+ HM+LHS+WLRRLN  L H
Sbjct: 364 VKRHFKINRPAKPFDVIHMLLHSLWLRRLNFGLPH 397

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GLYT5_ARATH3.3e-15256.71Probable glycosyltransferase At5g20260 OS=Arabidopsis thaliana GN=At5g20260 PE=3... [more]
GLYT2_ARATH6.5e-14055.86Probable glycosyltransferase At3g42180 OS=Arabidopsis thaliana GN=At3g42180 PE=2... [more]
GLYT4_ARATH2.1e-13850.63Probable glycosyltransferase At5g11130 OS=Arabidopsis thaliana GN=At5g11120/At5g... [more]
XGD1_ARATH2.4e-12652.73Xylogalacturonan beta-1,3-xylosyltransferase OS=Arabidopsis thaliana GN=XGD1 PE=... [more]
GLYT3_ARATH9.8e-11248.80Probable glycosyltransferase At5g03795 OS=Arabidopsis thaliana GN=At5g03795 PE=3... [more]
Match NameE-valueIdentityDescription
A0A0A0LTL1_CUCSA9.1e-28299.79Uncharacterized protein OS=Cucumis sativus GN=Csa_1G226410 PE=4 SV=1[more]
A0A061GVS6_THECC5.0e-16361.97Exostosin family protein OS=Theobroma cacao GN=TCM_038164 PE=4 SV=1[more]
U5G659_POPTR2.6e-15960.61Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0006s06330g PE=4 SV=1[more]
A0A0A0LJ06_CUCSA3.4e-15959.92Uncharacterized protein OS=Cucumis sativus GN=Csa_2G008045 PE=4 SV=1[more]
A0A059AYA0_EUCGR3.4e-15969.03Uncharacterized protein (Fragment) OS=Eucalyptus grandis GN=EUGRSUZ_H01381 PE=4 ... [more]
Match NameE-valueIdentityDescription
AT5G20260.11.9e-15356.71 Exostosin family protein[more]
AT3G42180.13.7e-14155.86 Exostosin family protein[more]
AT5G11130.11.2e-13950.63 Exostosin family protein[more]
AT5G33290.11.4e-12752.73 xylogalacturonan deficient 1[more]
AT5G03795.15.5e-11348.80 Exostosin family protein[more]
Match NameE-valueIdentityDescription
gi|778659929|ref|XP_011655344.1|1.3e-28199.79PREDICTED: probable glycosyltransferase At5g20260 [Cucumis sativus][more]
gi|659086442|ref|XP_008443936.1|3.5e-26694.58PREDICTED: probable glycosyltransferase At5g20260 [Cucumis melo][more]
gi|1009110120|ref|XP_015893842.1|3.2e-16360.59PREDICTED: probable glycosyltransferase At5g20260 [Ziziphus jujuba][more]
gi|590578652|ref|XP_007013570.1|7.2e-16361.97Exostosin family protein [Theobroma cacao][more]
gi|778666208|ref|XP_011648705.1|1.7e-15967.34PREDICTED: probable glycosyltransferase At3g42180 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004263Exostosin
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
biological_process GO:0006486 protein glycosylation
cellular_component GO:0005575 cellular_component
molecular_function GO:0050508 glucuronosyl-N-acetylglucosaminyl-proteoglycan 4-alpha-N-acetylglucosaminyltransferase activity
molecular_function GO:0003674 molecular_function
molecular_function GO:0016757 transferase activity, transferring glycosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G16600.1CSPI01G16600.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004263Exostosin-likePFAMPF03016Exostosincoord: 144..427
score: 1.1
NoneNo IPR availablePANTHERPTHR11062EXOSTOSIN HEPARAN SULFATE GLYCOSYLTRANSFERASE -RELATEDcoord: 94..476
score: 7.3E
NoneNo IPR availablePANTHERPTHR11062:SF61XYLOGALACTURONAN BETA-1,3-XYLOSYLTRANSFERASEcoord: 94..476
score: 7.3E