CSPI01G34330 (gene) Wild cucumber (PI 183967)

NameCSPI01G34330
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionExostosin family protein
LocationChr1 : 29233209 .. 29237365 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AACTTTGTAAGTTTGATCGTATTTCTGATATTTCCGAAGCTCACAGAAATCATGAGAGAAAGAAAAACAAAAGAAAACAAAAGTATATTGGAATTGGAAATTGTAGAATTCCTCGCGGACAATCGACGGTGCTTCTCACGCGCTCTACCAATGTATTCCGCCATTAAAGCCTGAACCTTAGTCTTCTTCCTCACAGTTCCTTCCCTTCTTTGCCGTTCTACAAAATTCAATTCAATTCACATTCACATTCACAGCTTCCATTCTCTCCAACAATGGCGCGAAAATCTTCTCTCCTCAAGCGAACCCTAGCTTCTCTCTGCTTTATACTTGCCCTATATGCCATCATCAACACCTTCATCAGCTCCACCGCTACTCTCAAGCTCGACCGTTCCTTCCCCTTCAGTTCTGCCAATTCCGTTATCGTCTCCGATGAATTTTCTTCCCAGGACACTGATCTTCTCAATTCCTCTGGGAAATCCCTCTCCCCGGTCAAGATTTACCTCTACGATGTCCCCACTAGGTTTACGTACGGTGTTATTGAGAACCACGGCATAGCTCGCGGTGGAAAACCCGTCCCTGACGTCACCGACCTTAAGTATCCTGGTCATCAGCATATGGCCGAGTGGTTTCTGTTCACGGATCTGCTTCGACCTGAGTCGGAGCGTATTGGGTCTGCTGTGGTTAGGGTGTTCGATCCAGAGGAGGCGGATTTGTTCTATGTACCCTTCTTCTCGTCTTTGAGCTTGATCGTCAACCCTATCCGACCTGCAACCGGGTCGGATCAGCAGCAGCGGAAGCTTGTGTACAGTGATGAGGAGACGCAGGATGCCTTTATGGAGTGGTTGGAGAAGCAAGAGTACTGGAAACGGAGCAATGGGCGGGATCATGTCATTATAGCGCAGGACCCGAATGCATTGTACAGGCTGATTGATAGGGTTAAGAACAGTATCTTGCTTGTTTCGGATTTCGGGAGGTTGAGGGCTGACCAAGCGTCGCTAGTTAAGGACGTCATTGTACCATACTCACATAGGATCAATACTTATACTGGTGACATTGGTGTTGAGAATCGGAAGACTTTATTGTTCTTCATGGGCAATCGATATCGTAAAGAGGTATATTTATGCAATGTCTTGAAATTTAATTGACAATGGATATTGTTGAGGATCCTTCTGTTATTGTTCCTTAAACCTTTCTTTTGGTTATTATACATAATTAGCTTTATGGGTTTAGAAAGTTGTGAAGACTTCCATGGTTGATCGTCATGGGGGAGGAAATTGCATGTCTGGAAGAAACTTGTGTGGGAAGATAGCAGACTGTTAGTAGAGTAGAAATTGCATATAACCCTTTTAGCATTAAGCTCTAGAACTCTGCAGCTGGTCCCACTAAATGTTGGCGGTAGAAGTTGTCATCTGTTCTTTTAAAGGTTCCTTTAATTCTACTGTTAGCTGAAAAGGATAGTGTTAAGAGGGCCTTTAGTTTGTTTACGTTTAGGATATGATGGTTATACATTCTAGGACCAAGTCTAGGAACTCATTTATTCTTGAGTTCCTCCCTAAATTGCACATAATAATATGAATTTGATTTTGATGTTTTTTGGTTTGTATAGGGGGTAGTTTCTTTTTAAGCACGGGGATTTGAAATTGTTTTTTCTTCCTTTTTCTTTTCTTTGTAATCTCTTAGTTTCCTAACATGAACCTCAACTAGTAATGCTGAACTATTACTAGATCTTATGGAAATAATGTAGGAAACACAGATATCTATACAATGGATGATCCTACATTTGAATGGATAAATCATCTTTAAGTTTTTTACTAACTTTTTCCTCCCGACGAGTCAATTCAATTCTTTACTTCTTTTTTTTTTTTGGAGAACTTGGAGATAACAGCCAACTCTCAACACTTCCAAAAATCCCCACTCATCTTGGGATCTCAATTATGGAACATATCAATGCATGAGAAAAGTAAAAAAATTAATGTACTTGAATCTTTGAGAAAAGGAAAAAATCAACTTCGAATTTTGAAGAAACTTGTGTCATATGATTCTTATCAACAGTGTCAATAGTTAGAACCAAGTAATGGTTTGACCGTGGAGTATCAAATGACTCGGATATTATGTTTGGGTTGGTCATCGATCAACCCTCTCTGATTAACACCCTGTTGCTCTTTTCATGTCATGAATGTAATTGTGACCCATGTTCCTAGGCCTAATTTCTGATATTGTAATATGTGTTGCCATTGATGTAGAAATGCGCAGGCTGCATCACAACATTTTCTTCTTTTTTTCTTTTTTTTCACAATTCCTTACTAGTCTCTAGCGTGTTATTTTTGGAGATTATAGTTACAATCTTCTTTTTGAAGTTCTCTTTCCAGACTTTTTTGCTCATTTGGCACCTTTTTCTTTCACTTGGGGGAATTTTCAGGAGCTCCCTTGGCCTTTTTCAGTCTGATGCCTATCACTCTCCGTACATTTATGCCTTGCTTTGTTAATATTATGTGTAGTTTCCAACTAAGATGAAAAGAAAATGTTGAGACCATAACTGATTTGTTTGCCGGTGATTCTATTGTTTGTAGGGAGGAAAGATACGTGACATGCTTTTCAATATACTTGAGCAAGAACAGGATGTTATAATCAAACACGGAACACAATCACGAGAGAGTCGGCGTGCTGCTACACATGGGATGCATACATCTAAGTTCTGCTTGAATCCAGCTGGTGACACTCCCTCTGCGTGTCGGCTTTTTGATTCTGTCGTGAGCTTATGTGTACCAGTTATTGTCAGTGATAGTATTGAGTTGCCTTTTGAAGATGTTATAGATTACTCCAAGATTGCAGTGTTTTTTGATTCGGTTTCTGCTGTGAAACCTGAATTTCTGATTTCAAAGCTGAGGAGAATATCTGAAGAGAGAATTTTGGACTATCAGAGAGAAATGAAAAAGGTTAGAACTGTGTGAATGCTTCGTTTCTTCCTATAAACTTTTATTATGTTAATTGACCATAAAACTTCTAATGCTTAAGCAGATGGTTAGGGTTAGTTTAATCCTATATCAATTCTTTCAATAATTCTTCACTTGTGAGCTTGAAAATTCATAGAAAGTCCAACAGGTGAATCAATTTTTATTTAGGAGAAAATGACGTTAGAGTAGTAAATGTTGGACGTGTTACTTGGATATCATGTTAAATTAGGGAGAATAGTTATGTTCTATCTAAATGTGTACATCTTAGGTTTATTATTAAATAGGAGAGAGACCCATCCCATGATAAGATGGGGGAAGAAGATTAAGGTTATAAATACTAAATATTGACCGCTAATATTTACAGAATTGTAAAGACTAGACATTCCAGCATTTTTAAGTTCTACTTCTGTTCGTGTAGTTTGTGTATGTATTTTTTGTTTTGAACCAATTCCCAAATAGGTAGTGCTGACTAATAAGAGATTCCTATTCCCCTGATAAATCGAAATGGTTGCAGTACACTTTTAAGTCTAACAATTTAAGCCAACAAAAACTTGCTTGAAACATGTACAGAGTAGTTACCATGTATCAGTTACTTTGAATAAATAATTAAGTCATGCATATTAATGAAGTTTATTCTATTTGATATGTGCTACCCAGATAAAGAGGTACTTTGAGTATACAGACTCAAATGGCACAGTGAACGAAATATGGCGCCAAGTCTCACAGAAGCTACCTTTAATTAAGCTAATGATTAATCGCGAAAAGAGGGTCATTCATAGAGATGGTGATGAACCAAACTGCTCTTGTCTCTGTTCAAACCAAACTGGCATCAGGGCCAGACTATAGTTCTTTATTTATGACTTTCTGGGAGAATCAACTACTTTTTACAAGCGGCAGGCTGGCTGCAAGACTTCCGAGTTGGAGAAAACTGAGGATGAAGTTATATATATGCCATGCCCTACTAGTTCCCAACCTTGAGGTAGCCATATTACTTAGCTAAGCTGGTTTTTTAATCTAACGAGGTAGGTAGAGGTTGCCATATTTATAGGCCTTTTCACTTATTACTATCTATTCTTTACTGTTGAGATGGTTATGTAGAACTGACATAGCCATGCATAATCTGGTAATATTTGTAATTATTATTATGATCTTTTGCTTCATTTTTTGTGCGTTGTATAGAATAAGGGTCTGTTTGATCCTTCC

mRNA sequence

ATGGCGCGAAAATCTTCTCTCCTCAAGCGAACCCTAGCTTCTCTCTGCTTTATACTTGCCCTATATGCCATCATCAACACCTTCATCAGCTCCACCGCTACTCTCAAGCTCGACCGTTCCTTCCCCTTCAGTTCTGCCAATTCCGTTATCGTCTCCGATGAATTTTCTTCCCAGGACACTGATCTTCTCAATTCCTCTGGGAAATCCCTCTCCCCGGTCAAGATTTACCTCTACGATGTCCCCACTAGGTTTACGTACGGTGTTATTGAGAACCACGGCATAGCTCGCGGTGGAAAACCCGTCCCTGACGTCACCGACCTTAAGTATCCTGGTCATCAGCATATGGCCGAGTGGTTTCTGTTCACGGATCTGCTTCGACCTGAGTCGGAGCGTATTGGGTCTGCTGTGGTTAGGGTGTTCGATCCAGAGGAGGCGGATTTGTTCTATGTACCCTTCTTCTCGTCTTTGAGCTTGATCGTCAACCCTATCCGACCTGCAACCGGGTCGGATCAGCAGCAGCGGAAGCTTGTGTACAGTGATGAGGAGACGCAGGATGCCTTTATGGAGTGGTTGGAGAAGCAAGAGTACTGGAAACGGAGCAATGGGCGGGATCATGTCATTATAGCGCAGGACCCGAATGCATTGTACAGGCTGATTGATAGGGTTAAGAACAGTATCTTGCTTGTTTCGGATTTCGGGAGGTTGAGGGCTGACCAAGCGTCGCTAGTTAAGGACGTCATTGTACCATACTCACATAGGATCAATACTTATACTGGTGACATTGGTGTTGAGAATCGGAAGACTTTATTGTTCTTCATGGGCAATCGATATCGTAAAGAGGGAGGAAAGATACGTGACATGCTTTTCAATATACTTGAGCAAGAACAGGATGTTATAATCAAACACGGAACACAATCACGAGAGAGTCGGCGTGCTGCTACACATGGGATGCATACATCTAAGTTCTGCTTGAATCCAGCTGGTGACACTCCCTCTGCGTGTCGGCTTTTTGATTCTGTCGTGAGCTTATGTGTACCAGTTATTGTCAGTGATAGTATTGAGTTGCCTTTTGAAGATGTTATAGATTACTCCAAGATTGCAGTGTTTTTTGATTCGGTTTCTGCTGTGAAACCTGAATTTCTGATTTCAAAGCTGAGGAGAATATCTGAAGAGAGAATTTTGGACTATCAGAGAGAAATGAAAAAGATAAAGAGGTACTTTGAGTATACAGACTCAAATGGCACAGTGAACGAAATATGGCGCCAAGTCTCACAGAAGCTACCTTTAATTAAGCTAATGATTAATCGCGAAAAGAGGGTCATTCATAGAGATGGTGATGAACCAAACTGCTCTTGTCTCTGTTCAAACCAAACTGGCATCAGGGCCAGACTATAG

Coding sequence (CDS)

ATGGCGCGAAAATCTTCTCTCCTCAAGCGAACCCTAGCTTCTCTCTGCTTTATACTTGCCCTATATGCCATCATCAACACCTTCATCAGCTCCACCGCTACTCTCAAGCTCGACCGTTCCTTCCCCTTCAGTTCTGCCAATTCCGTTATCGTCTCCGATGAATTTTCTTCCCAGGACACTGATCTTCTCAATTCCTCTGGGAAATCCCTCTCCCCGGTCAAGATTTACCTCTACGATGTCCCCACTAGGTTTACGTACGGTGTTATTGAGAACCACGGCATAGCTCGCGGTGGAAAACCCGTCCCTGACGTCACCGACCTTAAGTATCCTGGTCATCAGCATATGGCCGAGTGGTTTCTGTTCACGGATCTGCTTCGACCTGAGTCGGAGCGTATTGGGTCTGCTGTGGTTAGGGTGTTCGATCCAGAGGAGGCGGATTTGTTCTATGTACCCTTCTTCTCGTCTTTGAGCTTGATCGTCAACCCTATCCGACCTGCAACCGGGTCGGATCAGCAGCAGCGGAAGCTTGTGTACAGTGATGAGGAGACGCAGGATGCCTTTATGGAGTGGTTGGAGAAGCAAGAGTACTGGAAACGGAGCAATGGGCGGGATCATGTCATTATAGCGCAGGACCCGAATGCATTGTACAGGCTGATTGATAGGGTTAAGAACAGTATCTTGCTTGTTTCGGATTTCGGGAGGTTGAGGGCTGACCAAGCGTCGCTAGTTAAGGACGTCATTGTACCATACTCACATAGGATCAATACTTATACTGGTGACATTGGTGTTGAGAATCGGAAGACTTTATTGTTCTTCATGGGCAATCGATATCGTAAAGAGGGAGGAAAGATACGTGACATGCTTTTCAATATACTTGAGCAAGAACAGGATGTTATAATCAAACACGGAACACAATCACGAGAGAGTCGGCGTGCTGCTACACATGGGATGCATACATCTAAGTTCTGCTTGAATCCAGCTGGTGACACTCCCTCTGCGTGTCGGCTTTTTGATTCTGTCGTGAGCTTATGTGTACCAGTTATTGTCAGTGATAGTATTGAGTTGCCTTTTGAAGATGTTATAGATTACTCCAAGATTGCAGTGTTTTTTGATTCGGTTTCTGCTGTGAAACCTGAATTTCTGATTTCAAAGCTGAGGAGAATATCTGAAGAGAGAATTTTGGACTATCAGAGAGAAATGAAAAAGATAAAGAGGTACTTTGAGTATACAGACTCAAATGGCACAGTGAACGAAATATGGCGCCAAGTCTCACAGAAGCTACCTTTAATTAAGCTAATGATTAATCGCGAAAAGAGGGTCATTCATAGAGATGGTGATGAACCAAACTGCTCTTGTCTCTGTTCAAACCAAACTGGCATCAGGGCCAGACTATAG
BLAST of CSPI01G34330 vs. Swiss-Prot
Match: ARAD1_ARATH (Probable arabinosyltransferase ARAD1 OS=Arabidopsis thaliana GN=ARAD1 PE=1 SV=1)

HSP 1 Score: 580.9 bits (1496), Expect = 1.3e-164
Identity = 289/463 (62.42%), Postives = 360/463 (77.75%), Query Frame = 1

Query: 1   MARKSSLLKRTLASLCFILALYAIINTFISSTATLKLDRSFPFSSANSVIVSDEFSSQDT 60
           MARKSSLLKR   ++  ++A+Y I+N  +S        RS P SS     +  +   +D 
Sbjct: 1   MARKSSLLKRAAIAVVSVIAIYVILNASVS--------RSLPSSSD----LPRQLIREDD 60

Query: 61  DLLNSSGKSLSP-VKIYLYDVPTRFTYGVIENHGIARGG--KPVPDVTDLKYPGHQHMAE 120
           D  +     + P V++Y+Y++P RFTYG+IE H IARGG  KPV DVT LKYPGHQHM E
Sbjct: 61  D--DEGRAPIQPRVRVYMYNLPKRFTYGLIEQHSIARGGIKKPVGDVTTLKYPGHQHMHE 120

Query: 121 WFLFTDLLRPESERIGSAVVRVFDPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLV 180
           W+LF+DL +PE +R GS +VRV DP +ADLFYVP FSSLSLIVN  RP            
Sbjct: 121 WYLFSDLNQPEVDRSGSPIVRVSDPADADLFYVPVFSSLSLIVNAGRPVEAGSG------ 180

Query: 181 YSDEETQDAFMEWLEKQEYWKRSNGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGRLRA 240
           YSDE+ Q+  +EWLE QE+W+R+ GRDHVI A DPNALYR++DRVKN++LLVSDFGRLR 
Sbjct: 181 YSDEKMQEGLVEWLEGQEWWRRNAGRDHVIPAGDPNALYRILDRVKNAVLLVSDFGRLRP 240

Query: 241 DQASLVKDVIVPYSHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQD 300
           DQ S VKDV++PYSHR+N + G+IGVE+R TLLFFMGNRYRK+GGK+RD+LF +LE+E D
Sbjct: 241 DQGSFVKDVVIPYSHRVNLFNGEIGVEDRNTLLFFMGNRYRKDGGKVRDLLFQVLEKEDD 300

Query: 301 VIIKHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPF 360
           V IKHGTQSRE+RRAAT GMHTSKFCLNPAGDTPSACRLFDS+VSLCVP+IVSDSIELPF
Sbjct: 301 VTIKHGTQSRENRRAATKGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPLIVSDSIELPF 360

Query: 361 EDVIDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQREMKKIKRYFEYTDSNGTVN 420
           EDVIDY K ++F ++ +A++P FL+  LR+I  ++IL+YQREMK ++RYF+Y + NG V 
Sbjct: 361 EDVIDYRKFSIFVEANAALQPGFLVQMLRKIKTKKILEYQREMKSVRRYFDYDNPNGAVK 420

Query: 421 EIWRQVSQKLPLIKLMINREKRVIHRDGDEPNCSCLCSNQTGI 461
           EIWRQVS KLPLIKLM NR++R++ R+  EPNCSCLC+NQTG+
Sbjct: 421 EIWRQVSHKLPLIKLMSNRDRRLVLRNLTEPNCSCLCTNQTGL 443

BLAST of CSPI01G34330 vs. Swiss-Prot
Match: ARAD2_ARATH (Probable arabinosyltransferase ARAD2 OS=Arabidopsis thaliana GN=ARAD2 PE=1 SV=1)

HSP 1 Score: 542.7 bits (1397), Expect = 3.8e-153
Identity = 256/387 (66.15%), Postives = 316/387 (81.65%), Query Frame = 1

Query: 74  KIYLYDVPTRFTYGVIENHGIARGGKPVPDVTDLKYPGHQHMAEWFLFTDLLRPESERIG 133
           K+Y+Y++PT FTYGVIE HG    G+   DVT LKYPGHQHM EW+L++DL RPE +R+G
Sbjct: 66  KVYMYELPTNFTYGVIEQHG----GEKSDDVTGLKYPGHQHMHEWYLYSDLTRPEVKRVG 125

Query: 134 SAVVRVFDPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSDEETQDAFMEWLEK 193
           S +VRVFDP EADLFYV  FSSLSLIV+  RP  G         YSDEE Q++ + WLE 
Sbjct: 126 SPIVRVFDPAEADLFYVSAFSSLSLIVDSGRPGFG---------YSDEEMQESLVSWLES 185

Query: 194 QEYWKRSNGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGRLRADQASLVKDVIVPYSHR 253
           QE+W+R+NGRDHVI+A DPNAL R++DRVKN++LLV+DF RLRADQ SLVKDVI+PYSHR
Sbjct: 186 QEWWRRNNGRDHVIVAGDPNALKRVMDRVKNAVLLVTDFDRLRADQGSLVKDVIIPYSHR 245

Query: 254 INTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQDVIIKHGTQSRESRRAA 313
           I+ Y G++GV+ R  LLFFMGNRYRK+GGK+RD+LF +LE+E+DV+IK GTQSRE+ RA 
Sbjct: 246 IDAYEGELGVKQRTNLLFFMGNRYRKDGGKVRDLLFKLLEKEEDVVIKRGTQSRENMRAV 305

Query: 314 THGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDVIDYSKIAVFFDSV 373
             GMHTSKFCL+ AGDT SACRLFD++ SLCVPVIVSD IELPFEDVIDY K ++F    
Sbjct: 306 KQGMHTSKFCLHLAGDTSSACRLFDAIASLCVPVIVSDGIELPFEDVIDYRKFSIFLRRD 365

Query: 374 SAVKPEFLISKLRRISEERILDYQREMKKIKRYFEYTDSNGTVNEIWRQVSQKLPLIKLM 433
           +A+KP F++ KLR++   +IL YQ+ MK+++RYF+YT  NG+VNEIWRQV++K+PLIKLM
Sbjct: 366 AALKPGFVVKKLRKVKPGKILKYQKVMKEVRRYFDYTHLNGSVNEIWRQVTKKIPLIKLM 425

Query: 434 INREKRVIHRDGDEPNCSCLCSNQTGI 461
           INREKR+I RDG +P CSCLCSNQTGI
Sbjct: 426 INREKRMIKRDGSDPQCSCLCSNQTGI 439

BLAST of CSPI01G34330 vs. Swiss-Prot
Match: GLYT4_ARATH (Probable glycosyltransferase At5g11130 OS=Arabidopsis thaliana GN=At5g11120/At5g11130 PE=3 SV=2)

HSP 1 Score: 92.0 bits (227), Expect = 1.8e-17
Identity = 86/320 (26.88%), Postives = 152/320 (47.50%), Query Frame = 1

Query: 133 GSAVVRVFDPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSDEETQDAFMEWLE 192
           G++  +   PEEA +FY+P    +++I    RP T          Y+ +  Q+   +++ 
Sbjct: 184 GNSRFKAASPEEATVFYIPV-GIVNIIRFVYRPYTS---------YARDRLQNIVKDYIS 243

Query: 193 ----KQEYWKRSNGRDHVII----------AQDPNALYRLIDRVKNSILLVSDFGRLRAD 252
               +  YW RS G DH  +          A DP      I  + N+    S  G     
Sbjct: 244 LISNRYPYWNRSRGADHFFLSCHDWAPDVSAVDPELYKHFIRALCNAN---SSEGFTPMR 303

Query: 253 QASLVKDVIVPYSHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLF-NILEQEQD 312
             SL  ++ +P+S     +TG+   +NRK L FF G  +    G +R +LF +  E+++D
Sbjct: 304 DVSL-PEINIPHSQLGFVHTGE-PPQNRKLLAFFAGGSH----GDVRKILFQHWKEKDKD 363

Query: 313 VIIKHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPF 372
           V++        +    T  M  +KFCL P+G   ++ R+ +S+ S CVPVI++D   LPF
Sbjct: 364 VLVYENLPKTMNY---TKMMDKAKFCLCPSGWEVASPRIVESLYSGCVPVIIADYYVLPF 423

Query: 373 EDVIDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQREMKKIKRYF-------EYT 430
            DV+++   +V    +   K   +   L  I+EE  L+ QR + +++++F        Y 
Sbjct: 424 SDVLNWKTFSV---HIPISKMPDIKKILEAITEEEYLNMQRRVLEVRKHFVINRPSKPYD 478

BLAST of CSPI01G34330 vs. Swiss-Prot
Match: GLYT1_ARATH (Probable glycosyltransferase At3g07620 OS=Arabidopsis thaliana GN=At3g07620 PE=3 SV=1)

HSP 1 Score: 89.0 bits (219), Expect = 1.5e-16
Identity = 85/307 (27.69%), Postives = 155/307 (50.49%), Query Frame = 1

Query: 138 RVFDPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSDEETQDAFMEWLEKQ-EY 197
           R  DP++A ++++PF  S+ +I++ +      D+   + V +D      +++ + K+  Y
Sbjct: 183 RTRDPDKAHVYFLPF--SVVMILHHLFDPVVRDKAVLERVIAD------YVQIISKKYPY 242

Query: 198 WKRSNGRDHVIIAQDP---NALYRLIDRVKNSILLVSDFGRLRADQASLVKDVIVP---- 257
           W  S+G DH +++       A + +     NSI ++ +     ++  +  KD   P    
Sbjct: 243 WNTSDGFDHFMLSCHDWGHRATWYVKKLFFNSIRVLCNANI--SEYFNPEKDAPFPEINL 302

Query: 258 YSHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNIL-EQEQDVIIKHGTQSRE 317
            +  IN  TG +   +R TL FF G    K  GKIR +L N   E+++D+++    ++  
Sbjct: 303 LTGDINNLTGGLDPISRTTLAFFAG----KSHGKIRPVLLNHWKEKDKDILVY---ENLP 362

Query: 318 SRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDVIDYSKIAV 377
                T  M  S+FC+ P+G   ++ R+ +++ S CVPV++S++  LPF DV+++ K +V
Sbjct: 363 DGLDYTEMMRKSRFCICPSGHEVASPRVPEAIYSGCVPVLISENYVLPFSDVLNWEKFSV 422

Query: 378 FFDSVSAVKPEFLISKLRRISEERILDYQREMKKIKRYF-------EYTDSNGTVNEIW- 428
              SVS  +   L   L  I EER +     +KK+KR+         Y   N  ++ IW 
Sbjct: 423 ---SVSVKEIPELKRILMDIPEERYMRLYEGVKKVKRHILVNDPPKRYDVFNMIIHSIWL 469

BLAST of CSPI01G34330 vs. Swiss-Prot
Match: IX10L_ARATH (Probable beta-1,4-xylosyltransferase IRX10L OS=Arabidopsis thaliana GN=IRX10L PE=2 SV=1)

HSP 1 Score: 88.6 bits (218), Expect = 2.0e-16
Identity = 113/455 (24.84%), Postives = 182/455 (40.00%), Query Frame = 1

Query: 21  LYAIINTFISSTATLKLDRSFPFSSANSVIVSDEFSSQDTDLLNSSGKSLSPVKIYLYDV 80
           ++ + NTF SS +  +L RS P         ++  S    D+L      +  +K+++Y++
Sbjct: 9   IFLLCNTF-SSISAFRLSRSQP---------TERISGSAGDVLEDD--PVGRLKVFVYEL 68

Query: 81  PTRFTYGVIENHGIARGGKPVPDVTDLKYPGHQHMAEWFLFTDLLRPESERIGSAVVRVF 140
           P+++   +++               D +   H   AE ++   LL        S+ VR  
Sbjct: 69  PSKYNKKILQK--------------DPRCLNHMFAAEIYMQRFLL--------SSPVRTL 128

Query: 141 DPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSDEETQDAFMEWLEKQEYWKRS 200
           +PEEAD FYVP +++  L  N +     S +  R  +      Q     W     YW R+
Sbjct: 129 NPEEADWFYVPVYTTCDLTPNGLPLPFKSPRMMRSAI------QLIASNW----PYWNRT 188

Query: 201 NGRDHVIIA----------QDPNALYRLIDRVKNSILLVSDFGRLRADQASLVKDVIVPY 260
            G DH  +           Q+  A+ R I  +     LV  FG+            + PY
Sbjct: 189 EGADHFFVVPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQRNHVCLKEGSITVPPY 248

Query: 261 -------SHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKI-RDMLFNILEQEQD----- 320
                  SH I   T        + L + +GN    EGG   R     + E  +D     
Sbjct: 249 APPQKMQSHLIPEKTPRSIFVYFRGLFYDVGND--PEGGYYARGARAAVWENFKDNPLFD 308

Query: 321 VIIKHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPF 380
           +  +H T   E  + A        FCL P G  P + RL ++V+  C+PVI++D I LPF
Sbjct: 309 ISTEHPTTYYEDMQRAI-------FCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 368

Query: 381 EDVIDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQR-----EMKKIKRYFEYTDS 440
            D I +  I VF D        +L + L  I  E IL  QR      MK+   + +    
Sbjct: 369 ADAIPWEDIGVFVDEKDV---PYLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQP 400

Query: 441 NGTVNEIWRQVSQKLPLIKLMINREKRVIHRDGDE 448
               +++   +++KLP        E+ V  R G++
Sbjct: 429 GDAFHQVLNGLARKLP-------HERSVYLRPGEK 400

BLAST of CSPI01G34330 vs. TrEMBL
Match: A0A0A0M0S0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G701350 PE=4 SV=1)

HSP 1 Score: 918.3 bits (2372), Expect = 3.7e-264
Identity = 463/464 (99.78%), Postives = 463/464 (99.78%), Query Frame = 1

Query: 1   MARKSSLLKRTLASLCFILALYAIINTFISSTATLKLDRSFPFSSANSVIVSDEFSSQDT 60
           MARKSSLLKRTLASLCFILALYAIINTFISSTATLKLDRSFPFSSANSVIVSDEFSSQDT
Sbjct: 1   MARKSSLLKRTLASLCFILALYAIINTFISSTATLKLDRSFPFSSANSVIVSDEFSSQDT 60

Query: 61  DLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPVPDVTDLKYPGHQHMAEWFL 120
           DLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPVPDVTDLKYPGHQHMAEWFL
Sbjct: 61  DLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPVPDVTDLKYPGHQHMAEWFL 120

Query: 121 FTDLLRPESERIGSAVVRVFDPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSD 180
           FTDLLRPESERIGSAVVRVFDPE ADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSD
Sbjct: 121 FTDLLRPESERIGSAVVRVFDPEVADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSD 180

Query: 181 EETQDAFMEWLEKQEYWKRSNGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGRLRADQA 240
           EETQDAFMEWLEKQEYWKRSNGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGRLRADQA
Sbjct: 181 EETQDAFMEWLEKQEYWKRSNGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGRLRADQA 240

Query: 241 SLVKDVIVPYSHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQDVII 300
           SLVKDVIVPYSHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQDVII
Sbjct: 241 SLVKDVIVPYSHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQDVII 300

Query: 301 KHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDV 360
           KHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDV
Sbjct: 301 KHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDV 360

Query: 361 IDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQREMKKIKRYFEYTDSNGTVNEIW 420
           IDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQREMKKIKRYFEYTDSNGTVNEIW
Sbjct: 361 IDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQREMKKIKRYFEYTDSNGTVNEIW 420

Query: 421 RQVSQKLPLIKLMINREKRVIHRDGDEPNCSCLCSNQTGIRARL 465
           RQVSQKLPLIKLMINREKRVIHRDGDEPNCSCLCSNQTGIRARL
Sbjct: 421 RQVSQKLPLIKLMINREKRVIHRDGDEPNCSCLCSNQTGIRARL 464

BLAST of CSPI01G34330 vs. TrEMBL
Match: M5X179_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005538mg PE=4 SV=1)

HSP 1 Score: 667.2 bits (1720), Expect = 1.5e-188
Identity = 328/460 (71.30%), Postives = 391/460 (85.00%), Query Frame = 1

Query: 1   MARKSSLLKRTLASLCFILALYAIINTFISSTATLKLDRSFPFSSANSVIVSDEFSSQDT 60
           MARKSSLLK++LA++  +L +YA +NTF++ T T KL+ + P  S+ S I SD F+S++ 
Sbjct: 1   MARKSSLLKQSLATIVVVLLIYAFLNTFLTPTTTAKLETALPSFSSASSISSDVFASREN 60

Query: 61  DLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPVPDVTDLKYPGHQHMAEWFL 120
            L N  GK   PVK+YLYD+P RFTYGVIE+H +ARGG+P  DV+ LKYPGHQHM EW+L
Sbjct: 61  QL-NFPGK---PVKVYLYDLPKRFTYGVIEHHSLARGGRPDEDVSKLKYPGHQHMGEWYL 120

Query: 121 FTDLLRPESERIGSAVVRVFDPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSD 180
           F DLL+PE+ER GS V +V DPEEAD FYVPFFSSLSLIVNP RPA+GSD    K +YSD
Sbjct: 121 FKDLLKPEAERFGSPVQKVLDPEEADFFYVPFFSSLSLIVNPARPASGSD----KPLYSD 180

Query: 181 EETQDAFMEWLEKQEYWKRSNGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGRLRADQA 240
           EE Q A +EWLE+Q YWKR+NGRDHVI+A DPNALY++ID+VKNS+LLV DFGRL+ DQ 
Sbjct: 181 EENQVALIEWLEEQVYWKRNNGRDHVIMASDPNALYKVIDKVKNSVLLVCDFGRLKEDQG 240

Query: 241 SLVKDVIVPYSHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQDVII 300
           SLVKDVIVPYSHRINTY+GDI VE+R TLLFFMGNR+RKEGGKIRD+LF +LE E+DVII
Sbjct: 241 SLVKDVIVPYSHRINTYSGDISVEDRNTLLFFMGNRFRKEGGKIRDLLFQLLENEEDVII 300

Query: 301 KHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDV 360
           KHGTQSRESRRAA+HGMHTSKFCLNPAGDTPSACRLFDS+VSLCVPVIVSDSIELPFEDV
Sbjct: 301 KHGTQSRESRRAASHGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPVIVSDSIELPFEDV 360

Query: 361 IDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQREMKKIKRYFEYTDSNGTVNEIW 420
           IDY KIA+F +S +A+KPEFL+S LR I+ ERIL+YQ+E+ ++KRYF+Y   NGTVNEIW
Sbjct: 361 IDYRKIAIFVESNAALKPEFLVSMLRGITTERILEYQKELNEVKRYFQYGVPNGTVNEIW 420

Query: 421 RQVSQKLPLIKLMINREKRVIHRDGDEPNCSCLCSNQTGI 461
           RQV+QKLP IKL INR++R++ RD +  +CSCLCSNQTGI
Sbjct: 421 RQVAQKLPFIKLSINRDRRLVKRDLNVRDCSCLCSNQTGI 452

BLAST of CSPI01G34330 vs. TrEMBL
Match: W9S3J0_9ROSA (Putative glucuronosyltransferase GUT1 OS=Morus notabilis GN=L484_018501 PE=4 SV=1)

HSP 1 Score: 629.4 bits (1622), Expect = 3.5e-177
Identity = 314/466 (67.38%), Postives = 378/466 (81.12%), Query Frame = 1

Query: 1   MARKSSLLKRTLASLCFILALYAIINTFISSTATLKLDRSFPFSSANSVIVSDEFSSQDT 60
           MARK+SL K++LA++CF+LA+YA+IN F              F SA+   ++ +FS + T
Sbjct: 1   MARKASLAKKSLATVCFVLAIYAVINIF-------------HFPSASDPNLALDFSDR-T 60

Query: 61  DLLNSSGKSL------SPVKIYLYDVPTRFTYGVIENHGIARGGKPVPDVTDLKYPGHQH 120
            + +S  K+L      S VKIYLYD+P RFTYGVI +H +ARGG+P  D T L YPGHQH
Sbjct: 61  SVTSSFRKNLQFHNPESSVKIYLYDLPHRFTYGVIRHHSLARGGRPPEDATALSYPGHQH 120

Query: 121 MAEWFLFTDLLRPESERIGSAVVRVFDPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQR 180
           MAEW LF DL RPES+R+GSA+VRV DP+EADLFYVPFFSSLSLIVNP+R   G++    
Sbjct: 121 MAEWHLFKDLTRPESDRLGSAIVRVSDPDEADLFYVPFFSSLSLIVNPVRSEPGAEP--- 180

Query: 181 KLVYSDEETQDAFMEWLEKQEYWKRSNGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGR 240
             VYSDEETQ A  EWLE+QEYWKR+ GRDHVI+A DPNALY++IDRVKNS+LLVSDFGR
Sbjct: 181 --VYSDEETQVALAEWLEEQEYWKRNTGRDHVIVASDPNALYKVIDRVKNSVLLVSDFGR 240

Query: 241 LRADQASLVKDVIVPYSHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQ 300
           L+ DQ SLVKDVI+PYSHR+N Y GD+GV NR TLLFFMG RYRKEGGKIR++LF +LE 
Sbjct: 241 LKNDQGSLVKDVILPYSHRVNAYNGDVGVGNRGTLLFFMGARYRKEGGKIRNLLFQLLEN 300

Query: 301 EQDVIIKHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIE 360
           E+DV+IKHG QSRESRRAA+HGMHTSKFCLNPAGDTPSACRLFDS+VSLCVPVI+SD IE
Sbjct: 301 EEDVVIKHGAQSRESRRAASHGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPVIISDDIE 360

Query: 361 LPFEDVIDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQREMKKIKRYFEYTDSNG 420
           LPFEDVIDY KIA+F ++  A+KP +L+S LR IS+ERIL+YQ+E+K++K YFEY  S  
Sbjct: 361 LPFEDVIDYRKIAIFIETTVALKPGYLVSMLRAISDERILEYQKELKEVKHYFEYGSS-- 420

Query: 421 TVNEIWRQVSQKLPLIKLMINREKRVIHRDGDEPNCSCLCSNQTGI 461
           TVNEIWRQV+QKLPLIKLMINR+KR++  +   P+CSC+CSNQTGI
Sbjct: 421 TVNEIWRQVAQKLPLIKLMINRDKRLVKTNFTGPDCSCICSNQTGI 445

BLAST of CSPI01G34330 vs. TrEMBL
Match: A0A061E3Q4_THECC (Exostosin family protein isoform 1 OS=Theobroma cacao GN=TCM_006051 PE=4 SV=1)

HSP 1 Score: 622.9 bits (1605), Expect = 3.2e-175
Identity = 313/466 (67.17%), Postives = 376/466 (80.69%), Query Frame = 1

Query: 1   MARKSSLLKRTL-ASLCFILALYAIINTFISSTATLKLDRSFPFSSANSVIVSDEFSSQD 60
           MARKSSL K+TL A+  FILA+YA+  TF  +   +  D   P   A  V  S EF  + 
Sbjct: 37  MARKSSLFKQTLIATAFFILAIYALFTTFFHTPLPVS-DTVSPSDDAADVS-SVEFPERR 96

Query: 61  TDLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPVPDVTDLKYPGHQHMAEWF 120
            D   S GK    VK+++YD+P +FTYG+I+ HG+ARGG PV DVT LKYPGHQHM EWF
Sbjct: 97  ADGSGSVGK----VKVFMYDLPHKFTYGLIQQHGLARGGSPVDDVTTLKYPGHQHMHEWF 156

Query: 121 LFTDLLRPESERIGSAVVRVFDPEEADLFYVPFFSSLSLIVNPIRP-ATGSDQQQRKLVY 180
           LF DL  PES+R+GS +V+V DPEEADLFYVP FSSLSLIVN  RP  TGS        Y
Sbjct: 157 LFADLAPPESDRLGSPIVKVADPEEADLFYVPVFSSLSLIVNAGRPPGTGSG-------Y 216

Query: 181 SDEETQDAFMEWLEKQEYWKRSNGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGRLRAD 240
           SDE+ Q+  +EWL  QEYWKR+NG DHVIIA DPNALYR++DRVKN++LLV+DFGRLR D
Sbjct: 217 SDEQMQEELVEWLNGQEYWKRNNGWDHVIIAGDPNALYRVVDRVKNAVLLVADFGRLRPD 276

Query: 241 QASLVKDVIVPYSHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQDV 300
           Q SLVKDVI+PYSHRI+ YTGD GVE RKTLLFFMGNRYRKEGGKIRD+LF ILE E+DV
Sbjct: 277 QGSLVKDVIIPYSHRISAYTGDFGVEERKTLLFFMGNRYRKEGGKIRDLLFQILESEEDV 336

Query: 301 IIKHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFE 360
           IIKHGTQSRE+RRAA+HGMHTSKFCLNPAGDTPSACRLFD++VSLCVPVIVSD+IELPFE
Sbjct: 337 IIKHGTQSRENRRAASHGMHTSKFCLNPAGDTPSACRLFDAIVSLCVPVIVSDNIELPFE 396

Query: 361 DVIDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQREMKKIKRYFEYTDSNGTVNE 420
           D+IDY K +VF ++ +A+KP +L+S LR++  E+I++YQ+ MK++K+Y++YT  NGTVNE
Sbjct: 397 DIIDYKKFSVFVETTAALKPGYLVSLLRQVPAEKIIEYQKAMKEVKQYYDYTVPNGTVNE 456

Query: 421 IWRQVSQKLPLIKLMINREKRVIHRDGDEPNCSCLCSNQTGIRARL 465
           IWRQV+QKLPLIKLMINR+KR++  + +EPNCSCLCSNQTGI + L
Sbjct: 457 IWRQVAQKLPLIKLMINRDKRLVKMELNEPNCSCLCSNQTGIISSL 489

BLAST of CSPI01G34330 vs. TrEMBL
Match: B9SCC6_RICCO (Catalytic, putative OS=Ricinus communis GN=RCOM_0891750 PE=4 SV=1)

HSP 1 Score: 619.8 bits (1597), Expect = 2.7e-174
Identity = 305/461 (66.16%), Postives = 378/461 (82.00%), Query Frame = 1

Query: 1   MARKSSLLKRTLA-SLCFILALYAIINTFISSTATLKLDRSFPFSSANSVIVSDEFSSQD 60
           MARKSSLLK+TL  S+C ILALYA+ NTF + T +  L   +P    NS+     F  + 
Sbjct: 1   MARKSSLLKQTLIFSICLILALYAVFNTFFNPTTSSLL---YPSPEDNSL---SGFPGKV 60

Query: 61  TDLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPVPDVTDLKYPGHQHMAEWF 120
           T+  N+   +++ VKI++YD+P +FT G+I+ H +ARG K   D +++KYPGHQHM EW+
Sbjct: 61  TE--NNDNNNINKVKIFMYDLPKKFTTGIIQQHALARGSK---DTSNVKYPGHQHMGEWY 120

Query: 121 LFTDLLRPESERIGSAVVRVFDPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYS 180
           LF+DL RPE  RIGS VV+V DP+EADLFYVP FSSLSLIVNP+RPA       +   YS
Sbjct: 121 LFSDLNRPEHGRIGSPVVKVDDPDEADLFYVPVFSSLSLIVNPVRPAGTEPGLVQH--YS 180

Query: 181 DEETQDAFMEWLEKQEYWKRSNGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGRLRADQ 240
           DEE Q+  +EWLE+QEYWKR+NGRDHVIIA DPNALYR++DRVKN+ILL+SDFGR+R DQ
Sbjct: 181 DEEMQEQLVEWLEQQEYWKRNNGRDHVIIAGDPNALYRVLDRVKNAILLLSDFGRVRPDQ 240

Query: 241 ASLVKDVIVPYSHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQDVI 300
            SLVKD+IVPYSHRIN Y GDIGV +R TLLFFMGNRYRK+GGKIRD+LF +LE E+DV+
Sbjct: 241 GSLVKDIIVPYSHRINVYNGDIGVRDRNTLLFFMGNRYRKDGGKIRDLLFQMLESEEDVV 300

Query: 301 IKHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFED 360
           IKHGTQSRE+RRAA+ GMHTSKFCLNPAGDTPSACRLFDS+VSLCVPVIVSDSIELPFED
Sbjct: 301 IKHGTQSRENRRAASRGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPVIVSDSIELPFED 360

Query: 361 VIDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQREMKKIKRYFEYTDSNGTVNEI 420
           VIDY+KIA+F ++  ++KP +L+  LR ++ ERIL+YQ+E+KK+ RYFEY +SNGTVNEI
Sbjct: 361 VIDYTKIAIFVETTDSLKPGYLVKLLREVTSERILEYQKELKKVTRYFEYDNSNGTVNEI 420

Query: 421 WRQVSQKLPLIKLMINREKRVIHRDGDEPNCSCLCSNQTGI 461
           WRQV+QKLPLI+LM NR++R++ RD  +P+CSCLC+NQTG+
Sbjct: 421 WRQVAQKLPLIRLMTNRDRRLVKRDWSQPDCSCLCTNQTGL 448

BLAST of CSPI01G34330 vs. TAIR10
Match: AT2G35100.1 (AT2G35100.1 Exostosin family protein)

HSP 1 Score: 580.9 bits (1496), Expect = 7.2e-166
Identity = 289/463 (62.42%), Postives = 360/463 (77.75%), Query Frame = 1

Query: 1   MARKSSLLKRTLASLCFILALYAIINTFISSTATLKLDRSFPFSSANSVIVSDEFSSQDT 60
           MARKSSLLKR   ++  ++A+Y I+N  +S        RS P SS     +  +   +D 
Sbjct: 1   MARKSSLLKRAAIAVVSVIAIYVILNASVS--------RSLPSSSD----LPRQLIREDD 60

Query: 61  DLLNSSGKSLSP-VKIYLYDVPTRFTYGVIENHGIARGG--KPVPDVTDLKYPGHQHMAE 120
           D  +     + P V++Y+Y++P RFTYG+IE H IARGG  KPV DVT LKYPGHQHM E
Sbjct: 61  D--DEGRAPIQPRVRVYMYNLPKRFTYGLIEQHSIARGGIKKPVGDVTTLKYPGHQHMHE 120

Query: 121 WFLFTDLLRPESERIGSAVVRVFDPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLV 180
           W+LF+DL +PE +R GS +VRV DP +ADLFYVP FSSLSLIVN  RP            
Sbjct: 121 WYLFSDLNQPEVDRSGSPIVRVSDPADADLFYVPVFSSLSLIVNAGRPVEAGSG------ 180

Query: 181 YSDEETQDAFMEWLEKQEYWKRSNGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGRLRA 240
           YSDE+ Q+  +EWLE QE+W+R+ GRDHVI A DPNALYR++DRVKN++LLVSDFGRLR 
Sbjct: 181 YSDEKMQEGLVEWLEGQEWWRRNAGRDHVIPAGDPNALYRILDRVKNAVLLVSDFGRLRP 240

Query: 241 DQASLVKDVIVPYSHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQD 300
           DQ S VKDV++PYSHR+N + G+IGVE+R TLLFFMGNRYRK+GGK+RD+LF +LE+E D
Sbjct: 241 DQGSFVKDVVIPYSHRVNLFNGEIGVEDRNTLLFFMGNRYRKDGGKVRDLLFQVLEKEDD 300

Query: 301 VIIKHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPF 360
           V IKHGTQSRE+RRAAT GMHTSKFCLNPAGDTPSACRLFDS+VSLCVP+IVSDSIELPF
Sbjct: 301 VTIKHGTQSRENRRAATKGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPLIVSDSIELPF 360

Query: 361 EDVIDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQREMKKIKRYFEYTDSNGTVN 420
           EDVIDY K ++F ++ +A++P FL+  LR+I  ++IL+YQREMK ++RYF+Y + NG V 
Sbjct: 361 EDVIDYRKFSIFVEANAALQPGFLVQMLRKIKTKKILEYQREMKSVRRYFDYDNPNGAVK 420

Query: 421 EIWRQVSQKLPLIKLMINREKRVIHRDGDEPNCSCLCSNQTGI 461
           EIWRQVS KLPLIKLM NR++R++ R+  EPNCSCLC+NQTG+
Sbjct: 421 EIWRQVSHKLPLIKLMSNRDRRLVLRNLTEPNCSCLCTNQTGL 443

BLAST of CSPI01G34330 vs. TAIR10
Match: AT5G44930.1 (AT5G44930.1 Exostosin family protein)

HSP 1 Score: 542.7 bits (1397), Expect = 2.2e-154
Identity = 256/387 (66.15%), Postives = 316/387 (81.65%), Query Frame = 1

Query: 74  KIYLYDVPTRFTYGVIENHGIARGGKPVPDVTDLKYPGHQHMAEWFLFTDLLRPESERIG 133
           K+Y+Y++PT FTYGVIE HG    G+   DVT LKYPGHQHM EW+L++DL RPE +R+G
Sbjct: 66  KVYMYELPTNFTYGVIEQHG----GEKSDDVTGLKYPGHQHMHEWYLYSDLTRPEVKRVG 125

Query: 134 SAVVRVFDPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSDEETQDAFMEWLEK 193
           S +VRVFDP EADLFYV  FSSLSLIV+  RP  G         YSDEE Q++ + WLE 
Sbjct: 126 SPIVRVFDPAEADLFYVSAFSSLSLIVDSGRPGFG---------YSDEEMQESLVSWLES 185

Query: 194 QEYWKRSNGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGRLRADQASLVKDVIVPYSHR 253
           QE+W+R+NGRDHVI+A DPNAL R++DRVKN++LLV+DF RLRADQ SLVKDVI+PYSHR
Sbjct: 186 QEWWRRNNGRDHVIVAGDPNALKRVMDRVKNAVLLVTDFDRLRADQGSLVKDVIIPYSHR 245

Query: 254 INTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQDVIIKHGTQSRESRRAA 313
           I+ Y G++GV+ R  LLFFMGNRYRK+GGK+RD+LF +LE+E+DV+IK GTQSRE+ RA 
Sbjct: 246 IDAYEGELGVKQRTNLLFFMGNRYRKDGGKVRDLLFKLLEKEEDVVIKRGTQSRENMRAV 305

Query: 314 THGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDVIDYSKIAVFFDSV 373
             GMHTSKFCL+ AGDT SACRLFD++ SLCVPVIVSD IELPFEDVIDY K ++F    
Sbjct: 306 KQGMHTSKFCLHLAGDTSSACRLFDAIASLCVPVIVSDGIELPFEDVIDYRKFSIFLRRD 365

Query: 374 SAVKPEFLISKLRRISEERILDYQREMKKIKRYFEYTDSNGTVNEIWRQVSQKLPLIKLM 433
           +A+KP F++ KLR++   +IL YQ+ MK+++RYF+YT  NG+VNEIWRQV++K+PLIKLM
Sbjct: 366 AALKPGFVVKKLRKVKPGKILKYQKVMKEVRRYFDYTHLNGSVNEIWRQVTKKIPLIKLM 425

Query: 434 INREKRVIHRDGDEPNCSCLCSNQTGI 461
           INREKR+I RDG +P CSCLCSNQTGI
Sbjct: 426 INREKRMIKRDGSDPQCSCLCSNQTGI 439

BLAST of CSPI01G34330 vs. TAIR10
Match: AT1G67410.1 (AT1G67410.1 Exostosin family protein)

HSP 1 Score: 282.3 bits (721), Expect = 5.3e-76
Identity = 157/389 (40.36%), Postives = 232/389 (59.64%), Query Frame = 1

Query: 64  NSSGKSLSPVKIYLYDVPTRFTYGVIENHGI---ARGGKPVPDVTDLKYPGHQHMAEWFL 123
           +SSGK   P+++++YD+P +F   +++ H        GK +P          QH  E++L
Sbjct: 47  SSSGK---PLRVFMYDLPRKFNIAMMDPHSSDVEPITGKNLPSWPQTSGIKRQHSVEYWL 106

Query: 124 FTDLLRPESERIGSAVVRVFDPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSD 183
              LL    +   +  +RVFDP+ AD+FYVPFFSSLS   +  +  T  D +  +L+   
Sbjct: 107 MASLLNGGEDE--NEAIRVFDPDLADVFYVPFFSSLSFNTHG-KNMTDPDTEFDRLL--- 166

Query: 184 EETQDAFMEWLEKQEYWKRSNGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGRLRADQA 243
              Q   ME+LE  +YW RS G+DHVI    PNA   L  +V  SIL+V DFGR   D A
Sbjct: 167 ---QVELMEFLENSKYWNRSGGKDHVIPMTHPNAFRFLRQQVNASILIVVDFGRYSKDMA 226

Query: 244 SLVKDVIVPYSHRINTYT--GDIGV----ENRKTLLFFMGNRYRKEGGKIRDMLFNILEQ 303
            L KDV+ PY H + +    GD G+    E R TLL+F GN  RK+ GKIR  L  +L  
Sbjct: 227 RLSKDVVSPYVHVVESLNEEGDDGMGDPFEARTTLLYFRGNTVRKDEGKIRLRLEKLLAG 286

Query: 304 EQDVIIKHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIE 363
             DV  +    + ++ + +T GM +SKFCL+PAGDTPS+CRLFD++VS C+PVI+SD IE
Sbjct: 287 NSDVHFEKSVATTQNIKVSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDKIE 346

Query: 364 LPFEDVIDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQREMKKIKRYFEY---TD 423
           LPFED IDYS+ ++FF    +++P ++++ LR+  +E+ L+  + +K +  +FE+     
Sbjct: 347 LPFEDEIDYSEFSLFFSIKESLEPGYILNNLRQFPKEKWLEMWKRLKNVSHHFEFQYPPK 406

Query: 424 SNGTVNEIWRQVSQKLPLIKLMINREKRV 441
               VN +WRQV  K+P +KL ++R +R+
Sbjct: 407 REDAVNMLWRQVKHKIPYVKLAVHRNRRL 423

BLAST of CSPI01G34330 vs. TAIR10
Match: AT3G45400.1 (AT3G45400.1 exostosin family protein)

HSP 1 Score: 266.5 bits (680), Expect = 3.0e-71
Identity = 171/460 (37.17%), Postives = 255/460 (55.43%), Query Frame = 1

Query: 1   MARKSSLLKRTLASLCFILALYAII-----NTFISSTATLKLDRSFPFSSANSVIVSDEF 60
           +AR   L    + ++ F L+ Y ++     N F+SST   K         AN   V DE 
Sbjct: 14  VARNLLLSLFVVTTILFALSCYFVLRSTAHNRFLSSTFPSKSFVDVRPEKANCRCVKDEK 73

Query: 61  SSQDTDLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPV-PDVTDL--KYPGH 120
           SS              P+K+Y+Y++   F +G+++          V PD+      YPG 
Sbjct: 74  SSVIA----------GPLKVYMYNMDPEFHFGLLDWKKKEGSDSSVWPDIQKYIPPYPGG 133

Query: 121 ---QHMAEWFLFTDLLRPESERIGSAVV--RVFDPEEADLFYVPFFSSLSLI----VNPI 180
              QH  E++L  DLL  E E    +V   RV++  EAD+ +VPFFSSLS      VNP 
Sbjct: 134 LNLQHSIEYWLTLDLLASEYENAPRSVAAKRVYNSSEADVIFVPFFSSLSYNRFSKVNPH 193

Query: 181 RPATGSDQQQRKLVYSDEETQDAFMEWLEKQEYWKRSNGRDHVIIAQDPNALYRLIDRVK 240
           +  + +   Q KLV            +L  QE WKRS GRDHV++A  PN++    +++ 
Sbjct: 194 QKTSRNKDLQGKLV-----------TFLTAQEEWKRSGGRDHVVLAHHPNSMLDARNKLF 253

Query: 241 NSILLVSDFGRLRADQASLVKDVIVPYSHRINTYTGDI-GVENRKTLLFFMGNRYRKEGG 300
            ++ ++SDFGR     A++ KDVI PY H I  Y  D  G ++R  LL+F G  YRK+GG
Sbjct: 254 PAMFILSDFGRYPPTVANVEKDVIAPYKHVIKAYENDTSGFDSRPILLYFQGAIYRKDGG 313

Query: 301 KIRDMLFNILEQEQDVIIKHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVS 360
            +R  LF +L+ E+DV    G+        A+ GMH SKFCLN AGDTPS+ RLFD++ S
Sbjct: 314 FVRQELFYLLQDEKDVHFSFGSVRNGGINKASQGMHNSKFCLNIAGDTPSSNRLFDAIAS 373

Query: 361 LCVPVIVSDSIELPFEDVIDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQREMKK 420
            CVPVI+SD IELPFEDVIDYS+ +VF  +  A+K  FL++ +R I++E        +K+
Sbjct: 374 HCVPVIISDDIELPFEDVIDYSEFSVFVRTSDALKENFLVNLIRGITKEEWTRMWNRLKE 433

Query: 421 IKRYFEY---TDSNGTVNEIWRQVSQKLPLIKLMINREKR 440
           +++Y+E+   +  +  V  IW+ +++K+P +K+ I++ +R
Sbjct: 434 VEKYYEFHFPSKVDDAVQMIWQAIARKVPGVKMRIHKSRR 452

BLAST of CSPI01G34330 vs. TAIR10
Match: AT1G74680.1 (AT1G74680.1 Exostosin family protein)

HSP 1 Score: 260.0 bits (663), Expect = 2.8e-69
Identity = 167/461 (36.23%), Postives = 257/461 (55.75%), Query Frame = 1

Query: 1   MARKSSLLKRTLASLCFILALYAIINTFISSTATLKLDRSFPFSSANSVIV------SDE 60
           M+ KS L  + L     +  L  I+++ +      + D SF  S    +I+      ++E
Sbjct: 4   MSEKSLLSSKFLFYTITVSTLLFIVSSLVFLQ---RHDSSFTSSLVRKLILPRTDIKNEE 63

Query: 61  FSSQDTDLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPVPDVTDLK----YP 120
           F   DT       +    +K+++YD+P+ F +G++  H   +G +  P+V ++     YP
Sbjct: 64  FGLIDT----KCDRDRDVLKVFMYDLPSEFHFGILNWH--KKGSEIWPNVNNISTIPSYP 123

Query: 121 G---HQHMAEWFLFTDLLRPESERI----GSAVVRVFDPEEADLFYVPFFSSLSLIVNPI 180
           G    QH  E++L  DLL  E+  I     SA +RV +  EAD+ +VPFF+SLS   N  
Sbjct: 124 GGLNRQHSVEYWLTLDLLASETPEIKRPCSSAAIRVKNSNEADIVFVPFFASLSY--NRK 183

Query: 181 RPATGSDQQQRKLVYSDEETQDAFMEWLEKQEYWKRSNGRDHVIIAQDPNALYRLIDRVK 240
               G++         D   Q+  +E+L+ Q+ WKR +G+DH+I+A  PN+L    + + 
Sbjct: 184 SKLRGNETSS-----DDRLLQERLVEFLKSQDEWKRFDGKDHLIVAHHPNSLLYARNFLG 243

Query: 241 NSILLVSDFGRLRADQASLVKDVIVPYSHRINTYTGD--IGVENRKTLLFFMGNRYRKEG 300
           +++ ++SDFGR  +  A+L KD+I PY H + T + +     E R  L +F G  YRK+G
Sbjct: 244 SAMFVLSDFGRYSSAIANLEKDIIAPYVHVVKTISNNESASFEKRPVLAYFQGAIYRKDG 303

Query: 301 GKIRDMLFNILEQEQDVIIKHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVV 360
           G IR  L+N+L+ E+DV    GT      +    GM +SKFCLN AGDTPS+ RLFD++V
Sbjct: 304 GTIRQELYNLLKDEKDVHFAFGTVRGNGTKQTGKGMASSKFCLNIAGDTPSSNRLFDAIV 363

Query: 361 SLCVPVIVSDSIELPFEDVIDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQREMK 420
           S CVPVI+SD IELPFED +DYS  +VF  +  AVK EFL++ LR I+E++       +K
Sbjct: 364 SHCVPVIISDQIELPFEDTLDYSGFSVFVHASEAVKKEFLVNILRGITEDQWKKKWGRLK 423

Query: 421 KIKRYFEY---TDSNGTVNEIWRQVSQKLPLIKLMINREKR 440
           ++   FEY   +    +VN IW  VS KL  ++  ++R+ R
Sbjct: 424 EVAGCFEYRFPSQVGDSVNMIWSAVSHKLSSLQFDVHRKNR 448

BLAST of CSPI01G34330 vs. NCBI nr
Match: gi|449455387|ref|XP_004145434.1| (PREDICTED: probable arabinosyltransferase ARAD1 [Cucumis sativus])

HSP 1 Score: 918.3 bits (2372), Expect = 5.4e-264
Identity = 463/464 (99.78%), Postives = 463/464 (99.78%), Query Frame = 1

Query: 1   MARKSSLLKRTLASLCFILALYAIINTFISSTATLKLDRSFPFSSANSVIVSDEFSSQDT 60
           MARKSSLLKRTLASLCFILALYAIINTFISSTATLKLDRSFPFSSANSVIVSDEFSSQDT
Sbjct: 1   MARKSSLLKRTLASLCFILALYAIINTFISSTATLKLDRSFPFSSANSVIVSDEFSSQDT 60

Query: 61  DLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPVPDVTDLKYPGHQHMAEWFL 120
           DLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPVPDVTDLKYPGHQHMAEWFL
Sbjct: 61  DLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPVPDVTDLKYPGHQHMAEWFL 120

Query: 121 FTDLLRPESERIGSAVVRVFDPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSD 180
           FTDLLRPESERIGSAVVRVFDPE ADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSD
Sbjct: 121 FTDLLRPESERIGSAVVRVFDPEVADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSD 180

Query: 181 EETQDAFMEWLEKQEYWKRSNGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGRLRADQA 240
           EETQDAFMEWLEKQEYWKRSNGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGRLRADQA
Sbjct: 181 EETQDAFMEWLEKQEYWKRSNGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGRLRADQA 240

Query: 241 SLVKDVIVPYSHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQDVII 300
           SLVKDVIVPYSHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQDVII
Sbjct: 241 SLVKDVIVPYSHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQDVII 300

Query: 301 KHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDV 360
           KHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDV
Sbjct: 301 KHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDV 360

Query: 361 IDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQREMKKIKRYFEYTDSNGTVNEIW 420
           IDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQREMKKIKRYFEYTDSNGTVNEIW
Sbjct: 361 IDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQREMKKIKRYFEYTDSNGTVNEIW 420

Query: 421 RQVSQKLPLIKLMINREKRVIHRDGDEPNCSCLCSNQTGIRARL 465
           RQVSQKLPLIKLMINREKRVIHRDGDEPNCSCLCSNQTGIRARL
Sbjct: 421 RQVSQKLPLIKLMINREKRVIHRDGDEPNCSCLCSNQTGIRARL 464

BLAST of CSPI01G34330 vs. NCBI nr
Match: gi|659118235|ref|XP_008459015.1| (PREDICTED: probable arabinosyltransferase ARAD1 [Cucumis melo])

HSP 1 Score: 905.6 bits (2339), Expect = 3.6e-260
Identity = 455/464 (98.06%), Postives = 461/464 (99.35%), Query Frame = 1

Query: 1   MARKSSLLKRTLASLCFILALYAIINTFISSTATLKLDRSFPFSSANSVIVSDEFSSQDT 60
           MARKSSLLKRTLASLCFILALYAIINTFISSTATLKLDRSFPFSSANSVIVSDEFSSQDT
Sbjct: 1   MARKSSLLKRTLASLCFILALYAIINTFISSTATLKLDRSFPFSSANSVIVSDEFSSQDT 60

Query: 61  DLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPVPDVTDLKYPGHQHMAEWFL 120
           DLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPVP+VTDLKYPGHQHMAEWFL
Sbjct: 61  DLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPVPEVTDLKYPGHQHMAEWFL 120

Query: 121 FTDLLRPESERIGSAVVRVFDPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSD 180
           FTDLLRPESERIGSAVVRVFDPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSD
Sbjct: 121 FTDLLRPESERIGSAVVRVFDPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSD 180

Query: 181 EETQDAFMEWLEKQEYWKRSNGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGRLRADQA 240
           EETQDAFMEWL KQ+YWKR+NGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGRLRADQA
Sbjct: 181 EETQDAFMEWLGKQDYWKRNNGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGRLRADQA 240

Query: 241 SLVKDVIVPYSHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQDVII 300
           SLVKDVIVPYSHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQDVII
Sbjct: 241 SLVKDVIVPYSHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQDVII 300

Query: 301 KHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDV 360
           KHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDV
Sbjct: 301 KHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDV 360

Query: 361 IDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQREMKKIKRYFEYTDSNGTVNEIW 420
           IDYSKIAVFFDSVSA KP FL+SKLRRIS+ERILDYQREMKKIKRYFEYTDSNGTVNEIW
Sbjct: 361 IDYSKIAVFFDSVSAAKPGFLMSKLRRISKERILDYQREMKKIKRYFEYTDSNGTVNEIW 420

Query: 421 RQVSQKLPLIKLMINREKRVIHRDGDEPNCSCLCSNQTGIRARL 465
           RQVSQKLPLIKLMINREKRVIHRDGDEP+CSCLCSNQTGIRARL
Sbjct: 421 RQVSQKLPLIKLMINREKRVIHRDGDEPDCSCLCSNQTGIRARL 464

BLAST of CSPI01G34330 vs. NCBI nr
Match: gi|595863593|ref|XP_007211616.1| (hypothetical protein PRUPE_ppa005538mg [Prunus persica])

HSP 1 Score: 667.2 bits (1720), Expect = 2.2e-188
Identity = 328/460 (71.30%), Postives = 391/460 (85.00%), Query Frame = 1

Query: 1   MARKSSLLKRTLASLCFILALYAIINTFISSTATLKLDRSFPFSSANSVIVSDEFSSQDT 60
           MARKSSLLK++LA++  +L +YA +NTF++ T T KL+ + P  S+ S I SD F+S++ 
Sbjct: 1   MARKSSLLKQSLATIVVVLLIYAFLNTFLTPTTTAKLETALPSFSSASSISSDVFASREN 60

Query: 61  DLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPVPDVTDLKYPGHQHMAEWFL 120
            L N  GK   PVK+YLYD+P RFTYGVIE+H +ARGG+P  DV+ LKYPGHQHM EW+L
Sbjct: 61  QL-NFPGK---PVKVYLYDLPKRFTYGVIEHHSLARGGRPDEDVSKLKYPGHQHMGEWYL 120

Query: 121 FTDLLRPESERIGSAVVRVFDPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSD 180
           F DLL+PE+ER GS V +V DPEEAD FYVPFFSSLSLIVNP RPA+GSD    K +YSD
Sbjct: 121 FKDLLKPEAERFGSPVQKVLDPEEADFFYVPFFSSLSLIVNPARPASGSD----KPLYSD 180

Query: 181 EETQDAFMEWLEKQEYWKRSNGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGRLRADQA 240
           EE Q A +EWLE+Q YWKR+NGRDHVI+A DPNALY++ID+VKNS+LLV DFGRL+ DQ 
Sbjct: 181 EENQVALIEWLEEQVYWKRNNGRDHVIMASDPNALYKVIDKVKNSVLLVCDFGRLKEDQG 240

Query: 241 SLVKDVIVPYSHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQDVII 300
           SLVKDVIVPYSHRINTY+GDI VE+R TLLFFMGNR+RKEGGKIRD+LF +LE E+DVII
Sbjct: 241 SLVKDVIVPYSHRINTYSGDISVEDRNTLLFFMGNRFRKEGGKIRDLLFQLLENEEDVII 300

Query: 301 KHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDV 360
           KHGTQSRESRRAA+HGMHTSKFCLNPAGDTPSACRLFDS+VSLCVPVIVSDSIELPFEDV
Sbjct: 301 KHGTQSRESRRAASHGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPVIVSDSIELPFEDV 360

Query: 361 IDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQREMKKIKRYFEYTDSNGTVNEIW 420
           IDY KIA+F +S +A+KPEFL+S LR I+ ERIL+YQ+E+ ++KRYF+Y   NGTVNEIW
Sbjct: 361 IDYRKIAIFVESNAALKPEFLVSMLRGITTERILEYQKELNEVKRYFQYGVPNGTVNEIW 420

Query: 421 RQVSQKLPLIKLMINREKRVIHRDGDEPNCSCLCSNQTGI 461
           RQV+QKLP IKL INR++R++ RD +  +CSCLCSNQTGI
Sbjct: 421 RQVAQKLPFIKLSINRDRRLVKRDLNVRDCSCLCSNQTGI 452

BLAST of CSPI01G34330 vs. NCBI nr
Match: gi|645238788|ref|XP_008225839.1| (PREDICTED: probable arabinosyltransferase ARAD1 [Prunus mume])

HSP 1 Score: 661.0 bits (1704), Expect = 1.5e-186
Identity = 327/460 (71.09%), Postives = 388/460 (84.35%), Query Frame = 1

Query: 1   MARKSSLLKRTLASLCFILALYAIINTFISSTATLKLDRSFPFSSANSVIVSDEFSSQDT 60
           MARKSSLLK++LA++  +L +YA +NTF++ T T KL+ + P  S+ S I SD F+S++ 
Sbjct: 41  MARKSSLLKQSLATIVVVLLIYAFLNTFLTPTTTAKLETALPSFSSASSISSDVFASREN 100

Query: 61  DLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPVPDVTDLKYPGHQHMAEWFL 120
            L N  GK   PVK+YLYD+P RFTYGVIE+H +ARG +P  DV+ LKYPGHQHM EW+L
Sbjct: 101 QL-NFPGK---PVKVYLYDLPKRFTYGVIEHHSLARGARPDEDVSKLKYPGHQHMGEWYL 160

Query: 121 FTDLLRPESERIGSAVVRVFDPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSD 180
           F DLL+PESER GS V +V DPEEAD FYVPFFSSLSLIVNP RPA+GSD    K +YSD
Sbjct: 161 FKDLLKPESERFGSPVQKVLDPEEADFFYVPFFSSLSLIVNPARPASGSD----KPLYSD 220

Query: 181 EETQDAFMEWLEKQEYWKRSNGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGRLRADQA 240
           EE Q A +EWLE+Q YWKR+NGRDHVI+A DPNALY++ID VKNS+LLV DFGRL+ DQ 
Sbjct: 221 EENQVALIEWLEEQVYWKRNNGRDHVIMASDPNALYKVIDIVKNSVLLVCDFGRLKEDQG 280

Query: 241 SLVKDVIVPYSHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQDVII 300
           SLVKDVIVPYSHRINTY+GDI VE+R TLLFFMGNR+RKEGGKIRD+LF +LE E+DVII
Sbjct: 281 SLVKDVIVPYSHRINTYSGDISVEDRNTLLFFMGNRFRKEGGKIRDLLFQLLENEEDVII 340

Query: 301 KHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDV 360
           KHGTQSRESRRAA+ GMHTSKFCLNPAGDTPSACRLFDS+VSLCVPVIVSDSIELPFEDV
Sbjct: 341 KHGTQSRESRRAASRGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPVIVSDSIELPFEDV 400

Query: 361 IDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQREMKKIKRYFEYTDSNGTVNEIW 420
           IDY KIA+F +S +A+KPEFL+S LR I+ ERIL+YQ+E+ ++KRYF+Y   NGTVNEIW
Sbjct: 401 IDYRKIAIFVESNAALKPEFLVSMLRGITTERILEYQKELNEVKRYFQYGVPNGTVNEIW 460

Query: 421 RQVSQKLPLIKLMINREKRVIHRDGDEPNCSCLCSNQTGI 461
           RQV+QKLP IKL INR++R++ RD +  +CSCLCSNQTGI
Sbjct: 461 RQVAQKLPFIKLSINRDRRLVKRDLNVRDCSCLCSNQTGI 492

BLAST of CSPI01G34330 vs. NCBI nr
Match: gi|657960632|ref|XP_008371893.1| (PREDICTED: probable arabinosyltransferase ARAD1 [Malus domestica])

HSP 1 Score: 660.6 bits (1703), Expect = 2.0e-186
Identity = 327/462 (70.78%), Postives = 389/462 (84.20%), Query Frame = 1

Query: 1   MARKSSLLKRTLASLCFILALYAIINTFISSTATLKLDRSFP-FSS-ANSVIVSDEFSSQ 60
           MARKSSLLK++LA++  +L LYA +NTF++  AT KL+ +FP FSS A++ I SD F++ 
Sbjct: 36  MARKSSLLKQSLATIVGVLVLYAFLNTFLTPAATSKLENAFPSFSSVASNSISSDVFATL 95

Query: 61  DTDLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPVPDVTDLKYPGHQHMAEW 120
           +  L N  GK   PVK+YLYD+P RFTYGVIE+H +ARGG+P  DV+ LKYPGHQHM EW
Sbjct: 96  ENQL-NLPGK---PVKVYLYDLPKRFTYGVIEHHSLARGGRPDDDVSKLKYPGHQHMGEW 155

Query: 121 FLFTDLLRPESERIGSAVVRVFDPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVY 180
           +LF DLL+PESER+GS V R  DPEEADLFYVPFFSSLSLIVNP RPA+GS+    K +Y
Sbjct: 156 YLFQDLLKPESERVGSPVERALDPEEADLFYVPFFSSLSLIVNPARPASGSE----KPLY 215

Query: 181 SDEETQDAFMEWLEKQEYWKRSNGRDHVIIAQDPNALYRLIDRVKNSILLVSDFGRLRAD 240
           SDEE Q A +EWLE QEYWKR+ GRDHVI+A DPNALY++I++VKN +LLV DFGRL+ D
Sbjct: 216 SDEENQVALIEWLESQEYWKRNXGRDHVIMASDPNALYKVINKVKNCVLLVCDFGRLKED 275

Query: 241 QASLVKDVIVPYSHRINTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQDV 300
           Q SLVKDVIVPYSHRINTYTGDI VENR  LLFFMGNR+RK+GGKIRD+LF +LE E+DV
Sbjct: 276 QGSLVKDVIVPYSHRINTYTGDISVENRNALLFFMGNRFRKDGGKIRDLLFQLLENEEDV 335

Query: 301 IIKHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFE 360
           I+KHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDS+VSLCVPVI+SDSIELPFE
Sbjct: 336 IVKHGTQSRESRRAATHGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPVIISDSIELPFE 395

Query: 361 DVIDYSKIAVFFDSVSAVKPEFLISKLRRISEERILDYQREMKKIKRYFEYTDSNGTVNE 420
           DVIDY KIA+F +S +A+KP FL+S LR I  ERIL+YQ E+ ++K YF+Y + NGTVNE
Sbjct: 396 DVIDYRKIAIFVESNAALKPGFLVSMLRGIPTERILEYQTELNEVKHYFQYGEPNGTVNE 455

Query: 421 IWRQVSQKLPLIKLMINREKRVIHRDGDEPNCSCLCSNQTGI 461
           IWRQ++QKLP IKL INRE+R++ RD +  +CSCLCSNQTGI
Sbjct: 456 IWRQIAQKLPFIKLTINRERRLVKRDSNVLDCSCLCSNQTGI 489

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ARAD1_ARATH1.3e-16462.42Probable arabinosyltransferase ARAD1 OS=Arabidopsis thaliana GN=ARAD1 PE=1 SV=1[more]
ARAD2_ARATH3.8e-15366.15Probable arabinosyltransferase ARAD2 OS=Arabidopsis thaliana GN=ARAD2 PE=1 SV=1[more]
GLYT4_ARATH1.8e-1726.88Probable glycosyltransferase At5g11130 OS=Arabidopsis thaliana GN=At5g11120/At5g... [more]
GLYT1_ARATH1.5e-1627.69Probable glycosyltransferase At3g07620 OS=Arabidopsis thaliana GN=At3g07620 PE=3... [more]
IX10L_ARATH2.0e-1624.84Probable beta-1,4-xylosyltransferase IRX10L OS=Arabidopsis thaliana GN=IRX10L PE... [more]
Match NameE-valueIdentityDescription
A0A0A0M0S0_CUCSA3.7e-26499.78Uncharacterized protein OS=Cucumis sativus GN=Csa_1G701350 PE=4 SV=1[more]
M5X179_PRUPE1.5e-18871.30Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005538mg PE=4 SV=1[more]
W9S3J0_9ROSA3.5e-17767.38Putative glucuronosyltransferase GUT1 OS=Morus notabilis GN=L484_018501 PE=4 SV=... [more]
A0A061E3Q4_THECC3.2e-17567.17Exostosin family protein isoform 1 OS=Theobroma cacao GN=TCM_006051 PE=4 SV=1[more]
B9SCC6_RICCO2.7e-17466.16Catalytic, putative OS=Ricinus communis GN=RCOM_0891750 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G35100.17.2e-16662.42 Exostosin family protein[more]
AT5G44930.12.2e-15466.15 Exostosin family protein[more]
AT1G67410.15.3e-7640.36 Exostosin family protein[more]
AT3G45400.13.0e-7137.17 exostosin family protein[more]
AT1G74680.12.8e-6936.23 Exostosin family protein[more]
Match NameE-valueIdentityDescription
gi|449455387|ref|XP_004145434.1|5.4e-26499.78PREDICTED: probable arabinosyltransferase ARAD1 [Cucumis sativus][more]
gi|659118235|ref|XP_008459015.1|3.6e-26098.06PREDICTED: probable arabinosyltransferase ARAD1 [Cucumis melo][more]
gi|595863593|ref|XP_007211616.1|2.2e-18871.30hypothetical protein PRUPE_ppa005538mg [Prunus persica][more]
gi|645238788|ref|XP_008225839.1|1.5e-18671.09PREDICTED: probable arabinosyltransferase ARAD1 [Prunus mume][more]
gi|657960632|ref|XP_008371893.1|2.0e-18670.78PREDICTED: probable arabinosyltransferase ARAD1 [Malus domestica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004263Exostosin
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
biological_process GO:0006486 protein glycosylation
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016740 transferase activity
molecular_function GO:0003674 molecular_function
molecular_function GO:0016757 transferase activity, transferring glycosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G34330.1CSPI01G34330.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004263Exostosin-likePFAMPF03016Exostosincoord: 71..384
score: 2.0
NoneNo IPR availablePANTHERPTHR11062EXOSTOSIN HEPARAN SULFATE GLYCOSYLTRANSFERASE -RELATEDcoord: 18..449
score: 1.4E
NoneNo IPR availablePANTHERPTHR11062:SF50SUBFAMILY NOT NAMEDcoord: 18..449
score: 1.4E