Cla020014 (gene) Watermelon (97103) v1

NameCla020014
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionExostosin family protein (AHRD V1 **-- D7M892_ARALL); contains Interpro domain(s) IPR004263 Exostosin-like
LocationChr2 : 24834145 .. 24839464 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGAGGAGGAAGGAGCCTTCCATTTTCTTCATCAATGTCTGCTCAGATCCAAAGATCCAATCGATCTCCTCTTCTTCTTTTCACACTCTCTCTTCTCGCTCTTTCCGTCCTCTTCATCCTCGTTTTTCTCTCCCCTTCCAATCCAAATCCCACTTCCTTCCACCCTCCAATTTCCTCTGTGAAACCTGAAACTTCGTTCGTCCTTTCACTCGAACACTTCCTCACTCACAAGGCTCCGAAATCGCCTCCGCCTCACGATGACACGGTTCCTGCGGCCGGAGATGTTGAAGAGGCCTCAAGGAAGCTCGACGAGGCACTTTCAGAGGCAGAAATGGAGCGGGTGGTTCGAGATCCGTACTATCCATTGGGGTCGCCGATTAGAGTGTATGTTTATGAAATGCCGTGGAAATTCACTTACGATTTGCTCTGGACGTTCAGGAATACCTACAGAGAAACCTCTAATCTCACCTCCAATGGCAGCCCCGTCCACCGCCTTATTGAACAGGTATTTACACATAATTGAAAGTTTCTTAGTGTAGGTTATATATCTATATATATTTTACGCTCACCTCACTTGTGCCTTCGAGAATTTACACTATACTCGAGTGATTTTCAATTTTACTTGAGAGAAGATGACATTGTTGTATAATTCTCAATCTTAATTGAGAGAAAATGGCATTACTAATCCGAATATAAGATGCTCTAATACTGGAAGTTTCATTAATAGAATTAGGACTGTTTAATCAATTATATATACACTTAACAAGTGAACTAGTTTGATTCGCATTTCTTACATGGAATGAAAACGGTGTTTATATCATTCTTCTAGTTGATATGAACTATGGACTTCGCATTTTATTAAAGGGTTAAGTTATTTTCTTAAAAATTCCGGAAAATGTATAATGGTTAAGGTATTATACTGTTTAAAAGTGTCGATTATTTTTATGGCATTTTCATATTAGATGCGTTTATCTTGTTCCTTGTTGATATTATGAAGGAGTGGATAAAGTCATATAGAACTCGATGAGTAGCTTATTGCTTCGGTGTATATAGTTCTTTTAGTTCTCCATATACTTCAGTTTTTATTTCAGCATTACATGTTTCTATTTGTAGTAACTAAAATGCCAAAAATGGGTTTCTTTTGTAGCATTCTATTGATTACTGGCTGTGGGCCGACTTAATTGCTCCAGAGTCAGAGAGACTGCTGAAAGGTGTGGTGAGGGTTTATCGGCAGGAGGAAGCAGATCTCTTCTATATTCCCTTCTTCACAACTATCAGCTTCTTCTTGTTGGAGAAACAACAGTGCAAAGCACTTTACAGGGTGTGAGTGAGCTTCCAAGGTTTTCTTTATTGTTCTTTCAATTGTTATGCATGACATGCCTATAGTGAAATTAGGCTTTTTCTCACTTATCAACTTTTGTAAAGGCAGATCTTTTTCTTCTTTGTTTTATTCTATGAAAGGATACGATAATGTTTTTTTAGGAGAGTTAAAATACCGGCTTTATTTCTCACGTGTGAGTTGATTGGTTAGTTTGAGTATGTGCTCATTGAGTGGATTAGTTATAATATTAGGAGATCATTCAGTACTTCCTACTGCTGAACTAATTTATGTCGATTTCGTTAAGGCGGAAGTGGATTTCTTTATTCTATTAAGAGTGGGGGCAGAATTGCTCTAAAGCGAGCAATCTCTAGAAGTGAGCAGGTAGTACTTTTAAGGGAGGATTCAGTTTCCAATCTGACTCCCATTCCTTCTTTAGGATAGGAAGAGATCCCATTGGCTATGTAACTAAATACTGGGGAAATCAAGGTTACGTCGACCAATGTATCTAATGAACAGGCTTAGCGACCAAGGTATTCAATGAACAGGCTGAGTTCACCATGTCCCTATTCCTAGAAGAAAACAGCCCAAGGGAAATGACTTTTGTGGCATTTGAAACTGTTGTTGATGGGTTAGGGCGGTGCAATGTATAGGTGTACTTTTGAGAATCCTAGTGGTAGAGTAGAGGAATAAGGAGTACGAGAACAAAGAACTGAATTGAATTTGCTAATCCCACTAACGACATTTGGTTTAGGTATAATTTTGTGGAATTTGGTTTTCTTTTATTTGGATAGGTGTACTTTTGATTCACTAATGGCACCAATGACTTGAATATGCTTATCTCACTTGCCAATGGAAGTTTGACTTGGTCCAGTCATGGAGGGTAGCTGATCTGATCACCGCTCTGATAGATAGAGTTATTGTTCTTTTATTTTAAAGCCTCAAAACTTTAGGGTTTACCGAGAAGAAAAAGTTCCAAAACAAAATTTGAAGTTTAGTGGTATTAATGCATTCTATTTTTTGGAGCATAATTATGATTACTTCGGAATGTTATTTTTGTTCAAAATAGAGTGCTTTCGTCTGTGCTCCATAAAGCATTGATGTACTTTTACATGTGGATTCAGGAGGCTCTAAAGTGGGTGACAGATCAACCTGCATGGAAACGATCTGAAGGCAGGGATCACATACTTCCAGTTCATCATCCATGGTCTTTTAAGACCGTTAGAAAATTCATGAAGAATGCCATTTGGTTGCTACCGGATATGGACTCGACAGGGAACTGGTAGACACTAAGATGATAGAACCTCTTAGAAGTTATTGAATGTGCTGTGAAGCTTGATTGAGTGACATGTACTGATATTTTAGGTACAAGCCTGGGCAAGTCTATCTGGAAAAGGACCTTATTCTTCCCTACGTTCCAAATGTTGATTTGTGTGACAGCAAATGCTTATCAGATGGGCAATCAAAGAGAAGCATACTGCTCTTTTTCCGTGGTCGGCTCAAAAGAAATGCAGTAAGTTTTGGAAAATATCAGTGGAATTTTCGGCACTTTATACTGCTTCTTTATGCATTTGCTTAATGTATAGTAAGATGCATGAACTTTCTTTAGTGAGTCACATGAAATTTCGTTGGTAAGTTATGAGTTGATTTCTTAAAGGGGAACCAAATACAGCCATGTAAATATTTTTAAAATTCAGCTGGCAGTGGATGAGTGCTTTGGTAATAGACTAGAATCATTATTCATGACATAGCTCTTGATAGACTATTCTACTTACTCTTAGGGAGGTAAGATACGGGCAAAACTTGTTGGAGAGCTAAGTGGTGCAGATGATGTAGTAATAGAAGAGGGAACGGCTGGAGAAGGAGGGAAAGCAGCAGCTCAGGCTGGAATGCGCAAGTAACATATAAGTTTTTATTTTGATCTTTCTGCATTTTCTTTTGTTACCATTGCAAATTTCAGATTGTCAAATTTCTTCTTCCATCTCCAGGTCCACCTTTTGCTTAAGTCCTGCTGGCGACACTCCATCATCTGCCAGATTGTTTGATGCTATTGTCAGCGGCTGCATTCCTGTCATTGTTAGTGATGAATTGGAGCTTCCATTTGAAGGAATACTTGACTACAGGAAGGTAAATTGATGAATCATCTTGAAGCAAAATTGGTAACAAATTAAATTTTCAGTTACATTAGTGAATTTGAGACATTCCGTTGCATTAAGAATATTGCAATTGATATAATCTTTGATTATATAAATTTGATATTGAATTTCATCAGCCAACCAAAATCCGAAAAAACATCTCGAAAGTAGGAAAAGAAAAAATCATTAATAAATTCTTCGTTTATCTAGACCGCATGATCAATAGCATGATTTCATCTCAATGAACAAGAAGTTGAAAACAAGTATACGAGAACTTGTGGAACGTTTATCTATGATGTTCTAGAGCTCAAATGGCATTTTTCTCATGCAGATTGCGTTATTTGTTTCCTCTAGCGATGCTCTAAAATCAGGATGGCTTTTAACATATTTGAGGAGTGTTAGTGCTGGGGATATCAGAAGATTGCAACAAAATCTCGCCAAGGTGTGTAGCTCCTCCAAAATTAAAGTGCCCATTGGAAGAGTTTTTTTGTTGAGTAGTTACAAACACAACAATCAAGTTCAAAGTATTTATAGATATAGGACAATGCAAAAAAATTGAAATATAGCAAAAAAATTTTGCTTTCCTATGTTAATATCAGGGTCTATCAGTGATAAATTTTGCTATATTTGCAATTTTTTAAAATGTTGCTATACACCTAATTATTATCTCTAAAAGCTCTATCCAATAAATTACCTTTTTTTTTTTTTTGTTAACTTTCTTAGCTTGGGGCTTGGCTTTCCTCCCATCTCTTTTTAAATTTCGAATATCTTTTTTCTCATACAAAATTTTGAGCAACTGAAACTAATTTCTACTCTTATTTCGTGGCATTCGCCAGTTCTCAAGACATTTCCTTTATTCCAGTCCTGCTCAACCTATGGGTCCGGAAGACCTTGCTTGGAAAATGGTTAGTTTCTCGTCCTTCATTTCTACTTGTCCTGATTTTGTGATATTAACAATTTGTCTGCTGGCAAATATCTAATCTAGCATATGCGTGTGAAAGTCATTTTAAAATGGTTAAAATCATGACATATTTTATCATTAAAAACCAATATTTTAATGCTATGTAAAACCAAACATTAAAATGATTTTGAATGATTAAAGGATGGTTTTTGAGTTATTTTGAATATGACAACTTAACGAGAACCAGACATGTCCCAAATCAAATAGTTTAACAAACTCTTTTGCCATGCTACCAAAAACCAACCACTAGCACATGTCGGCGTTCTTTTTCTTTCACGAAGAAACTAACTTTTATGGAGAAGAAATGAAATAGTACACATGGACAAAATCGAAAACAAAAGGAGTCGAGAAAAACAAAGAACTGCGGTGGTTGATTCATAACTTTCTGGCATTCTAATTATGAGCAATGAAGTTTGGTGGCTGGTTACGGTTTGTGCAAATCTTTTAGGTGAAATGAATGCAATTGTGACAAAAGATTGACATCCGTCCATGCGTCAATATATCCTTAAATTGAGGAATTCAACATCACAAGAATTAATCAATATTTTTGTTATCTTTTATCTTACCCATAAAACATAAAAAATATCAACTATAAATAAAATTTATATGAATATGAATACTAATAACAAGGTCTTAATGTTAGTTTGTTTTGATAATTTTTTTGATTTACATAATTATACGTTGATGTGGATATTGTATCAATATATCCGTGGATATATTCGTATAATTGAAATACAAATACCAATATCGTCATCGAATTTGTGATTGAAATTTGCAGATTGCTGGTAAGTTGGTGAATGTAAAGCTTCATACGAGGAGAACCCAACGTGTGGTAAAAGAATCGAGGAGTATATGTAGTTGTGACTGCAGGCGTTCAAATTTTACCAACTCCCCTCCTTCCTAA

mRNA sequence

ATGCGAGGAGGAAGGAGCCTTCCATTTTCTTCATCAATGTCTGCTCAGATCCAAAGATCCAATCGATCTCCTCTTCTTCTTTTCACACTCTCTCTTCTCGCTCTTTCCGTCCTCTTCATCCTCGTTTTTCTCTCCCCTTCCAATCCAAATCCCACTTCCTTCCACCCTCCAATTTCCTCTGTGAAACCTGAAACTTCGTTCGTCCTTTCACTCGAACACTTCCTCACTCACAAGGCTCCGAAATCGCCTCCGCCTCACGATGACACGGTTCCTGCGGCCGGAGATGTTGAAGAGGCCTCAAGGAAGCTCGACGAGGCACTTTCAGAGGCAGAAATGGAGCGGGTGGTTCGAGATCCGTACTATCCATTGGGGTCGCCGATTAGAGTGTATGTTTATGAAATGCCGTGGAAATTCACTTACGATTTGCTCTGGACGTTCAGGAATACCTACAGAGAAACCTCTAATCTCACCTCCAATGGCAGCCCCGTCCACCGCCTTATTGAACAGCATTCTATTGATTACTGGCTGTGGGCCGACTTAATTGCTCCAGAGTCAGAGAGACTGCTGAAAGGTGTGGTGAGGGTTTATCGGCAGGAGGAAGCAGATCTCTTCTATATTCCCTTCTTCACAACTATCAGCTTCTTCTTGTTGGAGAAACAACAGTGCAAAGCACTTTACAGGGTAGTGCTTTCGTCTGTGCTCCATAAAGCATTGATGTACTTTTACATGTGGATTCAGGAGGCTCTAAAGTGGGTGACAGATCAACCTGCATGGAAACGATCTGAAGGCAGGGATCACATACTTCCAGTTCATCATCCATGGTCTTTTAAGACCGTTAGAAAATTCATGAAGAATGCCATTTGGTTGCTACCGGATATGGACTCGACAGGGAACTGGTACAAGCCTGGGCAAGTCTATCTGGAAAAGGACCTTATTCTTCCCTACGTTCCAAATGTTGATTTGTGTGACAGCAAATGCTTATCAGATGGGCAATCAAAGAGAAGCATACTGCTCTTTTTCCGTGGTCGGCTCAAAAGAAATGCAGGAGGTAAGATACGGGCAAAACTTGTTGGAGAGCTAAGTGGTGCAGATGATGTAGTAATAGAAGAGGGAACGGCTGGAGAAGGAGGGAAAGCAGCAGCTCAGGCTGGAATGCGCAAGTCCACCTTTTGCTTAAGTCCTGCTGGCGACACTCCATCATCTGCCAGATTGTTTGATGCTATTGTCAGCGGCTGCATTCCTGTCATTGTTAGTGATGAATTGGAGCTTCCATTTGAAGGAATACTTGACTACAGGAAGATTGCGTTATTTGTTTCCTCTAGCGATGCTCTAAAATCAGGATGGCTTTTAACATATTTGAGGAGTGTTAGTGCTGGGGATATCAGAAGATTGCAACAAAATCTCGCCAAGTTCTCAAGACATTTCCTTTATTCCAGTCCTGCTCAACCTATGGGTCCGGAAGACCTTGCTTGGAAAATGATTGCTGGTAAGTTGGTGAATGTAAAGCTTCATACGAGGAGAACCCAACGTGTGGTAAAAGAATCGAGGAGTATATGTAGTTGTGACTGCAGGCGTTCAAATTTTACCAACTCCCCTCCTTCCTAA

Coding sequence (CDS)

ATGCGAGGAGGAAGGAGCCTTCCATTTTCTTCATCAATGTCTGCTCAGATCCAAAGATCCAATCGATCTCCTCTTCTTCTTTTCACACTCTCTCTTCTCGCTCTTTCCGTCCTCTTCATCCTCGTTTTTCTCTCCCCTTCCAATCCAAATCCCACTTCCTTCCACCCTCCAATTTCCTCTGTGAAACCTGAAACTTCGTTCGTCCTTTCACTCGAACACTTCCTCACTCACAAGGCTCCGAAATCGCCTCCGCCTCACGATGACACGGTTCCTGCGGCCGGAGATGTTGAAGAGGCCTCAAGGAAGCTCGACGAGGCACTTTCAGAGGCAGAAATGGAGCGGGTGGTTCGAGATCCGTACTATCCATTGGGGTCGCCGATTAGAGTGTATGTTTATGAAATGCCGTGGAAATTCACTTACGATTTGCTCTGGACGTTCAGGAATACCTACAGAGAAACCTCTAATCTCACCTCCAATGGCAGCCCCGTCCACCGCCTTATTGAACAGCATTCTATTGATTACTGGCTGTGGGCCGACTTAATTGCTCCAGAGTCAGAGAGACTGCTGAAAGGTGTGGTGAGGGTTTATCGGCAGGAGGAAGCAGATCTCTTCTATATTCCCTTCTTCACAACTATCAGCTTCTTCTTGTTGGAGAAACAACAGTGCAAAGCACTTTACAGGGTAGTGCTTTCGTCTGTGCTCCATAAAGCATTGATGTACTTTTACATGTGGATTCAGGAGGCTCTAAAGTGGGTGACAGATCAACCTGCATGGAAACGATCTGAAGGCAGGGATCACATACTTCCAGTTCATCATCCATGGTCTTTTAAGACCGTTAGAAAATTCATGAAGAATGCCATTTGGTTGCTACCGGATATGGACTCGACAGGGAACTGGTACAAGCCTGGGCAAGTCTATCTGGAAAAGGACCTTATTCTTCCCTACGTTCCAAATGTTGATTTGTGTGACAGCAAATGCTTATCAGATGGGCAATCAAAGAGAAGCATACTGCTCTTTTTCCGTGGTCGGCTCAAAAGAAATGCAGGAGGTAAGATACGGGCAAAACTTGTTGGAGAGCTAAGTGGTGCAGATGATGTAGTAATAGAAGAGGGAACGGCTGGAGAAGGAGGGAAAGCAGCAGCTCAGGCTGGAATGCGCAAGTCCACCTTTTGCTTAAGTCCTGCTGGCGACACTCCATCATCTGCCAGATTGTTTGATGCTATTGTCAGCGGCTGCATTCCTGTCATTGTTAGTGATGAATTGGAGCTTCCATTTGAAGGAATACTTGACTACAGGAAGATTGCGTTATTTGTTTCCTCTAGCGATGCTCTAAAATCAGGATGGCTTTTAACATATTTGAGGAGTGTTAGTGCTGGGGATATCAGAAGATTGCAACAAAATCTCGCCAAGTTCTCAAGACATTTCCTTTATTCCAGTCCTGCTCAACCTATGGGTCCGGAAGACCTTGCTTGGAAAATGATTGCTGGTAAGTTGGTGAATGTAAAGCTTCATACGAGGAGAACCCAACGTGTGGTAAAAGAATCGAGGAGTATATGTAGTTGTGACTGCAGGCGTTCAAATTTTACCAACTCCCCTCCTTCCTAA

Protein sequence

MRGGRSLPFSSSMSAQIQRSNRSPLLLFTLSLLALSVLFILVFLSPSNPNPTSFHPPISSVKPETSFVLSLEHFLTHKAPKSPPPHDDTVPAAGDVEEASRKLDEALSEAEMERVVRDPYYPLGSPIRVYVYEMPWKFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLIAPESERLLKGVVRVYRQEEADLFYIPFFTTISFFLLEKQQCKALYRVVLSSVLHKALMYFYMWIQEALKWVTDQPAWKRSEGRDHILPVHHPWSFKTVRKFMKNAIWLLPDMDSTGNWYKPGQVYLEKDLILPYVPNVDLCDSKCLSDGQSKRSILLFFRGRLKRNAGGKIRAKLVGELSGADDVVIEEGTAGEGGKAAAQAGMRKSTFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDYRKIALFVSSSDALKSGWLLTYLRSVSAGDIRRLQQNLAKFSRHFLYSSPAQPMGPEDLAWKMIAGKLVNVKLHTRRTQRVVKESRSICSCDCRRSNFTNSPPS
BLAST of Cla020014 vs. Swiss-Prot
Match: ARAD1_ARATH (Probable arabinosyltransferase ARAD1 OS=Arabidopsis thaliana GN=ARAD1 PE=1 SV=1)

HSP 1 Score: 236.5 bits (602), Expect = 6.7e-61
Identity = 144/413 (34.87%), Postives = 221/413 (53.51%), Query Frame = 1

Query: 122 PLGSPIRVYVYEMPWKFTYDLLWTFR----NTYRETSNLTSNGSPVHRLIEQHSIDYWLW 181
           P+   +RVY+Y +P +FTY L+           +   ++T+   P H    QH  +++L+
Sbjct: 55  PIQPRVRVYMYNLPKRFTYGLIEQHSIARGGIKKPVGDVTTLKYPGH----QHMHEWYLF 114

Query: 182 ADLIAPESERLLKGVVRVYRQEEADLFYIPFFTTISFFLLEKQQCKALYRVVLSSVLHKA 241
           +DL  PE +R    +VRV    +ADLFY+P F+++S  +   +  +A             
Sbjct: 115 SDLNQPEVDRSGSPIVRVSDPADADLFYVPVFSSLSLIVNAGRPVEA------------G 174

Query: 242 LMYFYMWIQEAL-KWVTDQPAWKRSEGRDHILPVHHPWSFKTVRKFMKNAIWLLPDMDST 301
             Y    +QE L +W+  Q  W+R+ GRDH++P   P +   +   +KNA+ L+ D    
Sbjct: 175 SGYSDEKMQEGLVEWLEGQEWWRRNAGRDHVIPAGDPNALYRILDRVKNAVLLVSDFGRL 234

Query: 302 GNWYKPGQVYLEKDLILPYVPNVDLCDSKCLSDGQSKRSILLFFRGRLKRNAGGKIRAKL 361
               +P Q    KD+++PY   V+L + +    G   R+ LLFF G   R  GGK+R  L
Sbjct: 235 ----RPDQGSFVKDVVIPYSHRVNLFNGEI---GVEDRNTLLFFMGNRYRKDGGKVRDLL 294

Query: 362 VGELSGADDVVIEEGTAGEGGKAAAQAGMRKSTFCLSPAGDTPSSARLFDAIVSGCIPVI 421
              L   DDV I+ GT     + AA  GM  S FCL+PAGDTPS+ RLFD+IVS C+P+I
Sbjct: 295 FQVLEKEDDVTIKHGTQSRENRRAATKGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPLI 354

Query: 422 VSDELELPFEGILDYRKIALFVSSSDALKSGWLLTYLRSVSAGDIRRLQQNLAKFSRHFL 481
           VSD +ELPFE ++DYRK ++FV ++ AL+ G+L+  LR +    I   Q+ +    R+F 
Sbjct: 355 VSDSIELPFEDVIDYRKFSIFVEANAALQPGFLVQMLRKIKTKKILEYQREMKSVRRYFD 414

Query: 482 YSSPAQPMGPEDLAWKMIAGKLVNVKLHTRRTQRVVKESRSICSCDCRRSNFT 530
           Y +   P G     W+ ++ KL  +KL + R +R+V  + +  +C C  +N T
Sbjct: 415 YDN---PNGAVKEIWRQVSHKLPLIKLMSNRDRRLVLRNLTEPNCSCLCTNQT 441

BLAST of Cla020014 vs. Swiss-Prot
Match: ARAD2_ARATH (Probable arabinosyltransferase ARAD2 OS=Arabidopsis thaliana GN=ARAD2 PE=1 SV=1)

HSP 1 Score: 226.1 bits (575), Expect = 9.1e-58
Identity = 142/403 (35.24%), Postives = 214/403 (53.10%), Query Frame = 1

Query: 128 RVYVYEMPWKFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLIAPESER 187
           +VY+YE+P  FTY ++   ++   ++ ++T    P H    QH  +++L++DL  PE +R
Sbjct: 66  KVYMYELPTNFTYGVIE--QHGGEKSDDVTGLKYPGH----QHMHEWYLYSDLTRPEVKR 125

Query: 188 LLKGVVRVYRQEEADLFYIPFFTTISFFLLEKQQCKALYRVVLSSVLHKALMYFYMWIQE 247
           +   +VRV+   EADLFY+  F+++S  +   +                   Y    +QE
Sbjct: 126 VGSPIVRVFDPAEADLFYVSAFSSLSLIVDSGRP---------------GFGYSDEEMQE 185

Query: 248 AL-KWVTDQPAWKRSEGRDHILPVHHPWSFKTVRKFMKNAIWLLPDMDSTGNWYKPGQVY 307
           +L  W+  Q  W+R+ GRDH++    P + K V   +KNA+ L+ D D      +  Q  
Sbjct: 186 SLVSWLESQEWWRRNNGRDHVIVAGDPNALKRVMDRVKNAVLLVTDFDRL----RADQGS 245

Query: 308 LEKDLILPYVPNVDLCDSKCLSDGQSKRSILLFFRGRLKRNAGGKIRAKLVGELSGADDV 367
           L KD+I+PY   +D  + +    G  +R+ LLFF G   R  GGK+R  L   L   +DV
Sbjct: 246 LVKDVIIPYSHRIDAYEGEL---GVKQRTNLLFFMGNRYRKDGGKVRDLLFKLLEKEEDV 305

Query: 368 VIEEGTAGEGGKAAAQAGMRKSTFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFE 427
           VI+ GT       A + GM  S FCL  AGDT S+ RLFDAI S C+PVIVSD +ELPFE
Sbjct: 306 VIKRGTQSRENMRAVKQGMHTSKFCLHLAGDTSSACRLFDAIASLCVPVIVSDGIELPFE 365

Query: 428 GILDYRKIALFVSSSDALKSGWLLTYLRSVSAGDIRRLQQNLAKFSRHFLYSSPAQPMGP 487
            ++DYRK ++F+    ALK G+++  LR V  G I + Q+ + +  R+F Y+      G 
Sbjct: 366 DVIDYRKFSIFLRRDAALKPGFVVKKLRKVKPGKILKYQKVMKEVRRYFDYT---HLNGS 425

Query: 488 EDLAWKMIAGKLVNVKLHTRRTQRVVKESRSICSCDCRRSNFT 530
            +  W+ +  K+  +KL   R +R++K   S   C C  SN T
Sbjct: 426 VNEIWRQVTKKIPLIKLMINREKRMIKRDGSDPQCSCLCSNQT 437

BLAST of Cla020014 vs. Swiss-Prot
Match: F8H_ARATH (Probable glucuronoxylan glucuronosyltransferase F8H OS=Arabidopsis thaliana GN=F8H PE=2 SV=1)

HSP 1 Score: 100.9 bits (250), Expect = 4.4e-20
Identity = 90/349 (25.79%), Postives = 154/349 (44.13%), Query Frame = 1

Query: 188 LLKGVVRVYRQEEADLFYIPFFTTISFFLLEKQQCKALYRVVLSSVLHKALMYFYMWIQE 247
           LL   VR    +EAD F++P + + +F         +  R +LSS               
Sbjct: 146 LLSSDVRTLDPDEADYFFVPVYVSCNFSTSNGFPSLSHARSLLSS--------------- 205

Query: 248 ALKWVTDQ-PAWKRSEGRDHILPVHHPWSF-----------KTVRKFMKNAIWLLPDMDS 307
           A+ +++D  P W RS+G DH+    H +             + + KFMK +I L     +
Sbjct: 206 AVDFLSDHYPFWNRSQGSDHVFVASHDFGACFHAMEDMAIEEGIPKFMKRSIIL----QT 265

Query: 308 TGNWYKPGQVYLEKDLILPYVPNVDLCDSKCLSDGQSKRSILLFFRGRLK---RNAGGK- 367
            G  YK     +E  +I PY+P   +  +   +    +R I  FFRG+++   +N  G+ 
Sbjct: 266 FGVKYKHPCQEVEHVVIPPYIPPESVQKAIEKAPVNGRRDIWAFFRGKMEVNPKNISGRF 325

Query: 368 ----IRAKLVGELSGADDVVIEEGTAGEGGKAAAQAGMRKSTFCLSPAGDTPSSARLFDA 427
               +R  ++ +  G     +          A  ++ + +S FCL P G  P S RL ++
Sbjct: 326 YSKGVRTAILKKFGGRRRFYLNRHRF-----AGYRSEIVRSVFCLCPLGWAPWSPRLVES 385

Query: 428 IVSGCIPVIVSDELELPFEGILDYRKIALFVSSSDALKSGWLLTYLRSVSAGDIRRLQQN 487
            V GC+PV+++D ++LPF   + + +I+L V+  D      L   L  V+A ++  +Q+N
Sbjct: 386 AVLGCVPVVIADGIQLPFSETVQWPEISLTVAEKDVRN---LRKVLEHVAATNLSAIQRN 445

Query: 488 LAK--FSRHFLYSSPAQPMGPEDLAWKMIAGKLVNVKLHTRRTQRVVKE 515
           L +  F R  LY+    PM   D  W ++      +   + R  RV+ +
Sbjct: 446 LHEPVFKRALLYN---VPMKEGDATWHILESLWRKLDDRSYRRSRVLSQ 464

BLAST of Cla020014 vs. Swiss-Prot
Match: IRX7_ARATH (Probable glucuronoxylan glucuronosyltransferase IRX7 OS=Arabidopsis thaliana GN=IRX7 PE=2 SV=1)

HSP 1 Score: 95.9 bits (237), Expect = 1.4e-18
Identity = 96/404 (23.76%), Postives = 173/404 (42.82%), Query Frame = 1

Query: 127 IRVYVYEMPWKFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLIAPESE 186
           +++YVY++P KF  D L   R T                       ++   A++   ++ 
Sbjct: 94  LKIYVYDLPSKFNKDWLANDRCT-----------------------NHLFAAEVALHKAF 153

Query: 187 RLLKGVVRVYRQEEADLFYIPFFTTISFFLLEKQQCKALYRVVLSSVLHKALMYFYMWIQ 246
             L+G VR     EAD F++P + + +F  +                   A+ +    I 
Sbjct: 154 LSLEGDVRTEDPYEADFFFVPVYVSCNFSTING---------------FPAIGHARSLIN 213

Query: 247 EALKWVTDQ-PAWKRSEGRDHILPVHHPWS--FKTVRK---------FMKNAIWLLPDMD 306
           +A+K V+ Q P W R+ G DH+    H +   F T+           F++N+I L     
Sbjct: 214 DAIKLVSTQYPFWNRTSGSDHVFTATHDFGSCFHTMEDRAIADGVPIFLRNSIIL----Q 273

Query: 307 STGNWYKPGQVYLEKDLILPYVPNVDLCDSKCLSDGQSKRSILLFFRGRLK---RNAGGK 366
           + G  +      +E  +I PY+    L  ++       +R I +FFRG+++   +N  G+
Sbjct: 274 TFGVTFNHPCQEVENVVIPPYISPESLHKTQKNIPVTKERDIWVFFRGKMELHPKNISGR 333

Query: 367 -----IRAKLVGELSGADDVVIEEGTAGEGGKAAAQAGMRKSTFCLSPAGDTPSSARLFD 426
                +R  +     G     ++         A  Q+ + +S FCL P G  P S RL +
Sbjct: 334 FYSKRVRTNIWRSYGGDRRFYLQRQRF-----AGYQSEIARSVFCLCPLGWAPWSPRLVE 393

Query: 427 AIVSGCIPVIVSDELELPFEGILDYRKIALFVSSSDALKSGWLLTYLRSVSAGDIRRLQQ 486
           ++  GC+PVI++D + LPF   + +  I+L V+  D  K G +L +   V+A ++  +Q+
Sbjct: 394 SVALGCVPVIIADGIRLPFPSTVRWPDISLTVAERDVGKLGDILEH---VAATNLSVIQR 444

Query: 487 NLAKFS--RHFLYSSPAQPMGPEDLAWKMIAGKLVNVKLHTRRT 509
           NL   S  R  +++ P++     D  W+++      +    RR+
Sbjct: 454 NLEDPSVRRALMFNVPSR---EGDATWQVLEALSKKLNRSVRRS 444

BLAST of Cla020014 vs. Swiss-Prot
Match: GT14_ORYSJ (Probable glucuronosyltransferase Os01g0926600 OS=Oryza sativa subsp. japonica GN=Os01g0926600 PE=2 SV=1)

HSP 1 Score: 93.6 bits (231), Expect = 7.1e-18
Identity = 106/421 (25.18%), Postives = 165/421 (39.19%), Query Frame = 1

Query: 92  AAGDVEEASRKLDEALSEAEMERVVRDPYYPLGSPIRVYVYEMPWKFTYDLLWTFRNTYR 151
           AA  +E+ +R  D    E     V+ D   P+G  ++VYVYE+P K+   ++       R
Sbjct: 17  AATALEDVARGQDTERIEGSAGDVLEDD--PVGR-LKVYVYELPTKYNKKMV---AKDSR 76

Query: 152 ETSNLTSNGSPVHRLIEQHSIDYWLWADLIAPESERLLKGVVRVYRQEEADLFYIPFFTT 211
             S++ +    +HR                      LL   +R    EEAD FY P +TT
Sbjct: 77  CLSHMFAAEIFMHRF---------------------LLSSAIRTLNPEEADWFYTPVYTT 136

Query: 212 ISFFLLEKQQCKALYRVVLSSVLHKALMYFYMWIQEALKWVTDQ-PAWKRSEGRDHILPV 271
                             L+   H         ++ A+++++   P W R++G DH   V
Sbjct: 137 CD----------------LTPWGHPLPFKSPRIMRSAIQFISSHWPYWNRTDGADHFFVV 196

Query: 272 HHPWS----FKTVRKFMKNAIWLLPDMDSTGNWYKPGQVYLEKDLIL--PYVPNVDLCDS 331
            H +     ++  +   +  + LL        + +   V L++  I   PY P   +   
Sbjct: 197 PHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLKEGSITIPPYAPPQKM--K 256

Query: 332 KCLSDGQSKRSILLFFRGRLKRNAGGKIRAKLVGELSGADDVVIEEGTAGEGGKAAAQAG 391
             L   ++ RSI ++FRG     A                    E G    G +A+    
Sbjct: 257 THLVPPETPRSIFVYFRGLFYDTANDP-----------------EGGYYARGARASVWEN 316

Query: 392 --------------------MRKSTFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELP 451
                               M++S FCL P G  P S RL +A+V GCIPVI++D++ LP
Sbjct: 317 FKNNPLFDISTDHPPTYYEDMQRSIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 372

Query: 452 FEGILDYRKIALFVSSSDALKSGWLLTYLRSVSAGDIRRLQQNLA--KFSRHFLYSSPAQ 484
           F   + + +I +FV+  D  K   L T L S+    I R Q+ LA     +  L+  PAQ
Sbjct: 377 FADAIPWDEIGVFVAEDDVPK---LDTILTSIPMDVILRKQRLLANPSMKQAMLFPQPAQ 372

BLAST of Cla020014 vs. TrEMBL
Match: A0A0A0LM68_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G035510 PE=4 SV=1)

HSP 1 Score: 972.6 bits (2513), Expect = 1.9e-280
Identity = 484/533 (90.81%), Postives = 497/533 (93.25%), Query Frame = 1

Query: 4   GRSLPFSSSMSAQIQRSNRSPLLLFTLSLLALSVLFILVFLSPSNP--NPTSFHPPISSV 63
           GR+LPFSSSMSAQIQRSNRSPLLLFTLSLLALSVLFILVFLSPSNP  NPTSFH PISS+
Sbjct: 3   GRNLPFSSSMSAQIQRSNRSPLLLFTLSLLALSVLFILVFLSPSNPHPNPTSFHSPISSL 62

Query: 64  KPETSFVLSLEHFLTHKAPKSPPPHDDTVPAAGDVEEASRKLDEALSEAEMERVVRDPYY 123
           KPETSFV+SLEHFLTHK PKSPP  DDT P AGDVE+ASRKLDEALSEAEMERV+RDPY+
Sbjct: 63  KPETSFVVSLEHFLTHKVPKSPPLRDDTAPVAGDVEDASRKLDEALSEAEMERVIRDPYF 122

Query: 124 PLGSPIRVYVYEMPWKFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLI 183
           PLGSPIRVYVYEMPWKFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLI
Sbjct: 123 PLGSPIRVYVYEMPWKFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLI 182

Query: 184 APESERLLKGVVRVYRQEEADLFYIPFFTTISFFLLEKQQCKALYRVVLSSVLHKALMYF 243
           APESERLLKGVVRVYRQEEADLFYIPFFTTISFFLLEKQQCKALYR              
Sbjct: 183 APESERLLKGVVRVYRQEEADLFYIPFFTTISFFLLEKQQCKALYR-------------- 242

Query: 244 YMWIQEALKWVTDQPAWKRSEGRDHILPVHHPWSFKTVRKFMKNAIWLLPDMDSTGNWYK 303
                EALKWVTDQPAWKRSEGRDHILPVHHPWSFKTVRKFMKNAIWLLPDMDSTGNWYK
Sbjct: 243 -----EALKWVTDQPAWKRSEGRDHILPVHHPWSFKTVRKFMKNAIWLLPDMDSTGNWYK 302

Query: 304 PGQVYLEKDLILPYVPNVDLCDSKCLSDGQSKRSILLFFRGRLKRNAGGKIRAKLVGELS 363
           PGQV+LEKDLILPYVPNV+LCDSKCLS  QSKRSILLFFRGRLKRNAGGKIRAKL GELS
Sbjct: 303 PGQVFLEKDLILPYVPNVELCDSKCLSYQQSKRSILLFFRGRLKRNAGGKIRAKLGGELS 362

Query: 364 GADDVVIEEGTAGEGGKAAAQAGMRKSTFCLSPAGDTPSSARLFDAIVSGCIPVIVSDEL 423
           GADDV+IEEGTAGEGGKAAAQ GMRKS FCLSPAGDTPSSARLFDAIVSGCIPVIVSDEL
Sbjct: 363 GADDVLIEEGTAGEGGKAAAQTGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDEL 422

Query: 424 ELPFEGILDYRKIALFVSSSDALKSGWLLTYLRSVSAGDIRRLQQNLAKFSRHFLYSSPA 483
           ELPFEGILDYRKIALFVSSSDALKSGWLLTYLRS SA DIRRLQQNLAK SRHF+YSSPA
Sbjct: 423 ELPFEGILDYRKIALFVSSSDALKSGWLLTYLRSFSAADIRRLQQNLAKLSRHFIYSSPA 482

Query: 484 QPMGPEDLAWKMIAGKLVNVKLHTRRTQRVVKESRSICSCDCRRSNFTNSPPS 535
           QPMGPEDLAWKMI GKLVN+KLHTRR+QRVVKESRS+CSCDCRRSNFTNSPPS
Sbjct: 483 QPMGPEDLAWKMIGGKLVNIKLHTRRSQRVVKESRSVCSCDCRRSNFTNSPPS 516

BLAST of Cla020014 vs. TrEMBL
Match: A0A067JQH1_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_00551 PE=4 SV=1)

HSP 1 Score: 772.3 bits (1993), Expect = 3.8e-220
Identity = 389/510 (76.27%), Postives = 431/510 (84.51%), Query Frame = 1

Query: 21  NRSPLLLFTLSLLALSVLFILVFLSPSNP-NPTSFHPPISSVKPETSFVLSLEHFLTHKA 80
           +RSP+LL TLS LALS+LF L  LS S   N +SF  P  ++KPETSFV SLEHFLT KA
Sbjct: 13  SRSPILLCTLSFLALSLLFFLFSLSSSQTRNLSSFPDPNLTLKPETSFVTSLEHFLTVKA 72

Query: 81  PKSPPPHDDTVPAAGDVEEASRKLDEALSEAEMERVVRDPYYPLGSPIRVYVYEMPWKFT 140
            KS P  DDTV      EE  ++LDE +   E ERV  + YYP+  PIRVYVYEMP KFT
Sbjct: 73  SKSSPIRDDTVLQV--TEEDVKELDEKMFVKESERVYGNVYYPVEFPIRVYVYEMPRKFT 132

Query: 141 YDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLIAPESERLLKGVVRVYRQE 200
           Y+LLW FRNTYRET NLTSNGSPVHRLIEQHSIDYWLWADLIAPESERLLK VVRV+RQE
Sbjct: 133 YELLWLFRNTYRETGNLTSNGSPVHRLIEQHSIDYWLWADLIAPESERLLKTVVRVHRQE 192

Query: 201 EADLFYIPFFTTISFFLLEKQQCKALYRVVLSSVLHKALMYFYMWIQEALKWVTDQPAWK 260
           EADLFYIPFFTTISFFLLEKQQ KALYR                   E LKWVTDQPAWK
Sbjct: 193 EADLFYIPFFTTISFFLLEKQQSKALYR-------------------EVLKWVTDQPAWK 252

Query: 261 RSEGRDHILPVHHPWSFKTVRKFMKNAIWLLPDMDSTGNWYKPGQVYLEKDLILPYVPNV 320
           RSEGRDHILPVHHPWSFK+VR++MKNAIWLLPDMDSTGNWYKPGQV+LEKDLILPYVPNV
Sbjct: 253 RSEGRDHILPVHHPWSFKSVRRYMKNAIWLLPDMDSTGNWYKPGQVFLEKDLILPYVPNV 312

Query: 321 DLCDSKCLSDGQSKRSILLFFRGRLKRNAGGKIRAKLVGELSGADDVVIEEGTAGEGGKA 380
           DLCD+KCLS+ +SKR+ L+FFRGRLKRNAGGKIRAKLV ELSGAD VVIEEG+AGE GKA
Sbjct: 313 DLCDAKCLSESESKRTTLIFFRGRLKRNAGGKIRAKLVAELSGADGVVIEEGSAGEEGKA 372

Query: 381 AAQAGMRKSTFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDYRKIALFVS 440
           AAQ GMRKS FCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDYRKIALFVS
Sbjct: 373 AAQFGMRKSVFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDYRKIALFVS 432

Query: 441 SSDALKSGWLLTYLRSVSAGDIRRLQQNLAKFSRHFLYSSPAQPMGPEDLAWKMIAGKLV 500
           SSDA++ GWLL +L+ +S   +R +++NLAK+SRHFLYSSPAQP GPEDL W+M+AGKLV
Sbjct: 433 SSDAIQPGWLLQFLKGISPVQLRDMRKNLAKYSRHFLYSSPAQPWGPEDLVWRMMAGKLV 492

Query: 501 NVKLHTRRTQRVVKESRSICSCDCRRSNFT 530
           N+KLHTRR+QR+VKESRSIC+C+C+R+NFT
Sbjct: 493 NIKLHTRRSQRLVKESRSICTCECKRTNFT 501

BLAST of Cla020014 vs. TrEMBL
Match: B9S214_RICCO (Catalytic, putative OS=Ricinus communis GN=RCOM_1325460 PE=4 SV=1)

HSP 1 Score: 767.7 bits (1981), Expect = 9.4e-219
Identity = 385/513 (75.05%), Postives = 429/513 (83.63%), Query Frame = 1

Query: 21  NRSPLLLFTLSLLALSVLFILVFLSPS---NPNPTSFHPPISSVKPETSFVLSLEHFLTH 80
           +RSP+LLFTLSLLA S+LF L  LS S   NP P+    P  ++KP TSF+ SLE FLT 
Sbjct: 15  SRSPILLFTLSLLAFSLLFFLFSLSSSQTHNPYPS----PNFTLKPVTSFLASLELFLTK 74

Query: 81  KAPKSPPPH-DDTVPAAGDVEEASRKLDEALSEAEMERVVRDPYYPLGSPIRVYVYEMPW 140
           K+  S   H DDTV     +E+   +LDE +   E  R+  DPYYPL  PIRVYVYEMP 
Sbjct: 75  KSLSSSSSHRDDTVREV--IEDDLHRLDEKMFAKESARLYSDPYYPLQFPIRVYVYEMPN 134

Query: 141 KFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLIAPESERLLKGVVRVY 200
           KFTYDLLW FRNTYR+T NLTSNGSPVHRLIEQHSIDYWLWADLIAPE+ERLLK VVRVY
Sbjct: 135 KFTYDLLWLFRNTYRDTVNLTSNGSPVHRLIEQHSIDYWLWADLIAPETERLLKSVVRVY 194

Query: 201 RQEEADLFYIPFFTTISFFLLEKQQCKALYRVVLSSVLHKALMYFYMWIQEALKWVTDQP 260
           RQEEADLFYIPFFTTISFFLLEKQQCKALYR                   EALKWVTDQP
Sbjct: 195 RQEEADLFYIPFFTTISFFLLEKQQCKALYR-------------------EALKWVTDQP 254

Query: 261 AWKRSEGRDHILPVHHPWSFKTVRKFMKNAIWLLPDMDSTGNWYKPGQVYLEKDLILPYV 320
           AWKRS GRDHILPVHHPWSFK+VR+++KNAIWLLPDMDSTGNWYKPGQV+LEKDLILPYV
Sbjct: 255 AWKRSGGRDHILPVHHPWSFKSVRRYVKNAIWLLPDMDSTGNWYKPGQVFLEKDLILPYV 314

Query: 321 PNVDLCDSKCLSDGQSKRSILLFFRGRLKRNAGGKIRAKLVGELSGADDVVIEEGTAGEG 380
           PNVDLCD+KC S+ +SKR+ LLFFRGRLKRNAGGKIRAKLV ELSGA+ VV+EEGTAGEG
Sbjct: 315 PNVDLCDAKCASENESKRTTLLFFRGRLKRNAGGKIRAKLVAELSGAEGVVVEEGTAGEG 374

Query: 381 GKAAAQAGMRKSTFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDYRKIAL 440
           GKAAAQ GMRKS FCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDYRKIA+
Sbjct: 375 GKAAAQTGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDYRKIAV 434

Query: 441 FVSSSDALKSGWLLTYLRSVSAGDIRRLQQNLAKFSRHFLYSSPAQPMGPEDLAWKMIAG 500
           FVSSSDA++ GWL+ +L+ VS    R +Q+NL K+SRHFLYSSPAQP+GPEDL W+M+AG
Sbjct: 435 FVSSSDAIQPGWLIKFLKDVSPAQTREMQRNLVKYSRHFLYSSPAQPLGPEDLVWRMMAG 494

Query: 501 KLVNVKLHTRRTQRVVKESRSICSCDCRRSNFT 530
           KLVN+KLHTRR+QRVVKESRS+C+CDC+R+NFT
Sbjct: 495 KLVNIKLHTRRSQRVVKESRSVCTCDCKRANFT 502

BLAST of Cla020014 vs. TrEMBL
Match: A0A0D2RAQ2_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_004G257200 PE=4 SV=1)

HSP 1 Score: 767.7 bits (1981), Expect = 9.4e-219
Identity = 377/522 (72.22%), Postives = 434/522 (83.14%), Query Frame = 1

Query: 14  SAQIQRSNRSPLLLFTLSLLALSVLFILVFLSPSNPNPTSFHPPIS--SVKPETSFVLSL 73
           S+ + RS    +LLF + +LALS  F  +F SPS+    +   P    SV+PETSFV SL
Sbjct: 8   SSILTRSKSPVILLFAVIILALSFFFFFLF-SPSSTTTAAVTVPYRHPSVRPETSFVASL 67

Query: 74  EHFLTHKAPKSPPPHDDTVPAAGDVEEASRKLDEALSEAEMERVVRDPYYPLGSPIRVYV 133
           EHFL HKAPK P   DDTV +   ++   R+LDE     EME +  DPYYP+  P+RVYV
Sbjct: 68  EHFLAHKAPKIPASSDDTVGSV--IDRDVRRLDERKFVKEMEWLRGDPYYPMSMPVRVYV 127

Query: 134 YEMPWKFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLIAPESERLLKG 193
           YEMP KFTYDLLW FRNTYRETSN+TSNGSPVHRLIEQHSIDYWLWADLIAPESERLLK 
Sbjct: 128 YEMPSKFTYDLLWLFRNTYRETSNITSNGSPVHRLIEQHSIDYWLWADLIAPESERLLKN 187

Query: 194 VVRVYRQEEADLFYIPFFTTISFFLLEKQQCKALYRVVLSSVLHKALMYFYMWIQEALKW 253
           VVRV +QEEADLFY+PFFTTISFFLLEKQQCKALYR                   EALKW
Sbjct: 188 VVRVVKQEEADLFYVPFFTTISFFLLEKQQCKALYR-------------------EALKW 247

Query: 254 VTDQPAWKRSEGRDHILPVHHPWSFKTVRKFMKNAIWLLPDMDSTGNWYKPGQVYLEKDL 313
           VTDQPAWKRSEGRDHI P+HHPWSFK+VR+++KNAIWLLPDMDSTGNWYKPGQV LEKDL
Sbjct: 248 VTDQPAWKRSEGRDHIFPIHHPWSFKSVRRYVKNAIWLLPDMDSTGNWYKPGQVSLEKDL 307

Query: 314 ILPYVPNVDLCDSKCLSDGQSKRSILLFFRGRLKRNAGGKIRAKLVGELSGADDVVIEEG 373
           ILPYVPNVDLCD+KCL + +SKR+ LLFFRGRLKRNAGGKIRAK+  EL+GA DVVIEEG
Sbjct: 308 ILPYVPNVDLCDAKCLLESESKRTTLLFFRGRLKRNAGGKIRAKIGAELTGAKDVVIEEG 367

Query: 374 TAGEGGKAAAQAGMRKSTFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDY 433
           TAGEGGKAAAQ GMR+S FCLSPAGDTPSSARLFDAIVSGCIPVI+SDELELPFEGILDY
Sbjct: 368 TAGEGGKAAAQKGMRRSIFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGILDY 427

Query: 434 RKIALFVSSSDALKSGWLLTYLRSVSAGDIRRLQQNLAKFSRHFLYSSPAQPMGPEDLAW 493
           RK+A+F+SS+DA++ GW+L YL+S+S+  IR +++NLA++SRHF+YS+PAQP+GPEDL W
Sbjct: 428 RKMAVFISSTDAVQPGWILRYLKSISSTQIREMRRNLAEYSRHFVYSNPAQPLGPEDLVW 487

Query: 494 KMIAGKLVNVKLHTRRTQRVVKESRSICSCDCRRSNFTNSPP 534
           +M+AGKLVN+KLHTRR+QRVVKESRS+C+CDCRRSN T+S P
Sbjct: 488 RMMAGKLVNIKLHTRRSQRVVKESRSVCTCDCRRSNSTHSNP 507

BLAST of Cla020014 vs. TrEMBL
Match: F6H1U3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_19s0014g00560 PE=4 SV=1)

HSP 1 Score: 767.7 bits (1981), Expect = 9.4e-219
Identity = 385/525 (73.33%), Postives = 433/525 (82.48%), Query Frame = 1

Query: 10  SSSMSAQIQRSNRSPLLLFTLSLLALSVLFILVFLSPSNPNPT-SFHPPISSVKPETSFV 69
           SSS+      S RSP+LL T SLL+LS+L  L  + PS+PNP  + +P ++++ PE+SFV
Sbjct: 8   SSSLPNPRGGSTRSPILLLTFSLLSLSLLLFLFSILPSHPNPNPNRNPNLNTLLPESSFV 67

Query: 70  LSLEHFLTHKAPKSPPPHDDTVPAAGDVEEASRKLDEALSEAEMERVVRDPYYPLGS--- 129
            SLEHFL  K+P+SPP  DDTV    D  EA +KLD+ + + E+ RV  DPYYP  S   
Sbjct: 68  ASLEHFLISKSPRSPPIRDDTV--GSDDPEAVKKLDDLVWQREIRRVYEDPYYPAASGVT 127

Query: 130 -PIRVYVYEMPWKFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLIAPE 189
             IRVYVYEMP KFTYDLLW FRNTY+ETSN TSNGSPVHRLIEQHSIDYWLWADL APE
Sbjct: 128 SAIRVYVYEMPAKFTYDLLWLFRNTYKETSNRTSNGSPVHRLIEQHSIDYWLWADLTAPE 187

Query: 190 SERLLKGVVRVYRQEEADLFYIPFFTTISFFLLEKQQCKALYRVVLSSVLHKALMYFYMW 249
           SERLLK VVRV+RQEEADLFYIPFFTTISFFLLE +Q K LYR                 
Sbjct: 188 SERLLKNVVRVHRQEEADLFYIPFFTTISFFLLEPEQWKPLYR----------------- 247

Query: 250 IQEALKWVTDQPAWKRSEGRDHILPVHHPWSFKTVRKFMKNAIWLLPDMDSTGNWYKPGQ 309
             EALKWVTDQPAWKRSEGRDHILPVHHPWSFKTVRK MKNAIWLLPDMDSTGNWYKPGQ
Sbjct: 248 --EALKWVTDQPAWKRSEGRDHILPVHHPWSFKTVRKSMKNAIWLLPDMDSTGNWYKPGQ 307

Query: 310 VYLEKDLILPYVPNVDLCDSKCLSDGQSKRSILLFFRGRLKRNAGGKIRAKLVGELSGAD 369
           V LEKDLILPYVPNVDLCD+KC S+ +SKR  LLFFRGRLKRNAGGKIRAKL+ ELSG D
Sbjct: 308 VSLEKDLILPYVPNVDLCDAKCSSESESKRKTLLFFRGRLKRNAGGKIRAKLMAELSGDD 367

Query: 370 DVVIEEGTAGEGGKAAAQAGMRKSTFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELP 429
            VVI+EGTAGEGGK AAQ GMRKS FCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELP
Sbjct: 368 GVVIQEGTAGEGGKEAAQRGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELP 427

Query: 430 FEGILDYRKIALFVSSSDALKSGWLLTYLRSVSAGDIRRLQQNLAKFSRHFLYSSPAQPM 489
           FEGILDYRKIALFVSSSDA++ GWLLT+L+S+S   I+ +Q+NLAK+SRHF+YSSPAQ +
Sbjct: 428 FEGILDYRKIALFVSSSDAMQPGWLLTFLKSISPAQIKEMQRNLAKYSRHFVYSSPAQLL 487

Query: 490 GPEDLAWKMIAGKLVNVKLHTRRTQRVVKESRSICSCDCRRSNFT 530
           GPEDL W+M+AGKL+N+KLHTRR QRVV+ESR +C+CDC+R+NFT
Sbjct: 488 GPEDLVWRMMAGKLMNIKLHTRRLQRVVRESRRLCTCDCKRANFT 511

BLAST of Cla020014 vs. NCBI nr
Match: gi|659069993|ref|XP_008452796.1| (PREDICTED: probable arabinosyltransferase ARAD1 [Cucumis melo])

HSP 1 Score: 974.2 bits (2517), Expect = 9.5e-281
Identity = 485/533 (90.99%), Postives = 499/533 (93.62%), Query Frame = 1

Query: 4   GRSLPFSSSMSAQIQRSNRSPLLLFTLSLLALSVLFILVFLSPSNPNP--TSFHPPISSV 63
           GR+LPFSSSMSAQIQRSNRSPLLLFTLSLLALSVLFILVFLSPSNPNP  TSFH PISS+
Sbjct: 3   GRNLPFSSSMSAQIQRSNRSPLLLFTLSLLALSVLFILVFLSPSNPNPNPTSFHSPISSL 62

Query: 64  KPETSFVLSLEHFLTHKAPKSPPPHDDTVPAAGDVEEASRKLDEALSEAEMERVVRDPYY 123
           KPETSFV+SLEHFLTHKAPKSPP  DDT P AGDVE+ASRKLDEALSEAEMERVVRDPYY
Sbjct: 63  KPETSFVVSLEHFLTHKAPKSPPLRDDTAPVAGDVEDASRKLDEALSEAEMERVVRDPYY 122

Query: 124 PLGSPIRVYVYEMPWKFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLI 183
           PLGSPIRVYVYEMPWKFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLI
Sbjct: 123 PLGSPIRVYVYEMPWKFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLI 182

Query: 184 APESERLLKGVVRVYRQEEADLFYIPFFTTISFFLLEKQQCKALYRVVLSSVLHKALMYF 243
           APESERLLKGVVRV+RQEEADLFYIPFFTTISFFLLEKQQCKALYR              
Sbjct: 183 APESERLLKGVVRVHRQEEADLFYIPFFTTISFFLLEKQQCKALYR-------------- 242

Query: 244 YMWIQEALKWVTDQPAWKRSEGRDHILPVHHPWSFKTVRKFMKNAIWLLPDMDSTGNWYK 303
                EALKWVTDQPAWKRSEGRDHILPVHHPWSFKTVRKFMKNAIWLLPDMDSTGNWYK
Sbjct: 243 -----EALKWVTDQPAWKRSEGRDHILPVHHPWSFKTVRKFMKNAIWLLPDMDSTGNWYK 302

Query: 304 PGQVYLEKDLILPYVPNVDLCDSKCLSDGQSKRSILLFFRGRLKRNAGGKIRAKLVGELS 363
           PGQVYLEKDLILPYVPNVDLCDSKCL++ QSKRSILLFFRGRLKRNAGGKIRAKL GELS
Sbjct: 303 PGQVYLEKDLILPYVPNVDLCDSKCLANQQSKRSILLFFRGRLKRNAGGKIRAKLGGELS 362

Query: 364 GADDVVIEEGTAGEGGKAAAQAGMRKSTFCLSPAGDTPSSARLFDAIVSGCIPVIVSDEL 423
           GADDV+IEEGTAGEGGKAAAQ GMRKS FCLSPAGDTPSSARLFDAIVSGCIPVIVSDEL
Sbjct: 363 GADDVLIEEGTAGEGGKAAAQTGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDEL 422

Query: 424 ELPFEGILDYRKIALFVSSSDALKSGWLLTYLRSVSAGDIRRLQQNLAKFSRHFLYSSPA 483
           ELPFEGILDYRKIALFVSSSDALKSGWLLTYLRS SA DIRRLQQNLAK SRHF+YS+PA
Sbjct: 423 ELPFEGILDYRKIALFVSSSDALKSGWLLTYLRSFSAADIRRLQQNLAKLSRHFVYSNPA 482

Query: 484 QPMGPEDLAWKMIAGKLVNVKLHTRRTQRVVKESRSICSCDCRRSNFTNSPPS 535
           QPMGPEDLAWKMI GKLVN+KLHTRR+QRVVKESRS+CSCDCRRSNFT+SPPS
Sbjct: 483 QPMGPEDLAWKMIGGKLVNIKLHTRRSQRVVKESRSVCSCDCRRSNFTDSPPS 516

BLAST of Cla020014 vs. NCBI nr
Match: gi|449453962|ref|XP_004144725.1| (PREDICTED: probable arabinosyltransferase ARAD1 isoform X1 [Cucumis sativus])

HSP 1 Score: 972.6 bits (2513), Expect = 2.8e-280
Identity = 484/533 (90.81%), Postives = 497/533 (93.25%), Query Frame = 1

Query: 4   GRSLPFSSSMSAQIQRSNRSPLLLFTLSLLALSVLFILVFLSPSNP--NPTSFHPPISSV 63
           GR+LPFSSSMSAQIQRSNRSPLLLFTLSLLALSVLFILVFLSPSNP  NPTSFH PISS+
Sbjct: 3   GRNLPFSSSMSAQIQRSNRSPLLLFTLSLLALSVLFILVFLSPSNPHPNPTSFHSPISSL 62

Query: 64  KPETSFVLSLEHFLTHKAPKSPPPHDDTVPAAGDVEEASRKLDEALSEAEMERVVRDPYY 123
           KPETSFV+SLEHFLTHK PKSPP  DDT P AGDVE+ASRKLDEALSEAEMERV+RDPY+
Sbjct: 63  KPETSFVVSLEHFLTHKVPKSPPLRDDTAPVAGDVEDASRKLDEALSEAEMERVIRDPYF 122

Query: 124 PLGSPIRVYVYEMPWKFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLI 183
           PLGSPIRVYVYEMPWKFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLI
Sbjct: 123 PLGSPIRVYVYEMPWKFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLI 182

Query: 184 APESERLLKGVVRVYRQEEADLFYIPFFTTISFFLLEKQQCKALYRVVLSSVLHKALMYF 243
           APESERLLKGVVRVYRQEEADLFYIPFFTTISFFLLEKQQCKALYR              
Sbjct: 183 APESERLLKGVVRVYRQEEADLFYIPFFTTISFFLLEKQQCKALYR-------------- 242

Query: 244 YMWIQEALKWVTDQPAWKRSEGRDHILPVHHPWSFKTVRKFMKNAIWLLPDMDSTGNWYK 303
                EALKWVTDQPAWKRSEGRDHILPVHHPWSFKTVRKFMKNAIWLLPDMDSTGNWYK
Sbjct: 243 -----EALKWVTDQPAWKRSEGRDHILPVHHPWSFKTVRKFMKNAIWLLPDMDSTGNWYK 302

Query: 304 PGQVYLEKDLILPYVPNVDLCDSKCLSDGQSKRSILLFFRGRLKRNAGGKIRAKLVGELS 363
           PGQV+LEKDLILPYVPNV+LCDSKCLS  QSKRSILLFFRGRLKRNAGGKIRAKL GELS
Sbjct: 303 PGQVFLEKDLILPYVPNVELCDSKCLSYQQSKRSILLFFRGRLKRNAGGKIRAKLGGELS 362

Query: 364 GADDVVIEEGTAGEGGKAAAQAGMRKSTFCLSPAGDTPSSARLFDAIVSGCIPVIVSDEL 423
           GADDV+IEEGTAGEGGKAAAQ GMRKS FCLSPAGDTPSSARLFDAIVSGCIPVIVSDEL
Sbjct: 363 GADDVLIEEGTAGEGGKAAAQTGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDEL 422

Query: 424 ELPFEGILDYRKIALFVSSSDALKSGWLLTYLRSVSAGDIRRLQQNLAKFSRHFLYSSPA 483
           ELPFEGILDYRKIALFVSSSDALKSGWLLTYLRS SA DIRRLQQNLAK SRHF+YSSPA
Sbjct: 423 ELPFEGILDYRKIALFVSSSDALKSGWLLTYLRSFSAADIRRLQQNLAKLSRHFIYSSPA 482

Query: 484 QPMGPEDLAWKMIAGKLVNVKLHTRRTQRVVKESRSICSCDCRRSNFTNSPPS 535
           QPMGPEDLAWKMI GKLVN+KLHTRR+QRVVKESRS+CSCDCRRSNFTNSPPS
Sbjct: 483 QPMGPEDLAWKMIGGKLVNIKLHTRRSQRVVKESRSVCSCDCRRSNFTNSPPS 516

BLAST of Cla020014 vs. NCBI nr
Match: gi|802777497|ref|XP_012090902.1| (PREDICTED: probable arabinosyltransferase ARAD1 [Jatropha curcas])

HSP 1 Score: 772.3 bits (1993), Expect = 5.5e-220
Identity = 389/510 (76.27%), Postives = 431/510 (84.51%), Query Frame = 1

Query: 21  NRSPLLLFTLSLLALSVLFILVFLSPSNP-NPTSFHPPISSVKPETSFVLSLEHFLTHKA 80
           +RSP+LL TLS LALS+LF L  LS S   N +SF  P  ++KPETSFV SLEHFLT KA
Sbjct: 13  SRSPILLCTLSFLALSLLFFLFSLSSSQTRNLSSFPDPNLTLKPETSFVTSLEHFLTVKA 72

Query: 81  PKSPPPHDDTVPAAGDVEEASRKLDEALSEAEMERVVRDPYYPLGSPIRVYVYEMPWKFT 140
            KS P  DDTV      EE  ++LDE +   E ERV  + YYP+  PIRVYVYEMP KFT
Sbjct: 73  SKSSPIRDDTVLQV--TEEDVKELDEKMFVKESERVYGNVYYPVEFPIRVYVYEMPRKFT 132

Query: 141 YDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLIAPESERLLKGVVRVYRQE 200
           Y+LLW FRNTYRET NLTSNGSPVHRLIEQHSIDYWLWADLIAPESERLLK VVRV+RQE
Sbjct: 133 YELLWLFRNTYRETGNLTSNGSPVHRLIEQHSIDYWLWADLIAPESERLLKTVVRVHRQE 192

Query: 201 EADLFYIPFFTTISFFLLEKQQCKALYRVVLSSVLHKALMYFYMWIQEALKWVTDQPAWK 260
           EADLFYIPFFTTISFFLLEKQQ KALYR                   E LKWVTDQPAWK
Sbjct: 193 EADLFYIPFFTTISFFLLEKQQSKALYR-------------------EVLKWVTDQPAWK 252

Query: 261 RSEGRDHILPVHHPWSFKTVRKFMKNAIWLLPDMDSTGNWYKPGQVYLEKDLILPYVPNV 320
           RSEGRDHILPVHHPWSFK+VR++MKNAIWLLPDMDSTGNWYKPGQV+LEKDLILPYVPNV
Sbjct: 253 RSEGRDHILPVHHPWSFKSVRRYMKNAIWLLPDMDSTGNWYKPGQVFLEKDLILPYVPNV 312

Query: 321 DLCDSKCLSDGQSKRSILLFFRGRLKRNAGGKIRAKLVGELSGADDVVIEEGTAGEGGKA 380
           DLCD+KCLS+ +SKR+ L+FFRGRLKRNAGGKIRAKLV ELSGAD VVIEEG+AGE GKA
Sbjct: 313 DLCDAKCLSESESKRTTLIFFRGRLKRNAGGKIRAKLVAELSGADGVVIEEGSAGEEGKA 372

Query: 381 AAQAGMRKSTFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDYRKIALFVS 440
           AAQ GMRKS FCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDYRKIALFVS
Sbjct: 373 AAQFGMRKSVFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDYRKIALFVS 432

Query: 441 SSDALKSGWLLTYLRSVSAGDIRRLQQNLAKFSRHFLYSSPAQPMGPEDLAWKMIAGKLV 500
           SSDA++ GWLL +L+ +S   +R +++NLAK+SRHFLYSSPAQP GPEDL W+M+AGKLV
Sbjct: 433 SSDAIQPGWLLQFLKGISPVQLRDMRKNLAKYSRHFLYSSPAQPWGPEDLVWRMMAGKLV 492

Query: 501 NVKLHTRRTQRVVKESRSICSCDCRRSNFT 530
           N+KLHTRR+QR+VKESRSIC+C+C+R+NFT
Sbjct: 493 NIKLHTRRSQRLVKESRSICTCECKRTNFT 501

BLAST of Cla020014 vs. NCBI nr
Match: gi|225461772|ref|XP_002285599.1| (PREDICTED: probable arabinosyltransferase ARAD1 isoform X2 [Vitis vinifera])

HSP 1 Score: 767.7 bits (1981), Expect = 1.3e-218
Identity = 385/525 (73.33%), Postives = 433/525 (82.48%), Query Frame = 1

Query: 10  SSSMSAQIQRSNRSPLLLFTLSLLALSVLFILVFLSPSNPNPT-SFHPPISSVKPETSFV 69
           SSS+      S RSP+LL T SLL+LS+L  L  + PS+PNP  + +P ++++ PE+SFV
Sbjct: 8   SSSLPNPRGGSTRSPILLLTFSLLSLSLLLFLFSILPSHPNPNPNRNPNLNTLLPESSFV 67

Query: 70  LSLEHFLTHKAPKSPPPHDDTVPAAGDVEEASRKLDEALSEAEMERVVRDPYYPLGS--- 129
            SLEHFL  K+P+SPP  DDTV    D  EA +KLD+ + + E+ RV  DPYYP  S   
Sbjct: 68  ASLEHFLISKSPRSPPIRDDTV--GSDDPEAVKKLDDLVWQREIRRVYEDPYYPAASGVT 127

Query: 130 -PIRVYVYEMPWKFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLIAPE 189
             IRVYVYEMP KFTYDLLW FRNTY+ETSN TSNGSPVHRLIEQHSIDYWLWADL APE
Sbjct: 128 SAIRVYVYEMPAKFTYDLLWLFRNTYKETSNRTSNGSPVHRLIEQHSIDYWLWADLTAPE 187

Query: 190 SERLLKGVVRVYRQEEADLFYIPFFTTISFFLLEKQQCKALYRVVLSSVLHKALMYFYMW 249
           SERLLK VVRV+RQEEADLFYIPFFTTISFFLLE +Q K LYR                 
Sbjct: 188 SERLLKNVVRVHRQEEADLFYIPFFTTISFFLLEPEQWKPLYR----------------- 247

Query: 250 IQEALKWVTDQPAWKRSEGRDHILPVHHPWSFKTVRKFMKNAIWLLPDMDSTGNWYKPGQ 309
             EALKWVTDQPAWKRSEGRDHILPVHHPWSFKTVRK MKNAIWLLPDMDSTGNWYKPGQ
Sbjct: 248 --EALKWVTDQPAWKRSEGRDHILPVHHPWSFKTVRKSMKNAIWLLPDMDSTGNWYKPGQ 307

Query: 310 VYLEKDLILPYVPNVDLCDSKCLSDGQSKRSILLFFRGRLKRNAGGKIRAKLVGELSGAD 369
           V LEKDLILPYVPNVDLCD+KC S+ +SKR  LLFFRGRLKRNAGGKIRAKL+ ELSG D
Sbjct: 308 VSLEKDLILPYVPNVDLCDAKCSSESESKRKTLLFFRGRLKRNAGGKIRAKLMAELSGDD 367

Query: 370 DVVIEEGTAGEGGKAAAQAGMRKSTFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELP 429
            VVI+EGTAGEGGK AAQ GMRKS FCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELP
Sbjct: 368 GVVIQEGTAGEGGKEAAQRGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELP 427

Query: 430 FEGILDYRKIALFVSSSDALKSGWLLTYLRSVSAGDIRRLQQNLAKFSRHFLYSSPAQPM 489
           FEGILDYRKIALFVSSSDA++ GWLLT+L+S+S   I+ +Q+NLAK+SRHF+YSSPAQ +
Sbjct: 428 FEGILDYRKIALFVSSSDAMQPGWLLTFLKSISPAQIKEMQRNLAKYSRHFVYSSPAQLL 487

Query: 490 GPEDLAWKMIAGKLVNVKLHTRRTQRVVKESRSICSCDCRRSNFT 530
           GPEDL W+M+AGKL+N+KLHTRR QRVV+ESR +C+CDC+R+NFT
Sbjct: 488 GPEDLVWRMMAGKLMNIKLHTRRLQRVVRESRRLCTCDCKRANFT 511

BLAST of Cla020014 vs. NCBI nr
Match: gi|823153948|ref|XP_012476826.1| (PREDICTED: probable arabinosyltransferase ARAD1 isoform X1 [Gossypium raimondii])

HSP 1 Score: 767.7 bits (1981), Expect = 1.3e-218
Identity = 377/522 (72.22%), Postives = 434/522 (83.14%), Query Frame = 1

Query: 14  SAQIQRSNRSPLLLFTLSLLALSVLFILVFLSPSNPNPTSFHPPIS--SVKPETSFVLSL 73
           S+ + RS    +LLF + +LALS  F  +F SPS+    +   P    SV+PETSFV SL
Sbjct: 8   SSILTRSKSPVILLFAVIILALSFFFFFLF-SPSSTTTAAVTVPYRHPSVRPETSFVASL 67

Query: 74  EHFLTHKAPKSPPPHDDTVPAAGDVEEASRKLDEALSEAEMERVVRDPYYPLGSPIRVYV 133
           EHFL HKAPK P   DDTV +   ++   R+LDE     EME +  DPYYP+  P+RVYV
Sbjct: 68  EHFLAHKAPKIPASSDDTVGSV--IDRDVRRLDERKFVKEMEWLRGDPYYPMSMPVRVYV 127

Query: 134 YEMPWKFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLIAPESERLLKG 193
           YEMP KFTYDLLW FRNTYRETSN+TSNGSPVHRLIEQHSIDYWLWADLIAPESERLLK 
Sbjct: 128 YEMPSKFTYDLLWLFRNTYRETSNITSNGSPVHRLIEQHSIDYWLWADLIAPESERLLKN 187

Query: 194 VVRVYRQEEADLFYIPFFTTISFFLLEKQQCKALYRVVLSSVLHKALMYFYMWIQEALKW 253
           VVRV +QEEADLFY+PFFTTISFFLLEKQQCKALYR                   EALKW
Sbjct: 188 VVRVVKQEEADLFYVPFFTTISFFLLEKQQCKALYR-------------------EALKW 247

Query: 254 VTDQPAWKRSEGRDHILPVHHPWSFKTVRKFMKNAIWLLPDMDSTGNWYKPGQVYLEKDL 313
           VTDQPAWKRSEGRDHI P+HHPWSFK+VR+++KNAIWLLPDMDSTGNWYKPGQV LEKDL
Sbjct: 248 VTDQPAWKRSEGRDHIFPIHHPWSFKSVRRYVKNAIWLLPDMDSTGNWYKPGQVSLEKDL 307

Query: 314 ILPYVPNVDLCDSKCLSDGQSKRSILLFFRGRLKRNAGGKIRAKLVGELSGADDVVIEEG 373
           ILPYVPNVDLCD+KCL + +SKR+ LLFFRGRLKRNAGGKIRAK+  EL+GA DVVIEEG
Sbjct: 308 ILPYVPNVDLCDAKCLLESESKRTTLLFFRGRLKRNAGGKIRAKIGAELTGAKDVVIEEG 367

Query: 374 TAGEGGKAAAQAGMRKSTFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDY 433
           TAGEGGKAAAQ GMR+S FCLSPAGDTPSSARLFDAIVSGCIPVI+SDELELPFEGILDY
Sbjct: 368 TAGEGGKAAAQKGMRRSIFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGILDY 427

Query: 434 RKIALFVSSSDALKSGWLLTYLRSVSAGDIRRLQQNLAKFSRHFLYSSPAQPMGPEDLAW 493
           RK+A+F+SS+DA++ GW+L YL+S+S+  IR +++NLA++SRHF+YS+PAQP+GPEDL W
Sbjct: 428 RKMAVFISSTDAVQPGWILRYLKSISSTQIREMRRNLAEYSRHFVYSNPAQPLGPEDLVW 487

Query: 494 KMIAGKLVNVKLHTRRTQRVVKESRSICSCDCRRSNFTNSPP 534
           +M+AGKLVN+KLHTRR+QRVVKESRS+C+CDCRRSN T+S P
Sbjct: 488 RMMAGKLVNIKLHTRRSQRVVKESRSVCTCDCRRSNSTHSNP 507

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ARAD1_ARATH6.7e-6134.87Probable arabinosyltransferase ARAD1 OS=Arabidopsis thaliana GN=ARAD1 PE=1 SV=1[more]
ARAD2_ARATH9.1e-5835.24Probable arabinosyltransferase ARAD2 OS=Arabidopsis thaliana GN=ARAD2 PE=1 SV=1[more]
F8H_ARATH4.4e-2025.79Probable glucuronoxylan glucuronosyltransferase F8H OS=Arabidopsis thaliana GN=F... [more]
IRX7_ARATH1.4e-1823.76Probable glucuronoxylan glucuronosyltransferase IRX7 OS=Arabidopsis thaliana GN=... [more]
GT14_ORYSJ7.1e-1825.18Probable glucuronosyltransferase Os01g0926600 OS=Oryza sativa subsp. japonica GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LM68_CUCSA1.9e-28090.81Uncharacterized protein OS=Cucumis sativus GN=Csa_2G035510 PE=4 SV=1[more]
A0A067JQH1_JATCU3.8e-22076.27Uncharacterized protein OS=Jatropha curcas GN=JCGZ_00551 PE=4 SV=1[more]
B9S214_RICCO9.4e-21975.05Catalytic, putative OS=Ricinus communis GN=RCOM_1325460 PE=4 SV=1[more]
A0A0D2RAQ2_GOSRA9.4e-21972.22Uncharacterized protein OS=Gossypium raimondii GN=B456_004G257200 PE=4 SV=1[more]
F6H1U3_VITVI9.4e-21973.33Putative uncharacterized protein OS=Vitis vinifera GN=VIT_19s0014g00560 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
gi|659069993|ref|XP_008452796.1|9.5e-28190.99PREDICTED: probable arabinosyltransferase ARAD1 [Cucumis melo][more]
gi|449453962|ref|XP_004144725.1|2.8e-28090.81PREDICTED: probable arabinosyltransferase ARAD1 isoform X1 [Cucumis sativus][more]
gi|802777497|ref|XP_012090902.1|5.5e-22076.27PREDICTED: probable arabinosyltransferase ARAD1 [Jatropha curcas][more]
gi|225461772|ref|XP_002285599.1|1.3e-21873.33PREDICTED: probable arabinosyltransferase ARAD1 isoform X2 [Vitis vinifera][more]
gi|823153948|ref|XP_012476826.1|1.3e-21872.22PREDICTED: probable arabinosyltransferase ARAD1 isoform X1 [Gossypium raimondii][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004263Exostosin
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
biological_process GO:0006486 protein glycosylation
cellular_component GO:0005794 Golgi apparatus
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005575 cellular_component
cellular_component GO:0016020 membrane
molecular_function GO:0016740 transferase activity
molecular_function GO:0050508 glucuronosyl-N-acetylglucosaminyl-proteoglycan 4-alpha-N-acetylglucosaminyltransferase activity
molecular_function GO:0016757 transferase activity, transferring glycosyl groups
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU60249watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla020014Cla020014.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU60249WMU60249transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004263Exostosin-likePFAMPF03016Exostosincoord: 125..452
score: 4.3
NoneNo IPR availableunknownCoilCoilcoord: 93..113
scor
NoneNo IPR availablePANTHERPTHR11062EXOSTOSIN HEPARAN SULFATE GLYCOSYLTRANSFERASE -RELATEDcoord: 247..518
score: 1.9E-247coord: 106..227
score: 1.9E
NoneNo IPR availablePANTHERPTHR11062:SF54EXOSTOSIN FAMILY PROTEINcoord: 247..518
score: 1.9E-247coord: 106..227
score: 1.9E