ClCG01G005220 (gene) Watermelon (Charleston Gray)

NameClCG01G005220
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionXyloglucan galactosyltransferase KATAMARI1, putative
LocationCG_Chr01 : 5560450 .. 5562925 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTCCTCTTTCCGGTAACTCCTCGCCGGCGGAGCACCACCCGAAGAAATTCAAAGCCTCTGAGTTGTTCGATAAGAAGAACTCGTTTAATTCGTTGTTACAGCAGTTTCACCATCAACAATCTCGGCTATGGCTTCTCCTTGTAATCCTCTGCCTTCAAATCCTCCTCCTCTTCACCATCCGCTTCCTCCCTCTTCCTCTCCCGCCGGCGCTTTCCTCCTCCACCAACCGAATCCACCGCTTCCCCTCCGTTGTCTCCCCCGCCGTTAATGACGGCGACAACTGCAAAAACGGCCGGATCTTCGTCTACGACCTGCCGACGCTATTTAACCGAGACATTCTCGAAAACTGCGACAATCTGAATCCATGGAGCTCGAGCTGCAGCGCGATGGCGAACGGCGGATTCGGCCAGAGAGCCGATTCCCTGGCCGGAATCATACCGGAGAATCTCCTTCCGTCGTGGTATTGGACGGACCAATTCGTAACGGAGATCGTTTTCCATAATCGGATTATGAAGCATAAATGCCGCGTTTCAGAACCGGAATCCGCTACGGCGTTCTACATACCGTTTTACGCTGGACTCGCAGTGGGGAAATTCCTCTGGACGAATTCGACCGCGGAGGAACGGGATCAGCATTGCCGTATGATTCTAAAATGGCTTTCAGATCAAAAGTATTACAAAAGATCCAACGGCTGGGATCATTTCATTACAATGGGCCGCATCACATGGGATTTCCGCCGGAGCAAGGATAAAGATTGGGGATCAGCGTGTATTTATTTACCTGGAATGAGAAATATTACTCGTCTTTTGATTGAGCGAAATCCATGGGATTATTTTGACGTCGGTGTACCTTACCCCACAGGATTCCACCCCCGAGGCCACGACGACATATCGGCTTGGCAGGACTTTGTTCGCACGCGCCGTCGTACGCATCTCTTCTGCTTCGCCGGAGCCACCCGCGCCGCCTTCCGCAACGACTTTCGAGCGGTGCTTCTCGACCAGTGCCGCAACTCCGCAGCCGAGAAATGCCGAGTTGTCGACTGCGCTGGCAGCCGCTGCTCCAACGGCACGTCGGCGATTCTCGAGACATTTCTCTCTTCAGACTTCTGCCTCCAGCCGAGAGGTGACAGCTTCACGCGACGTTCGATCTTCGATTGCATGGTGGCCGGATCGATTCCGGTGTTCTTCTGGCGGCGGACGGCGTATTACCAATACGAGTGGTTCTTACCAGGCGAACCGGAAAGTTACTCGGTTTTTATAGACCGAAACGCGGTGACGAACGGGACGGCCTCCATTGAAGCGATTCTGGAGAAGTTTAGCAGAGATGAAGTGAGGGAGATGAGAGAGAGAGTGATCGAGTCAATTCCGAATTTCATTTACGGCACCGGGGGAGTTAGAGACGCATTCGACGTCGCCATTGAAGGGGTTTTGAGGAGGTTTAAAGAGCAAGAAGAATGGGAATACAAGTGGAAATAAGGAAGAGTACGGGGGTAGAAATGCAAATAAAAAAGAGAAAAGGAAAACTGGAAAAGCTTGTTAATGTTATTAGTATATATATTAGGGATTAATATACGATTCTTTTGGTAATGGGGTTTTTTTAGGGAATAAAATTTTGGTGGGTAATATTATTTGAGGGGGAGGTAATTTGATAAAGTTTGGGGGAGGGGTAAAATTGCAAAAAGGAGAAAAGGCATAATGGGAATTTTGGTTGTCCTTTGTGTAAGTTGGGCCCACAGGTTTGGGTGTTTGGCATGATAGAAAATAGAAATTTAGTTTTTATTTATACAACAAAGTTGTGTGTTAATTTGATAAGAAAATGAAAGGTTTGGAACCAACTTTTTCATTTCTTTAGCCATTGCTTTTATTCCAATTGCTTTGGATTTGCTAACAATTTAGGTGGAATTTTTTTGTTCTCTTTTTGGGAGTGTTATGGTTGTGGTTTTGGAGATGATGTCAATGGGTCAAAACTTAAATTTCTTTTTATGTCATAATTCTCTTTTTATTTAATACTTTAGACTTAGAAGGAAAAAAGAAAAAAGAAAAAAGAGTGGTTACAAATAAAGCCTAAAGCTGAAAAAATAATCTCCTTATAATTTAAGTATTAGGGATAAGACCACTCTGAAGTTTAAATAACAAATTCAAGTCATTCAATATTTTCATTATATTCAACTCAATTAAATTTATTATTCTAACTTTTCTTTTCTTAAGTTGAATTGGGGTGCTTTCATTGTCAAAAGATGCTTGTCTGCTCACTACGATGTGTGGTTTCAGGGAGAGGGATCTTTCTCCTCTATGTGTTGTGACTCAGGCAGTTTTGGACGGATTATCTTTTGCGAAGGGGAAGGATTTGGGTGACCTCACCGTTTTCTCTGATTGTAAAGGGTTGGTTGATATGCTGAATCGAAGTAGTAAGTCATTTTTGGATGTCCGTCCCTTTTTGGACAATATTTGGCTTTTAGCTTCTGGTTTTTAG

mRNA sequence

ATGCTTCCTCTTTCCGGTAACTCCTCGCCGGCGGAGCACCACCCGAAGAAATTCAAAGCCTCTGAGTTGTTCGATAAGAAGAACTCGTTTAATTCGTTGTTACAGCAGTTTCACCATCAACAATCTCGGCTATGGCTTCTCCTTGTAATCCTCTGCCTTCAAATCCTCCTCCTCTTCACCATCCGCTTCCTCCCTCTTCCTCTCCCGCCGGCGCTTTCCTCCTCCACCAACCGAATCCACCGCTTCCCCTCCGTTGTCTCCCCCGCCGTTAATGACGGCGACAACTGCAAAAACGGCCGGATCTTCGTCTACGACCTGCCGACGCTATTTAACCGAGACATTCTCGAAAACTGCGACAATCTGAATCCATGGAGCTCGAGCTGCAGCGCGATGGCGAACGGCGGATTCGGCCAGAGAGCCGATTCCCTGGCCGGAATCATACCGGAGAATCTCCTTCCGTCGTGGTATTGGACGGACCAATTCGTAACGGAGATCGTTTTCCATAATCGGATTATGAAGCATAAATGCCGCGTTTCAGAACCGGAATCCGCTACGGCGTTCTACATACCGTTTTACGCTGGACTCGCAGTGGGGAAATTCCTCTGGACGAATTCGACCGCGGAGGAACGGGATCAGCATTGCCGTATGATTCTAAAATGGCTTTCAGATCAAAAGTATTACAAAAGATCCAACGGCTGGGATCATTTCATTACAATGGGCCGCATCACATGGGATTTCCGCCGGAGCAAGGATAAAGATTGGGGATCAGCGTGTATTTATTTACCTGGAATGAGAAATATTACTCGTCTTTTGATTGAGCGAAATCCATGGGATTATTTTGACGTCGGTGTACCTTACCCCACAGGATTCCACCCCCGAGGCCACGACGACATATCGGCTTGGCAGGACTTTGTTCGCACGCGCCGTCGTACGCATCTCTTCTGCTTCGCCGGAGCCACCCGCGCCGCCTTCCGCAACGACTTTCGAGCGGTGCTTCTCGACCAGTGCCGCAACTCCGCAGCCGAGAAATGCCGAGTTGTCGACTGCGCTGGCAGCCGCTGCTCCAACGGCACGTCGGCGATTCTCGAGACATTTCTCTCTTCAGACTTCTGCCTCCAGCCGAGAGGTGACAGCTTCACGCGACGTTCGATCTTCGATTGCATGGTGGCCGGATCGATTCCGGTGTTCTTCTGGCGGCGGACGGCGTATTACCAATACGAGTGGTTCTTACCAGGCGAACCGGAAAGTTACTCGGTTTTTATAGACCGAAACGCGGTGACGAACGGGACGGCCTCCATTGAAGCGATTCTGGAGAAGTTTAGCAGAGATGAAGTGAGGGAGATGAGAGAGAGAGTGATCGAGTCAATTCCGAATTTCATTTACGGCACCGGGGGAGTTAGAGACGCATTCGACGTCGCCATTGAAGGGGTTTTGAGGAGGGAGAGGGATCTTTCTCCTCTATGTGTTGTGACTCAGGCAGTTTTGGACGGATTATCTTTTGCGAAGGGGAAGGATTTGGGTGACCTCACCGTTTTCTCTGATTGTAAAGGGTTGGTTGATATGCTGAATCGAAGTAGTAAGTCATTTTTGGATGTCCGTCCCTTTTTGGACAATATTTGGCTTTTAGCTTCTGGTTTTTAG

Coding sequence (CDS)

ATGCTTCCTCTTTCCGGTAACTCCTCGCCGGCGGAGCACCACCCGAAGAAATTCAAAGCCTCTGAGTTGTTCGATAAGAAGAACTCGTTTAATTCGTTGTTACAGCAGTTTCACCATCAACAATCTCGGCTATGGCTTCTCCTTGTAATCCTCTGCCTTCAAATCCTCCTCCTCTTCACCATCCGCTTCCTCCCTCTTCCTCTCCCGCCGGCGCTTTCCTCCTCCACCAACCGAATCCACCGCTTCCCCTCCGTTGTCTCCCCCGCCGTTAATGACGGCGACAACTGCAAAAACGGCCGGATCTTCGTCTACGACCTGCCGACGCTATTTAACCGAGACATTCTCGAAAACTGCGACAATCTGAATCCATGGAGCTCGAGCTGCAGCGCGATGGCGAACGGCGGATTCGGCCAGAGAGCCGATTCCCTGGCCGGAATCATACCGGAGAATCTCCTTCCGTCGTGGTATTGGACGGACCAATTCGTAACGGAGATCGTTTTCCATAATCGGATTATGAAGCATAAATGCCGCGTTTCAGAACCGGAATCCGCTACGGCGTTCTACATACCGTTTTACGCTGGACTCGCAGTGGGGAAATTCCTCTGGACGAATTCGACCGCGGAGGAACGGGATCAGCATTGCCGTATGATTCTAAAATGGCTTTCAGATCAAAAGTATTACAAAAGATCCAACGGCTGGGATCATTTCATTACAATGGGCCGCATCACATGGGATTTCCGCCGGAGCAAGGATAAAGATTGGGGATCAGCGTGTATTTATTTACCTGGAATGAGAAATATTACTCGTCTTTTGATTGAGCGAAATCCATGGGATTATTTTGACGTCGGTGTACCTTACCCCACAGGATTCCACCCCCGAGGCCACGACGACATATCGGCTTGGCAGGACTTTGTTCGCACGCGCCGTCGTACGCATCTCTTCTGCTTCGCCGGAGCCACCCGCGCCGCCTTCCGCAACGACTTTCGAGCGGTGCTTCTCGACCAGTGCCGCAACTCCGCAGCCGAGAAATGCCGAGTTGTCGACTGCGCTGGCAGCCGCTGCTCCAACGGCACGTCGGCGATTCTCGAGACATTTCTCTCTTCAGACTTCTGCCTCCAGCCGAGAGGTGACAGCTTCACGCGACGTTCGATCTTCGATTGCATGGTGGCCGGATCGATTCCGGTGTTCTTCTGGCGGCGGACGGCGTATTACCAATACGAGTGGTTCTTACCAGGCGAACCGGAAAGTTACTCGGTTTTTATAGACCGAAACGCGGTGACGAACGGGACGGCCTCCATTGAAGCGATTCTGGAGAAGTTTAGCAGAGATGAAGTGAGGGAGATGAGAGAGAGAGTGATCGAGTCAATTCCGAATTTCATTTACGGCACCGGGGGAGTTAGAGACGCATTCGACGTCGCCATTGAAGGGGTTTTGAGGAGGGAGAGGGATCTTTCTCCTCTATGTGTTGTGACTCAGGCAGTTTTGGACGGATTATCTTTTGCGAAGGGGAAGGATTTGGGTGACCTCACCGTTTTCTCTGATTGTAAAGGGTTGGTTGATATGCTGAATCGAAGTAGTAAGTCATTTTTGGATGTCCGTCCCTTTTTGGACAATATTTGGCTTTTAGCTTCTGGTTTTTAG

Protein sequence

MLPLSGNSSPAEHHPKKFKASELFDKKNSFNSLLQQFHHQQSRLWLLLVILCLQILLLFTIRFLPLPLPPALSSSTNRIHRFPSVVSPAVNDGDNCKNGRIFVYDLPTLFNRDILENCDNLNPWSSSCSAMANGGFGQRADSLAGIIPENLLPSWYWTDQFVTEIVFHNRIMKHKCRVSEPESATAFYIPFYAGLAVGKFLWTNSTAEERDQHCRMILKWLSDQKYYKRSNGWDHFITMGRITWDFRRSKDKDWGSACIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHPRGHDDISAWQDFVRTRRRTHLFCFAGATRAAFRNDFRAVLLDQCRNSAAEKCRVVDCAGSRCSNGTSAILETFLSSDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWRRTAYYQYEWFLPGEPESYSVFIDRNAVTNGTASIEAILEKFSRDEVREMRERVIESIPNFIYGTGGVRDAFDVAIEGVLRRERDLSPLCVVTQAVLDGLSFAKGKDLGDLTVFSDCKGLVDMLNRSSKSFLDVRPFLDNIWLLASGF
BLAST of ClCG01G005220 vs. Swiss-Prot
Match: GT18_ARATH (Xyloglucan galactosyltransferase XLT2 OS=Arabidopsis thaliana GN=XLT2 PE=1 SV=1)

HSP 1 Score: 634.4 bits (1635), Expect = 1.1e-180
Identity = 310/511 (60.67%), Postives = 379/511 (74.17%), Query Frame = 1

Query: 1   MLPLSGNSSPAEHHPKKFKASEL---FDKKNSFNSLLQQFHHQQ----SR---LWLLLVI 60
           MLP+S  SSP EH  KK +  +     D+KNSFNSL    +       SR    WL+L +
Sbjct: 1   MLPVSNPSSP-EHLLKKSRTPDSTTSIDRKNSFNSLHSVGNRSSYIAASRSHCTWLILSL 60

Query: 61  LCLQILLLFTIRFLPLP---LP-----PALSSSTNRIHRFPSVVSP-----AVNDGDNCK 120
           L LQ++L  T+R +P P   +P     PA   +T       S  S      + +  + C 
Sbjct: 61  LSLQLILFLTLRSIPFPHRHIPENFPSPAAVVTTTVTTTVISAASSNPPLSSSSSDERCD 120

Query: 121 NGRIFVYDLPTLFNRDILENCDNLNPWSSSCSAMANGGFGQRADSLAGIIPENLLPSWYW 180
           +GR+FVYD+P +FN  IL+ CDNLNPWSS C A++N GFGQ A SL+ +IP++L+ SW+W
Sbjct: 121 SGRVFVYDMPKIFNEVILQQCDNLNPWSSRCDALSNDGFGQEATSLSNVIPKDLVQSWFW 180

Query: 181 TDQFVTEIVFHNRIMKHKCRVSEPESATAFYIPFYAGLAVGKFLWTNSTAEERDQHCRMI 240
           TDQFVTEI+FHNRI+ H+CR  +PESATAFYIPFYAGLAVG++LW+N  A +RD+HC+M+
Sbjct: 181 TDQFVTEIIFHNRILNHRCRTLDPESATAFYIPFYAGLAVGQYLWSNYAAADRDRHCKMM 240

Query: 241 LKWLSDQKYYKRSNGWDHFITMGRITWDFRRSKDKDWGSACIYLPGMRNITRLLIERNPW 300
            +W+ +Q Y+ RSNGWDHFITMGRITWDFRRSKD+DWGS CIY+PGMRNITRLLIERN W
Sbjct: 241 TQWVKNQPYWNRSNGWDHFITMGRITWDFRRSKDEDWGSNCIYIPGMRNITRLLIERNSW 300

Query: 301 DYFDVGVPYPTGFHPRGHDDISAWQDFVRTRRRTHLFCFAGATRAAFRNDFRAVLLDQCR 360
           D+FDVGVPYPTGFHPR   D+  WQDFVR RRR  LFCFAGA RA   NDFR +LL  C 
Sbjct: 301 DHFDVGVPYPTGFHPRSDSDVVNWQDFVRNRRRETLFCFAGAPRAGIVNDFRGLLLRHCE 360

Query: 361 NSAAEKCRVVDCAGSRCSNGTSAILETFLSSDFCLQPRGDSFTRRSIFDCMVAGSIPVFF 420
            S   KCR VDC   +CSNG+SAILETFL SDFCLQPRGDSFTRRSIFDCM+AGSIPVFF
Sbjct: 361 ESRG-KCRTVDCTVGKCSNGSSAILETFLGSDFCLQPRGDSFTRRSIFDCMLAGSIPVFF 420

Query: 421 WRRTAYYQYEWFLPGEPESYSVFIDRNAVTNGTASIEAILEKFSRDEVREMRERVIESIP 480
           WRR+AY QY+WFLP +P+SYSVFIDRN VTNGT SI+ +LE++S+++VR+MRERVI+ IP
Sbjct: 421 WRRSAYMQYQWFLPDKPDSYSVFIDRNEVTNGTTSIKEVLERYSKEDVRKMRERVIDLIP 480

Query: 481 NFIY-----GTGGVRDAFDVAIEGVLRRERD 484
           N +Y     G    +DAFDVAI+GV RR ++
Sbjct: 481 NLVYAKSPNGLETFKDAFDVAIDGVFRRFKE 509

BLAST of ClCG01G005220 vs. Swiss-Prot
Match: GT11_ARATH (Probable xyloglucan galactosyltransferase GT11 OS=Arabidopsis thaliana GN=GT11 PE=2 SV=1)

HSP 1 Score: 328.9 bits (842), Expect = 1.0e-88
Identity = 169/402 (42.04%), Postives = 239/402 (59.45%), Query Frame = 1

Query: 94  DNCKNGRIFVYDLPTLFNRDILENCDNLNPWSSSCSAMANGGFGQRADSLAGIIPENLLP 153
           D CK   ++++++P LFN ++L+NC  L+ W+  C   +N G G R  ++ G+       
Sbjct: 287 DPCKGKYVYMHEVPALFNEELLKNCWTLSRWTDMCELTSNFGLGPRLPNMEGV------S 346

Query: 154 SWYWTDQFVTEIVFHNRIMKHKCRVSEPESATAFYIPFYAGLAVGKFLWTNSTAEERDQH 213
            WY T+QF  E++FHNR+ ++KC   +   A+A Y+P+Y GL + +FLW       RD  
Sbjct: 347 GWYATNQFTLEVIFHNRMKQYKCLTKDSSLASAVYVPYYPGLDLMRFLW-GPFPFMRDAA 406

Query: 214 CRMILKWLSDQKYYKRSNGWDHFITMGRITWDFRRS--KDKDWGSACIYLPGMRNITRLL 273
              ++KWL + + +KR +G DHF+  GR TWDF R+   + DWG+  + LP +RN+T LL
Sbjct: 407 ALDLMKWLRESQEWKRMDGRDHFMVAGRTTWDFMRTPENESDWGNRLMILPEVRNMTMLL 466

Query: 274 IERNPWDYFDVGVPYPTGFHPRGHDDISAWQDFVRTRRRTHLFCFAGATRAAFRNDFRAV 333
           IE +PW+Y    VPYPT FHP  + +I  WQ  +R   R +LF F GA R    +  R  
Sbjct: 467 IESSPWNYHGFAVPYPTYFHPSTYAEIIQWQMRMRRINRRYLFSFVGAPRPNLGDSIRTE 526

Query: 334 LLDQCRNSAAEKCRVVDC-AGSRCSNGTSAILETFLSSDFCLQPRGDSFTRRSIFDCMVA 393
           ++DQC+ ++  KC++++C +GS+       I++ FLSS FCLQP GDS+TRRS FD ++A
Sbjct: 527 IMDQCK-ASKRKCKLLECISGSQKCYKPDQIMKFFLSSTFCLQPPGDSYTRRSTFDSILA 586

Query: 394 GSIPVFFWRRTAYYQYEWFLPGEPESYSVFIDRNAVTNGTASIEAILEKFSRDEVREMRE 453
           G IPVFF   +AY QY W LP +   YSVFI    V  G  SIE +L +  R +V  MRE
Sbjct: 587 GCIPVFFHPGSAYAQYIWHLPKDIAKYSVFIPEKNVKEGKVSIENVLSRIPRTKVFAMRE 646

Query: 454 RVIESIPNFIY--------GTGGVRDAFDVAIEGVLRRERDL 485
           +VI  IP  +Y         TG   DAFDVA+EGVL R   L
Sbjct: 647 QVIRLIPRLMYFHPSSKSEDTGRFEDAFDVAVEGVLERVEGL 680

BLAST of ClCG01G005220 vs. Swiss-Prot
Match: KATAM_ORYSJ (Xyloglucan galactosyltransferase KATAMARI1 homolog OS=Oryza sativa subsp. japonica GN=Os03g0144800 PE=2 SV=1)

HSP 1 Score: 316.6 bits (810), Expect = 5.2e-85
Identity = 165/425 (38.82%), Postives = 237/425 (55.76%), Query Frame = 1

Query: 67  PLPPALSSSTNRIHRFPSVVSPAVNDGDNCKNGRIFVYDLPTLFNRDILENCDNLNPWSS 126
           P   A   +  R + F   +  A N  D C    I+V++LP  FN D+L  C+ L+ W++
Sbjct: 99  PTAVAHQEAAPRDYAFQRALKTAENKSDPCGGRYIYVHELPPRFNDDMLRECERLSLWTN 158

Query: 127 SCSAMANGGFGQRADSLAGIIPENLLPSWYWTDQFVTEIVFHNRIMKHKCRVSEPESATA 186
            C  M+N G G    +  G+        WY T+QF+ +++F NR+ +++C   +   A A
Sbjct: 159 MCKFMSNEGLGPPLGNEEGVFSNT---GWYATNQFMVDVIFRNRMKQYECLTKDSSIAAA 218

Query: 187 FYIPFYAGLAVGKFLWTNSTAEERDQHCRMILKWLSDQKYYKRSNGWDHFITMGRITWDF 246
            ++PFYAG  V ++LW ++ +  RD     ++ WL  +  +    G DHF+  GRI WDF
Sbjct: 219 VFVPFYAGFDVARYLWGHNIST-RDAASLDLIDWLRKRPEWNVMGGRDHFLVGGRIAWDF 278

Query: 247 RRSKDK--DWGSACIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHPRGHDDISAWQDF 306
           RR  D+  DWG+  +++P  +N++ L++E +PW+  D  +PYPT FHP    D+  WQD 
Sbjct: 279 RRLTDEESDWGNKLLFMPAAKNMSMLVVESSPWNANDFAIPYPTYFHPAKDADVLLWQDR 338

Query: 307 VRTRRRTHLFCFAGATRAAFRNDFRAVLLDQCRNSAAEKCRVVDCAGSRCSNGTSAILET 366
           +R+  R  LF FAGA R       R+ L+DQCR S+  K    D   S+C +  SAI+  
Sbjct: 339 MRSLERPWLFSFAGAPRPDDPKSIRSQLIDQCRTSSVCKLLECDLGESKC-HSPSAIMNM 398

Query: 367 FLSSDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWRRTAYYQYEWFLPGEPESYSVFIDRN 426
           F +S FCLQP+GDS+TRRS FD M+AG IPVFF   +AY QY W LP     YSVFI  +
Sbjct: 399 FQNSLFCLQPQGDSYTRRSAFDSMLAGCIPVFFHPGSAYVQYTWHLPKNYTRYSVFIPED 458

Query: 427 AVTNGTASIEAILEKFSRDEVREMRERVIESIPNFIYG-----TGGVRDAFDVAIEGVLR 485
            V  G  SIE  L+    D V++MRE VI  IP  IY         ++DAFDV++E ++ 
Sbjct: 459 GVRKGNVSIEDRLKSIHPDMVKKMREEVISLIPRVIYADPRSKLETLKDAFDVSVEAIIN 518

BLAST of ClCG01G005220 vs. Swiss-Prot
Match: MUR3_ARATH (Xyloglucan galactosyltransferase MUR3 OS=Arabidopsis thaliana GN=MUR3 PE=1 SV=1)

HSP 1 Score: 311.2 bits (796), Expect = 2.2e-83
Identity = 166/430 (38.60%), Postives = 238/430 (55.35%), Query Frame = 1

Query: 67  PLPPALSSSTNRIHR-----------FPSVVSPAVNDGDNCKNGRIFVYDLPTLFNRDIL 126
           P P A SSST +  R           F   +    N  D C    I+V++LP+ FN D+L
Sbjct: 111 PAPVANSSSTFKPPRIVESGKKQEFSFIRALKTVDNKSDPCGGKYIYVHNLPSKFNEDML 170

Query: 127 ENCDNLNPWSSSCSAMANGGFGQRADSLAGIIPENLLPSWYWTDQFVTEIVFHNRIMKHK 186
            +C  L+ W++ C    N G G   +++ G+  +     WY T+QF  +++F NR+ ++K
Sbjct: 171 RDCKKLSLWTNMCKFTTNAGLGPPLENVEGVFSDE---GWYATNQFAVDVIFSNRMKQYK 230

Query: 187 CRVSEPESATAFYIPFYAGLAVGKFLWTNSTAEERDQHCRMILKWLSDQKYYKRSNGWDH 246
           C  ++   A A ++PFYAG  + ++LW  + +  RD     ++ WL  +  +    G DH
Sbjct: 231 CLTNDSSLAAAIFVPFYAGFDIARYLWGYNISR-RDAASLELVDWLMKRPEWDIMRGKDH 290

Query: 247 FITMGRITWDFRR--SKDKDWGSACIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHPR 306
           F+  GRITWDFRR   ++ DWG+  ++LP  +N++ L++E +PW+  D G+PYPT FHP 
Sbjct: 291 FLVAGRITWDFRRLSEEETDWGNKLLFLPAAKNMSMLVVESSPWNANDFGIPYPTYFHPA 350

Query: 307 GHDDISAWQDFVRTRRRTHLFCFAGATRAAFRNDFRAVLLDQCRNSAAEKCRVVDCAGSR 366
              ++  WQD +R   R  LF FAGA R       R  ++DQCRNS   K    D   S+
Sbjct: 351 KDSEVFEWQDRMRNLERKWLFSFAGAPRPDNPKSIRGQIIDQCRNSNVGKLLECDFGESK 410

Query: 367 CSNGTSAILETFLSSDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWRRTAYYQYEWFLPGE 426
           C +  S+I++ F SS FCLQP+GDS+TRRS FD M+AG IPVFF   +AY QY W LP  
Sbjct: 411 C-HAPSSIMQMFQSSLFCLQPQGDSYTRRSAFDSMLAGCIPVFFHPGSAYTQYTWHLPKN 470

Query: 427 PESYSVFIDRNAVTNGTASIEAILEKFSRDEVREMRERVIESIPNFIYG-----TGGVRD 479
             +YSVFI  + V     SIE  L +    +V+ MRE VI  IP  IY          +D
Sbjct: 471 YTTYSVFIPEDDVRKRNISIEERLLQIPAKQVKIMRENVINLIPRLIYADPRSELETQKD 530

BLAST of ClCG01G005220 vs. Swiss-Prot
Match: GT14_ARATH (Probable xyloglucan galactosyltransferase GT14 OS=Arabidopsis thaliana GN=GT14 PE=2 SV=1)

HSP 1 Score: 306.6 bits (784), Expect = 5.4e-82
Identity = 172/455 (37.80%), Postives = 253/455 (55.60%), Query Frame = 1

Query: 46  LLLVILCLQILLLFT-IRFLPLPLPPALSSSTN-------RIHRFPSVVSPAVNDGDNCK 105
           L  V+LC     LFT        +P     ST+          RFP   SP      +C 
Sbjct: 41  LCFVLLCFDYSALFTDTDETAFSIPDVTQKSTSSEFTKDDNFSRFPDDPSP----DSSCS 100

Query: 106 NGRIFVYDLPTLFNRDILENCDNLNPWSSS--CSAMANGGFGQRADSLAGIIPENLLPSW 165
              I+V++LP  FN D+L+NC  +   +    C  + N GFG    +   ++   L  SW
Sbjct: 101 GRYIYVHELPYRFNGDLLDNCFKITRGTEKDICPYIENYGFGPVIKNYENVL---LKQSW 160

Query: 166 YWTDQFVTEIVFHNRIMKHKCRVSEPESATAFYIPFYAGLAVGKFLWT-NSTAEERDQHC 225
           + T+QF+ E++FHN+++ ++C  ++   A+A ++PFYAGL + ++LW  N T  +   H 
Sbjct: 161 FTTNQFMLEVIFHNKMINYRCLTNDSSLASAVFVPFYAGLDMSRYLWGFNITVRDSSSH- 220

Query: 226 RMILKWLSDQKYYKRSNGWDHFITMGRITWDFRRSKDK--DWGSACIYLPGMRNITRLLI 285
             ++ WL  QK + R +G DHF+  GRI WDFRR  D   DWGS   +LP  RN++ L I
Sbjct: 221 -ELMDWLVVQKEWGRMSGRDHFLVSGRIAWDFRRQTDNESDWGSKLRFLPESRNMSMLSI 280

Query: 286 ERNPWDYFDVGVPYPTGFHPRGHDDISAWQDFVRTRRRTHLFCFAGATRAAFRNDFRAVL 345
           E + W   D  +PYPT FHPR  D+I  WQ+ +R+R+R +LF FAGA R  +++  R  +
Sbjct: 281 ESSSWKN-DYAIPYPTCFHPRSVDEIVEWQELMRSRKREYLFTFAGAPRPEYKDSVRGKI 340

Query: 346 LDQCRNSAAEKCRVVDC--AGSRCSNGTSAILETFLSSDFCLQPRGDSFTRRSIFDCMVA 405
           +D+C  S  ++C ++DC      C N  + +++ F +S FCLQP GDS+TRRS+FD ++A
Sbjct: 341 IDECLESK-KQCYLLDCNYGNVNCDNPVN-VMKVFRNSVFCLQPPGDSYTRRSMFDSILA 400

Query: 406 GSIPVFFWRRTAYYQYEWFLPGEPESYSVFIDRNAVTNGTASIEAILEKFSRDEVREMRE 465
           G IPVFF   TAY QY+W LP    SYSV++    V      I+  L +   + V  +RE
Sbjct: 401 GCIPVFFHPGTAYAQYKWHLPKNHSSYSVYLPVKDVKEWNIKIKERLIEIPEERVVRLRE 460

Query: 466 RVIESIPNFI-----YGTGGVRDAFDVAIEGVLRR 481
            VI  IP  +     YG+ G  DAF++A++G+L R
Sbjct: 461 EVIRLIPKVVYADPKYGSDGSEDAFELAVKGMLER 483

BLAST of ClCG01G005220 vs. TrEMBL
Match: A0A0A0KNB0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G171710 PE=4 SV=1)

HSP 1 Score: 913.3 bits (2359), Expect = 1.4e-262
Identity = 435/486 (89.51%), Postives = 460/486 (94.65%), Query Frame = 1

Query: 1   MLPLSGNSSPAEHHPKKFKAS-ELFDKKNSFNSLLQQFHHQQSRLWLLLVILCLQILLLF 60
           MLPLSGNSSPAEHHPKKFKAS E FD+KNSFNSLLQQFHHQQSRLWLLLVIL LQILLLF
Sbjct: 1   MLPLSGNSSPAEHHPKKFKASSEFFDRKNSFNSLLQQFHHQQSRLWLLLVILFLQILLLF 60

Query: 61  TIRFLPLPLPPALSSSTNR-IHRFPSV-VSPAVNDGDNCKNGRIFVYDLPTLFNRDILEN 120
           TIR+LPLPLPPALSSSTN+ +HRFPSV VSPA  DG NCKNGRIFVYDLP LFN+DILEN
Sbjct: 61  TIRYLPLPLPPALSSSTNQQLHRFPSVAVSPADIDGGNCKNGRIFVYDLPKLFNQDILEN 120

Query: 121 CDNLNPWSSSCSAMANGGFGQRADSLAGIIPENLLPSWYWTDQFVTEIVFHNRIMKHKCR 180
           CDNLNPWSSSCSAMANGGFGQ+ADSLAGIIPENLL SWYWTDQFVTEI+FHNRI+KHKCR
Sbjct: 121 CDNLNPWSSSCSAMANGGFGQKADSLAGIIPENLLQSWYWTDQFVTEIIFHNRILKHKCR 180

Query: 181 VSEPESATAFYIPFYAGLAVGKFLWTNSTAEERDQHCRMILKWLSDQKYYKRSNGWDHFI 240
           V EPESATAFY+PFYAGLAVGKFLWTNST EERDQHCR ILKWLSDQ+YYKRSNGWDHFI
Sbjct: 181 VLEPESATAFYVPFYAGLAVGKFLWTNSTPEERDQHCRSILKWLSDQEYYKRSNGWDHFI 240

Query: 241 TMGRITWDFRRSKDKDWGSACIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHPRGHDD 300
           TMGRITWDFRRSKDKDWGS CIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHP+  +D
Sbjct: 241 TMGRITWDFRRSKDKDWGSGCIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHPKSLND 300

Query: 301 ISAWQDFVRTRRRTHLFCFAGATRAAFRNDFRAVLLDQCRNSAAEKCRVVDCAGSRCSNG 360
           ISAWQ+F+RTRRRTHLFCFAGATRAAF NDFRA+LL QC+NS  EKCRVVDCAGSRCSNG
Sbjct: 301 ISAWQEFIRTRRRTHLFCFAGATRAAFHNDFRAMLLHQCKNSTGEKCRVVDCAGSRCSNG 360

Query: 361 TSAILETFLSSDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWRRTAYYQYEWFLPGEPESY 420
           TSAILETFL+SDFCLQPRGDSFTRRSIFDCMVAG+IPVFFWRRTAYYQYEWFLPGEPESY
Sbjct: 361 TSAILETFLTSDFCLQPRGDSFTRRSIFDCMVAGAIPVFFWRRTAYYQYEWFLPGEPESY 420

Query: 421 SVFIDRNAVTNGTASIEAILEKFSRDEVREMRERVIESIPNFIYGTGGVRDAFDVAIEGV 480
           SVFIDRNAV NGT SIEA+LE+FSR+EV+EMRERVIESIP FIYGTG VRDA DVA+EGV
Sbjct: 421 SVFIDRNAVKNGTTSIEAVLERFSREEVKEMRERVIESIPKFIYGTGEVRDALDVAVEGV 480

Query: 481 LRRERD 484
           LRR ++
Sbjct: 481 LRRFKE 486

BLAST of ClCG01G005220 vs. TrEMBL
Match: M5X3U0_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003851mg PE=4 SV=1)

HSP 1 Score: 677.2 bits (1746), Expect = 1.7e-191
Identity = 330/490 (67.35%), Postives = 386/490 (78.78%), Query Frame = 1

Query: 2   LPLSGNSSPAEHHPKK--FKASELFDKKNSFNSLLQQFHHQQSRLWLLLVILCLQILLLF 61
           LPLS   S +EH  K+   K  +  D  NSFNSLLQ  HH   R  LL  IL LQ++LLF
Sbjct: 55  LPLSLMLSISEHPSKEPQLKRPKSPDPLNSFNSLLQLIHHP--RTCLLFAILALQVMLLF 114

Query: 62  TIRFLPLPLPPALSSSTNRIHRFPSVVSPAVNDGDNCKNGRIFVYDLPTLFNRDILENCD 121
           T+R LP        S  N     P +V       D C +GR+FVYDLP   N++IL+NCD
Sbjct: 115 TLRSLPFSRQHF--SPPNSPKPEPLLVP---EPEDRCGSGRVFVYDLPKTLNQEILQNCD 174

Query: 122 NLNPWSSSCSAMANGGFGQRADSLAGIIPENLLPSWYWTDQFVTEIVFHNRIMKHKCRVS 181
           +LNPWSS C A+AN G GQ+A  +AG++PENL P+WYWTDQFV+E++FHNRI+ HKCRV 
Sbjct: 175 DLNPWSSRCKALANEGLGQQATGIAGVVPENLAPAWYWTDQFVSEVIFHNRILNHKCRVM 234

Query: 182 EPESATAFYIPFYAGLAVGKFLWTNS-TAEERDQHCRMILKWLSDQKYYKRSNGWDHFIT 241
           EPESATAFYIPFYAGLAVGK+LW+NS TA++RD+HC M+L+W+ DQ YYKRS GWDHFIT
Sbjct: 235 EPESATAFYIPFYAGLAVGKYLWSNSSTAQDRDRHCEMMLRWVQDQPYYKRSEGWDHFIT 294

Query: 242 MGRITWDFRRSKDKDWGSACIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHPRGHDDI 301
           MGRITWDFRRS D+DWGS CIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHPR   D+
Sbjct: 295 MGRITWDFRRSNDQDWGSRCIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHPRSDSDV 354

Query: 302 SAWQDFVRTRRRTHLFCFAGATRAAFRNDFRAVLLDQCRNSAAEKCRVVDCAGSRCSNGT 361
           + WQ FVRTR RT LFCFAGA R A +NDFR +LL  C+ S +E CRVVDCAG++CSNGT
Sbjct: 355 AEWQSFVRTRNRTKLFCFAGAKRGAIKNDFRGLLLSHCQ-SESESCRVVDCAGTKCSNGT 414

Query: 362 SAILETFLSSDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWRRTAYYQYEWFLPGEPESYS 421
           SAILETFL SDFCLQPRGDSFTRRSIFDCMVAGSIPVFFW+RTAY QYEWFLPGEPESYS
Sbjct: 415 SAILETFLDSDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWKRTAYIQYEWFLPGEPESYS 474

Query: 422 VFIDRNAVTNGTASIEAILEKFSRDEVREMRERVIESIPNFIY-----GTGGVRDAFDVA 481
           V+IDRNAVTNGT SI+ +L+ FSR+EV +MR +VI+ IP F+Y     G   V+DAFD+A
Sbjct: 475 VYIDRNAVTNGT-SIKNVLQGFSREEVEKMRGKVIDYIPKFLYAKPQEGLESVKDAFDIA 534

Query: 482 IEGVLRRERD 484
           +EGV+RR ++
Sbjct: 535 LEGVMRRFKE 535

BLAST of ClCG01G005220 vs. TrEMBL
Match: B9R9L8_RICCO (Xyloglucan galactosyltransferase KATAMARI1, putative OS=Ricinus communis GN=RCOM_1498620 PE=4 SV=1)

HSP 1 Score: 674.5 bits (1739), Expect = 1.1e-190
Identity = 321/500 (64.20%), Postives = 387/500 (77.40%), Query Frame = 1

Query: 1   MLPLSGNSSPAEH--HPKKFKASELFDKKNSFNSLLQQFHHQ---QSRLWLLLVILCLQI 60
           ML LS  SSP  +   PK      +  +KNSF SL     H    QSR WLLL +L  Q+
Sbjct: 1   MLSLSRPSSPEPYIRKPKSPPDDAVLPRKNSFTSLSSLLSHSYLNQSRTWLLLSVLSFQL 60

Query: 61  LLLFTIRFLPLPLPPALSSSTNRIHRFPS-------VVSPAVNDGDNCKNGRIFVYDLPT 120
           ++L   R LPL       S T+  H FPS       + +P  +D   C+ GR+FVYDLP+
Sbjct: 61  IILLAFRSLPL-------SFTHHRHHFPSPYTAHHFITNPTADD--ECRLGRVFVYDLPS 120

Query: 121 LFNRDILENCDNLNPWSSSCSAMANGGFGQRADSLAGIIPENLLPSWYWTDQFVTEIVFH 180
            FN ++++NCD LNPWSS C A+ N GFGQ+A  L+GI+PENL+P+WYWTDQFV+EI+FH
Sbjct: 121 KFNAELVQNCDELNPWSSRCDALTNDGFGQKATGLSGIVPENLVPAWYWTDQFVSEIIFH 180

Query: 181 NRIMKHKCRVSEPESATAFYIPFYAGLAVGKFLWTNSTAEERDQHCRMILKWLSDQKYYK 240
           NRI+ HKCR +EP +ATAFYIPFYAGLAVGKFLW N TA++RD+HC ++L W+ DQ YYK
Sbjct: 181 NRILNHKCRTTEPSNATAFYIPFYAGLAVGKFLWFNYTAKDRDRHCEIMLDWVRDQPYYK 240

Query: 241 RSNGWDHFITMGRITWDFRRSKDKDWGSACIYLPGMRNITRLLIERNPWDYFDVGVPYPT 300
           RSNGW+HF+TMGRI+WDFRRSK++DWGS+CIY+PGMRNITRLLIERNPWDYFDVGVPYPT
Sbjct: 241 RSNGWNHFLTMGRISWDFRRSKEEDWGSSCIYMPGMRNITRLLIERNPWDYFDVGVPYPT 300

Query: 301 GFHPRGHDDISAWQDFVRTRRRTHLFCFAGATRAAFRNDFRAVLLDQCRNSAAEKCRVVD 360
           GFHPR  +DI  WQDFVRTR R  LFCFAGA R A +NDFR +LL  C N  ++ CRVVD
Sbjct: 301 GFHPRSDNDILQWQDFVRTRNRNSLFCFAGAKRGAIKNDFRGLLLRHCYNE-SDSCRVVD 360

Query: 361 CAGSRCSNGTSAILETFLSSDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWRRTAYYQYEW 420
           C+GSRCSNGTSAIL+TFL SDFCLQPRGDSFTRRSIFDCM+AGSIPV FW+RTAYYQYEW
Sbjct: 361 CSGSRCSNGTSAILKTFLDSDFCLQPRGDSFTRRSIFDCMLAGSIPVLFWKRTAYYQYEW 420

Query: 421 FLPGEPESYSVFIDRNAVTNGTASIEAILEKFSRDEVREMRERVIESIPNFIY-----GT 480
           FLPGEP+SYSVFI R+ V NGT S+  +LE +S++EVR+MRE+VIE IP F+Y     G 
Sbjct: 421 FLPGEPDSYSVFIHRDEVKNGT-SVRKVLESYSKEEVRKMREKVIEYIPKFVYARPNEGL 480

Query: 481 GGVRDAFDVAIEGVLRRERD 484
           G ++DAFDVAI+GVLRR ++
Sbjct: 481 GSIKDAFDVAIDGVLRRFKE 489

BLAST of ClCG01G005220 vs. TrEMBL
Match: K7N3K7_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_20G152000 PE=4 SV=1)

HSP 1 Score: 672.5 bits (1734), Expect = 4.2e-190
Identity = 322/490 (65.71%), Postives = 384/490 (78.37%), Query Frame = 1

Query: 1   MLPLSGNSSPAEHHPKKFKASELFDKKNSFNSLLQ-QFHHQQSRLWLLLVILCLQILLLF 60
           MLP S    PAE  P+           + F SLLQ +F  Q  R W+L  +L +QILLL 
Sbjct: 1   MLPKS--VPPAEPQPQTLNTKTTLT--SFFTSLLQPRFSPQNPRTWILFTVLFIQILLLC 60

Query: 61  TIRFLPLP-LPPALSSSTNRIHRFPSVVSPAVNDGDNCKNGRIFVYDLPTLFNRDILENC 120
            +R  P P +PP L ++ +      +    +     +  +G++FVY+LP  FN+ I+ NC
Sbjct: 61  NLRSFPSPSIPPPLPAAADTKRTTNTTGHHSYRTVYHSGSGKVFVYNLPDTFNQQIILNC 120

Query: 121 DNLNPWSSSCSAMANGGFGQRADSLAGIIPENLLPSWYWTDQFVTEIVFHNRIMKHKCRV 180
           DNLNPWSS C A++N GFG+ A SLAGI+PE+LLP+W+WTDQFVTEI+FHNR++ HKCRV
Sbjct: 121 DNLNPWSSRCDALSNDGFGRAATSLAGILPEDLLPAWHWTDQFVTEIIFHNRLINHKCRV 180

Query: 181 SEPESATAFYIPFYAGLAVGKFLWTNSTAEERDQHCRMILKWLSDQKYYKRSNGWDHFIT 240
            EPESATAFYIPFYAGLAVGK+LW NSTAEERD+HC M+L+W+ DQ ++KRSNGWDHFIT
Sbjct: 181 MEPESATAFYIPFYAGLAVGKYLWFNSTAEERDRHCDMMLQWIQDQPFFKRSNGWDHFIT 240

Query: 241 MGRITWDFRRSKDKDWGSACIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHPRGHDDI 300
           MGRITWDFRRSKD+DWGS+CIY PG+RN+TRLLIERNPWDYFDVGVPYPTGFHPR   D+
Sbjct: 241 MGRITWDFRRSKDRDWGSSCIYKPGIRNVTRLLIERNPWDYFDVGVPYPTGFHPRSKSDV 300

Query: 301 SAWQDFVRTRRRTHLFCFAGATRAAFRNDFRAVLLDQCRNSAAEKCRVVDCAGSRCSNGT 360
           + WQ FVR R+R  LFCFAGA R AFR+DFRA+LL QCR+S  E CR V+C G+RCSNGT
Sbjct: 301 TRWQSFVRERQRHALFCFAGAPRRAFRDDFRAILLSQCRDS-GESCRAVNCTGTRCSNGT 360

Query: 361 SAILETFLSSDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWRRTAYYQYEWFLPGEPESYS 420
           SAILETFL SDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWRRTAY QYEWFLPGEPESYS
Sbjct: 361 SAILETFLDSDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWRRTAYLQYEWFLPGEPESYS 420

Query: 421 VFIDRNAVTNGTASIEAILEKFSRDEVREMRERVIESIPNFIY-----GTGGVRDAFDVA 480
           VFIDRNAV NGT +++ +LE+F+++EVR MRE+VIE IP  +Y     G  GV DAFDVA
Sbjct: 421 VFIDRNAVKNGTLTVKNVLERFTKEEVRRMREKVIEYIPRLVYANTKQGLEGVNDAFDVA 480

Query: 481 IEGVLRRERD 484
           IEGV +R ++
Sbjct: 481 IEGVFKRIKE 485

BLAST of ClCG01G005220 vs. TrEMBL
Match: V7BFP7_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_007G063500g PE=4 SV=1)

HSP 1 Score: 664.1 bits (1712), Expect = 1.5e-187
Identity = 318/489 (65.03%), Postives = 379/489 (77.51%), Query Frame = 1

Query: 1   MLPLSGNSSPAEHHPKKFKASELFDKKNSFNSLLQ-QFHHQQSRLWLLLVILCLQILLLF 60
           MLP S + S  E  PK           +S  SLL+ +F  Q  R W++  +L +QILLL 
Sbjct: 1   MLPKSIHPS-LEPQPKTIHTKATIT--SSLTSLLKPRFSPQNPRTWIIFTVLFIQILLLC 60

Query: 61  TIRFLPLPLPPALSSSTNRIHRFPSVVSPAVNDGDNCKNGRIFVYDLPTLFNRDILENCD 120
            +R     +P  L S+         VV  +    D C +G++FVYDLP  FN +IL+NCD
Sbjct: 61  NLRSFSSSIPTPLPST---------VVHHSSRAQDQCGSGKVFVYDLPRTFNHEILQNCD 120

Query: 121 NLNPWSSSCSAMANGGFGQRADSLAGIIPENLLPSWYWTDQFVTEIVFHNRIMKHKCRVS 180
           NLNPWSS C +++N GFGQ A +LAGIIPE++LP+W+WTDQFVTEI+FHNR++ H+CRV 
Sbjct: 121 NLNPWSSRCDSLSNQGFGQSAAALAGIIPEDILPAWHWTDQFVTEIIFHNRLLNHRCRVK 180

Query: 181 EPESATAFYIPFYAGLAVGKFLWTNSTAEERDQHCRMILKWLSDQKYYKRSNGWDHFITM 240
           EPESATAFYIPFYAGL+VGKFLW N TA ERD+HC M+L+W+ DQ ++KRSNGWDHFITM
Sbjct: 181 EPESATAFYIPFYAGLSVGKFLWLNWTAVERDRHCDMMLRWVMDQPFFKRSNGWDHFITM 240

Query: 241 GRITWDFRRSKDKDWGSACIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHPRGHDDIS 300
           GRITWDFRRS D DWGS+CIY PGMRN+TRLLIERNPWDYFDVGVPYPTGFHPR   D++
Sbjct: 241 GRITWDFRRSVDHDWGSSCIYKPGMRNVTRLLIERNPWDYFDVGVPYPTGFHPRSKSDVT 300

Query: 301 AWQDFVRTRRRTHLFCFAGATRAAFRNDFRAVLLDQCRNSAAEKCRVVDCAGSRCSNGTS 360
            WQ+FVRTR+R  LFCFAGA R AFRNDFR VLL QCR+S  E CR V+C G+RCSNGTS
Sbjct: 301 LWQNFVRTRQRNSLFCFAGAPRRAFRNDFRGVLLSQCRDS-GESCRTVNCTGTRCSNGTS 360

Query: 361 AILETFLSSDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWRRTAYYQYEWFLPGEPESYSV 420
            ILETFL SDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWRR+AY QYEWFLP EP SYSV
Sbjct: 361 VILETFLDSDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWRRSAYLQYEWFLPAEPGSYSV 420

Query: 421 FIDRNAVTNGTASIEAILEKFSRDEVREMRERVIESIPNFIY-----GTGGVRDAFDVAI 480
           +IDRNAV NGT +++ +LEKF+++EVR+MRE+VIE IP  IY     G  G++DAFDVAI
Sbjct: 421 YIDRNAVNNGTVTVKNVLEKFTKEEVRQMREKVIEYIPRLIYANTKHGLEGMKDAFDVAI 476

Query: 481 EGVLRRERD 484
           EGV +R ++
Sbjct: 481 EGVFKRFKE 476

BLAST of ClCG01G005220 vs. TAIR10
Match: AT5G62220.1 (AT5G62220.1 glycosyltransferase 18)

HSP 1 Score: 634.4 bits (1635), Expect = 6.4e-182
Identity = 310/511 (60.67%), Postives = 379/511 (74.17%), Query Frame = 1

Query: 1   MLPLSGNSSPAEHHPKKFKASEL---FDKKNSFNSLLQQFHHQQ----SR---LWLLLVI 60
           MLP+S  SSP EH  KK +  +     D+KNSFNSL    +       SR    WL+L +
Sbjct: 1   MLPVSNPSSP-EHLLKKSRTPDSTTSIDRKNSFNSLHSVGNRSSYIAASRSHCTWLILSL 60

Query: 61  LCLQILLLFTIRFLPLP---LP-----PALSSSTNRIHRFPSVVSP-----AVNDGDNCK 120
           L LQ++L  T+R +P P   +P     PA   +T       S  S      + +  + C 
Sbjct: 61  LSLQLILFLTLRSIPFPHRHIPENFPSPAAVVTTTVTTTVISAASSNPPLSSSSSDERCD 120

Query: 121 NGRIFVYDLPTLFNRDILENCDNLNPWSSSCSAMANGGFGQRADSLAGIIPENLLPSWYW 180
           +GR+FVYD+P +FN  IL+ CDNLNPWSS C A++N GFGQ A SL+ +IP++L+ SW+W
Sbjct: 121 SGRVFVYDMPKIFNEVILQQCDNLNPWSSRCDALSNDGFGQEATSLSNVIPKDLVQSWFW 180

Query: 181 TDQFVTEIVFHNRIMKHKCRVSEPESATAFYIPFYAGLAVGKFLWTNSTAEERDQHCRMI 240
           TDQFVTEI+FHNRI+ H+CR  +PESATAFYIPFYAGLAVG++LW+N  A +RD+HC+M+
Sbjct: 181 TDQFVTEIIFHNRILNHRCRTLDPESATAFYIPFYAGLAVGQYLWSNYAAADRDRHCKMM 240

Query: 241 LKWLSDQKYYKRSNGWDHFITMGRITWDFRRSKDKDWGSACIYLPGMRNITRLLIERNPW 300
            +W+ +Q Y+ RSNGWDHFITMGRITWDFRRSKD+DWGS CIY+PGMRNITRLLIERN W
Sbjct: 241 TQWVKNQPYWNRSNGWDHFITMGRITWDFRRSKDEDWGSNCIYIPGMRNITRLLIERNSW 300

Query: 301 DYFDVGVPYPTGFHPRGHDDISAWQDFVRTRRRTHLFCFAGATRAAFRNDFRAVLLDQCR 360
           D+FDVGVPYPTGFHPR   D+  WQDFVR RRR  LFCFAGA RA   NDFR +LL  C 
Sbjct: 301 DHFDVGVPYPTGFHPRSDSDVVNWQDFVRNRRRETLFCFAGAPRAGIVNDFRGLLLRHCE 360

Query: 361 NSAAEKCRVVDCAGSRCSNGTSAILETFLSSDFCLQPRGDSFTRRSIFDCMVAGSIPVFF 420
            S   KCR VDC   +CSNG+SAILETFL SDFCLQPRGDSFTRRSIFDCM+AGSIPVFF
Sbjct: 361 ESRG-KCRTVDCTVGKCSNGSSAILETFLGSDFCLQPRGDSFTRRSIFDCMLAGSIPVFF 420

Query: 421 WRRTAYYQYEWFLPGEPESYSVFIDRNAVTNGTASIEAILEKFSRDEVREMRERVIESIP 480
           WRR+AY QY+WFLP +P+SYSVFIDRN VTNGT SI+ +LE++S+++VR+MRERVI+ IP
Sbjct: 421 WRRSAYMQYQWFLPDKPDSYSVFIDRNEVTNGTTSIKEVLERYSKEDVRKMRERVIDLIP 480

Query: 481 NFIY-----GTGGVRDAFDVAIEGVLRRERD 484
           N +Y     G    +DAFDVAI+GV RR ++
Sbjct: 481 NLVYAKSPNGLETFKDAFDVAIDGVFRRFKE 509

BLAST of ClCG01G005220 vs. TAIR10
Match: AT2G29040.1 (AT2G29040.1 Exostosin family protein)

HSP 1 Score: 328.9 bits (842), Expect = 5.8e-90
Identity = 169/402 (42.04%), Postives = 239/402 (59.45%), Query Frame = 1

Query: 94  DNCKNGRIFVYDLPTLFNRDILENCDNLNPWSSSCSAMANGGFGQRADSLAGIIPENLLP 153
           D CK   ++++++P LFN ++L+NC  L+ W+  C   +N G G R  ++ G+       
Sbjct: 287 DPCKGKYVYMHEVPALFNEELLKNCWTLSRWTDMCELTSNFGLGPRLPNMEGV------S 346

Query: 154 SWYWTDQFVTEIVFHNRIMKHKCRVSEPESATAFYIPFYAGLAVGKFLWTNSTAEERDQH 213
            WY T+QF  E++FHNR+ ++KC   +   A+A Y+P+Y GL + +FLW       RD  
Sbjct: 347 GWYATNQFTLEVIFHNRMKQYKCLTKDSSLASAVYVPYYPGLDLMRFLW-GPFPFMRDAA 406

Query: 214 CRMILKWLSDQKYYKRSNGWDHFITMGRITWDFRRS--KDKDWGSACIYLPGMRNITRLL 273
              ++KWL + + +KR +G DHF+  GR TWDF R+   + DWG+  + LP +RN+T LL
Sbjct: 407 ALDLMKWLRESQEWKRMDGRDHFMVAGRTTWDFMRTPENESDWGNRLMILPEVRNMTMLL 466

Query: 274 IERNPWDYFDVGVPYPTGFHPRGHDDISAWQDFVRTRRRTHLFCFAGATRAAFRNDFRAV 333
           IE +PW+Y    VPYPT FHP  + +I  WQ  +R   R +LF F GA R    +  R  
Sbjct: 467 IESSPWNYHGFAVPYPTYFHPSTYAEIIQWQMRMRRINRRYLFSFVGAPRPNLGDSIRTE 526

Query: 334 LLDQCRNSAAEKCRVVDC-AGSRCSNGTSAILETFLSSDFCLQPRGDSFTRRSIFDCMVA 393
           ++DQC+ ++  KC++++C +GS+       I++ FLSS FCLQP GDS+TRRS FD ++A
Sbjct: 527 IMDQCK-ASKRKCKLLECISGSQKCYKPDQIMKFFLSSTFCLQPPGDSYTRRSTFDSILA 586

Query: 394 GSIPVFFWRRTAYYQYEWFLPGEPESYSVFIDRNAVTNGTASIEAILEKFSRDEVREMRE 453
           G IPVFF   +AY QY W LP +   YSVFI    V  G  SIE +L +  R +V  MRE
Sbjct: 587 GCIPVFFHPGSAYAQYIWHLPKDIAKYSVFIPEKNVKEGKVSIENVLSRIPRTKVFAMRE 646

Query: 454 RVIESIPNFIY--------GTGGVRDAFDVAIEGVLRRERDL 485
           +VI  IP  +Y         TG   DAFDVA+EGVL R   L
Sbjct: 647 QVIRLIPRLMYFHPSSKSEDTGRFEDAFDVAVEGVLERVEGL 680

BLAST of ClCG01G005220 vs. TAIR10
Match: AT2G20370.1 (AT2G20370.1 Exostosin family protein)

HSP 1 Score: 311.2 bits (796), Expect = 1.2e-84
Identity = 166/430 (38.60%), Postives = 238/430 (55.35%), Query Frame = 1

Query: 67  PLPPALSSSTNRIHR-----------FPSVVSPAVNDGDNCKNGRIFVYDLPTLFNRDIL 126
           P P A SSST +  R           F   +    N  D C    I+V++LP+ FN D+L
Sbjct: 111 PAPVANSSSTFKPPRIVESGKKQEFSFIRALKTVDNKSDPCGGKYIYVHNLPSKFNEDML 170

Query: 127 ENCDNLNPWSSSCSAMANGGFGQRADSLAGIIPENLLPSWYWTDQFVTEIVFHNRIMKHK 186
            +C  L+ W++ C    N G G   +++ G+  +     WY T+QF  +++F NR+ ++K
Sbjct: 171 RDCKKLSLWTNMCKFTTNAGLGPPLENVEGVFSDE---GWYATNQFAVDVIFSNRMKQYK 230

Query: 187 CRVSEPESATAFYIPFYAGLAVGKFLWTNSTAEERDQHCRMILKWLSDQKYYKRSNGWDH 246
           C  ++   A A ++PFYAG  + ++LW  + +  RD     ++ WL  +  +    G DH
Sbjct: 231 CLTNDSSLAAAIFVPFYAGFDIARYLWGYNISR-RDAASLELVDWLMKRPEWDIMRGKDH 290

Query: 247 FITMGRITWDFRR--SKDKDWGSACIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHPR 306
           F+  GRITWDFRR   ++ DWG+  ++LP  +N++ L++E +PW+  D G+PYPT FHP 
Sbjct: 291 FLVAGRITWDFRRLSEEETDWGNKLLFLPAAKNMSMLVVESSPWNANDFGIPYPTYFHPA 350

Query: 307 GHDDISAWQDFVRTRRRTHLFCFAGATRAAFRNDFRAVLLDQCRNSAAEKCRVVDCAGSR 366
              ++  WQD +R   R  LF FAGA R       R  ++DQCRNS   K    D   S+
Sbjct: 351 KDSEVFEWQDRMRNLERKWLFSFAGAPRPDNPKSIRGQIIDQCRNSNVGKLLECDFGESK 410

Query: 367 CSNGTSAILETFLSSDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWRRTAYYQYEWFLPGE 426
           C +  S+I++ F SS FCLQP+GDS+TRRS FD M+AG IPVFF   +AY QY W LP  
Sbjct: 411 C-HAPSSIMQMFQSSLFCLQPQGDSYTRRSAFDSMLAGCIPVFFHPGSAYTQYTWHLPKN 470

Query: 427 PESYSVFIDRNAVTNGTASIEAILEKFSRDEVREMRERVIESIPNFIYG-----TGGVRD 479
             +YSVFI  + V     SIE  L +    +V+ MRE VI  IP  IY          +D
Sbjct: 471 YTTYSVFIPEDDVRKRNISIEERLLQIPAKQVKIMRENVINLIPRLIYADPRSELETQKD 530

BLAST of ClCG01G005220 vs. TAIR10
Match: AT4G13990.1 (AT4G13990.1 Exostosin family protein)

HSP 1 Score: 306.6 bits (784), Expect = 3.1e-83
Identity = 172/455 (37.80%), Postives = 253/455 (55.60%), Query Frame = 1

Query: 46  LLLVILCLQILLLFT-IRFLPLPLPPALSSSTN-------RIHRFPSVVSPAVNDGDNCK 105
           L  V+LC     LFT        +P     ST+          RFP   SP      +C 
Sbjct: 41  LCFVLLCFDYSALFTDTDETAFSIPDVTQKSTSSEFTKDDNFSRFPDDPSP----DSSCS 100

Query: 106 NGRIFVYDLPTLFNRDILENCDNLNPWSSS--CSAMANGGFGQRADSLAGIIPENLLPSW 165
              I+V++LP  FN D+L+NC  +   +    C  + N GFG    +   ++   L  SW
Sbjct: 101 GRYIYVHELPYRFNGDLLDNCFKITRGTEKDICPYIENYGFGPVIKNYENVL---LKQSW 160

Query: 166 YWTDQFVTEIVFHNRIMKHKCRVSEPESATAFYIPFYAGLAVGKFLWT-NSTAEERDQHC 225
           + T+QF+ E++FHN+++ ++C  ++   A+A ++PFYAGL + ++LW  N T  +   H 
Sbjct: 161 FTTNQFMLEVIFHNKMINYRCLTNDSSLASAVFVPFYAGLDMSRYLWGFNITVRDSSSH- 220

Query: 226 RMILKWLSDQKYYKRSNGWDHFITMGRITWDFRRSKDK--DWGSACIYLPGMRNITRLLI 285
             ++ WL  QK + R +G DHF+  GRI WDFRR  D   DWGS   +LP  RN++ L I
Sbjct: 221 -ELMDWLVVQKEWGRMSGRDHFLVSGRIAWDFRRQTDNESDWGSKLRFLPESRNMSMLSI 280

Query: 286 ERNPWDYFDVGVPYPTGFHPRGHDDISAWQDFVRTRRRTHLFCFAGATRAAFRNDFRAVL 345
           E + W   D  +PYPT FHPR  D+I  WQ+ +R+R+R +LF FAGA R  +++  R  +
Sbjct: 281 ESSSWKN-DYAIPYPTCFHPRSVDEIVEWQELMRSRKREYLFTFAGAPRPEYKDSVRGKI 340

Query: 346 LDQCRNSAAEKCRVVDC--AGSRCSNGTSAILETFLSSDFCLQPRGDSFTRRSIFDCMVA 405
           +D+C  S  ++C ++DC      C N  + +++ F +S FCLQP GDS+TRRS+FD ++A
Sbjct: 341 IDECLESK-KQCYLLDCNYGNVNCDNPVN-VMKVFRNSVFCLQPPGDSYTRRSMFDSILA 400

Query: 406 GSIPVFFWRRTAYYQYEWFLPGEPESYSVFIDRNAVTNGTASIEAILEKFSRDEVREMRE 465
           G IPVFF   TAY QY+W LP    SYSV++    V      I+  L +   + V  +RE
Sbjct: 401 GCIPVFFHPGTAYAQYKWHLPKNHSSYSVYLPVKDVKEWNIKIKERLIEIPEERVVRLRE 460

Query: 466 RVIESIPNFI-----YGTGGVRDAFDVAIEGVLRR 481
            VI  IP  +     YG+ G  DAF++A++G+L R
Sbjct: 461 EVIRLIPKVVYADPKYGSDGSEDAFELAVKGMLER 483

BLAST of ClCG01G005220 vs. TAIR10
Match: AT1G63450.1 (AT1G63450.1 root hair specific 8)

HSP 1 Score: 306.6 bits (784), Expect = 3.1e-83
Identity = 151/393 (38.42%), Postives = 233/393 (59.29%), Query Frame = 1

Query: 95  NCKNGRIFVYDLPTLFNRDILENCDNLNPWSSSCSAMANGGFGQRADSLAGIIPENLLPS 154
           +C+   ++VYDLP+ FN+D+L  C ++ PW+  C+   N  FG+  +S+           
Sbjct: 278 SCEGKGVYVYDLPSKFNKDLLRECSDMVPWADFCNYFKNDAFGELMESMG--------KG 337

Query: 155 WYWTDQFVTEIVFHNRIMKHKCRVSEPESATAFYIPFYAGLAVGKFLWTNSTAEERDQHC 214
           W+ T Q+  E +FH+RI+KH CRV     A  FY+PFY G+ V ++ + N +++ +D   
Sbjct: 338 WFRTHQYSLEPIFHSRILKHPCRVHNETQAKLFYVPFYGGMDVLRWHFKNVSSDVKDVLP 397

Query: 215 RMILKWLSDQKYYKRSNGWDHFITMGRITWDFRRSKDKDWGSACIYLPGMRNITRLLIER 274
             I+KWL  +K +++++G DH   +G+I+WDFRR     WGS+ + +  M+N T+LLIER
Sbjct: 398 IEIVKWLGSKKSWRKNSGKDHVFVLGKISWDFRRVDKYSWGSSLLEMQEMKNPTKLLIER 457

Query: 275 NPWDYFDVGVPYPTGFHPRGHDDISAWQDFVRTRRRTHLFCFAGATRAAFRNDFRAVLLD 334
           NPW+  D+ +P+PT FHP+   DI+ WQ+ +  + R  L  FAGA R       R++L+D
Sbjct: 458 NPWEVNDIAIPHPTYFHPKTDTDIAIWQNKILGKPRRSLISFAGAARPGNPESIRSILID 517

Query: 335 QCRNSAAEKCRVVDCAGSRCSNGTSAILETFLSSDFCLQPRGDSFTRRSIFDCMVAGSIP 394
           QCR S+  +CR ++C    C    S ++E F  S+FCLQP GDS TR+SIFD ++ G IP
Sbjct: 518 QCR-SSPNQCRFLNCTDGGCDKSES-VIELFRDSEFCLQPPGDSPTRKSIFDSLILGCIP 577

Query: 395 VFFWRRTAYYQYEWFLPGEPESYSVFIDRNAVTNGTAS-IEAILEKFSRDEVREMRERVI 454
           V F   +AYYQY W LP +   YSV+I++  V     + IE ++ K  R E  +MR  ++
Sbjct: 578 VIFDPYSAYYQYTWHLPEDHRRYSVYINKEDVKLKRVNVIEKLMSKTLR-EREDMRSYIV 637

Query: 455 -ESIPNFIYGTGGV-----RDAFDVAIEGVLRR 481
            E +P  +YG         RDAFD+ ++ + ++
Sbjct: 638 HELLPGLVYGDSNAKFERFRDAFDITMDSLFKK 659

BLAST of ClCG01G005220 vs. NCBI nr
Match: gi|778700528|ref|XP_011654883.1| (PREDICTED: xyloglucan galactosyltransferase KATAMARI1 [Cucumis sativus])

HSP 1 Score: 913.3 bits (2359), Expect = 2.0e-262
Identity = 435/486 (89.51%), Postives = 460/486 (94.65%), Query Frame = 1

Query: 1   MLPLSGNSSPAEHHPKKFKAS-ELFDKKNSFNSLLQQFHHQQSRLWLLLVILCLQILLLF 60
           MLPLSGNSSPAEHHPKKFKAS E FD+KNSFNSLLQQFHHQQSRLWLLLVIL LQILLLF
Sbjct: 1   MLPLSGNSSPAEHHPKKFKASSEFFDRKNSFNSLLQQFHHQQSRLWLLLVILFLQILLLF 60

Query: 61  TIRFLPLPLPPALSSSTNR-IHRFPSV-VSPAVNDGDNCKNGRIFVYDLPTLFNRDILEN 120
           TIR+LPLPLPPALSSSTN+ +HRFPSV VSPA  DG NCKNGRIFVYDLP LFN+DILEN
Sbjct: 61  TIRYLPLPLPPALSSSTNQQLHRFPSVAVSPADIDGGNCKNGRIFVYDLPKLFNQDILEN 120

Query: 121 CDNLNPWSSSCSAMANGGFGQRADSLAGIIPENLLPSWYWTDQFVTEIVFHNRIMKHKCR 180
           CDNLNPWSSSCSAMANGGFGQ+ADSLAGIIPENLL SWYWTDQFVTEI+FHNRI+KHKCR
Sbjct: 121 CDNLNPWSSSCSAMANGGFGQKADSLAGIIPENLLQSWYWTDQFVTEIIFHNRILKHKCR 180

Query: 181 VSEPESATAFYIPFYAGLAVGKFLWTNSTAEERDQHCRMILKWLSDQKYYKRSNGWDHFI 240
           V EPESATAFY+PFYAGLAVGKFLWTNST EERDQHCR ILKWLSDQ+YYKRSNGWDHFI
Sbjct: 181 VLEPESATAFYVPFYAGLAVGKFLWTNSTPEERDQHCRSILKWLSDQEYYKRSNGWDHFI 240

Query: 241 TMGRITWDFRRSKDKDWGSACIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHPRGHDD 300
           TMGRITWDFRRSKDKDWGS CIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHP+  +D
Sbjct: 241 TMGRITWDFRRSKDKDWGSGCIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHPKSLND 300

Query: 301 ISAWQDFVRTRRRTHLFCFAGATRAAFRNDFRAVLLDQCRNSAAEKCRVVDCAGSRCSNG 360
           ISAWQ+F+RTRRRTHLFCFAGATRAAF NDFRA+LL QC+NS  EKCRVVDCAGSRCSNG
Sbjct: 301 ISAWQEFIRTRRRTHLFCFAGATRAAFHNDFRAMLLHQCKNSTGEKCRVVDCAGSRCSNG 360

Query: 361 TSAILETFLSSDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWRRTAYYQYEWFLPGEPESY 420
           TSAILETFL+SDFCLQPRGDSFTRRSIFDCMVAG+IPVFFWRRTAYYQYEWFLPGEPESY
Sbjct: 361 TSAILETFLTSDFCLQPRGDSFTRRSIFDCMVAGAIPVFFWRRTAYYQYEWFLPGEPESY 420

Query: 421 SVFIDRNAVTNGTASIEAILEKFSRDEVREMRERVIESIPNFIYGTGGVRDAFDVAIEGV 480
           SVFIDRNAV NGT SIEA+LE+FSR+EV+EMRERVIESIP FIYGTG VRDA DVA+EGV
Sbjct: 421 SVFIDRNAVKNGTTSIEAVLERFSREEVKEMRERVIESIPKFIYGTGEVRDALDVAVEGV 480

Query: 481 LRRERD 484
           LRR ++
Sbjct: 481 LRRFKE 486

BLAST of ClCG01G005220 vs. NCBI nr
Match: gi|659073116|ref|XP_008467262.1| (PREDICTED: LOW QUALITY PROTEIN: xyloglucan galactosyltransferase KATAMARI1 [Cucumis melo])

HSP 1 Score: 907.1 bits (2343), Expect = 1.5e-260
Identity = 436/487 (89.53%), Postives = 459/487 (94.25%), Query Frame = 1

Query: 1   MLPLSGNSSPAEHHPKKFKAS-ELFDKKNSFNSLLQQFHHQQSRLWLLLVILCLQILLLF 60
           MLPLSGNSSPAEHHPKKFKAS E FDKKNSFNSLLQQFHHQQSRLWLLLVIL LQILLLF
Sbjct: 1   MLPLSGNSSPAEHHPKKFKASSEFFDKKNSFNSLLQQFHHQQSRLWLLLVILFLQILLLF 60

Query: 61  TIRFLPLPLPPALSSSTNR-IHRFPSV-VSPAVNDGDNCKNGRIFVYDLPTLFNRDILEN 120
           TIR+LPLPLPPALSSSTN+ +HRFPSV VSPA  DG NCKNGRIFVYDLPT+FN+DIL+N
Sbjct: 61  TIRYLPLPLPPALSSSTNKQLHRFPSVAVSPADIDGGNCKNGRIFVYDLPTVFNQDILDN 120

Query: 121 CDNLNPWSSSCSAMANGGFGQRADSLAGIIPENLLPSWYWTDQFVTEIVFHNRIMKHKCR 180
           CDNLNPWSSSCSAMANGGFGQRADSLAGIIPENLL SWYWTDQFVTEI+FHNRI+KHKCR
Sbjct: 121 CDNLNPWSSSCSAMANGGFGQRADSLAGIIPENLLQSWYWTDQFVTEIIFHNRILKHKCR 180

Query: 181 VSEPESATAFYIPFYAGLAVGKFLWTNSTAEERDQHCRMILKWLSDQKYYKRSNGWDHFI 240
           V EPESATAFYIPFYAGLAVGKFLWTNSTAEERDQHCR ILKWLSDQ+YYK+SNGWDHFI
Sbjct: 181 VLEPESATAFYIPFYAGLAVGKFLWTNSTAEERDQHCRSILKWLSDQEYYKKSNGWDHFI 240

Query: 241 TMGRITWDFRRSKDKDWGSACIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHPRGHDD 300
           TMGRITWDFRRSKDKDWGS CIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHP+  +D
Sbjct: 241 TMGRITWDFRRSKDKDWGSGCIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHPKSQND 300

Query: 301 ISAWQDFVRTRRRTHLFCFAGATRAAFRNDFRAVLLDQCRNSAAEKCRVVDCAGSRCSNG 360
           ISAWQ F+RTR RTHLFCFAGATRAAFR DFRA+LL QC+NS  EKCRVVDCAGSRCSNG
Sbjct: 301 ISAWQQFIRTRPRTHLFCFAGATRAAFRKDFRAMLLHQCKNSTGEKCRVVDCAGSRCSNG 360

Query: 361 TSAILETFLSSDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWRRTAYYQYEWFLPGEPESY 420
           TSAILETFL+SDFCLQPRGDSFTRRSIFDCMVAG+IPVFFWRRTAYYQYEWFLPGEP SY
Sbjct: 361 TSAILETFLTSDFCLQPRGDSFTRRSIFDCMVAGAIPVFFWRRTAYYQYEWFLPGEPGSY 420

Query: 421 SVFIDRNAVTNGTASIEAILEKFSRDEVREMRERVIESIPNFI-YGTGGVRDAFDVAIEG 480
           SVFIDRNAV NGT SIEA+LE+FSR+EV+EMRERVIESIP FI  GTG VRDAFDVAIEG
Sbjct: 421 SVFIDRNAVKNGTTSIEAVLERFSREEVKEMRERVIESIPKFILXGTGDVRDAFDVAIEG 480

Query: 481 VLRRERD 484
           VLRR ++
Sbjct: 481 VLRRFKE 487

BLAST of ClCG01G005220 vs. NCBI nr
Match: gi|596021109|ref|XP_007218972.1| (hypothetical protein PRUPE_ppa003851mg [Prunus persica])

HSP 1 Score: 677.2 bits (1746), Expect = 2.4e-191
Identity = 330/490 (67.35%), Postives = 386/490 (78.78%), Query Frame = 1

Query: 2   LPLSGNSSPAEHHPKK--FKASELFDKKNSFNSLLQQFHHQQSRLWLLLVILCLQILLLF 61
           LPLS   S +EH  K+   K  +  D  NSFNSLLQ  HH   R  LL  IL LQ++LLF
Sbjct: 55  LPLSLMLSISEHPSKEPQLKRPKSPDPLNSFNSLLQLIHHP--RTCLLFAILALQVMLLF 114

Query: 62  TIRFLPLPLPPALSSSTNRIHRFPSVVSPAVNDGDNCKNGRIFVYDLPTLFNRDILENCD 121
           T+R LP        S  N     P +V       D C +GR+FVYDLP   N++IL+NCD
Sbjct: 115 TLRSLPFSRQHF--SPPNSPKPEPLLVP---EPEDRCGSGRVFVYDLPKTLNQEILQNCD 174

Query: 122 NLNPWSSSCSAMANGGFGQRADSLAGIIPENLLPSWYWTDQFVTEIVFHNRIMKHKCRVS 181
           +LNPWSS C A+AN G GQ+A  +AG++PENL P+WYWTDQFV+E++FHNRI+ HKCRV 
Sbjct: 175 DLNPWSSRCKALANEGLGQQATGIAGVVPENLAPAWYWTDQFVSEVIFHNRILNHKCRVM 234

Query: 182 EPESATAFYIPFYAGLAVGKFLWTNS-TAEERDQHCRMILKWLSDQKYYKRSNGWDHFIT 241
           EPESATAFYIPFYAGLAVGK+LW+NS TA++RD+HC M+L+W+ DQ YYKRS GWDHFIT
Sbjct: 235 EPESATAFYIPFYAGLAVGKYLWSNSSTAQDRDRHCEMMLRWVQDQPYYKRSEGWDHFIT 294

Query: 242 MGRITWDFRRSKDKDWGSACIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHPRGHDDI 301
           MGRITWDFRRS D+DWGS CIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHPR   D+
Sbjct: 295 MGRITWDFRRSNDQDWGSRCIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHPRSDSDV 354

Query: 302 SAWQDFVRTRRRTHLFCFAGATRAAFRNDFRAVLLDQCRNSAAEKCRVVDCAGSRCSNGT 361
           + WQ FVRTR RT LFCFAGA R A +NDFR +LL  C+ S +E CRVVDCAG++CSNGT
Sbjct: 355 AEWQSFVRTRNRTKLFCFAGAKRGAIKNDFRGLLLSHCQ-SESESCRVVDCAGTKCSNGT 414

Query: 362 SAILETFLSSDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWRRTAYYQYEWFLPGEPESYS 421
           SAILETFL SDFCLQPRGDSFTRRSIFDCMVAGSIPVFFW+RTAY QYEWFLPGEPESYS
Sbjct: 415 SAILETFLDSDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWKRTAYIQYEWFLPGEPESYS 474

Query: 422 VFIDRNAVTNGTASIEAILEKFSRDEVREMRERVIESIPNFIY-----GTGGVRDAFDVA 481
           V+IDRNAVTNGT SI+ +L+ FSR+EV +MR +VI+ IP F+Y     G   V+DAFD+A
Sbjct: 475 VYIDRNAVTNGT-SIKNVLQGFSREEVEKMRGKVIDYIPKFLYAKPQEGLESVKDAFDIA 534

Query: 482 IEGVLRRERD 484
           +EGV+RR ++
Sbjct: 535 LEGVMRRFKE 535

BLAST of ClCG01G005220 vs. NCBI nr
Match: gi|255539657|ref|XP_002510893.1| (PREDICTED: xyloglucan galactosyltransferase KATAMARI1 homolog [Ricinus communis])

HSP 1 Score: 674.5 bits (1739), Expect = 1.6e-190
Identity = 321/500 (64.20%), Postives = 387/500 (77.40%), Query Frame = 1

Query: 1   MLPLSGNSSPAEH--HPKKFKASELFDKKNSFNSLLQQFHHQ---QSRLWLLLVILCLQI 60
           ML LS  SSP  +   PK      +  +KNSF SL     H    QSR WLLL +L  Q+
Sbjct: 1   MLSLSRPSSPEPYIRKPKSPPDDAVLPRKNSFTSLSSLLSHSYLNQSRTWLLLSVLSFQL 60

Query: 61  LLLFTIRFLPLPLPPALSSSTNRIHRFPS-------VVSPAVNDGDNCKNGRIFVYDLPT 120
           ++L   R LPL       S T+  H FPS       + +P  +D   C+ GR+FVYDLP+
Sbjct: 61  IILLAFRSLPL-------SFTHHRHHFPSPYTAHHFITNPTADD--ECRLGRVFVYDLPS 120

Query: 121 LFNRDILENCDNLNPWSSSCSAMANGGFGQRADSLAGIIPENLLPSWYWTDQFVTEIVFH 180
            FN ++++NCD LNPWSS C A+ N GFGQ+A  L+GI+PENL+P+WYWTDQFV+EI+FH
Sbjct: 121 KFNAELVQNCDELNPWSSRCDALTNDGFGQKATGLSGIVPENLVPAWYWTDQFVSEIIFH 180

Query: 181 NRIMKHKCRVSEPESATAFYIPFYAGLAVGKFLWTNSTAEERDQHCRMILKWLSDQKYYK 240
           NRI+ HKCR +EP +ATAFYIPFYAGLAVGKFLW N TA++RD+HC ++L W+ DQ YYK
Sbjct: 181 NRILNHKCRTTEPSNATAFYIPFYAGLAVGKFLWFNYTAKDRDRHCEIMLDWVRDQPYYK 240

Query: 241 RSNGWDHFITMGRITWDFRRSKDKDWGSACIYLPGMRNITRLLIERNPWDYFDVGVPYPT 300
           RSNGW+HF+TMGRI+WDFRRSK++DWGS+CIY+PGMRNITRLLIERNPWDYFDVGVPYPT
Sbjct: 241 RSNGWNHFLTMGRISWDFRRSKEEDWGSSCIYMPGMRNITRLLIERNPWDYFDVGVPYPT 300

Query: 301 GFHPRGHDDISAWQDFVRTRRRTHLFCFAGATRAAFRNDFRAVLLDQCRNSAAEKCRVVD 360
           GFHPR  +DI  WQDFVRTR R  LFCFAGA R A +NDFR +LL  C N  ++ CRVVD
Sbjct: 301 GFHPRSDNDILQWQDFVRTRNRNSLFCFAGAKRGAIKNDFRGLLLRHCYNE-SDSCRVVD 360

Query: 361 CAGSRCSNGTSAILETFLSSDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWRRTAYYQYEW 420
           C+GSRCSNGTSAIL+TFL SDFCLQPRGDSFTRRSIFDCM+AGSIPV FW+RTAYYQYEW
Sbjct: 361 CSGSRCSNGTSAILKTFLDSDFCLQPRGDSFTRRSIFDCMLAGSIPVLFWKRTAYYQYEW 420

Query: 421 FLPGEPESYSVFIDRNAVTNGTASIEAILEKFSRDEVREMRERVIESIPNFIY-----GT 480
           FLPGEP+SYSVFI R+ V NGT S+  +LE +S++EVR+MRE+VIE IP F+Y     G 
Sbjct: 421 FLPGEPDSYSVFIHRDEVKNGT-SVRKVLESYSKEEVRKMREKVIEYIPKFVYARPNEGL 480

Query: 481 GGVRDAFDVAIEGVLRRERD 484
           G ++DAFDVAI+GVLRR ++
Sbjct: 481 GSIKDAFDVAIDGVLRRFKE 489

BLAST of ClCG01G005220 vs. NCBI nr
Match: gi|356574438|ref|XP_003555354.1| (PREDICTED: xyloglucan galactosyltransferase KATAMARI1 homolog [Glycine max])

HSP 1 Score: 672.5 bits (1734), Expect = 6.0e-190
Identity = 322/490 (65.71%), Postives = 384/490 (78.37%), Query Frame = 1

Query: 1   MLPLSGNSSPAEHHPKKFKASELFDKKNSFNSLLQ-QFHHQQSRLWLLLVILCLQILLLF 60
           MLP S    PAE  P+           + F SLLQ +F  Q  R W+L  +L +QILLL 
Sbjct: 1   MLPKS--VPPAEPQPQTLNTKTTLT--SFFTSLLQPRFSPQNPRTWILFTVLFIQILLLC 60

Query: 61  TIRFLPLP-LPPALSSSTNRIHRFPSVVSPAVNDGDNCKNGRIFVYDLPTLFNRDILENC 120
            +R  P P +PP L ++ +      +    +     +  +G++FVY+LP  FN+ I+ NC
Sbjct: 61  NLRSFPSPSIPPPLPAAADTKRTTNTTGHHSYRTVYHSGSGKVFVYNLPDTFNQQIILNC 120

Query: 121 DNLNPWSSSCSAMANGGFGQRADSLAGIIPENLLPSWYWTDQFVTEIVFHNRIMKHKCRV 180
           DNLNPWSS C A++N GFG+ A SLAGI+PE+LLP+W+WTDQFVTEI+FHNR++ HKCRV
Sbjct: 121 DNLNPWSSRCDALSNDGFGRAATSLAGILPEDLLPAWHWTDQFVTEIIFHNRLINHKCRV 180

Query: 181 SEPESATAFYIPFYAGLAVGKFLWTNSTAEERDQHCRMILKWLSDQKYYKRSNGWDHFIT 240
            EPESATAFYIPFYAGLAVGK+LW NSTAEERD+HC M+L+W+ DQ ++KRSNGWDHFIT
Sbjct: 181 MEPESATAFYIPFYAGLAVGKYLWFNSTAEERDRHCDMMLQWIQDQPFFKRSNGWDHFIT 240

Query: 241 MGRITWDFRRSKDKDWGSACIYLPGMRNITRLLIERNPWDYFDVGVPYPTGFHPRGHDDI 300
           MGRITWDFRRSKD+DWGS+CIY PG+RN+TRLLIERNPWDYFDVGVPYPTGFHPR   D+
Sbjct: 241 MGRITWDFRRSKDRDWGSSCIYKPGIRNVTRLLIERNPWDYFDVGVPYPTGFHPRSKSDV 300

Query: 301 SAWQDFVRTRRRTHLFCFAGATRAAFRNDFRAVLLDQCRNSAAEKCRVVDCAGSRCSNGT 360
           + WQ FVR R+R  LFCFAGA R AFR+DFRA+LL QCR+S  E CR V+C G+RCSNGT
Sbjct: 301 TRWQSFVRERQRHALFCFAGAPRRAFRDDFRAILLSQCRDS-GESCRAVNCTGTRCSNGT 360

Query: 361 SAILETFLSSDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWRRTAYYQYEWFLPGEPESYS 420
           SAILETFL SDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWRRTAY QYEWFLPGEPESYS
Sbjct: 361 SAILETFLDSDFCLQPRGDSFTRRSIFDCMVAGSIPVFFWRRTAYLQYEWFLPGEPESYS 420

Query: 421 VFIDRNAVTNGTASIEAILEKFSRDEVREMRERVIESIPNFIY-----GTGGVRDAFDVA 480
           VFIDRNAV NGT +++ +LE+F+++EVR MRE+VIE IP  +Y     G  GV DAFDVA
Sbjct: 421 VFIDRNAVKNGTLTVKNVLERFTKEEVRRMREKVIEYIPRLVYANTKQGLEGVNDAFDVA 480

Query: 481 IEGVLRRERD 484
           IEGV +R ++
Sbjct: 481 IEGVFKRIKE 485

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GT18_ARATH1.1e-18060.67Xyloglucan galactosyltransferase XLT2 OS=Arabidopsis thaliana GN=XLT2 PE=1 SV=1[more]
GT11_ARATH1.0e-8842.04Probable xyloglucan galactosyltransferase GT11 OS=Arabidopsis thaliana GN=GT11 P... [more]
KATAM_ORYSJ5.2e-8538.82Xyloglucan galactosyltransferase KATAMARI1 homolog OS=Oryza sativa subsp. japoni... [more]
MUR3_ARATH2.2e-8338.60Xyloglucan galactosyltransferase MUR3 OS=Arabidopsis thaliana GN=MUR3 PE=1 SV=1[more]
GT14_ARATH5.4e-8237.80Probable xyloglucan galactosyltransferase GT14 OS=Arabidopsis thaliana GN=GT14 P... [more]
Match NameE-valueIdentityDescription
A0A0A0KNB0_CUCSA1.4e-26289.51Uncharacterized protein OS=Cucumis sativus GN=Csa_5G171710 PE=4 SV=1[more]
M5X3U0_PRUPE1.7e-19167.35Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003851mg PE=4 SV=1[more]
B9R9L8_RICCO1.1e-19064.20Xyloglucan galactosyltransferase KATAMARI1, putative OS=Ricinus communis GN=RCOM... [more]
K7N3K7_SOYBN4.2e-19065.71Uncharacterized protein OS=Glycine max GN=GLYMA_20G152000 PE=4 SV=1[more]
V7BFP7_PHAVU1.5e-18765.03Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_007G063500g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G62220.16.4e-18260.67 glycosyltransferase 18[more]
AT2G29040.15.8e-9042.04 Exostosin family protein[more]
AT2G20370.11.2e-8438.60 Exostosin family protein[more]
AT4G13990.13.1e-8337.80 Exostosin family protein[more]
AT1G63450.13.1e-8338.42 root hair specific 8[more]
Match NameE-valueIdentityDescription
gi|778700528|ref|XP_011654883.1|2.0e-26289.51PREDICTED: xyloglucan galactosyltransferase KATAMARI1 [Cucumis sativus][more]
gi|659073116|ref|XP_008467262.1|1.5e-26089.53PREDICTED: LOW QUALITY PROTEIN: xyloglucan galactosyltransferase KATAMARI1 [Cucu... [more]
gi|596021109|ref|XP_007218972.1|2.4e-19167.35hypothetical protein PRUPE_ppa003851mg [Prunus persica][more]
gi|255539657|ref|XP_002510893.1|1.6e-19064.20PREDICTED: xyloglucan galactosyltransferase KATAMARI1 homolog [Ricinus communis][more]
gi|356574438|ref|XP_003555354.1|6.0e-19065.71PREDICTED: xyloglucan galactosyltransferase KATAMARI1 homolog [Glycine max][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004263Exostosin
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006007 glucose catabolic process
biological_process GO:0009969 xyloglucan biosynthetic process
biological_process GO:0044723 single-organism carbohydrate metabolic process
biological_process GO:0008150 biological_process
biological_process GO:0006486 protein glycosylation
cellular_component GO:0005794 Golgi apparatus
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0008378 galactosyltransferase activity
molecular_function GO:0016757 transferase activity, transferring glycosyl groups
molecular_function GO:0016740 transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G005220.1ClCG01G005220.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004263Exostosin-likePFAMPF03016Exostosincoord: 96..433
score: 7.4
NoneNo IPR availablePANTHERPTHR11062EXOSTOSIN HEPARAN SULFATE GLYCOSYLTRANSFERASE -RELATEDcoord: 148..537
score: 7.7E-245coord: 43..118
score: 7.7E
NoneNo IPR availablePANTHERPTHR11062:SF52GLYCOSYLTRANSFERASE 18coord: 148..537
score: 7.7E-245coord: 43..118
score: 7.7E