ClCG01G006940 (gene) Watermelon (Charleston Gray)

NameClCG01G006940
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionNucleotide-diphospho-sugar transferase family protein
LocationCG_Chr01 : 8047913 .. 8052278 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAACGCGTACGATGAACTGTTGATTGATGTTGCTGTTCTTGCTCCCTGCTTCGAATATGGTGGAAATTGGAATGCGATCTTCAACCCTAGACGTTTCTATATCGGTTCGTGATGCGCCGCTGTAATGTTTATGATCTCTGCTTCTAATTCTTACGTTTCCGCCATGTTTAGTTACTCCTCCTCCTCCCGTACCCTTCACATTCTTCTCCTCTTCACTGCCATTTCCCTCTCTTGCCTTGTTATTCTCAGAGAACTCAACTCCCTTCGCTACTTCCCTCTTTTCTCCTTCTCTACTTTTTCCGGTTCTCCTCCTGTTTCCCCTTTCGTCCCCTCCCTCGATGATGACGACGAGTCTTTTCCGGTGAGTTACCTCGGTTTTCTTATTTGTCTTCCGTTCATGTTTTTTTTTTCCCCATTTTGGTTGATTAATTGATTGCTTTTCGTTCTTTTAGTTTGGGTTGTCGATTAATCTACCTTCCTTAATCGTCTTTCTTGCTTCTCGTTCGTTTCCAATTGTGAGTTTGAATATCATTGATGAATTGGCCTTGATGATGAGGGATGGATATCTAATGATGATTCGTTTTGGTAGTGTATTTAAAATACAGGACGGCTATGACTTCTCTTTCTTGCTGAAATTCTTTAGCAACCTTTTGATGTTTAATAGACTATCTTTTCTTTACTTAGCTTGCATGGTTGCGCTCAGTGTGCCTCTTCTATTCCCCATTTCTATCTCTCTGTTTTTCATATGTGAGCGCAGAGTAGATTTTGTTTTTGCCTTTCAATTTGAACCTTCAAATCTTTGCCCCATTCTGCTTATTTGCTTTCCTTTGGGTCTGTTAGGGATAGTGAAAATGCCTTTAGGAAGGACAATTCTCGGTTTTCGGTTCCCGAATTGTGATCCATTTTCTTACGAAAAAACAAGAGATGAGTGATTCATTTTCCATGTAGCCAAATGGATAATTTTGTGTTGTGAATCTCTGAGTAGAAAGTGTACTGTTGTTTTGGAAGATGAATTTACCTGCTAGTATAACAATCATATTACTCACGGTGTAGCTGATATGTACAATGCATGGGAATGTAGCAAGACTTCTTTACTTATTATTTTTCTAGTTTATTCACAAGTTGCTTAACAACTCTTATGCTTGAGTATTGAACTGATTTGGATTGGATTTATCTTTTATGCTTATAGTTGTACTTTTATTTGCTTTTGATTTTTGCCTCTTTCGGCTGTGATTGTAGGATGTTGATGAGTATGGACTGGACAAGGTCTTAATGGATGCTGCAACAGATGACAAAACTGTTATTTTAACTACTTTAAATGAAGCATGGGCATCTCCAAATTCAGTCATTGATCTCTTTCTACAAAGCTTTAGAATTGGAAATCGAACTCACCAACTATTAGACCATTTGGTTATTATTGCATTGGACAAAAAGGCATTTGTTCGTTGCTTGGATATCCATGTCCATTGCTTTGCTCTTGCTACTGAAGGAGTTGATTTTCATTCTGAGGCACATTTTATGTCACCTGACTACTTGAAGATGATGTGGAGAAGGATTGATTTTTTGCGAATTGTTCTTGAGATGGGGTACAATTTTGTATTCACGGTATCATACTCTCTCCTTTTCCTTTGAGATGGTTGCTTCCTCAGTTTTCATTTACGTCAATAATTCTTTTATTTATTTATTTTTGTTTCCTTTTGAAGGTTGAGATATGTAATATAGTTTAATGTAATGCTCAGGAAGGTTGGATAAACAGCACGTAGGAAACTGATAATGGGTGGACAATCAATCTATGTTCAGTGAATTTTGGTATTACTAGGAAAGAAATTTAGAGTTTTCCTTGTGTAGAAGTTTTCTTGGAATTTATTATTTATCAATGTATATGGTAGACCATAAAGAACCACATAGATAAACAATACGTCTAGAATGTGGTAGGGTCACCTCTTTTGAAGTATGTTTTTGTACACGAATATATTAAGGACGTGTCTGAAACTAATTTTAAAATTTGAAATTTCAGTGAAAACGATTTTAGGAATCTGTGATTTCCTTTATGATTTATAGGAAGTGATTTTCACATAATCATCTTATGTTTCATTTTACACTTTTAAATGCTATTTCCATAGATCCAAATTGATTTTTGATAAAACCATGTTTGAGAGTGATTTTAAACACAACAAAAGTGATTTTAATAGTTTCAAAATCACTTTCAAACATGCCCTAAATCTTTTTGTGCTCAAGAGTTTCTATGGAAAATTTTGATATTTGCATTTAAAATTTAAATGAAATGAGTGACAAGACTAAGGTTGCCTGCAAGGAAGTTGGAGATGTCAATAATTTATAAGTGTTACACGTGTATCATGGTTTCTTTCCATTTCATACTCTAATTTGTGTATCCAACATGTATGGGTCTTAAGTCTTCATTTGGGAATCCAACATATATTAGGGTGTCTGGAGTCTCAAGTGTATGGACAGCATACACAATATCAGAAACCAGTTGTTTGTGCTTCACAAATACTTTACATGGAGAATAGGGAATCAAACTGGATGTCAATGCATCTTTTTTTGGTTTGGAATTACTTAAGAAGTCTATTTCCCAATGGATGAACCTATTCTCTAGGAACCCAAGATTTCCTCATATAGTTAAGGGAGAGGCTTCTGGGAAAGGAAAAATTCCTTTATTAATTAAACCTCACCTCTGCTCGACTTGTTCATTCTGAATTAGGAATCTAAGGCCACTCCAGTGACCACTTGGATGTCTTTCTCCTCCACTGTTTGAACATGCTTGGGTGGTTTCTTCAGGGGAAGATTTTTTTGGAGAGGGGTTCAATCAATTGTCACCTCTACTCCCCCTAGCTTGATTTAAGATAAGGAATTAGGGTCAATCCATTGCCTATTTGGTTGTCGTTCTCTTCCTCCACTATCAGCAGTAATTTCCATATTAGGTCTCCCTTGCCTTTTGTTTCAATAGTCAAACATTTGTGTACGATAGCATGACATGCCTTGCTTAAACTTTTGTCGTTTGGGATCTGTTATTTAGCTGTTCAATGTCTCTTATACATTCACCACTTGGCCGTTTACCAATTACTTCCATTAGGATATATCAGTTTTTATTTTCATGTATGTTGAAGGATGGAAATTTGCATGCGCCTCTCATGTCTTTATCAAACACTAGATCCGTGTATAATTTTTTCAAGTTTTCCTTTTGTATTTCTTTTAATTCCCATACTTTTGTTTCTTCCTTCAGGATGCTGATGTTATGTGGTTCAGGGATCCGTTCCCTTTCTTTGATATCAATGCAGATTTCCAAATTGCTTGTGATCAATACCTGGGCATCCCTGATGATTTAGATAACAGACCAAATGGAGGGTTTAACTATGTAAAATCCAATAATCGGTCAATTGAGTTCTACAAATATTGGTACTCAGCTCGGGAAACTTATCCAGGATACCATGATCAGGACGTTCTAAATAGGATCAAATACGATTTTTTCATCGATGAAATTGGACTAAAGATTAGATTCTTGGATACTGCTTACTTTGGTGGGTTCTGTGAACCCAGCAAAGATTTGAATCGTGTACTAACCATGCATGCAAACTGCTGTATTGGAATGGACAGTAAACTTCATGATCTCAGAATTATGCTCGAGGATTGGAAACATTACATGTCGATGCCGCCATATCGTAAGACATCATCAGCGTTGTCTTGGAGGGTTCCTCAGAACTGCAGGTATGGAACCATTTCTATGTCTTAGACAATTTTGCGCTATAATTAGGTACGTTTCTATTTCTTTATTCTTGAATTGCATTTTTTATATTCGAATCTTTTTCTCCTCCCGTTCCAGTGTCTGATCTTCTAATCTCCAAACAAAGAATTGATGAATCCTACCAAACTTGGAGTGATGCACTTTGAGCAATCATTTTTGCATTGCACATTTCGAGTTTTGTGAACCAAAGATATTTCTGCACTACAGCTTATACCAAAGTGAATGTTAGAAGAGTATTGTGGCCGTAGTCTTTATTTGCCAGTTTGTGTAAATTTATACCCAAATAAATTTCGAGGTCCCGCAATTGTACATCGCCAAATTAGAAAGCCTAACCATCATTAGGTAGTTCGTTTGCTAGAAATTTTGGTAAAACGATCAAATTCAGGTTTAGGTATACCTTTAGTTTTCAGATATAAGGAAATTTTCTCTTGCATTTTAATGCAACTATTAAGTCCACTTTTATGTACAACTCTATTACGCCGTAGCCACGATCCCCTATTATATAAAATGATCAAGCTTATAACCACATAAGATAGAATAAACTACACTTTCTCTCGTGGCCTATAAAAGGCTGATATTGTTTATGGGTAAGATATGTGTTT

mRNA sequence

AGAACGCGTACGATGAACTGTTGATTGATGTTGCTGTTCTTGCTCCCTGCTTCGAATATGGTGGAAATTGGAATGCGATCTTCAACCCTAGACGTTTCTATATCGGTTCGTGATGCGCCGCTGTAATGTTTATGATCTCTGCTTCTAATTCTTACGTTTCCGCCATGTTTAGTTACTCCTCCTCCTCCCGTACCCTTCACATTCTTCTCCTCTTCACTGCCATTTCCCTCTCTTGCCTTGTTATTCTCAGAGAACTCAACTCCCTTCGCTACTTCCCTCTTTTCTCCTTCTCTACTTTTTCCGGTTCTCCTCCTGTTTCCCCTTTCGTCCCCTCCCTCGATGATGACGACGAGTCTTTTCCGTTTGGGTTGTCGATTAATCTACCTTCCTTAATCGTCTTTCTTGCTTCTCGTTCGTTTCCAATTGATGTTGATGAGTATGGACTGGACAAGGTCTTAATGGATGCTGCAACAGATGACAAAACTGTTATTTTAACTACTTTAAATGAAGCATGGGCATCTCCAAATTCAGTCATTGATCTCTTTCTACAAAGCTTTAGAATTGGAAATCGAACTCACCAACTATTAGACCATTTGGTTATTATTGCATTGGACAAAAAGGCATTTGTTCGTTGCTTGGATATCCATGTCCATTGCTTTGCTCTTGCTACTGAAGGAGTTGATTTTCATTCTGAGGCACATTTTATGTCACCTGACTACTTGAAGATGATGTGGAGAAGGATTGATTTTTTGCGAATTGTTCTTGAGATGGGGTACAATTTTGTATTCACGGATGCTGATGTTATGTGGTTCAGGGATCCGTTCCCTTTCTTTGATATCAATGCAGATTTCCAAATTGCTTGTGATCAATACCTGGGCATCCCTGATGATTTAGATAACAGACCAAATGGAGGGTTTAACTATGTAAAATCCAATAATCGGTCAATTGAGTTCTACAAATATTGGTACTCAGCTCGGGAAACTTATCCAGGATACCATGATCAGGACGTTCTAAATAGGATCAAATACGATTTTTTCATCGATGAAATTGGACTAAAGATTAGATTCTTGGATACTGCTTACTTTGGTGGGTTCTGTGAACCCAGCAAAGATTTGAATCGTGTACTAACCATGCATGCAAACTGCTGTATTGGAATGGACAGTAAACTTCATGATCTCAGAATTATGCTCGAGGATTGGAAACATTACATGTCGATGCCGCCATATCGTAAGACATCATCAGCGTTGTCTTGGAGGGTTCCTCAGAACTGCAGGTATGGAACCATTTCTATGTCTTAGACAATTTTGCGCTATAATTAGGTACGTTTCTATTTCTTTATTCTTGAATTGCATTTTTTATATTCGAATCTTTTTCTCCTCCCGTTCCAGTGTCTGATCTTCTAATCTCCAAACAAAGAATTGATGAATCCTACCAAACTTGGAGTGATGCACTTTGAGCAATCATTTTTGCATTGCACATTTCGAGTTTTGTGAACCAAAGATATTTCTGCACTACAGCTTATACCAAAGTGAATGTTAGAAGAGTATTGTGGCCGTAGTCTTTATTTGCCAGTTTGTGTAAATTTATACCCAAATAAATTTCGAGGTCCCGCAATTGTACATCGCCAAATTAGAAAGCCTAACCATCATTAGGTAGTTCGTTTGCTAGAAATTTTGGTAAAACGATCAAATTCAGGTTTAGGTATACCTTTAGTTTTCAGATATAAGGAAATTTTCTCTTGCATTTTAATGCAACTATTAAGTCCACTTTTATGTACAACTCTATTACGCCGTAGCCACGATCCCCTATTATATAAAATGATCAAGCTTATAACCACATAAGATAGAATAAACTACACTTTCTCTCGTGGCCTATAAAAGGCTGATATTGTTTATGGGTAAGATATGTGTTT

Coding sequence (CDS)

ATGTTTATGATCTCTGCTTCTAATTCTTACGTTTCCGCCATGTTTAGTTACTCCTCCTCCTCCCGTACCCTTCACATTCTTCTCCTCTTCACTGCCATTTCCCTCTCTTGCCTTGTTATTCTCAGAGAACTCAACTCCCTTCGCTACTTCCCTCTTTTCTCCTTCTCTACTTTTTCCGGTTCTCCTCCTGTTTCCCCTTTCGTCCCCTCCCTCGATGATGACGACGAGTCTTTTCCGTTTGGGTTGTCGATTAATCTACCTTCCTTAATCGTCTTTCTTGCTTCTCGTTCGTTTCCAATTGATGTTGATGAGTATGGACTGGACAAGGTCTTAATGGATGCTGCAACAGATGACAAAACTGTTATTTTAACTACTTTAAATGAAGCATGGGCATCTCCAAATTCAGTCATTGATCTCTTTCTACAAAGCTTTAGAATTGGAAATCGAACTCACCAACTATTAGACCATTTGGTTATTATTGCATTGGACAAAAAGGCATTTGTTCGTTGCTTGGATATCCATGTCCATTGCTTTGCTCTTGCTACTGAAGGAGTTGATTTTCATTCTGAGGCACATTTTATGTCACCTGACTACTTGAAGATGATGTGGAGAAGGATTGATTTTTTGCGAATTGTTCTTGAGATGGGGTACAATTTTGTATTCACGGATGCTGATGTTATGTGGTTCAGGGATCCGTTCCCTTTCTTTGATATCAATGCAGATTTCCAAATTGCTTGTGATCAATACCTGGGCATCCCTGATGATTTAGATAACAGACCAAATGGAGGGTTTAACTATGTAAAATCCAATAATCGGTCAATTGAGTTCTACAAATATTGGTACTCAGCTCGGGAAACTTATCCAGGATACCATGATCAGGACGTTCTAAATAGGATCAAATACGATTTTTTCATCGATGAAATTGGACTAAAGATTAGATTCTTGGATACTGCTTACTTTGGTGGGTTCTGTGAACCCAGCAAAGATTTGAATCGTGTACTAACCATGCATGCAAACTGCTGTATTGGAATGGACAGTAAACTTCATGATCTCAGAATTATGCTCGAGGATTGGAAACATTACATGTCGATGCCGCCATATCGTAAGACATCATCAGCGTTGTCTTGGAGGGTTCCTCAGAACTGCAGGTATGGAACCATTTCTATGTCTTAG

Protein sequence

MFMISASNSYVSAMFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFVPSLDDDDESFPFGLSINLPSLIVFLASRSFPIDVDEYGLDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNCRYGTISMS
BLAST of ClCG01G006940 vs. Swiss-Prot
Match: Y4597_ARATH (Uncharacterized protein At4g15970 OS=Arabidopsis thaliana GN=At4g15970 PE=2 SV=1)

HSP 1 Score: 315.5 bits (807), Expect = 8.3e-85
Identity = 149/279 (53.41%), Postives = 199/279 (71.33%), Query Frame = 1

Query: 107 LDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKA 166
           L K+L +AAT+DKTVI+TTLN+AW+ PNS  DLFL SF +G  T  LL HLV+  LD++A
Sbjct: 42  LGKILTEAATEDKTVIITTLNKAWSEPNSTFDLFLHSFHVGKGTKPLLRHLVVACLDEEA 101

Query: 167 FVRCLDIHVH-CFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVLEMGYNFVFTDAD 226
           + RC ++H H C+ + T G+DF  +  FM+PDYLKMMWRRI+FL  +L++ YNF+FT   
Sbjct: 102 YSRCSEVHPHRCYFMKTPGIDFAGDKMFMTPDYLKMMWRRIEFLGTLLKLRYNFIFTI-- 161

Query: 227 VMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARE 286
                 PFP      DFQIACD+Y G   D+ N  NGGF +VK+N R+I+FY YWY +R 
Sbjct: 162 ------PFPRLSKEVDFQIACDRYSGDDKDIHNAVNGGFAFVKANQRTIDFYNYWYMSRL 221

Query: 287 TYPGYHDQDVLNRIKYDFFIDEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMD 346
            YP  HDQDVL++IK   +  +IGLK+RFLDT YFGGFCEPS+DL++V TMHANCC+G++
Sbjct: 222 RYPDRHDQDVLDQIKGGGYPAKIGLKMRFLDTKYFGGFCEPSRDLDKVCTMHANCCVGLE 281

Query: 347 SKLHDLRIMLEDWKHYMSMPPYRKTSSA--LSWRVPQNC 383
           +K+ DLR ++ DW++Y+S     KT+    ++WR P+NC
Sbjct: 282 NKIKDLRQVIVDWENYVSA---AKTTDGQIMTWRDPENC 309

BLAST of ClCG01G006940 vs. Swiss-Prot
Match: Y1869_ARATH (Uncharacterized protein At1g28695 OS=Arabidopsis thaliana GN=At1g28695 PE=2 SV=1)

HSP 1 Score: 209.5 bits (532), Expect = 6.5e-53
Identity = 111/282 (39.36%), Postives = 167/282 (59.22%), Query Frame = 1

Query: 85  NLPSLIVFLASRSFPIDVDEYGLDKVLMDAATDDKTVILTTLNEAWASP----NSVIDLF 144
           N  + +  L +  +P+D  E  L      AA ++KTVI+T +N+A+       ++++DLF
Sbjct: 27  NYQNYVNTLRTTQYPVDELEAALYTA---AAGNNKTVIITMVNKAYVKEVGRGSTMLDLF 86

Query: 145 LQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATE-GVDFHSEAHFMSPDYL 204
           L+SF  G  T  LLDHL+++A+D+ A+ RC    +HC+ + TE GVD   E  FMS D++
Sbjct: 87  LESFWEGEGTLPLLDHLMVVAVDQTAYDRCRFKRLHCYKMETEDGVDLEGEKVFMSKDFI 146

Query: 205 KMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNR 264
           +MMWRR   +  VL  GYN +FTD DVMW R P    +++ D QI+ D+ + +   L N 
Sbjct: 147 EMMWRRTRLILDVLRRGYNVIFTDTDVMWLRSPLSRLNMSLDMQISVDR-INVGGQLINT 206

Query: 265 PNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIRFLDTAY 324
              GF +V+SNN++I  ++ WY  R    G  +QDVL  +    F +++GL + FL T  
Sbjct: 207 ---GFYHVRSNNKTISLFQKWYDMRLNSTGMKEQDVLKNLLDSGFFNQLGLNVGFLSTTE 266

Query: 325 FGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHY 362
           F GFC+ S  +  V T+HANCC+ + +K+ DL  +L DWK Y
Sbjct: 267 FSGFCQDSPHMGVVTTVHANCCLHIPAKVFDLTRVLRDWKRY 301

BLAST of ClCG01G006940 vs. TrEMBL
Match: A0A0A0KPV2_CUCSA (Glycosyltransferase OS=Cucumis sativus GN=Csa_5G107050 PE=3 SV=1)

HSP 1 Score: 641.0 bits (1652), Expect = 9.7e-181
Identity = 312/369 (84.55%), Postives = 324/369 (87.80%), Query Frame = 1

Query: 14  MFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFVPSLDD 73
           MF YSS   + HILLLFTAISLSCLVILRELNSLRYFPLFSFST SG PP+ PF+ SL  
Sbjct: 1   MFIYSSFRCSPHILLLFTAISLSCLVILRELNSLRYFPLFSFSTSSGPPPLPPFLLSLPH 60

Query: 74  DDESFPFGLSINLPSLIVFLASRSFPIDVDEYGLDKVLMDAATDDKTVILTTLNEAWASP 133
            D   P                     + DEYGLDKVL DAAT+DKTVILTTLNEAWASP
Sbjct: 61  HDHLSP---------------------EADEYGLDKVLKDAATEDKTVILTTLNEAWASP 120

Query: 134 NSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHF 193
           N+VIDLFLQSFRIGNRTHQLLDHLVIIALDKKAF+RCLDIH+HC +L TEGVDF SEA+F
Sbjct: 121 NAVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFMRCLDIHIHCVSLVTEGVDFRSEAYF 180

Query: 194 MSPDYLKMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIP 253
           MSPDYLKMMWRRIDFLR VLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIP
Sbjct: 181 MSPDYLKMMWRRIDFLRTVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIP 240

Query: 254 DDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIR 313
           DDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFI+EIGLKIR
Sbjct: 241 DDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIEEIGLKIR 300

Query: 314 FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSA 373
           FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRI+LEDWKHYMSMPPY KTSS 
Sbjct: 301 FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRILLEDWKHYMSMPPYLKTSSI 348

Query: 374 LSWRVPQNC 383
            SWRVPQNC
Sbjct: 361 QSWRVPQNC 348

BLAST of ClCG01G006940 vs. TrEMBL
Match: A0A072VSD7_MEDTR (Glycosyltransferase OS=Medicago truncatula GN=MTR_1g112300 PE=3 SV=1)

HSP 1 Score: 473.8 bits (1218), Expect = 2.0e-130
Identity = 234/368 (63.59%), Postives = 275/368 (74.73%), Query Frame = 1

Query: 21  SRTLHI------LLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFVPSLDDD 80
           S+TLH+       L F  +SLSCL++ R+++S R+F    FS+    P    F P +  D
Sbjct: 5   SKTLHLRCALAAALFFATVSLSCLILFRDVDSYRFFS--GFSSSYALPRFPTFFPLVTVD 64

Query: 81  DESFPFGLSINLPSLIVFLASRSFPIDVDEYGLDKVLMDAATDDKTVILTTLNEAWASPN 140
                       PS           +  +EY L+++L DAA +DKTVILTTLNEAWA+PN
Sbjct: 65  ------------PS-----------VTTNEYPLERILNDAAMEDKTVILTTLNEAWAAPN 124

Query: 141 SVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFM 200
           SVIDLFLQSFRIG+ T +LL+HLVIIALD+KAF RC  IH HCF+LA E  DFH EA+FM
Sbjct: 125 SVIDLFLQSFRIGDHTSRLLNHLVIIALDQKAFARCQVIHTHCFSLANEEADFHEEAYFM 184

Query: 201 SPDYLKMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPD 260
           +P YL MMWRRIDFLR VLE GYNFVFTDAD+MWFRDPFP F ++ADFQIACD + G  D
Sbjct: 185 TPSYLMMMWRRIDFLRSVLEKGYNFVFTDADIMWFRDPFPRFHLDADFQIACDHFTGGFD 244

Query: 261 DLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIRF 320
           D+ NRPNGGFN+VKSNNRSIEFYK+WYS+RETYPGYHDQDVLN IK   FI +IGLK+RF
Sbjct: 245 DVMNRPNGGFNFVKSNNRSIEFYKFWYSSRETYPGYHDQDVLNFIKVHPFIADIGLKMRF 304

Query: 321 LDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSAL 380
           LDT  FGG CEPS+DLN+V TMHANCC GMDSKLHDLRIML+DWKHY+S+PP  K  S +
Sbjct: 305 LDTTNFGGLCEPSRDLNQVCTMHANCCFGMDSKLHDLRIMLQDWKHYLSLPPNLKKLSVV 347

Query: 381 SWRVPQNC 383
           SWRVPQ C
Sbjct: 365 SWRVPQKC 347

BLAST of ClCG01G006940 vs. TrEMBL
Match: A0A067KPV4_JATCU (Glycosyltransferase OS=Jatropha curcas GN=JCGZ_04849 PE=3 SV=1)

HSP 1 Score: 471.9 bits (1213), Expect = 7.8e-130
Identity = 223/372 (59.95%), Postives = 280/372 (75.27%), Query Frame = 1

Query: 12  SAMFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFVPSL 71
           SA F+    S     +L+F  + +SCL++ R  +SLR+                      
Sbjct: 30  SATFNMLPESAIPRAVLVFAVLCISCLLLYRAADSLRFL--------------------- 89

Query: 72  DDDDESFPFGLSINLPSLIVFLASRSFPIDV-DEYGLDKVLMDAATDDKTVILTTLNEAW 131
                SFP G S++ P +   L   +  + V +E  L+ VL DAA +DKTVILTTLNEAW
Sbjct: 90  -----SFPSGSSVSFPHIFPSLVFDNDSVPVRNEQKLENVLEDAAMEDKTVILTTLNEAW 149

Query: 132 ASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSE 191
           A+PNS++DLFL SFRIG  T +LL+HLVIIALD+KA+ RC+++H+HCFAL TEG+DF  E
Sbjct: 150 AAPNSIVDLFLASFRIGVHTRRLLNHLVIIALDQKAYTRCMELHIHCFALVTEGIDFRKE 209

Query: 192 AHFMSPDYLKMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYL 251
           A+FM+P Y+KMMWRRIDFLR VLE+GYNFVFTDADVMWFRDPFP F  +ADFQIACD + 
Sbjct: 210 AYFMTPAYVKMMWRRIDFLRSVLELGYNFVFTDADVMWFRDPFPRFYSDADFQIACDHFT 269

Query: 252 GIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGL 311
           G   +++N+PNGGFN+V+SNNRSIEFYK+WYS+RETYPGYHDQDVLN IK+D F++++GL
Sbjct: 270 GSSVNIENKPNGGFNFVRSNNRSIEFYKFWYSSRETYPGYHDQDVLNFIKFDSFVEDLGL 329

Query: 312 KIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKT 371
           K+RFLDTAYFGG CEPSKDL+ V TMHANCC G+DSKLHDLR+ML+DW H++S+PP  K 
Sbjct: 330 KMRFLDTAYFGGLCEPSKDLSLVCTMHANCCFGLDSKLHDLRVMLQDWMHFLSLPPSMKR 375

Query: 372 SSALSWRVPQNC 383
           S  +SWRVPQNC
Sbjct: 390 SLIVSWRVPQNC 375

BLAST of ClCG01G006940 vs. TrEMBL
Match: A0A0B2RYA6_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_013623 PE=4 SV=1)

HSP 1 Score: 468.8 bits (1205), Expect = 6.6e-129
Identity = 234/371 (63.07%), Postives = 277/371 (74.66%), Query Frame = 1

Query: 21  SRTLHIL------LLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFV---PSL 80
           S+T+H+         F  +SLSCLV+L +++S R+      S+F  S  +S F    PS+
Sbjct: 5   SKTIHLRRAVAASFFFATVSLSCLVLLGDVDSHRFL-----SSFHSSYSLSGFTRIFPSV 64

Query: 81  DDDDESFPFGLSINLPSLIVFLASRSFPIDVDEYGLDKVLMDAATDDKTVILTTLNEAWA 140
            +D    P   S                   +EY L+K+L DAA  D+TVILTTLNEAWA
Sbjct: 65  YND----PVATS-------------------NEYPLEKILNDAAMKDRTVILTTLNEAWA 124

Query: 141 SPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEA 200
           +PNSVIDLFL+SFRIG+RT   L+HLVIIALD+KAF RC  IH HCF+L +E  DFH EA
Sbjct: 125 TPNSVIDLFLESFRIGDRTSTFLNHLVIIALDQKAFARCQVIHTHCFSLVSEEADFHEEA 184

Query: 201 HFMSPDYLKMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLG 260
           +FM+P YL MMW+RIDFLR VLEMGYNFVFTDAD+MWFRDPFP FD++ADFQIACD + G
Sbjct: 185 YFMTPRYLMMMWKRIDFLRTVLEMGYNFVFTDADIMWFRDPFPQFDLHADFQIACDHFTG 244

Query: 261 IPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLK 320
             DD+ NRPNGGFNYVKSNNRSIEFYK+WYS+RETYPGYHDQDVLN IK   FI +IGLK
Sbjct: 245 GFDDVQNRPNGGFNYVKSNNRSIEFYKFWYSSRETYPGYHDQDVLNFIKVHPFITDIGLK 304

Query: 321 IRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTS 380
           +RFLDT  FGG CEPS+DLN+V TMHANCC+GMDSKLHDLRIML+DWKHY+S+PP  K  
Sbjct: 305 MRFLDTTNFGGLCEPSRDLNQVCTMHANCCLGMDSKLHDLRIMLQDWKHYLSLPPSLKRL 347

Query: 381 SALSWRVPQNC 383
           S +SWRVPQ C
Sbjct: 365 SVVSWRVPQKC 347

BLAST of ClCG01G006940 vs. TrEMBL
Match: K7LLW7_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_10G280600 PE=4 SV=1)

HSP 1 Score: 468.8 bits (1205), Expect = 6.6e-129
Identity = 234/371 (63.07%), Postives = 277/371 (74.66%), Query Frame = 1

Query: 21  SRTLHIL------LLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFV---PSL 80
           S+T+H+         F  +SLSCLV+L +++S R+      S+F  S  +S F    PS+
Sbjct: 5   SKTIHLRRAVAASFFFATVSLSCLVLLGDVDSHRFL-----SSFHSSYSLSGFTRIFPSV 64

Query: 81  DDDDESFPFGLSINLPSLIVFLASRSFPIDVDEYGLDKVLMDAATDDKTVILTTLNEAWA 140
            +D    P   S                   +EY L+K+L DAA  D+TVILTTLNEAWA
Sbjct: 65  YND----PVATS-------------------NEYPLEKILNDAAMKDRTVILTTLNEAWA 124

Query: 141 SPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEA 200
           +PNSVIDLFL+SFRIG+RT   L+HLVIIALD+KAF RC  IH HCF+L +E  DFH EA
Sbjct: 125 TPNSVIDLFLESFRIGDRTSTFLNHLVIIALDQKAFARCQVIHTHCFSLVSEEADFHEEA 184

Query: 201 HFMSPDYLKMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLG 260
           +FM+P YL MMW+RIDFLR VLEMGYNFVFTDAD+MWFRDPFP FD++ADFQIACD + G
Sbjct: 185 YFMTPRYLMMMWKRIDFLRTVLEMGYNFVFTDADIMWFRDPFPQFDLHADFQIACDHFTG 244

Query: 261 IPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLK 320
             DD+ NRPNGGFNYVKSNNRSIEFYK+WYS+RETYPGYHDQDVLN IK   FI +IGLK
Sbjct: 245 GFDDVQNRPNGGFNYVKSNNRSIEFYKFWYSSRETYPGYHDQDVLNFIKVHPFITDIGLK 304

Query: 321 IRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTS 380
           +RFLDT  FGG CEPS+DLN+V TMHANCC+GMDSKLHDLRIML+DWKHY+S+PP  K  
Sbjct: 305 MRFLDTTNFGGLCEPSRDLNQVCTMHANCCLGMDSKLHDLRIMLQDWKHYLSLPPSLKRL 347

Query: 381 SALSWRVPQNC 383
           S +SWRVPQ C
Sbjct: 365 SVVSWRVPQKC 347

BLAST of ClCG01G006940 vs. TAIR10
Match: AT1G14590.1 (AT1G14590.1 Nucleotide-diphospho-sugar transferase family protein)

HSP 1 Score: 427.9 bits (1099), Expect = 6.5e-120
Identity = 199/283 (70.32%), Postives = 233/283 (82.33%), Query Frame = 1

Query: 100 IDVDEYGLDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVI 159
           +D +E  L+ VL  AAT D+TV+LTTLN AWA+P SVIDLF +SFRIG  T Q+LDHLVI
Sbjct: 78  LDNEEPKLEDVLSKAATRDRTVVLTTLNAAWAAPGSVIDLFFESFRIGEETSQILDHLVI 137

Query: 160 IALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVLEMGYNF 219
           +ALD KA+ RCL++H HCF+L TEGVDF  EA+FM+  YLKMMWRRID LR VLEMGYNF
Sbjct: 138 VALDAKAYSRCLELHKHCFSLVTEGVDFSREAYFMTRSYLKMMWRRIDLLRSVLEMGYNF 197

Query: 220 VFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKY 279
           VFTDADVMWFR+PFP F + ADFQIACD YLG  +DL NRPNGGFN+V+SNNR+I FYKY
Sbjct: 198 VFTDADVMWFRNPFPRFYMYADFQIACDHYLGRSNDLHNRPNGGFNFVRSNNRTILFYKY 257

Query: 280 WYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHAN 339
           WY++R  +PGYHDQDVLN +K + F+  IGLK+RFL+TAYFGG CEPS+DLN V TMHAN
Sbjct: 258 WYASRLRFPGYHDQDVLNFLKAEPFVFRIGLKMRFLNTAYFGGLCEPSRDLNLVRTMHAN 317

Query: 340 CCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNC 383
           CC GM+SKLHDLRIML+DWK +MS+P + K SS  SW+VPQNC
Sbjct: 318 CCYGMESKLHDLRIMLQDWKDFMSLPLHLKQSSGFSWKVPQNC 360

BLAST of ClCG01G006940 vs. TAIR10
Match: AT2G02061.1 (AT2G02061.1 Nucleotide-diphospho-sugar transferase family protein)

HSP 1 Score: 400.2 bits (1027), Expect = 1.5e-111
Identity = 201/337 (59.64%), Postives = 239/337 (70.92%), Query Frame = 1

Query: 47  LRYFPLFSFSTFSGSPPVSPFVPSLDDDDESFPFGLSINLPSLIVFLASRSFPIDVDEYG 106
           L +F  F FS +  S  +     S       FP   S+N  S     +    P +++E  
Sbjct: 51  LLFFICFCFSLYRTSGYLRIVSDSSSTLSRIFP---SVNDSSSSPSPSPSLSPEEIEEPK 110

Query: 107 LDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKA 166
           L++VL  AAT D TVILTTLNEAWA+P SVIDLF +SFRIG  T +LL HLVIIALD KA
Sbjct: 111 LEEVLRRAATKDGTVILTTLNEAWAAPGSVIDLFFESFRIGKGTRRLLKHLVIIALDAKA 170

Query: 167 FVRCLDIHVHCFALATEGVDFHS-EAHFMSPDYLKMMWRRIDFLRIVLEMGYNFVFTDAD 226
           + RC ++H HCF L TEGVDF   EA+FM+P YL MMWRRI FLR VLE GYNFVFTDAD
Sbjct: 171 YSRCQELHKHCFRLETEGVDFSGGEAYFMTPSYLTMMWRRISFLRSVLEKGYNFVFTDAD 230

Query: 227 VMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARE 286
           VMWFR+PF  F  + DFQIACD Y+G P+D  NRPNGGF +V++NNRSI FYK+WY +R 
Sbjct: 231 VMWFRNPFRRFYEDGDFQIACDHYIGRPNDFRNRPNGGFTFVRANNRSIGFYKFWYDSRT 290

Query: 287 TYPGYHDQDVLNRIKYDFFIDEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMD 346
            YP  HDQDVLN IK D F+ ++ ++IRFL+T YFGGFCEPSKDLN V TMHANCC G+D
Sbjct: 291 KYPKNHDQDVLNFIKTDPFLWKLRIRIRFLNTVYFGGFCEPSKDLNLVCTMHANCCFGLD 350

Query: 347 SKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQNC 383
           SKLHDLRIML+DW+ + S+P +   SS  +W VPQNC
Sbjct: 351 SKLHDLRIMLQDWRDFKSLPLHSNQSSGFTWSVPQNC 384

BLAST of ClCG01G006940 vs. TAIR10
Match: AT5G44820.1 (AT5G44820.1 Nucleotide-diphospho-sugar transferase family protein)

HSP 1 Score: 365.5 bits (937), Expect = 4.0e-101
Identity = 178/358 (49.72%), Postives = 236/358 (65.92%), Query Frame = 1

Query: 26  ILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFVPSLDDDDESFPFGLSIN 85
           IL+LF  ++ SCLV+ +    L+   + + ++   SP  SP +P+L+  + S        
Sbjct: 33  ILILFLGLTASCLVLYKTAYPLQRLNVSNLTSLQASP--SPLLPNLNSSEIS----PETT 92

Query: 86  LPSLIVFLASRSFPIDVDEYGLDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFR 145
            P L                   ++L +A+T + TVI+TTLN+AWA PNS+ DLFL+SFR
Sbjct: 93  KPKL----------------SFKEILENASTKNNTVIITTLNQAWAEPNSLFDLFLESFR 152

Query: 146 IGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRR 205
           IG  T QLL H+V++ LD KAF RC  +H +C+ + T   DF  E  + +PDYLKMMW R
Sbjct: 153 IGQGTQQLLKHVVVVCLDIKAFERCSQLHTNCYHIETSETDFSGEKVYNTPDYLKMMWAR 212

Query: 206 IDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFN 265
           ID L  VLEMG+NF+FTDAD+MW RDPFP    + DFQ+ACD++ G P D DN  NGGF 
Sbjct: 213 IDLLTQVLEMGFNFIFTDADIMWLRDPFPRLYPDGDFQMACDRFFGNPYDSDNWVNGGFT 272

Query: 266 YVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIRFLDTAYFGGFCE 325
           YV+SNNRSIEFYK+W+ +R  YP  HDQDV NRIK++ FI EIG+++RF DT YFGGFC+
Sbjct: 273 YVRSNNRSIEFYKFWHKSRLDYPDLHDQDVFNRIKHEPFISEIGIQMRFFDTVYFGGFCQ 332

Query: 326 PSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSM-PPYRKTSSALSWRVPQNC 383
            S+D+N V TMHANCCIG+D KLHDL ++L+DW+ Y+S+  P + T    +W VP  C
Sbjct: 333 TSRDINLVCTMHANCCIGLDKKLHDLNLVLDDWRKYLSLSEPVQNT----TWSVPMKC 364

BLAST of ClCG01G006940 vs. TAIR10
Match: AT4G19970.1 (AT4G19970.1 Nucleotide-diphospho-sugar transferase, predicted (InterPro:IPR005069))

HSP 1 Score: 357.5 bits (916), Expect = 1.1e-98
Identity = 160/275 (58.18%), Postives = 207/275 (75.27%), Query Frame = 1

Query: 109 KVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFV 168
           +VL +A+T+++TVI+TTLN+AWA PNS+ DLFL+SFRIG  T +LL H+V++ LD KAF 
Sbjct: 444 EVLENASTENRTVIVTTLNQAWAEPNSLFDLFLESFRIGQGTKKLLQHVVVVCLDSKAFA 503

Query: 169 RCLDIHVHCFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVLEMGYNFVFTDADVMW 228
           RC  +H +C+ L T G DF  E  F +PDYLKMMWRRI+ L  VLEMGYNF+FTDAD+MW
Sbjct: 504 RCSQLHPNCYYLKTTGTDFSGEKLFATPDYLKMMWRRIELLTQVLEMGYNFIFTDADIMW 563

Query: 229 FRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYP 288
            RDPFP    + DFQ+ACD++ G P D DN  NGGF YVKSN+RSIEFYK+WY++R  YP
Sbjct: 564 LRDPFPRLYPDGDFQMACDRFFGDPHDSDNWVNGGFTYVKSNHRSIEFYKFWYNSRLDYP 623

Query: 289 GYHDQDVLNRIKYDFFIDEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKL 348
             HDQDV N+IK+   + EIG+++RF DT YFGGFC+ S+D+N V TMHANCC+G+  KL
Sbjct: 624 KMHDQDVFNQIKHKALVSEIGIQMRFFDTVYFGGFCQTSRDINLVCTMHANCCVGLAKKL 683

Query: 349 HDLRIMLEDWKHYMSM-PPYRKTSSALSWRVPQNC 383
           HDL ++L+DW++Y+S+  P + T    +W VP  C
Sbjct: 684 HDLNLVLDDWRNYLSLSEPVKNT----TWSVPMKC 714


HSP 2 Score: 326.6 bits (836), Expect = 2.0e-89
Identity = 144/274 (52.55%), Postives = 196/274 (71.53%), Query Frame = 1

Query: 107 LDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKA 166
           L++VLM+AA +D TVI+T LN+AWA PNS  D+F +SF++G  T +LL H++ + LD KA
Sbjct: 102 LERVLMNAAMEDNTVIITALNQAWAEPNSTFDVFRESFKVGIETERLLKHVIAVCLDIKA 161

Query: 167 FVRCLDIHVHCFAL-ATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVLEMGYNFVFTDAD 226
           + +CL +H HC+ + AT+         FM+P YLK++WRR+D LR V+ +GYNF+FTDAD
Sbjct: 162 YDQCLKVHPHCYLINATDSDQLSGPNRFMTPGYLKLIWRRMDLLRQVIGLGYNFIFTDAD 221

Query: 227 VMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARE 286
           ++W RDPFP F  +ADFQI CD Y G P D  N  N GF YVK+NN++ +FYKYW  +  
Sbjct: 222 ILWLRDPFPRFFPDADFQITCDDYNGRPSDKKNHVNSGFTYVKANNKTSKFYKYWIRSSR 281

Query: 287 TYPGYHDQDVLNRIKYDFFIDEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMD 346
            +PG HDQDV N IK D  ++++G+K+RF DT YFGGFC+PS+D+N V TMHANCCIG+D
Sbjct: 282 KFPGKHDQDVFNFIKNDLHVEKLGIKMRFFDTVYFGGFCQPSRDINVVNTMHANCCIGLD 341

Query: 347 SKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVP 380
           +K+++L+  LEDWK Y+S+     T S   W +P
Sbjct: 342 NKVNNLKAALEDWKRYVSL---NTTVSETKWNIP 372

BLAST of ClCG01G006940 vs. TAIR10
Match: AT4G15970.1 (AT4G15970.1 Nucleotide-diphospho-sugar transferase family protein)

HSP 1 Score: 316.2 bits (809), Expect = 2.8e-86
Identity = 149/279 (53.41%), Postives = 199/279 (71.33%), Query Frame = 1

Query: 107 LDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKA 166
           L K+L +AAT+DKTVI+TTLN+AW+ PNS  DLFL SF +G  T  LL HLV+  LD++A
Sbjct: 33  LGKILTEAATEDKTVIITTLNKAWSEPNSTFDLFLHSFHVGKGTKPLLRHLVVACLDEEA 92

Query: 167 FVRCLDIHVH-CFALATEGVDFHSEAHFMSPDYLKMMWRRIDFLRIVLEMGYNFVFTDAD 226
           + RC ++H H C+ + T G+DF  +  FM+PDYLKMMWRRI+FL  +L++ YNF+FT   
Sbjct: 93  YSRCSEVHPHRCYFMKTPGIDFAGDKMFMTPDYLKMMWRRIEFLGTLLKLRYNFIFTI-- 152

Query: 227 VMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARE 286
                 PFP      DFQIACD+Y G   D+ N  NGGF +VK+N R+I+FY YWY +R 
Sbjct: 153 ------PFPRLSKEVDFQIACDRYSGDDKDIHNAVNGGFTFVKANQRTIDFYNYWYMSRL 212

Query: 287 TYPGYHDQDVLNRIKYDFFIDEIGLKIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMD 346
            YP  HDQDVL++IK   +  +IGLK+RFLDT YFGGFCEPS+DL++V TMHANCC+G++
Sbjct: 213 RYPDRHDQDVLDQIKGGGYPAKIGLKMRFLDTKYFGGFCEPSRDLDKVCTMHANCCVGLE 272

Query: 347 SKLHDLRIMLEDWKHYMSMPPYRKTSSA--LSWRVPQNC 383
           +K+ DLR ++ DW++Y+S     KT+    ++WR P+NC
Sbjct: 273 NKIKDLRQVIVDWENYVSA---AKTTDGQIMTWRDPENC 300

BLAST of ClCG01G006940 vs. NCBI nr
Match: gi|449463499|ref|XP_004149471.1| (PREDICTED: uncharacterized protein At4g15970 [Cucumis sativus])

HSP 1 Score: 641.0 bits (1652), Expect = 1.4e-180
Identity = 312/369 (84.55%), Postives = 324/369 (87.80%), Query Frame = 1

Query: 14  MFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFVPSLDD 73
           MF YSS   + HILLLFTAISLSCLVILRELNSLRYFPLFSFST SG PP+ PF+ SL  
Sbjct: 1   MFIYSSFRCSPHILLLFTAISLSCLVILRELNSLRYFPLFSFSTSSGPPPLPPFLLSLPH 60

Query: 74  DDESFPFGLSINLPSLIVFLASRSFPIDVDEYGLDKVLMDAATDDKTVILTTLNEAWASP 133
            D   P                     + DEYGLDKVL DAAT+DKTVILTTLNEAWASP
Sbjct: 61  HDHLSP---------------------EADEYGLDKVLKDAATEDKTVILTTLNEAWASP 120

Query: 134 NSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHF 193
           N+VIDLFLQSFRIGNRTHQLLDHLVIIALDKKAF+RCLDIH+HC +L TEGVDF SEA+F
Sbjct: 121 NAVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFMRCLDIHIHCVSLVTEGVDFRSEAYF 180

Query: 194 MSPDYLKMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIP 253
           MSPDYLKMMWRRIDFLR VLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIP
Sbjct: 181 MSPDYLKMMWRRIDFLRTVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIP 240

Query: 254 DDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIR 313
           DDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFI+EIGLKIR
Sbjct: 241 DDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIEEIGLKIR 300

Query: 314 FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSA 373
           FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRI+LEDWKHYMSMPPY KTSS 
Sbjct: 301 FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRILLEDWKHYMSMPPYLKTSSI 348

Query: 374 LSWRVPQNC 383
            SWRVPQNC
Sbjct: 361 QSWRVPQNC 348

BLAST of ClCG01G006940 vs. NCBI nr
Match: gi|659072690|ref|XP_008466761.1| (PREDICTED: uncharacterized protein At4g15970 [Cucumis melo])

HSP 1 Score: 640.6 bits (1651), Expect = 1.8e-180
Identity = 315/369 (85.37%), Postives = 326/369 (88.35%), Query Frame = 1

Query: 14  MFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFVPSLDD 73
           MF Y S   +LHILLLFTAISLSCLVILRELNSLRYFPLFSFST SG PPV PF  SL  
Sbjct: 1   MFIYYSFRCSLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTPSGPPPVPPFFLSLPH 60

Query: 74  DDESFPFGLSINLPSLIVFLASRSFPIDVDEYGLDKVLMDAATDDKTVILTTLNEAWASP 133
           DD+     LS+                  DEYGLDKVL DAAT+DKTVILTTLNEAWA+P
Sbjct: 61  DDD-----LSL-----------------ADEYGLDKVLKDAATEDKTVILTTLNEAWAAP 120

Query: 134 NSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHF 193
           N+VIDLFLQSFRIGN+THQLLDHLVIIALDKKAF+RCLDIHVHC AL TEGVDF SEA+F
Sbjct: 121 NAVIDLFLQSFRIGNQTHQLLDHLVIIALDKKAFMRCLDIHVHCVALVTEGVDFRSEAYF 180

Query: 194 MSPDYLKMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIP 253
           MSPDYLKMMWRRIDFLR VLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIP
Sbjct: 181 MSPDYLKMMWRRIDFLRTVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIP 240

Query: 254 DDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIR 313
           DDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFI+EIGLKIR
Sbjct: 241 DDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIEEIGLKIR 300

Query: 314 FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSA 373
           FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRI+LEDWKHYMSMPPY KTSS 
Sbjct: 301 FLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRILLEDWKHYMSMPPYLKTSSI 347

Query: 374 LSWRVPQNC 383
            SWRVPQNC
Sbjct: 361 QSWRVPQNC 347

BLAST of ClCG01G006940 vs. NCBI nr
Match: gi|1009152804|ref|XP_015894295.1| (PREDICTED: uncharacterized protein At4g15970 [Ziziphus jujuba])

HSP 1 Score: 480.3 bits (1235), Expect = 3.1e-132
Identity = 238/365 (65.21%), Postives = 278/365 (76.16%), Query Frame = 1

Query: 20  SSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFVPSLDDDDESFP 79
           S+  L  LL    + L+ LV+LR+  SLR+              +S F PS         
Sbjct: 41  SASLLRRLLFLAVVLLAGLVLLRDTESLRF--------------LSRFTPS--------- 100

Query: 80  FGLSINLPSLIVFLASRSFPIDV--DEYGLDKVLMDAATDDKTVILTTLNEAWASPNSVI 139
                  PS   F +S   P  V  +EY L+KVL DAA  DKTVILTTLNEAWA+PNSV+
Sbjct: 101 -------PSAYFFPSSSYLPQSVSEEEYTLEKVLKDAAMADKTVILTTLNEAWAAPNSVV 160

Query: 140 DLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPD 199
           DLFL+SFRIG+RT +LL+HLVIIALD KAF RCLD+H HCF L +EG+DFH EA+FM+P 
Sbjct: 161 DLFLESFRIGDRTSRLLNHLVIIALDMKAFKRCLDVHRHCFFLVSEGIDFHQEAYFMTPA 220

Query: 200 YLKMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLD 259
           YLKMMWRRIDFLR VLEMGYNFVFTDAD+MWFRDPFP F ++ADFQIACD +LG PDD++
Sbjct: 221 YLKMMWRRIDFLRSVLEMGYNFVFTDADIMWFRDPFPRFYLDADFQIACDHFLGSPDDVN 280

Query: 260 NRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIRFLDT 319
           N PNGGFNYVKSNN+SIEFYK+WY+++ TYPGYHDQDVLN IK+  FIDEIGLK+RFLDT
Sbjct: 281 NIPNGGFNYVKSNNQSIEFYKFWYASQHTYPGYHDQDVLNIIKFHPFIDEIGLKMRFLDT 340

Query: 320 AYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWR 379
           AYFGG CEPSKDLN V TMHANCC G++SKLHDLRIML+DWK +MS+ P  K SS +SWR
Sbjct: 341 AYFGGLCEPSKDLNEVCTMHANCCFGLNSKLHDLRIMLQDWKKFMSLQPSLKRSSFISWR 375

Query: 380 VPQNC 383
           VPQNC
Sbjct: 401 VPQNC 375

BLAST of ClCG01G006940 vs. NCBI nr
Match: gi|502121038|ref|XP_004497167.1| (PREDICTED: uncharacterized protein At4g15970 [Cicer arietinum])

HSP 1 Score: 478.8 bits (1231), Expect = 9.1e-132
Identity = 233/361 (64.54%), Postives = 275/361 (76.18%), Query Frame = 1

Query: 22  RTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFVPSLDDDDESFPFG 81
           R L   LLF ++SLSCL++ R+++S R+F  F  S     P  S F P + +D    P  
Sbjct: 12  RALATALLFASVSLSCLILFRDVDSYRFFSRFPSSY--PLPRFSSFFPLVSND----PTA 71

Query: 82  LSINLPSLIVFLASRSFPIDVDEYGLDKVLMDAATDDKTVILTTLNEAWASPNSVIDLFL 141
            S                   +EY L+K+L +AA +D+TVILTTLNEAWA+PNSVIDLFL
Sbjct: 72  TS-------------------NEYPLEKILNEAAMEDRTVILTTLNEAWAAPNSVIDLFL 131

Query: 142 QSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSEAHFMSPDYLKM 201
           QSFRIG+RT +LL+HLVIIALD+KAF RC  IH HCF LA+E  DFH EA+FM+P YL M
Sbjct: 132 QSFRIGDRTRRLLNHLVIIALDQKAFARCKVIHAHCFLLASEEADFHEEAYFMTPSYLMM 191

Query: 202 MWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYLGIPDDLDNRPN 261
           MWRRIDFLR VLEMGYNFVFTDAD+MWFRDPFP F ++ADFQIACD + G  DD+ NRPN
Sbjct: 192 MWRRIDFLRSVLEMGYNFVFTDADIMWFRDPFPQFHLDADFQIACDHFTGSFDDVQNRPN 251

Query: 262 GGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGLKIRFLDTAYFG 321
           GGFN+VKSNNRSIEFYK+WYS+RETYPGYHDQDVLN IKY  FI +IGLK+ FLDT  FG
Sbjct: 252 GGFNFVKSNNRSIEFYKFWYSSRETYPGYHDQDVLNIIKYHPFIADIGLKMTFLDTTNFG 311

Query: 322 GFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKTSSALSWRVPQN 381
           G CEPS+DLN+V TMHANCC GMDSKLHDL++ML+DWKHY+S+PP  K  S +SWRVPQ 
Sbjct: 312 GLCEPSRDLNQVCTMHANCCFGMDSKLHDLKVMLQDWKHYLSLPPSLKKMSVVSWRVPQK 347

Query: 382 C 383
           C
Sbjct: 372 C 347

BLAST of ClCG01G006940 vs. NCBI nr
Match: gi|802597746|ref|XP_012072412.1| (PREDICTED: uncharacterized protein At4g15970 isoform X2 [Jatropha curcas])

HSP 1 Score: 474.9 bits (1221), Expect = 1.3e-130
Identity = 224/374 (59.89%), Postives = 282/374 (75.40%), Query Frame = 1

Query: 12  SAMFSYSSSSRTLHILLLFTAISLSCLVILRELNSLRYFPLFSFSTFSGSPPVSPFVPSL 71
           SA F+    S     +L+F  + +SCL++ R  +SLR+                      
Sbjct: 30  SATFNMLPESAIPRAVLVFAVLCISCLLLYRAADSLRFL--------------------- 89

Query: 72  DDDDESFPFGLSINLPSLIVFLASRSFPIDV-DEYGLDKVLMDAATDDKTVILTTLNEAW 131
                SFP G S++ P +   L   +  + V +E  L+ VL DAA +DKTVILTTLNEAW
Sbjct: 90  -----SFPSGSSVSFPHIFPSLVFDNDSVPVRNEQKLENVLEDAAMEDKTVILTTLNEAW 149

Query: 132 ASPNSVIDLFLQSFRIGNRTHQLLDHLVIIALDKKAFVRCLDIHVHCFALATEGVDFHSE 191
           A+PNS++DLFL SFRIG  T +LL+HLVIIALD+KA+ RC+++H+HCFAL TEG+DF  E
Sbjct: 150 AAPNSIVDLFLASFRIGVHTRRLLNHLVIIALDQKAYTRCMELHIHCFALVTEGIDFRKE 209

Query: 192 AHFMSPDYLKMMWRRIDFLRIVLEMGYNFVFTDADVMWFRDPFPFFDINADFQIACDQYL 251
           A+FM+P Y+KMMWRRIDFLR VLE+GYNFVFTDADVMWFRDPFP F  +ADFQIACD + 
Sbjct: 210 AYFMTPAYVKMMWRRIDFLRSVLELGYNFVFTDADVMWFRDPFPRFYSDADFQIACDHFT 269

Query: 252 GIPDDLDNRPNGGFNYVKSNNRSIEFYKYWYSARETYPGYHDQDVLNRIKYDFFIDEIGL 311
           G   +++N+PNGGFN+V+SNNRSIEFYK+WYS+RETYPGYHDQDVLN IK+D F++++GL
Sbjct: 270 GSSVNIENKPNGGFNFVRSNNRSIEFYKFWYSSRETYPGYHDQDVLNFIKFDSFVEDLGL 329

Query: 312 KIRFLDTAYFGGFCEPSKDLNRVLTMHANCCIGMDSKLHDLRIMLEDWKHYMSMPPYRKT 371
           K+RFLDTAYFGG CEPSKDL+ V TMHANCC G+DSKLHDLR+ML+DW H++S+PP  K 
Sbjct: 330 KMRFLDTAYFGGLCEPSKDLSLVCTMHANCCFGLDSKLHDLRVMLQDWMHFLSLPPSMKR 377

Query: 372 SSALSWRVPQNCRY 385
           S  +SWRVPQNCR+
Sbjct: 390 SLIVSWRVPQNCRF 377

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y4597_ARATH8.3e-8553.41Uncharacterized protein At4g15970 OS=Arabidopsis thaliana GN=At4g15970 PE=2 SV=1[more]
Y1869_ARATH6.5e-5339.36Uncharacterized protein At1g28695 OS=Arabidopsis thaliana GN=At1g28695 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KPV2_CUCSA9.7e-18184.55Glycosyltransferase OS=Cucumis sativus GN=Csa_5G107050 PE=3 SV=1[more]
A0A072VSD7_MEDTR2.0e-13063.59Glycosyltransferase OS=Medicago truncatula GN=MTR_1g112300 PE=3 SV=1[more]
A0A067KPV4_JATCU7.8e-13059.95Glycosyltransferase OS=Jatropha curcas GN=JCGZ_04849 PE=3 SV=1[more]
A0A0B2RYA6_GLYSO6.6e-12963.07Uncharacterized protein OS=Glycine soja GN=glysoja_013623 PE=4 SV=1[more]
K7LLW7_SOYBN6.6e-12963.07Uncharacterized protein OS=Glycine max GN=GLYMA_10G280600 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G14590.16.5e-12070.32 Nucleotide-diphospho-sugar transferase family protein[more]
AT2G02061.11.5e-11159.64 Nucleotide-diphospho-sugar transferase family protein[more]
AT5G44820.14.0e-10149.72 Nucleotide-diphospho-sugar transferase family protein[more]
AT4G19970.11.1e-9858.18 Nucleotide-diphospho-sugar transferase, predicted (InterPro:IPR00506... [more]
AT4G15970.12.8e-8653.41 Nucleotide-diphospho-sugar transferase family protein[more]
Match NameE-valueIdentityDescription
gi|449463499|ref|XP_004149471.1|1.4e-18084.55PREDICTED: uncharacterized protein At4g15970 [Cucumis sativus][more]
gi|659072690|ref|XP_008466761.1|1.8e-18085.37PREDICTED: uncharacterized protein At4g15970 [Cucumis melo][more]
gi|1009152804|ref|XP_015894295.1|3.1e-13265.21PREDICTED: uncharacterized protein At4g15970 [Ziziphus jujuba][more]
gi|502121038|ref|XP_004497167.1|9.1e-13264.54PREDICTED: uncharacterized protein At4g15970 [Cicer arietinum][more]
gi|802597746|ref|XP_012072412.1|1.3e-13059.89PREDICTED: uncharacterized protein At4g15970 isoform X2 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005069Nucl-diP-sugar_transferase
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016310 phosphorylation
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
biological_process GO:0071555 cell wall organization
cellular_component GO:0005575 cellular_component
cellular_component GO:0000139 Golgi membrane
cellular_component GO:0005794 Golgi apparatus
molecular_function GO:0016301 kinase activity
molecular_function GO:0016740 transferase activity
molecular_function GO:0003674 molecular_function
molecular_function GO:0016757 transferase activity, transferring glycosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G006940.1ClCG01G006940.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005069Nucleotide-diphospho-sugar transferasePFAMPF03407Nucleotid_transcoord: 153..351
score: 7.7
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 231..364
score: 9.9E-43coord: 73..190
score: 9.9E-43coord: 1..12
score: 9.9
NoneNo IPR availablePANTHERPTHR24015:SF419NUCLEOTIDE-DIPHOSPHO-SUGAR TRANSFERASE DOMAIN-CONTAINING PROTEIN-RELATEDcoord: 1..12
score: 9.9E-43coord: 73..190
score: 9.9E-43coord: 231..364
score: 9.9