ClCG01G010820 (gene) Watermelon (Charleston Gray)

NameClCG01G010820
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionGalactinol synthase family protein
LocationCG_Chr01 : 16947797 .. 16949281 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAGAAAAAGAAAAAGAAAAAGAAGAAAGTGGCAATGGCTCCTGAATTGAGCGGTGGTAAGTGCGGATATGTGACGTTTCTGGCTGGAAACGGAGATTATGTGAAGGGAGTGGTGGGACTGGCGAAGGGGTTGAGGAAAGTGAAGAGCAAGTATCCGTTGGTGGTGGCGGTTCTTCCGGACGTGCCGGAGGAACACCGGGAATTGCTGAGGTGGCAGGGGTGCGTGGTGAAAGAGATAGAACCGGTTTACCCACCTGAGAACCAAACCCACTTCGCCATGCCTTACTACGTCATCAATTACTCCAAGCTAAGGATTTGGGAGGTACGTTTTGTTTTTTTTTCTATCTTTATAATGAATTATATGCTTGAACTTATTAATTGAAATTGGTGTGTAGTTTGTGGAGTTCAAGAAAATGATATATTTGGATGGAGACATTCAAGTGATGGAGAATATAGACCATCTATTCCAGATGGAGGAGTCGTACTTTTACGCAGTGATGGATTGTTTTTGCGAGAAGACGTGGAGCCATACGGCTCAGTACAAGATTGGGTATTGCCAGCAGAGGCCGAACGAAGTGGAGTGGGGTCCCCAGTTGGGGCCGAAGCCTCCTCTTTACTTCAACGCTGGGATGTTCGTTTACGAGCCCAATTTCCAGACTTACCGTGCCCTTCTCTCCACTCTCAACTCCACCCCTCCCACCCCCTTCGCCGAACAGGTACCCTTTTCACTTGTTACTATTACCATTTCAAAACTGGATTCTGTAATTTCTGATAAATAAAACCTTTTTTTGTTATTGGACAGGACTTCTTAAACATGTTCTTCAAGGAGAAATACAGGCCAATTCCAGCAGTTTACAACCTGGTGATGGCGATGCTATGGCGCCACCCGGAAAACGTGGACCTCCACAAAGTCAAGGTGGTTCACTACTGCGCTGCCGTAAGTATATTATAATTAATTTACAATACTAATTCATAGTTTAATTAATATTCATTTTCATGATATTTTTAATAATTAATTAATGAGTTTGTTGTTGATCAGGGTTCGAAGCCATGGAGGTACACAGGCAAGGAGGAGAACATGGACAGGGAAGACGTAAAGATGCTGGTCAAGAAATGGTGGGAAATTTACGACGACGAGACCTTGGACTTCATCAACTATAAGATCGATGATGGTGACACCGACGCTCGACACCCATTCCTAGCGGCGCTGTCGGAAGCTGGTGCTGTTCACTACCACAATGCCCCATCCGCGGCTTAATTTCATGTCGGACCTGACTCAATAAGACTCATTATCAAATACATAAATACATTGCTGATATAGTTAATTACCTGAAAATATATATATATTTTGAGTTTGAGTTTGTATTATTAATTTTGAGTTGGATTCTCCCATTAAGAATCAAAAAGATAGAGGAGAGAGTGTTGTTATGAATTTGTGAGTTTTAATATGTAAATTATCAATTTGTGAGCTTTAATAATTAATTC

mRNA sequence

AAAGAAAAAGAAAAAGAAAAAGAAGAAAGTGGCAATGGCTCCTGAATTGAGCGGTGGTAAGTGCGGATATGTGACGTTTCTGGCTGGAAACGGAGATTATGTGAAGGGAGTGGTGGGACTGGCGAAGGGGTTGAGGAAAGTGAAGAGCAAGTATCCGTTGGTGGTGGCGGTTCTTCCGGACGTGCCGGAGGAACACCGGGAATTGCTGAGGTGGCAGGGGTGCGTGGTGAAAGAGATAGAACCGGTTTACCCACCTGAGAACCAAACCCACTTCGCCATGCCTTACTACGTCATCAATTACTCCAAGCTAAGGATTTGGGAGTTTGTGGAGTTCAAGAAAATGATATATTTGGATGGAGACATTCAAGTGATGGAGAATATAGACCATCTATTCCAGATGGAGGAGTCGTACTTTTACGCAGTGATGGATTGTTTTTGCGAGAAGACGTGGAGCCATACGGCTCAGTACAAGATTGGGTATTGCCAGCAGAGGCCGAACGAAGTGGAGTGGGGTCCCCAGTTGGGGCCGAAGCCTCCTCTTTACTTCAACGCTGGGATGTTCGTTTACGAGCCCAATTTCCAGACTTACCGTGCCCTTCTCTCCACTCTCAACTCCACCCCTCCCACCCCCTTCGCCGAACAGGACTTCTTAAACATGTTCTTCAAGGAGAAATACAGGCCAATTCCAGCAGTTTACAACCTGGTGATGGCGATGCTATGGCGCCACCCGGAAAACGTGGACCTCCACAAAGTCAAGGTGGTTCACTACTGCGCTGCCGGTTCGAAGCCATGGAGGTACACAGGCAAGGAGGAGAACATGGACAGGGAAGACGTAAAGATGCTGGTCAAGAAATGGTGGGAAATTTACGACGACGAGACCTTGGACTTCATCAACTATAAGATCGATGATGGTGACACCGACGCTCGACACCCATTCCTAGCGGCGCTGTCGGAAGCTGGTGCTGTTCACTACCACAATGCCCCATCCGCGGCTTAATTTCATGTCGGACCTGACTCAATAAGACTCATTATCAAATACATAAATACATTGCTGATATAGTTAATTACCTGAAAATATATATATATTTTGAGTTTGAGTTTGTATTATTAATTTTGAGTTGGATTCTCCCATTAAGAATCAAAAAGATAGAGGAGAGAGTGTTGTTATGAATTTGTGAGTTTTAATATGTAAATTATCAATTTGTGAGCTTTAATAATTAATTC

Coding sequence (CDS)

ATGGCTCCTGAATTGAGCGGTGGTAAGTGCGGATATGTGACGTTTCTGGCTGGAAACGGAGATTATGTGAAGGGAGTGGTGGGACTGGCGAAGGGGTTGAGGAAAGTGAAGAGCAAGTATCCGTTGGTGGTGGCGGTTCTTCCGGACGTGCCGGAGGAACACCGGGAATTGCTGAGGTGGCAGGGGTGCGTGGTGAAAGAGATAGAACCGGTTTACCCACCTGAGAACCAAACCCACTTCGCCATGCCTTACTACGTCATCAATTACTCCAAGCTAAGGATTTGGGAGTTTGTGGAGTTCAAGAAAATGATATATTTGGATGGAGACATTCAAGTGATGGAGAATATAGACCATCTATTCCAGATGGAGGAGTCGTACTTTTACGCAGTGATGGATTGTTTTTGCGAGAAGACGTGGAGCCATACGGCTCAGTACAAGATTGGGTATTGCCAGCAGAGGCCGAACGAAGTGGAGTGGGGTCCCCAGTTGGGGCCGAAGCCTCCTCTTTACTTCAACGCTGGGATGTTCGTTTACGAGCCCAATTTCCAGACTTACCGTGCCCTTCTCTCCACTCTCAACTCCACCCCTCCCACCCCCTTCGCCGAACAGGACTTCTTAAACATGTTCTTCAAGGAGAAATACAGGCCAATTCCAGCAGTTTACAACCTGGTGATGGCGATGCTATGGCGCCACCCGGAAAACGTGGACCTCCACAAAGTCAAGGTGGTTCACTACTGCGCTGCCGGTTCGAAGCCATGGAGGTACACAGGCAAGGAGGAGAACATGGACAGGGAAGACGTAAAGATGCTGGTCAAGAAATGGTGGGAAATTTACGACGACGAGACCTTGGACTTCATCAACTATAAGATCGATGATGGTGACACCGACGCTCGACACCCATTCCTAGCGGCGCTGTCGGAAGCTGGTGCTGTTCACTACCACAATGCCCCATCCGCGGCTTAA

Protein sequence

MAPELSGGKCGYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLPDVPEEHRELLRWQGCVVKEIEPVYPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDGDIQVMENIDHLFQMEESYFYAVMDCFCEKTWSHTAQYKIGYCQQRPNEVEWGPQLGPKPPLYFNAGMFVYEPNFQTYRALLSTLNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAMLWRHPENVDLHKVKVVHYCAAGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLDFINYKIDDGDTDARHPFLAALSEAGAVHYHNAPSAA
BLAST of ClCG01G010820 vs. Swiss-Prot
Match: GOLS2_SOLLC (Galactinol synthase 2 OS=Solanum lycopersicum GN=GOLS2 PE=2 SV=1)

HSP 1 Score: 506.9 bits (1304), Expect = 1.6e-142
Identity = 237/313 (75.72%), Postives = 262/313 (83.71%), Query Frame = 1

Query: 12  YVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLPDVPEEHRELLRWQGCVVKEIEPV 71
           YVTFLAGNGDY KGVVGL KGLRK KS YPLVVA LPDVPEEHR +L  QGC+V+EIEPV
Sbjct: 26  YVTFLAGNGDYWKGVVGLVKGLRKAKSAYPLVVACLPDVPEEHRRILINQGCIVREIEPV 85

Query: 72  YPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDGDIQVMENIDHLFQMEESYFYAVM 131
           YPP NQT FAM YYVINYSKLRIWEFVE+ KMIYLDGDIQV +NIDHLF + + YFYAVM
Sbjct: 86  YPPHNQTQFAMAYYVINYSKLRIWEFVEYSKMIYLDGDIQVFDNIDHLFDLPDGYFYAVM 145

Query: 132 DCFCEKTWSHTAQYKIGYCQQRPNEVEWGPQLGPKPPLYFNAGMFVYEPNFQTYRALLST 191
           DCFCEKTWSHT QYK+GYCQQ P++V+W   LGPKP LYFNAGMFVYEP+  TY  LL T
Sbjct: 146 DCFCEKTWSHTPQYKVGYCQQCPDKVQWTEDLGPKPSLYFNAGMFVYEPSLSTYDDLLKT 205

Query: 192 LNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAMLWRHPENVDLHKVKVVHYCAAGSK 251
           L  TPPTPFAEQDFLNM+F++ Y+PIP  YNLV+AMLWRHPENVDL KVKVVHYCAAGSK
Sbjct: 206 LKVTPPTPFAEQDFLNMYFRDVYKPIPNDYNLVLAMLWRHPENVDLEKVKVVHYCAAGSK 265

Query: 252 PWRYTGKEENMDREDVKMLVKKWWEIYDDETLDFINYKI----DDGDTDARHPFLAALSE 311
           PWRYTGKEENMDRED+KML+KKWW+IYDDE+LD+ N  +     DG+ +A+   + ALSE
Sbjct: 266 PWRYTGKEENMDREDIKMLIKKWWDIYDDESLDYKNSNVVMNAVDGEVEAQ-KIMEALSE 325

Query: 312 AGAVHYHNAPSAA 321
           AG VHY  APSAA
Sbjct: 326 AGVVHYITAPSAA 337

BLAST of ClCG01G010820 vs. Swiss-Prot
Match: GOLS2_ARATH (Galactinol synthase 2 OS=Arabidopsis thaliana GN=GOLS2 PE=1 SV=1)

HSP 1 Score: 498.4 bits (1282), Expect = 5.7e-140
Identity = 241/335 (71.94%), Postives = 269/335 (80.30%), Query Frame = 1

Query: 1   MAPELS------------GGKCGYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLP 60
           MAPE++            G K  YVTFLAG GDYVKGVVGLAKGLRK KSKYPLVVAVLP
Sbjct: 1   MAPEINTKLTVPVHSATGGEKRAYVTFLAGTGDYVKGVVGLAKGLRKAKSKYPLVVAVLP 60

Query: 61  DVPEEHRELLRWQGCVVKEIEPVYPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDG 120
           DVPE+HR+ L  QGCVVKEIEPVYPPENQT FAM YYVINYSKLRIWEFVE+ KMIYLDG
Sbjct: 61  DVPEDHRKQLVDQGCVVKEIEPVYPPENQTEFAMAYYVINYSKLRIWEFVEYNKMIYLDG 120

Query: 121 DIQVMENIDHLFQMEESYFYAVMDCFCEKTWSHTAQYKIGYCQQRPNEVEW-GPQLGPKP 180
           DIQV +NIDHLF +    FYAVMDCFCEKTWSH+ QYKIGYCQQ P++V W   +LGPKP
Sbjct: 121 DIQVFDNIDHLFDLPNGQFYAVMDCFCEKTWSHSPQYKIGYCQQCPDKVTWPEAKLGPKP 180

Query: 181 PLYFNAGMFVYEPNFQTYRALLSTLNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAM 240
           PLYFNAGMFVYEPN  TY  LL T+   PPT FAEQDFLNM+FK+ Y+PIP VYNLV+AM
Sbjct: 181 PLYFNAGMFVYEPNLSTYHNLLETVKIVPPTLFAEQDFLNMYFKDIYKPIPPVYNLVLAM 240

Query: 241 LWRHPENVDLHKVKVVHYCAAGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLDFIN 300
           LWRHPEN++L +VKVVHYCAAG+KPWR+TG+EENMDRED+KMLVKKWW+IY+DE+LD+ N
Sbjct: 241 LWRHPENIELDQVKVVHYCAAGAKPWRFTGEEENMDREDIKMLVKKWWDIYNDESLDYKN 300

Query: 301 YKIDDGDTDAR--HPFLAALSEAGAVHYHNAPSAA 321
             I D     +    F+ ALSEAGA+ Y  APSAA
Sbjct: 301 VVIGDSHKKQQTLQQFIEALSEAGALQYVKAPSAA 335

BLAST of ClCG01G010820 vs. Swiss-Prot
Match: GOLS1_ARATH (Galactinol synthase 1 OS=Arabidopsis thaliana GN=GOLS1 PE=1 SV=1)

HSP 1 Score: 495.4 bits (1274), Expect = 4.8e-139
Identity = 236/322 (73.29%), Postives = 266/322 (82.61%), Query Frame = 1

Query: 3   PELSGGKCGYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLPDVPEEHRELLRWQG 62
           P +      YVTFLAGNGDYVKGVVGLAKGLRKVKS YPLVVA+LPDVPEEHR +L  QG
Sbjct: 23  PSVQDSDRAYVTFLAGNGDYVKGVVGLAKGLRKVKSAYPLVVAMLPDVPEEHRRILVDQG 82

Query: 63  CVVKEIEPVYPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDGDIQVMENIDHLFQM 122
           C+V+EIEPVYPPENQT FAM YYVINYSKLRIW+FVE+ KMIYLDGDIQV ENIDHLF +
Sbjct: 83  CIVREIEPVYPPENQTQFAMAYYVINYSKLRIWKFVEYSKMIYLDGDIQVYENIDHLFDL 142

Query: 123 EESYFYAVMDCFCEKTWSHTAQYKIGYCQQRPNEVEW-GPQLGPKPPLYFNAGMFVYEPN 182
            + Y YAVMDCFCEKTWSHT QYKI YCQQ P++V+W   +LG  P LYFNAGMF+YEPN
Sbjct: 143 PDGYLYAVMDCFCEKTWSHTPQYKIRYCQQCPDKVQWPKAELGEPPALYFNAGMFLYEPN 202

Query: 183 FQTYRALLSTLNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAMLWRHPENVDLHKVK 242
            +TY  LL TL  TPPTPFAEQDFLNM+FK+ Y+PIP VYNLV+AMLWRHPENV+L KVK
Sbjct: 203 LETYEDLLRTLKITPPTPFAEQDFLNMYFKKIYKPIPLVYNLVLAMLWRHPENVELGKVK 262

Query: 243 VVHYCAAGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLDFIN-YKIDDGDTDARH- 302
           VVHYCAAGSKPWRYTGKE NM+RED+KMLVKKWW+IYDDE+LD+     + D + D  + 
Sbjct: 263 VVHYCAAGSKPWRYTGKEANMEREDIKMLVKKWWDIYDDESLDYKKPVTVVDTEVDLVNL 322

Query: 303 -PFLAALSEAGAVHYHNAPSAA 321
            PF+ AL+EAG ++Y  APSAA
Sbjct: 323 KPFITALTEAGRLNYVTAPSAA 344

BLAST of ClCG01G010820 vs. Swiss-Prot
Match: GOLS1_AJURE (Galactinol synthase 1 OS=Ajuga reptans GN=GOLS1 PE=1 SV=1)

HSP 1 Score: 488.0 bits (1255), Expect = 7.7e-137
Identity = 232/315 (73.65%), Postives = 258/315 (81.90%), Query Frame = 1

Query: 7   GGKCGYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLPDVPEEHRELLRWQGCVVK 66
           G K GYVTFLAGNGDYVKGVVGLAKGLRKVKS YPLVVA+LPDVPEEHRELLR QGC+VK
Sbjct: 20  GAKKGYVTFLAGNGDYVKGVVGLAKGLRKVKSAYPLVVAILPDVPEEHRELLRSQGCIVK 79

Query: 67  EIEPVYPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDGDIQVMENIDHLFQMEESY 126
           EIEP+YPP NQ  FAM YYVINYSKLRIW F E+ KM+YLD DIQV ENIDHL    + Y
Sbjct: 80  EIEPIYPPANQIQFAMAYYVINYSKLRIWNFEEYSKMVYLDADIQVYENIDHLLDTPDGY 139

Query: 127 FYAVMDCFCEKTWSHTAQYKIGYCQQRPNEVEWGPQLGPKPPLYFNAGMFVYEPNFQTYR 186
           FYAVMDCFCEKTWSH+ Q+ IGYCQQ PN+V W  Q+G  PPLYFNAGMFV+EP+  TY+
Sbjct: 140 FYAVMDCFCEKTWSHSRQFSIGYCQQCPNKVTWPAQMGSPPPLYFNAGMFVFEPSKTTYQ 199

Query: 187 ALLSTLNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAMLWRHPENVDLHKVKVVHYC 246
            LL TL  TPPTPFAEQDFLNMFF+  Y+PIP VYNLV+AMLWRHPENV+L KV+VVHYC
Sbjct: 200 TLLHTLRITPPTPFAEQDFLNMFFEPIYKPIPLVYNLVLAMLWRHPENVELEKVQVVHYC 259

Query: 247 AAGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLDF-INYKIDDGDTDARHPFLAAL 306
           AAGSKPWRYTG+E NMDRED+KMLVKKWW++Y+DE+LDF     I   +T +   F+A+L
Sbjct: 260 AAGSKPWRYTGQEANMDREDIKMLVKKWWDVYNDESLDFKAEDSIAGEETFSMPSFIASL 319

Query: 307 SEAGAVHYHNAPSAA 321
            E  AV Y  APSAA
Sbjct: 320 PEP-AVSYIPAPSAA 333

BLAST of ClCG01G010820 vs. Swiss-Prot
Match: GOLS3_ARATH (Galactinol synthase 3 OS=Arabidopsis thaliana GN=GOLS3 PE=1 SV=1)

HSP 1 Score: 478.0 bits (1229), Expect = 8.0e-134
Identity = 227/334 (67.96%), Postives = 263/334 (78.74%), Query Frame = 1

Query: 1   MAPELSGG------KCGYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLPDVPEEH 60
           MAPE++        K  YVTFLAG GDYVKGVVGLAKGLRK KSKYPLVVAVLPDVP +H
Sbjct: 1   MAPEMNNKLSYGEKKRAYVTFLAGTGDYVKGVVGLAKGLRKTKSKYPLVVAVLPDVPADH 60

Query: 61  RELLRWQGCVVKEIEPVYPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDGDIQVME 120
           R  L  QGCV+KEI+PVYPP+NQT FAM YYV+NYSKLRIW+FVE+ K+IYLDGDIQV E
Sbjct: 61  RRQLLDQGCVIKEIQPVYPPDNQTQFAMAYYVLNYSKLRIWKFVEYSKLIYLDGDIQVFE 120

Query: 121 NIDHLFQMEESYFYAVMDCFCEKTWSHTAQYKIGYCQQRPNEVEW-GPQLGPKPPLYFNA 180
           NIDHLF + +  FYAV DCFCEKTWSHT QYKIGYCQQ P++V W   +LGPKPPLYFNA
Sbjct: 121 NIDHLFDLPDGNFYAVKDCFCEKTWSHTPQYKIGYCQQCPDKVTWPESELGPKPPLYFNA 180

Query: 181 GMFVYEPNFQTYRALLSTLNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAMLWRHPE 240
           GMFVYEP+  TY  LL TL   PPTPFAEQDFLNM+FK+ Y+PIP VYNLV+AMLWRHPE
Sbjct: 181 GMFVYEPSLPTYYNLLETLKVVPPTPFAEQDFLNMYFKDIYKPIPPVYNLVLAMLWRHPE 240

Query: 241 NVDLHKVKVVHYCAAGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLDFINYKIDDG 300
           N++L++ KVVHYCAAG+KPWR+TG+E NM+RED+KMLV+KWW+IY+DE+LD+ N+ +  G
Sbjct: 241 NIELNEAKVVHYCAAGAKPWRFTGQEGNMEREDIKMLVEKWWDIYNDESLDYKNFNVHCG 300

Query: 301 DTDARH-------PFLAALSEAGAVHYHNAPSAA 321
             +  H        F   LSEA  +    APSAA
Sbjct: 301 QKEDVHRKPKTLPQFFTDLSEADVLQCAKAPSAA 334

BLAST of ClCG01G010820 vs. TrEMBL
Match: A0A067JHE5_JATCU (Hexosyltransferase OS=Jatropha curcas GN=JCGZ_23194 PE=2 SV=1)

HSP 1 Score: 527.7 bits (1358), Expect = 9.8e-147
Identity = 241/321 (75.08%), Postives = 276/321 (85.98%), Query Frame = 1

Query: 1   MAPELSGGKCGYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLPDVPEEHRELLRW 60
           +  + S   C YVTFLAGNGDY+KGVVGLAKGLRKVKSKYPLVVA+LPDVPE+HR++L  
Sbjct: 15  LVKQASISSCAYVTFLAGNGDYIKGVVGLAKGLRKVKSKYPLVVAILPDVPEDHRKILVS 74

Query: 61  QGCVVKEIEPVYPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDGDIQVMENIDHLF 120
           QGC+VKEIEPVYPPENQT FAM YYVINYSKLRIWEFVE+ KMIYLDGDIQV ENIDHLF
Sbjct: 75  QGCIVKEIEPVYPPENQTQFAMAYYVINYSKLRIWEFVEYSKMIYLDGDIQVFENIDHLF 134

Query: 121 QMEESYFYAVMDCFCEKTWSHTAQYKIGYCQQRPNEVEWGPQLGPKPPLYFNAGMFVYEP 180
            +++ YFYAVMDC+CEKTWSH+ Q+KIGYCQQ P+ V+W  +LGP PPLYFNAGMFVYEP
Sbjct: 135 DLQDGYFYAVMDCYCEKTWSHSVQHKIGYCQQCPDRVKWPAELGPAPPLYFNAGMFVYEP 194

Query: 181 NFQTYRALLSTLNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAMLWRHPENVDLHKV 240
           +  TY  LL TL  TPPTPFAEQDFLNMFFK+ Y+PIP +YNLV+A++WRHPEN++++K 
Sbjct: 195 SLSTYDDLLKTLKVTPPTPFAEQDFLNMFFKDIYKPIPPIYNLVLALIWRHPENIEVNKA 254

Query: 241 KVVHYCAAGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLDFIN-YKIDDGDTDARH 300
           KVVHYCAAGSKPWRYTGKEENMDRED+KMLV+KWW+IY+DE+LD+ N      GD     
Sbjct: 255 KVVHYCAAGSKPWRYTGKEENMDREDIKMLVQKWWDIYNDESLDYRNTVAAAGGDEGGMQ 314

Query: 301 PFLAALSEAGAVHYHNAPSAA 321
           PFLAALSEAG VHY NAPSAA
Sbjct: 315 PFLAALSEAGVVHYVNAPSAA 335

BLAST of ClCG01G010820 vs. TrEMBL
Match: B9RFM7_RICCO (Hexosyltransferase OS=Ricinus communis GN=RCOM_1435920 PE=3 SV=1)

HSP 1 Score: 525.0 bits (1351), Expect = 6.3e-146
Identity = 243/322 (75.47%), Postives = 276/322 (85.71%), Query Frame = 1

Query: 1   MAPELSGGKCGYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLPDVPEEHRELLRW 60
           +  + S   C YVTFLAG+GDYVKGVVGLAKGLRKVKSKYPLVVA+LPDVPE+HR++L  
Sbjct: 17  LVKQASISSCAYVTFLAGDGDYVKGVVGLAKGLRKVKSKYPLVVAILPDVPEDHRKILVS 76

Query: 61  QGCVVKEIEPVYPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDGDIQVMENIDHLF 120
           QGC+VKEIEPVYPPENQT FAM YYVINYSKLRIWEFVE+ KMIYLDGDIQV ENIDHLF
Sbjct: 77  QGCIVKEIEPVYPPENQTQFAMAYYVINYSKLRIWEFVEYSKMIYLDGDIQVFENIDHLF 136

Query: 121 QMEESYFYAVMDCFCEKTWSHTAQYKIGYCQQRPNEVEWGPQLGPKPPLYFNAGMFVYEP 180
            ++  YFYAVMDCFCEKTWSH+ QYKIGYCQQ P+ V+W  ++GPKPPLYFNAGMFV+EP
Sbjct: 137 DLQNGYFYAVMDCFCEKTWSHSPQYKIGYCQQCPDRVKWPAEMGPKPPLYFNAGMFVFEP 196

Query: 181 NFQTYRALLSTLNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAMLWRHPENVDLHKV 240
           +  TY  LL+T+  TPPTPFAEQDFLNMFFK+ YRPIP +YNLV+A+LWRHPEN++  KV
Sbjct: 197 SLSTYDDLLNTVKLTPPTPFAEQDFLNMFFKDIYRPIPPIYNLVLALLWRHPENIEFEKV 256

Query: 241 KVVHYCAAGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLDFIN-YKIDDGDTDAR- 300
           KVVHYCAAGSKPWRYTGKE+NMDRED+KMLVKKWW+IY+DE+LD+ N      G T+   
Sbjct: 257 KVVHYCAAGSKPWRYTGKEDNMDREDIKMLVKKWWDIYEDESLDYKNTVAATGGATEGEL 316

Query: 301 HPFLAALSEAGAVHYHNAPSAA 321
            PFLAALSEAG VHY  APSAA
Sbjct: 317 QPFLAALSEAGVVHYVTAPSAA 338

BLAST of ClCG01G010820 vs. TrEMBL
Match: A0A061F4Y8_THECC (Hexosyltransferase OS=Theobroma cacao GN=TCM_026798 PE=3 SV=1)

HSP 1 Score: 520.8 bits (1340), Expect = 1.2e-144
Identity = 244/316 (77.22%), Postives = 274/316 (86.71%), Query Frame = 1

Query: 12  YVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLPDVPEEHRELLRWQGCVVKEIEPV 71
           YVTFLAGNGDYVKGVVGLAKGLRKVKS+YPL+VA+LPDVPE+HR++L  QGC+VKEIEPV
Sbjct: 33  YVTFLAGNGDYVKGVVGLAKGLRKVKSQYPLLVAILPDVPEDHRKILVDQGCIVKEIEPV 92

Query: 72  YPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDGDIQVMENIDHLFQMEESYFYAVM 131
           YPPENQT FAM YYVINYSKLRIWEFVE+ KMIYLDGDIQV ENIDHLF ME+  FYAVM
Sbjct: 93  YPPENQTQFAMAYYVINYSKLRIWEFVEYCKMIYLDGDIQVFENIDHLFDMEDGSFYAVM 152

Query: 132 DCFCEKTWSHTAQYKIGYCQQRPNEVEWGPQLGPKPPLYFNAGMFVYEPNFQTYRALLST 191
           DCFCEKTWSHT QYKIGYCQQ P++V+W  QLGPKPPLYFNAGMFVYEP+ + Y  LL T
Sbjct: 153 DCFCEKTWSHTPQYKIGYCQQCPDKVQWPSQLGPKPPLYFNAGMFVYEPSLRVYDELLRT 212

Query: 192 LNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAMLWRHPENVDLHKVKVVHYCAAGSK 251
           L  TPPTPFAEQD+LNMFF++ Y+PIP VYNLVMAMLWRHPEN++L KVKV HYCAAGSK
Sbjct: 213 LKVTPPTPFAEQDYLNMFFRDIYKPIPPVYNLVMAMLWRHPENIELEKVKVAHYCAAGSK 272

Query: 252 PWRYTGKEENMDREDVKMLVKKWWEIYDDETLDFINY------KIDDGDTDARHPFLAAL 311
           PWR+TGKEENMDRED+KMLV KWW+IY+DE+LD+ N+      ++D  +     PFLAAL
Sbjct: 273 PWRFTGKEENMDREDIKMLVSKWWDIYNDESLDYKNFVASGEAEVDRDERTGLQPFLAAL 332

Query: 312 SEAGAV-HYHNAPSAA 321
           SEAG V HY NAPSAA
Sbjct: 333 SEAGVVDHYINAPSAA 348

BLAST of ClCG01G010820 vs. TrEMBL
Match: B9RFM8_RICCO (Hexosyltransferase OS=Ricinus communis GN=RCOM_1436030 PE=3 SV=1)

HSP 1 Score: 519.6 bits (1337), Expect = 2.7e-144
Identity = 241/323 (74.61%), Postives = 275/323 (85.14%), Query Frame = 1

Query: 1   MAPELSGGKCGYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLPDVPEEHRELLRW 60
           +  + S   C YVTFLAGNGDYVKGVVGLAKGLRKV SKYPLVVA+LPDVPE+HR++L  
Sbjct: 17  LVKQASISSCAYVTFLAGNGDYVKGVVGLAKGLRKVNSKYPLVVAILPDVPEDHRKILVS 76

Query: 61  QGCVVKEIEPVYPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDGDIQVMENIDHLF 120
           QGC++KEIEPVYPPENQT FAM YYVINYSKLRIWEFVE+ KMIYLDGDIQV ENIDHLF
Sbjct: 77  QGCIIKEIEPVYPPENQTQFAMAYYVINYSKLRIWEFVEYSKMIYLDGDIQVFENIDHLF 136

Query: 121 QMEESYFYAVMDCFCEKTWSHTAQYKIGYCQQRPNEVEWGPQLGPKPPLYFNAGMFVYEP 180
            +++ YFYAVMDCFCEKTWSH+ QYKIGYCQQ P+ V+W  ++GPKPPLYFNAGMFV+EP
Sbjct: 137 DLQDGYFYAVMDCFCEKTWSHSPQYKIGYCQQCPDRVKWPAEMGPKPPLYFNAGMFVFEP 196

Query: 181 NFQTYRALLSTLNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAMLWRHPENVDLHKV 240
           +  TY  LL+T+  TPPTPFAEQDFLNMFFK+ YRPIP +YNLV+A+LWRHPEN++L KV
Sbjct: 197 SLPTYDDLLNTVKLTPPTPFAEQDFLNMFFKDIYRPIPPIYNLVLALLWRHPENIELEKV 256

Query: 241 KVVHYCAAGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLDFIN--YKIDDGDTDA- 300
           KVVHYCAAGSKPWRYTGKEENMDRED+K LVKKWW+IY+DE+LD+ N       G T+A 
Sbjct: 257 KVVHYCAAGSKPWRYTGKEENMDREDIKTLVKKWWDIYEDESLDYKNTAAAATGGATEAG 316

Query: 301 RHPFLAALSEAGAVHYHNAPSAA 321
             P LAA+SEA  VHY  APSAA
Sbjct: 317 LQPLLAAMSEASEVHYITAPSAA 339

BLAST of ClCG01G010820 vs. TrEMBL
Match: A0A068J7T2_MANES (Hexosyltransferase OS=Manihot esculenta GN=GolS5 PE=2 SV=1)

HSP 1 Score: 517.3 bits (1331), Expect = 1.3e-143
Identity = 238/321 (74.14%), Postives = 274/321 (85.36%), Query Frame = 1

Query: 1   MAPELSGGKCGYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLPDVPEEHRELLRW 60
           +  + S   C YVTFLAGNGDYVKGVVGLAKGLRKV+SKYPLVVA+LPDVP+EHR++L  
Sbjct: 15  LVKQASISSCAYVTFLAGNGDYVKGVVGLAKGLRKVRSKYPLVVAILPDVPDEHRKILVS 74

Query: 61  QGCVVKEIEPVYPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDGDIQVMENIDHLF 120
           QGC+VKEIEPVYPPENQT FAM YYVINYSKLRIWEFVE+ KMIYLDGDIQV +NIDHLF
Sbjct: 75  QGCIVKEIEPVYPPENQTQFAMAYYVINYSKLRIWEFVEYSKMIYLDGDIQVFDNIDHLF 134

Query: 121 QMEESYFYAVMDCFCEKTWSHTAQYKIGYCQQRPNEVEWGPQLGPKPPLYFNAGMFVYEP 180
            +++ YFY VMDCFCE+TWS + QYKIGYCQQ P+ V+W  ++GPKPPLYFNAGMFV+EP
Sbjct: 135 DLQDGYFYGVMDCFCEQTWSFSPQYKIGYCQQCPDRVQWPAEMGPKPPLYFNAGMFVFEP 194

Query: 181 NFQTYRALLSTLNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAMLWRHPENVDLHKV 240
           +  TY  LL T+  T PT FAEQDFLNMFFK+ YRP+P +YNLV+AMLWRHPEN++L KV
Sbjct: 195 SLSTYDDLLQTVKVTTPTLFAEQDFLNMFFKDIYRPLPPIYNLVLAMLWRHPENIELEKV 254

Query: 241 KVVHYCAAGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLDFINYKIDDGDTDA-RH 300
           KVVHYCAAGSKPWRYTGKEENMDRED+K+LVKKWW+IY+DE+LD+ N  +  G T+A   
Sbjct: 255 KVVHYCAAGSKPWRYTGKEENMDREDIKILVKKWWDIYNDESLDYKNTVVAAGGTEADLQ 314

Query: 301 PFLAALSEAGAVHYHNAPSAA 321
           PFLAALSEAG  HY  APSAA
Sbjct: 315 PFLAALSEAGVAHYLTAPSAA 335

BLAST of ClCG01G010820 vs. TAIR10
Match: AT1G56600.1 (AT1G56600.1 galactinol synthase 2)

HSP 1 Score: 498.4 bits (1282), Expect = 3.2e-141
Identity = 241/335 (71.94%), Postives = 269/335 (80.30%), Query Frame = 1

Query: 1   MAPELS------------GGKCGYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLP 60
           MAPE++            G K  YVTFLAG GDYVKGVVGLAKGLRK KSKYPLVVAVLP
Sbjct: 1   MAPEINTKLTVPVHSATGGEKRAYVTFLAGTGDYVKGVVGLAKGLRKAKSKYPLVVAVLP 60

Query: 61  DVPEEHRELLRWQGCVVKEIEPVYPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDG 120
           DVPE+HR+ L  QGCVVKEIEPVYPPENQT FAM YYVINYSKLRIWEFVE+ KMIYLDG
Sbjct: 61  DVPEDHRKQLVDQGCVVKEIEPVYPPENQTEFAMAYYVINYSKLRIWEFVEYNKMIYLDG 120

Query: 121 DIQVMENIDHLFQMEESYFYAVMDCFCEKTWSHTAQYKIGYCQQRPNEVEW-GPQLGPKP 180
           DIQV +NIDHLF +    FYAVMDCFCEKTWSH+ QYKIGYCQQ P++V W   +LGPKP
Sbjct: 121 DIQVFDNIDHLFDLPNGQFYAVMDCFCEKTWSHSPQYKIGYCQQCPDKVTWPEAKLGPKP 180

Query: 181 PLYFNAGMFVYEPNFQTYRALLSTLNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAM 240
           PLYFNAGMFVYEPN  TY  LL T+   PPT FAEQDFLNM+FK+ Y+PIP VYNLV+AM
Sbjct: 181 PLYFNAGMFVYEPNLSTYHNLLETVKIVPPTLFAEQDFLNMYFKDIYKPIPPVYNLVLAM 240

Query: 241 LWRHPENVDLHKVKVVHYCAAGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLDFIN 300
           LWRHPEN++L +VKVVHYCAAG+KPWR+TG+EENMDRED+KMLVKKWW+IY+DE+LD+ N
Sbjct: 241 LWRHPENIELDQVKVVHYCAAGAKPWRFTGEEENMDREDIKMLVKKWWDIYNDESLDYKN 300

Query: 301 YKIDDGDTDAR--HPFLAALSEAGAVHYHNAPSAA 321
             I D     +    F+ ALSEAGA+ Y  APSAA
Sbjct: 301 VVIGDSHKKQQTLQQFIEALSEAGALQYVKAPSAA 335

BLAST of ClCG01G010820 vs. TAIR10
Match: AT2G47180.1 (AT2G47180.1 galactinol synthase 1)

HSP 1 Score: 495.4 bits (1274), Expect = 2.7e-140
Identity = 236/322 (73.29%), Postives = 266/322 (82.61%), Query Frame = 1

Query: 3   PELSGGKCGYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLPDVPEEHRELLRWQG 62
           P +      YVTFLAGNGDYVKGVVGLAKGLRKVKS YPLVVA+LPDVPEEHR +L  QG
Sbjct: 23  PSVQDSDRAYVTFLAGNGDYVKGVVGLAKGLRKVKSAYPLVVAMLPDVPEEHRRILVDQG 82

Query: 63  CVVKEIEPVYPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDGDIQVMENIDHLFQM 122
           C+V+EIEPVYPPENQT FAM YYVINYSKLRIW+FVE+ KMIYLDGDIQV ENIDHLF +
Sbjct: 83  CIVREIEPVYPPENQTQFAMAYYVINYSKLRIWKFVEYSKMIYLDGDIQVYENIDHLFDL 142

Query: 123 EESYFYAVMDCFCEKTWSHTAQYKIGYCQQRPNEVEW-GPQLGPKPPLYFNAGMFVYEPN 182
            + Y YAVMDCFCEKTWSHT QYKI YCQQ P++V+W   +LG  P LYFNAGMF+YEPN
Sbjct: 143 PDGYLYAVMDCFCEKTWSHTPQYKIRYCQQCPDKVQWPKAELGEPPALYFNAGMFLYEPN 202

Query: 183 FQTYRALLSTLNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAMLWRHPENVDLHKVK 242
            +TY  LL TL  TPPTPFAEQDFLNM+FK+ Y+PIP VYNLV+AMLWRHPENV+L KVK
Sbjct: 203 LETYEDLLRTLKITPPTPFAEQDFLNMYFKKIYKPIPLVYNLVLAMLWRHPENVELGKVK 262

Query: 243 VVHYCAAGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLDFIN-YKIDDGDTDARH- 302
           VVHYCAAGSKPWRYTGKE NM+RED+KMLVKKWW+IYDDE+LD+     + D + D  + 
Sbjct: 263 VVHYCAAGSKPWRYTGKEANMEREDIKMLVKKWWDIYDDESLDYKKPVTVVDTEVDLVNL 322

Query: 303 -PFLAALSEAGAVHYHNAPSAA 321
            PF+ AL+EAG ++Y  APSAA
Sbjct: 323 KPFITALTEAGRLNYVTAPSAA 344

BLAST of ClCG01G010820 vs. TAIR10
Match: AT1G09350.1 (AT1G09350.1 galactinol synthase 3)

HSP 1 Score: 478.0 bits (1229), Expect = 4.5e-135
Identity = 227/334 (67.96%), Postives = 263/334 (78.74%), Query Frame = 1

Query: 1   MAPELSGG------KCGYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLPDVPEEH 60
           MAPE++        K  YVTFLAG GDYVKGVVGLAKGLRK KSKYPLVVAVLPDVP +H
Sbjct: 1   MAPEMNNKLSYGEKKRAYVTFLAGTGDYVKGVVGLAKGLRKTKSKYPLVVAVLPDVPADH 60

Query: 61  RELLRWQGCVVKEIEPVYPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDGDIQVME 120
           R  L  QGCV+KEI+PVYPP+NQT FAM YYV+NYSKLRIW+FVE+ K+IYLDGDIQV E
Sbjct: 61  RRQLLDQGCVIKEIQPVYPPDNQTQFAMAYYVLNYSKLRIWKFVEYSKLIYLDGDIQVFE 120

Query: 121 NIDHLFQMEESYFYAVMDCFCEKTWSHTAQYKIGYCQQRPNEVEW-GPQLGPKPPLYFNA 180
           NIDHLF + +  FYAV DCFCEKTWSHT QYKIGYCQQ P++V W   +LGPKPPLYFNA
Sbjct: 121 NIDHLFDLPDGNFYAVKDCFCEKTWSHTPQYKIGYCQQCPDKVTWPESELGPKPPLYFNA 180

Query: 181 GMFVYEPNFQTYRALLSTLNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAMLWRHPE 240
           GMFVYEP+  TY  LL TL   PPTPFAEQDFLNM+FK+ Y+PIP VYNLV+AMLWRHPE
Sbjct: 181 GMFVYEPSLPTYYNLLETLKVVPPTPFAEQDFLNMYFKDIYKPIPPVYNLVLAMLWRHPE 240

Query: 241 NVDLHKVKVVHYCAAGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLDFINYKIDDG 300
           N++L++ KVVHYCAAG+KPWR+TG+E NM+RED+KMLV+KWW+IY+DE+LD+ N+ +  G
Sbjct: 241 NIELNEAKVVHYCAAGAKPWRFTGQEGNMEREDIKMLVEKWWDIYNDESLDYKNFNVHCG 300

Query: 301 DTDARH-------PFLAALSEAGAVHYHNAPSAA 321
             +  H        F   LSEA  +    APSAA
Sbjct: 301 QKEDVHRKPKTLPQFFTDLSEADVLQCAKAPSAA 334

BLAST of ClCG01G010820 vs. TAIR10
Match: AT1G60470.1 (AT1G60470.1 galactinol synthase 4)

HSP 1 Score: 473.4 bits (1217), Expect = 1.1e-133
Identity = 224/311 (72.03%), Postives = 255/311 (81.99%), Query Frame = 1

Query: 12  YVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLPDVPEEHRELLRWQGCVVKEIEPV 71
           YVTFLAGNGDYVKGVVGLAKGLRKVKS YPLVVA+LPDVPEEHRE+LR QGCVV+EIEPV
Sbjct: 25  YVTFLAGNGDYVKGVVGLAKGLRKVKSAYPLVVAMLPDVPEEHREILRSQGCVVREIEPV 84

Query: 72  YPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDGDIQVMENIDHLFQMEESYFYAVM 131
           YPP+NQ  FAM YYV+NYSKLRIW F E+ KMIYLD DIQV +NIDHLF + ++YFYAVM
Sbjct: 85  YPPDNQVEFAMAYYVLNYSKLRIWNFEEYSKMIYLDADIQVFDNIDHLFDLSDAYFYAVM 144

Query: 132 DCFCEKTWSHTAQYKIGYCQQRPNEVEWGPQL-GPKPPLYFNAGMFVYEPNFQTYRALLS 191
           DCFCEKTWSH+ QY IGYCQQ P +V W   +  P PPLYFNAGMFV+EP+  TY +LL 
Sbjct: 145 DCFCEKTWSHSLQYSIGYCQQCPEKVTWPEDMESPPPPLYFNAGMFVFEPSPLTYESLLQ 204

Query: 192 TLNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAMLWRHPENVDLHKVKVVHYCAAGS 251
           TL  TPP+PFAEQDFLNMFF++ Y+PIP VYNLV+AMLWRHPENV+L KVKVVHYCAAGS
Sbjct: 205 TLEITPPSPFAEQDFLNMFFEKVYKPIPLVYNLVLAMLWRHPENVELEKVKVVHYCAAGS 264

Query: 252 KPWRYTGKEENMDREDVKMLVKKWWEIYDDETLDF-INYKIDDGDTDARHPFLAALSEAG 311
           KPWRYTG+E NMDRED+KMLV KWW++Y+DE+LDF      D  +T  +   LA++ E  
Sbjct: 265 KPWRYTGEEANMDREDIKMLVDKWWDVYNDESLDFKSKIPADAEETVTKSSILASVLEP- 324

Query: 312 AVHYHNAPSAA 321
            + Y  APSAA
Sbjct: 325 EMTYFPAPSAA 334

BLAST of ClCG01G010820 vs. TAIR10
Match: AT4G26250.1 (AT4G26250.1 galactinol synthase 6)

HSP 1 Score: 429.1 bits (1102), Expect = 2.4e-120
Identity = 202/313 (64.54%), Postives = 238/313 (76.04%), Query Frame = 1

Query: 9   KCGYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLPDVPEEHRELLRWQGCVVKEI 68
           K  YVTFLAGN DY  GVVGLAKGLRKVKS YPLVVA+LPDVPEEHR++L  QGC+++EI
Sbjct: 24  KRAYVTFLAGNKDYWMGVVGLAKGLRKVKSAYPLVVAILPDVPEEHRQILLAQGCIIREI 83

Query: 69  EPVYPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDGDIQVMENIDHLFQMEESYFY 128
           EPVYPPEN+T ++M YYVINYSKLRIWEFVE++KMIYLDGDIQV  NIDHLF     Y Y
Sbjct: 84  EPVYPPENKTGYSMAYYVINYSKLRIWEFVEYEKMIYLDGDIQVFSNIDHLFDTPRGYLY 143

Query: 129 AVMDCFCEKTWSHTAQYKIGYCQQRPNEVEWGPQ-LGPKPPLYFNAGMFVYEPNFQTYRA 188
           AV DCFCE +WS T Q+KIGYCQQ P +V W  + LG  PP+YFNAGM V+EPN  TY  
Sbjct: 144 AVKDCFCEISWSKTPQFKIGYCQQCPEKVTWPVESLGSPPPVYFNAGMLVFEPNLLTYED 203

Query: 189 LLSTLNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAMLWRHPENVDLHKVKVVHYCA 248
           LL  +  T PT FAEQDFLN +F + Y+PIP+ YNLVMAMLWRHPE++DL ++ V+HYCA
Sbjct: 204 LLRVVQITTPTYFAEQDFLNEYFTDIYKPIPSTYNLVMAMLWRHPEHIDLDQISVIHYCA 263

Query: 249 AGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLDFINYKIDDGDTDARHPFLAALSE 308
            GSKPWR+   EE+MDRED+KMLVKKWW+IY+D +LD+ N+   +      +  LA+   
Sbjct: 264 NGSKPWRFDETEEHMDREDIKMLVKKWWDIYEDSSLDYKNFVETESKLSPINATLASKES 323

Query: 309 AGAVHYHNAPSAA 321
            G V    APSAA
Sbjct: 324 VGDVLISLAPSAA 336

BLAST of ClCG01G010820 vs. NCBI nr
Match: gi|449443518|ref|XP_004139524.1| (PREDICTED: galactinol synthase 2-like [Cucumis sativus])

HSP 1 Score: 609.0 bits (1569), Expect = 4.8e-171
Identity = 288/326 (88.34%), Postives = 305/326 (93.56%), Query Frame = 1

Query: 1   MAPEL---SGGKCGYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLPDVPEEHREL 60
           MAPEL     GK GYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPL+VAVLPDVPEEHREL
Sbjct: 1   MAPELLSSGAGKFGYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLLVAVLPDVPEEHREL 60

Query: 61  LRWQGCVVKEIEPVYPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDGDIQVMENID 120
           LRWQGCVVKEI+PVYPP+N T FAMPYYVINYSKLRIWEFVE+KK+IYLDGDIQVMENID
Sbjct: 61  LRWQGCVVKEIQPVYPPQNHTQFAMPYYVINYSKLRIWEFVEYKKLIYLDGDIQVMENID 120

Query: 121 HLFQMEESYFYAVMDCFCEKTWSHTAQYKIGYCQQRPNEVEW-GPQLGPKPPLYFNAGMF 180
           HLFQME+S+FYAVMDCFCEKTWSHTAQY+IGYCQQRPNEV+W   +LGPKPPLYFNAGMF
Sbjct: 121 HLFQMEDSFFYAVMDCFCEKTWSHTAQYEIGYCQQRPNEVQWPASELGPKPPLYFNAGMF 180

Query: 181 VYEPNFQTYRALLSTLNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAMLWRHPENVD 240
           VYEPN +TY +LLSTLN TPPTPFAEQDFLNMFFK+KY+PIP VYNLVMAMLWRHPEN++
Sbjct: 181 VYEPNLETYHSLLSTLNITPPTPFAEQDFLNMFFKDKYKPIPPVYNLVMAMLWRHPENIE 240

Query: 241 LHKVKVVHYCAAGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLDFINYK-IDDGDT 300
           LHKVKVVHYCAAGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLD+INYK IDDGDT
Sbjct: 241 LHKVKVVHYCAAGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLDYINYKMIDDGDT 300

Query: 301 DARHPFLAALSEAGAVHY-HNAPSAA 321
           D R PFLAALSEAG VHY H APSAA
Sbjct: 301 DTRQPFLAALSEAGVVHYGHTAPSAA 326

BLAST of ClCG01G010820 vs. NCBI nr
Match: gi|659127227|ref|XP_008463593.1| (PREDICTED: galactinol synthase 2-like [Cucumis melo])

HSP 1 Score: 605.5 bits (1560), Expect = 5.3e-170
Identity = 287/325 (88.31%), Postives = 303/325 (93.23%), Query Frame = 1

Query: 1   MAPEL--SG-GKCGYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLPDVPEEHREL 60
           MAPE   SG GK GYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPL+VAVLPDVPEEHREL
Sbjct: 1   MAPEFLSSGVGKFGYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLLVAVLPDVPEEHREL 60

Query: 61  LRWQGCVVKEIEPVYPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDGDIQVMENID 120
           LRWQGCVVKEI+PVYPP+N T FAM YYVINYSKLRIWEFVE+KK+IYLDGDIQVMENID
Sbjct: 61  LRWQGCVVKEIQPVYPPQNHTQFAMAYYVINYSKLRIWEFVEYKKLIYLDGDIQVMENID 120

Query: 121 HLFQMEESYFYAVMDCFCEKTWSHTAQYKIGYCQQRPNEVEWGPQLGPKPPLYFNAGMFV 180
           HLFQME+S+FYAVMDCFCEKTWSHT QY IGYCQQRPNEV+W  +LGPKPPLYFNAGMFV
Sbjct: 121 HLFQMEDSFFYAVMDCFCEKTWSHTPQYNIGYCQQRPNEVQWPSELGPKPPLYFNAGMFV 180

Query: 181 YEPNFQTYRALLSTLNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAMLWRHPENVDL 240
           YEPN +TY +LLSTLN TPPTPFAEQDFLNMFFK+KY+PIP VYNLVMAMLWRHPEN++L
Sbjct: 181 YEPNLETYHSLLSTLNVTPPTPFAEQDFLNMFFKDKYKPIPPVYNLVMAMLWRHPENIEL 240

Query: 241 HKVKVVHYCAAGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLDFINYK-IDDGDTD 300
           HKVKVVHYCAAGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLD+INYK IDDGDTD
Sbjct: 241 HKVKVVHYCAAGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLDYINYKMIDDGDTD 300

Query: 301 ARHPFLAALSEAGAVHY-HNAPSAA 321
            R PFLAALSEAG VHY H APSAA
Sbjct: 301 TRQPFLAALSEAGVVHYGHTAPSAA 325

BLAST of ClCG01G010820 vs. NCBI nr
Match: gi|821324983|ref|NP_001295628.1| (galactinol synthase 2 [Jatropha curcas])

HSP 1 Score: 527.7 bits (1358), Expect = 1.4e-146
Identity = 241/321 (75.08%), Postives = 276/321 (85.98%), Query Frame = 1

Query: 1   MAPELSGGKCGYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLPDVPEEHRELLRW 60
           +  + S   C YVTFLAGNGDY+KGVVGLAKGLRKVKSKYPLVVA+LPDVPE+HR++L  
Sbjct: 15  LVKQASISSCAYVTFLAGNGDYIKGVVGLAKGLRKVKSKYPLVVAILPDVPEDHRKILVS 74

Query: 61  QGCVVKEIEPVYPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDGDIQVMENIDHLF 120
           QGC+VKEIEPVYPPENQT FAM YYVINYSKLRIWEFVE+ KMIYLDGDIQV ENIDHLF
Sbjct: 75  QGCIVKEIEPVYPPENQTQFAMAYYVINYSKLRIWEFVEYSKMIYLDGDIQVFENIDHLF 134

Query: 121 QMEESYFYAVMDCFCEKTWSHTAQYKIGYCQQRPNEVEWGPQLGPKPPLYFNAGMFVYEP 180
            +++ YFYAVMDC+CEKTWSH+ Q+KIGYCQQ P+ V+W  +LGP PPLYFNAGMFVYEP
Sbjct: 135 DLQDGYFYAVMDCYCEKTWSHSVQHKIGYCQQCPDRVKWPAELGPAPPLYFNAGMFVYEP 194

Query: 181 NFQTYRALLSTLNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAMLWRHPENVDLHKV 240
           +  TY  LL TL  TPPTPFAEQDFLNMFFK+ Y+PIP +YNLV+A++WRHPEN++++K 
Sbjct: 195 SLSTYDDLLKTLKVTPPTPFAEQDFLNMFFKDIYKPIPPIYNLVLALIWRHPENIEVNKA 254

Query: 241 KVVHYCAAGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLDFIN-YKIDDGDTDARH 300
           KVVHYCAAGSKPWRYTGKEENMDRED+KMLV+KWW+IY+DE+LD+ N      GD     
Sbjct: 255 KVVHYCAAGSKPWRYTGKEENMDREDIKMLVQKWWDIYNDESLDYRNTVAAAGGDEGGMQ 314

Query: 301 PFLAALSEAGAVHYHNAPSAA 321
           PFLAALSEAG VHY NAPSAA
Sbjct: 315 PFLAALSEAGVVHYVNAPSAA 335

BLAST of ClCG01G010820 vs. NCBI nr
Match: gi|255542966|ref|XP_002512546.1| (PREDICTED: galactinol synthase 2 [Ricinus communis])

HSP 1 Score: 525.0 bits (1351), Expect = 9.1e-146
Identity = 243/322 (75.47%), Postives = 276/322 (85.71%), Query Frame = 1

Query: 1   MAPELSGGKCGYVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLPDVPEEHRELLRW 60
           +  + S   C YVTFLAG+GDYVKGVVGLAKGLRKVKSKYPLVVA+LPDVPE+HR++L  
Sbjct: 17  LVKQASISSCAYVTFLAGDGDYVKGVVGLAKGLRKVKSKYPLVVAILPDVPEDHRKILVS 76

Query: 61  QGCVVKEIEPVYPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDGDIQVMENIDHLF 120
           QGC+VKEIEPVYPPENQT FAM YYVINYSKLRIWEFVE+ KMIYLDGDIQV ENIDHLF
Sbjct: 77  QGCIVKEIEPVYPPENQTQFAMAYYVINYSKLRIWEFVEYSKMIYLDGDIQVFENIDHLF 136

Query: 121 QMEESYFYAVMDCFCEKTWSHTAQYKIGYCQQRPNEVEWGPQLGPKPPLYFNAGMFVYEP 180
            ++  YFYAVMDCFCEKTWSH+ QYKIGYCQQ P+ V+W  ++GPKPPLYFNAGMFV+EP
Sbjct: 137 DLQNGYFYAVMDCFCEKTWSHSPQYKIGYCQQCPDRVKWPAEMGPKPPLYFNAGMFVFEP 196

Query: 181 NFQTYRALLSTLNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAMLWRHPENVDLHKV 240
           +  TY  LL+T+  TPPTPFAEQDFLNMFFK+ YRPIP +YNLV+A+LWRHPEN++  KV
Sbjct: 197 SLSTYDDLLNTVKLTPPTPFAEQDFLNMFFKDIYRPIPPIYNLVLALLWRHPENIEFEKV 256

Query: 241 KVVHYCAAGSKPWRYTGKEENMDREDVKMLVKKWWEIYDDETLDFIN-YKIDDGDTDAR- 300
           KVVHYCAAGSKPWRYTGKE+NMDRED+KMLVKKWW+IY+DE+LD+ N      G T+   
Sbjct: 257 KVVHYCAAGSKPWRYTGKEDNMDREDIKMLVKKWWDIYEDESLDYKNTVAATGGATEGEL 316

Query: 301 HPFLAALSEAGAVHYHNAPSAA 321
            PFLAALSEAG VHY  APSAA
Sbjct: 317 QPFLAALSEAGVVHYVTAPSAA 338

BLAST of ClCG01G010820 vs. NCBI nr
Match: gi|590644899|ref|XP_007031210.1| (Galactinol synthase 2 [Theobroma cacao])

HSP 1 Score: 520.8 bits (1340), Expect = 1.7e-144
Identity = 244/316 (77.22%), Postives = 274/316 (86.71%), Query Frame = 1

Query: 12  YVTFLAGNGDYVKGVVGLAKGLRKVKSKYPLVVAVLPDVPEEHRELLRWQGCVVKEIEPV 71
           YVTFLAGNGDYVKGVVGLAKGLRKVKS+YPL+VA+LPDVPE+HR++L  QGC+VKEIEPV
Sbjct: 33  YVTFLAGNGDYVKGVVGLAKGLRKVKSQYPLLVAILPDVPEDHRKILVDQGCIVKEIEPV 92

Query: 72  YPPENQTHFAMPYYVINYSKLRIWEFVEFKKMIYLDGDIQVMENIDHLFQMEESYFYAVM 131
           YPPENQT FAM YYVINYSKLRIWEFVE+ KMIYLDGDIQV ENIDHLF ME+  FYAVM
Sbjct: 93  YPPENQTQFAMAYYVINYSKLRIWEFVEYCKMIYLDGDIQVFENIDHLFDMEDGSFYAVM 152

Query: 132 DCFCEKTWSHTAQYKIGYCQQRPNEVEWGPQLGPKPPLYFNAGMFVYEPNFQTYRALLST 191
           DCFCEKTWSHT QYKIGYCQQ P++V+W  QLGPKPPLYFNAGMFVYEP+ + Y  LL T
Sbjct: 153 DCFCEKTWSHTPQYKIGYCQQCPDKVQWPSQLGPKPPLYFNAGMFVYEPSLRVYDELLRT 212

Query: 192 LNSTPPTPFAEQDFLNMFFKEKYRPIPAVYNLVMAMLWRHPENVDLHKVKVVHYCAAGSK 251
           L  TPPTPFAEQD+LNMFF++ Y+PIP VYNLVMAMLWRHPEN++L KVKV HYCAAGSK
Sbjct: 213 LKVTPPTPFAEQDYLNMFFRDIYKPIPPVYNLVMAMLWRHPENIELEKVKVAHYCAAGSK 272

Query: 252 PWRYTGKEENMDREDVKMLVKKWWEIYDDETLDFINY------KIDDGDTDARHPFLAAL 311
           PWR+TGKEENMDRED+KMLV KWW+IY+DE+LD+ N+      ++D  +     PFLAAL
Sbjct: 273 PWRFTGKEENMDREDIKMLVSKWWDIYNDESLDYKNFVASGEAEVDRDERTGLQPFLAAL 332

Query: 312 SEAGAV-HYHNAPSAA 321
           SEAG V HY NAPSAA
Sbjct: 333 SEAGVVDHYINAPSAA 348

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GOLS2_SOLLC1.6e-14275.72Galactinol synthase 2 OS=Solanum lycopersicum GN=GOLS2 PE=2 SV=1[more]
GOLS2_ARATH5.7e-14071.94Galactinol synthase 2 OS=Arabidopsis thaliana GN=GOLS2 PE=1 SV=1[more]
GOLS1_ARATH4.8e-13973.29Galactinol synthase 1 OS=Arabidopsis thaliana GN=GOLS1 PE=1 SV=1[more]
GOLS1_AJURE7.7e-13773.65Galactinol synthase 1 OS=Ajuga reptans GN=GOLS1 PE=1 SV=1[more]
GOLS3_ARATH8.0e-13467.96Galactinol synthase 3 OS=Arabidopsis thaliana GN=GOLS3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A067JHE5_JATCU9.8e-14775.08Hexosyltransferase OS=Jatropha curcas GN=JCGZ_23194 PE=2 SV=1[more]
B9RFM7_RICCO6.3e-14675.47Hexosyltransferase OS=Ricinus communis GN=RCOM_1435920 PE=3 SV=1[more]
A0A061F4Y8_THECC1.2e-14477.22Hexosyltransferase OS=Theobroma cacao GN=TCM_026798 PE=3 SV=1[more]
B9RFM8_RICCO2.7e-14474.61Hexosyltransferase OS=Ricinus communis GN=RCOM_1436030 PE=3 SV=1[more]
A0A068J7T2_MANES1.3e-14374.14Hexosyltransferase OS=Manihot esculenta GN=GolS5 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT1G56600.13.2e-14171.94 galactinol synthase 2[more]
AT2G47180.12.7e-14073.29 galactinol synthase 1[more]
AT1G09350.14.5e-13567.96 galactinol synthase 3[more]
AT1G60470.11.1e-13372.03 galactinol synthase 4[more]
AT4G26250.12.4e-12064.54 galactinol synthase 6[more]
Match NameE-valueIdentityDescription
gi|449443518|ref|XP_004139524.1|4.8e-17188.34PREDICTED: galactinol synthase 2-like [Cucumis sativus][more]
gi|659127227|ref|XP_008463593.1|5.3e-17088.31PREDICTED: galactinol synthase 2-like [Cucumis melo][more]
gi|821324983|ref|NP_001295628.1|1.4e-14675.08galactinol synthase 2 [Jatropha curcas][more]
gi|255542966|ref|XP_002512546.1|9.1e-14675.47PREDICTED: galactinol synthase 2 [Ricinus communis][more]
gi|590644899|ref|XP_007031210.1|1.7e-14477.22Galactinol synthase 2 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002495Glyco_trans_8
Vocabulary: Molecular Function
TermDefinition
GO:0016757transferase activity, transferring glycosyl groups
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006012 galactose metabolic process
biological_process GO:0009409 response to cold
biological_process GO:0008150 biological_process
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005575 cellular_component
molecular_function GO:0047216 inositol 3-alpha-galactosyltransferase activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0016757 transferase activity, transferring glycosyl groups
molecular_function GO:0016740 transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G010820.1ClCG01G010820.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002495Glycosyl transferase, family 8PFAMPF01501Glyco_transf_8coord: 14..256
score: 8.3
NoneNo IPR availablePANTHERPTHR11183GLYCOGENINcoord: 9..120
score: 2.8E-191coord: 149..298
score: 2.8E
NoneNo IPR availablePANTHERPTHR11183:SF59GALACTINOL SYNTHASE 2-RELATEDcoord: 9..120
score: 2.8E-191coord: 149..298
score: 2.8E