ClCG07G004050 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG07G004050
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionglycosyltransferase family 64 protein C4
LocationCG_Chr07: 4620340 .. 4624499 (+)
RNA-Seq ExpressionClCG07G004050
SyntenyClCG07G004050
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGAGGGAGCTCTTTTCGACGTTCGGTGATGATTCAACGGCTCCGGCAAATTGCTGTCACGATCAAGATCAAGCTTCTTCTGTGTTGTTGCATTGGACTCGCCGTCGTCTTCTTCGCCGCCCGCGCTTCCGATCTCATGGGATGGACGTGCGACGACTGCACTACGCCACTCCCATACTCTTCGCCGCGGTTTGAAATTCGAACCCTTTTTTCGTTCATGAATTTTGTGCTTCTTTCTTATGTGATCTTCACGTGAATTGGTTATGATTTCTGGATCTTTTCGTGTTTAATTATCTCTGATGGGATGAAAATTAGAATCTCCTAAGTTAAATAATTTAGGCCCCGTTTCATATTAAATAGTTTATGTCCGTTTGTAACTAGTTAGGTCTCGTTTTGTAATCTATTTGATTTTAGTATTTTAAAATTAAGTTTACAAATATCTTTTCATCTCAATTTTTTTTTTTTAATATCTACTTTTTACCCATATTTTCATAAACAAATAGATTTTAAAAACTTATTTTTGTTTTTGAAATTTAGCTAAGAATTCAGCTCTGCAGGGTAAGAAAATAAGAGTAAATAGTCTTGATTTTAAAAAAACAAAAACTAAAAACAAAATAATTACCAAACGAGATGTTAGTTTTTTGTTTCTTGTTTTTGAAATTTAAGACTACAAACATATCTTTCACTTCTAAATTTCTTACTTATTTATATCCTTTCTATCTACAATTTAAAAAACCTAGTCAAGTTTCGAAAACTAAAGAAAATAAAAAAGAATTTTTAAAATTTTTGTTTTTGTTTTTGTTTTCTATTCTTTTATCTAAGAAAGATAAAAATTTTATCTTTCTTGAACAAAAGAGTTGAATAGTTAACTATGGTAAGGAACTAAGATTAAATATGCTTAATTTTAAAATTTCAAATAGTTATAAGATGGTGCCTGAATTTTTAGTTTTTAAAACTTATCTTTATTTTCTCATAATTTTGTTTTTTTACAACACTCTTCATCTTTCGAACCTCTGACAATTTTTAGATATGATTTAATGATGTGTTATAGAGGTGAAAGTAGTATTTATAAGCTAGAATGGATTTGAGGTTACTGTATCATTAACTTTTTGAATTAAATAGAAGTTTCCATGTATTTTGTCTTCATTGTAAAAGTAATTATAGCCCACATTTTCTTATATGGGAATATATATGAAGAAAATGATGAAAAGTTGTGTGTATTGAACAATTTCTAGACATAGTTTATTCTATTGATGAGTTATTGTATTATATGAACGTGAAATTGTAGGAAACGTTATGCAATTCTAATGAACACATGGAAGCGTTATGATCTTTTAAAGAAGTCAATTTCTCACTACACAAAATGTTTGGGAGTTGAGTCCATACATATTGTATGGAGTGAACCAGCTCCTCCTCCTGTTTCTTTGGTAACTTTTCTACAACGGACGGCGAAGGCGAATTCCCGACACGGTCGAGAAGCCGAGTTGAGATTTGAAATAAATGAAGAAGATAGCTTGAACAATAGGTTTAAGGAAATAAAGGGATTGAAAACAGAAGCCATATTTTCAGTAGATGATGATGTTATATTTGCTTGTTCTACTTTGGAGTTTGCTTTTACTGTTTGGCAAAGTGCACCTCAAACTATGGTTGGCTTTGTGCCTCGCATGCATTGGATTGACCGCTCGGTATGTTTTTCCACCTTTCCATTCGTTTCTTTTCTTTTACATCATCTTCTTCATGTTTTTTTTTCTTTTGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTATGGATAAGGTTCTTTTTGTTGGGGTAAATTATAAGTTTAGGCCGTGTTTGATAATTATTTGGATTTTTTGTTTTGGTTTTTGAAAATTATGTTTGTTTTCTTACCATTTCTTTACCATGGTTTTCGTCTTTCCTAGCTAAATACTTTAGTTCTTGGTCATAATCCCAAAACAAAAACAAATTTTTAAAGAACTACCATTTTTTAGTTTTCAAATTTTGATTTTTTTAAAGGTAGATAATGAAATAAGAAAACAATAAATTAAGGTAGTGTTTATAAGTTTAGTTTTCAAAAACAAAAAACAAAATGATTATCAAAGAAAATTTTTATGGTTGTGATTTAGAAGTTTGTGTGATCTTGAACTTTTAATTTTGTGTCTAGTAAGTCTTTGAACTTTAAAAGTATTCAAGGTGGTGTTTGGCCCAAAGAGTTGTGGGAGTAAGAGTTGTGAACTCCACTCCTAGTTTGTCCCAAGAAGCTCGTGGGCCACACGATTAAAAAACATCGATTACTTATTAACTTCTCACACTGTGGGCCTAGGTTTGACTTACTAAAATTTTCAACTCTTTGAAGTTCAATAACTCCACTCCAGGCCCCAAACATCCTCTCTCATAGATCCCTTAGACTTCAATCTTTGTTTGTCCTAAGAAGTTCGTGGGCCACACAACTAAAAAACATAAACTTTATGTCTTATTAACTTCTATATTGTAGGCTCCCTAAACTTCTTTATTATTTGGAGTTCACAATTTCACTTCTTGTCTCAAATGCCCCATAATAGATCCCTTAGGCCCAGTGTGATAATCATTTAATTTTTAGTTTTTTTGAAAATTGTGTTTATTTTCTTATACCTTCTTTACCATGGTTTTCATTTTTTTAAAGTAAACATTTGTATCCTAAAACACCTCTAAAAGTAGATAATAAAACAAAGAAATCTATAAGTGGAAGTAACGTTTATAAGCTTAATCTTCAAAAACTGAAGAACAAAAACCAAATGGTTATAAAACATAACCTTAACTTTGAATTTTGTGTCCAATAGATCTATGACACGAATTTTAATGTCGAATAAGTCGAAAACCTTTAAGAAACAAAATTGAAAGTCTAAGAAGGTATTAGATAATTTGAAAAACCTAAAGGACATGAAGTTAAAAAGTAAAGAGACGTATTAAACACTTTGGCCAAACGTAAAAGTTTGGGGACCAACTAGGTTTGGTTATTGAAACGTTAATATATATTTTGTAAAAGTGTTTTCTAGTTATTATTATTAAAAGATAATTTAGAACTTGGAAGTTCTATTCTCTTCTTAAAACCAAATACACATGATTAAGTACTTGTAACATAAACAATATATATAAATCTATAGGGTGGCATGCATATAACATACTTATTAAACCTAATAATGTAATAGAAATTTTCAAATATTAAAGTAGGAAATTAGGATTGAAAAAATTGAAATGTTTGGTTAATGTTTGATGATTATTAGCAATGCAGAAGGGAGAAATGGGGAGATTTAGATATGGAGGATGGTGGTCAGTTTGGTGGACTGGTACATACAGTATGGTGCTTTCAAAAGCTGCATTTTTCCACTCTAAATATTTGGATATTTACACTAACAATATGCCTTCTTCCATCAGTGATTATATTACCAAAAACAGGTTTCTTTGATCACTTTTTCCCTTTCCATTAATCCCTTTTATGTCACTTGCTAATTTTGAATTTGAATTCTTTAGGAACTGTGAAGACATTGCAATGTCTTTTCTTGTTGCTAACGTAAGTGGTGCTCCACCTGTGTGGGTCCAAGGTACGAAACTGTGGATAAGAATAATACTTGAGTACATTCTCAACTACAATAAATATTTTGAAGATAAAAGAGGAAAGTATTTATTACATGACAAATATATAATTAGGCTGTAAAAATATTTTCTTGGCTAAAAGTTATACTTTTGATCCCTAAGGTTTGAAATTTATGTTCATTGAGTTTAAAAACCAAAAAATACTTTATTTTGGTTCCTTGATTAACAAAAACCATTAATTATTAACCAAGAGTTAGCAGATTTTGTTAGTATATGGATCAAACCTAGAGTGATTTGACACAAAACTCAAACATTGAGAAACCAAGTGTAGTTTACCCATTTTTGTTTTAATAATATGGTTACAAAAATTGGATAAGAAGCGGGTTTAATTTTGTTAATATGTGTGGAAATGCAGGTAAGATTTATGAAATAGGGTCGAGTGGAATTAGTAGTTTAGGAGGTCATAGTGAAAGAAGAAGCCAATGCTTGAATTGGTTTGTTGAAGAATATGGTGGAATTATGCCTTTGCTACCTTCAACTCTCAAGGCTGTTGATAGTCGTCACATTTGGTCTTGGTAA

mRNA sequence

ATGAGAGGGAGCTCTTTTCGACGTTCGGTGATGATTCAACGGCTCCGGCAAATTGCTGTCACGATCAAGATCAAGCTTCTTCTGTGTTGTTGCATTGGACTCGCCGTCGTCTTCTTCGCCGCCCGCGCTTCCGATCTCATGGGATGGACGTGCGACGACTGCACTACGCCACTCCCATACTCTTCGCCGCGGTTTGAAATTCGAACCCTTTTTTCGTTCATGAATTTTGTGCTTCTTTCTTATGTGATCTTCACGAAACGTTATGCAATTCTAATGAACACATGGAAGCGTTATGATCTTTTAAAGAAGTCAATTTCTCACTACACAAAATGTTTGGGAGTTGAGTCCATACATATTGTATGGAGTGAACCAGCTCCTCCTCCTGTTTCTTTGGTAACTTTTCTACAACGGACGGCGAAGGCGAATTCCCGACACGGTCGAGAAGCCGAGTTGAGATTTGAAATAAATGAAGAAGATAGCTTGAACAATAGGTTTAAGGAAATAAAGGGATTGAAAACAGAAGCCATATTTTCAGTAGATGATGATGTTATATTTGCTTGTTCTACTTTGGAGTTTGCTTTTACTGTTTGGCAAAGTGCACCTCAAACTATGGTTGGCTTTGTGCCTCGCATGCATTGGATTGACCGCTCGAAGGGAGAAATGGGGAGATTTAGATATGGAGGATGGTGGTCAGTTTGGTGGACTGGTACATACAGTATGGTGCTTTCAAAAGCTGCATTTTTCCACTCTAAATATTTGGATATTTACACTAACAATATGCCTTCTTCCATCAGTGATTATATTACCAAAAACAGGAACTGTGAAGACATTGCAATGTCTTTTCTTGTTGCTAACGTAAGTGGTGCTCCACCTGTGTGGGTCCAAGGTAAGATTTATGAAATAGGGTCGAGTGGAATTAGTAGTTTAGGAGGTCATAGTGAAAGAAGAAGCCAATGCTTGAATTGGTTTGTTGAAGAATATGGTGGAATTATGCCTTTGCTACCTTCAACTCTCAAGGCTGTTGATAGTCGTCACATTTGGTCTTGGTAA

Coding sequence (CDS)

ATGAGAGGGAGCTCTTTTCGACGTTCGGTGATGATTCAACGGCTCCGGCAAATTGCTGTCACGATCAAGATCAAGCTTCTTCTGTGTTGTTGCATTGGACTCGCCGTCGTCTTCTTCGCCGCCCGCGCTTCCGATCTCATGGGATGGACGTGCGACGACTGCACTACGCCACTCCCATACTCTTCGCCGCGGTTTGAAATTCGAACCCTTTTTTCGTTCATGAATTTTGTGCTTCTTTCTTATGTGATCTTCACGAAACGTTATGCAATTCTAATGAACACATGGAAGCGTTATGATCTTTTAAAGAAGTCAATTTCTCACTACACAAAATGTTTGGGAGTTGAGTCCATACATATTGTATGGAGTGAACCAGCTCCTCCTCCTGTTTCTTTGGTAACTTTTCTACAACGGACGGCGAAGGCGAATTCCCGACACGGTCGAGAAGCCGAGTTGAGATTTGAAATAAATGAAGAAGATAGCTTGAACAATAGGTTTAAGGAAATAAAGGGATTGAAAACAGAAGCCATATTTTCAGTAGATGATGATGTTATATTTGCTTGTTCTACTTTGGAGTTTGCTTTTACTGTTTGGCAAAGTGCACCTCAAACTATGGTTGGCTTTGTGCCTCGCATGCATTGGATTGACCGCTCGAAGGGAGAAATGGGGAGATTTAGATATGGAGGATGGTGGTCAGTTTGGTGGACTGGTACATACAGTATGGTGCTTTCAAAAGCTGCATTTTTCCACTCTAAATATTTGGATATTTACACTAACAATATGCCTTCTTCCATCAGTGATTATATTACCAAAAACAGGAACTGTGAAGACATTGCAATGTCTTTTCTTGTTGCTAACGTAAGTGGTGCTCCACCTGTGTGGGTCCAAGGTAAGATTTATGAAATAGGGTCGAGTGGAATTAGTAGTTTAGGAGGTCATAGTGAAAGAAGAAGCCAATGCTTGAATTGGTTTGTTGAAGAATATGGTGGAATTATGCCTTTGCTACCTTCAACTCTCAAGGCTGTTGATAGTCGTCACATTTGGTCTTGGTAA

Protein sequence

MRGSSFRRSVMIQRLRQIAVTIKIKLLLCCCIGLAVVFFAARASDLMGWTCDDCTTPLPYSSPRFEIRTLFSFMNFVLLSYVIFTKRYAILMNTWKRYDLLKKSISHYTKCLGVESIHIVWSEPAPPPVSLVTFLQRTAKANSRHGREAELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFACSTLEFAFTVWQSAPQTMVGFVPRMHWIDRSKGEMGRFRYGGWWSVWWTGTYSMVLSKAAFFHSKYLDIYTNNMPSSISDYITKNRNCEDIAMSFLVANVSGAPPVWVQGKIYEIGSSGISSLGGHSERRSQCLNWFVEEYGGIMPLLPSTLKAVDSRHIWSW
Homology
BLAST of ClCG07G004050 vs. NCBI nr
Match: XP_038893187.1 (glycosyltransferase family 64 protein C4 [Benincasa hispida])

HSP 1 Score: 604.4 bits (1557), Expect = 6.4e-169
Identity = 300/349 (85.96%), Postives = 308/349 (88.25%), Query Frame = 0

Query: 1   MRGSSFRRSVMIQRLRQIAVTIKIKLLLCCCIGLAVVFFAARASDLMGWTCDDCTTPLPY 60
           MRG S RR VMIQRL Q+AVTIKIKLLLCCCIGLAV+ F ARASDLMGWT DD  + L Y
Sbjct: 1   MRGRSLRRPVMIQRLWQVAVTIKIKLLLCCCIGLAVILFTARASDLMGWTSDDGASALRY 60

Query: 61  SSPRFEIRTLFSFMNFVLLSYVIFTKRYAILMNTWKRYDLLKKSISHYTKCLGVESIHIV 120
           S+PR                     KRYAILMNTWKRYDLLKKSISHYT CLGVESIHIV
Sbjct: 61  STPR---------------------KRYAILMNTWKRYDLLKKSISHYTTCLGVESIHIV 120

Query: 121 WSEPAPPPVSLVTFLQRTAKANSRHGREAELRFEINEEDSLNNRFKEIKGLKTEAIFSVD 180
           WSEPAPPPVSLV+FLQ TAKANSR GREAELRFE+NEEDSLNNRFKEIKGLKTEAIFSVD
Sbjct: 121 WSEPAPPPVSLVSFLQGTAKANSRDGREAELRFEMNEEDSLNNRFKEIKGLKTEAIFSVD 180

Query: 181 DDVIFACSTLEFAFTVWQSAPQTMVGFVPRMHWIDRSKGEMGRFRYGGWWSVWWTGTYSM 240
           DDVIF CSTLEFAF+VWQSAPQTMVGFVPRMHWIDRSKGEMGRFRYGGWWSVWWTGTYSM
Sbjct: 181 DDVIFGCSTLEFAFSVWQSAPQTMVGFVPRMHWIDRSKGEMGRFRYGGWWSVWWTGTYSM 240

Query: 241 VLSKAAFFHSKYLDIYTNNMPSSISDYITKNRNCEDIAMSFLVANVSGAPPVWVQGKIYE 300
           VLSKAAFFHSKYL IYTN+MPSSI DYITKNRNCEDIAMSFLVANVSGAPPVWVQGKIYE
Sbjct: 241 VLSKAAFFHSKYLSIYTNHMPSSIRDYITKNRNCEDIAMSFLVANVSGAPPVWVQGKIYE 300

Query: 301 IGSSGISSLGGHSERRSQCLNWFVEEYGGIMPLLPSTLKAVDSRHIWSW 350
           IGSSGISSLGGHSERRSQCLN FVEEYGGIMPLLPSTLKAVDSR  WSW
Sbjct: 301 IGSSGISSLGGHSERRSQCLNIFVEEYGGIMPLLPSTLKAVDSRQFWSW 328

BLAST of ClCG07G004050 vs. NCBI nr
Match: XP_004149253.1 (glycosyltransferase family 64 protein C4 [Cucumis sativus] >KAE8648017.1 hypothetical protein Csa_021415 [Cucumis sativus])

HSP 1 Score: 577.0 bits (1486), Expect = 1.1e-160
Identity = 282/350 (80.57%), Postives = 307/350 (87.71%), Query Frame = 0

Query: 1   MRGSSFRRSVMIQRLRQIAVTIKIKLLLCCCIGLAVVFFAARASDLMGWTCDDCTTPLPY 60
           MRG+S RR VM+QRLRQIAVTIKIKLLLCCCI LA+VFFA+RASDLMGWTCDDC+T + Y
Sbjct: 1   MRGTSLRRPVMVQRLRQIAVTIKIKLLLCCCIVLAIVFFASRASDLMGWTCDDCSTAVRY 60

Query: 61  SSPRFEIRTLFSFMNFVLLSYVIFTKRYAILMNTWKRYDLLKKSISHYTKCLGVESIHIV 120
           S+PR                     KRYAI+MNTWKR+DLLKKSI HYT C+GVESIHIV
Sbjct: 61  STPR---------------------KRYAIVMNTWKRHDLLKKSIDHYTACIGVESIHIV 120

Query: 121 WSEPAPPPVSLVTFLQRTAKANSRHGREAELRFEINEED-SLNNRFKEIKGLKTEAIFSV 180
           WSEP+PPP SLV++LQRT KANSR GRE ELRFE+NEED SLNNRFKEIKGLKTEAIFSV
Sbjct: 121 WSEPSPPPDSLVSYLQRTVKANSRDGRETELRFEMNEEDSSLNNRFKEIKGLKTEAIFSV 180

Query: 181 DDDVIFACSTLEFAFTVWQSAPQTMVGFVPRMHWIDRSKGEMGRFRYGGWWSVWWTGTYS 240
           DDDVIFACSTLEFAF+VWQ+AP TMVGFVPRMHWIDRSK   GR+RYGGWWSVWW+GTYS
Sbjct: 181 DDDVIFACSTLEFAFSVWQTAPHTMVGFVPRMHWIDRSK---GRYRYGGWWSVWWSGTYS 240

Query: 241 MVLSKAAFFHSKYLDIYTNNMPSSISDYITKNRNCEDIAMSFLVANVSGAPPVWVQGKIY 300
           MVLSKAAFFHSKYLD YTN+MPSSI  YIT NRNCEDIAMSF+VAN+SG+PPVWVQGKIY
Sbjct: 241 MVLSKAAFFHSKYLDFYTNHMPSSIRHYITNNRNCEDIAMSFVVANLSGSPPVWVQGKIY 300

Query: 301 EIGSSGISSLGGHSERRSQCLNWFVEEYGGIMPLLPSTLKAVDSRHIWSW 350
           E+GSSGISSLGGHSERRSQCLN FVEEYGGIMPLLPSTLKAVD+R +WSW
Sbjct: 301 EVGSSGISSLGGHSERRSQCLNIFVEEYGGIMPLLPSTLKAVDARRLWSW 326

BLAST of ClCG07G004050 vs. NCBI nr
Match: XP_008463246.1 (PREDICTED: glycosyltransferase family 64 protein C4 [Cucumis melo])

HSP 1 Score: 563.1 bits (1450), Expect = 1.6e-156
Identity = 278/350 (79.43%), Postives = 302/350 (86.29%), Query Frame = 0

Query: 1   MRGSSFRRSVMIQRLRQIAVTIKIKLLLCCCIGLAVVFFAARASDLMGWTCDDCTTPLPY 60
           MRG+S RR V IQRLRQIAVTIKIKLLLCCCI LAVVFFA+RASDLMGWTCDDC+T + Y
Sbjct: 1   MRGTSLRRPVTIQRLRQIAVTIKIKLLLCCCIVLAVVFFASRASDLMGWTCDDCSTAVRY 60

Query: 61  SSPRFEIRTLFSFMNFVLLSYVIFTKRYAILMNTWKRYDLLKKSISHYTKCLGVESIHIV 120
           S+PR                     K YAI++NTWKR+DLLKKSI HYT C+GVESIHIV
Sbjct: 61  STPR---------------------KGYAIVINTWKRHDLLKKSIDHYTACMGVESIHIV 120

Query: 121 WSEPAPPPVSLVTFLQRTAKANSRHGREAELRFEINEED-SLNNRFKEIKGLKTEAIFSV 180
           WSEP+PPP SLV++LQ+T KANSR  RE ELRFEINEED SLNNRFKEIKGL+TEAIFSV
Sbjct: 121 WSEPSPPPDSLVSYLQQTVKANSRDSRETELRFEINEEDSSLNNRFKEIKGLRTEAIFSV 180

Query: 181 DDDVIFACSTLEFAFTVWQSAPQTMVGFVPRMHWIDRSKGEMGRFRYGGWWSVWWTGTYS 240
           DDDVIFACSTLEFAF+VWQ+AP TMVGFVPRMHWIDRSK   GR+RYGGWWSVWW+GTYS
Sbjct: 181 DDDVIFACSTLEFAFSVWQTAPHTMVGFVPRMHWIDRSK---GRYRYGGWWSVWWSGTYS 240

Query: 241 MVLSKAAFFHSKYLDIYTNNMPSSISDYITKNRNCEDIAMSFLVANVSGAPPVWVQGKIY 300
           MVLSKAAFFHSKYLD YTN+MPSSI  YIT NRNCEDIAMSF+VAN+SG+PPVWVQGKIY
Sbjct: 241 MVLSKAAFFHSKYLDFYTNHMPSSIRHYITNNRNCEDIAMSFVVANLSGSPPVWVQGKIY 300

Query: 301 EIGSSGISSLGGHSERRSQCLNWFVEEYGGIMPLLPSTLKAVDSRHIWSW 350
           EIGSSGISSLG HSERRSQCLN FVEEYGGIMPLLPST KAVD+R +WSW
Sbjct: 301 EIGSSGISSLGEHSERRSQCLNIFVEEYGGIMPLLPSTFKAVDARRLWSW 326

BLAST of ClCG07G004050 vs. NCBI nr
Match: KAG6600937.1 (Glycosyltransferase family 64 protein C4, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 551.2 bits (1419), Expect = 6.4e-153
Identity = 269/349 (77.08%), Postives = 298/349 (85.39%), Query Frame = 0

Query: 1   MRGSSFRRSVMIQRLRQIAVTIKIKLLLCCCIGLAVVFFAARASDLMGWTCDDCTTPLPY 60
           MRGSS RR VMIQRLRQIAVTIKIK LLCCCI LA V FA RAS+LMGWT DD  T L Y
Sbjct: 1   MRGSSLRRPVMIQRLRQIAVTIKIKHLLCCCIVLAFVLFATRASNLMGWTSDDDATALRY 60

Query: 61  SSPRFEIRTLFSFMNFVLLSYVIFTKRYAILMNTWKRYDLLKKSISHYTKCLGVESIHIV 120
           S+PR                     KRYAILMNTWKR+DLLK+SISHYT C+GVESIHIV
Sbjct: 61  STPR---------------------KRYAILMNTWKRHDLLKQSISHYTTCMGVESIHIV 120

Query: 121 WSEPAPPPVSLVTFLQRTAKANSRHGREAELRFEINEEDSLNNRFKEIKGLKTEAIFSVD 180
           WSEP PPP SLV FLQRT K NSR+ RE ELRFE+N+EDSLNNRFKEIK L T+AIFSVD
Sbjct: 121 WSEPDPPPDSLVAFLQRTVKENSRNDREIELRFEMNKEDSLNNRFKEIKNLNTDAIFSVD 180

Query: 181 DDVIFACSTLEFAFTVWQSAPQTMVGFVPRMHWIDRSKGEMGRFRYGGWWSVWWTGTYSM 240
           DDVIFACSTLEFAFTVWQSA +TMVGFVPRMHWID+++ EMG++ YGGWWSVWWTGTYSM
Sbjct: 181 DDVIFACSTLEFAFTVWQSASETMVGFVPRMHWIDQAQEEMGKYIYGGWWSVWWTGTYSM 240

Query: 241 VLSKAAFFHSKYLDIYTNNMPSSISDYITKNRNCEDIAMSFLVANVSGAPPVWVQGKIYE 300
           VLSKAAFFHSKYL IYT++MP+SI +Y+TKNRNCEDIAMSF+VANVSGAPPVWV+G+IYE
Sbjct: 241 VLSKAAFFHSKYLGIYTHHMPASIRNYVTKNRNCEDIAMSFVVANVSGAPPVWVKGRIYE 300

Query: 301 IGSSGISSLGGHSERRSQCLNWFVEEYGGIMPLLPSTLKAVDSRHIWSW 350
           IGS GISSLGGHSERRS+C+N FVEEYGG+MPL+PST+KAVDSRH+W W
Sbjct: 301 IGSGGISSLGGHSERRSKCVNTFVEEYGGMMPLVPSTVKAVDSRHLWFW 328

BLAST of ClCG07G004050 vs. NCBI nr
Match: XP_022987145.1 (glycosyltransferase family 64 protein C4 [Cucurbita maxima])

HSP 1 Score: 549.7 bits (1415), Expect = 1.9e-152
Identity = 267/349 (76.50%), Postives = 296/349 (84.81%), Query Frame = 0

Query: 1   MRGSSFRRSVMIQRLRQIAVTIKIKLLLCCCIGLAVVFFAARASDLMGWTCDDCTTPLPY 60
           MRGSS RR +MIQRLRQIAVTIKIK LLCCCI LA V FA RAS+LMGWT DD  T L Y
Sbjct: 1   MRGSSLRRPIMIQRLRQIAVTIKIKHLLCCCIVLAFVLFATRASNLMGWTSDDGATALRY 60

Query: 61  SSPRFEIRTLFSFMNFVLLSYVIFTKRYAILMNTWKRYDLLKKSISHYTKCLGVESIHIV 120
           S+PR                     KRYAILMNTWKR+DLLK+S+SHYT C+GVESIHIV
Sbjct: 61  STPR---------------------KRYAILMNTWKRHDLLKQSVSHYTTCMGVESIHIV 120

Query: 121 WSEPAPPPVSLVTFLQRTAKANSRHGREAELRFEINEEDSLNNRFKEIKGLKTEAIFSVD 180
           WSEP PPP SLV FLQRT K NSR+ RE +LRFE+NEEDSLNNRFKEIK L T+AIFSVD
Sbjct: 121 WSEPDPPPDSLVAFLQRTVKENSRNDREIKLRFEMNEEDSLNNRFKEIKDLNTDAIFSVD 180

Query: 181 DDVIFACSTLEFAFTVWQSAPQTMVGFVPRMHWIDRSKGEMGRFRYGGWWSVWWTGTYSM 240
           DDVIFACSTLEFAFTVWQSA +TMVGFVPRMHWID++K EMGR+ YGGWWSVWWTGTYSM
Sbjct: 181 DDVIFACSTLEFAFTVWQSASETMVGFVPRMHWIDQAKEEMGRYIYGGWWSVWWTGTYSM 240

Query: 241 VLSKAAFFHSKYLDIYTNNMPSSISDYITKNRNCEDIAMSFLVANVSGAPPVWVQGKIYE 300
           VLSKAAFFHSKYL IYT++MP+SI +Y+TKNRNCEDIAMSF+VANVSGAPPVWV+G+IYE
Sbjct: 241 VLSKAAFFHSKYLGIYTHHMPASIRNYVTKNRNCEDIAMSFVVANVSGAPPVWVKGRIYE 300

Query: 301 IGSSGISSLGGHSERRSQCLNWFVEEYGGIMPLLPSTLKAVDSRHIWSW 350
           IGS GISSLGGHSERR +C+N FVEEYGG+MPL+PST+KAVD RH+W W
Sbjct: 301 IGSGGISSLGGHSERRGKCVNTFVEEYGGMMPLVPSTVKAVDGRHLWFW 328

BLAST of ClCG07G004050 vs. ExPASy Swiss-Prot
Match: Q9LY62 (Glycosylinositol phosphorylceramide mannosyl transferase 1 OS=Arabidopsis thaliana OX=3702 GN=GMT1 PE=1 SV=1)

HSP 1 Score: 395.2 bits (1014), Expect = 7.7e-109
Identity = 189/337 (56.08%), Postives = 236/337 (70.03%), Query Frame = 0

Query: 13  QRLRQIAVTIKIKLLLCCCIGLAVVFFAARASDLMGWTCDDCTTPLPYSSPRFEIRTLFS 72
           Q+LR+       K LL CCI   +V    R+S    W           S  R        
Sbjct: 22  QKLRKFVTARSTKFLLFCCIAFVLVTIVCRSS--RPWVNSSIAVADRISGSR-------- 81

Query: 73  FMNFVLLSYVIFTKRYAILMNTWKRYDLLKKSISHYTKCLGVESIHIVWSEPAPPPVSLV 132
                        K Y +LMNTWKRYDLLKKS+SHY  C  ++SIHIVWSEP PP  SL 
Sbjct: 82  -------------KGYTLLMNTWKRYDLLKKSVSHYASCSRLDSIHIVWSEPNPPSESLK 141

Query: 133 TFLQRTAKANSRHGREAELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFACSTLEF 192
            +L    K  +R G E ELRF+IN+EDSLNNRFKEIK LKT+A+FS+DDD+IF C T++F
Sbjct: 142 EYLHNVLKKKTRDGHEVELRFDINKEDSLNNRFKEIKDLKTDAVFSIDDDIIFPCHTVDF 201

Query: 193 AFTVWQSAPQTMVGFVPRMHWIDRSKGEMGRFRYGGWWSVWWTGTYSMVLSKAAFFHSKY 252
           AF VW+SAP TMVGFVPR+HW ++S  +   + Y GWWSVWW+GTYSMVLSKAAFFH KY
Sbjct: 202 AFNVWESAPDTMVGFVPRVHWPEKSNDKANYYTYSGWWSVWWSGTYSMVLSKAAFFHKKY 261

Query: 253 LDIYTNNMPSSISDYITKNRNCEDIAMSFLVANVSGAPPVWVQGKIYEIGSSGISSLGGH 312
           L +YTN+MP+SI ++ TKNRNCEDIAMSFL+AN + AP +WV+GKIYEIGS+GISS+GGH
Sbjct: 262 LSLYTNSMPASIREFTTKNRNCEDIAMSFLIANATNAPAIWVKGKIYEIGSTGISSIGGH 321

Query: 313 SERRSQCLNWFVEEYGGIMPLLPSTLKAVDSRHIWSW 350
           +E+R+ C+N FV E+G  MPL+ +++KAVDSR++W W
Sbjct: 322 TEKRTHCVNRFVAEFGK-MPLVYTSMKAVDSRNLWFW 334

BLAST of ClCG07G004050 vs. ExPASy Swiss-Prot
Match: Q5IGR7 (Exostosin-1b OS=Danio rerio OX=7955 GN=ext1b PE=2 SV=1)

HSP 1 Score: 125.6 bits (314), Expect = 1.1e-27
Identity = 71/190 (37.37%), Postives = 111/190 (58.42%), Query Frame = 0

Query: 157 EEDSLNNRFKEIKGLKTEAIFSVDDDVIFACSTLEFAFTVWQSAPQTMVGFVPRMHWIDR 216
           E   +++RF+  + L ++A+ S+D+D + + + ++FAFTVWQS P+ +VG+  R H+ D 
Sbjct: 537 ENKVMSSRFQPYESLISDAVLSLDEDTVLSTTEVDFAFTVWQSFPERIVGYPARSHFWDN 596

Query: 217 SKGEMGRFRYGGWWSVWWTGTYSMVLSKAAFFHSKYLDIYTNNMPSSISDYITKNRNCED 276
           +K   G       ++  WT  YSMVL+ AA +H  Y  +YT+ +PSS+   + +  NCED
Sbjct: 597 NKERWG-------YTSKWTNDYSMVLTGAAIYHRYYHFLYTHFLPSSLKSMVDQLANCED 656

Query: 277 IAMSFLVANVSGAPPVWV-QGKIYEIGSSGISSLGG------HSERRSQCLNWFVEEYGG 336
           I M+FLV+ V+  PPV V Q K Y+    G SS         H  +R  C+N F   +GG
Sbjct: 657 ILMNFLVSAVTKLPPVKVTQKKQYKETMMGQSSRASRWADPDHFAQRQTCMNKFASWFGG 716

Query: 337 IMPLLPSTLK 340
            MPL+ S ++
Sbjct: 717 -MPLVHSQMR 718

BLAST of ClCG07G004050 vs. ExPASy Swiss-Prot
Match: A5D7I4 (Exostosin-1 OS=Bos taurus OX=9913 GN=EXT1 PE=2 SV=1)

HSP 1 Score: 122.5 bits (306), Expect = 9.6e-27
Identity = 68/190 (35.79%), Postives = 109/190 (57.37%), Query Frame = 0

Query: 157 EEDSLNNRFKEIKGLKTEAIFSVDDDVIFACSTLEFAFTVWQSAPQTMVGFVPRMHWIDR 216
           E   +++RF     + T+A+ S+D+D + + + ++FAFTVWQS P+ +VG+  R H+ D 
Sbjct: 542 ESKVMSSRFLPYDNIITDAVLSLDEDTVLSTTEVDFAFTVWQSFPERIVGYPARSHFWDN 601

Query: 217 SKGEMGRFRYGGWWSVWWTGTYSMVLSKAAFFHSKYLDIYTNNMPSSISDYITKNRNCED 276
           SK   G       ++  WT  YSMVL+ AA +H  Y  +YT+ +P+S+ + + +  NCED
Sbjct: 602 SKERWG-------YTSKWTNDYSMVLTGAAIYHKYYHYLYTHYLPASLKNMVDQLANCED 661

Query: 277 IAMSFLVANVSGAPPVWV-QGKIYEIGSSGISSLGG------HSERRSQCLNWFVEEYGG 336
           I M+FLV+ V+  PP+ V Q K Y+    G +S         H  +R  C+N F   + G
Sbjct: 662 ILMNFLVSAVTKLPPIKVTQKKQYKETMMGQTSRASRWADPDHFAQRQSCMNTFASWF-G 721

Query: 337 IMPLLPSTLK 340
            MPL+ S ++
Sbjct: 722 YMPLIHSQMR 723

BLAST of ClCG07G004050 vs. ExPASy Swiss-Prot
Match: Q9JK82 (Exostosin-1 OS=Cricetulus griseus OX=10029 GN=EXT1 PE=1 SV=1)

HSP 1 Score: 122.5 bits (306), Expect = 9.6e-27
Identity = 68/190 (35.79%), Postives = 109/190 (57.37%), Query Frame = 0

Query: 157 EEDSLNNRFKEIKGLKTEAIFSVDDDVIFACSTLEFAFTVWQSAPQTMVGFVPRMHWIDR 216
           E   +++RF     + T+A+ S+D+D + + + ++FAFTVWQS P+ +VG+  R H+ D 
Sbjct: 542 ESKVMSSRFLPYDNIITDAVLSLDEDTVLSTTEVDFAFTVWQSFPERIVGYPARSHFWDN 601

Query: 217 SKGEMGRFRYGGWWSVWWTGTYSMVLSKAAFFHSKYLDIYTNNMPSSISDYITKNRNCED 276
           SK   G       ++  WT  YSMVL+ AA +H  Y  +YT+ +P+S+ + + +  NCED
Sbjct: 602 SKERWG-------YTSKWTNDYSMVLTGAAIYHKYYHYLYTHYLPASLKNMVDQLANCED 661

Query: 277 IAMSFLVANVSGAPPVWV-QGKIYEIGSSGISSLGG------HSERRSQCLNWFVEEYGG 336
           I M+FLV+ V+  PP+ V Q K Y+    G +S         H  +R  C+N F   + G
Sbjct: 662 ILMNFLVSAVTKLPPIKVTQKKQYKETMMGQTSRASRWADPDHFAQRQSCMNTFASWF-G 721

Query: 337 IMPLLPSTLK 340
            MPL+ S ++
Sbjct: 722 YMPLIHSQMR 723

BLAST of ClCG07G004050 vs. ExPASy Swiss-Prot
Match: Q5IGR8 (Exostosin-1a OS=Danio rerio OX=7955 GN=ext1a PE=2 SV=1)

HSP 1 Score: 121.3 bits (303), Expect = 2.1e-26
Identity = 68/190 (35.79%), Postives = 109/190 (57.37%), Query Frame = 0

Query: 157 EEDSLNNRFKEIKGLKTEAIFSVDDDVIFACSTLEFAFTVWQSAPQTMVGFVPRMHWIDR 216
           E   +++RF   + + T+A+ S+D+D + + + ++FAFTVWQS P+ +VG+  R H+ D 
Sbjct: 526 ESKVMSSRFLPYENIITDAVLSLDEDTVLSTTEVDFAFTVWQSFPERIVGYPARSHFWDS 585

Query: 217 SKGEMGRFRYGGWWSVWWTGTYSMVLSKAAFFHSKYLDIYTNNMPSSISDYITKNRNCED 276
           +K   G       ++  WT  YSMVL+ AAF+H  Y  +YT+ +P S+   + +  NCED
Sbjct: 586 NKERWG-------YTSKWTNDYSMVLTGAAFYHRYYNYLYTHYLPGSLKGLVDQLSNCED 645

Query: 277 IAMSFLVANVSGAPPVWV-QGKIYEIGSSGISSLGG------HSERRSQCLNWFVEEYGG 336
           I M+FLV+ V+  PP+ V Q K Y+    G +S         H  +R  C+N F   + G
Sbjct: 646 ILMNFLVSAVTKMPPIKVTQKKQYKETMMGQTSRASRWADPDHFAQRQTCMNKFASWF-G 705

Query: 337 IMPLLPSTLK 340
            MPL+ S ++
Sbjct: 706 TMPLVHSQMR 707

BLAST of ClCG07G004050 vs. ExPASy TrEMBL
Match: A0A1S3CIU1 (glycosyltransferase family 64 protein C4 OS=Cucumis melo OX=3656 GN=LOC103501448 PE=3 SV=1)

HSP 1 Score: 563.1 bits (1450), Expect = 7.9e-157
Identity = 278/350 (79.43%), Postives = 302/350 (86.29%), Query Frame = 0

Query: 1   MRGSSFRRSVMIQRLRQIAVTIKIKLLLCCCIGLAVVFFAARASDLMGWTCDDCTTPLPY 60
           MRG+S RR V IQRLRQIAVTIKIKLLLCCCI LAVVFFA+RASDLMGWTCDDC+T + Y
Sbjct: 1   MRGTSLRRPVTIQRLRQIAVTIKIKLLLCCCIVLAVVFFASRASDLMGWTCDDCSTAVRY 60

Query: 61  SSPRFEIRTLFSFMNFVLLSYVIFTKRYAILMNTWKRYDLLKKSISHYTKCLGVESIHIV 120
           S+PR                     K YAI++NTWKR+DLLKKSI HYT C+GVESIHIV
Sbjct: 61  STPR---------------------KGYAIVINTWKRHDLLKKSIDHYTACMGVESIHIV 120

Query: 121 WSEPAPPPVSLVTFLQRTAKANSRHGREAELRFEINEED-SLNNRFKEIKGLKTEAIFSV 180
           WSEP+PPP SLV++LQ+T KANSR  RE ELRFEINEED SLNNRFKEIKGL+TEAIFSV
Sbjct: 121 WSEPSPPPDSLVSYLQQTVKANSRDSRETELRFEINEEDSSLNNRFKEIKGLRTEAIFSV 180

Query: 181 DDDVIFACSTLEFAFTVWQSAPQTMVGFVPRMHWIDRSKGEMGRFRYGGWWSVWWTGTYS 240
           DDDVIFACSTLEFAF+VWQ+AP TMVGFVPRMHWIDRSK   GR+RYGGWWSVWW+GTYS
Sbjct: 181 DDDVIFACSTLEFAFSVWQTAPHTMVGFVPRMHWIDRSK---GRYRYGGWWSVWWSGTYS 240

Query: 241 MVLSKAAFFHSKYLDIYTNNMPSSISDYITKNRNCEDIAMSFLVANVSGAPPVWVQGKIY 300
           MVLSKAAFFHSKYLD YTN+MPSSI  YIT NRNCEDIAMSF+VAN+SG+PPVWVQGKIY
Sbjct: 241 MVLSKAAFFHSKYLDFYTNHMPSSIRHYITNNRNCEDIAMSFVVANLSGSPPVWVQGKIY 300

Query: 301 EIGSSGISSLGGHSERRSQCLNWFVEEYGGIMPLLPSTLKAVDSRHIWSW 350
           EIGSSGISSLG HSERRSQCLN FVEEYGGIMPLLPST KAVD+R +WSW
Sbjct: 301 EIGSSGISSLGEHSERRSQCLNIFVEEYGGIMPLLPSTFKAVDARRLWSW 326

BLAST of ClCG07G004050 vs. ExPASy TrEMBL
Match: A0A6J1JG04 (glycosyltransferase family 64 protein C4 OS=Cucurbita maxima OX=3661 GN=LOC111484781 PE=3 SV=1)

HSP 1 Score: 549.7 bits (1415), Expect = 9.0e-153
Identity = 267/349 (76.50%), Postives = 296/349 (84.81%), Query Frame = 0

Query: 1   MRGSSFRRSVMIQRLRQIAVTIKIKLLLCCCIGLAVVFFAARASDLMGWTCDDCTTPLPY 60
           MRGSS RR +MIQRLRQIAVTIKIK LLCCCI LA V FA RAS+LMGWT DD  T L Y
Sbjct: 1   MRGSSLRRPIMIQRLRQIAVTIKIKHLLCCCIVLAFVLFATRASNLMGWTSDDGATALRY 60

Query: 61  SSPRFEIRTLFSFMNFVLLSYVIFTKRYAILMNTWKRYDLLKKSISHYTKCLGVESIHIV 120
           S+PR                     KRYAILMNTWKR+DLLK+S+SHYT C+GVESIHIV
Sbjct: 61  STPR---------------------KRYAILMNTWKRHDLLKQSVSHYTTCMGVESIHIV 120

Query: 121 WSEPAPPPVSLVTFLQRTAKANSRHGREAELRFEINEEDSLNNRFKEIKGLKTEAIFSVD 180
           WSEP PPP SLV FLQRT K NSR+ RE +LRFE+NEEDSLNNRFKEIK L T+AIFSVD
Sbjct: 121 WSEPDPPPDSLVAFLQRTVKENSRNDREIKLRFEMNEEDSLNNRFKEIKDLNTDAIFSVD 180

Query: 181 DDVIFACSTLEFAFTVWQSAPQTMVGFVPRMHWIDRSKGEMGRFRYGGWWSVWWTGTYSM 240
           DDVIFACSTLEFAFTVWQSA +TMVGFVPRMHWID++K EMGR+ YGGWWSVWWTGTYSM
Sbjct: 181 DDVIFACSTLEFAFTVWQSASETMVGFVPRMHWIDQAKEEMGRYIYGGWWSVWWTGTYSM 240

Query: 241 VLSKAAFFHSKYLDIYTNNMPSSISDYITKNRNCEDIAMSFLVANVSGAPPVWVQGKIYE 300
           VLSKAAFFHSKYL IYT++MP+SI +Y+TKNRNCEDIAMSF+VANVSGAPPVWV+G+IYE
Sbjct: 241 VLSKAAFFHSKYLGIYTHHMPASIRNYVTKNRNCEDIAMSFVVANVSGAPPVWVKGRIYE 300

Query: 301 IGSSGISSLGGHSERRSQCLNWFVEEYGGIMPLLPSTLKAVDSRHIWSW 350
           IGS GISSLGGHSERR +C+N FVEEYGG+MPL+PST+KAVD RH+W W
Sbjct: 301 IGSGGISSLGGHSERRGKCVNTFVEEYGGMMPLVPSTVKAVDGRHLWFW 328

BLAST of ClCG07G004050 vs. ExPASy TrEMBL
Match: A0A6J1GZ09 (glycosyltransferase family 64 protein C4 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111458813 PE=3 SV=1)

HSP 1 Score: 544.7 bits (1402), Expect = 2.9e-151
Identity = 266/349 (76.22%), Postives = 297/349 (85.10%), Query Frame = 0

Query: 1   MRGSSFRRSVMIQRLRQIAVTIKIKLLLCCCIGLAVVFFAARASDLMGWTCDDCTTPLPY 60
           MRGSS RR VMIQRLRQIAVTIKIK LLCCCI LA V FA RAS+LMGWT DD  T L Y
Sbjct: 1   MRGSSLRRPVMIQRLRQIAVTIKIKHLLCCCIVLAFVLFATRASNLMGWTSDDDATALRY 60

Query: 61  SSPRFEIRTLFSFMNFVLLSYVIFTKRYAILMNTWKRYDLLKKSISHYTKCLGVESIHIV 120
           S+PR                     KRYAILMNTWKR+DLLK+SISHYT C+GVESIHIV
Sbjct: 61  STPR---------------------KRYAILMNTWKRHDLLKQSISHYTTCMGVESIHIV 120

Query: 121 WSEPAPPPVSLVTFLQRTAKANSRHGREAELRFEINEEDSLNNRFKEIKGLKTEAIFSVD 180
           WSEP PPP SLV FLQRT K NSR+ RE +LRFE+N+EDSLNNRFKEIK L T+AIFSVD
Sbjct: 121 WSEPDPPPDSLVAFLQRTVKENSRNDREIKLRFEMNKEDSLNNRFKEIKNLNTDAIFSVD 180

Query: 181 DDVIFACSTLEFAFTVWQSAPQTMVGFVPRMHWIDRSKGEMGRFRYGGWWSVWWTGTYSM 240
           DDVIFACSTLEFAFTVWQSA +TMVGFVPRMHWID+++ EMG++ YGGWWSVWWTGTYSM
Sbjct: 181 DDVIFACSTLEFAFTVWQSASETMVGFVPRMHWIDQAQEEMGKYIYGGWWSVWWTGTYSM 240

Query: 241 VLSKAAFFHSKYLDIYTNNMPSSISDYITKNRNCEDIAMSFLVANVSGAPPVWVQGKIYE 300
           VLSKAAFFHSKYL IYT++MP+SI +Y++KNRNCEDIAMSF+VANVSGAPPVWV+G+IYE
Sbjct: 241 VLSKAAFFHSKYLGIYTHHMPASIRNYVSKNRNCEDIAMSFVVANVSGAPPVWVKGRIYE 300

Query: 301 IGSSGISSLGGHSERRSQCLNWFVEEYGGIMPLLPSTLKAVDSRHIWSW 350
           IGS GISSLGGHSERRS+C+N FVEEYGG+MPL+ ST+KAVDSRH+W W
Sbjct: 301 IGSGGISSLGGHSERRSKCVNTFVEEYGGMMPLVRSTVKAVDSRHLWFW 328

BLAST of ClCG07G004050 vs. ExPASy TrEMBL
Match: A0A6J1H042 (glycosyltransferase family 64 protein C4 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111458813 PE=3 SV=1)

HSP 1 Score: 540.8 bits (1392), Expect = 4.2e-150
Identity = 266/349 (76.22%), Postives = 297/349 (85.10%), Query Frame = 0

Query: 1   MRGSSFRRSVMIQRLRQIAVTIKIKLLLCCCIGLAVVFFAARASDLMGWTCDDCTTPLPY 60
           MRGSS RR VMIQRLRQIAVTIKIK LLCCCI LA V FA RAS+LMGWT DD  T L Y
Sbjct: 1   MRGSSLRRPVMIQRLRQIAVTIKIKHLLCCCIVLAFVLFATRASNLMGWTSDDDATALRY 60

Query: 61  SSPRFEIRTLFSFMNFVLLSYVIFTKRYAILMNTWKRYDLLKKSISHYTKCLGVESIHIV 120
           S+PR                     KRYAILMNTWKR+DLLK+SISHYT C+GVESIHIV
Sbjct: 61  STPR---------------------KRYAILMNTWKRHDLLKQSISHYTTCMGVESIHIV 120

Query: 121 WSEPAPPPVSLVTFLQRTAKANSRHGREAELRFEINEEDSLNNRFKEIKGLKTEAIFSVD 180
           WSEP PPP SLV FLQRT K NSR+ RE +LRFE+N+EDSLNNRFKEIK L T+AIFSVD
Sbjct: 121 WSEPDPPPDSLVAFLQRTVKENSRNDREIKLRFEMNKEDSLNNRFKEIKNLNTDAIFSVD 180

Query: 181 DDVIFACSTLEFAFTVWQSAPQTMVGFVPRMHWIDRSKGEMGRFRYGGWWSVWWTGTYSM 240
           DDVIFACSTLEFAFTVWQSA +TMVGFVPRMHWID+++ EMG++ YGGWWSVWWTGTYSM
Sbjct: 181 DDVIFACSTLEFAFTVWQSASETMVGFVPRMHWIDQAE-EMGKYIYGGWWSVWWTGTYSM 240

Query: 241 VLSKAAFFHSKYLDIYTNNMPSSISDYITKNRNCEDIAMSFLVANVSGAPPVWVQGKIYE 300
           VLSKAAFFHSKYL IYT++MP+SI +Y++KNRNCEDIAMSF+VANVSGAPPVWV+G+IYE
Sbjct: 241 VLSKAAFFHSKYLGIYTHHMPASIRNYVSKNRNCEDIAMSFVVANVSGAPPVWVKGRIYE 300

Query: 301 IGSSGISSLGGHSERRSQCLNWFVEEYGGIMPLLPSTLKAVDSRHIWSW 350
           IGS GISSLGGHSERRS+C+N FVEEYGG+MPL+ ST+KAVDSRH+W W
Sbjct: 301 IGSGGISSLGGHSERRSKCVNTFVEEYGGMMPLVRSTVKAVDSRHLWFW 327

BLAST of ClCG07G004050 vs. ExPASy TrEMBL
Match: A0A6J1DV09 (glycosyltransferase family 64 protein C4 OS=Momordica charantia OX=3673 GN=LOC111023748 PE=3 SV=1)

HSP 1 Score: 499.6 bits (1285), Expect = 1.1e-137
Identity = 260/355 (73.24%), Postives = 288/355 (81.13%), Query Frame = 0

Query: 1   MRGSSFRRSVMIQRLRQIAV--TIKIK-LLLCCCIGLAVVFFAARASDLMGWTCDDCTTP 60
           MRGSS RR    QRLRQ     ++KIK +LL CCIG AVV FAARA +L+ WT       
Sbjct: 1   MRGSSSRRP-ETQRLRQWTALGSMKIKIILLFCCIGFAVVVFAARAPNLVVWTAGRLMVE 60

Query: 61  LPYSSPRFEIRTLFSFMNFVLLSYVIFTKRYAILMNTWKRYDLLKKSISHYTKCLGVESI 120
            P+S PR                     KRYAIL+NTWKRYDLLK+SI HY+ CLGVESI
Sbjct: 61  -PFSLPR---------------------KRYAILINTWKRYDLLKQSIFHYSTCLGVESI 120

Query: 121 HIVWSEPAPPPVSLVTFLQRTAKANS-RHGREA--ELRFEINEEDSLNNRFKEIKGLKTE 180
           HIVWSEP PP  SLVTFL++ AK NS RHGRE   ELRFEINEEDSLNNRFKEIKGLKTE
Sbjct: 121 HIVWSEPEPPSESLVTFLRQRAKENSRRHGRELEDELRFEINEEDSLNNRFKEIKGLKTE 180

Query: 181 AIFSVDDDVIFACSTLEFAFTVWQSAPQTMVGFVPRMHWIDRSKGEMGRFRYGGWWSVWW 240
           AIFSVDDDVIF CST+EFAF+VWQSAP+TMVGFVPRMHW D S+GEMGRFRYGGWWSVWW
Sbjct: 181 AIFSVDDDVIFPCSTVEFAFSVWQSAPETMVGFVPRMHWPDHSEGEMGRFRYGGWWSVWW 240

Query: 241 TGTYSMVLSKAAFFHSKYLDIYTNNMPSSISDYITKNRNCEDIAMSFLVANVSGAPPVWV 300
           +GTYSMVLSKAAFFHSKYL IY+NNMP+SI DY++KNRNCEDIAMSFLVANVSG PP+WV
Sbjct: 241 SGTYSMVLSKAAFFHSKYLAIYSNNMPASIRDYVSKNRNCEDIAMSFLVANVSGTPPIWV 300

Query: 301 QGKIYEIGSSGISSLGGHSERRSQCLNWFVEEYGGIMPLLPSTLKAVDSRHIWSW 350
           +GKI+EIGSSGISSLGGH+ERRS+C+N FVEEYGG MPLLPST+KAVDSR+ W W
Sbjct: 301 KGKIFEIGSSGISSLGGHNERRSECVNRFVEEYGG-MPLLPSTVKAVDSRYTWFW 331

BLAST of ClCG07G004050 vs. TAIR 10
Match: AT3G55830.1 (Nucleotide-diphospho-sugar transferases superfamily protein )

HSP 1 Score: 395.2 bits (1014), Expect = 5.5e-110
Identity = 189/337 (56.08%), Postives = 236/337 (70.03%), Query Frame = 0

Query: 13  QRLRQIAVTIKIKLLLCCCIGLAVVFFAARASDLMGWTCDDCTTPLPYSSPRFEIRTLFS 72
           Q+LR+       K LL CCI   +V    R+S    W           S  R        
Sbjct: 22  QKLRKFVTARSTKFLLFCCIAFVLVTIVCRSS--RPWVNSSIAVADRISGSR-------- 81

Query: 73  FMNFVLLSYVIFTKRYAILMNTWKRYDLLKKSISHYTKCLGVESIHIVWSEPAPPPVSLV 132
                        K Y +LMNTWKRYDLLKKS+SHY  C  ++SIHIVWSEP PP  SL 
Sbjct: 82  -------------KGYTLLMNTWKRYDLLKKSVSHYASCSRLDSIHIVWSEPNPPSESLK 141

Query: 133 TFLQRTAKANSRHGREAELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFACSTLEF 192
            +L    K  +R G E ELRF+IN+EDSLNNRFKEIK LKT+A+FS+DDD+IF C T++F
Sbjct: 142 EYLHNVLKKKTRDGHEVELRFDINKEDSLNNRFKEIKDLKTDAVFSIDDDIIFPCHTVDF 201

Query: 193 AFTVWQSAPQTMVGFVPRMHWIDRSKGEMGRFRYGGWWSVWWTGTYSMVLSKAAFFHSKY 252
           AF VW+SAP TMVGFVPR+HW ++S  +   + Y GWWSVWW+GTYSMVLSKAAFFH KY
Sbjct: 202 AFNVWESAPDTMVGFVPRVHWPEKSNDKANYYTYSGWWSVWWSGTYSMVLSKAAFFHKKY 261

Query: 253 LDIYTNNMPSSISDYITKNRNCEDIAMSFLVANVSGAPPVWVQGKIYEIGSSGISSLGGH 312
           L +YTN+MP+SI ++ TKNRNCEDIAMSFL+AN + AP +WV+GKIYEIGS+GISS+GGH
Sbjct: 262 LSLYTNSMPASIREFTTKNRNCEDIAMSFLIANATNAPAIWVKGKIYEIGSTGISSIGGH 321

Query: 313 SERRSQCLNWFVEEYGGIMPLLPSTLKAVDSRHIWSW 350
           +E+R+ C+N FV E+G  MPL+ +++KAVDSR++W W
Sbjct: 322 TEKRTHCVNRFVAEFGK-MPLVYTSMKAVDSRNLWFW 334

BLAST of ClCG07G004050 vs. TAIR 10
Match: AT5G04500.1 (glycosyltransferase family protein 47 )

HSP 1 Score: 109.0 bits (271), Expect = 7.8e-24
Identity = 68/242 (28.10%), Postives = 119/242 (49.17%), Query Frame = 0

Query: 101 LKKSISHYTKCLGVESIHIVWSEPAPPPVSLVTFLQRTAKANSRHGREAELRFEINEEDS 160
           LK  +  Y++C  V+ I ++W++  PP +S +                  +R  + +++S
Sbjct: 532 LKMYVKRYSRCPSVKEIVVIWNKGPPPDLSEL-------------DSAVPVRIRVQKQNS 591

Query: 161 LNNRFKEIKGLKTEAIFSVDDDVIFACSTLEFAFTVWQSAPQTMVGFVPRM--HWIDRSK 220
           LNNRF+    +KT A+  +DDD++  C  +E  F VW+  P+ +VGF PR     +  S 
Sbjct: 592 LNNRFEIDPLIKTRAVLELDDDIMMPCDDIEKGFRVWREHPERLVGFYPRFVDQTMTYSA 651

Query: 221 GEMGRFRYGGWWSVWWTGTYSMVLSKAAFFHSKY-LDIYTNNMPSSISDYITKNRNCEDI 280
            +  R   G          Y+M+L+ AAF   ++  D+Y ++       ++ +  NCEDI
Sbjct: 652 EKFARSHKG----------YNMILTGAAFMDVRFAFDMYQSDKAKLGRVFVDEQFNCEDI 711

Query: 281 AMSFLVANVSGAPPV--WVQGKIYEIGSSGISSL------GGHSERRSQCLNWFVEEYGG 332
            ++FL AN SG+     +V+  +  I +S  S +        H  +RS+CL  F + YG 
Sbjct: 712 LLNFLYANASGSGKAVEYVRPSLVTIDTSKFSGVAISGNTNQHYRKRSKCLRRFSDLYGS 750

BLAST of ClCG07G004050 vs. TAIR 10
Match: AT1G80290.1 (Nucleotide-diphospho-sugar transferases superfamily protein )

HSP 1 Score: 89.4 bits (220), Expect = 6.4e-18
Identity = 79/275 (28.73%), Postives = 118/275 (42.91%), Query Frame = 0

Query: 90  ILMNTWKRY--DLLKKSISHYTKCLGVESIHIVWSEPAPPPVSLVTFLQRTAKANSRHGR 149
           +L+N +  Y   LL+  ++ Y+    V SI ++W  P+ P   L    Q   + +     
Sbjct: 50  VLINGYSEYRIPLLQTIVASYSSSSIVSSILVLWGNPSTPDQLLDQLYQNLTQYSP---G 109

Query: 150 EAELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFACSTLEFAFTVWQSAPQTMVGF 209
            A +        SLN RF     + T A+   DDDV     +LEFAF+VW+S P  +VG 
Sbjct: 110 SASISLIQQSSSSLNARFLPRSSVDTRAVLICDDDVEIDQRSLEFAFSVWKSNPDRLVGT 169

Query: 210 VPRMHWIDRSKGEMGRFRYGGWWSVWWTGTYSMVLSKAAFFHSKYLDIYTNNMPSSISD- 269
             R H  D    E        W        YS+VL+K       YL  Y+      + + 
Sbjct: 170 FVRSHGFDLQGKE--------WIYTVHPDKYSIVLTKFMMMKQDYLFEYSCKGGVEMEEM 229

Query: 270 --YITKNRNCEDIAMSFLVANVSGAPPV---------W-------VQGKIYEIGSSGISS 329
              + + RNCEDI M+F+ A+   A P+         W       V+ ++ ++G S  S 
Sbjct: 230 RMIVDQMRNCEDILMNFVAADRLRAGPIMVGAERVRDWGDARNEEVEERVRDVGLS--SR 289

Query: 330 LGGHSERRSQCLNWFVEEYGGIMPLLPSTLKAVDS 344
              H +RR  C+  F     G MPL+ S  K V+S
Sbjct: 290 RVEHRKRRGNCIREF-HRVMGKMPLMYSYGKVVNS 310

BLAST of ClCG07G004050 vs. TAIR 10
Match: AT1G80290.2 (Nucleotide-diphospho-sugar transferases superfamily protein )

HSP 1 Score: 89.4 bits (220), Expect = 6.4e-18
Identity = 79/275 (28.73%), Postives = 118/275 (42.91%), Query Frame = 0

Query: 90  ILMNTWKRY--DLLKKSISHYTKCLGVESIHIVWSEPAPPPVSLVTFLQRTAKANSRHGR 149
           +L+N +  Y   LL+  ++ Y+    V SI ++W  P+ P   L    Q   + +     
Sbjct: 58  VLINGYSEYRIPLLQTIVASYSSSSIVSSILVLWGNPSTPDQLLDQLYQNLTQYSP---G 117

Query: 150 EAELRFEINEEDSLNNRFKEIKGLKTEAIFSVDDDVIFACSTLEFAFTVWQSAPQTMVGF 209
            A +        SLN RF     + T A+   DDDV     +LEFAF+VW+S P  +VG 
Sbjct: 118 SASISLIQQSSSSLNARFLPRSSVDTRAVLICDDDVEIDQRSLEFAFSVWKSNPDRLVGT 177

Query: 210 VPRMHWIDRSKGEMGRFRYGGWWSVWWTGTYSMVLSKAAFFHSKYLDIYTNNMPSSISD- 269
             R H  D    E        W        YS+VL+K       YL  Y+      + + 
Sbjct: 178 FVRSHGFDLQGKE--------WIYTVHPDKYSIVLTKFMMMKQDYLFEYSCKGGVEMEEM 237

Query: 270 --YITKNRNCEDIAMSFLVANVSGAPPV---------W-------VQGKIYEIGSSGISS 329
              + + RNCEDI M+F+ A+   A P+         W       V+ ++ ++G S  S 
Sbjct: 238 RMIVDQMRNCEDILMNFVAADRLRAGPIMVGAERVRDWGDARNEEVEERVRDVGLS--SR 297

Query: 330 LGGHSERRSQCLNWFVEEYGGIMPLLPSTLKAVDS 344
              H +RR  C+  F     G MPL+ S  K V+S
Sbjct: 298 RVEHRKRRGNCIREF-HRVMGKMPLMYSYGKVVNS 318

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038893187.16.4e-16985.96glycosyltransferase family 64 protein C4 [Benincasa hispida][more]
XP_004149253.11.1e-16080.57glycosyltransferase family 64 protein C4 [Cucumis sativus] >KAE8648017.1 hypothe... [more]
XP_008463246.11.6e-15679.43PREDICTED: glycosyltransferase family 64 protein C4 [Cucumis melo][more]
KAG6600937.16.4e-15377.08Glycosyltransferase family 64 protein C4, partial [Cucurbita argyrosperma subsp.... [more]
XP_022987145.11.9e-15276.50glycosyltransferase family 64 protein C4 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q9LY627.7e-10956.08Glycosylinositol phosphorylceramide mannosyl transferase 1 OS=Arabidopsis thalia... [more]
Q5IGR71.1e-2737.37Exostosin-1b OS=Danio rerio OX=7955 GN=ext1b PE=2 SV=1[more]
A5D7I49.6e-2735.79Exostosin-1 OS=Bos taurus OX=9913 GN=EXT1 PE=2 SV=1[more]
Q9JK829.6e-2735.79Exostosin-1 OS=Cricetulus griseus OX=10029 GN=EXT1 PE=1 SV=1[more]
Q5IGR82.1e-2635.79Exostosin-1a OS=Danio rerio OX=7955 GN=ext1a PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S3CIU17.9e-15779.43glycosyltransferase family 64 protein C4 OS=Cucumis melo OX=3656 GN=LOC103501448... [more]
A0A6J1JG049.0e-15376.50glycosyltransferase family 64 protein C4 OS=Cucurbita maxima OX=3661 GN=LOC11148... [more]
A0A6J1GZ092.9e-15176.22glycosyltransferase family 64 protein C4 isoform X1 OS=Cucurbita moschata OX=366... [more]
A0A6J1H0424.2e-15076.22glycosyltransferase family 64 protein C4 isoform X2 OS=Cucurbita moschata OX=366... [more]
A0A6J1DV091.1e-13773.24glycosyltransferase family 64 protein C4 OS=Momordica charantia OX=3673 GN=LOC11... [more]
Match NameE-valueIdentityDescription
AT3G55830.15.5e-11056.08Nucleotide-diphospho-sugar transferases superfamily protein [more]
AT5G04500.17.8e-2428.10glycosyltransferase family protein 47 [more]
AT1G80290.16.4e-1828.73Nucleotide-diphospho-sugar transferases superfamily protein [more]
AT1G80290.26.4e-1828.73Nucleotide-diphospho-sugar transferases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR015338Glycosyl transferase 64 domainPFAMPF09258Glyco_transf_64coord: 88..340
e-value: 1.1E-73
score: 247.6
IPR029044Nucleotide-diphospho-sugar transferasesGENE3D3.90.550.10Spore Coat Polysaccharide Biosynthesis Protein SpsA; Chain Acoord: 84..345
e-value: 6.3E-83
score: 279.7
IPR029044Nucleotide-diphospho-sugar transferasesSUPERFAMILY53448Nucleotide-diphospho-sugar transferasescoord: 86..339
NoneNo IPR availablePANTHERPTHR11062:SF286GLYCOSYLTRANSFERASE FAMILY 64 PROTEIN C4coord: 10..346
IPR004263Exostosin-likePANTHERPTHR11062EXOSTOSIN HEPARAN SULFATE GLYCOSYLTRANSFERASE -RELATEDcoord: 10..346

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG07G004050.2ClCG07G004050.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010383 cell wall polysaccharide metabolic process
biological_process GO:0009664 plant-type cell wall organization
biological_process GO:0006486 protein glycosylation
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016757 glycosyltransferase activity