CsaV3_4G028900 (gene) Cucumber (Chinese Long) v3

NameCsaV3_4G028900
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionLOW QUALITY PROTEIN: EGF domain-specific O-linked N-acetylglucosamine transferase-like
Locationchr4 : 18380937 .. 18384089 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AATAAATAAAAAAAAAGATACATATAGATTAAGAAGATGAAGTTCCCAAATTTCTTGTTCAAAAAGCCCTAAATATAAATATAAATATATATATATATATATAAAAGAAAAGGTTCTGTAGTTCCCTAATCTTCCTGAACAGGCAATATACAGCCATATTTGACGCCTCCATTTTTGCCTTTTTCTTCCTTACTCATTCCCTGCAGCAGCAAACCCCAGAAATCACCTCTCTTTGTTTCTCTCTCCTTCCTATATAAACACAGACAAACCCCATTTCAATTTCTGTACTCTTTTTTCTTTCTCTCTCCTTTGTTTTTTCTTTTAAAAATAAAGCTGCCGAATTCCAAGAAAAAAAAAAAGAAAAAGGCTTTTTCTTACAGCTTTGGTTTTGGAATATGATTCATCAATACATGCGTTATCAGCCATGGAAAAAAGGGGTGAGTTCGGGAAAAACTCATTTTCATCGTGTAGAAGATGAAGAAGGAGGAGAAGTGGGTATGATGGGATGTGAAGAGTTTTATTACTCTGCCTCTGCTTACAAGAAAGCTAATAAACCCAAGTTTTTGTTTCTTCTTTTTCTTTCTTTTCTCTCTTGTTCCATCATTTTTGCTCCCCATTTCTTCTCTTCTTCTTTCTCTCCTTTCTGTAAGTCCTTTCTTCTCTCTCTCTCTCTCTCTTTTCGGCTTTTCTTTTGCTTTTCTCTCTTTTTTTAATGTTTTTGTTTTTGTTTTTTTTTTGTTTTTCAGATTCTTTTGGTGTTCAAAATGATGATCTCTCTGTTGATAAGGAGGTTTTCGCTCCTCTCTGTTCTTCTATCCCTAATGGTATTTTCCCTTCTCTCTCTTTTTCTATGCTTTAACTTTAATTACATGATTCTTTGTCTCTGTCTCTCTCTGTTTATTGAAATTTTCTTCATGCTTATTAAATCTATCTCTTACCTTCTATACTTCATCAAAAAAATTTCTCAAATGAGAATCACTTTCCCCCCCTTTTCTCTCTGTCTTTCGAAGTATTCCATGGTGTTCTTCTTTCCCCCTGCTTTTTTTTTTTCTTTAAAAAATAATTTTCCCTAATGGAACCAGCTCTCTTTTTTACGCAAAGAAAATGAAAGAAACAAAGCAAAAAAAAAAAAAAAAAAAAAAAAAAGAATAGTAAGAAGAAGAAATCAGCTGTTTCCAAATTTTCTCTACAAATGGAAATTAGTTTCTGTGTTTTCAACTTGTCATTCATCCCAAGTTTCGTGTTGGGTTTAATGTAAACATATATATGTTTTCAGATATTGGGGTAAAAGGGGAAATTTCCCACTAGAATCTGATCTGTTTCAACTTATCATTTATTTGAATAAAAAATGTTCAACTTTAGTTTTCACAGCCTAATGAAACCAAGGGAGAAAATAACAAAATATTTCATTAAAATATAGAAATCCCCTGTTTTACTTCATAATCTTTCCAATGGGATTTTCTCTTTTTCTTTTTTCTTCTCAAAATTGTTTTTGAGTTCTAACCAAATCTCTTTAAGTTTTTTAATTGATTCATGTGAATTTGTTTGTAAATTCTTATGGGATTGTTTCTGTTTTCTTTGTTAGGAACCATATGTTGCGACAGAAGTAGCATTCGCTCTGATATCTGCATTATGAAAGGAGATATAAGAACGGATTCTTCTTCCTCCTCAATCTTCCTCTACACCTCTCCTGATTCCCCGATTGAGTTTGACGATGATCACGGAGTGATCCAAGTCGAAAAAATTAAACCATACACTAGAAAATGGGAGAAAAACACCATGGATACAATCGATGAATTGGAGCTTATTGTGAAACGAAAGAGCAATGATATTGATCAAAAGCACCGGTGCGATGTAAGACATAACGTCCCCGCCGTGTTCTTCTCGACGGGAGGCTATACGGGCAATGTTTATCACGAATTCAACGACGGGATTTTGCCACTTTATATAACTTCTCATAGTATGAATAAGGAGGTTGTTTTTGTGATCCTTGAGTATCACAAGTGGTGGCTAACAAAATATGCTGATATTCTTTCCCAACTCTCCAATTACCCTGTAATTGACCTGAGAAAAAACAATAAAACTCATTGCTTCCCACAAGTTATTGCTGGTTTGAGAATCCACGATGAGTTAACTGTGGACCCTTCATTGATGGAAGGGGGGAAGAGCATAGTTGATTTTCGGAACCTCTTGGACAAGGCGTACCAACCTCGAATCCGTGAGTTGATTCGACAAGAGGAGTTAGAAGCGAAGATTTCTTTGCATAGATCAAAGCGACCGAAATTGGTGGTTTTGTCACGGAAGGGGTCGTCAAGAGTGATAACGAATGAGAAGTTGATGGTGAAGATGGCGGAGAGAATGGGGTTCGAGGTGAAAGTTTTGAGGCCAGATAAAACCACAGAGTTGGCCAAGATTTATAGAGAGGTGAATGAAAGTAATGTATTGGTAGGAGTTCATGGAGCAGCAATGACACACTCTCTATTCATGAGGCCTAATGCGGTTTTCATCCAAATAATTCCACTGGGGACTGTTTGGGCAGCGGAAACATACTACGGGGAGCCTGCGAAGAAGCTGGGTTTGAAGTACATTGGGTACGAAATTGGGGCGAAGGAGAGTTCGTTATATAGTAATCACAACAAAGATGATCCTGTTCTAGTGAATCCGGATAGCATTACGAAAAAAGGGTGGGAATACACGAAGAAAATATATCTGGATGGCCAAAATGTGAGATTGAATCTTGGACGGTTTGAGAAGCGATTAGAGCGTGCTTATTACTATTGCATCGCCCGAGCTCGGGATGGTCGATCCCACTGATTTTGTTCTTAATTATGTGGATAAACATCCCATTATTCTTCTCTCTTTATTTTGCACATAAAAATCACCAACTTCACTTCCCCACTCAATAATTTACAAAGGAGATCAAAACACTGTAAATTCTTCCTTCCTTCTTCCTTTTAGGTTAGGATTGTTTTTTTATTTTTTATTTTCATTCTTGGGTATACTTATGTGAGATTCAATCCTTATCTTTAATGGAATAAAACTACTATTTTTAAATATTTTCATTTTGTCCTTTTTATTTTTAAAATTCTTTGTAATTTTTTATATTTTTCACATTTTTAATGCCTACAAAATAACTTTTGGTTCTTCAACTTC

mRNA sequence

ATGATTCATCAATACATGCGTTATCAGCCATGGAAAAAAGGGGTGAGTTCGGGAAAAACTCATTTTCATCGTGTAGAAGATGAAGAAGGAGGAGAAGTGGGTATGATGGGATGTGAAGAGTTTTATTACTCTGCCTCTGCTTACAAGAAAGCTAATAAACCCAAGTTTTTGTTTCTTCTTTTTCTTTCTTTTCTCTCTTGTTCCATCATTTTTGCTCCCCATTTCTTCTCTTCTTCTTTCTCTCCTTTCTATTCTTTTGGTGTTCAAAATGATGATCTCTCTGTTGATAAGGAGGTTTTCGCTCCTCTCTGTTCTTCTATCCCTAATGGAACCATATGTTGCGACAGAAGTAGCATTCGCTCTGATATCTGCATTATGAAAGGAGATATAAGAACGGATTCTTCTTCCTCCTCAATCTTCCTCTACACCTCTCCTGATTCCCCGATTGAGTTTGACGATGATCACGGAGTGATCCAAGTCGAAAAAATTAAACCATACACTAGAAAATGGGAGAAAAACACCATGGATACAATCGATGAATTGGAGCTTATTGTGAAACGAAAGAGCAATGATATTGATCAAAAGCACCGGTGCGATGTAAGACATAACGTCCCCGCCGTGTTCTTCTCGACGGGAGGCTATACGGGCAATGTTTATCACGAATTCAACGACGGGATTTTGCCACTTTATATAACTTCTCATAGTATGAATAAGGAGGTTGTTTTTGTGATCCTTGAGTATCACAAGTGGTGGCTAACAAAATATGCTGATATTCTTTCCCAACTCTCCAATTACCCTGTAATTGACCTGAGAAAAAACAATAAAACTCATTGCTTCCCACAAGTTATTGCTGGTTTGAGAATCCACGATGAGTTAACTGTGGACCCTTCATTGATGGAAGGGGGGAAGAGCATAGTTGATTTTCGGAACCTCTTGGACAAGGCGTACCAACCTCGAATCCGTGAGTTGATTCGACAAGAGGAGTTAGAAGCGAAGATTTCTTTGCATAGATCAAAGCGACCGAAATTGGTGGTTTTGTCACGGAAGGGGTCGTCAAGAGTGATAACGAATGAGAAGTTGATGGTGAAGATGGCGGAGAGAATGGGGTTCGAGGTGAAAGTTTTGAGGCCAGATAAAACCACAGAGTTGGCCAAGATTTATAGAGAGGTGAATGAAAGTAATGTATTGGTAGGAGTTCATGGAGCAGCAATGACACACTCTCTATTCATGAGGCCTAATGCGGTTTTCATCCAAATAATTCCACTGGGGACTGTTTGGGCAGCGGAAACATACTACGGGGAGCCTGCGAAGAAGCTGGGTTTGAAGTACATTGGGTACGAAATTGGGGCGAAGGAGAGTTCGTTATATAGTAATCACAACAAAGATGATCCTGTTCTAGTGAATCCGGATAGCATTACGAAAAAAGGGTGGGAATACACGAAGAAAATATATCTGGATGGCCAAAATGTGAGATTGAATCTTGGACGGTTTGAGAAGCGATTAGAGCGTGCTTATTACTATTGCATCGCCCGAGCTCGGGATGGTCGATCCCACTGA

Coding sequence (CDS)

ATGATTCATCAATACATGCGTTATCAGCCATGGAAAAAAGGGGTGAGTTCGGGAAAAACTCATTTTCATCGTGTAGAAGATGAAGAAGGAGGAGAAGTGGGTATGATGGGATGTGAAGAGTTTTATTACTCTGCCTCTGCTTACAAGAAAGCTAATAAACCCAAGTTTTTGTTTCTTCTTTTTCTTTCTTTTCTCTCTTGTTCCATCATTTTTGCTCCCCATTTCTTCTCTTCTTCTTTCTCTCCTTTCTATTCTTTTGGTGTTCAAAATGATGATCTCTCTGTTGATAAGGAGGTTTTCGCTCCTCTCTGTTCTTCTATCCCTAATGGAACCATATGTTGCGACAGAAGTAGCATTCGCTCTGATATCTGCATTATGAAAGGAGATATAAGAACGGATTCTTCTTCCTCCTCAATCTTCCTCTACACCTCTCCTGATTCCCCGATTGAGTTTGACGATGATCACGGAGTGATCCAAGTCGAAAAAATTAAACCATACACTAGAAAATGGGAGAAAAACACCATGGATACAATCGATGAATTGGAGCTTATTGTGAAACGAAAGAGCAATGATATTGATCAAAAGCACCGGTGCGATGTAAGACATAACGTCCCCGCCGTGTTCTTCTCGACGGGAGGCTATACGGGCAATGTTTATCACGAATTCAACGACGGGATTTTGCCACTTTATATAACTTCTCATAGTATGAATAAGGAGGTTGTTTTTGTGATCCTTGAGTATCACAAGTGGTGGCTAACAAAATATGCTGATATTCTTTCCCAACTCTCCAATTACCCTGTAATTGACCTGAGAAAAAACAATAAAACTCATTGCTTCCCACAAGTTATTGCTGGTTTGAGAATCCACGATGAGTTAACTGTGGACCCTTCATTGATGGAAGGGGGGAAGAGCATAGTTGATTTTCGGAACCTCTTGGACAAGGCGTACCAACCTCGAATCCGTGAGTTGATTCGACAAGAGGAGTTAGAAGCGAAGATTTCTTTGCATAGATCAAAGCGACCGAAATTGGTGGTTTTGTCACGGAAGGGGTCGTCAAGAGTGATAACGAATGAGAAGTTGATGGTGAAGATGGCGGAGAGAATGGGGTTCGAGGTGAAAGTTTTGAGGCCAGATAAAACCACAGAGTTGGCCAAGATTTATAGAGAGGTGAATGAAAGTAATGTATTGGTAGGAGTTCATGGAGCAGCAATGACACACTCTCTATTCATGAGGCCTAATGCGGTTTTCATCCAAATAATTCCACTGGGGACTGTTTGGGCAGCGGAAACATACTACGGGGAGCCTGCGAAGAAGCTGGGTTTGAAGTACATTGGGTACGAAATTGGGGCGAAGGAGAGTTCGTTATATAGTAATCACAACAAAGATGATCCTGTTCTAGTGAATCCGGATAGCATTACGAAAAAAGGGTGGGAATACACGAAGAAAATATATCTGGATGGCCAAAATGTGAGATTGAATCTTGGACGGTTTGAGAAGCGATTAGAGCGTGCTTATTACTATTGCATCGCCCGAGCTCGGGATGGTCGATCCCACTGA

Protein sequence

MIHQYMRYQPWKKGVSSGKTHFHRVEDEEGGEVGMMGCEEFYYSASAYKKANKPKFLFLLFLSFLSCSIIFAPHFFSSSFSPFYSFGVQNDDLSVDKEVFAPLCSSIPNGTICCDRSSIRSDICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKPYTRKWEKNTMDTIDELELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSHSMNKEVVFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAGLRIHDELTVDPSLMEGGKSIVDFRNLLDKAYQPRIRELIRQEELEAKISLHRSKRPKLVVLSRKGSSRVITNEKLMVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGAAMTHSLFMRPNAVFIQIIPLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDPVLVNPDSITKKGWEYTKKIYLDGQNVRLNLGRFEKRLERAYYYCIARARDGRSH
BLAST of CsaV3_4G028900 vs. NCBI nr
Match: XP_004151727.2 (PREDICTED: EGF domain-specific O-linked N-acetylglucosamine transferase-like [Cucumis sativus] >KGN54637.1 hypothetical protein Csa_4G411390 [Cucumis sativus])

HSP 1 Score: 1050.8 bits (2716), Expect = 1.5e-303
Identity = 518/518 (100.00%), Postives = 518/518 (100.00%), Query Frame = 0

Query: 1   MIHQYMRYQPWKKGVSSGKTHFHRVEDEEGGEVGMMGCEEFYYSASAYKKANKPKFLFLL 60
           MIHQYMRYQPWKKGVSSGKTHFHRVEDEEGGEVGMMGCEEFYYSASAYKKANKPKFLFLL
Sbjct: 1   MIHQYMRYQPWKKGVSSGKTHFHRVEDEEGGEVGMMGCEEFYYSASAYKKANKPKFLFLL 60

Query: 61  FLSFLSCSIIFAPHFFSSSFSPFYSFGVQNDDLSVDKEVFAPLCSSIPNGTICCDRSSIR 120
           FLSFLSCSIIFAPHFFSSSFSPFYSFGVQNDDLSVDKEVFAPLCSSIPNGTICCDRSSIR
Sbjct: 61  FLSFLSCSIIFAPHFFSSSFSPFYSFGVQNDDLSVDKEVFAPLCSSIPNGTICCDRSSIR 120

Query: 121 SDICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKPYTRKWEKNTMDTIDE 180
           SDICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKPYTRKWEKNTMDTIDE
Sbjct: 121 SDICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKPYTRKWEKNTMDTIDE 180

Query: 181 LELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSHSMNKEV 240
           LELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSHSMNKEV
Sbjct: 181 LELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSHSMNKEV 240

Query: 241 VFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAGLRIHDELTVDPSLME 300
           VFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAGLRIHDELTVDPSLME
Sbjct: 241 VFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAGLRIHDELTVDPSLME 300

Query: 301 GGKSIVDFRNLLDKAYQPRIRELIRQEELEAKISLHRSKRPKLVVLSRKGSSRVITNEKL 360
           GGKSIVDFRNLLDKAYQPRIRELIRQEELEAKISLHRSKRPKLVVLSRKGSSRVITNEKL
Sbjct: 301 GGKSIVDFRNLLDKAYQPRIRELIRQEELEAKISLHRSKRPKLVVLSRKGSSRVITNEKL 360

Query: 361 MVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGAAMTHSLFMRPNAVFIQII 420
           MVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGAAMTHSLFMRPNAVFIQII
Sbjct: 361 MVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGAAMTHSLFMRPNAVFIQII 420

Query: 421 PLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDPVLVNPDSITKKGWEYT 480
           PLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDPVLVNPDSITKKGWEYT
Sbjct: 421 PLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDPVLVNPDSITKKGWEYT 480

Query: 481 KKIYLDGQNVRLNLGRFEKRLERAYYYCIARARDGRSH 519
           KKIYLDGQNVRLNLGRFEKRLERAYYYCIARARDGRSH
Sbjct: 481 KKIYLDGQNVRLNLGRFEKRLERAYYYCIARARDGRSH 518

BLAST of CsaV3_4G028900 vs. NCBI nr
Match: XP_008455788.2 (PREDICTED: LOW QUALITY PROTEIN: EGF domain-specific O-linked N-acetylglucosamine transferase-like [Cucumis melo])

HSP 1 Score: 987.3 bits (2551), Expect = 2.0e-284
Identity = 483/518 (93.24%), Postives = 500/518 (96.53%), Query Frame = 0

Query: 1   MIHQYMRYQPWKKGVSSGKTHFHRVEDEEGGEVGMMGCEEFYYSASAYKKANKPKFLFLL 60
           M+HQYMRYQPW+KG++ GKTHFH  EDEE GEVGMMGC+EFYYSASAYKKANKPKFLFLL
Sbjct: 1   MVHQYMRYQPWRKGMNLGKTHFHHEEDEEEGEVGMMGCQEFYYSASAYKKANKPKFLFLL 60

Query: 61  FLSFLSCSIIFAPHFFSSSFSPFYSFGVQNDDLSVDKEVFAPLCSSIPNGTICCDRSSIR 120
           FLSFLSCSIIFAPHFFSSSFSPFYSFGVQNDDLSVDKEVF  LCSSIPNGTICCDR+SIR
Sbjct: 61  FLSFLSCSIIFAPHFFSSSFSPFYSFGVQNDDLSVDKEVFCSLCSSIPNGTICCDRNSIR 120

Query: 121 SDICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKPYTRKWEKNTMDTIDE 180
           SDICIMKGDIRTDSSSSSIFLYTSPDSPIEF DD GV+QVEKIKPYTRKWEKNTMDTIDE
Sbjct: 121 SDICIMKGDIRTDSSSSSIFLYTSPDSPIEFGDDQGVLQVEKIKPYTRKWEKNTMDTIDE 180

Query: 181 LELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSHSMNKEV 240
           LELIVKRKSNDIDQ+HRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSH+MNKEV
Sbjct: 181 LELIVKRKSNDIDQQHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSHNMNKEV 240

Query: 241 VFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAGLRIHDELTVDPSLME 300
           VFVILEYHKWWLTKYADILSQLSNYPVID RKNNKTHCFPQVIAGLRIHDEL+VDPSLME
Sbjct: 241 VFVILEYHKWWLTKYADILSQLSNYPVIDFRKNNKTHCFPQVIAGLRIHDELSVDPSLME 300

Query: 301 GGKSIVDFRNLLDKAYQPRIRELIRQEELEAKISLHRSKRPKLVVLSRKGSSRVITNEKL 360
           GGKSIVDFRNLLD AYQPRIRELIRQEE E KISL+ SKRPKLVVLSRKGSSR ITNEKL
Sbjct: 301 GGKSIVDFRNLLDMAYQPRIRELIRQEEEEGKISLYISKRPKLVVLSRKGSSRAITNEKL 360

Query: 361 MVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGAAMTHSLFMRPNAVFIQII 420
           MVKMAERMGFEVKVLRPDKTTELAKIYRE+NES+VL+GVHGAA+TH+LFMRPNAVFIQII
Sbjct: 361 MVKMAERMGFEVKVLRPDKTTELAKIYREMNESDVLIGVHGAALTHTLFMRPNAVFIQII 420

Query: 421 PLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDPVLVNPDSITKKGWEYT 480
           PLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSN+ KDDPVLVNPDSITKKGWEYT
Sbjct: 421 PLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNYKKDDPVLVNPDSITKKGWEYT 480

Query: 481 KKIYLDGQNVRLNLGRFEKRLERAYYYCIARARDGRSH 519
           KKIYLD QNVRLNL RFEKRLERAYYYCIARAR GRSH
Sbjct: 481 KKIYLDSQNVRLNLARFEKRLERAYYYCIARARQGRSH 518

BLAST of CsaV3_4G028900 vs. NCBI nr
Match: XP_022967775.1 (EGF domain-specific O-linked N-acetylglucosamine transferase-like [Cucurbita maxima])

HSP 1 Score: 802.7 bits (2072), Expect = 7.1e-229
Identity = 411/525 (78.29%), Postives = 452/525 (86.10%), Query Frame = 0

Query: 1   MIHQYMRYQPWKKGVSSGKTHFHRVEDEEGGEVGMMGCEEFYY--SASAYKKANKPKFLF 60
           M+H+Y+RYQ WKKGV+ G+TH+   ED+E  EV  MGCEEFYY  S SAYK+  + KFL 
Sbjct: 1   MVHEYVRYQAWKKGVNLGRTHY---EDDEDEEVEGMGCEEFYYSVSTSAYKR-TRIKFLV 60

Query: 61  LLFLSFLSCSIIFAPHFFSSSFSPFYSFGVQNDDLSVDKEVFAPLCSSIPNGTICCDRSS 120
           L FLSFLSCS IFAP FF   FS FY FGVQN +L  DKEV APLCSS+PNGTICCDR+S
Sbjct: 61  LFFLSFLSCSFIFAPLFF---FSRFYCFGVQNGELFADKEVLAPLCSSVPNGTICCDRNS 120

Query: 121 IRSDICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKPYTRKWEKNTMDTI 180
           IRSDIC MKGDIRT SSSSSIFLYTSPD PI+ D+D  VIQVEKIKPYTRKWEKNTMDTI
Sbjct: 121 IRSDICTMKGDIRTHSSSSSIFLYTSPDPPIDNDED-DVIQVEKIKPYTRKWEKNTMDTI 180

Query: 181 DELELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSHSMNK 240
           DELELIVKR SN  +  HRCDVRH+VPAVFFSTGGYTGNVYHEFNDGILPLYITS+ M K
Sbjct: 181 DELELIVKR-SNASNYHHRCDVRHDVPAVFFSTGGYTGNVYHEFNDGILPLYITSNHMKK 240

Query: 241 EVVFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAGLRIHDELTVDPSL 300
           EVVFVILEYH WW TKY DILSQLSNYP ID R +N+THCFPQ IAGLRIHDELTV+PSL
Sbjct: 241 EVVFVILEYHNWWFTKYGDILSQLSNYPPIDFRNDNRTHCFPQAIAGLRIHDELTVEPSL 300

Query: 301 MEGGKSIVDFRNLLDKAYQPRIRELIRQEELEAKISLHRS-----KRPKLVVLSRKGSSR 360
           MEG  SIVDFRN+LD AY+PRI+ELIRQEE E K  +  S     KRPKLVVLSRKGSSR
Sbjct: 301 MEGSTSIVDFRNVLDMAYRPRIQELIRQEEGEVKEEVKISTPLEPKRPKLVVLSRKGSSR 360

Query: 361 VITNEKLMVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGAAMTHSLFMRPN 420
            ITNE LMVKMAE +GFEVKVLRPD++TELAKIYRE+N+S+V+VGVHGAAMTH LFMRPN
Sbjct: 361 EITNENLMVKMAENVGFEVKVLRPDRSTELAKIYRELNQSDVMVGVHGAAMTHFLFMRPN 420

Query: 421 AVFIQIIPLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDPVLVNPDSIT 480
           AVFIQIIPLGTVWAAETYYGEPAKKLGLKYIGYEI  KESSLYS +N++DP+L++PDSIT
Sbjct: 421 AVFIQIIPLGTVWAAETYYGEPAKKLGLKYIGYEIQPKESSLYSKYNEEDPILIDPDSIT 480

Query: 481 KKGWEYTKKIYLDGQNVRLNLGRFEKRLERAYYYCIARARDGRSH 519
           +KGWEYTKKIYLDGQNVRLNL RF+KRL+RAYYYCIAR R  R+H
Sbjct: 481 RKGWEYTKKIYLDGQNVRLNLARFQKRLDRAYYYCIARTRH-RTH 515

BLAST of CsaV3_4G028900 vs. NCBI nr
Match: XP_022928604.1 (EGF domain-specific O-linked N-acetylglucosamine transferase-like [Cucurbita moschata])

HSP 1 Score: 792.0 bits (2044), Expect = 1.3e-225
Identity = 407/519 (78.42%), Postives = 444/519 (85.55%), Query Frame = 0

Query: 1   MIHQYMRYQPWKKGVSSGKTHFHRVEDEEGGEVGMMGCEEFYY--SASAYKKANKPKFLF 60
           M+H+YM+Y  WKKGV  G+TH+   ED+E  EVG MGCEE YY  S SAYK+  + KFL 
Sbjct: 1   MVHEYMQYPAWKKGVILGRTHY---EDDEDEEVGGMGCEEIYYSVSVSAYKR-TRIKFLV 60

Query: 61  LLFLSFLSCSIIFAPHFFSSSFSPFYSFGVQNDDLSVDKEVFAPLCSSIPNGTICCDRSS 120
           L FLSFLSCS IFAP FF   FS FY FGV+N DL  DK V APLCSS+PNGTICCDR+S
Sbjct: 61  LFFLSFLSCSFIFAPLFF---FSRFYCFGVENGDLFADKGVLAPLCSSVPNGTICCDRNS 120

Query: 121 IRSDICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKPYTRKWEKNTMDTI 180
           IRSDIC MKGDIRT SSSSSIFLYTSPDSPI+ D+D  VIQVEKIKPYTRKWEKNTMDTI
Sbjct: 121 IRSDICTMKGDIRTHSSSSSIFLYTSPDSPIDNDED-DVIQVEKIKPYTRKWEKNTMDTI 180

Query: 181 DELELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSHSMNK 240
           DELELIVKR SN  D  HRCDVRH VPAVFFSTGGYTGNVYHEFNDGILPLYITS+ M K
Sbjct: 181 DELELIVKR-SNAGDYHHRCDVRHGVPAVFFSTGGYTGNVYHEFNDGILPLYITSNHMKK 240

Query: 241 EVVFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAGLRIHDELTVDPSL 300
           EVVFVILEYH WW TKY DILSQLSNYP ID R +N+THCFPQ IAGLRIHDEL+V+PSL
Sbjct: 241 EVVFVILEYHNWWFTKYGDILSQLSNYPPIDFRNDNRTHCFPQAIAGLRIHDELSVEPSL 300

Query: 301 MEGGKSIVDFRNLLDKAYQPRIRELIRQEE---LEAKISLH-RSKRPKLVVLSRKGSSRV 360
           MEG  SIVDFRN+LD AY+PRI+EL R EE    E KIS     KRPKLVVLSRKGSSR 
Sbjct: 301 MEGSTSIVDFRNVLDMAYRPRIQELSRHEEEVKEEVKISTPLEPKRPKLVVLSRKGSSRE 360

Query: 361 ITNEKLMVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGAAMTHSLFMRPNA 420
           ITNE LMVKMAE +GFEVKV+RPD++TELAKIYRE+N+S+V+VGVHGAAMTH LFMRPNA
Sbjct: 361 ITNENLMVKMAENVGFEVKVVRPDRSTELAKIYRELNQSDVMVGVHGAAMTHFLFMRPNA 420

Query: 421 VFIQIIPLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDPVLVNPDSITK 480
           VFIQI+PLGTVWAAETYYGEPAKKLGLKYIGYEI  KESSLYS +N++DPVLV+PDSIT+
Sbjct: 421 VFIQIVPLGTVWAAETYYGEPAKKLGLKYIGYEIQPKESSLYSKYNEEDPVLVDPDSITQ 480

Query: 481 KGWEYTKKIYLDGQNVRLNLGRFEKRLERAYYYCIARAR 514
           KGWEYTKKIYLDGQNVRLNL RF+KRL+RAYYYCIAR R
Sbjct: 481 KGWEYTKKIYLDGQNVRLNLARFQKRLDRAYYYCIARTR 510

BLAST of CsaV3_4G028900 vs. NCBI nr
Match: XP_023544659.1 (EGF domain-specific O-linked N-acetylglucosamine transferase-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 788.9 bits (2036), Expect = 1.1e-224
Identity = 405/525 (77.14%), Postives = 447/525 (85.14%), Query Frame = 0

Query: 1   MIHQYMRYQPWKKGVSSGKTHFHRVEDEEGGEVGMMGCEEFYY--SASAYKKANKPKFLF 60
           M+H Y+RYQ WKKGV+ G+TH+   E++E  EV  MGCEEFYY  S SAYK+  + KFL 
Sbjct: 1   MVHGYVRYQAWKKGVNLGRTHY---EEDEDDEVEGMGCEEFYYSVSVSAYKR-TRIKFLV 60

Query: 61  LLFLSFLSCSIIFAPHFFSSSFSPFYSFGVQNDDLSVDKEVFAPLCSSIPNGTICCDRSS 120
           L+FLSFL CS IFAP FF   FS FY FGV+N +L  DKEV APLCSS+ NGTICCDR+S
Sbjct: 61  LVFLSFLCCSFIFAPLFF---FSRFYCFGVENGELFADKEVLAPLCSSVSNGTICCDRNS 120

Query: 121 IRSDICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKPYTRKWEKNTMDTI 180
           IRSDIC MKGDIRT SSSSSIFLYTSPDSPI  D D  VIQVEKIKPYTRKWEKNTMDTI
Sbjct: 121 IRSDICTMKGDIRTHSSSSSIFLYTSPDSPINNDKD-DVIQVEKIKPYTRKWEKNTMDTI 180

Query: 181 DELELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSHSMNK 240
           DELELIVKR SN  D  HRCDVRH+VPAVFFSTGGYTGNVYHEFNDGILPLYITS+ M K
Sbjct: 181 DELELIVKR-SNASDYHHRCDVRHDVPAVFFSTGGYTGNVYHEFNDGILPLYITSNHMKK 240

Query: 241 EVVFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAGLRIHDELTVDPSL 300
           EVVFVILEYH WW TKY DILSQLSNYP ID R +N+THCFPQ IAGLRIHDELTV+ SL
Sbjct: 241 EVVFVILEYHNWWFTKYGDILSQLSNYPPIDFRNDNRTHCFPQAIAGLRIHDELTVESSL 300

Query: 301 MEGGKSIVDFRNLLDKAYQPRIRELIRQEELEAKISLH-----RSKRPKLVVLSRKGSSR 360
           MEG  SIVDFRN+LD AY+PRI+ELIR EE E K  +      + K+PKLVVLSRKGSSR
Sbjct: 301 MEGSTSIVDFRNVLDMAYRPRIQELIRHEEEEVKEEVKISTPLKPKKPKLVVLSRKGSSR 360

Query: 361 VITNEKLMVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGAAMTHSLFMRPN 420
            ITNE LMVK+AE +GFEVKVLRPD++TELAKIYRE+N+S+V+VGVHGAAMTH LFMRPN
Sbjct: 361 EITNENLMVKIAENVGFEVKVLRPDRSTELAKIYRELNQSDVMVGVHGAAMTHFLFMRPN 420

Query: 421 AVFIQIIPLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDPVLVNPDSIT 480
           AVFIQI+PLGTVWAAETYYGEPAKKLGLKYIGYEI  KESSLYS +N++DPVLV+PDSIT
Sbjct: 421 AVFIQIVPLGTVWAAETYYGEPAKKLGLKYIGYEIQPKESSLYSKYNEEDPVLVDPDSIT 480

Query: 481 KKGWEYTKKIYLDGQNVRLNLGRFEKRLERAYYYCIARARDGRSH 519
           +KGWEYTKKIYLDGQNVRLNL RF+KRL+RAYYYCIAR R  R+H
Sbjct: 481 QKGWEYTKKIYLDGQNVRLNLARFQKRLDRAYYYCIARTRH-RTH 515

BLAST of CsaV3_4G028900 vs. TAIR10
Match: AT2G41640.1 (Glycosyltransferase family 61 protein)

HSP 1 Score: 556.6 bits (1433), Expect = 1.6e-158
Identity = 282/465 (60.65%), Postives = 349/465 (75.05%), Query Frame = 0

Query: 50  KANKPKFLFLLFLSFLSCSIIFAPH--FFSSSFSPFYSFGVQNDDLSVDKEVFAPLCSSI 109
           K  K KF  LLFLS LSC  + +P+  F  S+ S   SF  + + LS  + V  PLCS I
Sbjct: 35  KRAKQKFRCLLFLSILSCCFVLSPYYLFGFSTLSLLDSFRREIEGLSSYEPVITPLCSEI 94

Query: 110 PNGTICCDRSSIRSDICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKPYT 169
            NGTICCDR+ +RSDIC+MKGD+RT+S+SSSIFL+TS          +   + EKIKPYT
Sbjct: 95  SNGTICCDRTGLRSDICVMKGDVRTNSASSSIFLFTS--------STNNNTKPEKIKPYT 154

Query: 170 RKWEKNTMDTIDELELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGIL 229
           RKWE + MDT+ EL LI K  +   D+   CDV H+VPAVFFSTGGYTGNVYHEFNDGI+
Sbjct: 155 RKWETSVMDTVQELNLITKDSNKSSDRV--CDVYHDVPAVFFSTGGYTGNVYHEFNDGII 214

Query: 230 PLYITSHSMNKEVVFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAGLR 289
           PL+ITS   NK+VVFVI+EYH WW  KY D++SQLS+YP++D   + +THCF +   GLR
Sbjct: 215 PLFITSQHYNKKVVFVIVEYHDWWEMKYGDVVSQLSDYPLVDFNGDTRTHCFKEATVGLR 274

Query: 290 IHDELTVDPSLMEGGKSIVDFRNLLDKAYQPRIRELIRQEELEAKI-SLHRSKRPKLVVL 349
           IHDELTV+ SL+ G ++IVDFRN+LD+ Y  RI+ L  QEE EA + +L   K+PKLV+L
Sbjct: 275 IHDELTVNSSLVIGNQTIVDFRNVLDRGYSHRIQSL-TQEETEANVTALDFKKKPKLVIL 334

Query: 350 SRKGSSRVITNEKLMVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGAAMTH 409
           SR GSSR I NE L+V++AE+ GF V+VLRP KTTE+AKIYR +N S+V++GVHGAAMTH
Sbjct: 335 SRNGSSRAILNENLLVELAEKTGFNVEVLRPQKTTEMAKIYRSLNTSDVMIGVHGAAMTH 394

Query: 410 SLFMRPNAVFIQIIPLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDPVL 469
            LF++P  VFIQIIPLGT WAAETYYGEPAKKLGLKY+GY+I  KESSLY  + KDDPV+
Sbjct: 395 FLFLKPKTVFIQIIPLGTDWAAETYYGEPAKKLGLKYVGYKIAPKESSLYEEYGKDDPVI 454

Query: 470 VNPDSITKKGWEYTKKIYLDGQNVRLNLGRFEKRLERAYYYCIAR 512
            +PDS+  KGWEYTKKIYL GQNV+L+L RF + L R+Y + I R
Sbjct: 455 RDPDSLNDKGWEYTKKIYLQGQNVKLDLRRFRETLTRSYDFSIRR 488

BLAST of CsaV3_4G028900 vs. TAIR10
Match: AT3G57380.1 (Glycosyltransferase family 61 protein)

HSP 1 Score: 540.0 bits (1390), Expect = 1.6e-153
Identity = 282/506 (55.73%), Postives = 356/506 (70.36%), Query Frame = 0

Query: 11  WKKGVSSGKTHFHRVEDEEGGEVGMMGCEEFYYSASAYKKANKPKFLFLLFLSFLSCSII 70
           +K+ +  G+   HR+  EEGG     G      S   Y K  K K L  +FLS LSC  I
Sbjct: 4   YKRLIKKGEK--HRLSVEEGGS----GASAVTVSGGVYSKTAKQKLLLTIFLSLLSCCYI 63

Query: 71  FAPHFFSSSFSPFYSFGVQNDDLSVDKEVFAPLCSSIPNGTICCDRSSIRSDICIMKGDI 130
           F+     SSFS   +F  ++      +   APLCS   NGTICCDR+  RSD+CIMKGD+
Sbjct: 64  FS----FSSFSLLGAFSRESKGFGPYELFIAPLCSGTSNGTICCDRTGSRSDVCIMKGDV 123

Query: 131 RTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKPYTRKWEKNTMDTIDELELIVKRKSN 190
           RT S+SSS+FL+TS  +  +          +KIKPYTRKWE + M T+ EL L+ + + N
Sbjct: 124 RTHSASSSVFLFTSLKNKTKI--------TKKIKPYTRKWETSVMQTVQELNLVYRDEEN 183

Query: 191 D----IDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSHSMNKEVVFVILE 250
           +          CDV +NVPAVFFSTGGYTGNVYHEFNDGI+PL+ITSH  NK+VVFVI+E
Sbjct: 184 NSLVVSSVNDICDVFYNVPAVFFSTGGYTGNVYHEFNDGIIPLFITSHHFNKKVVFVIVE 243

Query: 251 YHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAGLRIHDELTVDPSLMEGGKSIV 310
           YH WW+ KY DI+SQLS+YP +D   + +THCF + I GL+IHDELTV+ SLM G K+I+
Sbjct: 244 YHSWWIMKYGDIVSQLSDYPPVDFNGDKRTHCFKEAIVGLKIHDELTVESSLMLGNKTIL 303

Query: 311 DFRNLLDKAYQPRIRELIRQEELEAKISLHRS-KRPKLVVLSRKGSSRVITNEKLMVKMA 370
           DFRN+LD+AY PRI  LI++EEL+A        K+P LV+LSR G SR I NE L+V++A
Sbjct: 304 DFRNVLDQAYWPRIHGLIQEEELKAANKTEDGFKKPILVILSRNG-SREILNESLLVELA 363

Query: 371 ERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGAAMTHSLFMRPNAVFIQIIPLGTV 430
           E +GF V VLRPDKTTELAKIYR +N S+V++GVHGAAMTH LF++P  VFIQIIP+GT 
Sbjct: 364 EEIGFIVHVLRPDKTTELAKIYRCLNSSDVMIGVHGAAMTHLLFLKPKTVFIQIIPIGTE 423

Query: 431 WAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDPVLVNPDSITKKGWEYTKKIYL 490
           WAAETYYG+PAKK+ LKYIGY+I  KESSLY  +  DDP++ +P S T+KGW+YTKKIYL
Sbjct: 424 WAAETYYGKPAKKMRLKYIGYKIKPKESSLYDEYGIDDPIIRDPKSFTQKGWDYTKKIYL 483

Query: 491 DGQNVRLNLGRFEKRLERAYYYCIAR 512
           + QNV+L+L RF K L RAY + + R
Sbjct: 484 ERQNVKLDLKRFRKPLSRAYDFSMKR 490

BLAST of CsaV3_4G028900 vs. TAIR10
Match: AT3G10320.1 (Glycosyltransferase family 61 protein)

HSP 1 Score: 503.8 bits (1296), Expect = 1.2e-142
Identity = 270/464 (58.19%), Postives = 334/464 (71.98%), Query Frame = 0

Query: 50  KANKPKFLFLLFLSFLSCSIIFAPHFFSSSFSPFYSFGVQNDDLSVDKEVFAPLCSSIPN 109
           K +KPK ++LL  S +S   +FAP      + P   F + +    ++  V      S   
Sbjct: 35  KRSKPKLIYLLIFSLISSCFVFAPQLLCFPY-PSALFLIDSSIKEIENRVSESNIESPKT 94

Query: 110 G----TICCDRSSIRSDICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKP 169
                +I CDR+  RSDIC MKGDIRT S SSSIFLYTS D   +      V+Q EKIKP
Sbjct: 95  SQKEESISCDRTGYRSDICFMKGDIRTHSPSSSIFLYTSNDLTTD-----QVLQ-EKIKP 154

Query: 170 YTRKWEKNTMDTIDELELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDG 229
           YTRKWE + M+TI EL+L+ K        K +C+V H VPAV FSTGGYTGN+YHEFNDG
Sbjct: 155 YTRKWETSIMETIPELKLVTK-DMKLFGDKRKCEVIHEVPAVLFSTGGYTGNLYHEFNDG 214

Query: 230 ILPLYITSHSMNKEVVFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAG 289
           ++PLYITS   NK+VVFVI EYHKWW  KY D+LSQLS+Y +ID  K+ +THCF + I G
Sbjct: 215 LIPLYITSKRFNKKVVFVIAEYHKWWEMKYGDVLSQLSDYSLIDFNKDKRTHCFKEAIVG 274

Query: 290 LRIHDELTVDPSLM-EGGKSIVDFRNLLDKAYQPRIRELIRQEE--LEAKISLHR-SKRP 349
           LRIH ELTVDPS M + G +I +FRN+LD+AY+PRI  L R EE    A+++  R +KRP
Sbjct: 275 LRIHGELTVDPSQMQDDGTTINEFRNVLDRAYRPRINRLDRLEEQRFHARLAQRRKAKRP 334

Query: 350 KLVVLSRKGSSRVITNEKLMVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHG 409
           KL + SR G SR ITNE LMVKMA+R+GF+++VLRPD+TTELAKIYR +N S V+VGVHG
Sbjct: 335 KLALFSRTG-SRGITNEDLMVKMAQRIGFDIEVLRPDRTTELAKIYRVLNSSKVMVGVHG 394

Query: 410 AAMTHSLFMRPNAVFIQIIPLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNK 469
           AAMTH LFM+P ++FIQIIPLGT WAAETYYGEPAKKLGL Y GY+I  +ESSLY  ++K
Sbjct: 395 AAMTHFLFMKPGSIFIQIIPLGTDWAAETYYGEPAKKLGLDYNGYKILPRESSLYEKYDK 454

Query: 470 DDPVLVNPDSITKKGWEYTKKIYLDGQNVRLNLGRFEKRLERAY 506
           DDP+L +P+SITKKGW++TK IYL+ Q VRL+L RF+K L  AY
Sbjct: 455 DDPILKDPNSITKKGWQFTKGIYLNDQKVRLDLHRFKKLLIDAY 489

BLAST of CsaV3_4G028900 vs. TAIR10
Match: AT2G03370.1 (Glycosyltransferase family 61 protein)

HSP 1 Score: 292.7 bits (748), Expect = 4.3e-79
Identity = 156/404 (38.61%), Postives = 241/404 (59.65%), Query Frame = 0

Query: 109 NGTICCDRSSIRSDICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKPYTR 168
           + TI CDRS    D+C + G    D  + ++ L     +P+          VEKI+PY +
Sbjct: 65  SATITCDRSHTNYDLCSINGSCNLDLKTGTLTLMDPTSAPL----------VEKIRPYPK 124

Query: 169 KWEKNTMDTIDELELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILP 228
           K +   M  I EL L     S  +     CD+ H++PA+ FS GGYTG++YH+  DG +P
Sbjct: 125 KADNWIMPRIRELTL----TSGPLGLPRSCDITHDLPAIVFSAGGYTGSIYHDLMDGFIP 184

Query: 229 LYITSHSM--NKEVVFVILEYHKWWLTKYADILSQLSNY-PVIDLRKNN--KTHCFPQVI 288
           L+IT++S+  +++ + V++   +WW+ KY DIL   S + P++ L K +   THCF   I
Sbjct: 185 LFITANSVYPDRDFIPVVVNAKEWWMPKYIDILGTFSKHKPILLLDKESVATTHCFTSAI 244

Query: 289 AGLRIHDELTVDPSLMEGGKSIVDFRNLLDKAYQPRIRELIRQEELEAKISLHRSKRPKL 348
            GL  H  +T+DP+ +   KS+VDF NLL+KA+                IS  ++ +P+L
Sbjct: 245 VGLITHWPMTIDPTQIPNSKSLVDFHNLLEKAF-------------TTNISTPKTHKPRL 304

Query: 349 VVLSRKGS-SRVITNEKLMVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGA 408
           +++SR G+  RVI NE+ + +M E +GFEV + RP KTT L + Y+ +  S+ +VGVHGA
Sbjct: 305 MLVSRYGNIGRVILNEQEIKEMLEDVGFEVIIFRPSKTTNLKEAYKLIKSSHGMVGVHGA 364

Query: 409 AMTHSLFMRPNAVFIQIIPLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKD 468
           A+TH LF+RP ++F+Q++PLG  WA++  Y  PAK + L+Y+ Y++  +ESSL   +N+D
Sbjct: 365 ALTHLLFLRPGSIFVQVVPLGLGWASKPCYESPAKTMKLEYLEYKVNVEESSLIEKYNRD 424

Query: 469 DPVLVNPDSITKKGWEYTK-KIYLDGQNVRLNLGRFEKRLERAY 506
           D VL +P +     W  TK K+YL  Q+V L++ RF K +  AY
Sbjct: 425 DLVLKDPIAYRGMDWNATKMKVYLKEQDVSLDVNRFRKHMNEAY 441

BLAST of CsaV3_4G028900 vs. TAIR10
Match: AT2G03360.2 (Glycosyltransferase family 61 protein)

HSP 1 Score: 279.6 bits (714), Expect = 3.8e-75
Identity = 166/459 (36.17%), Postives = 253/459 (55.12%), Query Frame = 0

Query: 57  LFLLFLSFLSCSIIFAPHFFSS-SFSPFYSFGVQNDDLSVDKEVFAPLCSSIPNGTICCD 116
           L ++ + +++ S +F    F S SFS   S G         +    P   +  +  I CD
Sbjct: 14  LVIVMVIYVAFSSVFDRQLFRSLSFSQVSSVGTLQQRWESRRTKQNPKVMA-ASAKITCD 73

Query: 117 RSSIRSDICIMKGDIRTDSSSSSIFLY---TSPDSPIEFDDDHGVIQVEKIKPYTRKWEK 176
           RS    D+C + G    +  + ++ L     +  +P+          VEKI+PY RK E 
Sbjct: 74  RSHTSYDLCSINGSCILNPKTGTLTLMDRTLTTSAPL----------VEKIRPYPRKSEN 133

Query: 177 NTMDTIDELELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYIT 236
             M  I EL+L     S   D    CD+ H+ PA+ FS GGYTG++YH+F DG +PL+IT
Sbjct: 134 WIMPRIRELKL----TSGPSDLTRSCDITHDSPAIVFSAGGYTGSIYHDFIDGFIPLFIT 193

Query: 237 SHSM--NKEVVFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNK--THCFPQVIAGLRI 296
           ++S+  +++ + V++   +WW+ KY DIL   S +  I L K N   THCF     GL  
Sbjct: 194 ANSVYPDRDFILVVVNPKEWWMPKYIDILGTFSKHKTILLDKENASITHCFTSATVGLIS 253

Query: 297 HDELTVDPSLMEGGKSIVDFRNLLDKAYQPRIRELIRQEELEAKISLHRSKRPKLVVLSR 356
           H  +T+DP+ +   KS+VDF NLLDKA  P              +S+ +  +P+L+++ R
Sbjct: 254 HGPMTIDPTQIPNSKSLVDFHNLLDKALNP-------------NLSIIKINKPRLILVRR 313

Query: 357 KGS-SRVITNEKLMVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGAAMTHS 416
            G+  RVI NE+ + +M E +GFEV   RP KTT L + Y+ +  S+ ++GVHGAA+T  
Sbjct: 314 YGNIGRVILNEEEIREMLEDVGFEVITFRPSKTTSLREAYKLIKSSHGMIGVHGAALTQL 373

Query: 417 LFMRPNAVFIQIIPLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDPVLV 476
           LF+RP +V +QI+P+G  W ++T +  PAK + L Y  Y +  +ESSL   +++DD VL 
Sbjct: 374 LFLRPGSVLVQIVPVGLGWVSKTCFETPAKAMKLDYTEYRVNVEESSLIEKYSRDDLVLK 433

Query: 477 NPDSITKKGWEYTK-KIYLDGQNVRLNLGRFEKRLERAY 506
           +P +     W  TK K+YL  Q+VRL++ RF K +  AY
Sbjct: 434 DPIAYRGMDWNVTKMKVYLKDQDVRLDVNRFRKHMNEAY 444

BLAST of CsaV3_4G028900 vs. Swiss-Prot
Match: sp|Q5NDL3|EOGT_CHICK (EGF domain-specific O-linked N-acetylglucosamine transferase OS=Gallus gallus OX=9031 GN=EOGT PE=2 SV=2)

HSP 1 Score: 58.9 bits (141), Expect = 1.9e-07
Identity = 74/303 (24.42%), Postives = 131/303 (43.23%), Query Frame = 0

Query: 197 RCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSH---SMNKEVVFVILEYHKWWLT 256
           +CD+    P  F        N+YH F D  + LYIT H   S + +V  V+ +   +   
Sbjct: 234 KCDIVIEKPTYFMKLDAGV-NMYHHFCD-FVNLYITQHINNSFSTDVNIVMWDTSSY--- 293

Query: 257 KYADILSQ----LSNYPVIDLRK-NNKTHCFPQVIAGL--RIHDELTVDPSLMEGGKSIV 316
            Y D+ S+     ++Y +I L+  ++K  CF + +  L  R+   L  +  L+ G     
Sbjct: 294 GYGDLFSETWKAFTDYDIIYLKTFDSKRVCFKEAVFSLLPRMRYGLFYNTPLISGCHGTG 353

Query: 317 DFRNLLDKAYQPRI--RELIRQEELEAKISLHRSKRPKLVVLSRKGSSRVITNEKLMVKM 376
            FR     A+   +  R  I QE         +  + ++ +L+R    R I N+  +V  
Sbjct: 354 LFR-----AFSQHVLHRLNITQEG-------PKDGKIRVTILARSTDYRKILNQNELVNA 413

Query: 377 AERMG-FEVKVL-RPDKTTELAKIYREVNESNVLVGVHGAAMTHSLFMRPNAVFIQIIPL 436
            + +   EVKV+    K  E ++  R  + S++ +G+HGA +TH LF+   AV  ++   
Sbjct: 414 LKTVSTLEVKVVDYKYKELEFSEQLRITHNSDIFIGMHGAGLTHLLFLPDWAVVFELYNC 473

Query: 437 GTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDPVLVNPDSITKKGWEYTKK 486
                 E  Y + A+  G+ YI +    K + ++       P L      T   ++  + 
Sbjct: 474 ----EDERCYLDLARLRGIHYITWR---KRNKVFPQDQGHHPTLGEHPKFTNYSFDVEEF 512

BLAST of CsaV3_4G028900 vs. Swiss-Prot
Match: sp|A0JND3|EOGT_BOVIN (EGF domain-specific O-linked N-acetylglucosamine transferase OS=Bos taurus OX=9913 GN=EOGT PE=2 SV=1)

HSP 1 Score: 57.8 bits (138), Expect = 4.2e-07
Identity = 76/304 (25.00%), Postives = 130/304 (42.76%), Query Frame = 0

Query: 197 RCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSHSMNKEVVFVILEYHKWWLTK-- 256
           +CD+    P  F        N+YH F D  + LYIT H  N    F    Y   W T   
Sbjct: 226 QCDIVIEKPTYFMKLDAGV-NMYHHFCD-FINLYITQHVNNS---FSTDVYVVMWDTSSY 285

Query: 257 -YADILSQ----LSNYPVIDLRK-NNKTHCFPQVIAGL--RIHDELTVDPSLMEGGKSIV 316
            Y D+ S      ++Y VI L+  + K  CF + I  L  R+   L  +  L+ G ++  
Sbjct: 286 GYGDLFSDTWKAFTDYDVIHLKTYDAKRVCFKEAIFSLLPRMRYGLFYNTPLISGCQNTG 345

Query: 317 DFRNLLDKAYQPRI--RELIRQEELEAKISLHRSKRPKLVVLSRKGSSRVITNEKLMVKM 376
            FR     A+   +  R  I QE         +  + ++ +L+R    R I N+  +V  
Sbjct: 346 LFR-----AFSQHVLHRLNITQEG-------PKGGKIRVTILARSTEYRKILNQNELVNA 405

Query: 377 AERMG-FEVKVLRPDKTTELAKI--YREVNESNVLVGVHGAAMTHSLFMRPNAVFIQIIP 436
            + +  FEV+++   K  EL  +   R  + +++ +G+HGA +TH LF+   A   ++  
Sbjct: 406 LKTVSTFEVQIV-DYKYKELGFLDQLRITHNTDIFIGMHGAGLTHLLFLPDWAAVFELYN 465

Query: 437 LGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDPVLVNPDSITKKGWEYTK 486
            G     E  Y + A+  G+ YI +    +++ ++       P L      T   ++  +
Sbjct: 466 CGD----ERCYLDLARLRGVHYITWR---RQNKVFPQDKGHHPTLGEHPKFTNYSFDVEE 504

BLAST of CsaV3_4G028900 vs. Swiss-Prot
Match: sp|Q6GQ23|EOGT_XENLA (EGF domain-specific O-linked N-acetylglucosamine transferase OS=Xenopus laevis OX=8355 GN=eogt PE=2 SV=1)

HSP 1 Score: 51.2 bits (121), Expect = 4.0e-05
Identity = 70/293 (23.89%), Postives = 117/293 (39.93%), Query Frame = 0

Query: 170 WEKNTMDTIDELELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPL 229
           W+        EL+     K   I+  H CD+    P  F        N+YH F D  + L
Sbjct: 198 WKSPLQSWFAELQSYSSFKFKPIEDAH-CDIIIEKPTYFMKLDAGV-NMYHHFCD-FVNL 257

Query: 230 YITSHSMNKEVVFVILEYHKWWLTKYADILSQ----LSNYPVIDLRK-NNKTHCFPQVIA 289
           YIT H  N     + +      +  Y D+ S      ++Y +  L+  +NK  CF   + 
Sbjct: 258 YITQHVNNSFSTDINIVMWTTSVYGYGDLFSDTWKAFTDYEITHLKAYDNKRVCFKDAVF 317

Query: 290 GL--RIHDELTVDPSLMEGGKSIVDFRNLLDKAYQPRIRELIRQEELEAKISLHRSKRPK 349
            L  R+   L  +  L+        FR             + +    EAKI        +
Sbjct: 318 ALLPRMRYGLFYNTPLISHCHGSGLFRAFSQHVLHR--LNITQHPATEAKI--------R 377

Query: 350 LVVLSRKGSSRVITNEKLMVKMAERM-GFEVKVLRPDKTTELAKIYREV---NESNVLVG 409
           + +L R    R I N   +V+  E +  F+VKV+  D    +     ++   + S++ +G
Sbjct: 378 VTILVRSTEFRKILNLDELVQALEAVPTFQVKVV--DYKYRVLGFLEQLSITHNSDIFIG 437

Query: 410 VHGAAMTHSLFMRPNAVFIQIIPLGTVWAAETYYGEPAKKLGLKYIGYEIGAK 452
           +HGA +TH LF+   AV  ++            Y + A+  G++Y+ +E G K
Sbjct: 438 MHGAGLTHLLFLPDWAVVFELYNCEDA----RCYLDLARLRGIQYMTWEKGDK 471

BLAST of CsaV3_4G028900 vs. Swiss-Prot
Match: sp|Q08CY9|EOGT_XENTR (EGF domain-specific O-linked N-acetylglucosamine transferase OS=Xenopus tropicalis OX=8364 GN=eogt PE=2 SV=1)

HSP 1 Score: 50.8 bits (120), Expect = 5.2e-05
Identity = 66/287 (23.00%), Postives = 115/287 (40.07%), Query Frame = 0

Query: 170 WEKNTMDTIDELELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPL 229
           W+        EL+         ++  H CD+  + P  F        N+YH F D  + L
Sbjct: 198 WKSPLQSWFAELQSYSSLTFKPVEDAH-CDIIIDKPTYFMKLDAGV-NMYHHFCD-FVNL 257

Query: 230 YITSHSMNKEVVFVILEYHKWWLTKYADILSQ----LSNYPVIDLRK-NNKTHCFPQVIA 289
           YIT H  N     + +      +  Y D+ S      ++Y +  L+  +NK  CF   + 
Sbjct: 258 YITQHVNNSFSTDINIVMWTTSVYGYGDLFSDTWKAFTDYDITHLKAYDNKRVCFKDAVF 317

Query: 290 GL--RIHDELTVDPSLMEGGKSIVDFRNLLDKAYQPRIRELIRQEELEAKISLHRSKRPK 349
            L  R+   L  +  L+        FR             + +Q   EAKI        +
Sbjct: 318 ALLPRMRYGLFYNTPLISNCHGSGLFRAFSQHVLHR--LNITQQLPKEAKI--------R 377

Query: 350 LVVLSRKGSSRVITN-EKLMVKMAERMGFEVKVL-RPDKTTELAKIYREVNESNVLVGVH 409
           + +L R    R I N ++L+  +     F+VKV+    +     +     + S++ +G+H
Sbjct: 378 ITILVRSTEFRKILNLDELVHALEAEPTFQVKVVDYKYRVLGFLEQLEITHNSDIFIGMH 437

Query: 410 GAAMTHSLFMRPNAVFIQIIPLGTVWAAETYYGEPAKKLGLKYIGYE 448
           GA +TH LF+   AV  ++         E  Y + A+  G++Y+ +E
Sbjct: 438 GAGLTHLLFLPDWAVVFELYNC----EDERCYLDLARLRGIRYMTWE 467

BLAST of CsaV3_4G028900 vs. Swiss-Prot
Match: sp|Q5NDE4|PMGT2_TAKRU (Protein O-linked-mannose beta-1,4-N-acetylglucosaminyltransferase 2 OS=Takifugu rubripes OX=31033 GN=pomgnt2 PE=2 SV=1)

HSP 1 Score: 49.3 bits (116), Expect = 1.5e-04
Identity = 56/224 (25.00%), Postives = 104/224 (46.43%), Query Frame = 0

Query: 217 NVYHEFNDGILPLYITSHSM---NKEVVFVILEYHKWWLTKYADILSQLSN-YPVI--DL 276
           N+ H F+D +LP + T       +++   V +E   W    + ++   LSN  P++   L
Sbjct: 162 NLMHVFHDDLLPAFYTMKQFLDSDEDARLVFME--GWEEGPHFELYRLLSNKQPLLKEQL 221

Query: 277 RKNNKTHCFPQVIAGLRIHDELT-------VDP-----SLMEGGKSIVDFRNLLDKAYQP 336
           R   K  CF +   GL    ++T       V P     +++  G  I  F  +L +  + 
Sbjct: 222 RNFGKLMCFTKSYIGL---SKMTTWYQYGFVQPQGPKANILVSGNEIRHFAKVLME--KM 281

Query: 337 RIRELIRQEELEAKISLHRSKRPKLVVLSRKGSSRVITNEKLMVKMAERMGFEVKVLRPD 396
            I      E+ +      + K   +VV SR  ++R+I NE  ++ MA    F+++V+   
Sbjct: 282 NITRAAGGEKDQGNAEDEKPKDEYIVVFSR-STTRLILNEAELI-MALAQEFQMRVVTVS 341

Query: 397 -KTTELAKIYREVNESNVLVGVHGAAMTHSLFMRPNAVFIQIIP 422
            +      I + ++ +++LV +HGA +  SLF+ P AV +++ P
Sbjct: 342 LEEQSFPSIVQVISGASMLVSMHGAQLITSLFLPPGAVVVELYP 376

BLAST of CsaV3_4G028900 vs. TrEMBL
Match: tr|A0A0A0L217|A0A0A0L217_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G411390 PE=4 SV=1)

HSP 1 Score: 1050.8 bits (2716), Expect = 9.9e-304
Identity = 518/518 (100.00%), Postives = 518/518 (100.00%), Query Frame = 0

Query: 1   MIHQYMRYQPWKKGVSSGKTHFHRVEDEEGGEVGMMGCEEFYYSASAYKKANKPKFLFLL 60
           MIHQYMRYQPWKKGVSSGKTHFHRVEDEEGGEVGMMGCEEFYYSASAYKKANKPKFLFLL
Sbjct: 1   MIHQYMRYQPWKKGVSSGKTHFHRVEDEEGGEVGMMGCEEFYYSASAYKKANKPKFLFLL 60

Query: 61  FLSFLSCSIIFAPHFFSSSFSPFYSFGVQNDDLSVDKEVFAPLCSSIPNGTICCDRSSIR 120
           FLSFLSCSIIFAPHFFSSSFSPFYSFGVQNDDLSVDKEVFAPLCSSIPNGTICCDRSSIR
Sbjct: 61  FLSFLSCSIIFAPHFFSSSFSPFYSFGVQNDDLSVDKEVFAPLCSSIPNGTICCDRSSIR 120

Query: 121 SDICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKPYTRKWEKNTMDTIDE 180
           SDICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKPYTRKWEKNTMDTIDE
Sbjct: 121 SDICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKPYTRKWEKNTMDTIDE 180

Query: 181 LELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSHSMNKEV 240
           LELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSHSMNKEV
Sbjct: 181 LELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSHSMNKEV 240

Query: 241 VFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAGLRIHDELTVDPSLME 300
           VFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAGLRIHDELTVDPSLME
Sbjct: 241 VFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAGLRIHDELTVDPSLME 300

Query: 301 GGKSIVDFRNLLDKAYQPRIRELIRQEELEAKISLHRSKRPKLVVLSRKGSSRVITNEKL 360
           GGKSIVDFRNLLDKAYQPRIRELIRQEELEAKISLHRSKRPKLVVLSRKGSSRVITNEKL
Sbjct: 301 GGKSIVDFRNLLDKAYQPRIRELIRQEELEAKISLHRSKRPKLVVLSRKGSSRVITNEKL 360

Query: 361 MVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGAAMTHSLFMRPNAVFIQII 420
           MVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGAAMTHSLFMRPNAVFIQII
Sbjct: 361 MVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGAAMTHSLFMRPNAVFIQII 420

Query: 421 PLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDPVLVNPDSITKKGWEYT 480
           PLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDPVLVNPDSITKKGWEYT
Sbjct: 421 PLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDPVLVNPDSITKKGWEYT 480

Query: 481 KKIYLDGQNVRLNLGRFEKRLERAYYYCIARARDGRSH 519
           KKIYLDGQNVRLNLGRFEKRLERAYYYCIARARDGRSH
Sbjct: 481 KKIYLDGQNVRLNLGRFEKRLERAYYYCIARARDGRSH 518

BLAST of CsaV3_4G028900 vs. TrEMBL
Match: tr|A0A1S3C1N9|A0A1S3C1N9_CUCME (LOW QUALITY PROTEIN: EGF domain-specific O-linked N-acetylglucosamine transferase-like OS=Cucumis melo OX=3656 GN=LOC103495884 PE=4 SV=1)

HSP 1 Score: 987.3 bits (2551), Expect = 1.3e-284
Identity = 483/518 (93.24%), Postives = 500/518 (96.53%), Query Frame = 0

Query: 1   MIHQYMRYQPWKKGVSSGKTHFHRVEDEEGGEVGMMGCEEFYYSASAYKKANKPKFLFLL 60
           M+HQYMRYQPW+KG++ GKTHFH  EDEE GEVGMMGC+EFYYSASAYKKANKPKFLFLL
Sbjct: 1   MVHQYMRYQPWRKGMNLGKTHFHHEEDEEEGEVGMMGCQEFYYSASAYKKANKPKFLFLL 60

Query: 61  FLSFLSCSIIFAPHFFSSSFSPFYSFGVQNDDLSVDKEVFAPLCSSIPNGTICCDRSSIR 120
           FLSFLSCSIIFAPHFFSSSFSPFYSFGVQNDDLSVDKEVF  LCSSIPNGTICCDR+SIR
Sbjct: 61  FLSFLSCSIIFAPHFFSSSFSPFYSFGVQNDDLSVDKEVFCSLCSSIPNGTICCDRNSIR 120

Query: 121 SDICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKPYTRKWEKNTMDTIDE 180
           SDICIMKGDIRTDSSSSSIFLYTSPDSPIEF DD GV+QVEKIKPYTRKWEKNTMDTIDE
Sbjct: 121 SDICIMKGDIRTDSSSSSIFLYTSPDSPIEFGDDQGVLQVEKIKPYTRKWEKNTMDTIDE 180

Query: 181 LELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSHSMNKEV 240
           LELIVKRKSNDIDQ+HRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSH+MNKEV
Sbjct: 181 LELIVKRKSNDIDQQHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSHNMNKEV 240

Query: 241 VFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAGLRIHDELTVDPSLME 300
           VFVILEYHKWWLTKYADILSQLSNYPVID RKNNKTHCFPQVIAGLRIHDEL+VDPSLME
Sbjct: 241 VFVILEYHKWWLTKYADILSQLSNYPVIDFRKNNKTHCFPQVIAGLRIHDELSVDPSLME 300

Query: 301 GGKSIVDFRNLLDKAYQPRIRELIRQEELEAKISLHRSKRPKLVVLSRKGSSRVITNEKL 360
           GGKSIVDFRNLLD AYQPRIRELIRQEE E KISL+ SKRPKLVVLSRKGSSR ITNEKL
Sbjct: 301 GGKSIVDFRNLLDMAYQPRIRELIRQEEEEGKISLYISKRPKLVVLSRKGSSRAITNEKL 360

Query: 361 MVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGAAMTHSLFMRPNAVFIQII 420
           MVKMAERMGFEVKVLRPDKTTELAKIYRE+NES+VL+GVHGAA+TH+LFMRPNAVFIQII
Sbjct: 361 MVKMAERMGFEVKVLRPDKTTELAKIYREMNESDVLIGVHGAALTHTLFMRPNAVFIQII 420

Query: 421 PLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDPVLVNPDSITKKGWEYT 480
           PLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSN+ KDDPVLVNPDSITKKGWEYT
Sbjct: 421 PLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNYKKDDPVLVNPDSITKKGWEYT 480

Query: 481 KKIYLDGQNVRLNLGRFEKRLERAYYYCIARARDGRSH 519
           KKIYLD QNVRLNL RFEKRLERAYYYCIARAR GRSH
Sbjct: 481 KKIYLDSQNVRLNLARFEKRLERAYYYCIARARQGRSH 518

BLAST of CsaV3_4G028900 vs. TrEMBL
Match: tr|A0A251N788|A0A251N788_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_7G055700 PE=4 SV=1)

HSP 1 Score: 630.2 bits (1624), Expect = 4.2e-177
Identity = 324/530 (61.13%), Postives = 392/530 (73.96%), Query Frame = 0

Query: 4   QYMRYQPWKKGVSSGKTHFHRVEDEEGGEVGMMGCEEFYYSASAYKKANKPKFLFLLFLS 63
           +Y R+  W+KG           ED+E  +  +M      ++ S Y K   PK L+LLFLS
Sbjct: 3   RYQRHHQWRKGEK-------HTEDDEESQTFLM-----EFANSGYYKRTSPKLLYLLFLS 62

Query: 64  FLSCSIIFAPHFFS--SSFSPFYSFGVQNDDLSVDKEVFAPLCSSIPNGTICCDRSSIRS 123
           FLSCS I APH FS  ++FS  YS  V+N+ L+ + +V  PLCSSI NGTICCDRSSIRS
Sbjct: 63  FLSCSFILAPHLFSTNTTFSLLYSTMVENEGLAAEMDVNIPLCSSIANGTICCDRSSIRS 122

Query: 124 DICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQV------EKIKPYTRKWEKNTM 183
           DIC+MKGD+RT S+SSSIFLY S D     +   GV++       EK+KPYTRKWE + M
Sbjct: 123 DICVMKGDVRTHSASSSIFLYRSKDGNSLKNYVAGVVEETEELEHEKVKPYTRKWETSVM 182

Query: 184 DTIDELELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSHS 243
           DTIDEL+L+ K+  + +   H+CDV+H+VPAVFFSTGGYTGNVYHEFNDGI+PLYITS  
Sbjct: 183 DTIDELQLVAKK--DTLGMHHQCDVQHDVPAVFFSTGGYTGNVYHEFNDGIMPLYITSQH 242

Query: 244 MNKEVVFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAGLRIHDELTVD 303
            NK+V+FV+L+YH WW+TKY DILSQLS+YP ID   + +THCFP+V  GLRIHDELTVD
Sbjct: 243 FNKKVIFVMLDYHTWWITKYGDILSQLSDYPPIDFSGDKRTHCFPEVTVGLRIHDELTVD 302

Query: 304 PSLMEGGKSIVDFRNLLDKAYQPRIRELIRQEELEAKISLHRS----------------- 363
            SLMEG   IVDFRNLLD+AY PRIR LI+ EE EA+  L  S                 
Sbjct: 303 SSLMEGNMGIVDFRNLLDRAYWPRIRSLIQDEEREAQEKLSASLTSESPLEIENEVQEDQ 362

Query: 364 -KRPKLVVLSRKGSSRVITNEKLMVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLV 423
            ++PKLV++SR G SR ITNE L+VKMAE +GFEV VLRPD TTELAKIYR +N S+V++
Sbjct: 363 VRKPKLVIISRNG-SRAITNENLLVKMAEEIGFEVNVLRPDPTTELAKIYRALNASDVMI 422

Query: 424 GVHGAAMTHSLFMRPNAVFIQIIPLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYS 483
           GVHGAAMTH LFM+P +VFIQ++PLGTVWAAE YYGEPA+KLGLKYIGY+I  +ESSLY 
Sbjct: 423 GVHGAAMTHFLFMKPGSVFIQVVPLGTVWAAEEYYGEPARKLGLKYIGYQILTRESSLYD 482

Query: 484 NHNKDDPVLVNPDSITKKGWEYTKKIYLDGQNVRLNLGRFEKRLERAYYY 508
            ++KDDPVL +P S+T  GWEYTKKIYLDGQ VRL+LGRF KRL RAYY+
Sbjct: 483 KYDKDDPVLRDPKSVTTMGWEYTKKIYLDGQTVRLDLGRFRKRLVRAYYH 517

BLAST of CsaV3_4G028900 vs. TrEMBL
Match: tr|A0A2R6R5A6|A0A2R6R5A6_ACTCH (EGF domain-specific O-linked N-acetylglucosamine transferase OS=Actinidia chinensis var. chinensis OX=1590841 GN=CEY00_Acc10215 PE=4 SV=1)

HSP 1 Score: 620.9 bits (1600), Expect = 2.5e-174
Identity = 330/548 (60.22%), Postives = 399/548 (72.81%), Query Frame = 0

Query: 1   MIHQYMRYQPWKKGVSSGKTHFHRVEDEEGGEVGMMGCEEFYYSASAYKKANKPKFLFLL 60
           M+H Y RY P KKG   G   F   ++E    VG++      +  + Y K  +P+ L LL
Sbjct: 1   MVH-YQRYHPLKKG---GVDVFAEGDEESQSIVGLI------WGNTGYNKRTRPRLLSLL 60

Query: 61  FLSFLSCSIIFAPHFFS--SSFSPFYSFGVQNDDLSVDKEVFAPLCSSIPNGTICCDRSS 120
           FLS +SCS+I AP FFS  SSFS  YSFG  ++ L  D +V  PLCSSI NGTICCDRSS
Sbjct: 61  FLSLISCSLILAPQFFSCPSSFSLLYSFGHGDESLIADLDVNKPLCSSISNGTICCDRSS 120

Query: 121 IRSDICIMKGDIRTDSSSSSIFLYTS--PDSPIEF------DDDHGVIQVEKIKPYTRKW 180
           IRSDICIMKGD+RT S SSS+FLY+S  P+  I++      DD    +Q EK++PYTRKW
Sbjct: 121 IRSDICIMKGDVRTHSLSSSVFLYSSTHPNDLIDYASVVAEDDKEEELQHEKVRPYTRKW 180

Query: 181 EKNTMDTIDELELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLY 240
           E  TM TIDEL LI KR+++    +H CDVRH+VPAVFFSTGGYTGN+YHEFNDGILPLY
Sbjct: 181 EAGTMATIDELNLISKRETS--RPQHACDVRHDVPAVFFSTGGYTGNIYHEFNDGILPLY 240

Query: 241 ITSHSMNKEVVFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAGLRIHD 300
           ITS  +NK VVFVILEYH WW  KY D+LS LS+YP ID   +N+THCFP+ I G+RIHD
Sbjct: 241 ITSQHLNKRVVFVILEYHDWWFMKYGDVLSHLSDYPPIDFSNDNRTHCFPEAIVGMRIHD 300

Query: 301 ELTVDPSLMEGGKSIVDFRNLLDKAYQPRIRELIRQEELE-----------AKISLHRS- 360
           ELTVDPSLMEGGK+I DFR LLD+AY PRI  LI  EE E           ++ S H S 
Sbjct: 301 ELTVDPSLMEGGKTIRDFRTLLDRAYWPRISSLIEGEEREKAQLNSQENPSSQPSSHTSE 360

Query: 361 --------KRPKLVVLSRKGSSRVITNEKLMVKMAERMGFEVKVLRPDKTTELAKIYREV 420
                   K+PKL +LSR G SR ITNE  +V+MAE++GFEV+VLRP KT+ELA+IYR +
Sbjct: 361 EVREHWELKKPKLTILSRNG-SRAITNEDFLVQMAEKIGFEVEVLRPSKTSELAQIYRSL 420

Query: 421 NESNVLVGVHGAAMTHSLFMRPNAVFIQIIPLGTVWAAETYYGEPAKKLGLKYIGYEIGA 480
           N S+V+VGVHGAAMTH LF+RP +VFIQ++PLGT WAAETYYG+PAKKLGLKYI Y+I  
Sbjct: 421 NSSDVMVGVHGAAMTHFLFLRPGSVFIQVVPLGTEWAAETYYGDPAKKLGLKYISYKILP 480

Query: 481 KESSLYSNHNKDDPVLVNPDSITKKGWEYTKKIYLDGQNVRLNLGRFEKRLERAYYYCIA 519
           +ESSLY  ++K DPVL +P S++ KGWE+TKKIYLD QNVRLNL RF KRL RAY Y  A
Sbjct: 481 RESSLYDEYDKSDPVLTDPSSVSDKGWEFTKKIYLDRQNVRLNLARFRKRLVRAYDYSFA 534

BLAST of CsaV3_4G028900 vs. TrEMBL
Match: tr|A0A2C9VEW2|A0A2C9VEW2_MANES (Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_08G052900 PE=4 SV=1)

HSP 1 Score: 620.5 bits (1599), Expect = 3.3e-174
Identity = 325/524 (62.02%), Postives = 391/524 (74.62%), Query Frame = 0

Query: 1   MIHQYMRYQPWKKGVSSGKTHFHRVEDEEGGEVGMMGCEEFYYSASAYKKANKPKFLFLL 60
           M+H Y RY  W+KG           E++EG  +  +         S + K  +PK + LL
Sbjct: 1   MVH-YHRYNQWRKG--------EHAEEDEGLALVCIN--------SGFYKRKRPKLVSLL 60

Query: 61  FLSFLSCSIIFAPHFF--SSSFSPFYSFGVQNDD-LSVDKEVFAPLCSSIPNGTICCDRS 120
            LS LSCS+I  PH F  SS+FS  YSF  + DD + VD +V APLCSSI NGTICCDRS
Sbjct: 61  LLSLLSCSLILLPHLFCSSSAFSLLYSFVAETDDRVIVDMDVNAPLCSSISNGTICCDRS 120

Query: 121 SIRSDICIMKGDIRTDSSSSSIFLYTSPDSPIEFDDDHGVIQVEKIKPYTRKWEKNTMDT 180
           S RSDICIMKGD+RT S+SSSIFLYTS +S     +D      EKIKPYTRKWE + MDT
Sbjct: 121 SFRSDICIMKGDVRTQSASSSIFLYTSSNSSKSIKEDE-EFHHEKIKPYTRKWETSVMDT 180

Query: 181 IDELELIVKRKSNDIDQKHRCDVRHNVPAVFFSTGGYTGNVYHEFNDGILPLYITSHSMN 240
           ID+L+LI K + +     H+CDV H+VPAV FSTGGYTGNVYHEFNDGILPLYITS  +N
Sbjct: 181 IDQLDLISKHEKS--ATHHQCDVTHHVPAVVFSTGGYTGNVYHEFNDGILPLYITSQHLN 240

Query: 241 KEVVFVILEYHKWWLTKYADILSQLSNYPVIDLRKNNKTHCFPQVIAGLRIHDELTVDPS 300
           K+VVFV+LEYH WW+ KY DILS+LS+YP ID   + + HCFP+VI GLRIH+ELT+DPS
Sbjct: 241 KKVVFVMLEYHTWWIMKYGDILSRLSDYPAIDFSGDKRNHCFPEVIVGLRIHNELTIDPS 300

Query: 301 LMEGGKSIVDFRNLLDKAYQPRIRELIRQEELEA-------------KISLHRSKRPKLV 360
           LM+  KSIVDFRNLLDKAY PRIR LI++EELEA              +   + K+PKLV
Sbjct: 301 LMQENKSIVDFRNLLDKAYWPRIRGLIQKEELEALSPSSGTLLEFRKDVQEAKMKKPKLV 360

Query: 361 VLSRKGSSRVITNEKLMVKMAERMGFEVKVLRPDKTTELAKIYREVNESNVLVGVHGAAM 420
           +LSR  +SR ITNE L+VKMA R+GF V+VLRPD+TTELAKIYR +N S V++GVHGAAM
Sbjct: 361 ILSR-NASRAITNEDLLVKMAVRIGFRVEVLRPDRTTELAKIYRSLNSSEVMIGVHGAAM 420

Query: 421 THSLFMRPNAVFIQIIPLGTVWAAETYYGEPAKKLGLKYIGYEIGAKESSLYSNHNKDDP 480
           TH LFM+P +VFIQ+IPLGT WAAETYYG+PAKKLGLKYIGY+I  +ESSL+  ++K+DP
Sbjct: 421 THFLFMKPGSVFIQVIPLGTEWAAETYYGDPAKKLGLKYIGYQIMPRESSLFEKYDKNDP 480

Query: 481 VLVNPDSITKKGWEYTKKIYLDGQNVRLNLGRFEKRLERAYYYC 509
           VL +P SI++KGWEYTKKIYLD QNVRLNL RF+KRL RAY +C
Sbjct: 481 VLQDPRSISEKGWEYTKKIYLDSQNVRLNLARFQKRLVRAYQHC 503

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004151727.21.5e-303100.00PREDICTED: EGF domain-specific O-linked N-acetylglucosamine transferase-like [Cu... [more]
XP_008455788.22.0e-28493.24PREDICTED: LOW QUALITY PROTEIN: EGF domain-specific O-linked N-acetylglucosamine... [more]
XP_022967775.17.1e-22978.29EGF domain-specific O-linked N-acetylglucosamine transferase-like [Cucurbita max... [more]
XP_022928604.11.3e-22578.42EGF domain-specific O-linked N-acetylglucosamine transferase-like [Cucurbita mos... [more]
XP_023544659.11.1e-22477.14EGF domain-specific O-linked N-acetylglucosamine transferase-like isoform X1 [Cu... [more]
Match NameE-valueIdentityDescription
AT2G41640.11.6e-15860.65Glycosyltransferase family 61 protein[more]
AT3G57380.11.6e-15355.73Glycosyltransferase family 61 protein[more]
AT3G10320.11.2e-14258.19Glycosyltransferase family 61 protein[more]
AT2G03370.14.3e-7938.61Glycosyltransferase family 61 protein[more]
AT2G03360.23.8e-7536.17Glycosyltransferase family 61 protein[more]
Match NameE-valueIdentityDescription
sp|Q5NDL3|EOGT_CHICK1.9e-0724.42EGF domain-specific O-linked N-acetylglucosamine transferase OS=Gallus gallus OX... [more]
sp|A0JND3|EOGT_BOVIN4.2e-0725.00EGF domain-specific O-linked N-acetylglucosamine transferase OS=Bos taurus OX=99... [more]
sp|Q6GQ23|EOGT_XENLA4.0e-0523.89EGF domain-specific O-linked N-acetylglucosamine transferase OS=Xenopus laevis O... [more]
sp|Q08CY9|EOGT_XENTR5.2e-0523.00EGF domain-specific O-linked N-acetylglucosamine transferase OS=Xenopus tropical... [more]
sp|Q5NDE4|PMGT2_TAKRU1.5e-0425.00Protein O-linked-mannose beta-1,4-N-acetylglucosaminyltransferase 2 OS=Takifugu ... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0L217|A0A0A0L217_CUCSA9.9e-304100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G411390 PE=4 SV=1[more]
tr|A0A1S3C1N9|A0A1S3C1N9_CUCME1.3e-28493.24LOW QUALITY PROTEIN: EGF domain-specific O-linked N-acetylglucosamine transferas... [more]
tr|A0A251N788|A0A251N788_PRUPE4.2e-17761.13Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_7G055700 PE=4 SV=1[more]
tr|A0A2R6R5A6|A0A2R6R5A6_ACTCH2.5e-17460.22EGF domain-specific O-linked N-acetylglucosamine transferase OS=Actinidia chinen... [more]
tr|A0A2C9VEW2|A0A2C9VEW2_MANES3.3e-17462.02Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_08G052900 PE=4 SV=... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0016757transferase activity, transferring glycosyl groups
Vocabulary: INTERPRO
TermDefinition
IPR007657Glycosyltransferase_61
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016757 transferase activity, transferring glycosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_4G028900.1CsaV3_4G028900.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007657Glycosyltransferase 61PFAMPF04577DUF563coord: 217..446
e-value: 1.4E-21
score: 77.5
IPR007657Glycosyltransferase 61PANTHERPTHR20961GLYCOSYLTRANSFERASEcoord: 45..513
NoneNo IPR availablePANTHERPTHR20961:SF36GLYCOSYLTRANSFERASEcoord: 45..513

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CsaV3_4G028900CSPI04G18530Wild cucumber (PI 183967)cpicucB212
CsaV3_4G028900Cucsa.268990Cucumber (Gy14) v1cgycucB411
CsaV3_4G028900CsGy4G017610Cucumber (Gy14) v2cgybcucB171
CsaV3_4G028900MELO3C019188Melon (DHL92) v3.5.1cucmeB337
CsaV3_4G028900MELO3C019188.2Melon (DHL92) v3.6.1cucmedB330
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CsaV3_4G028900Bottle gourd (USVL1VR-Ls)cuclsiB273
CsaV3_4G028900Watermelon (Charleston Gray)cucwcgB314
CsaV3_4G028900Watermelon (97103) v1cucwmB352
CsaV3_4G028900Watermelon (97103) v2cucwmbB301