Cp4.1LG06g04410 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG06g04410
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionExostosin-like 3
LocationCp4.1LG06 : 2327328 .. 2328320 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACTTGAAGTCACTAACGATCTTCATCCTCCTCTCTTTCTCCACCGCCGTGTGCTCTCTGCTGAAAGCAACCGCCGCCGCGTGCGACGCCGAATCCCTGCCGGACCGACGGAATCTACGACCGGATCAAATCACGGTCCTCATCAACGGCTATTACGAATCCCGCATCCCTCTACTCCAATCAATAGCCGCCACGTATGCCGCATCCCCAATCGTACACGCCGTACTGATCCTTTGGGGAAATCCCTCAACCTCGTCAAAAACCCTAACGCAATTGGCTCAAAACCTCACCACTGGACCCATTTCCGTAATCCGCCAATCTTCAAACAGCCTCAATTCTCGTTTCCGACCCCATAAGTCCATCCAAACCGGCGCCGTTTTGATCTGCGACGACGACGTCGAGATCGACATATCGTCCCTGGAGTTCGCGTTCAGAGTTTGGGGGAGAAACCCCGAACGACTTGTCGGCTTCTTCGTCCGGTCCCACGATCTAGACCTCTCGAGACGAGAATGGATCTACACCGTTCATCAAGACAAATACTCGATCGTGCTCACGAAGTTGATGATTCTGAAGATGGAGTATTTGTTCGAGTATACTTGCGGAGGCGGAGCGGCGATGGCCGACATGAGAAGAGTTGTCGATCGGGAGCGCAATTGCGAGGATATCTTGATGAATTTCATGGTGGCCGACATGTCGAACGCTGGGCCTATTCTGGTTGCGGCGCAGAGGATCAGAGACTGGGGGGACCCACGGAATGATTACGATGAGAATGAACGGCTGGGATTGGGGGAGGGGGTTAGTGAGGTGGGGCTAAGTAATAGGAAAGGGGAGCATAGGAAGAGAAGGGGACGGTGTATAACGGAGTTTCATAGACGGCTGGGGAGGATGCCGTTACGGTATAGTTACGGGAAGTTAGTTAATTCAGTTGGCGAGCAGGCTTTGTGTAGAAAAGGAGGGAAGTTGGTCCCTTGCGATCATAACGTACTGTGA

mRNA sequence

ATGAACTTGAAGTCACTAACGATCTTCATCCTCCTCTCTTTCTCCACCGCCGTGTGCTCTCTGCTGAAAGCAACCGCCGCCGCGTGCGACGCCGAATCCCTGCCGGACCGACGGAATCTACGACCGGATCAAATCACGGTCCTCATCAACGGCTATTACGAATCCCGCATCCCTCTACTCCAATCAATAGCCGCCACGTATGCCGCATCCCCAATCGTACACGCCGTACTGATCCTTTGGGGAAATCCCTCAACCTCGTCAAAAACCCTAACGCAATTGGCTCAAAACCTCACCACTGGACCCATTTCCGTAATCCGCCAATCTTCAAACAGCCTCAATTCTCGTTTCCGACCCCATAAGTCCATCCAAACCGGCGCCGTTTTGATCTGCGACGACGACGTCGAGATCGACATATCGTCCCTGGAGTTCGCGTTCAGAGTTTGGGGGAGAAACCCCGAACGACTTGTCGGCTTCTTCGTCCGGTCCCACGATCTAGACCTCTCGAGACGAGAATGGATCTACACCGTTCATCAAGACAAATACTCGATCGTGCTCACGAAGTTGATGATTCTGAAGATGGAGTATTTGTTCGAGTATACTTGCGGAGGCGGAGCGGCGATGGCCGACATGAGAAGAGTTGTCGATCGGGAGCGCAATTGCGAGGATATCTTGATGAATTTCATGGTGGCCGACATGTCGAACGCTGGGCCTATTCTGGTTGCGGCGCAGAGGATCAGAGACTGGGGGGACCCACGGAATGATTACGATGAGAATGAACGGCTGGGATTGGGGGAGGGGGTTAGTGAGGTGGGGCTAAGTAATAGGAAAGGGGAGCATAGGAAGAGAAGGGGACGGTGTATAACGGAGTTTCATAGACGGCTGGGGAGGATGCCGTTACGGTATAGTTACGGGAAGTTAGTTAATTCAGTTGGCGAGCAGGCTTTGTGTAGAAAAGGAGGGAAGTTGGTCCCTTGCGATCATAACGTACTGTGA

Coding sequence (CDS)

ATGAACTTGAAGTCACTAACGATCTTCATCCTCCTCTCTTTCTCCACCGCCGTGTGCTCTCTGCTGAAAGCAACCGCCGCCGCGTGCGACGCCGAATCCCTGCCGGACCGACGGAATCTACGACCGGATCAAATCACGGTCCTCATCAACGGCTATTACGAATCCCGCATCCCTCTACTCCAATCAATAGCCGCCACGTATGCCGCATCCCCAATCGTACACGCCGTACTGATCCTTTGGGGAAATCCCTCAACCTCGTCAAAAACCCTAACGCAATTGGCTCAAAACCTCACCACTGGACCCATTTCCGTAATCCGCCAATCTTCAAACAGCCTCAATTCTCGTTTCCGACCCCATAAGTCCATCCAAACCGGCGCCGTTTTGATCTGCGACGACGACGTCGAGATCGACATATCGTCCCTGGAGTTCGCGTTCAGAGTTTGGGGGAGAAACCCCGAACGACTTGTCGGCTTCTTCGTCCGGTCCCACGATCTAGACCTCTCGAGACGAGAATGGATCTACACCGTTCATCAAGACAAATACTCGATCGTGCTCACGAAGTTGATGATTCTGAAGATGGAGTATTTGTTCGAGTATACTTGCGGAGGCGGAGCGGCGATGGCCGACATGAGAAGAGTTGTCGATCGGGAGCGCAATTGCGAGGATATCTTGATGAATTTCATGGTGGCCGACATGTCGAACGCTGGGCCTATTCTGGTTGCGGCGCAGAGGATCAGAGACTGGGGGGACCCACGGAATGATTACGATGAGAATGAACGGCTGGGATTGGGGGAGGGGGTTAGTGAGGTGGGGCTAAGTAATAGGAAAGGGGAGCATAGGAAGAGAAGGGGACGGTGTATAACGGAGTTTCATAGACGGCTGGGGAGGATGCCGTTACGGTATAGTTACGGGAAGTTAGTTAATTCAGTTGGCGAGCAGGCTTTGTGTAGAAAAGGAGGGAAGTTGGTCCCTTGCGATCATAACGTACTGTGA

Protein sequence

MNLKSLTIFILLSFSTAVCSLLKATAAACDAESLPDRRNLRPDQITVLINGYYESRIPLLQSIAATYAASPIVHAVLILWGNPSTSSKTLTQLAQNLTTGPISVIRQSSNSLNSRFRPHKSIQTGAVLICDDDVEIDISSLEFAFRVWGRNPERLVGFFVRSHDLDLSRREWIYTVHQDKYSIVLTKLMILKMEYLFEYTCGGGAAMADMRRVVDRERNCEDILMNFMVADMSNAGPILVAAQRIRDWGDPRNDYDENERLGLGEGVSEVGLSNRKGEHRKRRGRCITEFHRRLGRMPLRYSYGKLVNSVGEQALCRKGGKLVPCDHNVL
BLAST of Cp4.1LG06g04410 vs. Swiss-Prot
Match: GT643_ARATH (Glycosyltransferase family protein 64 C3 OS=Arabidopsis thaliana GN=At1g80290 PE=2 SV=1)

HSP 1 Score: 387.1 bits (993), Expect = 1.9e-106
Identity = 200/336 (59.52%), Postives = 242/336 (72.02%), Query Frame = 1

Query: 1   MNLKSLTI---FILLSFSTAVCSLLKATAAACDAESLPDRRNLRPDQITVLINGYYESRI 60
           M +KS+     F+ +      C  L      CDA +  + + LR DQITVLINGY E RI
Sbjct: 1   MGVKSVRFSIWFLFVVTDLVFCRTLSGDPDPCDATNQREFQKLRSDQITVLINGYSEYRI 60

Query: 61  PLLQSIAATYAASPIVHAVLILWGNPSTSSKTLTQLAQNLT-----TGPISVIRQSSNSL 120
           PLLQ+I A+Y++S IV ++L+LWGNPST  + L QL QNLT     +  IS+I+QSS+SL
Sbjct: 61  PLLQTIVASYSSSSIVSSILVLWGNPSTPDQLLDQLYQNLTQYSPGSASISLIQQSSSSL 120

Query: 121 NSRFRPHKSIQTGAVLICDDDVEIDISSLEFAFRVWGRNPERLVGFFVRSHDLDLSRREW 180
           N+RF P  S+ T AVLICDDDVEID  SLEFAF VW  NP+RLVG FVRSH  DL  +EW
Sbjct: 121 NARFLPRSSVDTRAVLICDDDVEIDQRSLEFAFSVWKSNPDRLVGTFVRSHGFDLQGKEW 180

Query: 181 IYTVHQDKYSIVLTKLMILKMEYLFEYTCGGGAAMADMRRVVDRERNCEDILMNFMVADM 240
           IYTVH DKYSIVLTK M++K +YLFEY+C GG  M +MR +VD+ RNCEDILMNF+ AD 
Sbjct: 181 IYTVHPDKYSIVLTKFMMMKQDYLFEYSCKGGVEMEEMRMIVDQMRNCEDILMNFVAADR 240

Query: 241 SNAGPILVAAQRIRDWGDPRNDYDENERLGLGEGVSEVGLSNRKGEHRKRRGRCITEFHR 300
             AGPI+V A+R+RDWGD RN+  E       E V +VGLS+R+ EHRKRRG CI EFHR
Sbjct: 241 LRAGPIMVGAERVRDWGDARNEEVE-------ERVRDVGLSSRRVEHRKRRGNCIREFHR 300

Query: 301 RLGRMPLRYSYGKLVNSVGEQALCRKGGKLVPCDHN 329
            +G+MPL YSYGK+VNSVGEQ LCRK GKLV CD +
Sbjct: 301 VMGKMPLMYSYGKVVNSVGEQGLCRKAGKLVFCDRD 329

BLAST of Cp4.1LG06g04410 vs. Swiss-Prot
Match: EXTL3_MOUSE (Exostosin-like 3 OS=Mus musculus GN=Extl3 PE=1 SV=2)

HSP 1 Score: 109.4 bits (272), Expect = 7.7e-23
Identity = 81/266 (30.45%), Postives = 124/266 (46.62%), Query Frame = 1

Query: 39  NLRPDQITVLINGYYESRIPLLQSIAATYAASPIVHAVLILWGNPSTSSKTLTQLAQNLT 98
           N++ +Q TV++  Y   R  +L +        P ++ V+++W +P   S+ L      + 
Sbjct: 656 NVQREQFTVVMLTY--EREEVLMNSLERLNGLPYLNKVVVVWNSPKLPSEDLLWPDIGV- 715

Query: 99  TGPISVIRQSSNSLNSRFRPHKSIQTGAVLICDDDVEIDISSLEFAFRVWGRNPERLVGF 158
             PI V+R   NSLN+RF P   I+T A+L  DDD  +    + F FRVW    +R+VGF
Sbjct: 716 --PIMVVRTEKNSLNNRFLPWNEIETEAILSIDDDAHLRHDEIMFGFRVWREARDRIVGF 775

Query: 159 FVRSHDLDLSRREWIYTVHQD-KYSIVLTKLMILKMEYLFEYTCGGGAAMADMRRVVDRE 218
             R H  D+  + W+Y  +   + S+VLT        Y + Y+     A+ DM   VD  
Sbjct: 776 PGRYHAWDIPHQSWLYNSNYSCELSMVLTGAAFFHKYYAYLYSYVMPQAIRDM---VDEY 835

Query: 219 RNCEDILMNFMVADMSNAGPILVAAQ-RIRDWGDPRNDYDENERLGLGEGVSEVGLSNRK 278
            NCEDI MNF+V+ ++   PI V ++   R  G P+                     +  
Sbjct: 836 INCEDIAMNFLVSHITRKPPIKVTSRWTFRCPGCPQ-------------------ALSHD 894

Query: 279 GEHRKRRGRCITEFHRRLGRMPLRYS 303
             H   R +CI  F +  G MPL Y+
Sbjct: 896 DSHFHERHKCINFFVKVYGYMPLLYT 894

BLAST of Cp4.1LG06g04410 vs. Swiss-Prot
Match: EXTL3_HUMAN (Exostosin-like 3 OS=Homo sapiens GN=EXTL3 PE=1 SV=1)

HSP 1 Score: 108.2 bits (269), Expect = 1.7e-22
Identity = 81/266 (30.45%), Postives = 123/266 (46.24%), Query Frame = 1

Query: 39  NLRPDQITVLINGYYESRIPLLQSIAATYAASPIVHAVLILWGNPSTSSKTLTQLAQNLT 98
           N+  +Q TV++  Y   R  +L +        P ++ V+++W +P   S+ L      + 
Sbjct: 657 NVPREQFTVVMLTY--EREEVLMNSLERLNGLPYLNKVVVVWNSPKLPSEDLLWPDIGV- 716

Query: 99  TGPISVIRQSSNSLNSRFRPHKSIQTGAVLICDDDVEIDISSLEFAFRVWGRNPERLVGF 158
             PI V+R   NSLN+RF P   I+T A+L  DDD  +    + F FRVW    +R+VGF
Sbjct: 717 --PIMVVRTEKNSLNNRFLPWNEIETEAILSIDDDAHLRHDEIMFGFRVWREARDRIVGF 776

Query: 159 FVRSHDLDLSRREWIYTVHQD-KYSIVLTKLMILKMEYLFEYTCGGGAAMADMRRVVDRE 218
             R H  D+  + W+Y  +   + S+VLT        Y + Y+     A+ DM   VD  
Sbjct: 777 PGRYHAWDIPHQSWLYNSNYSCELSMVLTGAAFFHKYYAYLYSYVMPQAIRDM---VDEY 836

Query: 219 RNCEDILMNFMVADMSNAGPILVAAQ-RIRDWGDPRNDYDENERLGLGEGVSEVGLSNRK 278
            NCEDI MNF+V+ ++   PI V ++   R  G P+                     +  
Sbjct: 837 INCEDIAMNFLVSHITRKPPIKVTSRWTFRCPGCPQ-------------------ALSHD 895

Query: 279 GEHRKRRGRCITEFHRRLGRMPLRYS 303
             H   R +CI  F +  G MPL Y+
Sbjct: 897 DSHFHERHKCINFFVKVYGYMPLLYT 895

BLAST of Cp4.1LG06g04410 vs. Swiss-Prot
Match: GT644_ARATH (Glycosyltransferase family 64 protein C4 OS=Arabidopsis thaliana GN=EPC1 PE=2 SV=1)

HSP 1 Score: 107.5 bits (267), Expect = 2.9e-22
Identity = 81/280 (28.93%), Postives = 132/280 (47.14%), Query Frame = 1

Query: 46  TVLINGYYESRIPLLQSIAATYAASPIVHAVLILWGNPSTSSKTLTQLAQNLTT-----G 105
           T+L+N +   R  LL+   + YA+   + ++ I+W  P+  S++L +   N+       G
Sbjct: 75  TLLMNTW--KRYDLLKKSVSHYASCSRLDSIHIVWSEPNPPSESLKEYLHNVLKKKTRDG 134

Query: 106 PISVIR---QSSNSLNSRFRPHKSIQTGAVLICDDDVEIDISSLEFAFRVWGRNPERLVG 165
               +R      +SLN+RF+  K ++T AV   DDD+     +++FAF VW   P+ +VG
Sbjct: 135 HEVELRFDINKEDSLNNRFKEIKDLKTDAVFSIDDDIIFPCHTVDFAFNVWESAPDTMVG 194

Query: 166 FFVRSHDLDLSRRE--------WIYTVHQDKYSIVLTKLMILKMEYLFEYTCGGGAAMAD 225
           F  R H  + S  +        W        YS+VL+K      +YL  YT    +  A 
Sbjct: 195 FVPRVHWPEKSNDKANYYTYSGWWSVWWSGTYSMVLSKAAFFHKKYLSLYT---NSMPAS 254

Query: 226 MRRVVDRERNCEDILMNFMVADMSNAGPILVAAQRIRDWGDPRNDYDENERLGLGEGVSE 285
           +R    + RNCEDI M+F++A+ +NA P +    +I + G                G+S 
Sbjct: 255 IREFTTKNRNCEDIAMSFLIANATNA-PAIWVKGKIYEIG--------------STGISS 314

Query: 286 VGLSNRKGEHRKRRGRCITEFHRRLGRMPLRYSYGKLVNS 310
           +      G H ++R  C+  F    G+MPL Y+  K V+S
Sbjct: 315 I------GGHTEKRTHCVNRFVAEFGKMPLVYTSMKAVDS 328

BLAST of Cp4.1LG06g04410 vs. Swiss-Prot
Match: EXT3_DROME (Exostosin-3 OS=Drosophila melanogaster GN=botv PE=1 SV=1)

HSP 1 Score: 105.5 bits (262), Expect = 1.1e-21
Identity = 85/265 (32.08%), Postives = 124/265 (46.79%), Query Frame = 1

Query: 39  NLRPDQITVLINGYYESRIPLLQSIAATYAASPIVHAVLILWGNPSTSSKTLTQLAQNLT 98
           N   +Q T+++  Y   ++ L+ S+   Y   P +H V+++W +P      L  L     
Sbjct: 708 NYPREQFTIVMLTYEREQV-LMDSLGRLYGL-PYLHKVVVVWNSPKPP---LDDLRWPDI 767

Query: 99  TGPISVIRQSSNSLNSRFRPHKSIQTGAVLICDDDVEIDISSLEFAFRVWGRNPERLVGF 158
             P++V+R   NSLN+RF P   I+T AVL  DDD  +    + F FRVW  + +R+VGF
Sbjct: 768 GVPVAVLRAPRNSLNNRFLPFDVIETEAVLSVDDDAHLRHDEILFGFRVWREHRDRVVGF 827

Query: 159 FVRSHDLDLS--RREWIYTVHQD-KYSIVLTKLMILKMEYLFEYTCGGGAAMADMRRVVD 218
             R H  DL     +W Y  +   + S+VLT    +   YL+ YT     A+ D    VD
Sbjct: 828 PGRYHAWDLGNPNGQWHYNSNYSCELSMVLTGAAFVHKYYLYLYTYHLPQAIRDK---VD 887

Query: 219 RERNCEDILMNFMVADMSNAGPILVAAQ-RIRDWGDPRNDYDENERLGLGEGVSEVGLSN 278
              NCEDI MNF+V+ ++   P+ V ++   R  G P                  V LS 
Sbjct: 888 EYMNCEDIAMNFLVSHITRKPPVKVTSRWTFRCPGCP------------------VSLS- 945

Query: 279 RKGEHRKRRGRCITEFHRRLGRMPL 300
               H + R +CI  F R  G  PL
Sbjct: 948 EDDTHFQERHKCINFFSRVFGYTPL 945

BLAST of Cp4.1LG06g04410 vs. TrEMBL
Match: A0A0A0LDW4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G883010 PE=4 SV=1)

HSP 1 Score: 574.3 bits (1479), Expect = 9.4e-161
Identity = 291/332 (87.65%), Postives = 307/332 (92.47%), Query Frame = 1

Query: 1   MNLKSLTIFILL-SFSTAVCSLLKATAAACDAESLPDRRNLRPDQITVLINGYYESRIPL 60
           MN +SLTI +LL SFSTAV SL K TAAAC AESLPDRRNLR DQITVLINGYYESRIPL
Sbjct: 1   MNFESLTILVLLLSFSTAVYSLHKETAAACAAESLPDRRNLRSDQITVLINGYYESRIPL 60

Query: 61  LQSIAATYAASPIVHAVLILWGNPSTSSKTLTQLAQNLTTGPISVIRQSSNSLNSRFRPH 120
           LQS+AA YAASP VH VLILWGNPSTS++TLT+LAQNLTTGPIS+IRQSSNSLNSRF P 
Sbjct: 61  LQSLAARYAASPFVHTVLILWGNPSTSTETLTKLAQNLTTGPISLIRQSSNSLNSRFLPR 120

Query: 121 KSIQTGAVLICDDDVEIDISSLEFAFRVWGRNPERLVGFFVRSHDLDLSRREWIYTVHQD 180
           KSIQT AVLICDDDVEID  SLEFAFR+WGRNPERLVGFFVRSHDLDLSRREWIYT+HQD
Sbjct: 121 KSIQTFAVLICDDDVEIDTPSLEFAFRIWGRNPERLVGFFVRSHDLDLSRREWIYTIHQD 180

Query: 181 KYSIVLTKLMILKMEYLFEYTCGGGAAMADMRRVVDRERNCEDILMNFMVADMSNAGPIL 240
           KYSIVLTKLMILK EYLFEY+CGGGAAMADMRRVVD ERNCEDILMNF+VADMSNAGPI+
Sbjct: 181 KYSIVLTKLMILKAEYLFEYSCGGGAAMADMRRVVDVERNCEDILMNFVVADMSNAGPIM 240

Query: 241 VAAQRIRDWGDPRNDYDE-NERLGLGEGVSEVGLSNRKGEHRKRRGRCITEFHRRLGRMP 300
           VAAQRIRDWGDPRN+YD+ NERL L EGVSE+GLSNRKGEHRKRRG CITEFHRRLGRMP
Sbjct: 241 VAAQRIRDWGDPRNEYDDGNERLRLREGVSEIGLSNRKGEHRKRRGGCITEFHRRLGRMP 300

Query: 301 LRYSYGKLVNSVGEQALCRKGGKLVPCDHNVL 331
           LRYSYGK VNS+GEQALCRKG KLVPCD NVL
Sbjct: 301 LRYSYGKSVNSIGEQALCRKGRKLVPCDQNVL 332

BLAST of Cp4.1LG06g04410 vs. TrEMBL
Match: W9S9Z8_9ROSA (Exostosin-like 3 OS=Morus notabilis GN=L484_019884 PE=4 SV=1)

HSP 1 Score: 423.3 bits (1087), Expect = 2.7e-115
Identity = 216/327 (66.06%), Postives = 252/327 (77.06%), Query Frame = 1

Query: 9   FILLSFSTAVCSLLKATAAACDAESLPDRRNLRPDQITVLINGYYESRIPLLQSIAATYA 68
           F  + F     SL   T+  CD+ +L D R LR DQITVLINGY ESRIPLLQS+AATY+
Sbjct: 14  FFFVFFFKPSLSLRTLTSDPCDSTALRDPRTLRSDQITVLINGYSESRIPLLQSLAATYS 73

Query: 69  ASPIVHAVLILWGNPSTSSKTLTQLAQNLT-----TGPISVIRQSSNSLNSRFRPHKSIQ 128
           ASP+V AV ILWGNPST S+TL + + NLT     + PIS+IRQSS SLN+RF P  SI 
Sbjct: 74  ASPLVSAVFILWGNPSTPSQTLAEFSHNLTAFSFGSAPISLIRQSSPSLNARFLPRPSIA 133

Query: 129 TGAVLICDDDVEIDISSLEFAFRVWGRNPERLVGFFVRSHDLDLSRREWIYTVHQDKYSI 188
           T AVLICDDDVEID  S  FAFRVW  NP+RL+GFF RSHD+DL+R++WIYT+H DKYSI
Sbjct: 134 TVAVLICDDDVEIDAKSFAFAFRVWESNPDRLIGFFARSHDIDLTRKKWIYTIHPDKYSI 193

Query: 189 VLTKLMILKMEYLFEYTCGGGAAMADMRRVVDRERNCEDILMNFMVADMSNAGPILVAAQ 248
           VLTK MILK  YLFEY+CGGG+AMA  R VVDR RNCEDILMNF+VA+ + AGP+LV A 
Sbjct: 194 VLTKFMILKNRYLFEYSCGGGSAMARARSVVDRARNCEDILMNFVVAEETGAGPVLVGAN 253

Query: 249 RIRDWGDPRND--YDENERLGLGEGVSEVGLSNRKGEHRKRRGRCITEFHRRLGRMPLRY 308
             RDWGD RN+   D + R GL   V++VGLS+R+ EHRKRRG CI+EFHR LG+M LRY
Sbjct: 254 WARDWGDARNEDIGDGDGRRGLSGTVAQVGLSSRRAEHRKRRGECISEFHRVLGKMALRY 313

Query: 309 SYGKLVNSVGEQALCRKGGKLVPCDHN 329
           SYGK+VNSVGEQ LC+KGGKLV CD N
Sbjct: 314 SYGKVVNSVGEQGLCQKGGKLVFCDQN 340

BLAST of Cp4.1LG06g04410 vs. TrEMBL
Match: V4UBE2_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10010680mg PE=4 SV=1)

HSP 1 Score: 420.2 bits (1079), Expect = 2.3e-114
Identity = 215/333 (64.56%), Postives = 254/333 (76.28%), Query Frame = 1

Query: 1   MNLKSLTIFILLSFSTAVCSLLKATAAA--CDAESLPDRRNLRPDQITVLINGYYESRIP 60
           ++L  L + +L++  + V S    T  A  C A +  D R LR DQITVL+NGY E RIP
Sbjct: 5   ISLSFLLVLLLVTRESVVFSYRTVTTDADPCAATNQRDPRTLRWDQITVLMNGYSEDRIP 64

Query: 61  LLQSIAATYAASPIVHAVLILWGNPSTSSKTLTQLAQNLT-----TGPISVIRQSSNSLN 120
           LLQSIAATY ASP+V +VL+LWGNPST ++TL+QL+ NL+     +  IS+IRQ S+SLN
Sbjct: 65  LLQSIAATYTASPLVSSVLVLWGNPSTPTRTLSQLSHNLSLSSFGSASISLIRQPSSSLN 124

Query: 121 SRFRPHKSIQTGAVLICDDDVEIDISSLEFAFRVWGRNPERLVGFFVRSHDLDLSRREWI 180
           +RF P  SI+T AVLICDDDVE+D  SLEFAFR+W  N  RL+G F RSHD+DL  +EWI
Sbjct: 125 ARFLPRSSIRTHAVLICDDDVEMDQKSLEFAFRIWQSNANRLIGVFARSHDVDLVNKEWI 184

Query: 181 YTVHQDKYSIVLTKLMILKMEYLFEYTCGGGAAMADMRRVVDRERNCEDILMNFMVADMS 240
           YTVH DKYSIVLTKLM LK  YLFEY+CGGGAAM +MRR+VD  RNCEDILMNF+VAD  
Sbjct: 185 YTVHPDKYSIVLTKLMFLKSSYLFEYSCGGGAAMGEMRRIVDEMRNCEDILMNFVVADRI 244

Query: 241 NAGPILVAAQRIRDWGDPRNDYDENERLGLGEGVSEVGLSNRKGEHRKRRGRCITEFHRR 300
           NAGP++V A+R+RDWGD RND D+    G    VS VGLS+RK EHRKRRG+CI EFHR 
Sbjct: 245 NAGPLMVGAERVRDWGDARNDRDDGGARGSRSRVSAVGLSSRKMEHRKRRGKCIREFHRV 304

Query: 301 LGRMPLRYSYGKLVNSVGEQALCRKGGKLVPCD 327
           LGRMPLRYSYGK+VNSVGEQ LC  GGKLV CD
Sbjct: 305 LGRMPLRYSYGKVVNSVGEQGLCENGGKLVFCD 337

BLAST of Cp4.1LG06g04410 vs. TrEMBL
Match: M5XE41_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008992mg PE=4 SV=1)

HSP 1 Score: 419.9 bits (1078), Expect = 3.0e-114
Identity = 206/309 (66.67%), Postives = 251/309 (81.23%), Query Frame = 1

Query: 29  CDAESLPDRRNLRPDQITVLINGYYESRIPLLQSIAATYAASPIVHAVLILWGNPSTSSK 88
           C+  +  + + L  DQ+TVLINGY ESRIPLLQSI A+YAAS +V ++L+LWGNPST S+
Sbjct: 4   CNPTAQQEPQTLISDQLTVLINGYSESRIPLLQSIIASYAASSLVSSILVLWGNPSTPSQ 63

Query: 89  TLTQLAQNLTTGP-----ISVIRQSSNSLNSRFRPHKSIQTGAVLICDDDVEIDISSLEF 148
           TL QLA NLT        ISVIRQ+S+SLN+RF P  SI+T AVLICDDDVE+D  S EF
Sbjct: 64  TLAQLAHNLTQSSFGFNGISVIRQTSDSLNNRFLPRPSIKTRAVLICDDDVEVDPKSFEF 123

Query: 149 AFRVWGRNPERLVGFFVRSHDLDLSRREWIYTVHQDKYSIVLTKLMILKMEYLFEYTCGG 208
           AF++WG NP+RLVGFFVRSHD+DLS++EWIYT+H DKYSI+LTK M+LK EYLF Y+C G
Sbjct: 124 AFKMWGSNPDRLVGFFVRSHDIDLSKKEWIYTIHPDKYSIMLTKFMLLKSEYLFRYSCAG 183

Query: 209 GAAMADMRRVVDRERNCEDILMNFMVADMSNAGPILVAAQRIRDWGDPRNDYDENERLG- 268
           G  MA MRR+VD+  NCEDILMNF+VAD  N+GPILV A+R+RDWGD RND+D+++  G 
Sbjct: 184 GPVMAHMRRIVDKMNNCEDILMNFVVADEVNSGPILVGAERVRDWGDARNDHDDDDGNGR 243

Query: 269 ---LGEGVSEVGLSNRKGEHRKRRGRCITEFHRRLGRMPLRYSYGKLVNSVGEQALCRKG 328
              +GE V++VGLS+RKG+HRKRRG CI EFHR LGRMPLR+SYGK+VNSVGEQ LC+KG
Sbjct: 244 HRLIGE-VAQVGLSSRKGKHRKRRGECIGEFHRVLGRMPLRFSYGKVVNSVGEQGLCQKG 303

BLAST of Cp4.1LG06g04410 vs. TrEMBL
Match: A0A0D2PU48_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G132700 PE=4 SV=1)

HSP 1 Score: 411.8 bits (1057), Expect = 8.1e-112
Identity = 203/330 (61.52%), Postives = 257/330 (77.88%), Query Frame = 1

Query: 5   SLTIFILLSFSTAVCSLLKATA--AACDAESLPDRRNLRPDQITVLINGYYESRIPLLQS 64
           SL++ + L        + K T   + C   + P+ R LR DQ+TVLINGY ESRIPLLQS
Sbjct: 13  SLSVVLCLRIPNLSSDIEKDTILLSVCHPGNQPNPRTLRSDQLTVLINGYSESRIPLLQS 72

Query: 65  IAATYAASPIVHAVLILWGNPSTSSKTLTQLAQNLTTGP-----ISVIRQSSNSLNSRFR 124
           IAA+Y+ASP+V +VL+LWGNPSTS  TL+QLA NL+        IS++ Q S+SLN+RF 
Sbjct: 73  IAASYSASPVVSSVLVLWGNPSTSPLTLSQLAYNLSVSSWGDAAISLVPQPSSSLNARFL 132

Query: 125 PHKSIQTGAVLICDDDVEIDISSLEFAFRVWGRNPERLVGFFVRSHDLDLSRREWIYTVH 184
           P  SI T AVL+CDDDVE+D+ ++EFAFR+W  NP+RL+G FVRSHD+D++R+EWIYTVH
Sbjct: 133 PRSSIGTRAVLVCDDDVEVDLKTVEFAFRMWKGNPDRLIGIFVRSHDIDMTRKEWIYTVH 192

Query: 185 QDKYSIVLTKLMILKMEYLFEYTCGGGAAMADMRRVVDRERNCEDILMNFMVADMSNAGP 244
            +KYSIVLTK M++K EYLF+Y+CGGG+ M +MR++VD  RNCEDILMNF+VA+ +N GP
Sbjct: 193 PNKYSIVLTKFMMMKREYLFKYSCGGGSPMHEMRKMVDEMRNCEDILMNFVVAEETNTGP 252

Query: 245 ILVAAQRIRDWGDPRNDYDENERLGLGEGVSEVGLSNRKGEHRKRRGRCITEFHRRLGRM 304
           ++V A+R RDWGDPRN+  + + L L   V +VGLS+RK EHRKRRG+CITEFHR LGRM
Sbjct: 253 LMVGAERARDWGDPRNEDSDGDGLRL---VRDVGLSSRKAEHRKRRGKCITEFHRVLGRM 312

Query: 305 PLRYSYGKLVNSVGEQALCRKGGKLVPCDH 328
           PLRYSYGKLV+SVGEQ LC+KG  LV CDH
Sbjct: 313 PLRYSYGKLVSSVGEQGLCKKGSNLVLCDH 339

BLAST of Cp4.1LG06g04410 vs. TAIR10
Match: AT1G80290.2 (AT1G80290.2 Nucleotide-diphospho-sugar transferases superfamily protein)

HSP 1 Score: 387.1 bits (993), Expect = 1.1e-107
Identity = 200/336 (59.52%), Postives = 242/336 (72.02%), Query Frame = 1

Query: 1   MNLKSLTI---FILLSFSTAVCSLLKATAAACDAESLPDRRNLRPDQITVLINGYYESRI 60
           M +KS+     F+ +      C  L      CDA +  + + LR DQITVLINGY E RI
Sbjct: 9   MGVKSVRFSIWFLFVVTDLVFCRTLSGDPDPCDATNQREFQKLRSDQITVLINGYSEYRI 68

Query: 61  PLLQSIAATYAASPIVHAVLILWGNPSTSSKTLTQLAQNLT-----TGPISVIRQSSNSL 120
           PLLQ+I A+Y++S IV ++L+LWGNPST  + L QL QNLT     +  IS+I+QSS+SL
Sbjct: 69  PLLQTIVASYSSSSIVSSILVLWGNPSTPDQLLDQLYQNLTQYSPGSASISLIQQSSSSL 128

Query: 121 NSRFRPHKSIQTGAVLICDDDVEIDISSLEFAFRVWGRNPERLVGFFVRSHDLDLSRREW 180
           N+RF P  S+ T AVLICDDDVEID  SLEFAF VW  NP+RLVG FVRSH  DL  +EW
Sbjct: 129 NARFLPRSSVDTRAVLICDDDVEIDQRSLEFAFSVWKSNPDRLVGTFVRSHGFDLQGKEW 188

Query: 181 IYTVHQDKYSIVLTKLMILKMEYLFEYTCGGGAAMADMRRVVDRERNCEDILMNFMVADM 240
           IYTVH DKYSIVLTK M++K +YLFEY+C GG  M +MR +VD+ RNCEDILMNF+ AD 
Sbjct: 189 IYTVHPDKYSIVLTKFMMMKQDYLFEYSCKGGVEMEEMRMIVDQMRNCEDILMNFVAADR 248

Query: 241 SNAGPILVAAQRIRDWGDPRNDYDENERLGLGEGVSEVGLSNRKGEHRKRRGRCITEFHR 300
             AGPI+V A+R+RDWGD RN+  E       E V +VGLS+R+ EHRKRRG CI EFHR
Sbjct: 249 LRAGPIMVGAERVRDWGDARNEEVE-------ERVRDVGLSSRRVEHRKRRGNCIREFHR 308

Query: 301 RLGRMPLRYSYGKLVNSVGEQALCRKGGKLVPCDHN 329
            +G+MPL YSYGK+VNSVGEQ LCRK GKLV CD +
Sbjct: 309 VMGKMPLMYSYGKVVNSVGEQGLCRKAGKLVFCDRD 337

BLAST of Cp4.1LG06g04410 vs. TAIR10
Match: AT3G55830.1 (AT3G55830.1 Nucleotide-diphospho-sugar transferases superfamily protein)

HSP 1 Score: 107.5 bits (267), Expect = 1.6e-23
Identity = 81/280 (28.93%), Postives = 132/280 (47.14%), Query Frame = 1

Query: 46  TVLINGYYESRIPLLQSIAATYAASPIVHAVLILWGNPSTSSKTLTQLAQNLTT-----G 105
           T+L+N +   R  LL+   + YA+   + ++ I+W  P+  S++L +   N+       G
Sbjct: 75  TLLMNTW--KRYDLLKKSVSHYASCSRLDSIHIVWSEPNPPSESLKEYLHNVLKKKTRDG 134

Query: 106 PISVIR---QSSNSLNSRFRPHKSIQTGAVLICDDDVEIDISSLEFAFRVWGRNPERLVG 165
               +R      +SLN+RF+  K ++T AV   DDD+     +++FAF VW   P+ +VG
Sbjct: 135 HEVELRFDINKEDSLNNRFKEIKDLKTDAVFSIDDDIIFPCHTVDFAFNVWESAPDTMVG 194

Query: 166 FFVRSHDLDLSRRE--------WIYTVHQDKYSIVLTKLMILKMEYLFEYTCGGGAAMAD 225
           F  R H  + S  +        W        YS+VL+K      +YL  YT    +  A 
Sbjct: 195 FVPRVHWPEKSNDKANYYTYSGWWSVWWSGTYSMVLSKAAFFHKKYLSLYT---NSMPAS 254

Query: 226 MRRVVDRERNCEDILMNFMVADMSNAGPILVAAQRIRDWGDPRNDYDENERLGLGEGVSE 285
           +R    + RNCEDI M+F++A+ +NA P +    +I + G                G+S 
Sbjct: 255 IREFTTKNRNCEDIAMSFLIANATNA-PAIWVKGKIYEIG--------------STGISS 314

Query: 286 VGLSNRKGEHRKRRGRCITEFHRRLGRMPLRYSYGKLVNS 310
           +      G H ++R  C+  F    G+MPL Y+  K V+S
Sbjct: 315 I------GGHTEKRTHCVNRFVAEFGKMPLVYTSMKAVDS 328

BLAST of Cp4.1LG06g04410 vs. TAIR10
Match: AT5G04500.1 (AT5G04500.1 glycosyltransferase family protein 47)

HSP 1 Score: 99.4 bits (246), Expect = 4.5e-21
Identity = 70/253 (27.67%), Postives = 120/253 (47.43%), Query Frame = 1

Query: 53  YESRIPLLQSIAATYAASPIVHAVLILWGNPSTSSKTLTQLAQNLTTGPISVIRQSSNSL 112
           Y++R+  L+     Y+  P V  ++++W            L++  +  P+ +  Q  NSL
Sbjct: 525 YDARLWNLKMYVKRYSRCPSVKEIVVIWNKGPPPD-----LSELDSAVPVRIRVQKQNSL 584

Query: 113 NSRFRPHKSIQTGAVLICDDDVEIDISSLEFAFRVWGRNPERLVGFFVRSHDLDLSRREW 172
           N+RF     I+T AVL  DDD+ +    +E  FRVW  +PERLVGF+ R  D  ++    
Sbjct: 585 NNRFEIDPLIKTRAVLELDDDIMMPCDDIEKGFRVWREHPERLVGFYPRFVDQTMTYSAE 644

Query: 173 IYTVHQDKYSIVLTKLMILKMEYLFEYTCGGGAAMADMRRVVDRERNCEDILMNFMVADM 232
            +      Y+++LT    + + + F+      A +   R  VD + NCEDIL+NF+ A+ 
Sbjct: 645 KFARSHKGYNMILTGAAFMDVRFAFDMYQSDKAKLG--RVFVDEQFNCEDILLNFLYANA 704

Query: 233 SNAGPILVAAQRIRDWGDPRNDYDENERLGLGEGVSEVGLSNRKGEHRKRRGRCITEFHR 292
           S +G    A + +R    P     +  +       S V +S    +H ++R +C+  F  
Sbjct: 705 SGSGK---AVEYVR----PSLVTIDTSKF------SGVAISGNTNQHYRKRSKCLRRFSD 757

Query: 293 RLGRM-PLRYSYG 305
             G +   R+ +G
Sbjct: 765 LYGSLVDRRWEFG 757

BLAST of Cp4.1LG06g04410 vs. NCBI nr
Match: gi|659132573|ref|XP_008466270.1| (PREDICTED: exostosin-like 2 [Cucumis melo])

HSP 1 Score: 575.9 bits (1483), Expect = 4.6e-161
Identity = 291/332 (87.65%), Postives = 305/332 (91.87%), Query Frame = 1

Query: 1   MNLKSLTIFILL-SFSTAVCSLLKATAAACDAESLPDRRNLRPDQITVLINGYYESRIPL 60
           MN KSLTI +LL SFSTAV SL   T AAC AESLPDRRNLR DQITVLINGYYESRIPL
Sbjct: 19  MNFKSLTILVLLLSFSTAVYSLHNETTAACAAESLPDRRNLRSDQITVLINGYYESRIPL 78

Query: 61  LQSIAATYAASPIVHAVLILWGNPSTSSKTLTQLAQNLTTGPISVIRQSSNSLNSRFRPH 120
           LQS+AA YAASP VH VLILWGNPSTS+KTLT+LAQNLTTGPI++IRQSSNSLNSRF P 
Sbjct: 79  LQSLAARYAASPFVHTVLILWGNPSTSTKTLTKLAQNLTTGPITLIRQSSNSLNSRFLPR 138

Query: 121 KSIQTGAVLICDDDVEIDISSLEFAFRVWGRNPERLVGFFVRSHDLDLSRREWIYTVHQD 180
           KSIQT AVLICDDDVEID  SLEFAFR+WGRNPERLVGFFVRSHDLDLSRREWIYTVHQD
Sbjct: 139 KSIQTSAVLICDDDVEIDTPSLEFAFRIWGRNPERLVGFFVRSHDLDLSRREWIYTVHQD 198

Query: 181 KYSIVLTKLMILKMEYLFEYTCGGGAAMADMRRVVDRERNCEDILMNFMVADMSNAGPIL 240
           KYSIVLTKLMILK EYLFEY+CGGGAAMADMRRVVD ERNCEDILMNF+VADMSNAGPI+
Sbjct: 199 KYSIVLTKLMILKAEYLFEYSCGGGAAMADMRRVVDLERNCEDILMNFVVADMSNAGPIM 258

Query: 241 VAAQRIRDWGDPRNDYDE-NERLGLGEGVSEVGLSNRKGEHRKRRGRCITEFHRRLGRMP 300
           VAAQRIRDWGDPRN+YD+ NERL L EG SE+GLSNRKGEHRKRRGRCITEFHRRLGRMP
Sbjct: 259 VAAQRIRDWGDPRNEYDDGNERLRLREGASEIGLSNRKGEHRKRRGRCITEFHRRLGRMP 318

Query: 301 LRYSYGKLVNSVGEQALCRKGGKLVPCDHNVL 331
           LRYSYGK VNS+GEQALCRKG KLVPCD NVL
Sbjct: 319 LRYSYGKSVNSIGEQALCRKGRKLVPCDQNVL 350

BLAST of Cp4.1LG06g04410 vs. NCBI nr
Match: gi|449437316|ref|XP_004136438.1| (PREDICTED: glycosyltransferase family protein 64 C3 [Cucumis sativus])

HSP 1 Score: 574.3 bits (1479), Expect = 1.3e-160
Identity = 291/332 (87.65%), Postives = 307/332 (92.47%), Query Frame = 1

Query: 1   MNLKSLTIFILL-SFSTAVCSLLKATAAACDAESLPDRRNLRPDQITVLINGYYESRIPL 60
           MN +SLTI +LL SFSTAV SL K TAAAC AESLPDRRNLR DQITVLINGYYESRIPL
Sbjct: 1   MNFESLTILVLLLSFSTAVYSLHKETAAACAAESLPDRRNLRSDQITVLINGYYESRIPL 60

Query: 61  LQSIAATYAASPIVHAVLILWGNPSTSSKTLTQLAQNLTTGPISVIRQSSNSLNSRFRPH 120
           LQS+AA YAASP VH VLILWGNPSTS++TLT+LAQNLTTGPIS+IRQSSNSLNSRF P 
Sbjct: 61  LQSLAARYAASPFVHTVLILWGNPSTSTETLTKLAQNLTTGPISLIRQSSNSLNSRFLPR 120

Query: 121 KSIQTGAVLICDDDVEIDISSLEFAFRVWGRNPERLVGFFVRSHDLDLSRREWIYTVHQD 180
           KSIQT AVLICDDDVEID  SLEFAFR+WGRNPERLVGFFVRSHDLDLSRREWIYT+HQD
Sbjct: 121 KSIQTFAVLICDDDVEIDTPSLEFAFRIWGRNPERLVGFFVRSHDLDLSRREWIYTIHQD 180

Query: 181 KYSIVLTKLMILKMEYLFEYTCGGGAAMADMRRVVDRERNCEDILMNFMVADMSNAGPIL 240
           KYSIVLTKLMILK EYLFEY+CGGGAAMADMRRVVD ERNCEDILMNF+VADMSNAGPI+
Sbjct: 181 KYSIVLTKLMILKAEYLFEYSCGGGAAMADMRRVVDVERNCEDILMNFVVADMSNAGPIM 240

Query: 241 VAAQRIRDWGDPRNDYDE-NERLGLGEGVSEVGLSNRKGEHRKRRGRCITEFHRRLGRMP 300
           VAAQRIRDWGDPRN+YD+ NERL L EGVSE+GLSNRKGEHRKRRG CITEFHRRLGRMP
Sbjct: 241 VAAQRIRDWGDPRNEYDDGNERLRLREGVSEIGLSNRKGEHRKRRGGCITEFHRRLGRMP 300

Query: 301 LRYSYGKLVNSVGEQALCRKGGKLVPCDHNVL 331
           LRYSYGK VNS+GEQALCRKG KLVPCD NVL
Sbjct: 301 LRYSYGKSVNSIGEQALCRKGRKLVPCDQNVL 332

BLAST of Cp4.1LG06g04410 vs. NCBI nr
Match: gi|657972676|ref|XP_008378124.1| (PREDICTED: exostosin-like 3 [Malus domestica])

HSP 1 Score: 425.2 bits (1092), Expect = 1.0e-115
Identity = 213/329 (64.74%), Postives = 260/329 (79.03%), Query Frame = 1

Query: 6   LTIFILLSFSTAVCSLLKATAAACDAESLPDRRNLRPDQITVLINGYYESRIPLLQSIAA 65
           + + IL S    V S+   +   C+ ++  D +NL  DQ+TVLINGY ESRIPLLQSI +
Sbjct: 23  MXLVILSSGGVHVLSVRTLSTDPCNPKAQQDPQNLISDQLTVLINGYSESRIPLLQSIVS 82

Query: 66  TYAASPIVHAVLILWGNPSTSSKTLTQLAQNLTT-----GPISVIRQSSNSLNSRFRPHK 125
           TYAAS +V  +L+LWGNPST S+TL+QLA+NLT      G ISVIRQ+S+SLN+RF P  
Sbjct: 83  TYAASSLVSYILVLWGNPSTPSQTLSQLARNLTDSSFGFGGISVIRQASDSLNNRFLPRP 142

Query: 126 SIQTGAVLICDDDVEIDISSLEFAFRVWGRNPERLVGFFVRSHDLDLSRREWIYTVHQDK 185
            I+T AVL+CDDDVE+D  S EFAF++WG NP+RLVGFFVRSHD+DLSR+EWIYTVH DK
Sbjct: 143 EIKTRAVLVCDDDVEVDPKSFEFAFKMWGSNPDRLVGFFVRSHDIDLSRKEWIYTVHPDK 202

Query: 186 YSIVLTKLMILKMEYLFEYTCGGGAAMADMRRVVDRERNCEDILMNFMVADMSNAGPILV 245
           YSI+LTK M+LK EYLF Y+C GG  MA MR++VDR +NCEDILMNF+ AD  NAGPILV
Sbjct: 203 YSIMLTKFMLLKSEYLFRYSCAGGPVMASMRKIVDRMQNCEDILMNFVAADEVNAGPILV 262

Query: 246 AAQRIRDWGDPRNDYDENE-RLGLGEGVSEVGLSNRKGEHRKRRGRCITEFHRRLGRMPL 305
            A+R+RDWGD RND+D+ + R GL   V++VGLS+RK +HRKRRG CI EFHR LGRMPL
Sbjct: 263 GAERVRDWGDARNDHDDGDARRGLIGEVAQVGLSSRKTKHRKRRGECIGEFHRVLGRMPL 322

Query: 306 RYSYGKLVNSVGEQALCRKGGKLVPCDHN 329
           R+SYGK+VNSVGEQ LC+KGGKLV CD +
Sbjct: 323 RFSYGKVVNSVGEQGLCQKGGKLVFCDQS 351

BLAST of Cp4.1LG06g04410 vs. NCBI nr
Match: gi|703159030|ref|XP_010112145.1| (Exostosin-like 3 [Morus notabilis])

HSP 1 Score: 423.3 bits (1087), Expect = 3.8e-115
Identity = 216/327 (66.06%), Postives = 252/327 (77.06%), Query Frame = 1

Query: 9   FILLSFSTAVCSLLKATAAACDAESLPDRRNLRPDQITVLINGYYESRIPLLQSIAATYA 68
           F  + F     SL   T+  CD+ +L D R LR DQITVLINGY ESRIPLLQS+AATY+
Sbjct: 14  FFFVFFFKPSLSLRTLTSDPCDSTALRDPRTLRSDQITVLINGYSESRIPLLQSLAATYS 73

Query: 69  ASPIVHAVLILWGNPSTSSKTLTQLAQNLT-----TGPISVIRQSSNSLNSRFRPHKSIQ 128
           ASP+V AV ILWGNPST S+TL + + NLT     + PIS+IRQSS SLN+RF P  SI 
Sbjct: 74  ASPLVSAVFILWGNPSTPSQTLAEFSHNLTAFSFGSAPISLIRQSSPSLNARFLPRPSIA 133

Query: 129 TGAVLICDDDVEIDISSLEFAFRVWGRNPERLVGFFVRSHDLDLSRREWIYTVHQDKYSI 188
           T AVLICDDDVEID  S  FAFRVW  NP+RL+GFF RSHD+DL+R++WIYT+H DKYSI
Sbjct: 134 TVAVLICDDDVEIDAKSFAFAFRVWESNPDRLIGFFARSHDIDLTRKKWIYTIHPDKYSI 193

Query: 189 VLTKLMILKMEYLFEYTCGGGAAMADMRRVVDRERNCEDILMNFMVADMSNAGPILVAAQ 248
           VLTK MILK  YLFEY+CGGG+AMA  R VVDR RNCEDILMNF+VA+ + AGP+LV A 
Sbjct: 194 VLTKFMILKNRYLFEYSCGGGSAMARARSVVDRARNCEDILMNFVVAEETGAGPVLVGAN 253

Query: 249 RIRDWGDPRND--YDENERLGLGEGVSEVGLSNRKGEHRKRRGRCITEFHRRLGRMPLRY 308
             RDWGD RN+   D + R GL   V++VGLS+R+ EHRKRRG CI+EFHR LG+M LRY
Sbjct: 254 WARDWGDARNEDIGDGDGRRGLSGTVAQVGLSSRRAEHRKRRGECISEFHRVLGKMALRY 313

Query: 309 SYGKLVNSVGEQALCRKGGKLVPCDHN 329
           SYGK+VNSVGEQ LC+KGGKLV CD N
Sbjct: 314 SYGKVVNSVGEQGLCQKGGKLVFCDQN 340

BLAST of Cp4.1LG06g04410 vs. NCBI nr
Match: gi|1009156326|ref|XP_015896190.1| (PREDICTED: glycosyltransferase family protein 64 C3 [Ziziphus jujuba])

HSP 1 Score: 423.3 bits (1087), Expect = 3.8e-115
Identity = 220/340 (64.71%), Postives = 258/340 (75.88%), Query Frame = 1

Query: 1   MNLKSLTIFILLSF-------STAVCSLLKATAAACDAESLPDRRNLRPDQITVLINGYY 60
           MNL   ++F+L  F       S +V SL    A  C   S  D R LR DQIT+LING+ 
Sbjct: 1   MNLLLTSLFLLFFFFFFFFFQSYSVLSLRMLEADPCGPTSHRDPRTLRSDQITILINGFS 60

Query: 61  ESRIPLLQSIAATYAASPIVHAVLILWGNPSTSSKTLTQLAQNLT-----TGPISVIRQS 120
           ESRIPLLQSI A Y+ASPIV AVL+LWGNPSTS++TL +LA NLT     +  IS+ RQ 
Sbjct: 61  ESRIPLLQSITAAYSASPIVSAVLVLWGNPSTSAQTLEELAHNLTLSSFGSATISLFRQP 120

Query: 121 SNSLNSRFRPHKSIQTGAVLICDDDVEIDISSLEFAFRVWGRNPERLVGFFVRSHDLDLS 180
           S+SLN+RF P  +++T AVLICDDDVEID  S  FAFR+WG NP+RL+GFF RSHD DL 
Sbjct: 121 SSSLNARFLPRPTVETRAVLICDDDVEIDSKSFAFAFRIWGANPDRLIGFFARSHDFDLL 180

Query: 181 RREWIYTVHQDKYSIVLTKLMILKMEYLFEYTCGGGAAMADMRRVVDRERNCEDILMNFM 240
           R+EWIYTVH D+YSIVLTK MILK EYLF+Y+CGGG AM  MR +VDR RNCEDILMNF+
Sbjct: 181 RKEWIYTVHPDRYSIVLTKSMILKTEYLFQYSCGGGVAMHRMRDIVDRMRNCEDILMNFV 240

Query: 241 VADMSNAGPILVAAQRIRDWGDPRNDYD--ENERLGLGEGVSEVGLSNRKGEHRKRRGRC 300
           VAD  NAGPILV A+R RDWGD RND D  ++ R GL   V++VGLS+R+GEHRKRRG C
Sbjct: 241 VADEVNAGPILVGAERARDWGDARNDKDDIDDGRRGLMGEVAQVGLSSRRGEHRKRRGEC 300

Query: 301 ITEFHRRLGRMPLRYSYGKLVNSVGEQALCRKGGKLVPCD 327
           I EFHR +G+MPLRYSYGK+VNSVGEQ LC+KG KLV CD
Sbjct: 301 IREFHRVMGKMPLRYSYGKVVNSVGEQGLCQKGRKLVFCD 340

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GT643_ARATH1.9e-10659.52Glycosyltransferase family protein 64 C3 OS=Arabidopsis thaliana GN=At1g80290 PE... [more]
EXTL3_MOUSE7.7e-2330.45Exostosin-like 3 OS=Mus musculus GN=Extl3 PE=1 SV=2[more]
EXTL3_HUMAN1.7e-2230.45Exostosin-like 3 OS=Homo sapiens GN=EXTL3 PE=1 SV=1[more]
GT644_ARATH2.9e-2228.93Glycosyltransferase family 64 protein C4 OS=Arabidopsis thaliana GN=EPC1 PE=2 SV... [more]
EXT3_DROME1.1e-2132.08Exostosin-3 OS=Drosophila melanogaster GN=botv PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LDW4_CUCSA9.4e-16187.65Uncharacterized protein OS=Cucumis sativus GN=Csa_3G883010 PE=4 SV=1[more]
W9S9Z8_9ROSA2.7e-11566.06Exostosin-like 3 OS=Morus notabilis GN=L484_019884 PE=4 SV=1[more]
V4UBE2_9ROSI2.3e-11464.56Uncharacterized protein OS=Citrus clementina GN=CICLE_v10010680mg PE=4 SV=1[more]
M5XE41_PRUPE3.0e-11466.67Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008992mg PE=4 SV=1[more]
A0A0D2PU48_GOSRA8.1e-11261.52Uncharacterized protein OS=Gossypium raimondii GN=B456_008G132700 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G80290.21.1e-10759.52 Nucleotide-diphospho-sugar transferases superfamily protein[more]
AT3G55830.11.6e-2328.93 Nucleotide-diphospho-sugar transferases superfamily protein[more]
AT5G04500.14.5e-2127.67 glycosyltransferase family protein 47[more]
Match NameE-valueIdentityDescription
gi|659132573|ref|XP_008466270.1|4.6e-16187.65PREDICTED: exostosin-like 2 [Cucumis melo][more]
gi|449437316|ref|XP_004136438.1|1.3e-16087.65PREDICTED: glycosyltransferase family protein 64 C3 [Cucumis sativus][more]
gi|657972676|ref|XP_008378124.1|1.0e-11564.74PREDICTED: exostosin-like 3 [Malus domestica][more]
gi|703159030|ref|XP_010112145.1|3.8e-11566.06Exostosin-like 3 [Morus notabilis][more]
gi|1009156326|ref|XP_015896190.1|3.8e-11564.71PREDICTED: glycosyltransferase family protein 64 C3 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: Cellular Component
TermDefinition
GO:0016021integral component of membrane
Vocabulary: Biological Process
TermDefinition
GO:0015012heparan sulfate proteoglycan biosynthetic process
GO:0006024glycosaminoglycan biosynthetic process
Vocabulary: INTERPRO
TermDefinition
IPR015338EXT_C
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006024 glycosaminoglycan biosynthetic process
biological_process GO:0015012 heparan sulfate proteoglycan biosynthetic process
biological_process GO:0009733 response to auxin
biological_process GO:0008150 biological_process
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016757 transferase activity, transferring glycosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG06g04410.1Cp4.1LG06g04410.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR015338Exostosin , C-terminalPFAMPF09258Glyco_transf_64coord: 45..308
score: 2.1
NoneNo IPR availablePANTHERPTHR11062EXOSTOSIN HEPARAN SULFATE GLYCOSYLTRANSFERASE -RELATEDcoord: 36..155
score: 1.4
NoneNo IPR availablePANTHERPTHR11062:SF94GLYCOSYLTRANSFERASE FAMILY PROTEIN 64coord: 36..155
score: 1.4

The following gene(s) are paralogous to this gene:

None