Cp4.1LG10g04320 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG10g04320
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionExostosin family protein
LocationCp4.1LG10 : 1534647 .. 1536651 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGTTGCCGCTTTTTATGATTCCTCTGATCGCTGCTTGTCTTTTTGTCATTTTGGGTCCGCCGTTCTTGTCGGACTGGCTCACTTTCCTTGATCGATATGCTTCATTCCGGAGCTCTCTTTCTTCTTCCTCCTCAGAAATGAATTCTCATTCAAATGCCACGGCGGCGCTACCGGTGCCGGTGACGGTGGATGGTCAAACAGTGGAACAACAACAACAACAACAACAACAACAACAACAACCTCTGTTTAACCACTCTGTTTCAAATCACTCTGTTCTTGCTTCTTCTTCTTCTTCTTCTCCACCGCCACCACCGATCGACGAAATGCAGATTCTTCCACCGGTCTTTGTTCGATTCTCTTTTAATCTTTCGTTCACTTAAACTATGATGAACCCAGAAAAGCCTCTTTCTGTGATGCATGCATCTTTCAAGATTTTGAATTCTTGTTTTGACTGAAACTGTAATGTAGCTTATTTGGCTATGGATAATTATGTAAACAGAATAGGACGAGAGTCAACGAAGAGTTTCTAGAAACCGCCATTAATGAAGTGACCAAGAACGCTTCTGCTGGAAGTAATTATGAACCGGCGGCGAAAGCCAGAAAACAGAGGAGATATACAAAATTGGAGAGAATCGAAGCGGAGCTAAGAGGAGCAAGAGCCGCCATTAGAGAAGCCATGTTCCAAAACCAAACACAAGATTCCGACTTCGTTCCTTCTGGACCAATGTATTGGAATGCCAAAACTTTTCACAGGTCAAATTTGATGGATAAACAGAGAATGTCTCTGTAAATTTGGTTTTTCTCTTGGAGTTTCTTTCAAATTTCGGTTTCTTCTTGGGAATTGTAATTTGTTTGTTCTTCCATTAATGGGAATTTCAGGAGTTACCAAGAAATGGAGAAGGAGCTGAAGATCTTCGTGTACGAAGAAGGAGAGCCTCCCCTGTTTCACAATGGCCCCTGCAAGAACATTTACTCAACGGAAGGGAATTTCATCCATGCCATCGAGATGGACTCCCGGTTTCGAACCAACGACCCCAACAAAGCCCATGTCTTCTTCCTTCCTCTGAGTGTTGTCATGCTCGTTCGGTTCGTCTACGTTCGCGACTCCCATGACTTCACTCCACTACGACGCACCGTCGCTGATTATGTCAATGTCATCGGAACCAAATATCCATTCTGGAACCGCACCCTCGGCGCTGATCATTTCATGCTCTCCTGCCACGATTGGGTAAGTATCACCTCACTTACCACAAATTCATCCAATAAACAGCCCATAATTGATTTGATCACAGGGTCCGGAAGTATCAAAATCAGAACCCCATTTGTACAAGAACTCCATTCGGGTACTGTGCAACGCCAACACATCCGAAGGCTTCAACCCATCAAAGGACGTATCACTCCCGGAAATCAATCTGCAGACGGGTGTGTTGACAGGATTTCTAGGTGGCCCATCTCCCTCGCGCCGCCCAATTCTGGCTTTCTTCGCCGGCGGACTTCACGGCCCAATAAGGCCCATTCTAATCCAGCAATGGGAGAACAAGGACCAGGATATTCGAGTCCACCAGTACCTCCCGGAAGGGGTCTCTTACATTGAGATGATGAGGAAGAGCAGGTTCTGCCTCTGCCCCAGCGGCTACGAAGTCGCCAGTCCCAGAATCGTGGAGGCCATTTACACCGGCTGCGTTCCGGTCCTCATTTCCGATCACTACGTCCCGCCGTTCAGCGACGTCCTCAATTGGAAATCTTTCTCCGTCGAAGTCTCCGTCAACGACATTCCGAACTTGAAGAAGATCCTGGCCGGAATATCGACGCGGCAGTACCTGAGAATGTACCGGAGAGTGGTCAACGTTAGAAGACACTTCGTAGTCAACTTTCCGCCGAAGAGATTCGATGTTTACCATATGATCCTTCATTCTGTGTGGCTTAGAAGACTCAATCTTGGACTACGGGATCATTGACCGTTAATTTTTGGGTTAGAATTTGAATTTGTACACGTATTTA

mRNA sequence

ATGAAGTTGCCGCTTTTTATGATTCCTCTGATCGCTGCTTGTCTTTTTGTCATTTTGGGTCCGCCGTTCTTGTCGGACTGGCTCACTTTCCTTGATCGATATGCTTCATTCCGGAGCTCTCTTTCTTCTTCCTCCTCAGAAATGAATTCTCATTCAAATGCCACGGCGGCGCTACCGGTGCCGGTGACGGTGGATGGTCAAACAGTGGAACAACAACAACAACAACAACAACAACAACAACAACCTCTGTTTAACCACTCTGTTTCAAATCACTCTGTTCTTGCTTCTTCTTCTTCTTCTTCTCCACCGCCACCACCGATCGACGAAATGCAGATTCTTCCACCGAATAGGACGAGAGTCAACGAAGAGTTTCTAGAAACCGCCATTAATGAAGTGACCAAGAACGCTTCTGCTGGAAGTAATTATGAACCGGCGGCGAAAGCCAGAAAACAGAGGAGATATACAAAATTGGAGAGAATCGAAGCGGAGCTAAGAGGAGCAAGAGCCGCCATTAGAGAAGCCATGTTCCAAAACCAAACACAAGATTCCGACTTCGTTCCTTCTGGACCAATGTATTGGAATGCCAAAACTTTTCACAGGAGTTACCAAGAAATGGAGAAGGAGCTGAAGATCTTCGTGTACGAAGAAGGAGAGCCTCCCCTGTTTCACAATGGCCCCTGCAAGAACATTTACTCAACGGAAGGGAATTTCATCCATGCCATCGAGATGGACTCCCGGTTTCGAACCAACGACCCCAACAAAGCCCATGTCTTCTTCCTTCCTCTGAGTGTTGTCATGCTCGTTCGGTTCGTCTACGTTCGCGACTCCCATGACTTCACTCCACTACGACGCACCGTCGCTGATTATGTCAATGTCATCGGAACCAAATATCCATTCTGGAACCGCACCCTCGGCGCTGATCATTTCATGCTCTCCTGCCACGATTGGGGTCCGGAAGTATCAAAATCAGAACCCCATTTGTACAAGAACTCCATTCGGGTACTGTGCAACGCCAACACATCCGAAGGCTTCAACCCATCAAAGGACGTATCACTCCCGGAAATCAATCTGCAGACGGGTGTGTTGACAGGATTTCTAGGTGGCCCATCTCCCTCGCGCCGCCCAATTCTGGCTTTCTTCGCCGGCGGACTTCACGGCCCAATAAGGCCCATTCTAATCCAGCAATGGGAGAACAAGGACCAGGATATTCGAGTCCACCAGTACCTCCCGGAAGGGGTCTCTTACATTGAGATGATGAGGAAGAGCAGGTTCTGCCTCTGCCCCAGCGGCTACGAAGTCGCCAGTCCCAGAATCGTGGAGGCCATTTACACCGGCTGCGTTCCGGTCCTCATTTCCGATCACTACGTCCCGCCGTTCAGCGACGTCCTCAATTGGAAATCTTTCTCCGTCGAAGTCTCCGTCAACGACATTCCGAACTTGAAGAAGATCCTGGCCGGAATATCGACGCGGCAGTACCTGAGAATGTACCGGAGAGTGGTCAACGTTAGAAGACACTTCGTAGTCAACTTTCCGCCGAAGAGATTCGATGTTTACCATATGATCCTTCATTCTGTGTGGCTTAGAAGACTCAATCTTGGACTACGGGATCATTGACCGTTAATTTTTGGGTTAGAATTTGAATTTGTACACGTATTTA

Coding sequence (CDS)

ATGAAGTTGCCGCTTTTTATGATTCCTCTGATCGCTGCTTGTCTTTTTGTCATTTTGGGTCCGCCGTTCTTGTCGGACTGGCTCACTTTCCTTGATCGATATGCTTCATTCCGGAGCTCTCTTTCTTCTTCCTCCTCAGAAATGAATTCTCATTCAAATGCCACGGCGGCGCTACCGGTGCCGGTGACGGTGGATGGTCAAACAGTGGAACAACAACAACAACAACAACAACAACAACAACAACCTCTGTTTAACCACTCTGTTTCAAATCACTCTGTTCTTGCTTCTTCTTCTTCTTCTTCTCCACCGCCACCACCGATCGACGAAATGCAGATTCTTCCACCGAATAGGACGAGAGTCAACGAAGAGTTTCTAGAAACCGCCATTAATGAAGTGACCAAGAACGCTTCTGCTGGAAGTAATTATGAACCGGCGGCGAAAGCCAGAAAACAGAGGAGATATACAAAATTGGAGAGAATCGAAGCGGAGCTAAGAGGAGCAAGAGCCGCCATTAGAGAAGCCATGTTCCAAAACCAAACACAAGATTCCGACTTCGTTCCTTCTGGACCAATGTATTGGAATGCCAAAACTTTTCACAGGAGTTACCAAGAAATGGAGAAGGAGCTGAAGATCTTCGTGTACGAAGAAGGAGAGCCTCCCCTGTTTCACAATGGCCCCTGCAAGAACATTTACTCAACGGAAGGGAATTTCATCCATGCCATCGAGATGGACTCCCGGTTTCGAACCAACGACCCCAACAAAGCCCATGTCTTCTTCCTTCCTCTGAGTGTTGTCATGCTCGTTCGGTTCGTCTACGTTCGCGACTCCCATGACTTCACTCCACTACGACGCACCGTCGCTGATTATGTCAATGTCATCGGAACCAAATATCCATTCTGGAACCGCACCCTCGGCGCTGATCATTTCATGCTCTCCTGCCACGATTGGGGTCCGGAAGTATCAAAATCAGAACCCCATTTGTACAAGAACTCCATTCGGGTACTGTGCAACGCCAACACATCCGAAGGCTTCAACCCATCAAAGGACGTATCACTCCCGGAAATCAATCTGCAGACGGGTGTGTTGACAGGATTTCTAGGTGGCCCATCTCCCTCGCGCCGCCCAATTCTGGCTTTCTTCGCCGGCGGACTTCACGGCCCAATAAGGCCCATTCTAATCCAGCAATGGGAGAACAAGGACCAGGATATTCGAGTCCACCAGTACCTCCCGGAAGGGGTCTCTTACATTGAGATGATGAGGAAGAGCAGGTTCTGCCTCTGCCCCAGCGGCTACGAAGTCGCCAGTCCCAGAATCGTGGAGGCCATTTACACCGGCTGCGTTCCGGTCCTCATTTCCGATCACTACGTCCCGCCGTTCAGCGACGTCCTCAATTGGAAATCTTTCTCCGTCGAAGTCTCCGTCAACGACATTCCGAACTTGAAGAAGATCCTGGCCGGAATATCGACGCGGCAGTACCTGAGAATGTACCGGAGAGTGGTCAACGTTAGAAGACACTTCGTAGTCAACTTTCCGCCGAAGAGATTCGATGTTTACCATATGATCCTTCATTCTGTGTGGCTTAGAAGACTCAATCTTGGACTACGGGATCATTGA

Protein sequence

MKLPLFMIPLIAACLFVILGPPFLSDWLTFLDRYASFRSSLSSSSSEMNSHSNATAALPVPVTVDGQTVEQQQQQQQQQQQPLFNHSVSNHSVLASSSSSSPPPPPIDEMQILPPNRTRVNEEFLETAINEVTKNASAGSNYEPAAKARKQRRYTKLERIEAELRGARAAIREAMFQNQTQDSDFVPSGPMYWNAKTFHRSYQEMEKELKIFVYEEGEPPLFHNGPCKNIYSTEGNFIHAIEMDSRFRTNDPNKAHVFFLPLSVVMLVRFVYVRDSHDFTPLRRTVADYVNVIGTKYPFWNRTLGADHFMLSCHDWGPEVSKSEPHLYKNSIRVLCNANTSEGFNPSKDVSLPEINLQTGVLTGFLGGPSPSRRPILAFFAGGLHGPIRPILIQQWENKDQDIRVHQYLPEGVSYIEMMRKSRFCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDVLNWKSFSVEVSVNDIPNLKKILAGISTRQYLRMYRRVVNVRRHFVVNFPPKRFDVYHMILHSVWLRRLNLGLRDH
BLAST of Cp4.1LG10g04320 vs. Swiss-Prot
Match: GLYT3_ARATH (Probable glycosyltransferase At5g03795 OS=Arabidopsis thaliana GN=At5g03795 PE=3 SV=2)

HSP 1 Score: 609.8 bits (1571), Expect = 2.9e-173
Identity = 287/449 (63.92%), Postives = 355/449 (79.06%), Query Frame = 1

Query: 97  SSSSSPPPPPI--DEMQILPPNRTRVNEEFLETAINEVTK----NASAGSNYEPAAKA-- 156
           S++ +P P P+  + +  LP +      E ++   N   +    N +A SN   +  +  
Sbjct: 69  STAPAPAPSPLLPEILPSLPASSLSTKVESIQGDYNRTIQLNMINVTATSNNVSSTASLE 128

Query: 157 -RKQRRYTKLERIEAELRGARAAIREAMFQNQTQDSDFVPSGPMYWNAKTFHRSYQEMEK 216
            +K+R  + LE+IE +L+ ARA+I+ A   +   D D+VP GPMYWNAK FHRSY EMEK
Sbjct: 129 PKKRRVLSNLEKIEFKLQKARASIKAASMDDPVDDPDYVPLGPMYWNAKVFHRSYLEMEK 188

Query: 217 ELKIFVYEEGEPPLFHNGPCKNIYSTEGNFIHAIEMDSRFRTNDPNKAHVFFLPLSVVML 276
           + KI+VY+EGEPPLFH+GPCK+IYS EG+FI+ IE D+RFRTN+P+KAHVF+LP SVV +
Sbjct: 189 QFKIYVYKEGEPPLFHDGPCKSIYSMEGSFIYEIETDTRFRTNNPDKAHVFYLPFSVVKM 248

Query: 277 VRFVYVRDSHDFTPLRRTVADYVNVIGTKYPFWNRTLGADHFMLSCHDWGPEVSKSEPHL 336
           VR+VY R+S DF+P+R TV DY+N++G KYP+WNR++GADHF+LSCHDWGPE S S PHL
Sbjct: 249 VRYVYERNSRDFSPIRNTVKDYINLVGDKYPYWNRSIGADHFILSCHDWGPEASFSHPHL 308

Query: 337 YKNSIRVLCNANTSEGFNPSKDVSLPEINLQTGVLTGFLGGPSPSRRPILAFFAGGLHGP 396
             NSIR LCNANTSE F P KDVS+PEINL+TG LTG +GGPSPS RPILAFFAGG+HGP
Sbjct: 309 GHNSIRALCNANTSERFKPRKDVSIPEINLRTGSLTGLVGGPSPSSRPILAFFAGGVHGP 368

Query: 397 IRPILIQQWENKDQDIRVHQYLPEGVSYIEMMRKSRFCLCPSGYEVASPRIVEAIYTGCV 456
           +RP+L+Q WENKD DIRVH+YLP G SY +MMR S+FC+CPSGYEVASPRIVEA+Y+GCV
Sbjct: 369 VRPVLLQHWENKDNDIRVHKYLPRGTSYSDMMRNSKFCICPSGYEVASPRIVEALYSGCV 428

Query: 457 PVLISDHYVPPFSDVLNWKSFSVEVSVNDIPNLKKILAGISTRQYLRMYRRVVNVRRHFV 516
           PVLI+  YVPPFSDVLNW+SFSV VSV DIPNLK IL  IS RQYLRMYRRV+ VRRHF 
Sbjct: 429 PVLINSGYVPPFSDVLNWRSFSVIVSVEDIPNLKTILTSISPRQYLRMYRRVLKVRRHFE 488

Query: 517 VNFPPKRFDVYHMILHSVWLRRLNLGLRD 537
           VN P KRFDV+HMILHS+W+RRLN+ +R+
Sbjct: 489 VNSPAKRFDVFHMILHSIWVRRLNVKIRE 517

BLAST of Cp4.1LG10g04320 vs. Swiss-Prot
Match: GLYT1_ARATH (Probable glycosyltransferase At3g07620 OS=Arabidopsis thaliana GN=At3g07620 PE=3 SV=1)

HSP 1 Score: 456.4 bits (1173), Expect = 4.2e-127
Identity = 218/383 (56.92%), Postives = 273/383 (71.28%), Query Frame = 1

Query: 159 RIEAELRGARAAIREAMFQNQTQ------DSDFVPSGPMYWNAKTFHRSYQEMEKELKIF 218
           ++EAEL  AR  IREA     +       D D+VP G +Y N   FHRSY  MEK  KI+
Sbjct: 87  KVEAELATARVLIREAQLNYSSTTSSPLGDEDYVPHGDIYRNPYAFHRSYLLMEKMFKIY 146

Query: 219 VYEEGEPPLFHNGPCKNIYSTEGNFIHAIEMDS-RFRTNDPNKAHVFFLPLSVVMLVRFV 278
           VYEEG+PP+FH G CK+IYS EG F++ +E D  ++RT DP+KAHV+FLP SVVM++  +
Sbjct: 147 VYEEGDPPIFHYGLCKDIYSMEGLFLNFMENDVLKYRTRDPDKAHVYFLPFSVVMILHHL 206

Query: 279 YVRDSHDFTPLRRTVADYVNVIGTKYPFWNRTLGADHFMLSCHDWGPEVSKSEPHLYKNS 338
           +     D   L R +ADYV +I  KYP+WN + G DHFMLSCHDWG   +     L+ NS
Sbjct: 207 FDPVVRDKAVLERVIADYVQIISKKYPYWNTSDGFDHFMLSCHDWGHRATWYVKKLFFNS 266

Query: 339 IRVLCNANTSEGFNPSKDVSLPEINLQTGVLTGFLGGPSPSRRPILAFFAGGLHGPIRPI 398
           IRVLCNAN SE FNP KD   PEINL TG +    GG  P  R  LAFFAG  HG IRP+
Sbjct: 267 IRVLCNANISEYFNPEKDAPFPEINLLTGDINNLTGGLDPISRTTLAFFAGKSHGKIRPV 326

Query: 399 LIQQWENKDQDIRVHQYLPEGVSYIEMMRKSRFCLCPSGYEVASPRIVEAIYTGCVPVLI 458
           L+  W+ KD+DI V++ LP+G+ Y EMMRKSRFC+CPSG+EVASPR+ EAIY+GCVPVLI
Sbjct: 327 LLNHWKEKDKDILVYENLPDGLDYTEMMRKSRFCICPSGHEVASPRVPEAIYSGCVPVLI 386

Query: 459 SDHYVPPFSDVLNWKSFSVEVSVNDIPNLKKILAGISTRQYLRMYRRVVNVRRHFVVNFP 518
           S++YV PFSDVLNW+ FSV VSV +IP LK+IL  I   +Y+R+Y  V  V+RH +VN P
Sbjct: 387 SENYVLPFSDVLNWEKFSVSVSVKEIPELKRILMDIPEERYMRLYEGVKKVKRHILVNDP 446

Query: 519 PKRFDVYHMILHSVWLRRLNLGL 535
           PKR+DV++MI+HS+WLRRLN+ L
Sbjct: 447 PKRYDVFNMIIHSIWLRRLNVKL 469

BLAST of Cp4.1LG10g04320 vs. Swiss-Prot
Match: GLYT4_ARATH (Probable glycosyltransferase At5g11130 OS=Arabidopsis thaliana GN=At5g11120/At5g11130 PE=3 SV=2)

HSP 1 Score: 435.3 bits (1118), Expect = 1.0e-120
Identity = 219/460 (47.61%), Postives = 307/460 (66.74%), Query Frame = 1

Query: 83  LFNHSVSNHSVLASSSSSSPPPPPIDEMQILPPNRTRVNEEFLETAINEVTKNASAGSNY 142
           LF +S+++H+ + SS     P   +        +  R+      ++   +T N ++ S  
Sbjct: 20  LFFYSINHHNQIFSSVVDDDPSCRLSSSPQAVFSSFRIFPFRSSSSCLNITSNNNSTSEV 79

Query: 143 EPAAKARKQRRYTKLERIEAELRGARAAIREAMFQNQTQDSD--------FVPSGPMYWN 202
               +  +      +ERIE  L  ARAAIR+A  +N  +D D         V +G +Y N
Sbjct: 80  VVVEEVDEA-----VERIEEGLAMARAAIRKAGEKNLRRDRDRTNNSDVGVVSNGSVYLN 139

Query: 203 AKTFHRSYQEMEKELKIFVYEEGEPPLFHNGPCKNIYSTEGNFIHAIEM-DSRFRTNDPN 262
           A TFH+S++EMEK  KI+ Y EGE PLFH GP  NIY+ EG F+  IE  +SRF+   P 
Sbjct: 140 AFTFHQSHKEMEKRFKIWTYREGEAPLFHKGPLNNIYAIEGQFMDEIENGNSRFKAASPE 199

Query: 263 KAHVFFLPLSVVMLVRFVY-VRDSHDFTPLRRTVADYVNVIGTKYPFWNRTLGADHFMLS 322
           +A VF++P+ +V ++RFVY    S+    L+  V DY+++I  +YP+WNR+ GADHF LS
Sbjct: 200 EATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKDYISLISNRYPYWNRSRGADHFFLS 259

Query: 323 CHDWGPEVSKSEPHLYKNSIRVLCNANTSEGFNPSKDVSLPEINLQTGVLTGFLGGPSPS 382
           CHDW P+VS  +P LYK+ IR LCNAN+SEGF P +DVSLPEIN+    L     G  P 
Sbjct: 260 CHDWAPDVSAVDPELYKHFIRALCNANSSEGFTPMRDVSLPEINIPHSQLGFVHTGEPPQ 319

Query: 383 RRPILAFFAGGLHGPIRPILIQQWENKDQDIRVHQYLPEGVSYIEMMRKSRFCLCPSGYE 442
            R +LAFFAGG HG +R IL Q W+ KD+D+ V++ LP+ ++Y +MM K++FCLCPSG+E
Sbjct: 320 NRKLLAFFAGGSHGDVRKILFQHWKEKDKDVLVYENLPKTMNYTKMMDKAKFCLCPSGWE 379

Query: 443 VASPRIVEAIYTGCVPVLISDHYVPPFSDVLNWKSFSVEVSVNDIPNLKKILAGISTRQY 502
           VASPRIVE++Y+GCVPV+I+D+YV PFSDVLNWK+FSV + ++ +P++KKIL  I+  +Y
Sbjct: 380 VASPRIVESLYSGCVPVIIADYYVLPFSDVLNWKTFSVHIPISKMPDIKKILEAITEEEY 439

Query: 503 LRMYRRVVNVRRHFVVNFPPKRFDVYHMILHSVWLRRLNL 533
           L M RRV+ VR+HFV+N P K +D+ HMI+HS+WLRRLN+
Sbjct: 440 LNMQRRVLEVRKHFVINRPSKPYDMLHMIMHSIWLRRLNV 474

BLAST of Cp4.1LG10g04320 vs. Swiss-Prot
Match: GLYT6_ARATH (Probable glycosyltransferase At5g25310 OS=Arabidopsis thaliana GN=At5g25310 PE=3 SV=2)

HSP 1 Score: 430.3 bits (1105), Expect = 3.2e-119
Identity = 205/390 (52.56%), Postives = 279/390 (71.54%), Query Frame = 1

Query: 150 KQRRYTKLERIEAELRGARAAIREAMFQ-NQTQDSDFVPSGPMYWNAKTFHRSYQEMEKE 209
           K  +  +   +E  L  ARA+I EA    N T     +P+  +Y N    +RSY EMEK 
Sbjct: 91  KPEKLNRRNLVEQGLAKARASILEASSNVNTTLFKSDLPNSEIYRNPSALYRSYLEMEKR 150

Query: 210 LKIFVYEEGEPPLFHNGPCKNIYSTEGNFIHAIEMD-SRFRTNDPNKAHVFFLPLSVVML 269
            K++VYEEGEPPL H+GPCK++Y+ EG FI  +E   ++FRT DPN+A+V+FLP SV  L
Sbjct: 151 FKVYVYEEGEPPLVHDGPCKSVYAVEGRFITEMEKRRTKFRTYDPNQAYVYFLPFSVTWL 210

Query: 270 VRFVYVRDSHDFTPLRRTVADYVNVIGTKYPFWNRTLGADHFMLSCHDWGPEVSKSEPHL 329
           VR++Y  +S D  PL+  V+DY+ ++ T +PFWNRT GADHFML+CHDWGP  S++   L
Sbjct: 211 VRYLYEGNS-DAKPLKTFVSDYIRLVSTNHPFWNRTNGADHFMLTCHDWGPLTSQANRDL 270

Query: 330 YKNSIRVLCNANTSEGFNPSKDVSLPEINLQTGVLTGFLGGP---SPSRRPILAFFAGGL 389
           +  SIRV+CNAN+SEGFNP+KDV+LPEI L  G +   L      S S RP L FFAGG+
Sbjct: 271 FNTSIRVMCNANSSEGFNPTKDVTLPEIKLYGGEVDHKLRLSKTLSASPRPYLGFFAGGV 330

Query: 390 HGPIRPILIQQWENKDQDIRVHQYLPEGVSYIEMMRKSRFCLCPSGYEVASPRIVEAIYT 449
           HGP+RPIL++ W+ +D D+ V++YLP+ ++Y + MR S+FC CPSGYEVASPR++EAIY+
Sbjct: 331 HGPVRPILLKHWKQRDLDMPVYEYLPKHLNYYDFMRSSKFCFCPSGYEVASPRVIEAIYS 390

Query: 450 GCVPVLISDHYVPPFSDVLNWKSFSVEVSVNDIPNLKKILAGISTRQYLRMYRRVVNVRR 509
            C+PV++S ++V PF+DVL W++FSV V V++IP LK+IL  IS  +Y  +   +  VRR
Sbjct: 391 ECIPVILSVNFVLPFTDVLRWETFSVLVDVSEIPRLKEILMSISNEKYEWLKSNLRYVRR 450

Query: 510 HFVVNFPPKRFDVYHMILHSVWLRRLNLGL 535
           HF +N PP+RFD +H+ LHS+WLRRLNL L
Sbjct: 451 HFELNDPPQRFDAFHLTLHSIWLRRLNLKL 479

BLAST of Cp4.1LG10g04320 vs. Swiss-Prot
Match: GLYT2_ARATH (Probable glycosyltransferase At3g42180 OS=Arabidopsis thaliana GN=At3g42180 PE=2 SV=2)

HSP 1 Score: 422.5 bits (1085), Expect = 6.7e-117
Identity = 214/418 (51.20%), Postives = 284/418 (67.94%), Query Frame = 1

Query: 130 NEVTKNASAGSNYEPAAKARKQRRYTKLERIEAELRGARAAIREAM-FQNQTQDSD---F 189
           N +  ++S+ S Y P    +++   + LE+ E ELR ARAAIR A+ F+N T + +   +
Sbjct: 54  NALQSSSSSSSLYSPPITVKRR---SNLEKREEELRKARAAIRRAVRFKNCTSNEEVITY 113

Query: 190 VPSGPMYWNAKTFHRSYQEMEKELKIFVYEEGEPPLFHNGPCKNIYSTEGNFIHAIEM-- 249
           +P+G +Y N+  FH+S+ EM K  K++ Y+EGE PL H+GP  +IY  EG FI  +    
Sbjct: 114 IPTGQIYRNSFAFHQSHIEMMKTFKVWSYKEGEQPLVHDGPVNDIYGIEGQFIDELSYVM 173

Query: 250 ---DSRFRTNDPNKAHVFFLPLSVVMLVRFVY--VRDSHDFTPLR--RTVADYVNVIGTK 309
                RFR + P +AH FFLP SV  +V +VY  +    DF   R  R   DYV+V+  K
Sbjct: 174 GGPSGRFRASRPEEAHAFFLPFSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVAHK 233

Query: 310 YPFWNRTLGADHFMLSCHDWGPEVSKSEPHLYKNSIRVLCNANTSEGFNPSKDVSLPEIN 369
           +PFWN++ GADHFM+SCHDW P+V  S+P  +KN +R LCNANTSEGF  + D S+PEIN
Sbjct: 234 HPFWNQSNGADHFMVSCHDWAPDVPDSKPEFFKNFMRGLCNANTSEGFRRNIDFSIPEIN 293

Query: 370 LQTGVLTGFLGGPSPSRRPILAFFAGGLHGPIRPILIQQWENKDQDIRVHQYLPEGVSYI 429
           +    L     G +P  R ILAFFAG  HG IR +L   W+ KD+D++V+ +L +G +Y 
Sbjct: 294 IPKRKLKPPFMGQNPENRTILAFFAGRAHGYIREVLFSHWKGKDKDVQVYDHLTKGQNYH 353

Query: 430 EMMRKSRFCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDVLNWKSFSVEVSVND 489
           E++  S+FCLCPSGYEVASPR VEAIY+GCVPV+ISD+Y  PF+DVL+W  FSVE+ V+ 
Sbjct: 354 ELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFNDVLDWSKFSVEIPVDK 413

Query: 490 IPNLKKILAGISTRQYLRMYRRVVNVRRHFVVNFPPKRFDVYHMILHSVWLRRLNLGL 535
           IP++KKIL  I   +YLRMYR V+ VRRHFVVN P + FDV HMILHSVWLRRLN+ L
Sbjct: 414 IPDIKKILQEIPHDKYLRMYRNVMKVRRHFVVNRPAQPFDVIHMILHSVWLRRLNIRL 468

BLAST of Cp4.1LG10g04320 vs. TrEMBL
Match: A0A0A0KKC9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G449280 PE=4 SV=1)

HSP 1 Score: 805.4 bits (2079), Expect = 4.1e-230
Identity = 412/542 (76.01%), Postives = 445/542 (82.10%), Query Frame = 1

Query: 1   MKLPLFMIPLIA-ACLFVILGPPFLSDWLTFLDRYASFRSSLSSSSSEMNSHSNATAALP 60
           MK P+F+IPLIA +C   +LGP F +DW  F D+YAS + SLSSS      HSN +    
Sbjct: 1   MKFPVFVIPLIAVSCFLAVLGPRFNTDWTIFFDKYASTQPSLSSSFKREGFHSNTS---- 60

Query: 61  VPVTVDGQTVEQQQQQQQQQQQPLFNHSVSNHSVLASSSSSSPPPPPIDEMQILP--PNR 120
           VPV  +                     + +N SVL   S SSPPPP  D  Q L   PNR
Sbjct: 61  VPVATE-------------------EAAAANVSVL---SFSSPPPPVDDGKQSLQLHPNR 120

Query: 121 TRVNEEFLETA--INEVTKNASAGSNYEPAAK--ARKQRRYTKLERIEAELRGARAAIRE 180
           TRVNE+  ETA  INEV +  S  S+YE A K  AR+QR YTKLERIEA LR ARAAIRE
Sbjct: 121 TRVNEDLGETATTINEVIRKVSNESSYESAVKVRARRQREYTKLERIEAGLRRARAAIRE 180

Query: 181 AMFQNQTQDSDFVPSGPMYWNAKTFHRSYQEMEKELKIFVYEEGEPPLFHNGPCKNIYST 240
           A F NQTQD DFVPSGPMYWN+K FHRSY EMEKE+KIFVYEEGEPPLFHNGPCK+IYST
Sbjct: 181 AKFLNQTQDPDFVPSGPMYWNSKAFHRSYLEMEKEMKIFVYEEGEPPLFHNGPCKSIYST 240

Query: 241 EGNFIHAIEMDSRFRTNDPNKAHVFFLPLSVVMLVRFVYVRDSHDFTPLRRTVADYVNVI 300
           EGNFIHAIEMDS+FRT DPNKAHVFFLPLSV MLVRFVYV DSHDFTP+R TV DY+NVI
Sbjct: 241 EGNFIHAIEMDSQFRTKDPNKAHVFFLPLSVAMLVRFVYVHDSHDFTPIRHTVVDYINVI 300

Query: 301 GTKYPFWNRTLGADHFMLSCHDWGPEVSKSEPHLYKNSIRVLCNANTSEGFNPSKDVSLP 360
           GTKYPFWNR+LGADHFMLSCHDWGPE SKS P+LYKNSIRVLCNANTSEGFNPSKDVS P
Sbjct: 301 GTKYPFWNRSLGADHFMLSCHDWGPEASKSVPNLYKNSIRVLCNANTSEGFNPSKDVSFP 360

Query: 361 EINLQTGVLTGFLGGPSPSRRPILAFFAGGLHGPIRPILIQQWENKDQDIRVHQYLPEGV 420
           EINLQTG LTGFLGGPSPS RPI+AFFAGGLHGPIRPILIQ+WEN+DQDI+VHQYLP+GV
Sbjct: 361 EINLQTGHLTGFLGGPSPSHRPIMAFFAGGLHGPIRPILIQRWENQDQDIQVHQYLPKGV 420

Query: 421 SYIEMMRKSRFCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDVLNWKSFSVEVS 480
           SYI+MMRKS+FCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDV+NWKSFSVEVS
Sbjct: 421 SYIDMMRKSKFCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDVINWKSFSVEVS 480

Query: 481 VNDIPNLKKILAGISTRQYLRMYRRVVNVRRHFVVNFPPKRFDVYHMILHSVWLRRLNLG 536
           V+DIPNLK IL GISTRQYLRMYRRVV VRRHF VN PPKR+DVYHMILHSVWLRRLNL 
Sbjct: 481 VDDIPNLKTILTGISTRQYLRMYRRVVKVRRHFEVNSPPKRYDVYHMILHSVWLRRLNLR 516

BLAST of Cp4.1LG10g04320 vs. TrEMBL
Match: M5VQP8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006842mg PE=4 SV=1)

HSP 1 Score: 655.6 bits (1690), Expect = 5.2e-185
Identity = 300/390 (76.92%), Postives = 338/390 (86.67%), Query Frame = 1

Query: 147 KARKQRRYTKLERIEAELRGARAAIREAMFQNQTQDSDFVPSGPMYWNAKTFHRSYQEME 206
           KA  QR+ T L  +EA LR ARAAIREA F NQTQD D++P+GPMYWNA  F RSY EME
Sbjct: 2   KAEGQRKNTNLGWLEARLRRARAAIREAKFGNQTQDVDYIPNGPMYWNANAFQRSYLEME 61

Query: 207 KELKIFVYEEGEPPLFHNGPCKNIYSTEGNFIHAIEMDSRFRTNDPNKAHVFFLPLSVVM 266
           K  K+FVY EGEPPLFHNGPCK+IYS EGNFIH IE++ +FRT DP KAHV+FLP SV M
Sbjct: 62  KRFKVFVYGEGEPPLFHNGPCKSIYSMEGNFIHEIEVNKQFRTRDPEKAHVYFLPFSVTM 121

Query: 267 LVRFVYVRDSHDFTPLRRTVADYVNVIGTKYPFWNRTLGADHFMLSCHDWGPEVSKSEPH 326
           LVRFVYVRDSHDF P+R+TV DYVN++  KYP+WNR+LGADHFML+CHDWGPE S S+PH
Sbjct: 122 LVRFVYVRDSHDFGPIRQTVRDYVNIVSGKYPYWNRSLGADHFMLACHDWGPETSNSDPH 181

Query: 327 LYKNSIRVLCNANTSEGFNPSKDVSLPEINLQTGVLTGFLGGPSPSRRPILAFFAGGLHG 386
           L KNSIRVLCNANTSEGFNPSKDVS PEINLQTG   GFLGGPSP  R ILAFFAGG+HG
Sbjct: 182 LRKNSIRVLCNANTSEGFNPSKDVSFPEINLQTGDTHGFLGGPSPRLRSILAFFAGGVHG 241

Query: 387 PIRPILIQQWENKDQDIRVHQYLPEGVSYIEMMRKSRFCLCPSGYEVASPRIVEAIYTGC 446
           PIRP+L++ WENKD+D+RVHQYLP+G+SY +MMR S+FCLCPSGYEVASPR+VEAIYTGC
Sbjct: 242 PIRPVLLEHWENKDEDLRVHQYLPKGISYYDMMRHSKFCLCPSGYEVASPRVVEAIYTGC 301

Query: 447 VPVLISDHYVPPFSDVLNWKSFSVEVSVNDIPNLKKILAGISTRQYLRMYRRVVNVRRHF 506
           VPVLISDHYVPPFSDVLNWKSFSVEV V++IPNLK IL  IST+QY+RM RRVV VRRHF
Sbjct: 302 VPVLISDHYVPPFSDVLNWKSFSVEVKVSEIPNLKNILMSISTKQYIRMQRRVVQVRRHF 361

Query: 507 VVNFPPKRFDVYHMILHSVWLRRLNLGLRD 537
            VN PPKRFDV+HMILHS+WLRRLN+ + D
Sbjct: 362 EVNSPPKRFDVFHMILHSIWLRRLNVRVHD 391

BLAST of Cp4.1LG10g04320 vs. TrEMBL
Match: W9RJ02_9ROSA (Putative glycosyltransferase OS=Morus notabilis GN=L484_022635 PE=4 SV=1)

HSP 1 Score: 651.0 bits (1678), Expect = 1.3e-183
Identity = 311/454 (68.50%), Postives = 364/454 (80.18%), Query Frame = 1

Query: 87  SVSNHSVLASSSSSSPPPPPIDEMQILPP----NRTRVNEEFLETAINEVTKNASAGSNY 146
           ++S+ S+   SSS   PP  I+ +   P     N T+ N   +  +  ++       S  
Sbjct: 13  AISDDSLFNRSSS---PPLAIEAISPSPSQEHSNDTKENSFNISASEPQIVVPLVNESRV 72

Query: 147 EPAAKARKQRRYTKLERIEAELRGARAAIREAMFQNQTQDSDFVPSGPMYWNAKTFHRSY 206
            P  KAR+ R Y+ LER+EA L  AR+AI EA   N+TQD ++VP+GPMYWN+K FHRSY
Sbjct: 73  VPLVKARRHREYSDLERLEARLMKARSAISEAKVGNETQDPEYVPTGPMYWNSKAFHRSY 132

Query: 207 QEMEKELKIFVYEEGEPPLFHNGPCKNIYSTEGNFIHAIEMDSRFRTNDPNKAHVFFLPL 266
            EMEKE K+FVYEEGEPPLFHNGPCK+IYS EGNFIH IEM+S+FRT DPNKAHVFFLP 
Sbjct: 133 IEMEKEFKVFVYEEGEPPLFHNGPCKSIYSMEGNFIHEIEMNSQFRTKDPNKAHVFFLPF 192

Query: 267 SVVMLVRFVYVRDSHDFTPLRRTVADYVNVIGTKYPFWNRTLGADHFMLSCHDWGPEVSK 326
           SV MLVR+VYVRDSHDF P+++TV DYVNV+  K+PFWNR+L ADHFMLSCHDWGPE S+
Sbjct: 193 SVTMLVRYVYVRDSHDFHPIKQTVIDYVNVVSEKHPFWNRSLAADHFMLSCHDWGPEASR 252

Query: 327 SEPHLYKNSIRVLCNANTSEGFNPSKDVSLPEINLQTGVLTGFLGGPSPSRRPILAFFAG 386
           S   L++NSIRVLCNANTSEGFNPSKDVS PEINL +G   G LGGPS SRRP LAFFAG
Sbjct: 253 SNRFLHQNSIRVLCNANTSEGFNPSKDVSFPEINLLSGATDGLLGGPSASRRPNLAFFAG 312

Query: 387 GLHGPIRPILIQQWENKDQDIRVHQYLPEGVSYIEMMRKSRFCLCPSGYEVASPRIVEAI 446
           G+HGPIRPIL++ WENKD+D+++ QYLP+GVSY +M+RKS++C+CPSGYEVASPRIVEAI
Sbjct: 313 GVHGPIRPILLEHWENKDEDMKIQQYLPKGVSYYDMLRKSKYCICPSGYEVASPRIVEAI 372

Query: 447 YTGCVPVLISDHYVPPFSDVLNWKSFSVEVSVNDIPNLKKILAGISTRQYLRMYRRVVNV 506
           YTGCVPVLISDHYVPPFSDVLNWKSFSVEV V DIPNLK IL  IS RQY+RM RRV+ V
Sbjct: 373 YTGCVPVLISDHYVPPFSDVLNWKSFSVEVPVKDIPNLKTILMSISPRQYIRMQRRVLKV 432

Query: 507 RRHFVVNFPPKRFDVYHMILHSVWLRRLNLGLRD 537
           RRHF VN PPKRFDV+HMILHSVWLRRLN+ + D
Sbjct: 433 RRHFEVNSPPKRFDVFHMILHSVWLRRLNIRIHD 463

BLAST of Cp4.1LG10g04320 vs. TrEMBL
Match: I1NAG9_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_19G189300 PE=4 SV=1)

HSP 1 Score: 647.5 bits (1669), Expect = 1.4e-182
Identity = 331/543 (60.96%), Postives = 394/543 (72.56%), Query Frame = 1

Query: 1   MKLPLFMIPLIAACLFVILGPPFLSDWL-------TFLDRYASFRSSLSSSSSEMNSHSN 60
           MKL L M+PL+     V +  P  S WL        +L+   +  SS SS +  + + S 
Sbjct: 15  MKLFLLMVPLVLVAGLVSVLGPNPSSWLFSANAPVLYLEGSVTSSSSTSSGAVTVTNPSE 74

Query: 61  ATAALPVPVTVDGQTVEQQQQQQQQQQQPLFNHSVSNHSVLASSSSSSPPPPPIDEMQIL 120
                 + V      VE +  ++       FNHS +               PP     I 
Sbjct: 75  VKQREGLVVVA----VENRGGEKVISDDTDFNHSST---------------PPFSVQAIQ 134

Query: 121 PPNRTRVNEEFLETAINEVTKNASAGSNYEPAAKARKQRRYTKLERIEAELRGARAAIRE 180
            P +   +E+ +      VT       +Y P  + + QR+++ L+R EA LR ARAAIRE
Sbjct: 135 TPQQPNKDEQNVSQLWANVT---GVNESYLPPERPKLQRKFSILDRTEAGLRQARAAIRE 194

Query: 181 AMFQNQTQDSDFVPSGPMYWNAKTFHRSYQEMEKELKIFVYEEGEPPLFHNGPCKNIYST 240
           A   NQTQD D+VP GPMY NA  FHRSY EMEK+ K+FVYEEGEPP+FHNGPCK+IYS 
Sbjct: 195 ARNGNQTQDIDYVPVGPMYNNANAFHRSYLEMEKQFKVFVYEEGEPPVFHNGPCKSIYSM 254

Query: 241 EGNFIHAIEMDSRFRTNDPNKAHVFFLPLSVVMLVRFVYVRDSHDFTPLRRTVADYVNVI 300
           EGNFIHAIEM+ +FRT DP +AHVFFLP SV MLV+FVYVRDSHDF P+++TV DYVNVI
Sbjct: 255 EGNFIHAIEMNDQFRTRDPEEAHVFFLPFSVAMLVQFVYVRDSHDFGPIKKTVTDYVNVI 314

Query: 301 GTKYPFWNRTLGADHFMLSCHDWGPEVSKSEPHLYKNSIRVLCNANTSEGFNPSKDVSLP 360
           G +YP+WNR+LGADHF L+CHDWGPE S+S P+L KNSIRVLCNANTSEGF PSKDVS P
Sbjct: 315 GGRYPYWNRSLGADHFYLACHDWGPETSRSIPNLNKNSIRVLCNANTSEGFKPSKDVSFP 374

Query: 361 EINLQTGVLTGFLGGPSPSRRPILAFFAGGLHGPIRPILIQQWENKDQDIRVHQYLPEGV 420
           EINLQTG + GF+GGPS SRRP+LAFFAGGLHGPIRP+L++ WENKD+DI+VH+YLP+GV
Sbjct: 375 EINLQTGSINGFIGGPSASRRPLLAFFAGGLHGPIRPVLLEHWENKDEDIQVHKYLPKGV 434

Query: 421 SYIEMMRKSRFCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDVLNWKSFSVEVS 480
           SY EM+RKS+FCLCPSGYEVASPR+VEAIYTGCVPVLISDHYVPPF+DVLNWKSFSVEVS
Sbjct: 435 SYYEMLRKSKFCLCPSGYEVASPRVVEAIYTGCVPVLISDHYVPPFNDVLNWKSFSVEVS 494

Query: 481 VNDIPNLKKILAGISTRQYLRMYRRVVNVRRHFVVNFPPKRFDVYHMILHSVWLRRLNLG 537
           V DIP LK+IL  IS RQY+RM RRV  VRRHF V+ PPKR+DV+HMILHSVWLRRLN  
Sbjct: 495 VKDIPRLKEILLSISPRQYIRMQRRVGQVRRHFEVHSPPKRYDVFHMILHSVWLRRLNFR 535

BLAST of Cp4.1LG10g04320 vs. TrEMBL
Match: A0A0B2NZ15_GLYSO (Putative glycosyltransferase OS=Glycine soja GN=glysoja_002299 PE=4 SV=1)

HSP 1 Score: 647.1 bits (1668), Expect = 1.9e-182
Identity = 330/543 (60.77%), Postives = 394/543 (72.56%), Query Frame = 1

Query: 1   MKLPLFMIPLIAACLFVILGPPFLSDWL-------TFLDRYASFRSSLSSSSSEMNSHSN 60
           MKL L M+PL+     V +  P  S WL        +L+   +  SS SS +  + + S 
Sbjct: 15  MKLFLLMVPLVLVAGLVSVLGPNPSSWLFSANAPVLYLEGSVTSSSSTSSGAVTVTNPSE 74

Query: 61  ATAALPVPVTVDGQTVEQQQQQQQQQQQPLFNHSVSNHSVLASSSSSSPPPPPIDEMQIL 120
                 + V      VE +  ++       FNHS +               PP     I 
Sbjct: 75  VKQREGLVVVA----VENRGGEKVISDDTDFNHSST---------------PPFSVQAIQ 134

Query: 121 PPNRTRVNEEFLETAINEVTKNASAGSNYEPAAKARKQRRYTKLERIEAELRGARAAIRE 180
            P +   +E+ +      VT       +Y P  + + QR+++ L+R EA LR ARAAIRE
Sbjct: 135 TPQQPNKDEQNVSQLWANVT---GVNESYLPPERPKLQRKFSILDRTEAGLRQARAAIRE 194

Query: 181 AMFQNQTQDSDFVPSGPMYWNAKTFHRSYQEMEKELKIFVYEEGEPPLFHNGPCKNIYST 240
           A   NQTQD D+VP GPMY NA  FHRSY EMEK+ K+FVYEEGEPP+FHNGPCK+IYS 
Sbjct: 195 ARNGNQTQDIDYVPVGPMYNNANAFHRSYLEMEKQFKVFVYEEGEPPVFHNGPCKSIYSM 254

Query: 241 EGNFIHAIEMDSRFRTNDPNKAHVFFLPLSVVMLVRFVYVRDSHDFTPLRRTVADYVNVI 300
           EGNF+HAIEM+ +FRT DP +AHVFFLP SV MLV+FVYVRDSHDF P+++TV DYVNVI
Sbjct: 255 EGNFVHAIEMNDQFRTRDPEEAHVFFLPFSVAMLVQFVYVRDSHDFGPIKKTVTDYVNVI 314

Query: 301 GTKYPFWNRTLGADHFMLSCHDWGPEVSKSEPHLYKNSIRVLCNANTSEGFNPSKDVSLP 360
           G +YP+WNR+LGADHF L+CHDWGPE S+S P+L KNSIRVLCNANTSEGF PSKDVS P
Sbjct: 315 GGRYPYWNRSLGADHFYLACHDWGPETSRSIPNLNKNSIRVLCNANTSEGFKPSKDVSFP 374

Query: 361 EINLQTGVLTGFLGGPSPSRRPILAFFAGGLHGPIRPILIQQWENKDQDIRVHQYLPEGV 420
           EINLQTG + GF+GGPS SRRP+LAFFAGGLHGPIRP+L++ WENKD+DI+VH+YLP+GV
Sbjct: 375 EINLQTGSINGFIGGPSASRRPLLAFFAGGLHGPIRPVLLEHWENKDEDIQVHKYLPKGV 434

Query: 421 SYIEMMRKSRFCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDVLNWKSFSVEVS 480
           SY EM+RKS+FCLCPSGYEVASPR+VEAIYTGCVPVLISDHYVPPF+DVLNWKSFSVEVS
Sbjct: 435 SYYEMLRKSKFCLCPSGYEVASPRVVEAIYTGCVPVLISDHYVPPFNDVLNWKSFSVEVS 494

Query: 481 VNDIPNLKKILAGISTRQYLRMYRRVVNVRRHFVVNFPPKRFDVYHMILHSVWLRRLNLG 537
           V DIP LK+IL  IS RQY+RM RRV  VRRHF V+ PPKR+DV+HMILHSVWLRRLN  
Sbjct: 495 VKDIPRLKEILLSISPRQYIRMQRRVGQVRRHFEVHSPPKRYDVFHMILHSVWLRRLNFR 535

BLAST of Cp4.1LG10g04320 vs. TAIR10
Match: AT5G03795.1 (AT5G03795.1 Exostosin family protein)

HSP 1 Score: 609.8 bits (1571), Expect = 1.7e-174
Identity = 287/449 (63.92%), Postives = 355/449 (79.06%), Query Frame = 1

Query: 97  SSSSSPPPPPI--DEMQILPPNRTRVNEEFLETAINEVTK----NASAGSNYEPAAKA-- 156
           S++ +P P P+  + +  LP +      E ++   N   +    N +A SN   +  +  
Sbjct: 69  STAPAPAPSPLLPEILPSLPASSLSTKVESIQGDYNRTIQLNMINVTATSNNVSSTASLE 128

Query: 157 -RKQRRYTKLERIEAELRGARAAIREAMFQNQTQDSDFVPSGPMYWNAKTFHRSYQEMEK 216
            +K+R  + LE+IE +L+ ARA+I+ A   +   D D+VP GPMYWNAK FHRSY EMEK
Sbjct: 129 PKKRRVLSNLEKIEFKLQKARASIKAASMDDPVDDPDYVPLGPMYWNAKVFHRSYLEMEK 188

Query: 217 ELKIFVYEEGEPPLFHNGPCKNIYSTEGNFIHAIEMDSRFRTNDPNKAHVFFLPLSVVML 276
           + KI+VY+EGEPPLFH+GPCK+IYS EG+FI+ IE D+RFRTN+P+KAHVF+LP SVV +
Sbjct: 189 QFKIYVYKEGEPPLFHDGPCKSIYSMEGSFIYEIETDTRFRTNNPDKAHVFYLPFSVVKM 248

Query: 277 VRFVYVRDSHDFTPLRRTVADYVNVIGTKYPFWNRTLGADHFMLSCHDWGPEVSKSEPHL 336
           VR+VY R+S DF+P+R TV DY+N++G KYP+WNR++GADHF+LSCHDWGPE S S PHL
Sbjct: 249 VRYVYERNSRDFSPIRNTVKDYINLVGDKYPYWNRSIGADHFILSCHDWGPEASFSHPHL 308

Query: 337 YKNSIRVLCNANTSEGFNPSKDVSLPEINLQTGVLTGFLGGPSPSRRPILAFFAGGLHGP 396
             NSIR LCNANTSE F P KDVS+PEINL+TG LTG +GGPSPS RPILAFFAGG+HGP
Sbjct: 309 GHNSIRALCNANTSERFKPRKDVSIPEINLRTGSLTGLVGGPSPSSRPILAFFAGGVHGP 368

Query: 397 IRPILIQQWENKDQDIRVHQYLPEGVSYIEMMRKSRFCLCPSGYEVASPRIVEAIYTGCV 456
           +RP+L+Q WENKD DIRVH+YLP G SY +MMR S+FC+CPSGYEVASPRIVEA+Y+GCV
Sbjct: 369 VRPVLLQHWENKDNDIRVHKYLPRGTSYSDMMRNSKFCICPSGYEVASPRIVEALYSGCV 428

Query: 457 PVLISDHYVPPFSDVLNWKSFSVEVSVNDIPNLKKILAGISTRQYLRMYRRVVNVRRHFV 516
           PVLI+  YVPPFSDVLNW+SFSV VSV DIPNLK IL  IS RQYLRMYRRV+ VRRHF 
Sbjct: 429 PVLINSGYVPPFSDVLNWRSFSVIVSVEDIPNLKTILTSISPRQYLRMYRRVLKVRRHFE 488

Query: 517 VNFPPKRFDVYHMILHSVWLRRLNLGLRD 537
           VN P KRFDV+HMILHS+W+RRLN+ +R+
Sbjct: 489 VNSPAKRFDVFHMILHSIWVRRLNVKIRE 517

BLAST of Cp4.1LG10g04320 vs. TAIR10
Match: AT3G07620.1 (AT3G07620.1 Exostosin family protein)

HSP 1 Score: 456.4 bits (1173), Expect = 2.4e-128
Identity = 218/383 (56.92%), Postives = 273/383 (71.28%), Query Frame = 1

Query: 159 RIEAELRGARAAIREAMFQNQTQ------DSDFVPSGPMYWNAKTFHRSYQEMEKELKIF 218
           ++EAEL  AR  IREA     +       D D+VP G +Y N   FHRSY  MEK  KI+
Sbjct: 87  KVEAELATARVLIREAQLNYSSTTSSPLGDEDYVPHGDIYRNPYAFHRSYLLMEKMFKIY 146

Query: 219 VYEEGEPPLFHNGPCKNIYSTEGNFIHAIEMDS-RFRTNDPNKAHVFFLPLSVVMLVRFV 278
           VYEEG+PP+FH G CK+IYS EG F++ +E D  ++RT DP+KAHV+FLP SVVM++  +
Sbjct: 147 VYEEGDPPIFHYGLCKDIYSMEGLFLNFMENDVLKYRTRDPDKAHVYFLPFSVVMILHHL 206

Query: 279 YVRDSHDFTPLRRTVADYVNVIGTKYPFWNRTLGADHFMLSCHDWGPEVSKSEPHLYKNS 338
           +     D   L R +ADYV +I  KYP+WN + G DHFMLSCHDWG   +     L+ NS
Sbjct: 207 FDPVVRDKAVLERVIADYVQIISKKYPYWNTSDGFDHFMLSCHDWGHRATWYVKKLFFNS 266

Query: 339 IRVLCNANTSEGFNPSKDVSLPEINLQTGVLTGFLGGPSPSRRPILAFFAGGLHGPIRPI 398
           IRVLCNAN SE FNP KD   PEINL TG +    GG  P  R  LAFFAG  HG IRP+
Sbjct: 267 IRVLCNANISEYFNPEKDAPFPEINLLTGDINNLTGGLDPISRTTLAFFAGKSHGKIRPV 326

Query: 399 LIQQWENKDQDIRVHQYLPEGVSYIEMMRKSRFCLCPSGYEVASPRIVEAIYTGCVPVLI 458
           L+  W+ KD+DI V++ LP+G+ Y EMMRKSRFC+CPSG+EVASPR+ EAIY+GCVPVLI
Sbjct: 327 LLNHWKEKDKDILVYENLPDGLDYTEMMRKSRFCICPSGHEVASPRVPEAIYSGCVPVLI 386

Query: 459 SDHYVPPFSDVLNWKSFSVEVSVNDIPNLKKILAGISTRQYLRMYRRVVNVRRHFVVNFP 518
           S++YV PFSDVLNW+ FSV VSV +IP LK+IL  I   +Y+R+Y  V  V+RH +VN P
Sbjct: 387 SENYVLPFSDVLNWEKFSVSVSVKEIPELKRILMDIPEERYMRLYEGVKKVKRHILVNDP 446

Query: 519 PKRFDVYHMILHSVWLRRLNLGL 535
           PKR+DV++MI+HS+WLRRLN+ L
Sbjct: 447 PKRYDVFNMIIHSIWLRRLNVKL 469

BLAST of Cp4.1LG10g04320 vs. TAIR10
Match: AT5G11130.1 (AT5G11130.1 Exostosin family protein)

HSP 1 Score: 435.3 bits (1118), Expect = 5.6e-122
Identity = 219/460 (47.61%), Postives = 307/460 (66.74%), Query Frame = 1

Query: 83  LFNHSVSNHSVLASSSSSSPPPPPIDEMQILPPNRTRVNEEFLETAINEVTKNASAGSNY 142
           LF +S+++H+ + SS     P   +        +  R+      ++   +T N ++ S  
Sbjct: 20  LFFYSINHHNQIFSSVVDDDPSCRLSSSPQAVFSSFRIFPFRSSSSCLNITSNNNSTSEV 79

Query: 143 EPAAKARKQRRYTKLERIEAELRGARAAIREAMFQNQTQDSD--------FVPSGPMYWN 202
               +  +      +ERIE  L  ARAAIR+A  +N  +D D         V +G +Y N
Sbjct: 80  VVVEEVDEA-----VERIEEGLAMARAAIRKAGEKNLRRDRDRTNNSDVGVVSNGSVYLN 139

Query: 203 AKTFHRSYQEMEKELKIFVYEEGEPPLFHNGPCKNIYSTEGNFIHAIEM-DSRFRTNDPN 262
           A TFH+S++EMEK  KI+ Y EGE PLFH GP  NIY+ EG F+  IE  +SRF+   P 
Sbjct: 140 AFTFHQSHKEMEKRFKIWTYREGEAPLFHKGPLNNIYAIEGQFMDEIENGNSRFKAASPE 199

Query: 263 KAHVFFLPLSVVMLVRFVY-VRDSHDFTPLRRTVADYVNVIGTKYPFWNRTLGADHFMLS 322
           +A VF++P+ +V ++RFVY    S+    L+  V DY+++I  +YP+WNR+ GADHF LS
Sbjct: 200 EATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKDYISLISNRYPYWNRSRGADHFFLS 259

Query: 323 CHDWGPEVSKSEPHLYKNSIRVLCNANTSEGFNPSKDVSLPEINLQTGVLTGFLGGPSPS 382
           CHDW P+VS  +P LYK+ IR LCNAN+SEGF P +DVSLPEIN+    L     G  P 
Sbjct: 260 CHDWAPDVSAVDPELYKHFIRALCNANSSEGFTPMRDVSLPEINIPHSQLGFVHTGEPPQ 319

Query: 383 RRPILAFFAGGLHGPIRPILIQQWENKDQDIRVHQYLPEGVSYIEMMRKSRFCLCPSGYE 442
            R +LAFFAGG HG +R IL Q W+ KD+D+ V++ LP+ ++Y +MM K++FCLCPSG+E
Sbjct: 320 NRKLLAFFAGGSHGDVRKILFQHWKEKDKDVLVYENLPKTMNYTKMMDKAKFCLCPSGWE 379

Query: 443 VASPRIVEAIYTGCVPVLISDHYVPPFSDVLNWKSFSVEVSVNDIPNLKKILAGISTRQY 502
           VASPRIVE++Y+GCVPV+I+D+YV PFSDVLNWK+FSV + ++ +P++KKIL  I+  +Y
Sbjct: 380 VASPRIVESLYSGCVPVIIADYYVLPFSDVLNWKTFSVHIPISKMPDIKKILEAITEEEY 439

Query: 503 LRMYRRVVNVRRHFVVNFPPKRFDVYHMILHSVWLRRLNL 533
           L M RRV+ VR+HFV+N P K +D+ HMI+HS+WLRRLN+
Sbjct: 440 LNMQRRVLEVRKHFVINRPSKPYDMLHMIMHSIWLRRLNV 474

BLAST of Cp4.1LG10g04320 vs. TAIR10
Match: AT5G25310.1 (AT5G25310.1 Exostosin family protein)

HSP 1 Score: 430.3 bits (1105), Expect = 1.8e-120
Identity = 205/390 (52.56%), Postives = 279/390 (71.54%), Query Frame = 1

Query: 150 KQRRYTKLERIEAELRGARAAIREAMFQ-NQTQDSDFVPSGPMYWNAKTFHRSYQEMEKE 209
           K  +  +   +E  L  ARA+I EA    N T     +P+  +Y N    +RSY EMEK 
Sbjct: 91  KPEKLNRRNLVEQGLAKARASILEASSNVNTTLFKSDLPNSEIYRNPSALYRSYLEMEKR 150

Query: 210 LKIFVYEEGEPPLFHNGPCKNIYSTEGNFIHAIEMD-SRFRTNDPNKAHVFFLPLSVVML 269
            K++VYEEGEPPL H+GPCK++Y+ EG FI  +E   ++FRT DPN+A+V+FLP SV  L
Sbjct: 151 FKVYVYEEGEPPLVHDGPCKSVYAVEGRFITEMEKRRTKFRTYDPNQAYVYFLPFSVTWL 210

Query: 270 VRFVYVRDSHDFTPLRRTVADYVNVIGTKYPFWNRTLGADHFMLSCHDWGPEVSKSEPHL 329
           VR++Y  +S D  PL+  V+DY+ ++ T +PFWNRT GADHFML+CHDWGP  S++   L
Sbjct: 211 VRYLYEGNS-DAKPLKTFVSDYIRLVSTNHPFWNRTNGADHFMLTCHDWGPLTSQANRDL 270

Query: 330 YKNSIRVLCNANTSEGFNPSKDVSLPEINLQTGVLTGFLGGP---SPSRRPILAFFAGGL 389
           +  SIRV+CNAN+SEGFNP+KDV+LPEI L  G +   L      S S RP L FFAGG+
Sbjct: 271 FNTSIRVMCNANSSEGFNPTKDVTLPEIKLYGGEVDHKLRLSKTLSASPRPYLGFFAGGV 330

Query: 390 HGPIRPILIQQWENKDQDIRVHQYLPEGVSYIEMMRKSRFCLCPSGYEVASPRIVEAIYT 449
           HGP+RPIL++ W+ +D D+ V++YLP+ ++Y + MR S+FC CPSGYEVASPR++EAIY+
Sbjct: 331 HGPVRPILLKHWKQRDLDMPVYEYLPKHLNYYDFMRSSKFCFCPSGYEVASPRVIEAIYS 390

Query: 450 GCVPVLISDHYVPPFSDVLNWKSFSVEVSVNDIPNLKKILAGISTRQYLRMYRRVVNVRR 509
            C+PV++S ++V PF+DVL W++FSV V V++IP LK+IL  IS  +Y  +   +  VRR
Sbjct: 391 ECIPVILSVNFVLPFTDVLRWETFSVLVDVSEIPRLKEILMSISNEKYEWLKSNLRYVRR 450

Query: 510 HFVVNFPPKRFDVYHMILHSVWLRRLNLGL 535
           HF +N PP+RFD +H+ LHS+WLRRLNL L
Sbjct: 451 HFELNDPPQRFDAFHLTLHSIWLRRLNLKL 479

BLAST of Cp4.1LG10g04320 vs. TAIR10
Match: AT3G42180.1 (AT3G42180.1 Exostosin family protein)

HSP 1 Score: 422.5 bits (1085), Expect = 3.8e-118
Identity = 214/418 (51.20%), Postives = 284/418 (67.94%), Query Frame = 1

Query: 130 NEVTKNASAGSNYEPAAKARKQRRYTKLERIEAELRGARAAIREAM-FQNQTQDSD---F 189
           N +  ++S+ S Y P    +++   + LE+ E ELR ARAAIR A+ F+N T + +   +
Sbjct: 54  NALQSSSSSSSLYSPPITVKRR---SNLEKREEELRKARAAIRRAVRFKNCTSNEEVITY 113

Query: 190 VPSGPMYWNAKTFHRSYQEMEKELKIFVYEEGEPPLFHNGPCKNIYSTEGNFIHAIEM-- 249
           +P+G +Y N+  FH+S+ EM K  K++ Y+EGE PL H+GP  +IY  EG FI  +    
Sbjct: 114 IPTGQIYRNSFAFHQSHIEMMKTFKVWSYKEGEQPLVHDGPVNDIYGIEGQFIDELSYVM 173

Query: 250 ---DSRFRTNDPNKAHVFFLPLSVVMLVRFVY--VRDSHDFTPLR--RTVADYVNVIGTK 309
                RFR + P +AH FFLP SV  +V +VY  +    DF   R  R   DYV+V+  K
Sbjct: 174 GGPSGRFRASRPEEAHAFFLPFSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVAHK 233

Query: 310 YPFWNRTLGADHFMLSCHDWGPEVSKSEPHLYKNSIRVLCNANTSEGFNPSKDVSLPEIN 369
           +PFWN++ GADHFM+SCHDW P+V  S+P  +KN +R LCNANTSEGF  + D S+PEIN
Sbjct: 234 HPFWNQSNGADHFMVSCHDWAPDVPDSKPEFFKNFMRGLCNANTSEGFRRNIDFSIPEIN 293

Query: 370 LQTGVLTGFLGGPSPSRRPILAFFAGGLHGPIRPILIQQWENKDQDIRVHQYLPEGVSYI 429
           +    L     G +P  R ILAFFAG  HG IR +L   W+ KD+D++V+ +L +G +Y 
Sbjct: 294 IPKRKLKPPFMGQNPENRTILAFFAGRAHGYIREVLFSHWKGKDKDVQVYDHLTKGQNYH 353

Query: 430 EMMRKSRFCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDVLNWKSFSVEVSVND 489
           E++  S+FCLCPSGYEVASPR VEAIY+GCVPV+ISD+Y  PF+DVL+W  FSVE+ V+ 
Sbjct: 354 ELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFNDVLDWSKFSVEIPVDK 413

Query: 490 IPNLKKILAGISTRQYLRMYRRVVNVRRHFVVNFPPKRFDVYHMILHSVWLRRLNLGL 535
           IP++KKIL  I   +YLRMYR V+ VRRHFVVN P + FDV HMILHSVWLRRLN+ L
Sbjct: 414 IPDIKKILQEIPHDKYLRMYRNVMKVRRHFVVNRPAQPFDVIHMILHSVWLRRLNIRL 468

BLAST of Cp4.1LG10g04320 vs. NCBI nr
Match: gi|449451243|ref|XP_004143371.1| (PREDICTED: probable glycosyltransferase At5g03795 [Cucumis sativus])

HSP 1 Score: 805.4 bits (2079), Expect = 5.8e-230
Identity = 412/542 (76.01%), Postives = 445/542 (82.10%), Query Frame = 1

Query: 1   MKLPLFMIPLIA-ACLFVILGPPFLSDWLTFLDRYASFRSSLSSSSSEMNSHSNATAALP 60
           MK P+F+IPLIA +C   +LGP F +DW  F D+YAS + SLSSS      HSN +    
Sbjct: 1   MKFPVFVIPLIAVSCFLAVLGPRFNTDWTIFFDKYASTQPSLSSSFKREGFHSNTS---- 60

Query: 61  VPVTVDGQTVEQQQQQQQQQQQPLFNHSVSNHSVLASSSSSSPPPPPIDEMQILP--PNR 120
           VPV  +                     + +N SVL   S SSPPPP  D  Q L   PNR
Sbjct: 61  VPVATE-------------------EAAAANVSVL---SFSSPPPPVDDGKQSLQLHPNR 120

Query: 121 TRVNEEFLETA--INEVTKNASAGSNYEPAAK--ARKQRRYTKLERIEAELRGARAAIRE 180
           TRVNE+  ETA  INEV +  S  S+YE A K  AR+QR YTKLERIEA LR ARAAIRE
Sbjct: 121 TRVNEDLGETATTINEVIRKVSNESSYESAVKVRARRQREYTKLERIEAGLRRARAAIRE 180

Query: 181 AMFQNQTQDSDFVPSGPMYWNAKTFHRSYQEMEKELKIFVYEEGEPPLFHNGPCKNIYST 240
           A F NQTQD DFVPSGPMYWN+K FHRSY EMEKE+KIFVYEEGEPPLFHNGPCK+IYST
Sbjct: 181 AKFLNQTQDPDFVPSGPMYWNSKAFHRSYLEMEKEMKIFVYEEGEPPLFHNGPCKSIYST 240

Query: 241 EGNFIHAIEMDSRFRTNDPNKAHVFFLPLSVVMLVRFVYVRDSHDFTPLRRTVADYVNVI 300
           EGNFIHAIEMDS+FRT DPNKAHVFFLPLSV MLVRFVYV DSHDFTP+R TV DY+NVI
Sbjct: 241 EGNFIHAIEMDSQFRTKDPNKAHVFFLPLSVAMLVRFVYVHDSHDFTPIRHTVVDYINVI 300

Query: 301 GTKYPFWNRTLGADHFMLSCHDWGPEVSKSEPHLYKNSIRVLCNANTSEGFNPSKDVSLP 360
           GTKYPFWNR+LGADHFMLSCHDWGPE SKS P+LYKNSIRVLCNANTSEGFNPSKDVS P
Sbjct: 301 GTKYPFWNRSLGADHFMLSCHDWGPEASKSVPNLYKNSIRVLCNANTSEGFNPSKDVSFP 360

Query: 361 EINLQTGVLTGFLGGPSPSRRPILAFFAGGLHGPIRPILIQQWENKDQDIRVHQYLPEGV 420
           EINLQTG LTGFLGGPSPS RPI+AFFAGGLHGPIRPILIQ+WEN+DQDI+VHQYLP+GV
Sbjct: 361 EINLQTGHLTGFLGGPSPSHRPIMAFFAGGLHGPIRPILIQRWENQDQDIQVHQYLPKGV 420

Query: 421 SYIEMMRKSRFCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDVLNWKSFSVEVS 480
           SYI+MMRKS+FCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDV+NWKSFSVEVS
Sbjct: 421 SYIDMMRKSKFCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDVINWKSFSVEVS 480

Query: 481 VNDIPNLKKILAGISTRQYLRMYRRVVNVRRHFVVNFPPKRFDVYHMILHSVWLRRLNLG 536
           V+DIPNLK IL GISTRQYLRMYRRVV VRRHF VN PPKR+DVYHMILHSVWLRRLNL 
Sbjct: 481 VDDIPNLKTILTGISTRQYLRMYRRVVKVRRHFEVNSPPKRYDVYHMILHSVWLRRLNLR 516

BLAST of Cp4.1LG10g04320 vs. NCBI nr
Match: gi|659125122|ref|XP_008462519.1| (PREDICTED: probable glycosyltransferase At5g03795 [Cucumis melo])

HSP 1 Score: 775.4 bits (2001), Expect = 6.5e-221
Identity = 402/543 (74.03%), Postives = 434/543 (79.93%), Query Frame = 1

Query: 1   MKLPLFMIPLIA-ACLFVILGPPFLSDWLTFLDRYASFRSSLSSSSSEMNSHSNATAALP 60
           MK P+F+IPLIA +C   +  P F +DW  F  + A     +  S S         +   
Sbjct: 1   MKFPVFVIPLIAISCFLALFAPQFNTDWTIFFHKSA----PIHPSLSYSFKREEFHSNTS 60

Query: 61  VPVTVDGQTVEQQQQQQQQQQQPLFNHSVSNHSVLASSSSSSPPPPPIDEMQILP--PNR 120
           V V  D                     + +N SVL   S SSPPPP  DE Q L   PNR
Sbjct: 61  VSVAADEAA------------------AAANVSVL---SFSSPPPPVNDEKQSLQLHPNR 120

Query: 121 TRVNEEFLETA--INEVTKNASAGSNYEPAA--KARKQRRYTKLERIEAELRGARAAIRE 180
           TRVNE+  ETA  IN V +N S  S+YE A   +AR+QR YTKLERIEA LR ARAAIRE
Sbjct: 121 TRVNEDLGETATAINGVIRNVSNDSSYESAVKVRARRQREYTKLERIEAGLRRARAAIRE 180

Query: 181 AMFQNQTQDSDFVPSGPMYWNAKTFHRSYQEMEKELKIFVYEEGEPPLFHNGPCKNIYST 240
           A F NQTQD DFVPSGPMYWN+K FHRSY EMEKE+KIFVYEEGEPPLFHNGPCK+IYST
Sbjct: 181 AKFLNQTQDPDFVPSGPMYWNSKAFHRSYLEMEKEMKIFVYEEGEPPLFHNGPCKSIYST 240

Query: 241 EGNFIHAIEMDSRFRTNDPNKAHVFFLPLSVVMLVRFVYVRDSHDFTPLRRTVADYVNVI 300
           EGNFIHAIEMDS+FRT DPNKAHVFFLPLSV MLVRFVYV DSHDFTP+R TV DY+NVI
Sbjct: 241 EGNFIHAIEMDSQFRTKDPNKAHVFFLPLSVAMLVRFVYVHDSHDFTPIRHTVIDYINVI 300

Query: 301 GTKYPFWNRTLGADHFMLSCHDWGPEVSKSEPHLYKNSIRVLCNANTSEGFNPSKDVSLP 360
           GTKYPFWNR+LGADHFMLSCHDWGPE SKS P+LYKNSIRVLCNANTSEGFNPSKDVS P
Sbjct: 301 GTKYPFWNRSLGADHFMLSCHDWGPEASKSVPNLYKNSIRVLCNANTSEGFNPSKDVSFP 360

Query: 361 EINLQTGVLTGFLGGPSPSRRPILAFFAGGLHGPIRPILIQQWENKDQDIRVHQYLPEG- 420
           EINLQTG LTGFLGGPSPS RPILAFFAGGLHGPIRPILIQQWEN+DQDI+VHQYLP+G 
Sbjct: 361 EINLQTGYLTGFLGGPSPSHRPILAFFAGGLHGPIRPILIQQWENQDQDIQVHQYLPKGV 420

Query: 421 VSYIEMMRKSRFCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDVLNWKSFSVEV 480
           VSYI+MMRKS+FCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDV+NWKSFSVEV
Sbjct: 421 VSYIDMMRKSKFCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDVINWKSFSVEV 480

Query: 481 SVNDIPNLKKILAGISTRQYLRMYRRVVNVRRHFVVNFPPKRFDVYHMILHSVWLRRLNL 536
           SV++IPNLK IL GISTRQYLRMYRRVV VRRHF VN PPKR+DVYHMILHSVWLRRLNL
Sbjct: 481 SVDEIPNLKTILTGISTRQYLRMYRRVVKVRRHFEVNSPPKRYDVYHMILHSVWLRRLNL 518

BLAST of Cp4.1LG10g04320 vs. NCBI nr
Match: gi|645272313|ref|XP_008241335.1| (PREDICTED: probable glycosyltransferase At5g03795 [Prunus mume])

HSP 1 Score: 659.1 bits (1699), Expect = 6.8e-186
Identity = 338/541 (62.48%), Postives = 399/541 (73.75%), Query Frame = 1

Query: 1   MKLPLFMIPLIAACLFVILGPPFLSDWLTFLDRYASFRSSLSSSSS-EMNSHSNATAALP 60
           M + LF++PL+     V L  P  S+W+   + Y    +S SSS S  +   SN+++  P
Sbjct: 21  MAILLFVVPLVVVFGLVSLLGPKTSNWVLISNSYPWLWNSQSSSPSLNLTGASNSSSEFP 80

Query: 61  VPVTVDGQTVEQQQQQQQQQQQPLFNHSVSN-HSVLASSSSSSPPPPPIDEM---QILPP 120
            P+  D   +                HS+   HS  +  + SS PP  I+E     +  P
Sbjct: 81  -PLNDDVLGLRSSVVVVDM-------HSIEEAHSDDSLQNRSSSPPLSIEEAVPPTLEQP 140

Query: 121 NRTRVNEEFLETAINEVTKNASAGSNYEPAAKARKQRRYTKLERIEAELRGARAAIREAM 180
           N TR ++   ET I+ V              KA  QR+ T L  +EA LR ARAAIREA 
Sbjct: 141 NGTRQDDYANETQISIV--------------KAEGQRKNTNLGWLEARLRRARAAIREAK 200

Query: 181 FQNQTQDSDFVPSGPMYWNAKTFHRSYQEMEKELKIFVYEEGEPPLFHNGPCKNIYSTEG 240
           F NQTQD D++P+GPMYW A  F RSY EMEK  K+FVY EGEPPLFHNGPCK+IYS EG
Sbjct: 201 FGNQTQDVDYIPNGPMYWKANAFQRSYLEMEKRFKVFVYGEGEPPLFHNGPCKSIYSMEG 260

Query: 241 NFIHAIEMDSRFRTNDPNKAHVFFLPLSVVMLVRFVYVRDSHDFTPLRRTVADYVNVIGT 300
           NFIH IE++ +FRT+DP KAHV+FLP SV MLVRFVYVRDSHDF P+R+TV DYVN++  
Sbjct: 261 NFIHEIEVNKQFRTHDPEKAHVYFLPFSVTMLVRFVYVRDSHDFGPIRQTVRDYVNIVSG 320

Query: 301 KYPFWNRTLGADHFMLSCHDWGPEVSKSEPHLYKNSIRVLCNANTSEGFNPSKDVSLPEI 360
           KYP+WNR+LGADHFML+CHDWGPE S S+PHL KNSIRVLCNANTSEGFNPSKDVS PEI
Sbjct: 321 KYPYWNRSLGADHFMLACHDWGPETSNSDPHLRKNSIRVLCNANTSEGFNPSKDVSFPEI 380

Query: 361 NLQTGVLTGFLGGPSPSRRPILAFFAGGLHGPIRPILIQQWENKDQDIRVHQYLPEGVSY 420
           NLQTG   GFLGGPSP  R ILAFFAGG+HGPIRP+L++ WENKD+D+RVHQYLP+G+SY
Sbjct: 381 NLQTGDTHGFLGGPSPRLRSILAFFAGGVHGPIRPVLLEHWENKDEDLRVHQYLPKGISY 440

Query: 421 IEMMRKSRFCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDVLNWKSFSVEVSVN 480
            +MMR S+FCLCPSGYEVASPR+VEAIYTGCVPVLISDHYVPPFSDVLNWKSFSVEV V+
Sbjct: 441 YDMMRHSKFCLCPSGYEVASPRVVEAIYTGCVPVLISDHYVPPFSDVLNWKSFSVEVKVS 500

Query: 481 DIPNLKKILAGISTRQYLRMYRRVVNVRRHFVVNFPPKRFDVYHMILHSVWLRRLNLGLR 537
           +IPNLK IL  IST+QY+RM RRVV VRRHF VN PPKRFDV+HMILHS+WLRRLN+ + 
Sbjct: 501 EIPNLKNILMSISTKQYIRMQRRVVQVRRHFEVNSPPKRFDVFHMILHSIWLRRLNVRVH 539

BLAST of Cp4.1LG10g04320 vs. NCBI nr
Match: gi|502111648|ref|XP_004494122.1| (PREDICTED: probable glycosyltransferase At5g03795 [Cicer arietinum])

HSP 1 Score: 658.7 bits (1698), Expect = 8.8e-186
Identity = 333/536 (62.13%), Postives = 400/536 (74.63%), Query Frame = 1

Query: 1   MKLPLFMIPLIAACLFVILGPPFLSDWLTFLDRYASFRSSLSSSSSEMNSHSNATAALPV 60
           MKL LFM+PLI     V +  P  S+W+  +       SS+++SSSE  S      A+  
Sbjct: 13  MKLLLFMVPLIIVAGLVSVLGPNPSNWV-LIPNQPLLWSSVTTSSSESGSLMEKVQAVSF 72

Query: 61  PVTVDGQTVEQQQQQQQQQQQPLFNHSVSNHSVLASSSSSSPPPPPIDEMQILPPNRTRV 120
               + + V+                ++S+     + SS+  PP  I  +Q L  N+   
Sbjct: 73  DFHNNNKEVK----------------AISDDDTFFNQSST--PPLSIQTLQKL--NKDEE 132

Query: 121 NEEFLETAINEVTKNASAGSNYEPAAKARKQRRYTKLERIEAELRGARAAIREAMFQNQT 180
           ++   +   N  + N S    Y P  K    R+++ L+R EA L  ARAAIR+A   NQT
Sbjct: 133 SDNASQIWTNTTSMNESY---YIPLEKPNLPRKFSILDRTEAGLLQARAAIRKARNGNQT 192

Query: 181 QDSDFVPSGPMYWNAKTFHRSYQEMEKELKIFVYEEGEPPLFHNGPCKNIYSTEGNFIHA 240
           QD D+VP GPMY+N   FHRSY EMEK+ K+FVYEEGEPP+FHNGPCK+IYS EGNFIH 
Sbjct: 193 QDIDYVPIGPMYYNPNAFHRSYLEMEKQFKVFVYEEGEPPVFHNGPCKSIYSMEGNFIHT 252

Query: 241 IEMDSRFRTNDPNKAHVFFLPLSVVMLVRFVYVRDSHDFTPLRRTVADYVNVIGTKYPFW 300
           IEM+ +FRT DP+KAHVFFLP SV M+V+FVYVRDSHDF+P+++TVADY+NVI  +YPFW
Sbjct: 253 IEMNDQFRTRDPDKAHVFFLPFSVAMMVQFVYVRDSHDFSPIKKTVADYINVISERYPFW 312

Query: 301 NRTLGADHFMLSCHDWGPEVSKSEPHLYKNSIRVLCNANTSEGFNPSKDVSLPEINLQTG 360
           NR+LGADHFMLSCHDWGPEVS S P+LYKNSIR LCNANTSEGF P+KDVS PEINLQTG
Sbjct: 313 NRSLGADHFMLSCHDWGPEVSNSVPNLYKNSIRALCNANTSEGFKPAKDVSFPEINLQTG 372

Query: 361 VLTGFLGGPSPSRRPILAFFAGGLHGPIRPILIQQWENKDQDIRVHQYLPEGVSYIEMMR 420
            + GF+GGPSPS+R +LAFFAGGLHGPIRPIL+  WENKD+DI+VH+YLP+GVSY +M+R
Sbjct: 373 TIHGFVGGPSPSKRSVLAFFAGGLHGPIRPILLDHWENKDEDIQVHKYLPKGVSYYDMLR 432

Query: 421 KSRFCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDVLNWKSFSVEVSVNDIPNL 480
           KS+FCLCPSGYEVASPR+VEAIYTGCVPVLISDHYVPPFSDVLNWKSFSVEVSV+DIPNL
Sbjct: 433 KSKFCLCPSGYEVASPRVVEAIYTGCVPVLISDHYVPPFSDVLNWKSFSVEVSVDDIPNL 492

Query: 481 KKILAGISTRQYLRMYRRVVNVRRHFVVNFPPKRFDVYHMILHSVWLRRLNLGLRD 537
           KKIL  IS RQY+RM RRV  VRRHF V+ PPKRFDV+HMILHS+WLRRLN  + D
Sbjct: 493 KKILLSISPRQYIRMQRRVGQVRRHFEVHSPPKRFDVFHMILHSIWLRRLNFRVHD 524

BLAST of Cp4.1LG10g04320 vs. NCBI nr
Match: gi|595804161|ref|XP_007202112.1| (hypothetical protein PRUPE_ppa006842mg [Prunus persica])

HSP 1 Score: 655.6 bits (1690), Expect = 7.5e-185
Identity = 300/390 (76.92%), Postives = 338/390 (86.67%), Query Frame = 1

Query: 147 KARKQRRYTKLERIEAELRGARAAIREAMFQNQTQDSDFVPSGPMYWNAKTFHRSYQEME 206
           KA  QR+ T L  +EA LR ARAAIREA F NQTQD D++P+GPMYWNA  F RSY EME
Sbjct: 2   KAEGQRKNTNLGWLEARLRRARAAIREAKFGNQTQDVDYIPNGPMYWNANAFQRSYLEME 61

Query: 207 KELKIFVYEEGEPPLFHNGPCKNIYSTEGNFIHAIEMDSRFRTNDPNKAHVFFLPLSVVM 266
           K  K+FVY EGEPPLFHNGPCK+IYS EGNFIH IE++ +FRT DP KAHV+FLP SV M
Sbjct: 62  KRFKVFVYGEGEPPLFHNGPCKSIYSMEGNFIHEIEVNKQFRTRDPEKAHVYFLPFSVTM 121

Query: 267 LVRFVYVRDSHDFTPLRRTVADYVNVIGTKYPFWNRTLGADHFMLSCHDWGPEVSKSEPH 326
           LVRFVYVRDSHDF P+R+TV DYVN++  KYP+WNR+LGADHFML+CHDWGPE S S+PH
Sbjct: 122 LVRFVYVRDSHDFGPIRQTVRDYVNIVSGKYPYWNRSLGADHFMLACHDWGPETSNSDPH 181

Query: 327 LYKNSIRVLCNANTSEGFNPSKDVSLPEINLQTGVLTGFLGGPSPSRRPILAFFAGGLHG 386
           L KNSIRVLCNANTSEGFNPSKDVS PEINLQTG   GFLGGPSP  R ILAFFAGG+HG
Sbjct: 182 LRKNSIRVLCNANTSEGFNPSKDVSFPEINLQTGDTHGFLGGPSPRLRSILAFFAGGVHG 241

Query: 387 PIRPILIQQWENKDQDIRVHQYLPEGVSYIEMMRKSRFCLCPSGYEVASPRIVEAIYTGC 446
           PIRP+L++ WENKD+D+RVHQYLP+G+SY +MMR S+FCLCPSGYEVASPR+VEAIYTGC
Sbjct: 242 PIRPVLLEHWENKDEDLRVHQYLPKGISYYDMMRHSKFCLCPSGYEVASPRVVEAIYTGC 301

Query: 447 VPVLISDHYVPPFSDVLNWKSFSVEVSVNDIPNLKKILAGISTRQYLRMYRRVVNVRRHF 506
           VPVLISDHYVPPFSDVLNWKSFSVEV V++IPNLK IL  IST+QY+RM RRVV VRRHF
Sbjct: 302 VPVLISDHYVPPFSDVLNWKSFSVEVKVSEIPNLKNILMSISTKQYIRMQRRVVQVRRHF 361

Query: 507 VVNFPPKRFDVYHMILHSVWLRRLNLGLRD 537
            VN PPKRFDV+HMILHS+WLRRLN+ + D
Sbjct: 362 EVNSPPKRFDVFHMILHSIWLRRLNVRVHD 391

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GLYT3_ARATH2.9e-17363.92Probable glycosyltransferase At5g03795 OS=Arabidopsis thaliana GN=At5g03795 PE=3... [more]
GLYT1_ARATH4.2e-12756.92Probable glycosyltransferase At3g07620 OS=Arabidopsis thaliana GN=At3g07620 PE=3... [more]
GLYT4_ARATH1.0e-12047.61Probable glycosyltransferase At5g11130 OS=Arabidopsis thaliana GN=At5g11120/At5g... [more]
GLYT6_ARATH3.2e-11952.56Probable glycosyltransferase At5g25310 OS=Arabidopsis thaliana GN=At5g25310 PE=3... [more]
GLYT2_ARATH6.7e-11751.20Probable glycosyltransferase At3g42180 OS=Arabidopsis thaliana GN=At3g42180 PE=2... [more]
Match NameE-valueIdentityDescription
A0A0A0KKC9_CUCSA4.1e-23076.01Uncharacterized protein OS=Cucumis sativus GN=Csa_6G449280 PE=4 SV=1[more]
M5VQP8_PRUPE5.2e-18576.92Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006842mg PE=4 SV=1[more]
W9RJ02_9ROSA1.3e-18368.50Putative glycosyltransferase OS=Morus notabilis GN=L484_022635 PE=4 SV=1[more]
I1NAG9_SOYBN1.4e-18260.96Uncharacterized protein OS=Glycine max GN=GLYMA_19G189300 PE=4 SV=1[more]
A0A0B2NZ15_GLYSO1.9e-18260.77Putative glycosyltransferase OS=Glycine soja GN=glysoja_002299 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G03795.11.7e-17463.92 Exostosin family protein[more]
AT3G07620.12.4e-12856.92 Exostosin family protein[more]
AT5G11130.15.6e-12247.61 Exostosin family protein[more]
AT5G25310.11.8e-12052.56 Exostosin family protein[more]
AT3G42180.13.8e-11851.20 Exostosin family protein[more]
Match NameE-valueIdentityDescription
gi|449451243|ref|XP_004143371.1|5.8e-23076.01PREDICTED: probable glycosyltransferase At5g03795 [Cucumis sativus][more]
gi|659125122|ref|XP_008462519.1|6.5e-22174.03PREDICTED: probable glycosyltransferase At5g03795 [Cucumis melo][more]
gi|645272313|ref|XP_008241335.1|6.8e-18662.48PREDICTED: probable glycosyltransferase At5g03795 [Prunus mume][more]
gi|502111648|ref|XP_004494122.1|8.8e-18662.13PREDICTED: probable glycosyltransferase At5g03795 [Cicer arietinum][more]
gi|595804161|ref|XP_007202112.1|7.5e-18576.92hypothetical protein PRUPE_ppa006842mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004263Exostosin
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0006486 protein glycosylation
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0050508 glucuronosyl-N-acetylglucosaminyl-proteoglycan 4-alpha-N-acetylglucosaminyltransferase activity
molecular_function GO:0016757 transferase activity, transferring glycosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG10g04320.1Cp4.1LG10g04320.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004263Exostosin-likePFAMPF03016Exostosincoord: 206..485
score: 2.2
NoneNo IPR availableunknownCoilCoilcoord: 157..177
scor
NoneNo IPR availablePANTHERPTHR11062EXOSTOSIN HEPARAN SULFATE GLYCOSYLTRANSFERASE -RELATEDcoord: 152..537
score: 4.3E
NoneNo IPR availablePANTHERPTHR11062:SF66SUBFAMILY NOT NAMEDcoord: 152..537
score: 4.3E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 223..455
score: 3.1

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG10g04320Cp4.1LG19g03760Cucurbita pepo (Zucchini)cpecpeB076
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG10g04320Cucurbita pepo (Zucchini)cpecpeB092
Cp4.1LG10g04320Cucurbita maxima (Rimu)cmacpeB254
Cp4.1LG10g04320Cucurbita maxima (Rimu)cmacpeB656
Cp4.1LG10g04320Cucurbita moschata (Rifu)cmocpeB218
Cp4.1LG10g04320Melon (DHL92) v3.6.1cpemedB069
Cp4.1LG10g04320Silver-seed gourdcarcpeB0348