Cp4.1LG18g04040 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG18g04040
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionExostosin domain-containing protein
LocationCp4.1LG18: 5271945 .. 5277046 (-)
RNA-Seq ExpressionCp4.1LG18g04040
SyntenyCp4.1LG18g04040
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAATCAATGTCTGCAACTGCAACCTTTTTGCTTCCAATCGCCATTGCCGGGAGCGGAGGGAGAAATGTAATGAGGAACATTGGAACTAAAGATTGTATCTCTAGGGCTTGAAGACGACCGCAAACAGAAGCTTTCTTGAAGCTCAATCACTCCATCTTCTTCGAGCAAGGTTAGTAAAGAAAGATTAACTTCGACTTCAGCGCTTTTCTTTACTTTTAGGATCCAAATGTTCGTACTTCGACTTCATCTTCTTCGAACTTTTCAACTTCTCTTTAGCTGAGTTCAGATTCGTTTACAGATTCACTCCTACTAACTAAAGAGAAATGGTTCTGGATTCAATCTCTTATTTTCCTTTTTAAGGTAATGAATCCCATCTTCTCCACTTCACTTCAGAAGCTAAAGCAATTTTGTTTTTGTTTTTGTTTCTGTTTCTGTTTTTGCAAGATTATGATAATTCTACTATTTCCTAATTGTAGGTTGTGAATTTCATTAGTTCTCATCGTCTGTGTTCTCATTGAATTGCTTGTTATGGAGTATCTGTTACCTCTCTGCAAGCTATGTCACATTGAAACTCGAAGATGGTTTCTTGTGATGGGAGTAGTGGCTTTTACTTATGTACTATTTCAATCTCTTTTACTTCCCTATGGGGATGCTCTCCGGTCTCTACTTCCTGAGGATGGTATTCAAAAATTTGATCAATATAGCATTCCGATGGGGCATACTTCAGCCAAATCAACAACGGTTCGCAACCCTCTTACGGTTCTGGATTTGGCTAATACTTCAGCTCCCGTTGGGAAAACTGATAATTATGTTCTTGAGAAAGGATCTCAACGTGATAGCACGCTGAATGCCAAAGGGAAGTATGTAAAAGATGAGGAAAGCCCTAGAGATGGTTACGAACTGTCTCTTGTTCGAAATCATGATATTGGTTTTGAATCTGGAAAGATGGTTGATACAAATGGCAATTTGGAATCAGATGGCACTAAGAATGGTGGAAATAATTCTATTGTTCACATGGATGGCCAACCAAGTTTTGAGTTCCCCTTGGCACTAAGTGATAAGGTCACTTCAGAGAATGAGTTAGAAGAAATTGGCGAAATGGATTTGGATTTTGGTGAGTTAGAAGAATTTAAAAACTCATCATTACGAAAGCCTGGTGATACAGATGTGACCTTCAATTCCTCGACCTTCATGCTACAGATCCCAACTTCACCAGTTAACACACCTCATTCACAGCACTTGATATCAAATATAAGCTCACCAGTCTCTGAAACTAATTCTAAAAGCATAGGTAAAAGGAAGAAGATGAAGAGTGAAATGCCACCAAAGTCAGTAACTTCACTAGAACAGATGAACAGCATTTTATTGCGTCACCGCAGGTCATCACGAGCGATGGTATATGTGTTTGAGTTAGTTTTAAGTTATTGGTTTGATGATCATATATTATGCTAGGTTTTAACCTCGTTCTGTTTCATGTAGAGACCAAGGAGATCGTCTTTGCGTGATCTGGAAATTTTTTCTGCCAAGTCCCAGATTGAGCATGCTTCTGCCATAAATGACCCTGAACTATACGCTCCGTTGTTTCGTAATGTTTCAATGTTTAAAAGGTAAAGCCAACACTAGTATTCTATTGCATTATTCACATTGTTAAATAGCATTTCCATGTACATCCATGGCCCTTGCGTTTTGGTTGGTTGGTATTAAAGAAAATGAGTTGGAAAATACAATCCTGTGGGATTTTCGCTATCTGCTCAATCGCCCTGTTACGTTTTAAAAATGATTATAAGGATGCCTCTTTTTTTTGTTGCCTCATTGTATCTGATCTTCAACCAAATTGTGCCATTAGGTTTGGTGGAATTTATACATGTGGCACACATTTCGAGCCTCATGTGTTAGTTTTGTTCAATTAACAACCAAAACTTAATTCATGGGCTGAAAGAGGCAAAAAATAAACCAAAGACTCCTTAAAACTATATTTAACGCTTGTATATTTTGCAATCCACCAAGTATACTTATGCCTTCTGGAAGTTATTTCCACGCTTTCAAACACTGCCGTTCTTTCCATTCATGATTGGGAACAATGCTCTGCCATGTTCATAACATTAATGTAAAAAGAGAAGTGTTAAGATGAACAGAATAATTATTTAAAAAATGAAGTACACAGTGGCATATTCCCTTCAAATCTAAATAATTTGTATCATATAAGTATATATCGTTATGTTTCTTATTTAAAAAAATAATTTGTATCATATCTAAAATTTAGATAGAATACAATATGAGTCTGCTGAAAAATAGTGCACACAATTTGTTTATTGTGCTGATTTATTGCTTGCTAAGTTACCTGATCATATGGCTGCTTGCGTTTTTCCTATATTTTCAAATCTAATAGTAGTTTTGACCAAAAATAGGAAGTCAATGACGCAGTTGTGTAGGTCCAATTCAAACCTTTTCCCTGTGATTTCAGGAGTTATGAACTCATGGAGCGCACACTCAAAATCTATGTCTATAGGGATGGAAAGAAGCCCATCTTTCATCAACCAATAATGAAGGGGTTATATGCCTCAGAAGGATGGTTTATGAAACTGATGGAGAGAAACAAGCATTTTGTTGTAAAGGATCCTCGAAAGGCTCACTTGTTTTATATGCCGTTTAGTTCTCGGATGTTGGAGTACACACTCTACGTGCGGAATTCTCATAACAGGACAAATTTACGTCAATTTTTGAAGGAATACTCTGAAAATATTGCAGCCAAATATCCATACTGGAACAGAACTGGTGGAGCAGATCATTTTCTTGTTGCATGCCATGATTGGGTACATTCGATGAAACTGTTGATCTTTTAACATTTTTTTTTTCTTTCTTTCAATTTCTAATCAAATTGGACTGCATTTAGATAAATGGTCGTAAGTTCTAGAAAAATAAGCGTGTTTGATGACCTGCATTTGCATATGAATTCACCAAGTCACTCCAGTAACATACTTATGATGCAATTGCTCTGGTCATTTGAATTAATGGTCAAATGGATCATCATTTTGTGCTTAGTTAGATTGTTCCTTCTCATCATATAGATCATATGAAGCACCTCTGTTGACACAAAAGATTAAGCAACACGCTAAAGTTAACTTAATGTCTTGGACTCTTATAGGCTCCTTATGAAACAAGGCACCACATGGAGCAGTGCATAAAAGCACTTTGCAATGCTGATGTAACCGTTGGCTTCAAAATTGGGAGAGACGTGTCTCTTCCAGAAACTTATGTACGATCCGCGAGGAATCCTCTTAGAGATCTTGGAGGAAAACCTGCATCACAGAGGCACATTCTTGCCTTTTATGCTGGAAATATGCATGGTTATGTACGTCCAATCCTACTGAAGTACTGGAAAGACAAAAACCCTGATATGAAGATCTTTGGTCCAATGCCTCCTGGTGTTGCAAGCAAAATGAACTACATTCAGCATATGAAGAGCAGCAAATACTGTATCTGTCCAAAGGGTTACGAGGTCAATAGTCCCCGAGTCGTGGAAGCCATCTTTTACGAGTGTGTACCTGTGATCATATCAGACAATTTTGTGCCACCATTTTTTGAGGTGTTGGATTGGGAAGCATTCTCAGTGATTGTTGCAGAAAAGGACATTCCCCACCTACAAGACATACTGCTTTCAATACCAAAAGACAGATATCTCGAGATGCAACTCCGAGTCAGGAAAGTACAGAAGCACTTCCTCTGGCATGCCAAGCCCTTGAAGTATGACCTGTTCCACATGACCCTCCATTCAATCTGGTATAACAGAGTTTTCCAGATAAAACTCAGATAATATTCTTGAAAGTGCCTCATTGAAGATTCAGAGGAAATCCATGGAACTGTTAAATAACTTCACAGAACAGAATAACTCACCTTCTCTGCCTTCGCAGCTTCTCTCTGGAAAGGATTCTTTCAACCTCAGGGGGCATCATGTGAAGGACCTTGAAGAAGGGATTGCAACTGTGGCAGTGCCCTTGTATATACAACGCTTTATCCTTGTTCCACTTCTGTATAGTAGACCAATGAAACAAAACTACGATTATTTGTCAATTTTGCAGTAGAAACTGTGATTTCATTTGCCCCAACCTCTTGTTGTGAGACTCGAGATTTCAAATCATACCGCCCCGCCGTGCATCGCTGTTCATCAACATAGTAAGCCATTTCCAGGACCACAATGGAAGGTATTCCTCTTTCTTTCGGTCATCATCCACATCCATCCAAAGCAATAAATGTTTTCACTCACTGGGTCCATCCAGTCCTTCAACTTGAAGAGCCCCACCTTCAAAAACCGTATTCTCAGGGATCGGAATCTGATTCCTATGCCATATATCATATCTATATGGTGCCACAACATGAAGCTTCCTATGAAAAACAGAGGCAATGTGCAATGACTGGTGTTAATGATCTTGGTAGGTTACTATCAAGAAAGGGACCCTGATAATCAGATATTTAAGATCTCCTACAGGGTCTACTTCACTAATGAGAGTTCAGATGATCATGGAGAGGTACCCAAAAGAAACTAGAACCAAAATGTAAGTGATTCTGGCCCCTCCTGAACTGAGTGATGTTTACTAACACAATGGTAACATTTCTTTTCCCCTCTCTCTCTCTCTTTTGGGGGTGGATGGTGCAAAATTAGAGATTTTTCACTCTGTAATGAGTGAGTGAGAGCATAGCTCCTCCCTTCTGGGTCCTTCCTAGCTCGTTGTCTTCAATACCCATCACTTATTTTTTTCTTCTCACAACTCCAATAACCACAATAAATTTTGACAGATTTTGAAGGTGTCCCCATGGCATTAAAAGATGAGAAATTTGCCAGGCTACCAGGATGAAGTCTCAGCCCCATATCCCACCATTATTGAAGGAGGAGGGAGGGGGAAGTGAATTTCAATTCCAATCAAACTCGTTTACTTTGTTATCAATGATTCTTATCAAAATCTTTCTCAATCTTTTCCTTAAACCACCAAGGACCAAATCTAAGCTGCCGCTTTTGATGCTTCCTTGCAGGTCCATTTTGAGGTAACTAAACGCATTATAGAGCTTCCATGTGCACTCACATTTACTTCTCATAATATATATGC

mRNA sequence

TAATCAATGTCTGCAACTGCAACCTTTTTGCTTCCAATCGCCATTGCCGGGAGCGGAGGGAGAAATGTAATGAGGAACATTGGAACTAAAGATTGTATCTCTAGGGCTTGAAGACGACCGCAAACAGAAGCTTTCTTGAAGCTCAATCACTCCATCTTCTTCGAGCAAGATTCACTCCTACTAACTAAAGAGAAATGGTTCTGGATTCAATCTCTTATTTTCCTTTTTAAGGTTGTGAATTTCATTAGTTCTCATCGTCTGTGTTCTCATTGAATTGCTTGTTATGGAGTATCTGTTACCTCTCTGCAAGCTATGTCACATTGAAACTCGAAGATGGTTTCTTGTGATGGGAGTAGTGGCTTTTACTTATGTACTATTTCAATCTCTTTTACTTCCCTATGGGGATGCTCTCCGGTCTCTACTTCCTGAGGATGGTATTCAAAAATTTGATCAATATAGCATTCCGATGGGGCATACTTCAGCCAAATCAACAACGGTTCGCAACCCTCTTACGGTTCTGGATTTGGCTAATACTTCAGCTCCCGTTGGGAAAACTGATAATTATGTTCTTGAGAAAGGATCTCAACGTGATAGCACGCTGAATGCCAAAGGGAAGTATGTAAAAGATGAGGAAAGCCCTAGAGATGGTTACGAACTGTCTCTTGTTCGAAATCATGATATTGGTTTTGAATCTGGAAAGATGGTTGATACAAATGGCAATTTGGAATCAGATGGCACTAAGAATGGTGGAAATAATTCTATTGTTCACATGGATGGCCAACCAAGTTTTGAGTTCCCCTTGGCACTAAGTGATAAGGTCACTTCAGAGAATGAGTTAGAAGAAATTGGCGAAATGGATTTGGATTTTGGTGAGTTAGAAGAATTTAAAAACTCATCATTACGAAAGCCTGGTGATACAGATGTGACCTTCAATTCCTCGACCTTCATGCTACAGATCCCAACTTCACCAGTTAACACACCTCATTCACAGCACTTGATATCAAATATAAGCTCACCAGTCTCTGAAACTAATTCTAAAAGCATAGGTAAAAGGAAGAAGATGAAGAGTGAAATGCCACCAAAGTCAGTAACTTCACTAGAACAGATGAACAGCATTTTATTGCGTCACCGCAGGTCATCACGAGCGATGAGACCAAGGAGATCGTCTTTGCGTGATCTGGAAATTTTTTCTGCCAAGTCCCAGATTGAGCATGCTTCTGCCATAAATGACCCTGAACTATACGCTCCGTTGTTTCGTAATGTTTCAATGTTTAAAAGGAGTTATGAACTCATGGAGCGCACACTCAAAATCTATGTCTATAGGGATGGAAAGAAGCCCATCTTTCATCAACCAATAATGAAGGGGTTATATGCCTCAGAAGGATGGTTTATGAAACTGATGGAGAGAAACAAGCATTTTGTTGTAAAGGATCCTCGAAAGGCTCACTTGTTTTATATGCCGTTTAGTTCTCGGATGTTGGAGTACACACTCTACGTGCGGAATTCTCATAACAGGACAAATTTACGTCAATTTTTGAAGGAATACTCTGAAAATATTGCAGCCAAATATCCATACTGGAACAGAACTGGTGGAGCAGATCATTTTCTTGTTGCATGCCATGATTGGGCTCCTTATGAAACAAGGCACCACATGGAGCAGTGCATAAAAGCACTTTGCAATGCTGATGTAACCGTTGGCTTCAAAATTGGGAGAGACGTGTCTCTTCCAGAAACTTATGTACGATCCGCGAGGAATCCTCTTAGAGATCTTGGAGGAAAACCTGCATCACAGAGGCACATTCTTGCCTTTTATGCTGGAAATATGCATGGTTATGTACGTCCAATCCTACTGAAGTACTGGAAAGACAAAAACCCTGATATGAAGATCTTTGGTCCAATGCCTCCTGGTGTTGCAAGCAAAATGAACTACATTCAGCATATGAAGAGCAGCAAATACTGTATCTGTCCAAAGGGTTACGAGGTCAATAGTCCCCGAGTCGTGGAAGCCATCTTTTACGAGTGTGTACCTGTGATCATATCAGACAATTTTGTGCCACCATTTTTTGAGGTGTTGGATTGGGAAGCATTCTCAGTGATTGTTGCAGAAAAGGACATTCCCCACCTACAAGACATACTGCTTTCAATACCAAAAGACAGATATCTCGAGATGCAACTCCGAGTCAGGAAAGTACAGAAGCACTTCCTCTGGCATGCCAAGCCCTTGAAGTATGACCTGTTCCACATGACCCTCCATTCAATCTGGTATAACAGAGTTTTCCAGATAAAACTCAGATAATATTCTTGAAAGTGCCTCATTGAAGATTCAGAGGAAATCCATGGAACTGTTAAATAACTTCACAGAACAGAATAACTCACCTTCTCTGCCTTCGCAGCTTCTCTCTGGAAAGGATTCTTTCAACCTCAGGGGGCATCATGTGAAGGACCTTGAAGAAGGGATTGCAACTGTGGCAGTGCCCTTGTATATACAACGCTTTATCCTTGTTCCACTTCTGTATAGTAGACCAATGAAACAAAACTACGATTATTTGTCAATTTTGCAGTAGAAACTGTGATTTCATTTGCCCCAACCTCTTGTTGTGAGACTCGAGATTTCAAATCATACCGCCCCGCCGTGCATCGCTGTTCATCAACATAGTAAGCCATTTCCAGGACCACAATGGAAGGTATTCCTCTTTCTTTCGGTCATCATCCACATCCATCCAAAGCAATAAATGTTTTCACTCACTGGGTCCATCCAGTCCTTCAACTTGAAGAGCCCCACCTTCAAAAACCGTATTCTCAGGGATCGGAATCTGATTCCTATGCCATATATCATATCTATATGGTGCCACAACATGAAGCTTCCTATGAAAAACAGAGGCAATGTGCAATGACTGGTGTTAATGATCTTGGTAGGTTACTATCAAGAAAGGGACCCTGATAATCAGATATTTAAGATCTCCTACAGGGTCTACTTCACTAATGAGAGTTCAGATGATCATGGAGAGGTACCCAAAAGAAACTAGAACCAAAATATTTTGAAGGTGTCCCCATGGCATTAAAAGATGAGAAATTTGCCAGGCTACCAGGATGAAGTCTCAGCCCCATATCCCACCATTATTGAAGGAGGAGGGAGGGGGAAGTGAATTTCAATTCCAATCAAACTCGTTTACTTTGTTATCAATGATTCTTATCAAAATCTTTCTCAATCTTTTCCTTAAACCACCAAGGACCAAATCTAAGCTGCCGCTTTTGATGCTTCCTTGCAGGTCCATTTTGAGGTAACTAAACGCATTATAGAGCTTCCATGTGCACTCACATTTACTTCTCATAATATATATGC

Coding sequence (CDS)

ATGGAGTATCTGTTACCTCTCTGCAAGCTATGTCACATTGAAACTCGAAGATGGTTTCTTGTGATGGGAGTAGTGGCTTTTACTTATGTACTATTTCAATCTCTTTTACTTCCCTATGGGGATGCTCTCCGGTCTCTACTTCCTGAGGATGGTATTCAAAAATTTGATCAATATAGCATTCCGATGGGGCATACTTCAGCCAAATCAACAACGGTTCGCAACCCTCTTACGGTTCTGGATTTGGCTAATACTTCAGCTCCCGTTGGGAAAACTGATAATTATGTTCTTGAGAAAGGATCTCAACGTGATAGCACGCTGAATGCCAAAGGGAAGTATGTAAAAGATGAGGAAAGCCCTAGAGATGGTTACGAACTGTCTCTTGTTCGAAATCATGATATTGGTTTTGAATCTGGAAAGATGGTTGATACAAATGGCAATTTGGAATCAGATGGCACTAAGAATGGTGGAAATAATTCTATTGTTCACATGGATGGCCAACCAAGTTTTGAGTTCCCCTTGGCACTAAGTGATAAGGTCACTTCAGAGAATGAGTTAGAAGAAATTGGCGAAATGGATTTGGATTTTGGTGAGTTAGAAGAATTTAAAAACTCATCATTACGAAAGCCTGGTGATACAGATGTGACCTTCAATTCCTCGACCTTCATGCTACAGATCCCAACTTCACCAGTTAACACACCTCATTCACAGCACTTGATATCAAATATAAGCTCACCAGTCTCTGAAACTAATTCTAAAAGCATAGGTAAAAGGAAGAAGATGAAGAGTGAAATGCCACCAAAGTCAGTAACTTCACTAGAACAGATGAACAGCATTTTATTGCGTCACCGCAGGTCATCACGAGCGATGAGACCAAGGAGATCGTCTTTGCGTGATCTGGAAATTTTTTCTGCCAAGTCCCAGATTGAGCATGCTTCTGCCATAAATGACCCTGAACTATACGCTCCGTTGTTTCGTAATGTTTCAATGTTTAAAAGGAGTTATGAACTCATGGAGCGCACACTCAAAATCTATGTCTATAGGGATGGAAAGAAGCCCATCTTTCATCAACCAATAATGAAGGGGTTATATGCCTCAGAAGGATGGTTTATGAAACTGATGGAGAGAAACAAGCATTTTGTTGTAAAGGATCCTCGAAAGGCTCACTTGTTTTATATGCCGTTTAGTTCTCGGATGTTGGAGTACACACTCTACGTGCGGAATTCTCATAACAGGACAAATTTACGTCAATTTTTGAAGGAATACTCTGAAAATATTGCAGCCAAATATCCATACTGGAACAGAACTGGTGGAGCAGATCATTTTCTTGTTGCATGCCATGATTGGGCTCCTTATGAAACAAGGCACCACATGGAGCAGTGCATAAAAGCACTTTGCAATGCTGATGTAACCGTTGGCTTCAAAATTGGGAGAGACGTGTCTCTTCCAGAAACTTATGTACGATCCGCGAGGAATCCTCTTAGAGATCTTGGAGGAAAACCTGCATCACAGAGGCACATTCTTGCCTTTTATGCTGGAAATATGCATGGTTATGTACGTCCAATCCTACTGAAGTACTGGAAAGACAAAAACCCTGATATGAAGATCTTTGGTCCAATGCCTCCTGGTGTTGCAAGCAAAATGAACTACATTCAGCATATGAAGAGCAGCAAATACTGTATCTGTCCAAAGGGTTACGAGGTCAATAGTCCCCGAGTCGTGGAAGCCATCTTTTACGAGTGTGTACCTGTGATCATATCAGACAATTTTGTGCCACCATTTTTTGAGGTGTTGGATTGGGAAGCATTCTCAGTGATTGTTGCAGAAAAGGACATTCCCCACCTACAAGACATACTGCTTTCAATACCAAAAGACAGATATCTCGAGATGCAACTCCGAGTCAGGAAAGTACAGAAGCACTTCCTCTGGCATGCCAAGCCCTTGAAGTATGACCTGTTCCACATGACCCTCCATTCAATCTGGTATAACAGAGTTTTCCAGATAAAACTCAGATAA

Protein sequence

MEYLLPLCKLCHIETRRWFLVMGVVAFTYVLFQSLLLPYGDALRSLLPEDGIQKFDQYSIPMGHTSAKSTTVRNPLTVLDLANTSAPVGKTDNYVLEKGSQRDSTLNAKGKYVKDEESPRDGYELSLVRNHDIGFESGKMVDTNGNLESDGTKNGGNNSIVHMDGQPSFEFPLALSDKVTSENELEEIGEMDLDFGELEEFKNSSLRKPGDTDVTFNSSTFMLQIPTSPVNTPHSQHLISNISSPVSETNSKSIGKRKKMKSEMPPKSVTSLEQMNSILLRHRRSSRAMRPRRSSLRDLEIFSAKSQIEHASAINDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFHQPIMKGLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLRQFLKEYSENIAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVGFKIGRDVSLPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKIFGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWEAFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTLHSIWYNRVFQIKLR
Homology
BLAST of Cp4.1LG18g04040 vs. ExPASy Swiss-Prot
Match: Q9FFN2 (Probable glycosyltransferase At5g03795 OS=Arabidopsis thaliana OX=3702 GN=At5g03795 PE=3 SV=2)

HSP 1 Score: 325.9 bits (834), Expect = 1.1e-87
Identity = 169/400 (42.25%), Postives = 261/400 (65.25%), Query Frame = 0

Query: 285 SSRAMRPRR----SSLRDLE--IFSAKSQIEHAS---AINDPEL--YAPLFRNVSMFKRS 344
           S+ ++ P++    S+L  +E  +  A++ I+ AS    ++DP+     P++ N  +F RS
Sbjct: 123 STASLEPKKRRVLSNLEKIEFKLQKARASIKAASMDDPVDDPDYVPLGPMYWNAKVFHRS 182

Query: 345 YELMERTLKIYVYRDGKKPIFHQPIMKGLYASEGWFMKLMERNKHFVVKDPRKAHLFYMP 404
           Y  ME+  KIYVY++G+ P+FH    K +Y+ EG F+  +E +  F   +P KAH+FY+P
Sbjct: 183 YLEMEKQFKIYVYKEGEPPLFHDGPCKSIYSMEGSFIYEIETDTRFRTNNPDKAHVFYLP 242

Query: 405 FSSRMLEYTLYVRNSHNRTNLRQFLKEYSENIAAKYPYWNRTGGADHFLVACHDWAP--- 464
           FS   +   +Y RNS + + +R  +K+Y   +  KYPYWNR+ GADHF+++CHDW P   
Sbjct: 243 FSVVKMVRYVYERNSRDFSPIRNTVKDYINLVGDKYPYWNRSIGADHFILSCHDWGPEAS 302

Query: 465 YETRHHMEQCIKALCNADVTVGFKIGRDVSLPETYVRSARNPLRDLGGKPASQRHILAFY 524
           +   H     I+ALCNA+ +  FK  +DVS+PE  +R+  +    +GG   S R ILAF+
Sbjct: 303 FSHPHLGHNSIRALCNANTSERFKPRKDVSIPEINLRTG-SLTGLVGGPSPSSRPILAFF 362

Query: 525 AGNMHGYVRPILLKYWKDKNPDMKIFGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSP 584
           AG +HG VRP+LL++W++K+ D+++   +P G     +Y   M++SK+CICP GYEV SP
Sbjct: 363 AGGVHGPVRPVLLQHWENKDNDIRVHKYLPRGT----SYSDMMRNSKFCICPSGYEVASP 422

Query: 585 RVVEAIFYECVPVIISDNFVPPFFEVLDWEAFSVIVAEKDIPHLQDILLSIPKDRYLEMQ 644
           R+VEA++  CVPV+I+  +VPPF +VL+W +FSVIV+ +DIP+L+ IL SI   +YL M 
Sbjct: 423 RIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVSVEDIPNLKTILTSISPRQYLRMY 482

Query: 645 LRVRKVQKHFLWHAKPLKYDLFHMTLHSIWYNRVFQIKLR 671
            RV KV++HF  ++   ++D+FHM LHSIW  R+  +K+R
Sbjct: 483 RRVLKVRRHFEVNSPAKRFDVFHMILHSIWVRRL-NVKIR 516

BLAST of Cp4.1LG18g04040 vs. ExPASy Swiss-Prot
Match: Q9SSE8 (Probable glycosyltransferase At3g07620 OS=Arabidopsis thaliana OX=3702 GN=At3g07620 PE=3 SV=1)

HSP 1 Score: 300.8 bits (769), Expect = 3.8e-80
Identity = 151/382 (39.53%), Postives = 238/382 (62.30%), Query Frame = 0

Query: 294 SSLRDLEIFSAKSQIEHASAINDP---ELYAP---LFRNVSMFKRSYELMERTLKIYVYR 353
           + L    +   ++Q+ ++S  + P   E Y P   ++RN   F RSY LME+  KIYVY 
Sbjct: 90  AELATARVLIREAQLNYSSTTSSPLGDEDYVPHGDIYRNPYAFHRSYLLMEKMFKIYVYE 149

Query: 354 DGKKPIFHQPIMKGLYASEGWFMKLMERN-KHFVVKDPRKAHLFYMPFSSRMLEYTLYVR 413
           +G  PIFH  + K +Y+ EG F+  ME +   +  +DP KAH++++PFS  M+ + L+  
Sbjct: 150 EGDPPIFHYGLCKDIYSMEGLFLNFMENDVLKYRTRDPDKAHVYFLPFSVVMILHHLFDP 209

Query: 414 NSHNRTNLRQFLKEYSENIAAKYPYWNRTGGADHFLVACHDW---APYETRHHMEQCIKA 473
              ++  L + + +Y + I+ KYPYWN + G DHF+++CHDW   A +  +      I+ 
Sbjct: 210 VVRDKAVLERVIADYVQIISKKYPYWNTSDGFDHFMLSCHDWGHRATWYVKKLFFNSIRV 269

Query: 474 LCNADVTVGFKIGRDVSLPETYVRSARNPLRDL-GGKPASQRHILAFYAGNMHGYVRPIL 533
           LCNA+++  F   +D   PE  +      + +L GG     R  LAF+AG  HG +RP+L
Sbjct: 270 LCNANISEYFNPEKDAPFPE--INLLTGDINNLTGGLDPISRTTLAFFAGKSHGKIRPVL 329

Query: 534 LKYWKDKNPDMKIFGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVP 593
           L +WK+K+ D+ ++  +P G    ++Y + M+ S++CICP G+EV SPRV EAI+  CVP
Sbjct: 330 LNHWKEKDKDILVYENLPDG----LDYTEMMRKSRFCICPSGHEVASPRVPEAIYSGCVP 389

Query: 594 VIISDNFVPPFFEVLDWEAFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLW 653
           V+IS+N+V PF +VL+WE FSV V+ K+IP L+ IL+ IP++RY+ +   V+KV++H L 
Sbjct: 390 VLISENYVLPFSDVLNWEKFSVSVSVKEIPELKRILMDIPEERYMRLYEGVKKVKRHILV 449

Query: 654 HAKPLKYDLFHMTLHSIWYNRV 665
           +  P +YD+F+M +HSIW  R+
Sbjct: 450 NDPPKRYDVFNMIIHSIWLRRL 465

BLAST of Cp4.1LG18g04040 vs. ExPASy Swiss-Prot
Match: Q3E7Q9 (Probable glycosyltransferase At5g25310 OS=Arabidopsis thaliana OX=3702 GN=At5g25310 PE=3 SV=2)

HSP 1 Score: 298.1 bits (762), Expect = 2.5e-79
Identity = 167/428 (39.02%), Postives = 250/428 (58.41%), Query Frame = 0

Query: 249 TNSKSIGKRKKMKSEMPPKSVTSLEQMNSILLRHRRSSRAMRPRRSSLRDLEIFSAKSQI 308
           T+S     R  + S    + + ++   NS L      S+  +  R +L +  +  A++ I
Sbjct: 58  TSSSGEENRVVVDSRHVSQQILTVRSTNSTL-----QSKPEKLNRRNLVEQGLAKARASI 117

Query: 309 EHASAINDPELY------APLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFHQPIMKGL 368
             AS+  +  L+      + ++RN S   RSY  ME+  K+YVY +G+ P+ H    K +
Sbjct: 118 LEASSNVNTTLFKSDLPNSEIYRNPSALYRSYLEMEKRFKVYVYEEGEPPLVHDGPCKSV 177

Query: 369 YASEGWFMKLME-RNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLRQFLKEY 428
           YA EG F+  ME R   F   DP +A+++++PFS   L   LY  NS  +  L+ F+ +Y
Sbjct: 178 YAVEGRFITEMEKRRTKFRTYDPNQAYVYFLPFSVTWLVRYLYEGNSDAKP-LKTFVSDY 237

Query: 429 SENIAAKYPYWNRTGGADHFLVACHDWAPYET---RHHMEQCIKALCNADVTVGFKIGRD 488
              ++  +P+WNRT GADHF++ CHDW P  +   R      I+ +CNA+ + GF   +D
Sbjct: 238 IRLVSTNHPFWNRTNGADHFMLTCHDWGPLTSQANRDLFNTSIRVMCNANSSEGFNPTKD 297

Query: 489 VSLPE--TYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKIF 548
           V+LPE   Y     + LR      AS R  L F+AG +HG VRPILLK+WK ++ DM ++
Sbjct: 298 VTLPEIKLYGGEVDHKLRLSKTLSASPRPYLGFFAGGVHGPVRPILLKHWKQRDLDMPVY 357

Query: 549 GPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEV 608
             +P      +NY   M+SSK+C CP GYEV SPRV+EAI+ EC+PVI+S NFV PF +V
Sbjct: 358 EYLP----KHLNYYDFMRSSKFCFCPSGYEVASPRVIEAIYSECIPVILSVNFVLPFTDV 417

Query: 609 LDWEAFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTL 665
           L WE FSV+V   +IP L++IL+SI  ++Y  ++  +R V++HF  +  P ++D FH+TL
Sbjct: 418 LRWETFSVLVDVSEIPRLKEILMSISNEKYEWLKSNLRYVRRHFELNDPPQRFDAFHLTL 475

BLAST of Cp4.1LG18g04040 vs. ExPASy Swiss-Prot
Match: Q9LFP3 (Probable glycosyltransferase At5g11130 OS=Arabidopsis thaliana OX=3702 GN=At5g11130/At5g11120 PE=3 SV=2)

HSP 1 Score: 279.6 bits (714), Expect = 9.0e-74
Identity = 143/351 (40.74%), Postives = 216/351 (61.54%), Query Frame = 0

Query: 323 LFRNVSMFKRSYELMERTLKIYVYRDGKKPIFHQPIMKGLYASEGWFMKLMER-NKHFVV 382
           ++ N   F +S++ ME+  KI+ YR+G+ P+FH+  +  +YA EG FM  +E  N  F  
Sbjct: 131 VYLNAFTFHQSHKEMEKRFKIWTYREGEAPLFHKGPLNNIYAIEGQFMDEIENGNSRFKA 190

Query: 383 KDPRKAHLFYMPFS-SRMLEYTLYVRNSHNRTNLRQFLKEYSENIAAKYPYWNRTGGADH 442
             P +A +FY+P     ++ +      S+ R  L+  +K+Y   I+ +YPYWNR+ GADH
Sbjct: 191 ASPEEATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKDYISLISNRYPYWNRSRGADH 250

Query: 443 FLVACHDWAPYETRHHME---QCIKALCNADVTVGFKIGRDVSLPETYVRSARNPLRDLG 502
           F ++CHDWAP  +    E     I+ALCNA+ + GF   RDVSLPE  +     P   LG
Sbjct: 251 FFLSCHDWAPDVSAVDPELYKHFIRALCNANSSEGFTPMRDVSLPEINI-----PHSQLG 310

Query: 503 ----GKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKIFGPMPPGVASKMNYIQHM 562
               G+P   R +LAF+AG  HG VR IL ++WK+K+ D+ ++  +P      MNY + M
Sbjct: 311 FVHTGEPPQNRKLLAFFAGGSHGDVRKILFQHWKEKDKDVLVYENLP----KTMNYTKMM 370

Query: 563 KSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWEAFSVIVAEKDIPH 622
             +K+C+CP G+EV SPR+VE+++  CVPVII+D +V PF +VL+W+ FSV +    +P 
Sbjct: 371 DKAKFCLCPSGWEVASPRIVESLYSGCVPVIIADYYVLPFSDVLNWKTFSVHIPISKMPD 430

Query: 623 LQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTLHSIWYNRV 665
           ++ IL +I ++ YL MQ RV +V+KHF+ +     YD+ HM +HSIW  R+
Sbjct: 431 IKKILEAITEEEYLNMQRRVLEVRKHFVINRPSKPYDMLHMIMHSIWLRRL 472

BLAST of Cp4.1LG18g04040 vs. ExPASy Swiss-Prot
Match: Q3E9A4 (Probable glycosyltransferase At5g20260 OS=Arabidopsis thaliana OX=3702 GN=At5g20260 PE=3 SV=3)

HSP 1 Score: 273.5 bits (698), Expect = 6.5e-72
Identity = 140/347 (40.35%), Postives = 209/347 (60.23%), Query Frame = 0

Query: 323 LFRNVSMFKRSYELMERTLKIYVYRDGKKPIFHQPIMKGLYASEGWFMKLMERN-KHFVV 382
           ++RN   F +S+  ME+  K++VYR+G+ P+ H   M  +Y+ EG FM  +E     F  
Sbjct: 119 VYRNAFAFHQSHIEMEKKFKVWVYREGETPLVHMGPMNNIYSIEGQFMDEIETGMSPFAA 178

Query: 383 KDPRKAHLFYMPFS-SRMLEYTLYVRNSHNRTNLRQFLKEYSENIAAKYPYWNRTGGADH 442
            +P +AH F +P S + ++ Y      +++R  L +   +Y + +A KYPYWNR+ GADH
Sbjct: 179 NNPEEAHAFLLPVSVANIVHYLYRPLVTYSREQLHKVFLDYVDVVAHKYPYWNRSLGADH 238

Query: 443 FLVACHDWAPYETRHH---MEQCIKALCNADVTVGFKIGRDVSLPETYVRSARNPLRDLG 502
           F V+CHDWAP  +  +   M+  I+ LCNA+ + GF   RDVS+PE  +         L 
Sbjct: 239 FYVSCHDWAPDVSGSNPELMKNLIRVLCNANTSEGFMPQRDVSIPEINIPGGHLGPPRLS 298

Query: 503 GKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKIFGPMPPGVASKMNYIQHMKSSK 562
                 R ILAF+AG  HGY+R ILL++WKDK+ ++++       +A   +Y + M +++
Sbjct: 299 RSSGHDRPILAFFAGGSHGYIRRILLQHWKDKDEEVQVH----EYLAKNKDYFKLMATAR 358

Query: 563 YCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWEAFSVIVAEKDIPHLQDI 622
           +C+CP GYEV SPRVV AI   CVPVIISD++  PF +VLDW  F++ V  K IP ++ I
Sbjct: 359 FCLCPSGYEVASPRVVAAINLGCVPVIISDHYALPFSDVLDWTKFTIHVPSKKIPEIKTI 418

Query: 623 LLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTLHSIWYNRV 665
           L SI   RY  +Q RV +VQ+HF+ +     +D+  M LHS+W  R+
Sbjct: 419 LKSISWRRYRVLQRRVLQVQRHFVINRPSQPFDMLRMLLHSVWLRRL 461

BLAST of Cp4.1LG18g04040 vs. NCBI nr
Match: XP_023516188.1 (probable glycosyltransferase At5g03795 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1354 bits (3505), Expect = 0.0
Identity = 670/670 (100.00%), Postives = 670/670 (100.00%), Query Frame = 0

Query: 1   MEYLLPLCKLCHIETRRWFLVMGVVAFTYVLFQSLLLPYGDALRSLLPEDGIQKFDQYSI 60
           MEYLLPLCKLCHIETRRWFLVMGVVAFTYVLFQSLLLPYGDALRSLLPEDGIQKFDQYSI
Sbjct: 1   MEYLLPLCKLCHIETRRWFLVMGVVAFTYVLFQSLLLPYGDALRSLLPEDGIQKFDQYSI 60

Query: 61  PMGHTSAKSTTVRNPLTVLDLANTSAPVGKTDNYVLEKGSQRDSTLNAKGKYVKDEESPR 120
           PMGHTSAKSTTVRNPLTVLDLANTSAPVGKTDNYVLEKGSQRDSTLNAKGKYVKDEESPR
Sbjct: 61  PMGHTSAKSTTVRNPLTVLDLANTSAPVGKTDNYVLEKGSQRDSTLNAKGKYVKDEESPR 120

Query: 121 DGYELSLVRNHDIGFESGKMVDTNGNLESDGTKNGGNNSIVHMDGQPSFEFPLALSDKVT 180
           DGYELSLVRNHDIGFESGKMVDTNGNLESDGTKNGGNNSIVHMDGQPSFEFPLALSDKVT
Sbjct: 121 DGYELSLVRNHDIGFESGKMVDTNGNLESDGTKNGGNNSIVHMDGQPSFEFPLALSDKVT 180

Query: 181 SENELEEIGEMDLDFGELEEFKNSSLRKPGDTDVTFNSSTFMLQIPTSPVNTPHSQHLIS 240
           SENELEEIGEMDLDFGELEEFKNSSLRKPGDTDVTFNSSTFMLQIPTSPVNTPHSQHLIS
Sbjct: 181 SENELEEIGEMDLDFGELEEFKNSSLRKPGDTDVTFNSSTFMLQIPTSPVNTPHSQHLIS 240

Query: 241 NISSPVSETNSKSIGKRKKMKSEMPPKSVTSLEQMNSILLRHRRSSRAMRPRRSSLRDLE 300
           NISSPVSETNSKSIGKRKKMKSEMPPKSVTSLEQMNSILLRHRRSSRAMRPRRSSLRDLE
Sbjct: 241 NISSPVSETNSKSIGKRKKMKSEMPPKSVTSLEQMNSILLRHRRSSRAMRPRRSSLRDLE 300

Query: 301 IFSAKSQIEHASAINDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFHQPIMK 360
           IFSAKSQIEHASAINDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFHQPIMK
Sbjct: 301 IFSAKSQIEHASAINDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFHQPIMK 360

Query: 361 GLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLRQFLKE 420
           GLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLRQFLKE
Sbjct: 361 GLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLRQFLKE 420

Query: 421 YSENIAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVGFKIGRDVS 480
           YSENIAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVGFKIGRDVS
Sbjct: 421 YSENIAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVGFKIGRDVS 480

Query: 481 LPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKIFGPMP 540
           LPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKIFGPMP
Sbjct: 481 LPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKIFGPMP 540

Query: 541 PGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWE 600
           PGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWE
Sbjct: 541 PGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWE 600

Query: 601 AFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTLHSIW 660
           AFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTLHSIW
Sbjct: 601 AFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTLHSIW 660

Query: 661 YNRVFQIKLR 670
           YNRVFQIKLR
Sbjct: 661 YNRVFQIKLR 670

BLAST of Cp4.1LG18g04040 vs. NCBI nr
Match: XP_022961026.1 (probable glycosyltransferase At5g03795 [Cucurbita moschata])

HSP 1 Score: 1331 bits (3445), Expect = 0.0
Identity = 656/670 (97.91%), Postives = 665/670 (99.25%), Query Frame = 0

Query: 1   MEYLLPLCKLCHIETRRWFLVMGVVAFTYVLFQSLLLPYGDALRSLLPEDGIQKFDQYSI 60
           MEYLLPLCKLCHIETRRWF+VMGVVAFTYVLFQSLLLPYGDALRSLLPEDG QKFDQYSI
Sbjct: 1   MEYLLPLCKLCHIETRRWFIVMGVVAFTYVLFQSLLLPYGDALRSLLPEDGTQKFDQYSI 60

Query: 61  PMGHTSAKSTTVRNPLTVLDLANTSAPVGKTDNYVLEKGSQRDSTLNAKGKYVKDEESPR 120
            MGHTSAKSTTVRNPLTVLDLAN SAP+GKTDNY+LEKGSQRDS+LNA+GKYVKDEESPR
Sbjct: 61  QMGHTSAKSTTVRNPLTVLDLANISAPIGKTDNYILEKGSQRDSSLNARGKYVKDEESPR 120

Query: 121 DGYELSLVRNHDIGFESGKMVDTNGNLESDGTKNGGNNSIVHMDGQPSFEFPLALSDKVT 180
           DGY+LSL RNHDIGF+SGKMVDTNGNLESDGTKNGGNNSIVHMDGQPSFEFPLA+SDKVT
Sbjct: 121 DGYKLSLNRNHDIGFDSGKMVDTNGNLESDGTKNGGNNSIVHMDGQPSFEFPLAVSDKVT 180

Query: 181 SENELEEIGEMDLDFGELEEFKNSSLRKPGDTDVTFNSSTFMLQIPTSPVNTPHSQHLIS 240
           SENELEEIGEMDLDFGELEEFKNSSLRKPGDTDVTFNSSTFMLQIPTSPVNTPHSQHLIS
Sbjct: 181 SENELEEIGEMDLDFGELEEFKNSSLRKPGDTDVTFNSSTFMLQIPTSPVNTPHSQHLIS 240

Query: 241 NISSPVSETNSKSIGKRKKMKSEMPPKSVTSLEQMNSILLRHRRSSRAMRPRRSSLRDLE 300
           NISSPVSETNSKSIGKRKKMKSEMPPKSVTSLEQMNSILLRHRRSSRAMRPRRSSLRDLE
Sbjct: 241 NISSPVSETNSKSIGKRKKMKSEMPPKSVTSLEQMNSILLRHRRSSRAMRPRRSSLRDLE 300

Query: 301 IFSAKSQIEHASAINDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFHQPIMK 360
           IFSAKSQIE ASAINDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFHQPIMK
Sbjct: 301 IFSAKSQIEQASAINDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFHQPIMK 360

Query: 361 GLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLRQFLKE 420
           GLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLRQFLKE
Sbjct: 361 GLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLRQFLKE 420

Query: 421 YSENIAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVGFKIGRDVS 480
           YSE+IAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVGFKIGRDVS
Sbjct: 421 YSESIAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVGFKIGRDVS 480

Query: 481 LPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKIFGPMP 540
           LPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKIFGPMP
Sbjct: 481 LPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKIFGPMP 540

Query: 541 PGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWE 600
           PGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWE
Sbjct: 541 PGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWE 600

Query: 601 AFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTLHSIW 660
           AFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTLHSIW
Sbjct: 601 AFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTLHSIW 660

Query: 661 YNRVFQIKLR 670
           YNRVFQIKLR
Sbjct: 661 YNRVFQIKLR 670

BLAST of Cp4.1LG18g04040 vs. NCBI nr
Match: XP_022987636.1 (probable glycosyltransferase At5g03795 [Cucurbita maxima])

HSP 1 Score: 1308 bits (3385), Expect = 0.0
Identity = 648/675 (96.00%), Postives = 658/675 (97.48%), Query Frame = 0

Query: 1   MEYLLPLCKLCHIETRRWFLVMGVVAFTYVLFQSLLLPYGDALRSLLPEDGIQKFDQYSI 60
           MEYLLPLCKLCHIETRRWFLVMGVVAFTYVLFQSLLLPYGDALRSLLPEDGI+KFDQYSI
Sbjct: 1   MEYLLPLCKLCHIETRRWFLVMGVVAFTYVLFQSLLLPYGDALRSLLPEDGIKKFDQYSI 60

Query: 61  PMGHTSAKSTTVRNPLTVLDLANTSAPVGKTDNYVLEKGSQRDSTLNAKGKYVKDEESPR 120
            MGHTSAKSTTVRNPLTVLDLANTSAP+GKTDNY+LEKGSQRDSTLNA+GKYVKDEESPR
Sbjct: 61  QMGHTSAKSTTVRNPLTVLDLANTSAPIGKTDNYILEKGSQRDSTLNARGKYVKDEESPR 120

Query: 121 DGYELSLVRNHDIGFESGKMVDTNGNLESDGTKNGGNNSIVHMDGQPSFEFPLAL----- 180
           DGY+LSL RNHDIGFESGKMVDTNGNLESDGTKN GNNSI+HMDG+ SFEFPL       
Sbjct: 121 DGYKLSLNRNHDIGFESGKMVDTNGNLESDGTKNRGNNSILHMDGEASFEFPLGQQFVKQ 180

Query: 181 SDKVTSENELEEIGEMDLDFGELEEFKNSSLRKPGDTDVTFNSSTFMLQIPTSPVNTPHS 240
           SD V SENELEE GEMDLDFGELEEFKNS LRKPGDTDVTFNSSTFMLQIPTSPVNTPHS
Sbjct: 181 SDTVASENELEEFGEMDLDFGELEEFKNSLLRKPGDTDVTFNSSTFMLQIPTSPVNTPHS 240

Query: 241 QHLISNISSPVSETNSKSIGKRKKMKSEMPPKSVTSLEQMNSILLRHRRSSRAMRPRRSS 300
           QHLISNISSPVSETNSKSIGKRKKMK+EMPPKSVTSLE+MNSILLRHRRSSRAMRPRRSS
Sbjct: 241 QHLISNISSPVSETNSKSIGKRKKMKNEMPPKSVTSLEEMNSILLRHRRSSRAMRPRRSS 300

Query: 301 LRDLEIFSAKSQIEHASAINDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFH 360
           LRDLEIFSAKSQIE ASAINDPELY PLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFH
Sbjct: 301 LRDLEIFSAKSQIEQASAINDPELYTPLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFH 360

Query: 361 QPIMKGLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLR 420
           QPIMKGLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLR
Sbjct: 361 QPIMKGLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLR 420

Query: 421 QFLKEYSENIAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVGFKI 480
           QFLKEYSE+IAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVGFKI
Sbjct: 421 QFLKEYSESIAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVGFKI 480

Query: 481 GRDVSLPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKI 540
           GRDVSLPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKI
Sbjct: 481 GRDVSLPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKI 540

Query: 541 FGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFE 600
           FGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFE
Sbjct: 541 FGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFE 600

Query: 601 VLDWEAFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMT 660
           VLDWEAFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMT
Sbjct: 601 VLDWEAFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMT 660

Query: 661 LHSIWYNRVFQIKLR 670
           LHSIWYNRVFQIKLR
Sbjct: 661 LHSIWYNRVFQIKLR 675

BLAST of Cp4.1LG18g04040 vs. NCBI nr
Match: KAG6589943.1 (putative glycosyltransferase, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1269 bits (3284), Expect = 0.0
Identity = 628/639 (98.28%), Postives = 634/639 (99.22%), Query Frame = 0

Query: 22  MGVVAFTYVLFQSLLLPYGDALRSLLPEDGIQKFDQYSIPMGHTSAKSTTVRNPLTVLDL 81
           MGVVAFTYVLFQSLLLPYGDALRSLLPEDG QKFDQYSI MGHTSAKSTTVRNPLTVLDL
Sbjct: 1   MGVVAFTYVLFQSLLLPYGDALRSLLPEDGTQKFDQYSIQMGHTSAKSTTVRNPLTVLDL 60

Query: 82  ANTSAPVGKTDNYVLEKGSQRDSTLNAKGKYVKDEESPRDGYELSLVRNHDIGFESGKMV 141
           AN SAP+GKTDNY+LEKGSQRDSTLNA+GKYVKDEESPRDGY+LSL RNHDIGFESGKMV
Sbjct: 61  ANISAPIGKTDNYILEKGSQRDSTLNARGKYVKDEESPRDGYKLSLNRNHDIGFESGKMV 120

Query: 142 DTNGNLESDGTKNGGNNSIVHMDGQPSFEFPLALSDKVTSENELEEIGEMDLDFGELEEF 201
           DTNGNLESDGTKNGGNNSIVHMDGQPSFEFPLA+SDKVTSENELEEIGEMDLDFGELEEF
Sbjct: 121 DTNGNLESDGTKNGGNNSIVHMDGQPSFEFPLAVSDKVTSENELEEIGEMDLDFGELEEF 180

Query: 202 KNSSLRKPGDTDVTFNSSTFMLQIPTSPVNTPHSQHLISNISSPVSETNSKSIGKRKKMK 261
           KNSSLRKPGDTDVTFNSSTFMLQIPTSPVNTPHSQHLISNISSPVSETNSKSIGKRKKMK
Sbjct: 181 KNSSLRKPGDTDVTFNSSTFMLQIPTSPVNTPHSQHLISNISSPVSETNSKSIGKRKKMK 240

Query: 262 SEMPPKSVTSLEQMNSILLRHRRSSRAMRPRRSSLRDLEIFSAKSQIEHASAINDPELYA 321
           SEMPPKSVTSLEQMNSILLRHRRSSRAMRPRRSSLRDLEIFSAKSQIE ASAINDPELYA
Sbjct: 241 SEMPPKSVTSLEQMNSILLRHRRSSRAMRPRRSSLRDLEIFSAKSQIEQASAINDPELYA 300

Query: 322 PLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFHQPIMKGLYASEGWFMKLMERNKHFVV 381
           PLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFHQPIMKGLYASEGWFMKLMERNKHFVV
Sbjct: 301 PLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFHQPIMKGLYASEGWFMKLMERNKHFVV 360

Query: 382 KDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLRQFLKEYSENIAAKYPYWNRTGGADHF 441
           KDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLRQFLKEYSE+IAAKYPYWNRTGGADHF
Sbjct: 361 KDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLRQFLKEYSESIAAKYPYWNRTGGADHF 420

Query: 442 LVACHDWAPYETRHHMEQCIKALCNADVTVGFKIGRDVSLPETYVRSARNPLRDLGGKPA 501
           LVACHDWAPYETRHHMEQCIKALCNADVTVGFKIGRDVSLPETYVRSARNPLRDLGGKPA
Sbjct: 421 LVACHDWAPYETRHHMEQCIKALCNADVTVGFKIGRDVSLPETYVRSARNPLRDLGGKPA 480

Query: 502 SQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKIFGPMPPGVASKMNYIQHMKSSKYCIC 561
           SQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKIFGPMPPGVASKMNYIQHMKSSKYCIC
Sbjct: 481 SQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKIFGPMPPGVASKMNYIQHMKSSKYCIC 540

Query: 562 PKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWEAFSVIVAEKDIPHLQDILLSI 621
           PKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWEAFSVIVAEKDIPHLQDILLSI
Sbjct: 541 PKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWEAFSVIVAEKDIPHLQDILLSI 600

Query: 622 PKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTLHSIW 660
           PKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTLHSIW
Sbjct: 601 PKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTLHSIW 639

BLAST of Cp4.1LG18g04040 vs. NCBI nr
Match: XP_038879145.1 (probable glycosyltransferase At5g03795 [Benincasa hispida] >XP_038879147.1 probable glycosyltransferase At5g03795 [Benincasa hispida] >XP_038879148.1 probable glycosyltransferase At5g03795 [Benincasa hispida])

HSP 1 Score: 1214 bits (3140), Expect = 0.0
Identity = 605/676 (89.50%), Postives = 628/676 (92.90%), Query Frame = 0

Query: 1   MEYLLPLCKLCHIETRRWFLVMGVVAFTYVLFQSLLLPYGDALRSLLPEDGIQKFDQYSI 60
           MEYLLPLCKLCHIETRRW  ++GVVAFTYV+FQSLLLPYGDALRSLLPEDGIQK+DQY+I
Sbjct: 1   MEYLLPLCKLCHIETRRWLFLVGVVAFTYVIFQSLLLPYGDALRSLLPEDGIQKYDQYNI 60

Query: 61  PMGHTSAKSTTVRNPLTVLDLANTSA-PVGKTDNYVLEKGSQRDSTLNAKGKYVKDEESP 120
            MG TSAK TTVRNPLTVLDLAN S  P+G TDN++LE+G QRDSTLN KGKYVK++ S 
Sbjct: 61  YMGSTSAKLTTVRNPLTVLDLANVSTTPIGNTDNFILERGFQRDSTLNGKGKYVKEKGSS 120

Query: 121 RDGYELSLVRNHDIGFESGKMVDTNGNLESDGTKNGGNNSIVHMDGQPSFEFPLAL---- 180
           RDGYELSL  NHDIGFESG  VDTNGNLES GTKN  NNSI+H+DG+ SFEFPL      
Sbjct: 121 RDGYELSLNGNHDIGFESGNNVDTNGNLESYGTKNRVNNSILHVDGETSFEFPLEQQVVK 180

Query: 181 -SDKVTSENELEEIGEMDLDFGELEEFKNSSLRKPGDTDVTFNSSTFMLQIPTSPVNTPH 240
            SD +TSENELEE G+MD DFGELEEFK SSL KP D DV FNSSTFMLQI TSPVNT H
Sbjct: 181 PSDTITSENELEEFGQMDSDFGELEEFKTSSLEKPEDADVAFNSSTFMLQISTSPVNTSH 240

Query: 241 SQHLISNISSPVSETNSKSIGKRKKMKSEMPPKSVTSLEQMNSILLRHRRSSRAMRPRRS 300
           SQHLISNISS VSETNSKS+GKRKKMKSEMPPKSVTSLE+MN ILLRHRRSSRAMRPRRS
Sbjct: 241 SQHLISNISSSVSETNSKSVGKRKKMKSEMPPKSVTSLEEMNRILLRHRRSSRAMRPRRS 300

Query: 301 SLRDLEIFSAKSQIEHASAINDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKPIF 360
           SLRD EIFSA+SQIE ASAINDPELY PLFRNVSMFKRSYELMERTLKIYVYRDGKKPIF
Sbjct: 301 SLRDQEIFSARSQIEQASAINDPELYTPLFRNVSMFKRSYELMERTLKIYVYRDGKKPIF 360

Query: 361 HQPIMKGLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNL 420
           HQPI+KGLYASEGWFMKLME NK FVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNL
Sbjct: 361 HQPILKGLYASEGWFMKLMEGNKRFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNL 420

Query: 421 RQFLKEYSENIAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVGFK 480
           RQFLKEY+E+IAAKYPYWNRTGGADHFLV CHDWAPYETRHHME CIKALCNADVTVGFK
Sbjct: 421 RQFLKEYAEHIAAKYPYWNRTGGADHFLVGCHDWAPYETRHHMEHCIKALCNADVTVGFK 480

Query: 481 IGRDVSLPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMK 540
           IGRDVSLPETYVRS RNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMK
Sbjct: 481 IGRDVSLPETYVRSMRNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMK 540

Query: 541 IFGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFF 600
           IFGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFF
Sbjct: 541 IFGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFF 600

Query: 601 EVLDWEAFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHM 660
           EVLDWEAFSVIVAEKDIP+LQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHM
Sbjct: 601 EVLDWEAFSVIVAEKDIPNLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHM 660

Query: 661 TLHSIWYNRVFQIKLR 670
           TLHSIWYNRVFQIKLR
Sbjct: 661 TLHSIWYNRVFQIKLR 676

BLAST of Cp4.1LG18g04040 vs. ExPASy TrEMBL
Match: A0A6J1H980 (probable glycosyltransferase At5g03795 OS=Cucurbita moschata OX=3662 GN=LOC111461655 PE=3 SV=1)

HSP 1 Score: 1331 bits (3445), Expect = 0.0
Identity = 656/670 (97.91%), Postives = 665/670 (99.25%), Query Frame = 0

Query: 1   MEYLLPLCKLCHIETRRWFLVMGVVAFTYVLFQSLLLPYGDALRSLLPEDGIQKFDQYSI 60
           MEYLLPLCKLCHIETRRWF+VMGVVAFTYVLFQSLLLPYGDALRSLLPEDG QKFDQYSI
Sbjct: 1   MEYLLPLCKLCHIETRRWFIVMGVVAFTYVLFQSLLLPYGDALRSLLPEDGTQKFDQYSI 60

Query: 61  PMGHTSAKSTTVRNPLTVLDLANTSAPVGKTDNYVLEKGSQRDSTLNAKGKYVKDEESPR 120
            MGHTSAKSTTVRNPLTVLDLAN SAP+GKTDNY+LEKGSQRDS+LNA+GKYVKDEESPR
Sbjct: 61  QMGHTSAKSTTVRNPLTVLDLANISAPIGKTDNYILEKGSQRDSSLNARGKYVKDEESPR 120

Query: 121 DGYELSLVRNHDIGFESGKMVDTNGNLESDGTKNGGNNSIVHMDGQPSFEFPLALSDKVT 180
           DGY+LSL RNHDIGF+SGKMVDTNGNLESDGTKNGGNNSIVHMDGQPSFEFPLA+SDKVT
Sbjct: 121 DGYKLSLNRNHDIGFDSGKMVDTNGNLESDGTKNGGNNSIVHMDGQPSFEFPLAVSDKVT 180

Query: 181 SENELEEIGEMDLDFGELEEFKNSSLRKPGDTDVTFNSSTFMLQIPTSPVNTPHSQHLIS 240
           SENELEEIGEMDLDFGELEEFKNSSLRKPGDTDVTFNSSTFMLQIPTSPVNTPHSQHLIS
Sbjct: 181 SENELEEIGEMDLDFGELEEFKNSSLRKPGDTDVTFNSSTFMLQIPTSPVNTPHSQHLIS 240

Query: 241 NISSPVSETNSKSIGKRKKMKSEMPPKSVTSLEQMNSILLRHRRSSRAMRPRRSSLRDLE 300
           NISSPVSETNSKSIGKRKKMKSEMPPKSVTSLEQMNSILLRHRRSSRAMRPRRSSLRDLE
Sbjct: 241 NISSPVSETNSKSIGKRKKMKSEMPPKSVTSLEQMNSILLRHRRSSRAMRPRRSSLRDLE 300

Query: 301 IFSAKSQIEHASAINDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFHQPIMK 360
           IFSAKSQIE ASAINDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFHQPIMK
Sbjct: 301 IFSAKSQIEQASAINDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFHQPIMK 360

Query: 361 GLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLRQFLKE 420
           GLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLRQFLKE
Sbjct: 361 GLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLRQFLKE 420

Query: 421 YSENIAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVGFKIGRDVS 480
           YSE+IAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVGFKIGRDVS
Sbjct: 421 YSESIAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVGFKIGRDVS 480

Query: 481 LPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKIFGPMP 540
           LPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKIFGPMP
Sbjct: 481 LPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKIFGPMP 540

Query: 541 PGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWE 600
           PGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWE
Sbjct: 541 PGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWE 600

Query: 601 AFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTLHSIW 660
           AFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTLHSIW
Sbjct: 601 AFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTLHSIW 660

Query: 661 YNRVFQIKLR 670
           YNRVFQIKLR
Sbjct: 661 YNRVFQIKLR 670

BLAST of Cp4.1LG18g04040 vs. ExPASy TrEMBL
Match: A0A6J1JEU9 (probable glycosyltransferase At5g03795 OS=Cucurbita maxima OX=3661 GN=LOC111485134 PE=3 SV=1)

HSP 1 Score: 1308 bits (3385), Expect = 0.0
Identity = 648/675 (96.00%), Postives = 658/675 (97.48%), Query Frame = 0

Query: 1   MEYLLPLCKLCHIETRRWFLVMGVVAFTYVLFQSLLLPYGDALRSLLPEDGIQKFDQYSI 60
           MEYLLPLCKLCHIETRRWFLVMGVVAFTYVLFQSLLLPYGDALRSLLPEDGI+KFDQYSI
Sbjct: 1   MEYLLPLCKLCHIETRRWFLVMGVVAFTYVLFQSLLLPYGDALRSLLPEDGIKKFDQYSI 60

Query: 61  PMGHTSAKSTTVRNPLTVLDLANTSAPVGKTDNYVLEKGSQRDSTLNAKGKYVKDEESPR 120
            MGHTSAKSTTVRNPLTVLDLANTSAP+GKTDNY+LEKGSQRDSTLNA+GKYVKDEESPR
Sbjct: 61  QMGHTSAKSTTVRNPLTVLDLANTSAPIGKTDNYILEKGSQRDSTLNARGKYVKDEESPR 120

Query: 121 DGYELSLVRNHDIGFESGKMVDTNGNLESDGTKNGGNNSIVHMDGQPSFEFPLAL----- 180
           DGY+LSL RNHDIGFESGKMVDTNGNLESDGTKN GNNSI+HMDG+ SFEFPL       
Sbjct: 121 DGYKLSLNRNHDIGFESGKMVDTNGNLESDGTKNRGNNSILHMDGEASFEFPLGQQFVKQ 180

Query: 181 SDKVTSENELEEIGEMDLDFGELEEFKNSSLRKPGDTDVTFNSSTFMLQIPTSPVNTPHS 240
           SD V SENELEE GEMDLDFGELEEFKNS LRKPGDTDVTFNSSTFMLQIPTSPVNTPHS
Sbjct: 181 SDTVASENELEEFGEMDLDFGELEEFKNSLLRKPGDTDVTFNSSTFMLQIPTSPVNTPHS 240

Query: 241 QHLISNISSPVSETNSKSIGKRKKMKSEMPPKSVTSLEQMNSILLRHRRSSRAMRPRRSS 300
           QHLISNISSPVSETNSKSIGKRKKMK+EMPPKSVTSLE+MNSILLRHRRSSRAMRPRRSS
Sbjct: 241 QHLISNISSPVSETNSKSIGKRKKMKNEMPPKSVTSLEEMNSILLRHRRSSRAMRPRRSS 300

Query: 301 LRDLEIFSAKSQIEHASAINDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFH 360
           LRDLEIFSAKSQIE ASAINDPELY PLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFH
Sbjct: 301 LRDLEIFSAKSQIEQASAINDPELYTPLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFH 360

Query: 361 QPIMKGLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLR 420
           QPIMKGLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLR
Sbjct: 361 QPIMKGLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLR 420

Query: 421 QFLKEYSENIAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVGFKI 480
           QFLKEYSE+IAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVGFKI
Sbjct: 421 QFLKEYSESIAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVGFKI 480

Query: 481 GRDVSLPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKI 540
           GRDVSLPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKI
Sbjct: 481 GRDVSLPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKI 540

Query: 541 FGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFE 600
           FGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFE
Sbjct: 541 FGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFE 600

Query: 601 VLDWEAFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMT 660
           VLDWEAFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMT
Sbjct: 601 VLDWEAFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMT 660

Query: 661 LHSIWYNRVFQIKLR 670
           LHSIWYNRVFQIKLR
Sbjct: 661 LHSIWYNRVFQIKLR 675

BLAST of Cp4.1LG18g04040 vs. ExPASy TrEMBL
Match: A0A1S3CJA3 (probable glycosyltransferase At5g03795 OS=Cucumis melo OX=3656 GN=LOC103501046 PE=3 SV=1)

HSP 1 Score: 1116 bits (2887), Expect = 0.0
Identity = 564/678 (83.19%), Postives = 596/678 (87.91%), Query Frame = 0

Query: 1   MEYLLPLCKLCHIETRRWFLVMGVVAFTYVLFQSLLLPYGDALRSLLPEDGIQKFDQYSI 60
           M+YLLPLC LCH++TRR   ++GVVAFTY++FQ LLLPYGDALRSLLPED I ++D YSI
Sbjct: 1   MDYLLPLCNLCHVQTRRCLFLVGVVAFTYLIFQFLLLPYGDALRSLLPEDAIHRYDHYSI 60

Query: 61  PMGHTSAKSTTVRNPLTVLDLANTSA-PVGKTDNYVLEKGSQRDSTLNAKGKYVKDEESP 120
             G TS K TTVRNPLTVLDLAN S  P+G      +EKG QRD+ LNAKGKYVK EE P
Sbjct: 61  QFGPTSPKLTTVRNPLTVLDLANVSTTPIGN-----IEKGFQRDNLLNAKGKYVKGEEIP 120

Query: 121 RDGYELSLVRNHDIGFESGKMVDTNGNLESDGTKNGGNNSIVHMDGQPSFEFPLAL---- 180
           R+          DIGFESG  VD NGN ESDGTKN  N+SI+H+ G+ SF FPL      
Sbjct: 121 REV---------DIGFESGNNVDANGNSESDGTKNRANDSILHVVGKTSFGFPLKQQVVK 180

Query: 181 ---SDKVTSENELEEIGEMDLDFGELEEFKNSSLRKPGDTDVTFNSSTFMLQIPTSPVNT 240
              ++ +TSENELE+ G+MDLDFGELEEFKNSSL+K  DTD+ FNSSTFMLQ  TS VNT
Sbjct: 181 PSDTNTITSENELEDFGQMDLDFGELEEFKNSSLQKLEDTDMAFNSSTFMLQFSTSTVNT 240

Query: 241 PHSQHLISNISSPVSETNSKSIGKRKKMKSEMPPKSVTSLEQMNSILLRHRRSSRAMRPR 300
            H  HL SN+ S  SETNS S+GKRKKMKSE+PPK+VT+LE+MN IL RH RSSRAMRPR
Sbjct: 241 THPHHLTSNLRSSASETNSTSVGKRKKMKSELPPKTVTTLEEMNRILFRHCRSSRAMRPR 300

Query: 301 RSSLRDLEIFSAKSQIEHASAINDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKP 360
           RSSLRD EIFSAKS I  ASAINDPELYAPLFRNVSMFKRSYELME TLKIYVYRDGKKP
Sbjct: 301 RSSLRDQEIFSAKSLIMQASAINDPELYAPLFRNVSMFKRSYELMEHTLKIYVYRDGKKP 360

Query: 361 IFHQPIMKGLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRT 420
           IFHQPI+KGLYASEGWFMKLME NK FVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRT
Sbjct: 361 IFHQPILKGLYASEGWFMKLMEGNKRFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRT 420

Query: 421 NLRQFLKEYSENIAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVG 480
           NLRQFLKEYSENIAAKYPYWNRTGGADHFLV CHDWAPYETRHHME CIKALCNADVTVG
Sbjct: 421 NLRQFLKEYSENIAAKYPYWNRTGGADHFLVGCHDWAPYETRHHMEHCIKALCNADVTVG 480

Query: 481 FKIGRDVSLPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPD 540
           FKIGRDVSLPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPD
Sbjct: 481 FKIGRDVSLPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPD 540

Query: 541 MKIFGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPP 600
           MKIFGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPP
Sbjct: 541 MKIFGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPP 600

Query: 601 FFEVLDWEAFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLF 660
           FFEVLDWEAFSVIVAEKDIP+LQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLF
Sbjct: 601 FFEVLDWEAFSVIVAEKDIPNLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLF 660

Query: 661 HMTLHSIWYNRVFQIKLR 670
           HMTLHSIWYNRVFQIKLR
Sbjct: 661 HMTLHSIWYNRVFQIKLR 664

BLAST of Cp4.1LG18g04040 vs. ExPASy TrEMBL
Match: A0A0A0LU64 (Exostosin domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G364960 PE=3 SV=1)

HSP 1 Score: 1106 bits (2861), Expect = 0.0
Identity = 561/678 (82.74%), Postives = 595/678 (87.76%), Query Frame = 0

Query: 1   MEYLLPLCKLCHIETRRWFLVMGVVAFTYVLFQSLLLPYGDALRSLLPEDGIQKFDQYSI 60
           M YLL  C LCHI+TRR  L++GVVAFTY++FQSLLLPYGDALRSLLPED I K+D Y+I
Sbjct: 1   MGYLLLPCNLCHIQTRRCLLLVGVVAFTYLIFQSLLLPYGDALRSLLPEDAIHKYDHYNI 60

Query: 61  PMGHTSAKSTTVRNPLTVLDLANTSA-PVGKTDNYVLEKGSQRDSTLNAKGKYVKDEESP 120
             G  S K  TVRNPLTVLDLAN S  P+GK D     KG QRD+ LN+KG+YVK+EE P
Sbjct: 61  QFGPNSPKLATVRNPLTVLDLANVSTTPIGKID-----KGFQRDNLLNSKGEYVKEEEIP 120

Query: 121 RDGYELSLVRNHDIGFESGKMVDTNGNLESDGTKNGGNNSIVHMDGQPSFEFPLAL---- 180
           R+          D G ESG  VD NGNLESDGTKN  N+SI+ +DG+ SF FPL      
Sbjct: 121 REV---------DFGSESGNNVDANGNLESDGTKNRANDSILPVDGETSFGFPLKQQVVK 180

Query: 181 ---SDKVTSENELEEIGEMDLDFGELEEFKNSSLRKPGDTDVTFNSSTFMLQIPTSPVNT 240
              ++ +T ENELE+ G+MDLDFGELEEFKNSSL+K  DTD+ FNSSTFMLQ  TS VNT
Sbjct: 181 PSDTNTITLENELEDFGQMDLDFGELEEFKNSSLQKLEDTDMPFNSSTFMLQTSTSTVNT 240

Query: 241 PHSQHLISNISSPVSETNSKSIGKRKKMKSEMPPKSVTSLEQMNSILLRHRRSSRAMRPR 300
            HS  L+SN+SS  SETNS SIGKRKKMKSE+PPK+VT+LE+MN IL RHRRSSRAMRPR
Sbjct: 241 IHSHQLLSNLSSSASETNSTSIGKRKKMKSELPPKTVTTLEEMNRILFRHRRSSRAMRPR 300

Query: 301 RSSLRDLEIFSAKSQIEHASAINDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKP 360
           RSSLRD EIFSAKS I  ASA+NDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKP
Sbjct: 301 RSSLRDQEIFSAKSLIVQASAVNDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKP 360

Query: 361 IFHQPIMKGLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRT 420
           IFHQPI+KGLYASEGWFMKLME NK FVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRT
Sbjct: 361 IFHQPILKGLYASEGWFMKLMEGNKRFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRT 420

Query: 421 NLRQFLKEYSENIAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVG 480
           NLRQFLKEY+ENIAAKYPYWNRTGGADHFL  CHDWAPYETRHHME CIKALCNADVTVG
Sbjct: 421 NLRQFLKEYAENIAAKYPYWNRTGGADHFLAGCHDWAPYETRHHMEHCIKALCNADVTVG 480

Query: 481 FKIGRDVSLPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPD 540
           FKIGRDVSLPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPD
Sbjct: 481 FKIGRDVSLPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPD 540

Query: 541 MKIFGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPP 600
           MKIFGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPP
Sbjct: 541 MKIFGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPP 600

Query: 601 FFEVLDWEAFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLF 660
           FFEVLDWEAFSVIVAEKDIP+LQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLF
Sbjct: 601 FFEVLDWEAFSVIVAEKDIPNLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLF 660

Query: 661 HMTLHSIWYNRVFQIKLR 670
           HMTLHSIWYNRVFQIKLR
Sbjct: 661 HMTLHSIWYNRVFQIKLR 664

BLAST of Cp4.1LG18g04040 vs. ExPASy TrEMBL
Match: A0A5A7V6N9 (Putative glycosyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold79G001640 PE=3 SV=1)

HSP 1 Score: 1037 bits (2681), Expect = 0.0
Identity = 527/641 (82.22%), Postives = 559/641 (87.21%), Query Frame = 0

Query: 1   MEYLLPLCKLCHIETRRWFLVMGVVAFTYVLFQSLLLPYGDALRSLLPEDGIQKFDQYSI 60
           M+YLLPLC LCH++TRR   ++GVVAFTY++FQ LLLPYGDALRSLLPED I ++D YSI
Sbjct: 1   MDYLLPLCNLCHVQTRRCLFLVGVVAFTYLIFQFLLLPYGDALRSLLPEDAIHRYDHYSI 60

Query: 61  PMGHTSAKSTTVRNPLTVLDLANTSA-PVGKTDNYVLEKGSQRDSTLNAKGKYVKDEESP 120
             G TS K TTVRNPLTVLDLAN S  P+G      +EKG QRD+ LNAKGKYVK EE P
Sbjct: 61  QFGPTSPKLTTVRNPLTVLDLANVSTTPIGN-----IEKGFQRDNLLNAKGKYVKGEEIP 120

Query: 121 RDGYELSLVRNHDIGFESGKMVDTNGNLESDGTKNGGNNSIVHMDGQPSFEFPLAL---- 180
           R+          DIGFESG  VD NGN ESDGTKN  N+SI+H+ G+ SF FPL      
Sbjct: 121 REV---------DIGFESGNNVDANGNSESDGTKNRANDSILHVVGKTSFGFPLKQQVVK 180

Query: 181 ---SDKVTSENELEEIGEMDLDFGELEEFKNSSLRKPGDTDVTFNSSTFMLQIPTSPVNT 240
              ++ +TSENELE+ G+MDLDFGELEEFKNSSL+K  DTD+ FNSSTFMLQ  TS VNT
Sbjct: 181 PSDTNTITSENELEDFGQMDLDFGELEEFKNSSLQKLEDTDMAFNSSTFMLQFSTSTVNT 240

Query: 241 PHSQHLISNISSPVSETNSKSIGKRKKMKSEMPPKSVTSLEQMNSILLRHRRSSRAMRPR 300
            H  HL SN+ S  SETNS S+GKRKKMKSE+PPK+VT+LE+MN IL RH RSSRAMRPR
Sbjct: 241 THPHHLTSNLRSSASETNSTSVGKRKKMKSELPPKTVTTLEEMNRILFRHCRSSRAMRPR 300

Query: 301 RSSLRDLEIFSAKSQIEHASAINDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKP 360
           RSSLRD EIFSAKS I  ASAINDPELYAPLFRNVSMFKRSYELME TLKIYVYRDGKKP
Sbjct: 301 RSSLRDQEIFSAKSLIMQASAINDPELYAPLFRNVSMFKRSYELMEHTLKIYVYRDGKKP 360

Query: 361 IFHQPIMKGLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRT 420
           IFHQPI+KGLYASEGWFMKLME NK FVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRT
Sbjct: 361 IFHQPILKGLYASEGWFMKLMEGNKRFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRT 420

Query: 421 NLRQFLKEYSENIAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVG 480
           NLRQFLKEYSENIAAKYPYWNRTGGADHFLV CHDWAPYETRHHME CIKALCNADVTVG
Sbjct: 421 NLRQFLKEYSENIAAKYPYWNRTGGADHFLVGCHDWAPYETRHHMEHCIKALCNADVTVG 480

Query: 481 FKIGRDVSLPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPD 540
           FKIGRDVSLPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPD
Sbjct: 481 FKIGRDVSLPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPD 540

Query: 541 MKIFGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPP 600
           MKIFGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPP
Sbjct: 541 MKIFGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPP 600

Query: 601 FFEVLDWEAFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRV 633
           FFEVLDWEAFSVIVAEKDIP+LQDILLSIPKDRYLEMQLRV
Sbjct: 601 FFEVLDWEAFSVIVAEKDIPNLQDILLSIPKDRYLEMQLRV 627

BLAST of Cp4.1LG18g04040 vs. TAIR 10
Match: AT5G19670.1 (Exostosin family protein )

HSP 1 Score: 719.9 bits (1857), Expect = 1.9e-207
Identity = 395/696 (56.75%), Postives = 475/696 (68.25%), Query Frame = 0

Query: 1   MEYLLPLCKLCHIETRRWFLVMGVVAFTYVLFQSLLLPYGDALRSLLPEDGIQKFDQYSI 60
           ME    L K      R+W +++G+VA T++L   LLL YGDALR LLP DG     +  +
Sbjct: 1   MEVRSELRKQSRSGKRKWAILVGIVALTHIL---LLLSYGDALRYLLP-DG----RRLKL 60

Query: 61  PMGHTSAKSTTVRNPLTVLDLANTSAPVGKTDNYVLEKGSQRDSTLNAKGKYVKDEESPR 120
           P  + +   T  RN L V    N S     +  +VLEK                      
Sbjct: 61  PNENNALLMTPSRNTLAV----NVSEDSAVSGIHVLEK---------------------- 120

Query: 121 DGYELSLVRNHDIGFESGKMVDTNGNLESDGTKNGGNNSIVHMDGQPSFEFPLALSDKVT 180
                                  NG +   G +N                          
Sbjct: 121 -----------------------NGYVSGFGLRN-------------------------- 180

Query: 181 SENELEEIGEMDLDFGELEEFKNSSLRK--PGDTDVTFNSSTFMLQIPTSPVNTPHSQHL 240
            E+E +E    ++DF   E+ K+S + K   G +D  F S T ++Q     V+T ++ + 
Sbjct: 181 -ESEDDEGFVGNVDFESFEDVKDSIIIKEVAGSSDNLFPSETTVMQ--KESVSTSNNGYQ 240

Query: 241 ISN-------------------ISSPVSETN----SKSIGKRKKMKSEMPPKSVTSLEQM 300
           + N                   I+SP S  +    SK + K+KKM+ ++PPKSVT++++M
Sbjct: 241 VQNVTVQSQKNVKSSILSGGSSIASPASGNSSLLVSKKVSKKKKMRCDLPPKSVTTIDEM 300

Query: 301 NSILLRHRRSSRAMRPRRSSLRDLEIFSAKSQIEHASAIN-DPELYAPLFRNVSMFKRSY 360
           N IL RHRR+SRAMRPR SS RD EI +A+ +IE+A     + ELY P+FRNVS+FKRSY
Sbjct: 301 NRILARHRRTSRAMRPRWSSRRDEEILTARKEIENAPVAKLERELYPPIFRNVSLFKRSY 360

Query: 361 ELMERTLKIYVYRDGKKPIFHQPIMKGLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPF 420
           ELMER LK+YVY++G +PIFH PI+KGLYASEGWFMKLME NK + VKDPRKAHL+YMPF
Sbjct: 361 ELMERILKVYVYKEGNRPIFHTPILKGLYASEGWFMKLMEGNKQYTVKDPRKAHLYYMPF 420

Query: 421 SSRMLEYTLYVRNSHNRTNLRQFLKEYSENIAAKYPYWNRTGGADHFLVACHDWAPYETR 480
           S+RMLEYTLYVRNSHNRTNLRQFLKEY+E+I++KYP++NRT GADHFLVACHDWAPYETR
Sbjct: 421 SARMLEYTLYVRNSHNRTNLRQFLKEYTEHISSKYPFFNRTDGADHFLVACHDWAPYETR 480

Query: 481 HHMEQCIKALCNADVTVGFKIGRDVSLPETYVRSARNPLRDLGGKPASQRHILAFYAGNM 540
           HHME CIKALCNADVT GFKIGRD+SLPETYVR+A+NPLRDLGGKP SQR  LAFYAG+M
Sbjct: 481 HHMEHCIKALCNADVTAGFKIGRDISLPETYVRAAKNPLRDLGGKPPSQRRTLAFYAGSM 540

Query: 541 HGYVRPILLKYWKDKNPDMKIFGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVE 600
           HGY+R ILL++WKDK+PDMKIFG MP GVASKMNYI+ MKSSKYCICPKGYEVNSPRVVE
Sbjct: 541 HGYLRQILLQHWKDKDPDMKIFGRMPFGVASKMNYIEQMKSSKYCICPKGYEVNSPRVVE 600

Query: 601 AIFYECVPVIISDNFVPPFFEVLDWEAFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVR 660
           +IFYECVPVIISDNFVPPFFEVLDW AFSVIVAEKDIP L+DILLSIP+D+Y++MQ+ VR
Sbjct: 601 SIFYECVPVIISDNFVPPFFEVLDWSAFSVIVAEKDIPRLKDILLSIPEDKYVKMQMAVR 610

Query: 661 KVQKHFLWHAKPLKYDLFHMTLHSIWYNRVFQIKLR 671
           K Q+HFLWHAKP KYDLFHM LHSIWYNRVFQ K R
Sbjct: 661 KAQRHFLWHAKPEKYDLFHMVLHSIWYNRVFQAKRR 610

BLAST of Cp4.1LG18g04040 vs. TAIR 10
Match: AT5G25820.1 (Exostosin family protein )

HSP 1 Score: 545.0 bits (1403), Expect = 8.2e-155
Identity = 264/417 (63.31%), Postives = 331/417 (79.38%), Query Frame = 0

Query: 259 KMKSEMPPKSVTSLEQMNSILLRHRRSSR--AMRPRRSSLRDLEIFSAKSQIEHASAIN- 318
           K  ++MP   V S+ +M+  L ++R S    A +P+  +  DLE+  AK  IE+A   + 
Sbjct: 239 KENAKMPGFGVMSISEMSKQLRQNRISHNRLAKKPKWVTKPDLELLQAKYDIENAPIDDK 298

Query: 319 DPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFHQPIMKGLYASEGWFMKLMER 378
           DP LYAPL+RNVSMFKRSYELME+ LK+Y Y++G KPI H PI++G+YASEGWFM ++E 
Sbjct: 299 DPFLYAPLYRNVSMFKRSYELMEKILKVYAYKEGNKPIMHSPILRGIYASEGWFMNIIES 358

Query: 379 NKH-FVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLRQFLKEYSENIAAKYPYWNR 438
           N + FV KDP KAHLFY+PFSSRMLE TLYV++SH+  NL ++LK+Y + I+AKYP+WNR
Sbjct: 359 NNNKFVTKDPAKAHLFYLPFSSRMLEVTLYVQDSHSHRNLIKYLKDYIDFISAKYPFWNR 418

Query: 439 TGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVGFKIGRDVSLPETYVRSARNPLR 498
           T GADHFL ACHDWAP ETR HM + I+ALCN+DV  GF  G+D SLPET+VR  + PL 
Sbjct: 419 TSGADHFLAACHDWAPSETRKHMAKSIRALCNSDVKEGFVFGKDTSLPETFVRDPKKPLS 478

Query: 499 DLGGKPASQRHILAFYAGNM-HGYVRPILLKYW-KDKNPDMKIFGPMPPGVASKMNYIQH 558
           ++GGK A+QR ILAF+AG   HGY+RPILL YW  +K+PD+KIFG +P    +K NY+Q 
Sbjct: 479 NMGGKSANQRPILAFFAGKPDHGYLRPILLSYWGNNKDPDLKIFGKLPRTKGNK-NYLQF 538

Query: 559 MKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWEAFSVIVAEKDIP 618
           MK+SKYCIC KG+EVNSPRVVEAIFY+CVPVIISDNFVPPFFEVL+WE+F++ + EKDIP
Sbjct: 539 MKTSKYCICAKGFEVNSPRVVEAIFYDCVPVIISDNFVPPFFEVLNWESFAIFIPEKDIP 598

Query: 619 HLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTLHSIWYNRVFQIKL 670
           +L+ IL+SIP+ RY  MQ+RV+KVQKHFLWHAKP KYD+FHM LHSIWYNRVFQI +
Sbjct: 599 NLKKILMSIPESRYRSMQMRVKKVQKHFLWHAKPEKYDMFHMILHSIWYNRVFQISV 654

BLAST of Cp4.1LG18g04040 vs. TAIR 10
Match: AT4G32790.1 (Exostosin family protein )

HSP 1 Score: 543.9 bits (1400), Expect = 1.8e-154
Identity = 258/423 (60.99%), Postives = 337/423 (79.67%), Query Frame = 0

Query: 247 SETNSKSIGKRKKMKSEMPPKSVTSLEQMNSILLRHRRSSRAMRPRRSSLRDLEIFSAKS 306
           S+ +  ++    K    +    V S+ +M ++L + R S  +++ +RSS  D E+  A++
Sbjct: 172 SDPSVDNLSSEVKKFMNVSNSGVVSITEMMNLLHQSRTSHVSLKVKRSSTIDHELLYART 231

Query: 307 QIEHASAI-NDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFHQPIMKGLYAS 366
           QIE+   I NDP L+ PL+ N+SMFKRSYELME+ LK+YVYR+GK+P+ H+P++KG+YAS
Sbjct: 232 QIENPPLIENDPLLHTPLYWNLSMFKRSYELMEKKLKVYVYREGKRPVLHKPVLKGIYAS 291

Query: 367 EGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLRQFLKEYSENI 426
           EGWFMK ++ ++ FV KDPRKAHLFY+PFSS+MLE TLYV  SH+  NL QFLK Y + I
Sbjct: 292 EGWFMKQLKSSRTFVTKDPRKAHLFYLPFSSKMLEETLYVPGSHSDKNLIQFLKNYLDMI 351

Query: 427 AAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTVGFKIGRDVSLPETY 486
           ++KY +WN+TGG+DHFLVACHDWAP ETR +M +CI+ALCN+DV+ GF  G+DV+LPET 
Sbjct: 352 SSKYSFWNKTGGSDHFLVACHDWAPSETRQYMAKCIRALCNSDVSEGFVFGKDVALPETT 411

Query: 487 VRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYW-KDKNPDMKIFGPMPPGVA 546
           +   R PLR LGGKP SQR ILAF+AG MHGY+RP+LL+ W  +++PDMKIF  +P    
Sbjct: 412 ILVPRRPLRALGGKPVSQRQILAFFAGGMHGYLRPLLLQNWGGNRDPDMKIFSEIPKS-K 471

Query: 547 SKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWEAFSV 606
            K +Y+++MKSSKYCICPKG+EVNSPRVVEA+FYECVPVIISDNFVPPFFEVL+WE+F+V
Sbjct: 472 GKKSYMEYMKSSKYCICPKGHEVNSPRVVEALFYECVPVIISDNFVPPFFEVLNWESFAV 531

Query: 607 IVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTLHSIWYNRV 666
            V EKDIP L++IL+SI ++RY EMQ+RV+ VQKHFLWH+KP ++D+FHM LHSIWYNRV
Sbjct: 532 FVLEKDIPDLKNILVSITEERYREMQMRVKMVQKHFLWHSKPERFDIFHMILHSIWYNRV 591

Query: 667 FQI 668
           FQI
Sbjct: 592 FQI 593

BLAST of Cp4.1LG18g04040 vs. TAIR 10
Match: AT4G16745.1 (Exostosin family protein )

HSP 1 Score: 492.7 bits (1267), Expect = 4.8e-139
Identity = 249/450 (55.33%), Postives = 319/450 (70.89%), Query Frame = 0

Query: 227 TSPVNTPHSQHLISNISSPVSETNSKSIGKRKKMKSEM----PPKSVTSLEQMNSILLRH 286
           T    TP  Q   S  S  V     +   KRKK K ++    PP +         +L   
Sbjct: 93  TLRTRTPIVQLNASEASEAVLSRKRRKRKKRKKTKDDLILTDPPPA------PRHVLSSS 152

Query: 287 RRSSRAMRPRRSSLRDLEIFSAKSQIEHA-SAINDPELYAPLFRNVSMFKRSYELMERTL 346
            R + ++ P+++      +  AK +I+ A   IND +L+APLFRN+S+FKRSYELME  L
Sbjct: 153 ERRALSLPPKKA------LTYAKLEIQRAPEVINDTDLFAPLFRNLSVFKRSYELMELIL 212

Query: 347 KIYVYRDGKKPIFHQPIMKGLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEY 406
           K+Y+Y DG KPIFH+P + G+YASEGWFMKLME NK FV K+P +AHLFYMP+S + L+ 
Sbjct: 213 KVYIYPDGDKPIFHEPHLNGIYASEGWFMKLMESNKQFVTKNPERAHLFYMPYSVKQLQK 272

Query: 407 TLYVRNSHNRTNLRQFLKEYSENIAAKYPYWNRTGGADHFLVACHDWAPYETRHHME--- 466
           +++V  SHN   L  FL++Y   ++ KYP+WNRT G+DHFLVACHDW PY    H E   
Sbjct: 273 SIFVPGSHNIKPLSIFLRDYVNMLSIKYPFWNRTHGSDHFLVACHDWGPYTVNEHPELKR 332

Query: 467 QCIKALCNADVTVG-FKIGRDVSLPETYVRSARNPLRDLG-GKPASQRHILAFYAGNMHG 526
             IKALCNAD++ G F  G+DVSLPET +R+A  PLR++G G   SQR ILAF+AGN+HG
Sbjct: 333 NAIKALCNADLSDGIFVPGKDVSLPETSIRNAGRPLRNIGNGNRVSQRPILAFFAGNLHG 392

Query: 527 YVRPILLKYWKDKNPDMKIFGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAI 586
            VRP LLK+W++K+ DMKI+GP+P  VA KM Y+QHMKSSKYC+CP GYEVNSPR+VEAI
Sbjct: 393 RVRPKLLKHWRNKDEDMKIYGPLPHNVARKMTYVQHMKSSKYCLCPMGYEVNSPRIVEAI 452

Query: 587 FYECVPVIISDNFVPPFFEVLDWEAFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKV 646
           +YECVPV+I+DNF+ PF +VLDW AFSV+V EK+IP L++ILL IP  RYL+MQ  V+ V
Sbjct: 453 YYECVPVVIADNFMLPFSDVLDWSAFSVVVPEKEIPRLKEILLEIPMRRYLKMQSNVKMV 512

Query: 647 QKHFLWHAKPLKYDLFHMTLHSIWYNRVFQ 667
           Q+HFLW  KP KYD+FHM LHSIW+N + Q
Sbjct: 513 QRHFLWSPKPRKYDVFHMILHSIWFNLLNQ 530

BLAST of Cp4.1LG18g04040 vs. TAIR 10
Match: AT5G11610.1 (Exostosin family protein )

HSP 1 Score: 488.0 bits (1255), Expect = 1.2e-137
Identity = 259/497 (52.11%), Postives = 337/497 (67.81%), Query Frame = 0

Query: 204 SSLRKPGDTDVTFNSSTFMLQIPTSPVNTPH---------SQHLISNISSPV-------- 263
           S  RK  DT  +  + TF+     S    P+         S+H   N S  +        
Sbjct: 53  SEFRKSNDTTKSAENETFLASQEASTGLKPYNRTTEVLKSSEHKFLNDSHKIEASGQRRR 112

Query: 264 -SETNSK---------SIGKRKKMKS-EMPPKSVTSLEQMNSILL-RHRRSSRAMRPRRS 323
            +ET S           I K+   +S   PP  V S++QMN+++L RH     ++ P   
Sbjct: 113 SNETASSLHPLQPKIPQIRKKYPHRSITKPPSIVISIKQMNNMILKRHNDPKNSLAPLWG 172

Query: 324 SLRDLEIFSAKSQIEHASAI-NDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKPI 383
           S  D E+ +A+ +I+ A+ +  D  LYAPL+ N+S+FKRSYELME+TLK+YVY +G +PI
Sbjct: 173 SKVDQELKTARDKIKKAALVKKDDTLYAPLYHNISIFKRSYELMEQTLKVYVYSEGDRPI 232

Query: 384 FHQP--IMKGLYASEGWFMKLMERNKHFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNR 443
           FHQP  IM+G+YASEGWFMKLME +  F+ KDP KAHLFY+PFSSR+L+  LYV +SH+R
Sbjct: 233 FHQPEAIMEGIYASEGWFMKLMESSHRFLTKDPTKAHLFYIPFSSRILQQKLYVHDSHSR 292

Query: 444 TNLRQFLKEYSENIAAKYPYWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTV 503
            NL ++L  Y + IA+ YP WNRT G+DHF  ACHDWAP ETR     CI+ALCNADV +
Sbjct: 293 NNLVKYLGNYIDLIASNYPSWNRTCGSDHFFTACHDWAPTETRGPYINCIRALCNADVGI 352

Query: 504 GFKIGRDVSLPETYVRSARNPLRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDK-N 563
            F +G+DVSLPET V S +NP   +GG   S+R ILAF+AG++HGYVRPILL  W  +  
Sbjct: 353 DFVVGKDVSLPETKVSSLQNPNGKIGGSRPSKRTILAFFAGSLHGYVRPILLNQWSSRPE 412

Query: 564 PDMKIFGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFV 623
            DMKIF  +        +YI++MK S++C+C KGYEVNSPRVVE+I Y CVPVIISDNFV
Sbjct: 413 QDMKIFNRI-----DHKSYIRYMKRSRFCVCAKGYEVNSPRVVESILYGCVPVIISDNFV 472

Query: 624 PPFFEVLDWEAFSVIVAEKDIPHLQDILLSIPKDRYLEMQLRVRKVQKHFLWH-AKPLKY 667
           PPF E+L+WE+F+V V EK+IP+L+ IL+SIP  RY+EMQ RV KVQKHF+WH  +P++Y
Sbjct: 473 PPFLEILNWESFAVFVPEKEIPNLRKILISIPVRRYVEMQKRVLKVQKHFMWHDGEPVRY 532

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FFN21.1e-8742.25Probable glycosyltransferase At5g03795 OS=Arabidopsis thaliana OX=3702 GN=At5g03... [more]
Q9SSE83.8e-8039.53Probable glycosyltransferase At3g07620 OS=Arabidopsis thaliana OX=3702 GN=At3g07... [more]
Q3E7Q92.5e-7939.02Probable glycosyltransferase At5g25310 OS=Arabidopsis thaliana OX=3702 GN=At5g25... [more]
Q9LFP39.0e-7440.74Probable glycosyltransferase At5g11130 OS=Arabidopsis thaliana OX=3702 GN=At5g11... [more]
Q3E9A46.5e-7240.35Probable glycosyltransferase At5g20260 OS=Arabidopsis thaliana OX=3702 GN=At5g20... [more]
Match NameE-valueIdentityDescription
XP_023516188.10.0100.00probable glycosyltransferase At5g03795 [Cucurbita pepo subsp. pepo][more]
XP_022961026.10.097.91probable glycosyltransferase At5g03795 [Cucurbita moschata][more]
XP_022987636.10.096.00probable glycosyltransferase At5g03795 [Cucurbita maxima][more]
KAG6589943.10.098.28putative glycosyltransferase, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_038879145.10.089.50probable glycosyltransferase At5g03795 [Benincasa hispida] >XP_038879147.1 proba... [more]
Match NameE-valueIdentityDescription
A0A6J1H9800.097.91probable glycosyltransferase At5g03795 OS=Cucurbita moschata OX=3662 GN=LOC11146... [more]
A0A6J1JEU90.096.00probable glycosyltransferase At5g03795 OS=Cucurbita maxima OX=3661 GN=LOC1114851... [more]
A0A1S3CJA30.083.19probable glycosyltransferase At5g03795 OS=Cucumis melo OX=3656 GN=LOC103501046 P... [more]
A0A0A0LU640.082.74Exostosin domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G364960 P... [more]
A0A5A7V6N90.082.22Putative glycosyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_sca... [more]
Match NameE-valueIdentityDescription
AT5G19670.11.9e-20756.75Exostosin family protein [more]
AT5G25820.18.2e-15563.31Exostosin family protein [more]
AT4G32790.11.8e-15460.99Exostosin family protein [more]
AT4G16745.14.8e-13955.33Exostosin family protein [more]
AT5G11610.11.2e-13752.11Exostosin family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR040911Exostosin, GT47 domainPFAMPF03016Exostosincoord: 338..619
e-value: 2.9E-60
score: 204.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 139..165
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 244..268
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 142..164
NoneNo IPR availablePANTHERPTHR11062:SF210EXOSTOSIN FAMILY PROTEINcoord: 221..668
IPR004263Exostosin-likePANTHERPTHR11062EXOSTOSIN HEPARAN SULFATE GLYCOSYLTRANSFERASE -RELATEDcoord: 221..668

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG18g04040.1Cp4.1LG18g04040.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006486 protein glycosylation
cellular_component GO:0000139 Golgi membrane
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016757 glycosyltransferase activity