Cp4.1LG14g01440 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG14g01440
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionGlycosyltransferase
LocationCp4.1LG14: 3606104 .. 3607847 (-)
RNA-Seq ExpressionCp4.1LG14g01440
SyntenyCp4.1LG14g01440
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCATATTCATGAGTGAACGAAGTCTCAGCCATAGCCTGTTAGTTTCTGCACTTACTGTCTTCACCTTCTTCAGCTCGCCATGGAATCCCAACCTCATGTCGCTCTTGTTTCTAGCCCTGGAATGGGCCATCTCTTCCCCTCTCTGGAGCTCGCCACGCGCCTCTCCATGCGCCACCACCTCTCCGTCACCGTTTTCATCGTCCCTTCCCGCTCCTCCTCCGTCGAAAACAAAGTCATCGCCGCCGCTCAAGCTGCCGGTCTCTTCACTGTCATTGAACTTCCTCCCGCTGACATGTCCGACGTCACTGACTCCACTGTCGTCGGTCGCCTCTGCATCACCATGCGTCGCCACGTCCCGGCTCTCCGCTCCGCCGTCTCCGCTCTTACCACTCTCCCCTCCGTCCTCATCGCCGACATCTTCGCCACCGAGTCGTTCGCTGTCGCCGACGAGTTCCACATGGCCAAATACGTCTTCGTCGCCTCTAACGCATGGTTCTTAGCCTTAACCATCTGCCTCCCGGTTCTCGATAAGCAAATCACCGGTCAGTACGTGGACCAGAATGAACCGCTCCATATCCCTGGATGCGAACCGGTTCGACCTTGCGACGTTATCGACCCGCTTTTGGACCGGACCGAATCACAGTATTTCGAATACGTCAAAATCGGGATGGAGATACCATCGAGCGACGGCGTTTTGGTTAACACGTGGGATGATTTGCAAGGTCGCACGCTTGCATCTTTCAGAGATCGGAATTTGTTGGGCCGTATTATGAAGTCGCCGGTTTATTCTATTGGACCGATCGTACGGCAGACCGGTGGGAAGAAAGGCGGTTCGAGTGAGCTGTTCAATTGGCTGAGTAAGCAACCCGGTGAGTCAGTTATATACGTGTCGTTTGGGAGTGGTGGAACGCTGTCGTCTGAGCAAATGACGGAAGTGGCTCACGGATTGGAAATGAGCGGGCAGAGATTCGTTTGGGTGGTACGCGCCCCAAAGGTAATCATTTCTTTAATTTTTAATTTTATTTTTTAATTTAAAAATAAACTAATAATAATCCAACATTTAATTTTACAGGTAAGATCGGACGCGACGTTTTTCACGACCGGCGACGGGAGTGAGGACCAATCAGAGGCGAAGTTTCTGCCAGATGGGTTTTTGGAGCGGACGTCGGAGGTGGGGTTTGTGGTGTCGATGTGGGCGGATCAGACGGCGGTGTTGGGGAGTCCGGCGGTGGGGGGTTTTTTCACGCACGGCGGATGGAACTCAGCGTTGGAAGGGATTACGAACGGAGTTCCAATGGTTGTGTGGCCGTTATATGCGGAGCAGCGGATGAACGCCACGATGCTGGCGGAAGAGGTGCGGGTGGCCGTCCGACCAAAGGAGCTGCCGACGAAGGCGGTGATCGGAAGGGAGGAGATTGCAGCGATGGTGAGGAAGATAATGGCGGAGGAGGATGAAGAAGGGAAAGCCATTAGAGCAAAGGCTAAGGAACTTCAACGGAGTGCGGAGAAGTCCACGGCGGAAGGTGGGTCGTCGTTCGAGAACTTCGCTCGAGTGGTGAAGTCATGGCGTGAAGCACGAGTTTGAATTTCACTTTATTCAACTAAATTTGTATTTAGTAATATTAGAATTATATTATAAGATATCCCAATAATTATCACTTTTATTTTAATTATTATTCGATATTAATTAATATCTTTTTAATTTTGATAATTTATTTAATACAAAAATAACTAACTAATTGA

mRNA sequence

TTCATATTCATGAGTGAACGAAGTCTCAGCCATAGCCTGTTAGTTTCTGCACTTACTGTCTTCACCTTCTTCAGCTCGCCATGGAATCCCAACCTCATGTCGCTCTTGTTTCTAGCCCTGGAATGGGCCATCTCTTCCCCTCTCTGGAGCTCGCCACGCGCCTCTCCATGCGCCACCACCTCTCCGTCACCGTTTTCATCGTCCCTTCCCGCTCCTCCTCCGTCGAAAACAAAGTCATCGCCGCCGCTCAAGCTGCCGGTCTCTTCACTGTCATTGAACTTCCTCCCGCTGACATGTCCGACGTCACTGACTCCACTGTCGTCGGTCGCCTCTGCATCACCATGCGTCGCCACGTCCCGGCTCTCCGCTCCGCCGTCTCCGCTCTTACCACTCTCCCCTCCGTCCTCATCGCCGACATCTTCGCCACCGAGTCGTTCGCTGTCGCCGACGAGTTCCACATGGCCAAATACGTCTTCGTCGCCTCTAACGCATGGTTCTTAGCCTTAACCATCTGCCTCCCGGTTCTCGATAAGCAAATCACCGGTCAGTACGTGGACCAGAATGAACCGCTCCATATCCCTGGATGCGAACCGGTTCGACCTTGCGACGTTATCGACCCGCTTTTGGACCGGACCGAATCACAGTATTTCGAATACGTCAAAATCGGGATGGAGATACCATCGAGCGACGGCGTTTTGGTTAACACGTGGGATGATTTGCAAGGTCGCACGCTTGCATCTTTCAGAGATCGGAATTTGTTGGGCCGTATTATGAAGTCGCCGGTTTATTCTATTGGACCGATCGTACGGCAGACCGGTGGGAAGAAAGGCGGTTCGAGTGAGCTGTTCAATTGGCTGAGTAAGCAACCCGGTGAGTCAGTTATATACGTGTCGTTTGGGAGTGGTGGAACGCTGTCGTCTGAGCAAATGACGGAAGTGGCTCACGGATTGGAAATGAGCGGGCAGAGATTCGTTTGGGTGGTACGCGCCCCAAAGGTAAGATCGGACGCGACGTTTTTCACGACCGGCGACGGGAGTGAGGACCAATCAGAGGCGAAGTTTCTGCCAGATGGGTTTTTGGAGCGGACGTCGGAGGTGGGGTTTGTGGTGTCGATGTGGGCGGATCAGACGGCGGTGTTGGGGAGTCCGGCGGTGGGGGGTTTTTTCACGCACGGCGGATGGAACTCAGCGTTGGAAGGGATTACGAACGGAGTTCCAATGGTTGTGTGGCCGTTATATGCGGAGCAGCGGATGAACGCCACGATGCTGGCGGAAGAGGTGCGGGTGGCCGTCCGACCAAAGGAGCTGCCGACGAAGGCGGTGATCGGAAGGGAGGAGATTGCAGCGATGGTGAGGAAGATAATGGCGGAGGAGGATGAAGAAGGGAAAGCCATTAGAGCAAAGGCTAAGGAACTTCAACGGAGTGCGGAGAAGTCCACGGCGGAAGGTGGGTCGTCGTTCGAGAACTTCGCTCGAGTGGTGAAGTCATGGCGTGAAGCACGAGTTTGAATTTCACTTTATTCAACTAAATTTGTATTTAGTAATATTAGAATTATATTATAAGATATCCCAATAATTATCACTTTTATTTTAATTATTATTCGATATTAATTAATATCTTTTTAATTTTGATAATTTATTTAATACAAAAATAACTAACTAATTGA

Coding sequence (CDS)

ATGGAATCCCAACCTCATGTCGCTCTTGTTTCTAGCCCTGGAATGGGCCATCTCTTCCCCTCTCTGGAGCTCGCCACGCGCCTCTCCATGCGCCACCACCTCTCCGTCACCGTTTTCATCGTCCCTTCCCGCTCCTCCTCCGTCGAAAACAAAGTCATCGCCGCCGCTCAAGCTGCCGGTCTCTTCACTGTCATTGAACTTCCTCCCGCTGACATGTCCGACGTCACTGACTCCACTGTCGTCGGTCGCCTCTGCATCACCATGCGTCGCCACGTCCCGGCTCTCCGCTCCGCCGTCTCCGCTCTTACCACTCTCCCCTCCGTCCTCATCGCCGACATCTTCGCCACCGAGTCGTTCGCTGTCGCCGACGAGTTCCACATGGCCAAATACGTCTTCGTCGCCTCTAACGCATGGTTCTTAGCCTTAACCATCTGCCTCCCGGTTCTCGATAAGCAAATCACCGGTCAGTACGTGGACCAGAATGAACCGCTCCATATCCCTGGATGCGAACCGGTTCGACCTTGCGACGTTATCGACCCGCTTTTGGACCGGACCGAATCACAGTATTTCGAATACGTCAAAATCGGGATGGAGATACCATCGAGCGACGGCGTTTTGGTTAACACGTGGGATGATTTGCAAGGTCGCACGCTTGCATCTTTCAGAGATCGGAATTTGTTGGGCCGTATTATGAAGTCGCCGGTTTATTCTATTGGACCGATCGTACGGCAGACCGGTGGGAAGAAAGGCGGTTCGAGTGAGCTGTTCAATTGGCTGAGTAAGCAACCCGGTGAGTCAGTTATATACGTGTCGTTTGGGAGTGGTGGAACGCTGTCGTCTGAGCAAATGACGGAAGTGGCTCACGGATTGGAAATGAGCGGGCAGAGATTCGTTTGGGTGGTACGCGCCCCAAAGGTAAGATCGGACGCGACGTTTTTCACGACCGGCGACGGGAGTGAGGACCAATCAGAGGCGAAGTTTCTGCCAGATGGGTTTTTGGAGCGGACGTCGGAGGTGGGGTTTGTGGTGTCGATGTGGGCGGATCAGACGGCGGTGTTGGGGAGTCCGGCGGTGGGGGGTTTTTTCACGCACGGCGGATGGAACTCAGCGTTGGAAGGGATTACGAACGGAGTTCCAATGGTTGTGTGGCCGTTATATGCGGAGCAGCGGATGAACGCCACGATGCTGGCGGAAGAGGTGCGGGTGGCCGTCCGACCAAAGGAGCTGCCGACGAAGGCGGTGATCGGAAGGGAGGAGATTGCAGCGATGGTGAGGAAGATAATGGCGGAGGAGGATGAAGAAGGGAAAGCCATTAGAGCAAAGGCTAAGGAACTTCAACGGAGTGCGGAGAAGTCCACGGCGGAAGGTGGGTCGTCGTTCGAGAACTTCGCTCGAGTGGTGAAGTCATGGCGTGAAGCACGAGTTTGA

Protein sequence

MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSVENKVIAAAQAAGLFTVIELPPADMSDVTDSTVVGRLCITMRRHVPALRSAVSALTTLPSVLIADIFATESFAVADEFHMAKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVIDPLLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGPIVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWVVRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGGFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEIAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKSWREARV
Homology
BLAST of Cp4.1LG14g01440 vs. ExPASy Swiss-Prot
Match: Q40287 (Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta OX=3983 GN=GT5 PE=2 SV=1)

HSP 1 Score: 487.3 bits (1253), Expect = 2.0e-136
Identity = 250/476 (52.52%), Postives = 334/476 (70.17%), Query Frame = 0

Query: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSVENKVIAAAQAAG 60
           + S+PH+ L+SSPG+GHL P LEL  R+    +  VT+F+V S +S+ E +V+ +A    
Sbjct: 6   LNSKPHIVLLSSPGLGHLIPVLELGKRIVTLCNFDVTIFMVGSDTSAAEPQVLRSAMTPK 65

Query: 61  LFTVIELPPADMSDVTD--STVVGRLCITMRRHVPALRSAVSALTTLPSVLIADIFATES 120
           L  +I+LPP ++S + D  +TV  RL + MR   PA R+AVSAL   P+ +I D+F TES
Sbjct: 66  LCEIIQLPPPNISCLIDPEATVCTRLFVLMREIRPAFRAAVSALKFRPAAIIVDLFGTES 125

Query: 121 FAVADEFHMAKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVI 180
             VA E  +AKYV++ASNAWFLALTI +P+LDK++ G++V Q EP+ IPGC PVR  +V+
Sbjct: 126 LEVAKELGIAKYVYIASNAWFLALTIYVPILDKEVEGEFVLQKEPMKIPGCRPVRTEEVV 185

Query: 181 DPLLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSI 240
           DP+LDRT  QY EY ++G+EIP++DG+L+NTW+ L+  T  + RD   LGR+ K PV+ I
Sbjct: 186 DPMLDRTNQQYSEYFRLGIEIPTADGILMNTWEALEPTTFGALRDVKFLGRVAKVPVFPI 245

Query: 241 GPIVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFV 300
           GP+ RQ  G  G + EL +WL +QP ESV+YVSFGSGGTLS EQM E+A GLE S QRF+
Sbjct: 246 GPLRRQ-AGPCGSNCELLDWLDQQPKESVVYVSFGSGGTLSLEQMIELAWGLERSQQRFI 305

Query: 301 WVVRAPKVRS-DATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPA 360
           WVVR P V++ DA FFT GDG++D S   + P+GFL R   VG VV  W+ Q  ++  P+
Sbjct: 306 WVVRQPTVKTGDAAFFTQGDGADDMS--GYFPEGFLTRIQNVGLVVPQWSPQIHIMSHPS 365

Query: 361 VGGFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGR 420
           VG F +H GWNS LE IT GVP++ WP+YAEQRMNAT+L EE+ VAVRPK LP K V+ R
Sbjct: 366 VGVFLSHCGWNSVLESITAGVPIIAWPIYAEQRMNATLLTEELGVAVRPKNLPAKEVVKR 425

Query: 421 EEIAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKSWREA 474
           EEI  M+R+IM   DEEG  IR + +EL+ S EK+  EGGSSF   + +   W ++
Sbjct: 426 EEIERMIRRIMV--DEEGSEIRKRVRELKDSGEKALNEGGSSFNYMSALGNEWEKS 476

BLAST of Cp4.1LG14g01440 vs. ExPASy Swiss-Prot
Match: Q9ZU72 (UDP-glycosyltransferase 72D1 OS=Arabidopsis thaliana OX=3702 GN=UGT72D1 PE=2 SV=1)

HSP 1 Score: 440.7 bits (1132), Expect = 2.2e-122
Identity = 233/463 (50.32%), Postives = 320/463 (69.11%), Query Frame = 0

Query: 4   QPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSS-VENKVIAAAQAAGLF 63
           QPH  LV+SPG+GHL P LEL  RLS   ++ VT+  V S SSS  E + I AA A  + 
Sbjct: 3   QPHALLVASPGLGHLIPILELGNRLSSVLNIHVTILAVTSGSSSPTETEAIHAAAARTIC 62

Query: 64  TVIELPPADMSDVT--DSTVVGRLCITMRRHVPALRSAVSALTTLPSVLIADIFATESFA 123
            + E+P  D+ ++   D+T+  ++ + MR   PA+R AV  +   P+V+I D   TE  +
Sbjct: 63  QITEIPSVDVDNLVEPDATIFTKMVVKMRAMKPAVRDAVKLMKRKPTVMIVDFLGTELMS 122

Query: 124 VADEFHM-AKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVID 183
           VAD+  M AKYV+V ++AWFLA+ + LPVLD  + G+YVD  EPL IPGC+PV P ++++
Sbjct: 123 VADDVGMTAKYVYVPTHAWFLAVMVYLPVLDTVVEGEYVDIKEPLKIPGCKPVGPKELME 182

Query: 184 PLLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIG 243
            +LDR+  QY E V+ G+E+P SDGVLVNTW++LQG TLA+ R+   L R+MK PVY IG
Sbjct: 183 TMLDRSGQQYKECVRAGLEVPMSDGVLVNTWEELQGNTLAALREDEELSRVMKVPVYPIG 242

Query: 244 PIVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVW 303
           PIVR T       + +F WL +Q   SV++V  GSGGTL+ EQ  E+A GLE+SGQRFVW
Sbjct: 243 PIVR-TNQHVDKPNSIFEWLDEQRERSVVFVCLGSGGTLTFEQTVELALGLELSGQRFVW 302

Query: 304 VVRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVG 363
           V+R P        +     S+D+  +  LP+GFL+RT  VG VV+ WA Q  +L   ++G
Sbjct: 303 VLRRP------ASYLGAISSDDEQVSASLPEGFLDRTRGVGIVVTQWAPQVEILSHRSIG 362

Query: 364 GFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREE 423
           GF +H GW+SALE +T GVP++ WPLYAEQ MNAT+L EE+ VAVR  ELP++ VIGREE
Sbjct: 363 GFLSHCGWSSALESLTKGVPIIAWPLYAEQWMNATLLTEEIGVAVRTSELPSERVIGREE 422

Query: 424 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFEN 463
           +A++VRKIMAEEDEEG+ IRAKA+E++ S+E++ ++ GSS+ +
Sbjct: 423 VASLVRKIMAEEDEEGQKIRAKAEEVRVSSERAWSKDGSSYNS 458

BLAST of Cp4.1LG14g01440 vs. ExPASy Swiss-Prot
Match: Q94A84 (UDP-glycosyltransferase 72E1 OS=Arabidopsis thaliana OX=3702 GN=UGT72E1 PE=1 SV=1)

HSP 1 Score: 400.2 bits (1027), Expect = 3.2e-110
Identity = 211/469 (44.99%), Postives = 307/469 (65.46%), Query Frame = 0

Query: 3   SQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSVENKVIAAAQA-AGL 62
           ++PHVA+ +SPGMGH+ P +EL  RL+  H   VT+F++ + ++S +++ + +    A L
Sbjct: 4   TKPHVAMFASPGMGHIIPVIELGKRLAGSHGFDVTIFVLETDAASAQSQFLNSPGCDAAL 63

Query: 63  FTVIELPPADMSDVTD-STVVG-RLCITMRRHVPALRSAVSALTTLPSVLIADIFATESF 122
             ++ LP  D+S + D S   G +L + MR  +P +RS +  +   P+ LI D+F  ++ 
Sbjct: 64  VDIVGLPTPDISGLVDPSAFFGIKLLVMMRETIPTIRSKIEEMQHKPTALIVDLFGLDAI 123

Query: 123 AVADEFHMAKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVID 182
            +  EF+M  Y+F+ASNA FLA+ +  P LDK +  +++ + +P+ +PGCEPVR  D ++
Sbjct: 124 PLGGEFNMLTYIFIASNARFLAVALFFPTLDKDMEEEHIIKKQPMVMPGCEPVRFEDTLE 183

Query: 183 PLLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIG 242
             LD     Y E+V  G   P+ DG++VNTWDD++ +TL S +D  LLGRI   PVY IG
Sbjct: 184 TFLDPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSLQDPKLLGRIAGVPVYPIG 243

Query: 243 PIVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVW 302
           P+ R     K  +  + +WL+KQP ESV+Y+SFGSGG+LS++Q+TE+A GLEMS QRFVW
Sbjct: 244 PLSRPVDPSK-TNHPVLDWLNKQPDESVLYISFGSGGSLSAKQLTELAWGLEMSQQRFVW 303

Query: 303 VVRAPKVRSD-ATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAV 362
           VVR P   S  + + +   G        +LP+GF+ RT E GF+VS WA Q  +L   AV
Sbjct: 304 VVRPPVDGSACSAYLSANSGKIRDGTPDYLPEGFVSRTHERGFMVSSWAPQAEILAHQAV 363

Query: 363 GGFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGRE 422
           GGF TH GWNS LE +  GVPM+ WPL+AEQ MNAT+L EE+ VAVR K+LP++ VI R 
Sbjct: 364 GGFLTHCGWNSILESVVGGVPMIAWPLFAEQMMNATLLNEELGVAVRSKKLPSEGVITRA 423

Query: 423 EIAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKS-TAEGGSSFENFARV 467
           EI A+VRKIM E  EEG  +R K K+L+ +A +S + +GG + E+ +R+
Sbjct: 424 EIEALVRKIMVE--EEGAEMRKKIKKLKETAAESLSCDGGVAHESLSRI 469

BLAST of Cp4.1LG14g01440 vs. ExPASy Swiss-Prot
Match: O81498 (UDP-glycosyltransferase 72E3 OS=Arabidopsis thaliana OX=3702 GN=UGT72E3 PE=1 SV=1)

HSP 1 Score: 384.0 bits (985), Expect = 2.4e-105
Identity = 206/471 (43.74%), Postives = 310/471 (65.82%), Query Frame = 0

Query: 3   SQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSVENKVIAAAQAAGLF 62
           ++PH A+ SSPGMGH+ P +ELA RLS  H   VTVF++ + ++SV++K++    + G+ 
Sbjct: 4   TKPHAAMFSSPGMGHVLPVIELAKRLSANHGFHVTVFVLETDAASVQSKLL---NSTGV- 63

Query: 63  TVIELPPADMSDVTD--STVVGRLCITMRRHVPALRSAVSALTTLPSVLIADIFATESFA 122
            ++ LP  D+S + D  + VV ++ + MR  VP LRS + A+   P+ LI D+F T++  
Sbjct: 64  DIVNLPSPDISGLVDPNAHVVTKIGVIMREAVPTLRSKIVAMHQNPTALIIDLFGTDALC 123

Query: 123 VADEFHMAKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVIDP 182
           +A E +M  YVF+ASNA +L ++I  P LD+ I  ++  Q +PL IPGCEPVR  D++D 
Sbjct: 124 LAAELNMLTYVFIASNARYLGVSIYYPTLDEVIKEEHTVQRKPLTIPGCEPVRFEDIMDA 183

Query: 183 LLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGP 242
            L   E  Y + V+  +  P +DG+LVNTW++++ ++L S +D  LLGR+ + PVY +GP
Sbjct: 184 YLVPDEPVYHDLVRHCLAYPKADGILVNTWEEMEPKSLKSLQDPKLLGRVARVPVYPVGP 243

Query: 243 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 302
           + R           +F+WL+KQP ESV+Y+SFGSGG+L+++Q+TE+A GLE S QRF+WV
Sbjct: 244 LCRPIQSST-TDHPVFDWLNKQPNESVLYISFGSGGSLTAQQLTELAWGLEESQQRFIWV 303

Query: 303 VRAPKVRSDAT-FFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVG 362
           VR P   S  + +F+   G    +  ++LP+GF+ RT + GF++  WA Q  +L   AVG
Sbjct: 304 VRPPVDGSSCSDYFSAKGGVTKDNTPEYLPEGFVTRTCDRGFMIPSWAPQAEILAHQAVG 363

Query: 363 GFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREE 422
           GF TH GW+S LE +  GVPM+ WPL+AEQ MNA +L++E+ ++VR  +   K  I R +
Sbjct: 364 GFLTHCGWSSTLESVLCGVPMIAWPLFAEQNMNAALLSDELGISVRVDD--PKEAISRSK 423

Query: 423 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTA--EGGSSFENFARVVK 469
           I AMVRK+MAE  +EG+ +R K K+L+ +AE S +   GGS+ E+  RV K
Sbjct: 424 IEAMVRKVMAE--DEGEEMRRKVKKLRDTAEMSLSIHGGGSAHESLCRVTK 465

BLAST of Cp4.1LG14g01440 vs. ExPASy Swiss-Prot
Match: Q9LVR1 (UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana OX=3702 GN=UGT72E2 PE=1 SV=1)

HSP 1 Score: 380.2 bits (975), Expect = 3.5e-104
Identity = 210/472 (44.49%), Postives = 309/472 (65.47%), Query Frame = 0

Query: 3   SQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSVENKVIAAAQAAGLF 62
           ++PH A+ SSPGMGH+ P +EL  RLS  +   VTVF++ + ++S ++K +    + G+ 
Sbjct: 4   TKPHAAMFSSPGMGHVIPVIELGKRLSANNGFHVTVFVLETDAASAQSKFL---NSTGV- 63

Query: 63  TVIELPPADMSDVT--DSTVVGRLCITMRRHVPALRSAVSALTTLPSVLIADIFATESFA 122
            +++LP  D+  +   D  VV ++ + MR  VPALRS ++A+   P+ LI D+F T++  
Sbjct: 64  DIVKLPSPDIYGLVDPDDHVVTKIGVIMRAAVPALRSKIAAMHQKPTALIVDLFGTDALC 123

Query: 123 VADEFHMAKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVIDP 182
           +A EF+M  YVF+ +NA FL ++I  P LDK I  ++  Q  PL IPGCEPVR  D +D 
Sbjct: 124 LAKEFNMLSYVFIPTNARFLGVSIYYPNLDKDIKEEHTVQRNPLAIPGCEPVRFEDTLDA 183

Query: 183 LLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGP 242
            L   E  Y ++V+ G+  P +DG+LVNTW++++ ++L S  +  LLGR+ + PVY IGP
Sbjct: 184 YLVPDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSLLNPKLLGRVARVPVYPIGP 243

Query: 243 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 302
           + R     +     + +WL++QP ESV+Y+SFGSGG LS++Q+TE+A GLE S QRFVWV
Sbjct: 244 LCRPIQSSE-TDHPVLDWLNEQPNESVLYISFGSGGCLSAKQLTELAWGLEQSQQRFVWV 303

Query: 303 VRAPKVRSDATFFTT--GDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAV 362
           VR P   S  + + +  G G+ED +  ++LP+GF+ RTS+ GFVV  WA Q  +L   AV
Sbjct: 304 VRPPVDGSCCSEYVSANGGGTEDNT-PEYLPEGFVSRTSDRGFVVPSWAPQAEILSHRAV 363

Query: 363 GGFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGRE 422
           GGF TH GW+S LE +  GVPM+ WPL+AEQ MNA +L++E+ +AVR  +   K  I R 
Sbjct: 364 GGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDELGIAVRLDD--PKEDISRW 423

Query: 423 EIAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTA--EGGSSFENFARVVK 469
           +I A+VRK+M E  +EG+A+R K K+L+ SAE S +   GG + E+  RV K
Sbjct: 424 KIEALVRKVMTE--KEGEAMRRKVKKLRDSAEMSLSIDGGGLAHESLCRVTK 465

BLAST of Cp4.1LG14g01440 vs. NCBI nr
Match: XP_023551829.1 (anthocyanidin 3-O-glucosyltransferase 5-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 929 bits (2402), Expect = 0.0
Identity = 475/475 (100.00%), Postives = 475/475 (100.00%), Query Frame = 0

Query: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSVENKVIAAAQAAG 60
           MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSVENKVIAAAQAAG
Sbjct: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSVENKVIAAAQAAG 60

Query: 61  LFTVIELPPADMSDVTDSTVVGRLCITMRRHVPALRSAVSALTTLPSVLIADIFATESFA 120
           LFTVIELPPADMSDVTDSTVVGRLCITMRRHVPALRSAVSALTTLPSVLIADIFATESFA
Sbjct: 61  LFTVIELPPADMSDVTDSTVVGRLCITMRRHVPALRSAVSALTTLPSVLIADIFATESFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVIDP 180
           VADEFHMAKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVIDP
Sbjct: 121 VADEFHMAKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVIDP 180

Query: 181 LLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGP 240
           LLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGP
Sbjct: 181 LLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGP 240

Query: 241 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300
           IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV
Sbjct: 241 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300

Query: 301 VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360
           VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG
Sbjct: 301 VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360

Query: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420
           FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI
Sbjct: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420

Query: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKSWREARV 475
           AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKSWREARV
Sbjct: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKSWREARV 475

BLAST of Cp4.1LG14g01440 vs. NCBI nr
Match: XP_022929544.1 (anthocyanidin 3-O-glucosyltransferase 5-like [Cucurbita moschata])

HSP 1 Score: 906 bits (2342), Expect = 0.0
Identity = 463/475 (97.47%), Postives = 467/475 (98.32%), Query Frame = 0

Query: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSVENKVIAAAQAAG 60
           MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSS ENKVIAAAQAAG
Sbjct: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAENKVIAAAQAAG 60

Query: 61  LFTVIELPPADMSDVTDSTVVGRLCITMRRHVPALRSAVSALTTLPSVLIADIFATESFA 120
           LFTVIELPPADMSDVT+S VVGRLCITMRRHVPALRSAVS LTTLPSVLIADIFATESFA
Sbjct: 61  LFTVIELPPADMSDVTESAVVGRLCITMRRHVPALRSAVSTLTTLPSVLIADIFATESFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVIDP 180
           VADEFHMAKYVFVASNAWFLA TI +PVLDKQITGQYVDQ EPL+IPGCEPVRPCDVIDP
Sbjct: 121 VADEFHMAKYVFVASNAWFLAFTIYVPVLDKQITGQYVDQKEPLYIPGCEPVRPCDVIDP 180

Query: 181 LLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGP 240
           LLDRTE QYFEYV+IGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIM SPVYSIGP
Sbjct: 181 LLDRTEPQYFEYVRIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMNSPVYSIGP 240

Query: 241 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300
           IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV
Sbjct: 241 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300

Query: 301 VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360
           VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG
Sbjct: 301 VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360

Query: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420
           FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI
Sbjct: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420

Query: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKSWREARV 475
           AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKSWREARV
Sbjct: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKSWREARV 475

BLAST of Cp4.1LG14g01440 vs. NCBI nr
Match: KAG7015349.1 (Anthocyanidin 3-O-glucosyltransferase 5 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 905 bits (2340), Expect = 0.0
Identity = 463/475 (97.47%), Postives = 467/475 (98.32%), Query Frame = 0

Query: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSVENKVIAAAQAAG 60
           MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSS ENKVIAAAQAAG
Sbjct: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAENKVIAAAQAAG 60

Query: 61  LFTVIELPPADMSDVTDSTVVGRLCITMRRHVPALRSAVSALTTLPSVLIADIFATESFA 120
           LFTVIELPPADMSDVT+S VVGRLCITMRRHVPALRSAVS LTTLPSVLIADIFATESFA
Sbjct: 61  LFTVIELPPADMSDVTESAVVGRLCITMRRHVPALRSAVSTLTTLPSVLIADIFATESFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVIDP 180
           VADEFHMAKYVFVASNAWFLA TI +PVLDKQITGQYVDQ EPL+IPGCEPVRPCDVIDP
Sbjct: 121 VADEFHMAKYVFVASNAWFLAFTIYVPVLDKQITGQYVDQKEPLYIPGCEPVRPCDVIDP 180

Query: 181 LLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGP 240
           LLDRTE QYFEYV+IGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGP
Sbjct: 181 LLDRTEPQYFEYVRIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGP 240

Query: 241 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300
           IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV
Sbjct: 241 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300

Query: 301 VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360
           VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG
Sbjct: 301 VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360

Query: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420
           FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI
Sbjct: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420

Query: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKSWREARV 475
           AAMVRKIMA EDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKSWREARV
Sbjct: 421 AAMVRKIMAVEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKSWREARV 475

BLAST of Cp4.1LG14g01440 vs. NCBI nr
Match: XP_022985065.1 (anthocyanidin 3-O-glucosyltransferase 5-like [Cucurbita maxima] >XP_022985066.1 anthocyanidin 3-O-glucosyltransferase 5-like [Cucurbita maxima])

HSP 1 Score: 889 bits (2296), Expect = 0.0
Identity = 454/475 (95.58%), Postives = 462/475 (97.26%), Query Frame = 0

Query: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSVENKVIAAAQAAG 60
           MESQPHVAL+SSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSS E KVIAAAQAAG
Sbjct: 1   MESQPHVALISSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAEYKVIAAAQAAG 60

Query: 61  LFTVIELPPADMSDVTDSTVVGRLCITMRRHVPALRSAVSALTTLPSVLIADIFATESFA 120
           LFTVIELPPADMSDVTDSTVVGRL ITMRRHVPALRSAVSALT+LPSVLIADIFATESFA
Sbjct: 61  LFTVIELPPADMSDVTDSTVVGRLSITMRRHVPALRSAVSALTSLPSVLIADIFATESFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVIDP 180
           VADEFHMAKYVFVASNAWF ALTI +PVLDKQI GQYVDQ EP HIPGCEPVRPCDV+DP
Sbjct: 121 VADEFHMAKYVFVASNAWFFALTIYVPVLDKQINGQYVDQKEPFHIPGCEPVRPCDVMDP 180

Query: 181 LLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGP 240
           LLDRTE QYFEYV+IG EIPSSDGVLVNTWDDL+GRTLASFRD NLLGRIMKSPVYSIGP
Sbjct: 181 LLDRTEPQYFEYVRIGTEIPSSDGVLVNTWDDLEGRTLASFRDWNLLGRIMKSPVYSIGP 240

Query: 241 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300
           IVRQTGGKKGG+SELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV
Sbjct: 241 IVRQTGGKKGGASELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300

Query: 301 VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360
           VRAPKVRSDATFFTTGDG+EDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG
Sbjct: 301 VRAPKVRSDATFFTTGDGTEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360

Query: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420
           FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI
Sbjct: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420

Query: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKSWREARV 475
           AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVK WRE RV
Sbjct: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKLWRENRV 475

BLAST of Cp4.1LG14g01440 vs. NCBI nr
Match: XP_038880693.1 (anthocyanidin 3-O-glucosyltransferase 5-like [Benincasa hispida])

HSP 1 Score: 822 bits (2122), Expect = 4.09e-298
Identity = 419/468 (89.53%), Postives = 439/468 (93.80%), Query Frame = 0

Query: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSVENKVIAAAQAAG 60
           M+SQ HVAL+SSPGMGHLFPSLELATRLS RHHL+VTVFIVPS SS+ ENKVIAAA+AAG
Sbjct: 1   MDSQTHVALISSPGMGHLFPSLELATRLSTRHHLTVTVFIVPSHSSNAENKVIAAAEAAG 60

Query: 61  LFTVIELPPADMSDVTDSTVVGRLCITMRRHVPALRSAVSALTTLPSVLIADIFATESFA 120
           LFTV+ELPPADMSDVTDS+VVGRL ITMRRHVP LRSAVSALT+LPSVLIADIFATESFA
Sbjct: 61  LFTVVELPPADMSDVTDSSVVGRLAITMRRHVPILRSAVSALTSLPSVLIADIFATESFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVIDP 180
           VADEFHMAKYVFVASNAWFLALT+   V DKQI GQYVDQ EPL IPGCEPVRPCDVIDP
Sbjct: 121 VADEFHMAKYVFVASNAWFLALTVYAQVWDKQIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180

Query: 181 LLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGP 240
           LLDRT+ QYFE VK+GM I S DGVLVNTWDDLQGRTLASFRDRNLLG+IMK PVYSIGP
Sbjct: 181 LLDRTQPQYFEIVKVGMGIASCDGVLVNTWDDLQGRTLASFRDRNLLGKIMKPPVYSIGP 240

Query: 241 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300
           IVRQ+G KKGGSSELFNWLSKQP ESVIYVSFGSGGTLS EQMTEVAHGLEMS QRFVWV
Sbjct: 241 IVRQSGSKKGGSSELFNWLSKQPTESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWV 300

Query: 301 VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360
           VRAPKVRSD  +FTTGDGSE+QS  KFLP+GFLERTSEVGFVVSMWADQTAVLGSPAVGG
Sbjct: 301 VRAPKVRSDGAYFTTGDGSEEQSAGKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360

Query: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420
           FFTHGGWNSALEGITNGVPMVVWPLYAEQR+NATMLAEEVRVAVRPKELPTKAVIGREEI
Sbjct: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRLNATMLAEEVRVAVRPKELPTKAVIGREEI 420

Query: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVK 468
           AAMVRKIMAEEDEEGKAIRAKAKELQRSAE ++AE GSS+ENFARVVK
Sbjct: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAENASAEDGSSYENFARVVK 468

BLAST of Cp4.1LG14g01440 vs. ExPASy TrEMBL
Match: A0A6J1EP22 (Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111436081 PE=3 SV=1)

HSP 1 Score: 906 bits (2342), Expect = 0.0
Identity = 463/475 (97.47%), Postives = 467/475 (98.32%), Query Frame = 0

Query: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSVENKVIAAAQAAG 60
           MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSS ENKVIAAAQAAG
Sbjct: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAENKVIAAAQAAG 60

Query: 61  LFTVIELPPADMSDVTDSTVVGRLCITMRRHVPALRSAVSALTTLPSVLIADIFATESFA 120
           LFTVIELPPADMSDVT+S VVGRLCITMRRHVPALRSAVS LTTLPSVLIADIFATESFA
Sbjct: 61  LFTVIELPPADMSDVTESAVVGRLCITMRRHVPALRSAVSTLTTLPSVLIADIFATESFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVIDP 180
           VADEFHMAKYVFVASNAWFLA TI +PVLDKQITGQYVDQ EPL+IPGCEPVRPCDVIDP
Sbjct: 121 VADEFHMAKYVFVASNAWFLAFTIYVPVLDKQITGQYVDQKEPLYIPGCEPVRPCDVIDP 180

Query: 181 LLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGP 240
           LLDRTE QYFEYV+IGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIM SPVYSIGP
Sbjct: 181 LLDRTEPQYFEYVRIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMNSPVYSIGP 240

Query: 241 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300
           IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV
Sbjct: 241 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300

Query: 301 VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360
           VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG
Sbjct: 301 VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360

Query: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420
           FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI
Sbjct: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420

Query: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKSWREARV 475
           AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKSWREARV
Sbjct: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKSWREARV 475

BLAST of Cp4.1LG14g01440 vs. ExPASy TrEMBL
Match: A0A6J1J726 (Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111483150 PE=3 SV=1)

HSP 1 Score: 889 bits (2296), Expect = 0.0
Identity = 454/475 (95.58%), Postives = 462/475 (97.26%), Query Frame = 0

Query: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSVENKVIAAAQAAG 60
           MESQPHVAL+SSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSS E KVIAAAQAAG
Sbjct: 1   MESQPHVALISSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAEYKVIAAAQAAG 60

Query: 61  LFTVIELPPADMSDVTDSTVVGRLCITMRRHVPALRSAVSALTTLPSVLIADIFATESFA 120
           LFTVIELPPADMSDVTDSTVVGRL ITMRRHVPALRSAVSALT+LPSVLIADIFATESFA
Sbjct: 61  LFTVIELPPADMSDVTDSTVVGRLSITMRRHVPALRSAVSALTSLPSVLIADIFATESFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVIDP 180
           VADEFHMAKYVFVASNAWF ALTI +PVLDKQI GQYVDQ EP HIPGCEPVRPCDV+DP
Sbjct: 121 VADEFHMAKYVFVASNAWFFALTIYVPVLDKQINGQYVDQKEPFHIPGCEPVRPCDVMDP 180

Query: 181 LLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGP 240
           LLDRTE QYFEYV+IG EIPSSDGVLVNTWDDL+GRTLASFRD NLLGRIMKSPVYSIGP
Sbjct: 181 LLDRTEPQYFEYVRIGTEIPSSDGVLVNTWDDLEGRTLASFRDWNLLGRIMKSPVYSIGP 240

Query: 241 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300
           IVRQTGGKKGG+SELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV
Sbjct: 241 IVRQTGGKKGGASELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300

Query: 301 VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360
           VRAPKVRSDATFFTTGDG+EDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG
Sbjct: 301 VRAPKVRSDATFFTTGDGTEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360

Query: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420
           FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI
Sbjct: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420

Query: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKSWREARV 475
           AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVK WRE RV
Sbjct: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKLWRENRV 475

BLAST of Cp4.1LG14g01440 vs. ExPASy TrEMBL
Match: A0A6J1FPT8 (Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111447126 PE=3 SV=1)

HSP 1 Score: 779 bits (2012), Expect = 1.00e-281
Identity = 391/468 (83.55%), Postives = 429/468 (91.67%), Query Frame = 0

Query: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSVENKVIAAAQAAG 60
           M+S  HVAL+SSPGMGHLFPSLELATRLS RHHL++TVF+V S SSS EN V+AAA+A G
Sbjct: 1   MDSPTHVALISSPGMGHLFPSLELATRLSTRHHLTLTVFLVTSHSSSAENNVVAAAEATG 60

Query: 61  LFTVIELPPADMSDVTDSTVVGRLCITMRRHVPALRSAVSALTTLPSVLIADIFATESFA 120
           LFTV+ELPPADMSDVTDSTVVGRL ITMRRHVPALRSA+SALT+ PS LIADIF+TE+FA
Sbjct: 61  LFTVVELPPADMSDVTDSTVVGRLAITMRRHVPALRSAISALTSRPSALIADIFSTEAFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVIDP 180
           VADEFHMAKYVFVASNAWFLALTI   VLDKQI GQYVDQ EPL IPGCEPVRPCDV+DP
Sbjct: 121 VADEFHMAKYVFVASNAWFLALTIYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 180

Query: 181 LLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGP 240
           +LDRTESQY+EYVK+G  I SS GVLVN+WD+LQGRTLASF+DR+LLGR+M +PVYSIGP
Sbjct: 181 MLDRTESQYYEYVKMGRAIASSHGVLVNSWDELQGRTLASFKDRSLLGRVMNAPVYSIGP 240

Query: 241 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300
           IVR  G  K GSSELFNWL KQPG+SVIYVSFGSGGTLS EQMTE+AHGLE+S QRFVWV
Sbjct: 241 IVRHFGSGKDGSSELFNWLRKQPGKSVIYVSFGSGGTLSFEQMTEMAHGLELSRQRFVWV 300

Query: 301 VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360
           VR P VRSDA FFTTGDGSEDQSEA++LP+GFLERTSEVGF+VSMWA+QTAVLGSPAVGG
Sbjct: 301 VRPPTVRSDAMFFTTGDGSEDQSEARYLPEGFLERTSEVGFLVSMWAEQTAVLGSPAVGG 360

Query: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420
           FFTHGGWNS+LEGIT GVPM+VWPLYAEQRMNATMLA+E+ VAVRPKELP  AVIGREEI
Sbjct: 361 FFTHGGWNSSLEGITKGVPMIVWPLYAEQRMNATMLADEMGVAVRPKELPGNAVIGREEI 420

Query: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVK 468
           AAMVRKIMAEEDEEG+AIRAKA ELQRSAEK+ A+GGSS+ENFARVVK
Sbjct: 421 AAMVRKIMAEEDEEGRAIRAKAMELQRSAEKACAQGGSSYENFARVVK 468

BLAST of Cp4.1LG14g01440 vs. ExPASy TrEMBL
Match: A0A6J1FTD7 (Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111447126 PE=3 SV=1)

HSP 1 Score: 779 bits (2012), Expect = 1.08e-281
Identity = 391/468 (83.55%), Postives = 429/468 (91.67%), Query Frame = 0

Query: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSVENKVIAAAQAAG 60
           M+S  HVAL+SSPGMGHLFPSLELATRLS RHHL++TVF+V S SSS EN V+AAA+A G
Sbjct: 1   MDSPTHVALISSPGMGHLFPSLELATRLSTRHHLTLTVFLVTSHSSSAENNVVAAAEATG 60

Query: 61  LFTVIELPPADMSDVTDSTVVGRLCITMRRHVPALRSAVSALTTLPSVLIADIFATESFA 120
           LFTV+ELPPADMSDVTDSTVVGRL ITMRRHVPALRSA+SALT+ PS LIADIF+TE+FA
Sbjct: 61  LFTVVELPPADMSDVTDSTVVGRLAITMRRHVPALRSAISALTSRPSALIADIFSTEAFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVIDP 180
           VADEFHMAKYVFVASNAWFLALTI   VLDKQI GQYVDQ EPL IPGCEPVRPCDV+DP
Sbjct: 121 VADEFHMAKYVFVASNAWFLALTIYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 180

Query: 181 LLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGP 240
           +LDRTESQY+EYVK+G  I SS GVLVN+WD+LQGRTLASF+DR+LLGR+M +PVYSIGP
Sbjct: 181 MLDRTESQYYEYVKMGRAIASSHGVLVNSWDELQGRTLASFKDRSLLGRVMNAPVYSIGP 240

Query: 241 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300
           IVR  G  K GSSELFNWL KQPG+SVIYVSFGSGGTLS EQMTE+AHGLE+S QRFVWV
Sbjct: 241 IVRHFGSGKDGSSELFNWLRKQPGKSVIYVSFGSGGTLSFEQMTEMAHGLELSRQRFVWV 300

Query: 301 VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360
           VR P VRSDA FFTTGDGSEDQSEA++LP+GFLERTSEVGF+VSMWA+QTAVLGSPAVGG
Sbjct: 301 VRPPTVRSDAMFFTTGDGSEDQSEARYLPEGFLERTSEVGFLVSMWAEQTAVLGSPAVGG 360

Query: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420
           FFTHGGWNS+LEGIT GVPM+VWPLYAEQRMNATMLA+E+ VAVRPKELP  AVIGREEI
Sbjct: 361 FFTHGGWNSSLEGITKGVPMIVWPLYAEQRMNATMLADEMGVAVRPKELPGNAVIGREEI 420

Query: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVK 468
           AAMVRKIMAEEDEEG+AIRAKA ELQRSAEK+ A+GGSS+ENFARVVK
Sbjct: 421 AAMVRKIMAEEDEEGRAIRAKAMELQRSAEKACAQGGSSYENFARVVK 468

BLAST of Cp4.1LG14g01440 vs. ExPASy TrEMBL
Match: A0A6J1IW98 (Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111479797 PE=3 SV=1)

HSP 1 Score: 771 bits (1990), Expect = 5.28e-278
Identity = 386/468 (82.48%), Postives = 427/468 (91.24%), Query Frame = 0

Query: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSVENKVIAAAQAAG 60
           M+S  HVAL+SSPGMGHLFPSLELATRLS RHHL++TVF+V S SSS EN V+AAA+AAG
Sbjct: 1   MDSPTHVALISSPGMGHLFPSLELATRLSTRHHLTLTVFLVTSHSSSAENNVVAAAEAAG 60

Query: 61  LFTVIELPPADMSDVTDSTVVGRLCITMRRHVPALRSAVSALTTLPSVLIADIFATESFA 120
           LFTV+ELPPADMSDVTD TVVGRL ITMRRHVPALRSA+SALT+ PS LIADIF+TE+FA
Sbjct: 61  LFTVVELPPADMSDVTDFTVVGRLAITMRRHVPALRSAISALTSRPSALIADIFSTEAFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVIDP 180
           VADEFHMAKYVFVASNAWFLALTI   VLDKQI GQYVDQ +PL IPGCEPVRPCDV+DP
Sbjct: 121 VADEFHMAKYVFVASNAWFLALTIYAQVLDKQIVGQYVDQKKPLQIPGCEPVRPCDVVDP 180

Query: 181 LLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGP 240
           +LDRTE QY+EYVK+GM I SS GVLVN+WD+LQGR LASF+DR+LLGR+MK+PVYSIGP
Sbjct: 181 MLDRTEFQYYEYVKVGMAIASSHGVLVNSWDELQGRALASFKDRSLLGRVMKAPVYSIGP 240

Query: 241 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300
           IVR  G  K GSSELFNWL KQPG+SVIYVSFGSGGTLS EQMTE+AHGLE+S QRFVWV
Sbjct: 241 IVRHFGSGKDGSSELFNWLRKQPGKSVIYVSFGSGGTLSFEQMTEMAHGLELSRQRFVWV 300

Query: 301 VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360
           VR P VRSDA FFT GDGS+DQS+A+FLP+GFLERTSEVGF+VSMWA+QTAVLGSPAVGG
Sbjct: 301 VRPPTVRSDAMFFTIGDGSDDQSKARFLPEGFLERTSEVGFLVSMWAEQTAVLGSPAVGG 360

Query: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420
           FFTHGGWNS+LEGIT GVPM+VWPLYAEQRMNATMLA+E+ VAVRPKELP  AVIGREEI
Sbjct: 361 FFTHGGWNSSLEGITKGVPMIVWPLYAEQRMNATMLADEMGVAVRPKELPGNAVIGREEI 420

Query: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVK 468
           AAMVRKIMA EDEEGKAIRAK +ELQRSAEK+ A+GGSS++NFARVVK
Sbjct: 421 AAMVRKIMAAEDEEGKAIRAKVEELQRSAEKACAQGGSSYQNFARVVK 468

BLAST of Cp4.1LG14g01440 vs. TAIR 10
Match: AT2G18570.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 440.7 bits (1132), Expect = 1.5e-123
Identity = 233/463 (50.32%), Postives = 320/463 (69.11%), Query Frame = 0

Query: 4   QPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSS-VENKVIAAAQAAGLF 63
           QPH  LV+SPG+GHL P LEL  RLS   ++ VT+  V S SSS  E + I AA A  + 
Sbjct: 3   QPHALLVASPGLGHLIPILELGNRLSSVLNIHVTILAVTSGSSSPTETEAIHAAAARTIC 62

Query: 64  TVIELPPADMSDVT--DSTVVGRLCITMRRHVPALRSAVSALTTLPSVLIADIFATESFA 123
            + E+P  D+ ++   D+T+  ++ + MR   PA+R AV  +   P+V+I D   TE  +
Sbjct: 63  QITEIPSVDVDNLVEPDATIFTKMVVKMRAMKPAVRDAVKLMKRKPTVMIVDFLGTELMS 122

Query: 124 VADEFHM-AKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVID 183
           VAD+  M AKYV+V ++AWFLA+ + LPVLD  + G+YVD  EPL IPGC+PV P ++++
Sbjct: 123 VADDVGMTAKYVYVPTHAWFLAVMVYLPVLDTVVEGEYVDIKEPLKIPGCKPVGPKELME 182

Query: 184 PLLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIG 243
            +LDR+  QY E V+ G+E+P SDGVLVNTW++LQG TLA+ R+   L R+MK PVY IG
Sbjct: 183 TMLDRSGQQYKECVRAGLEVPMSDGVLVNTWEELQGNTLAALREDEELSRVMKVPVYPIG 242

Query: 244 PIVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVW 303
           PIVR T       + +F WL +Q   SV++V  GSGGTL+ EQ  E+A GLE+SGQRFVW
Sbjct: 243 PIVR-TNQHVDKPNSIFEWLDEQRERSVVFVCLGSGGTLTFEQTVELALGLELSGQRFVW 302

Query: 304 VVRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVG 363
           V+R P        +     S+D+  +  LP+GFL+RT  VG VV+ WA Q  +L   ++G
Sbjct: 303 VLRRP------ASYLGAISSDDEQVSASLPEGFLDRTRGVGIVVTQWAPQVEILSHRSIG 362

Query: 364 GFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREE 423
           GF +H GW+SALE +T GVP++ WPLYAEQ MNAT+L EE+ VAVR  ELP++ VIGREE
Sbjct: 363 GFLSHCGWSSALESLTKGVPIIAWPLYAEQWMNATLLTEEIGVAVRTSELPSERVIGREE 422

Query: 424 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFEN 463
           +A++VRKIMAEEDEEG+ IRAKA+E++ S+E++ ++ GSS+ +
Sbjct: 423 VASLVRKIMAEEDEEGQKIRAKAEEVRVSSERAWSKDGSSYNS 458

BLAST of Cp4.1LG14g01440 vs. TAIR 10
Match: AT3G50740.1 (UDP-glucosyl transferase 72E1 )

HSP 1 Score: 400.2 bits (1027), Expect = 2.3e-111
Identity = 211/469 (44.99%), Postives = 307/469 (65.46%), Query Frame = 0

Query: 3   SQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSVENKVIAAAQA-AGL 62
           ++PHVA+ +SPGMGH+ P +EL  RL+  H   VT+F++ + ++S +++ + +    A L
Sbjct: 4   TKPHVAMFASPGMGHIIPVIELGKRLAGSHGFDVTIFVLETDAASAQSQFLNSPGCDAAL 63

Query: 63  FTVIELPPADMSDVTD-STVVG-RLCITMRRHVPALRSAVSALTTLPSVLIADIFATESF 122
             ++ LP  D+S + D S   G +L + MR  +P +RS +  +   P+ LI D+F  ++ 
Sbjct: 64  VDIVGLPTPDISGLVDPSAFFGIKLLVMMRETIPTIRSKIEEMQHKPTALIVDLFGLDAI 123

Query: 123 AVADEFHMAKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVID 182
            +  EF+M  Y+F+ASNA FLA+ +  P LDK +  +++ + +P+ +PGCEPVR  D ++
Sbjct: 124 PLGGEFNMLTYIFIASNARFLAVALFFPTLDKDMEEEHIIKKQPMVMPGCEPVRFEDTLE 183

Query: 183 PLLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIG 242
             LD     Y E+V  G   P+ DG++VNTWDD++ +TL S +D  LLGRI   PVY IG
Sbjct: 184 TFLDPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSLQDPKLLGRIAGVPVYPIG 243

Query: 243 PIVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVW 302
           P+ R     K  +  + +WL+KQP ESV+Y+SFGSGG+LS++Q+TE+A GLEMS QRFVW
Sbjct: 244 PLSRPVDPSK-TNHPVLDWLNKQPDESVLYISFGSGGSLSAKQLTELAWGLEMSQQRFVW 303

Query: 303 VVRAPKVRSD-ATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAV 362
           VVR P   S  + + +   G        +LP+GF+ RT E GF+VS WA Q  +L   AV
Sbjct: 304 VVRPPVDGSACSAYLSANSGKIRDGTPDYLPEGFVSRTHERGFMVSSWAPQAEILAHQAV 363

Query: 363 GGFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGRE 422
           GGF TH GWNS LE +  GVPM+ WPL+AEQ MNAT+L EE+ VAVR K+LP++ VI R 
Sbjct: 364 GGFLTHCGWNSILESVVGGVPMIAWPLFAEQMMNATLLNEELGVAVRSKKLPSEGVITRA 423

Query: 423 EIAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKS-TAEGGSSFENFARV 467
           EI A+VRKIM E  EEG  +R K K+L+ +A +S + +GG + E+ +R+
Sbjct: 424 EIEALVRKIMVE--EEGAEMRKKIKKLKETAAESLSCDGGVAHESLSRI 469

BLAST of Cp4.1LG14g01440 vs. TAIR 10
Match: AT5G26310.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 384.0 bits (985), Expect = 1.7e-106
Identity = 206/471 (43.74%), Postives = 310/471 (65.82%), Query Frame = 0

Query: 3   SQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSVENKVIAAAQAAGLF 62
           ++PH A+ SSPGMGH+ P +ELA RLS  H   VTVF++ + ++SV++K++    + G+ 
Sbjct: 4   TKPHAAMFSSPGMGHVLPVIELAKRLSANHGFHVTVFVLETDAASVQSKLL---NSTGV- 63

Query: 63  TVIELPPADMSDVTD--STVVGRLCITMRRHVPALRSAVSALTTLPSVLIADIFATESFA 122
            ++ LP  D+S + D  + VV ++ + MR  VP LRS + A+   P+ LI D+F T++  
Sbjct: 64  DIVNLPSPDISGLVDPNAHVVTKIGVIMREAVPTLRSKIVAMHQNPTALIIDLFGTDALC 123

Query: 123 VADEFHMAKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVIDP 182
           +A E +M  YVF+ASNA +L ++I  P LD+ I  ++  Q +PL IPGCEPVR  D++D 
Sbjct: 124 LAAELNMLTYVFIASNARYLGVSIYYPTLDEVIKEEHTVQRKPLTIPGCEPVRFEDIMDA 183

Query: 183 LLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGP 242
            L   E  Y + V+  +  P +DG+LVNTW++++ ++L S +D  LLGR+ + PVY +GP
Sbjct: 184 YLVPDEPVYHDLVRHCLAYPKADGILVNTWEEMEPKSLKSLQDPKLLGRVARVPVYPVGP 243

Query: 243 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 302
           + R           +F+WL+KQP ESV+Y+SFGSGG+L+++Q+TE+A GLE S QRF+WV
Sbjct: 244 LCRPIQSST-TDHPVFDWLNKQPNESVLYISFGSGGSLTAQQLTELAWGLEESQQRFIWV 303

Query: 303 VRAPKVRSDAT-FFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVG 362
           VR P   S  + +F+   G    +  ++LP+GF+ RT + GF++  WA Q  +L   AVG
Sbjct: 304 VRPPVDGSSCSDYFSAKGGVTKDNTPEYLPEGFVTRTCDRGFMIPSWAPQAEILAHQAVG 363

Query: 363 GFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREE 422
           GF TH GW+S LE +  GVPM+ WPL+AEQ MNA +L++E+ ++VR  +   K  I R +
Sbjct: 364 GFLTHCGWSSTLESVLCGVPMIAWPLFAEQNMNAALLSDELGISVRVDD--PKEAISRSK 423

Query: 423 IAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTA--EGGSSFENFARVVK 469
           I AMVRK+MAE  +EG+ +R K K+L+ +AE S +   GGS+ E+  RV K
Sbjct: 424 IEAMVRKVMAE--DEGEEMRRKVKKLRDTAEMSLSIHGGGSAHESLCRVTK 465

BLAST of Cp4.1LG14g01440 vs. TAIR 10
Match: AT5G66690.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 380.2 bits (975), Expect = 2.5e-105
Identity = 210/472 (44.49%), Postives = 309/472 (65.47%), Query Frame = 0

Query: 3   SQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSVENKVIAAAQAAGLF 62
           ++PH A+ SSPGMGH+ P +EL  RLS  +   VTVF++ + ++S ++K +    + G+ 
Sbjct: 4   TKPHAAMFSSPGMGHVIPVIELGKRLSANNGFHVTVFVLETDAASAQSKFL---NSTGV- 63

Query: 63  TVIELPPADMSDVT--DSTVVGRLCITMRRHVPALRSAVSALTTLPSVLIADIFATESFA 122
            +++LP  D+  +   D  VV ++ + MR  VPALRS ++A+   P+ LI D+F T++  
Sbjct: 64  DIVKLPSPDIYGLVDPDDHVVTKIGVIMRAAVPALRSKIAAMHQKPTALIVDLFGTDALC 123

Query: 123 VADEFHMAKYVFVASNAWFLALTICLPVLDKQITGQYVDQNEPLHIPGCEPVRPCDVIDP 182
           +A EF+M  YVF+ +NA FL ++I  P LDK I  ++  Q  PL IPGCEPVR  D +D 
Sbjct: 124 LAKEFNMLSYVFIPTNARFLGVSIYYPNLDKDIKEEHTVQRNPLAIPGCEPVRFEDTLDA 183

Query: 183 LLDRTESQYFEYVKIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGP 242
            L   E  Y ++V+ G+  P +DG+LVNTW++++ ++L S  +  LLGR+ + PVY IGP
Sbjct: 184 YLVPDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSLLNPKLLGRVARVPVYPIGP 243

Query: 243 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 302
           + R     +     + +WL++QP ESV+Y+SFGSGG LS++Q+TE+A GLE S QRFVWV
Sbjct: 244 LCRPIQSSE-TDHPVLDWLNEQPNESVLYISFGSGGCLSAKQLTELAWGLEQSQQRFVWV 303

Query: 303 VRAPKVRSDATFFTT--GDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAV 362
           VR P   S  + + +  G G+ED +  ++LP+GF+ RTS+ GFVV  WA Q  +L   AV
Sbjct: 304 VRPPVDGSCCSEYVSANGGGTEDNT-PEYLPEGFVSRTSDRGFVVPSWAPQAEILSHRAV 363

Query: 363 GGFFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGRE 422
           GGF TH GW+S LE +  GVPM+ WPL+AEQ MNA +L++E+ +AVR  +   K  I R 
Sbjct: 364 GGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDELGIAVRLDD--PKEDISRW 423

Query: 423 EIAAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTA--EGGSSFENFARVVK 469
           +I A+VRK+M E  +EG+A+R K K+L+ SAE S +   GG + E+  RV K
Sbjct: 424 KIEALVRKVMTE--KEGEAMRRKVKKLRDSAEMSLSIDGGGLAHESLCRVTK 465

BLAST of Cp4.1LG14g01440 vs. TAIR 10
Match: AT2G18560.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 365.2 bits (936), Expect = 8.2e-101
Identity = 186/372 (50.00%), Postives = 256/372 (68.82%), Query Frame = 0

Query: 88  MRRHVPALRSAVSALTTLPSVLIADIFATESFAVADEFHMAKYVFVASNAWFLALTICLP 147
           MR     +R AV ++   P+V+I D F T   ++ D    +KYV++ S+AWFLAL + LP
Sbjct: 1   MREMKSTVRDAVKSMKQKPTVMIVDFFGTALLSITDVGVTSKYVYIPSHAWFLALIVYLP 60

Query: 148 VLDKQITGQYVDQNEPLHIPGCEPVRPCDVIDPLLDRTESQYFEYVKIGMEIPSSDGVLV 207
           VLDK + G+YVD  EP+ IPGC+PV P +++D +LDR++ QY + V+IG+EIP SDGVLV
Sbjct: 61  VLDKVMEGEYVDIKEPMKIPGCKPVGPKELLDTMLDRSDQQYRDCVQIGLEIPMSDGVLV 120

Query: 208 NTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGPIVRQTGGKKGGSSELFNWLSKQPGESV 267
           NTW +LQG+TLA+ R+   L R++K PVY IGPIVR T       +  F WL KQ   SV
Sbjct: 121 NTWGELQGKTLAALREDIDLNRVIKVPVYPIGPIVR-TNVLIEKPNSTFEWLDKQEERSV 180

Query: 268 IYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWVVRAPKVRSDATFFTTGDGSEDQSEAKF 327
           +YV  GSGGTLS EQ  E+A GLE+S Q F+WV+R P        +      +D   +  
Sbjct: 181 VYVCLGSGGTLSFEQTMELAWGLELSCQSFLWVLRKP------PSYLGASSKDDDQVSDG 240

Query: 328 LPDGFLERTSEVGFVVSMWADQTAVLGSPAVGGFFTHGGWNSALEGITNGVPMVVWPLYA 387
           LP+GFL+RT  VG VV+ WA Q  +L   ++GGF +H GW+S LE +T GVP++ WPLYA
Sbjct: 241 LPEGFLDRTRGVGLVVTQWAPQVEILSHRSIGGFLSHCGWSSVLESLTKGVPIIAWPLYA 300

Query: 388 EQRMNATMLAEEVRVAVRPKELPTKAVIGREEIAAMVRKIMAEEDEEGKAIRAKAKELQR 447
           EQ MNAT+L EE+ +A+R  ELP+K VI REE+A++V+KI+AEED+EG+ I+ KA+E++ 
Sbjct: 301 EQWMNATLLTEEIGMAIRTSELPSKKVISREEVASLVKKIVAEEDKEGRKIKTKAEEVRV 360

Query: 448 SAEKSTAEGGSS 460
           S+E++   GGSS
Sbjct: 361 SSERAWTHGGSS 365

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q402872.0e-13652.52Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta OX=3983 GN=GT5 PE=2... [more]
Q9ZU722.2e-12250.32UDP-glycosyltransferase 72D1 OS=Arabidopsis thaliana OX=3702 GN=UGT72D1 PE=2 SV=... [more]
Q94A843.2e-11044.99UDP-glycosyltransferase 72E1 OS=Arabidopsis thaliana OX=3702 GN=UGT72E1 PE=1 SV=... [more]
O814982.4e-10543.74UDP-glycosyltransferase 72E3 OS=Arabidopsis thaliana OX=3702 GN=UGT72E3 PE=1 SV=... [more]
Q9LVR13.5e-10444.49UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana OX=3702 GN=UGT72E2 PE=1 SV=... [more]
Match NameE-valueIdentityDescription
XP_023551829.10.0100.00anthocyanidin 3-O-glucosyltransferase 5-like [Cucurbita pepo subsp. pepo][more]
XP_022929544.10.097.47anthocyanidin 3-O-glucosyltransferase 5-like [Cucurbita moschata][more]
KAG7015349.10.097.47Anthocyanidin 3-O-glucosyltransferase 5 [Cucurbita argyrosperma subsp. argyrospe... [more]
XP_022985065.10.095.58anthocyanidin 3-O-glucosyltransferase 5-like [Cucurbita maxima] >XP_022985066.1 ... [more]
XP_038880693.14.09e-29889.53anthocyanidin 3-O-glucosyltransferase 5-like [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1EP220.097.47Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111436081 PE=3 SV=1[more]
A0A6J1J7260.095.58Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111483150 PE=3 SV=1[more]
A0A6J1FPT81.00e-28183.55Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111447126 PE=3 SV=1[more]
A0A6J1FTD71.08e-28183.55Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111447126 PE=3 SV=1[more]
A0A6J1IW985.28e-27882.48Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111479797 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT2G18570.11.5e-12350.32UDP-Glycosyltransferase superfamily protein [more]
AT3G50740.12.3e-11144.99UDP-glucosyl transferase 72E1 [more]
AT5G26310.11.7e-10643.74UDP-Glycosyltransferase superfamily protein [more]
AT5G66690.12.5e-10544.49UDP-Glycosyltransferase superfamily protein [more]
AT2G18560.18.2e-10150.00UDP-Glycosyltransferase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 428..448
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 6..459
e-value: 2.8E-135
score: 453.7
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 253..454
e-value: 2.8E-135
score: 453.7
NoneNo IPR availablePANTHERPTHR48049GLYCOSYLTRANSFERASEcoord: 2..470
NoneNo IPR availablePANTHERPTHR48049:SF63UDP-GLYCOSYLTRANSFERASE 72C1coord: 2..470
NoneNo IPR availableSUPERFAMILY53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 5..470
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 263..398
e-value: 8.6E-22
score: 77.6
IPR002213UDP-glucuronosyl/UDP-glucosyltransferaseCDDcd03784GT1_Gtf-likecoord: 5..454
e-value: 9.00279E-67
score: 218.189
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 346..389

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g01440.1Cp4.1LG14g01440.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0008194 UDP-glycosyltransferase activity