CmaCh14G020520 (gene) Cucurbita maxima (Rimu)

NameCmaCh14G020520
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionUDP-Glycosyltransferase superfamily protein, putative
LocationCma_Chr14 : 14283580 .. 14284980 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGAAAAGAAAGCTGATCATGTCGTTCTTCTTCCATGGTCGGCCTTCGGCCATTTAATGCCCCATTTTCAACTCTCCTTAGCCTTAGCCAAAGCTGGCGTCCATGTCTCCTTCATCTCCACCCCCAAGAATCTCAACAGACTCCCCGGAATTCCTCCATCTCTGTCGCCATTCATAACTCTGGTGCCCATTCCACTCCCCAAACTCCCCGGCGACCCCTTGCCGGAAGGTGCAGAGGCCACTGTCGACATTCCGTTCGACAAAATTCCGTTTCTGAAACTATCTCTAGATCTCGCTGAGCCGTCGGTTCGAAAATTCGTCGCCGATCATCCTAATCCGCCGGATTGGATGATCGTTGATTTCAATGCTACTTGGATCTGTGACATTTCTCGAGAATTTCGAATTCCGATTGTTTTCTTTCGCGTTCTCTCGCCTGGATTTCTTGCTTTCTTCGCTCATGTTCTTGGGAGTGGTCTGCCTCTGTCGGAAATCGGAAGCCTGATGTCGCCGCCGATAATCAGCGGATCCACGGTGGCGTTCAGGCGGTATGAAGCTGCCAAAATTCATGCTGATTTGTTTGAGAAGAACGATTCTGGTATGAGCGATCGCGAAAGGGTAACGAAGATTATTTCCGGTAGTCGAGCAATTGCAGTTCGTAGTTGTTACGAATTTGATGTTGATTATTTGAAGTTATACTCGATTTTTTGTGGAAAGAGAGTGATTCCTCTAGGGTTTCTTCCTCCAGAAAAGCCCCAAAAATCAGAGTTCGAGGCCGATTCGCCATGGAAATCGACCTTCGAGTGGCTCGATCATCAAAGCCCCCAATCCGTGGTGTTCGTCGGATTCGGAAGCGAGTGCAAGCTCACAAAGGATCAAATACACAAAATAGCCCGCGGCTTGGAGCTGTCGGAGCTGCCATTTCTATGGTCTCTGAGGAAACCGGACTGGGCGGGGGACTCCGACGCGCTGCCGGCCGGTTTCCAGGATCGGACGGCGGAGAGAGGGATTGTGAGAATGGGGTGGGCCCCACAGATGGAGATTTTAGGGCATCCGGCGATCGGAGGGTGCTTCTTTCACGGAGGTTGGGGATCCGCCATTGAAGCTCTGCAATTCGGGCATCGTCTAGTTCTGTTGCCGTTCATCGTGGATCAGCCGCTGAATGCGAGGCTTTTGGTGGAGAAGGGAGTGGCAGTTGAAGTTGAAAGAAAGGAAGAGGATGGATCTTTCAGTGGAGAAGACATAGCCAAAGCTTTGAAAGAAGCTATGGCTTCCGAAGAAGGGGAGAAGATTAGAAGGCGAGCTACTGAAATGTCCGCCATTTTTGGGGACACGAAGCTTCATCAGCGATACATAGAGGAATTTGTAGAATTCCTGAAAAATGGGGATTCAAATCAGTAG

mRNA sequence

ATGGCGGAAAAGAAAGCTGATCATGTCGTTCTTCTTCCATGGTCGGCCTTCGGCCATTTAATGCCCCATTTTCAACTCTCCTTAGCCTTAGCCAAAGCTGGCGTCCATGTCTCCTTCATCTCCACCCCCAAGAATCTCAACAGACTCCCCGGAATTCCTCCATCTCTGTCGCCATTCATAACTCTGGTGCCCATTCCACTCCCCAAACTCCCCGGCGACCCCTTGCCGGAAGGTGCAGAGGCCACTGTCGACATTCCGTTCGACAAAATTCCGTTTCTGAAACTATCTCTAGATCTCGCTGAGCCGTCGGTTCGAAAATTCGTCGCCGATCATCCTAATCCGCCGGATTGGATGATCGTTGATTTCAATGCTACTTGGATCTGTGACATTTCTCGAGAATTTCGAATTCCGATTGTTTTCTTTCGCGTTCTCTCGCCTGGATTTCTTGCTTTCTTCGCTCATGTTCTTGGGAGTGGTCTGCCTCTGTCGGAAATCGGAAGCCTGATGTCGCCGCCGATAATCAGCGGATCCACGGTGGCGTTCAGGCGGTATGAAGCTGCCAAAATTCATGCTGATTTGTTTGAGAAGAACGATTCTGGTATGAGCGATCGCGAAAGGGTAACGAAGATTATTTCCGGTAGTCGAGCAATTGCAGTTCGTAGTTGTTACGAATTTGATGTTGATTATTTGAAGTTATACTCGATTTTTTGTGGAAAGAGAGTGATTCCTCTAGGGTTTCTTCCTCCAGAAAAGCCCCAAAAATCAGAGTTCGAGGCCGATTCGCCATGGAAATCGACCTTCGAGTGGCTCGATCATCAAAGCCCCCAATCCGTGGTGTTCGTCGGATTCGGAAGCGAGTGCAAGCTCACAAAGGATCAAATACACAAAATAGCCCGCGGCTTGGAGCTGTCGGAGCTGCCATTTCTATGGTCTCTGAGGAAACCGGACTGGGCGGGGGACTCCGACGCGCTGCCGGCCGGTTTCCAGGATCGGACGGCGGAGAGAGGGATTGTGAGAATGGGGTGGGCCCCACAGATGGAGATTTTAGGGCATCCGGCGATCGGAGGGTGCTTCTTTCACGGAGGTTGGGGATCCGCCATTGAAGCTCTGCAATTCGGGCATCGTCTAGTTCTGTTGCCGTTCATCGTGGATCAGCCGCTGAATGCGAGGCTTTTGGTGGAGAAGGGAGTGGCAGTTGAAGTTGAAAGAAAGGAAGAGGATGGATCTTTCAGTGGAGAAGACATAGCCAAAGCTTTGAAAGAAGCTATGGCTTCCGAAGAAGGGGAGAAGATTAGAAGGCGAGCTACTGAAATGTCCGCCATTTTTGGGGACACGAAGCTTCATCAGCGATACATAGAGGAATTTGTAGAATTCCTGAAAAATGGGGATTCAAATCAGTAG

Coding sequence (CDS)

ATGGCGGAAAAGAAAGCTGATCATGTCGTTCTTCTTCCATGGTCGGCCTTCGGCCATTTAATGCCCCATTTTCAACTCTCCTTAGCCTTAGCCAAAGCTGGCGTCCATGTCTCCTTCATCTCCACCCCCAAGAATCTCAACAGACTCCCCGGAATTCCTCCATCTCTGTCGCCATTCATAACTCTGGTGCCCATTCCACTCCCCAAACTCCCCGGCGACCCCTTGCCGGAAGGTGCAGAGGCCACTGTCGACATTCCGTTCGACAAAATTCCGTTTCTGAAACTATCTCTAGATCTCGCTGAGCCGTCGGTTCGAAAATTCGTCGCCGATCATCCTAATCCGCCGGATTGGATGATCGTTGATTTCAATGCTACTTGGATCTGTGACATTTCTCGAGAATTTCGAATTCCGATTGTTTTCTTTCGCGTTCTCTCGCCTGGATTTCTTGCTTTCTTCGCTCATGTTCTTGGGAGTGGTCTGCCTCTGTCGGAAATCGGAAGCCTGATGTCGCCGCCGATAATCAGCGGATCCACGGTGGCGTTCAGGCGGTATGAAGCTGCCAAAATTCATGCTGATTTGTTTGAGAAGAACGATTCTGGTATGAGCGATCGCGAAAGGGTAACGAAGATTATTTCCGGTAGTCGAGCAATTGCAGTTCGTAGTTGTTACGAATTTGATGTTGATTATTTGAAGTTATACTCGATTTTTTGTGGAAAGAGAGTGATTCCTCTAGGGTTTCTTCCTCCAGAAAAGCCCCAAAAATCAGAGTTCGAGGCCGATTCGCCATGGAAATCGACCTTCGAGTGGCTCGATCATCAAAGCCCCCAATCCGTGGTGTTCGTCGGATTCGGAAGCGAGTGCAAGCTCACAAAGGATCAAATACACAAAATAGCCCGCGGCTTGGAGCTGTCGGAGCTGCCATTTCTATGGTCTCTGAGGAAACCGGACTGGGCGGGGGACTCCGACGCGCTGCCGGCCGGTTTCCAGGATCGGACGGCGGAGAGAGGGATTGTGAGAATGGGGTGGGCCCCACAGATGGAGATTTTAGGGCATCCGGCGATCGGAGGGTGCTTCTTTCACGGAGGTTGGGGATCCGCCATTGAAGCTCTGCAATTCGGGCATCGTCTAGTTCTGTTGCCGTTCATCGTGGATCAGCCGCTGAATGCGAGGCTTTTGGTGGAGAAGGGAGTGGCAGTTGAAGTTGAAAGAAAGGAAGAGGATGGATCTTTCAGTGGAGAAGACATAGCCAAAGCTTTGAAAGAAGCTATGGCTTCCGAAGAAGGGGAGAAGATTAGAAGGCGAGCTACTGAAATGTCCGCCATTTTTGGGGACACGAAGCTTCATCAGCGATACATAGAGGAATTTGTAGAATTCCTGAAAAATGGGGATTCAAATCAGTAG

Protein sequence

MAEKKADHVVLLPWSAFGHLMPHFQLSLALAKAGVHVSFISTPKNLNRLPGIPPSLSPFITLVPIPLPKLPGDPLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWMIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGSGLPLSEIGSLMSPPIISGSTVAFRRYEAAKIHADLFEKNDSGMSDRERVTKIISGSRAIAVRSCYEFDVDYLKLYSIFCGKRVIPLGFLPPEKPQKSEFEADSPWKSTFEWLDHQSPQSVVFVGFGSECKLTKDQIHKIARGLELSELPFLWSLRKPDWAGDSDALPAGFQDRTAERGIVRMGWAPQMEILGHPAIGGCFFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVEKGVAVEVERKEEDGSFSGEDIAKALKEAMASEEGEKIRRRATEMSAIFGDTKLHQRYIEEFVEFLKNGDSNQ
BLAST of CmaCh14G020520 vs. Swiss-Prot
Match: URT1_FRAAN (Putative UDP-rhamnose:rhamnosyltransferase 1 OS=Fragaria ananassa GN=GT4 PE=2 SV=1)

HSP 1 Score: 335.9 bits (860), Expect = 7.1e-91
Identity = 189/469 (40.30%), Postives = 271/469 (57.78%), Query Frame = 1

Query: 3   EKKADHVVLLPWSAFGHLMPHFQLSLALAKAGVHVSFISTPKNLNRLPGIPPSLSPFITL 62
           ++K  H+ L PW AFGH++P  +++  +A+ G  VSFISTP+N+ RLP IP +L+P I L
Sbjct: 8   KRKKLHIALFPWLAFGHIIPFLEVAKHIARKGHKVSFISTPRNIQRLPKIPETLTPLINL 67

Query: 63  VPIPLPKLPGDPLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWMIVDF 122
           V IPLP +  + LPE AEAT+D+P D IP+LK++ D  E  + +F+      PDW+I DF
Sbjct: 68  VQIPLPHV--ENLPENAEATMDVPHDVIPYLKIAHDGLEQGISEFL--QAQSPDWIIHDF 127

Query: 123 NATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGSGL----PLSEIGSLMSPP--IISG 182
              W+  I+ +  I    F + +   + FF     + +    P  ++    SPP  I   
Sbjct: 128 APHWLPPIATKLGISNAHFSIFNASSMCFFGSTSPNRVSRYAPRKKLEQFTSPPEWIPFP 187

Query: 183 STVAFRRYEAAKIHADLFEKNDSGMSDRERVTKIISGSRAIAVRSCYEFDVDYLKLYSIF 242
           S +  R +EA ++       N SG++DR R+   I G +   +RSC E + ++L L    
Sbjct: 188 SKIYHRPFEAKRLMDGTLTPNASGVTDRFRLESTIQGCQVYFIRSCREIEGEWLDLLEDL 247

Query: 243 CGKRVI-PLGFLPPEKPQKSEFEA-DSPWKSTFEWLDHQSPQSVVFVGFGSECKLTKDQI 302
             K ++ P G LPP  P+  E    DS W     WLD Q    VV+  FGSE  L+++  
Sbjct: 248 HEKPIVLPTGLLPPSLPRSDEDGGKDSNWSKIAVWLDKQEKGKVVYAAFGSELNLSQEVF 307

Query: 303 HKIARGLELSELPFLWSLRKPDWA---GDSDALPAGFQDRTAERGIVRMGWAPQMEILGH 362
           +++A GLELS LPF W LRKP      GDS  LP GF+DR   RG+V   WAPQ++IL H
Sbjct: 308 NELALGLELSGLPFFWVLRKPSHGSGDGDSVKLPDGFEDRVKGRGLVWTTWAPQLKILSH 367

Query: 363 PAIGGCFFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVEKGVAVEVERKEEDGSFS 422
            ++GG   H GW S IE+LQ+G  L++LPF+ DQ L AR    K +  EV R EE G F+
Sbjct: 368 ESVGGFLTHCGWSSIIESLQYGCPLIMLPFMYDQGLIARFWDNK-IGAEVPRDEETGWFT 427

Query: 423 GEDIAKALKEAMASEEGEKIRRRATEMSAIFGDTKLHQRYIEEFVEFLK 461
             ++A +LK  +  EEG++ R  A E S +F D +LH RY++E VE+L+
Sbjct: 428 RNELANSLKLIVVDEEGKQYRDGANEYSKLFRDKELHDRYMDECVEYLE 471

BLAST of CmaCh14G020520 vs. Swiss-Prot
Match: SGT3_SOYBN (Soyasaponin III rhamnosyltransferase OS=Glycine max GN=GmSGT3 PE=1 SV=1)

HSP 1 Score: 327.8 bits (839), Expect = 1.9e-88
Identity = 181/457 (39.61%), Postives = 273/457 (59.74%), Query Frame = 1

Query: 8   HVVLLPWSAFGHLMPHFQLSLALAKAGVHVSFISTPKNLNRLPGIPPSLSPFITLVPIPL 67
           HV +LPW A GH+ P+F+++  LA+ G  V+FI++PKN++R+P  P  L PFI LV +PL
Sbjct: 16  HVAMLPWLAMGHIYPYFEVAKILAQKGHFVTFINSPKNIDRMPKTPKHLEPFIKLVKLPL 75

Query: 68  PKLPGDPLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWMIVDFNATWI 127
           PK+  + LPEGAE+T+DIP  K  FLK + +  + +V K +    + PDW++ DF A W+
Sbjct: 76  PKI--EHLPEGAESTMDIPSKKNCFLKKAYEGLQYAVSKLLKT--SNPDWVLYDFAAAWV 135

Query: 128 CDISREFRIPIVFFRVLSPGFLAFFAHVLGSGLPLSEIGSLMSPP--IISGSTVAFRRYE 187
             I++ + IP   + + +P F   F       +    + S+  PP  +   +T+  R YE
Sbjct: 136 IPIAKSYNIPCAHYNI-TPAFNKVFFDPPKDKMKDYSLASICGPPTWLPFTTTIHIRPYE 195

Query: 188 AAKIHADLFEKNDSGMSDRERVTKIISGSRAIAVRSCYEFDVDYLKLYSIFCGKRVIPLG 247
             + +    ++ ++G      + K  S      +R+  E + D+L   +      V+P+G
Sbjct: 196 FLRAYEGTKDE-ETGERASFDLNKAYSSCDLFLLRTSRELEGDWLDYLAGNYKVPVVPVG 255

Query: 248 FLPPEKPQKSEFEADS--PWKSTFEWLDHQSPQSVVFVGFGSECKLTKDQIHKIARGLEL 307
            LPP    +   E D+   W    +WLD Q   SVV++GFGSE KL+++ + ++A G+EL
Sbjct: 256 LLPPSMQIRDVEEEDNNPDWVRIKDWLDTQESSSVVYIGFGSELKLSQEDLTELAHGIEL 315

Query: 308 SELPFLWSLRKPDWAGDSDALPAGFQDRTAERGIVRMGWAPQMEILGHPAIGGCFFHGGW 367
           S LPF W+L+  +       LP GF++RT ERGIV   WAPQ++IL H AIGGC  H G 
Sbjct: 316 SNLPFFWALK--NLKEGVLELPEGFEERTKERGIVWKTWAPQLKILAHGAIGGCMSHCGS 375

Query: 368 GSAIEALQFGHRLVLLPFIVDQPLNARLLVEKGVAVEVERKEEDGSFSGEDIAKALKEAM 427
           GS IE + FGH LV LP+++DQ L +R+L EK VAVEV R E+DGSF+  D+AK L+ A+
Sbjct: 376 GSVIEKVHFGHVLVTLPYLLDQCLFSRVLEEKQVAVEVPRSEKDGSFTRVDVAKTLRFAI 435

Query: 428 ASEEGEKIRRRATEMSAIFGDTKLHQRYIEEFVEFLK 461
             EEG  +R  A EM  +F   +LH +YI++F++ L+
Sbjct: 436 VDEEGSALRENAKEMGKVFSSEELHNKYIQDFIDALQ 464

BLAST of CmaCh14G020520 vs. Swiss-Prot
Match: U91A1_ARATH (UDP-glycosyltransferase 91A1 OS=Arabidopsis thaliana GN=UGT91A1 PE=2 SV=1)

HSP 1 Score: 322.8 bits (826), Expect = 6.2e-87
Identity = 180/458 (39.30%), Postives = 264/458 (57.64%), Query Frame = 1

Query: 8   HVVLLPWSAFGHLMPHFQLSLALAKAGVHVSFISTPKNLNRL-PGIPPSLSPFITLVPIP 67
           HVV+ PW AFGH++P+ +LS  +A+ G  VSFISTP+N++RL P +P +LS  I  V + 
Sbjct: 15  HVVMFPWLAFGHMVPYLELSKLIAQKGHKVSFISTPRNIDRLLPRLPENLSSVINFVKLS 74

Query: 68  LPKLPGDPLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWMIVDFNATW 127
           LP +  + LPE  EAT D+PF+ IP+LK++ D  +  V +F+    + PDW++ DF   W
Sbjct: 75  LP-VGDNKLPEDGEATTDVPFELIPYLKIAYDGLKVPVTEFLES--SKPDWVLQDFAGFW 134

Query: 128 ICDISREFRIPIVFFRVLSPGFLAFFAHVLGSGLPLSEIGSLMSPP--IISGSTVAFRRY 187
           +  ISR   I   FF   +   L       G     +     M PP  +   ++VAF+ +
Sbjct: 135 LPPISRRLGIKTGFFSAFNGATLGILKPP-GFEEYRTSPADFMKPPKWVPFETSVAFKLF 194

Query: 188 EAAKIHADLF-EKNDSGMSDRERVTKIISGSRAIAVRSCYEFDVDYLKLYSIFCGKRVIP 247
           E   I      E  +  + D  RV  +I G   I VRSCYE++ ++L L      K VIP
Sbjct: 195 ECRFIFKGFMAETTEGNVPDIHRVGGVIDGCDVIFVRSCYEYEAEWLGLTQELHRKPVIP 254

Query: 248 LGFLPPEKPQKSEFEADSPWKSTFEWLDHQSPQSVVFVGFGSECKLTKDQIHKIARGLEL 307
           +G LPP+  +K  FE    W S  +WLD +  +S+V+V FGSE K ++ ++++IA GLEL
Sbjct: 255 VGVLPPKPDEK--FEDTDTWLSVKKWLDSRKSKSIVYVAFGSEAKPSQTELNEIALGLEL 314

Query: 308 SELPFLWSL--RKPDWAGDSDALPAGFQDRTAERGIVRMGWAPQMEILGHPAIGGCFFHG 367
           S LPF W L  R+  W  +   LP GF++RTA+RG+V  GW  Q+  L H +IG    H 
Sbjct: 315 SGLPFFWVLKTRRGPWDTEPVELPEGFEERTADRGMVWRGWVEQLRTLSHDSIGLVLTHP 374

Query: 368 GWGSAIEALQFGHRLVLLPFIVDQPLNARLLVEKGVAVEVERKEEDGSFSGEDIAKALKE 427
           GWG+ IEA++F   + +L F+ DQ LNAR++ EK +   + R E +G F+ E +A +L+ 
Sbjct: 375 GWGTIIEAIRFAKPMAMLVFVYDQGLNARVIEEKKIGYMIPRDETEGFFTKESVANSLRL 434

Query: 428 AMASEEGEKIRRRATEMSAIFGDTKLHQRYIEEFVEFL 460
            M  EEG+  R    EM  +FGD     RY++ F+E+L
Sbjct: 435 VMVEEEGKVYRENVKEMKGVFGDMDRQDRYVDSFLEYL 466

BLAST of CmaCh14G020520 vs. Swiss-Prot
Match: U91D1_STERE (UDP-glycosyltransferase 91D1 OS=Stevia rebaudiana GN=UGT91D1 PE=2 SV=1)

HSP 1 Score: 315.8 bits (808), Expect = 7.6e-85
Identity = 176/466 (37.77%), Postives = 269/466 (57.73%), Query Frame = 1

Query: 3   EKKADHVVLLPWSAFGHLMPHFQLSLALAKAGVHVSFISTPKNLNRLPGIPPSLSPFITL 62
           ++K  HV   PW AFGH++P  QLS  +A+ G  VSF+ST +N+ RL      +SP I +
Sbjct: 22  DRKQLHVATFPWLAFGHILPFLQLSKLIAEKGHKVSFLSTTRNIQRLSS---HISPLINV 81

Query: 63  VPIPLPKLPGDPLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWMIVDF 122
           V + LP++    LPE AEAT D+  + I +LK ++D  +P V +F+  H   PDW+I DF
Sbjct: 82  VQLTLPRV--QELPEDAEATTDVHPEDIQYLKKAVDGLQPEVTRFLEQHS--PDWIIYDF 141

Query: 123 NATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGSGLPLSE----IGSLMSPP--IISG 182
              W+  I+    I   +F V++P  +A+ A    + +  S+    +  L +PP      
Sbjct: 142 THYWLPSIAASLGISRAYFCVITPWTIAYLAPSSDAMINDSDGRTTVEDLTTPPKWFPFP 201

Query: 183 STVAFRRYEAAKIHADLFEKNDSGMSDRERVTKIISGSRAIAVRSCYEFDVDYLKLYSIF 242
           + V +R+++ A++    +E    G+SD  R+  +  GS  +  +  +EF   +L L    
Sbjct: 202 TKVCWRKHDLARMEP--YEA--PGISDGYRMGMVFKGSDCLLFKCYHEFGTQWLPLLETL 261

Query: 243 CGKRVIPLGFLPPEKPQKSEFEADSPWKSTFEWLDHQSPQSVVFVGFGSECKLTKDQIHK 302
               V+P+G LPPE P     E D  W S  +WLD +   SVV+V  GSE  +++ ++ +
Sbjct: 262 HQVPVVPVGLLPPEIPGD---EKDETWVSIKKWLDGKQKGSVVYVALGSEALVSQTEVVE 321

Query: 303 IARGLELSELPFLWSLRKPDWAGDSDA--LPAGFQDRTAERGIVRMGWAPQMEILGHPAI 362
           +A GLELS LPF+W+ RKP     SD+  LP GF +RT +RG+V   WAPQ+ IL H ++
Sbjct: 322 LALGLELSGLPFVWAYRKPKGPAKSDSVELPDGFVERTRDRGLVWTSWAPQLRILSHESV 381

Query: 363 GGCFFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVEKGVAVEVERKEEDGSFSGED 422
            G   H G GS +E L FGH L++LP   DQPLNARLL +K V +E+ R EEDG  + E 
Sbjct: 382 CGFLTHCGSGSIVEGLMFGHPLIMLPIFCDQPLNARLLEDKQVGIEIPRNEEDGCLTKES 441

Query: 423 IAKALKEAMASEEGEKIRRRATEMSAIFGDTKLHQRYIEEFVEFLK 461
           +A++L+  +   EGE  +  A  +S I+ DTK+ + Y+ +FV++L+
Sbjct: 442 VARSLRSVVVENEGEIYKANARALSKIYNDTKVEKEYVSQFVDYLE 473

BLAST of CmaCh14G020520 vs. Swiss-Prot
Match: U91C1_ARATH (UDP-glycosyltransferase 91C1 OS=Arabidopsis thaliana GN=UGT91C1 PE=2 SV=1)

HSP 1 Score: 308.9 bits (790), Expect = 9.3e-83
Identity = 173/473 (36.58%), Postives = 268/473 (56.66%), Query Frame = 1

Query: 1   MAEKKAD--HVVLLPWSAFGHLMPHFQLSLALAKAGVHVSFISTPKNLNRLPGIPPSLSP 60
           M +K+ +  HV + PW A GHL+P  +LS  LA+ G  +SFISTP+N+ RLP +  +L+ 
Sbjct: 1   MVDKREEVMHVAMFPWLAMGHLLPFLRLSKLLAQKGHKISFISTPRNIERLPKLQSNLAS 60

Query: 61  FITLVPIPLPKLPGDPLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWM 120
            IT V  PLP + G  LP  +E+++D+P++K   LK + DL +P +++F+    + PDW+
Sbjct: 61  SITFVSFPLPPISG--LPPSSESSMDVPYNKQQSLKAAFDLLQPPLKEFL--RRSSPDWI 120

Query: 121 IVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGSGLPLSEIGS------LMSPP 180
           I D+ + W+  I+ E  I   FF + +   L F      S   + EI S      ++ P 
Sbjct: 121 IYDYASHWLPSIAAELGISKAFFSLFNAATLCFMGP---SSSLIEEIRSTPEDFTVVPPW 180

Query: 181 IISGSTVAFRRYEAAKIHADLFEKNDSGMSDRERVTKIISGSRAIAVRSCYEFDVDYLKL 240
           +   S + FR +E  + + +  E++ +G+SD  R    I  S A+ VRSC EF+ ++  L
Sbjct: 181 VPFKSNIVFRYHEVTR-YVEKTEEDVTGVSDSVRFGYSIDESDAVFVRSCPEFEPEWFGL 240

Query: 241 YSIFCGKRVIPLGFLPPEKPQKSEFEADSPWKSTFEWLDHQSPQSVVFVGFGSECKLTKD 300
                 K V P+GFLPP    + +   D+ W    +WLD Q   SVV+V  G+E  L  +
Sbjct: 241 LKDLYRKPVFPIGFLPPVI--EDDDAVDTTWVRIKKWLDKQRLNSVVYVSLGTEASLRHE 300

Query: 301 QIHKIARGLELSELPFLWSLRKPDWAGDSDALPAGFQDRTAERGIVRMGWAPQMEILGHP 360
           ++ ++A GLE SE PF W LR      +   +P GF+ R   RG+V +GW PQ++IL H 
Sbjct: 301 EVTELALGLEKSETPFFWVLR------NEPKIPDGFKTRVKGRGMVHVGWVPQVKILSHE 360

Query: 361 AIGGCFFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVEKGVAVEVERKEEDGSFSG 420
           ++GG   H GW S +E L FG   +  P + +Q LN RLL  KG+ VEV R E DGSF  
Sbjct: 361 SVGGFLTHCGWNSVVEGLGFGKVPIFFPVLNEQGLNTRLLHGKGLGVEVSRDERDGSFDS 420

Query: 421 EDIAKALKEAMASEEGEKIRRRATEMSAIFGDTKLHQRYIEEFVEFLKNGDSN 466
           + +A +++  M  + GE+IR +A  M  +FG+   + RY++E V F+++  S+
Sbjct: 421 DSVADSIRLVMIDDAGEEIRAKAKVMKDLFGNMDENIRYVDELVRFMRSKGSS 457

BLAST of CmaCh14G020520 vs. TrEMBL
Match: A0A0A0L7F1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G121730 PE=4 SV=1)

HSP 1 Score: 788.9 bits (2036), Expect = 3.4e-225
Identity = 379/461 (82.21%), Postives = 421/461 (91.32%), Query Frame = 1

Query: 1   MAEKKADHVVLLPWSAFGHLMPHFQLSLALAKAGVHVSFISTPKNLNRLPGIPPSLSPFI 60
           MAE K  HVV+ PWSAFGHL+PHFQLS+ALAKAGVHVSFISTPKNL RLP IPPSLS FI
Sbjct: 1   MAENKGLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSSFI 60

Query: 61  TLVPIPLPKLPGDPLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWMIV 120
           TLVPIPLPKLPGDPLPEGAEATVDIPFDKIPFLK++LDL EP  RKF+ADH +PPDW IV
Sbjct: 61  TLVPIPLPKLPGDPLPEGAEATVDIPFDKIPFLKVALDLTEPPFRKFIADHAHPPDWFIV 120

Query: 121 DFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGSGLPLSEIGSLMSPPIISGSTVA 180
           DFN +WI DISREFRIPIVFFRVLSPGFLAF+AH+LG+ LP++EIGSL+SPP I GSTVA
Sbjct: 121 DFNVSWIGDISREFRIPIVFFRVLSPGFLAFYAHLLGNRLPMTEIGSLISPPPIEGSTVA 180

Query: 181 FRRYEAAKIHADLFEKNDSGMSDRERVTKIISGSRAIAVRSCYEFDVDYLKLYSIFCGKR 240
           +RR+EA  IHA  FEKNDSG+SD ERVTKI +  R IAVR+CYEFDVDYLKLYS +CGK+
Sbjct: 181 YRRHEAVGIHAGFFEKNDSGLSDYERVTKINTACRVIAVRTCYEFDVDYLKLYSNYCGKK 240

Query: 241 VIPLGFLPPEKPQKSEFEADSPWKSTFEWLDHQSPQSVVFVGFGSECKLTKDQIHKIARG 300
           VIPLGFLPPEKP K+EFEA+SPWKSTFEWLD Q+P+SVVFVGFGSECKLTKDQIH+IARG
Sbjct: 241 VIPLGFLPPEKPPKTEFEANSPWKSTFEWLDQQNPKSVVFVGFGSECKLTKDQIHEIARG 300

Query: 301 LELSELPFLWSLRKPDWAGDSDALPAGFQDRTAERGIVRMGWAPQMEILGHPAIGGCFFH 360
           +ELSELPF+W+LR+PDWA DSD LPAGF+DRTAERGIV MGWAPQM+ILGHPAIGG FFH
Sbjct: 301 VELSELPFMWALRQPDWAEDSDVLPAGFRDRTAERGIVSMGWAPQMQILGHPAIGGSFFH 360

Query: 361 GGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVEKGVAVEVERKEEDGSFSGEDIAKALK 420
           GGWGSAIEAL+FG+ L+LLPFIVDQPLNARLLVEKGVA+EVER E+DG  SGE IAKAL+
Sbjct: 361 GGWGSAIEALEFGNCLILLPFIVDQPLNARLLVEKGVAIEVERNEDDGCSSGEAIAKALR 420

Query: 421 EAMASEEGEKIRRRATEMSAIFGDTKLHQRYIEEFVEFLKN 462
           EAM SEEGEKIR+RA E++AIFGDTKLHQRYIEEFVEFLK+
Sbjct: 421 EAMVSEEGEKIRKRAKEVAAIFGDTKLHQRYIEEFVEFLKH 461

BLAST of CmaCh14G020520 vs. TrEMBL
Match: M5XKS9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025376mg PE=4 SV=1)

HSP 1 Score: 504.6 bits (1298), Expect = 1.3e-139
Identity = 258/465 (55.48%), Postives = 331/465 (71.18%), Query Frame = 1

Query: 9   VVLLPWSAFGHLMPHFQLSLALAKAGVHVSFISTPKNLNRLPGIPPSLSPFITLVPIPLP 68
           VV+LPWSAFGH MP FQLS+ALAKA VHV +ISTPKN+ RLP I P L PFI LV IP P
Sbjct: 7   VVMLPWSAFGHTMPFFQLSMALAKAEVHVFYISTPKNIQRLPKISPDLQPFIHLVSIPFP 66

Query: 69  KLPGDPLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWMIVDFNATWIC 128
            L    LPEGAEATVDIPF+K+   K++ DL +  +++F+ D    PDW+IVDF+A W  
Sbjct: 67  ALASGFLPEGAEATVDIPFEKMDNFKIAYDLLQQPIKQFIGDQL--PDWIIVDFSAHWAV 126

Query: 129 DISREFRIPIVFFRVLSPGFLAFFAHVLGSGLPLSEIGSLMSPPIISG--------STVA 188
           +I +EF +P+V+F         F   +       ++   L SP  ++         ST+A
Sbjct: 127 EIGKEFGVPLVYFSAFCAATCVFLTSLENISKANTDHDVLSSPESLTSPRDFGTFRSTIA 186

Query: 189 FRRYEAAKIHADLFEKNDSGMSDRERVTKIISGSRAIAVRSCYEFDVDYLKLYSIFCGKR 248
           +R++EA  I+A  +E NDSG+SD +R  KI+   +A+AVRSC EF+ +YL+ Y    G+ 
Sbjct: 187 YRKHEAVDIYAGFYELNDSGISDSDRHNKILLACQAVAVRSCNEFEGEYLEAYKNKTGQL 246

Query: 249 VIPLGFLPPEKPQ-KSEFEAD-SPWKSTFEWLDHQSPQSVVFVGFGSECKLTKDQIHKIA 308
           VIP G LPPE+P  K E  +D SP    F+WLD Q P+SVVFVGFGSECKL+K+Q+ +IA
Sbjct: 247 VIPTGLLPPEQPSAKREISSDGSPNNVIFDWLDKQKPKSVVFVGFGSECKLSKEQVFEIA 306

Query: 309 RGLELSELPFLWSLRKPDWA-GDSDALPAGFQDRTAERGIVRMGWAPQMEILGHPAIGGC 368
            GL LSELPFLW+LRKP+WA  ++DALP GF +RT+E+G+V +GW PQMEILGHP++GG 
Sbjct: 307 HGLGLSELPFLWALRKPNWADSEADALPPGFVERTSEKGLVCLGWVPQMEILGHPSVGGS 366

Query: 369 FFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVEKGVAVEVERKEEDGSFSGEDIAK 428
            FH GWGS IE LQFGH LV+LPFI+DQPLNARLLVEK +AVEV+R  EDGSF  +DIAK
Sbjct: 367 LFHSGWGSVIETLQFGHVLVVLPFIIDQPLNARLLVEKDLAVEVKR-TEDGSFCKDDIAK 426

Query: 429 ALKEAMASEEGEKIRRRATEMSAIFGDTKLHQ-RYIEEFVEFLKN 462
            L+ AM +EEGEK+R  A + + +FGD KLHQ  Y+ +FV +LKN
Sbjct: 427 TLRHAMVAEEGEKLRSNARKAAKVFGDHKLHQDHYLGQFVHYLKN 468

BLAST of CmaCh14G020520 vs. TrEMBL
Match: B9GQB9_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s16380g PE=4 SV=1)

HSP 1 Score: 503.8 bits (1296), Expect = 2.2e-139
Identity = 255/466 (54.72%), Postives = 337/466 (72.32%), Query Frame = 1

Query: 8   HVVLLPWSAFGHLMPHFQLSLALAKAGVHVSFISTPKNLNRLPGIPPSLSPFITLVPIPL 67
           H+V+ PWSAFGH++P F  S ALA+AGVHVSF+STP+N+ RLP I P+L+P I LV +P 
Sbjct: 6   HIVIFPWSAFGHILPFFHFSKALAEAGVHVSFVSTPRNIQRLPAISPTLAPLINLVELPF 65

Query: 68  PKLPGD-PLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWMIVDFNATW 127
           P L     LPEGAEAT DIP +KI +LK++ DL +   ++FVA+    P+W+IVDF + W
Sbjct: 66  PALDVKYGLPEGAEATADIPAEKIQYLKIAYDLLQHPFKQFVAE--KSPNWIIVDFCSHW 125

Query: 128 ICDISREFRIPIVFFRVLSPGFLAFFAH---VLGSGLPL--SEIGSLMSPP--IISGSTV 187
             DI++E+ IP+++  + S    AF  H    +G G         SL SPP  I   S+V
Sbjct: 126 AVDIAKEYGIPLIYLSIFSGVMGAFMGHPGYFVGDGQKRYWGSPESLTSPPEWITFPSSV 185

Query: 188 AFRRYEAAKIHADLFEKNDSGMSDRERVTKIISGSRAIAVRSCYEFDVDYLKLYSIFCGK 247
           AFR YEA  ++  ++ +N SG+ D ERV K +SG +AIAVRSC EF+ +Y+ +Y     K
Sbjct: 186 AFRSYEAKNMYPGIYGENASGIRDAERVAKTVSGCQAIAVRSCIEFEGEYMDVYQKIMSK 245

Query: 248 RVIPLGFLPPEKPQKSEFEADSPWKSTFEWLDHQSPQSVVFVGFGSECKLTKDQIHKIAR 307
           +VIP+G LPPEKP++ E   D  W + FEWLD+Q  +SVVFVGFGSECKLTKD++++IA 
Sbjct: 246 QVIPIGLLPPEKPEEREI-TDGTWNTIFEWLDNQEHESVVFVGFGSECKLTKDEVYEIAY 305

Query: 308 GLELSELPFLWSLRKPDWAG-DSDALPAGFQDRTAERGIVRMGWAPQMEILGHPAIGGCF 367
           GLELS+LPFLW+LRKP+WA  D D LP  F ++T+E+GIV +GWAPQ+E+L HP+IGG  
Sbjct: 306 GLELSKLPFLWALRKPNWAATDLDVLPPEFNNKTSEKGIVSIGWAPQLELLSHPSIGGSL 365

Query: 368 FHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVEKGVAVEVERKEEDGSFSGEDIAKA 427
           FH GWGS IE LQ+GH L++LPFI DQ LNARLLVEKG+AVEV+RK EDGSF+  DIAK+
Sbjct: 366 FHSGWGSVIETLQYGHCLIVLPFIADQGLNARLLVEKGLAVEVDRK-EDGSFTRHDIAKS 425

Query: 428 LKEAMASEEGEKIRRRATEMSAIFGDTKLHQRYIEEFVEFLKNGDS 465
           L+ AM SEEG +++ RA + + IF + KLHQ YI  FV++LK+G S
Sbjct: 426 LRLAMVSEEGSQLKTRAKDAATIFQNRKLHQDYINRFVKYLKDGVS 467

BLAST of CmaCh14G020520 vs. TrEMBL
Match: B9I8N6_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0014s08440g PE=4 SV=1)

HSP 1 Score: 503.1 bits (1294), Expect = 3.8e-139
Identity = 260/470 (55.32%), Postives = 340/470 (72.34%), Query Frame = 1

Query: 1   MAEKKADHVVLLPWSAFGHLMPHFQLSLALAKAGVHVSFISTPKNLNRLPGIPPSLSPFI 60
           MAEK   H+V+LPW AFGH++P FQLS+ LAKAG+ VSF+STP+N+ RLP IPPSL+  +
Sbjct: 1   MAEKL--HIVMLPWIAFGHMIPFFQLSIDLAKAGIKVSFVSTPRNIKRLPKIPPSLADLV 60

Query: 61  TLVPIPLPKLPGDPLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWMIV 120
             V  PLP L  D LPE  EATVDIP +KI +LK++ DL +  +++F+AD    PDW+I+
Sbjct: 61  KFVEFPLPSLDNDILPEDGEATVDIPAEKIEYLKIAYDLLQHPLKQFIADQL--PDWIII 120

Query: 121 DFNATWICDISREFRIPIVFFRVLSPGFLAFFAH----VLGSGLPLSEIG--SLMSPP-- 180
           D    W+ +I+R+ ++P++ F V S     F  H    ++G G         S+ S P  
Sbjct: 121 DMIPYWMVEIARDKKVPLIHFSVFSAVAYVFLGHPECLLVGDGQKRLRPSWTSMTSKPEW 180

Query: 181 IISGSTVAFRRYEAAKIHADLFEKNDSGMSDRERVTKIISGSRAIAVRSCYEFDVDYLKL 240
           +   S+VA+R +EA  +   +++ N SG++D ERV+KI+ G +A+AVRSC EF+ DYL L
Sbjct: 181 VDFPSSVAYRNHEAVGVFEWIYKGNASGITDGERVSKILHGCQALAVRSCAEFEGDYLNL 240

Query: 241 YSIFCGKRVIPLGFLPPEKPQKSEFEADSPWKSTFEWLDHQSPQSVVFVGFGSECKLTKD 300
           +    GK VIP+G LP EKP++ EF  D  W   F+WLD Q P+SVVFVGFGSE KLT+D
Sbjct: 241 FERVIGKPVIPVGLLPQEKPERKEF-TDGRWGEIFKWLDDQKPKSVVFVGFGSEYKLTRD 300

Query: 301 QIHKIARGLELSELPFLWSLRKPDWAGDS-DALPAGFQDRTAERGIVRMGWAPQMEILGH 360
           Q+++IA GLELS LPFLW+LRKP WA D  DALP+GF +RT++RGIV MGWAPQMEILGH
Sbjct: 301 QVYEIAHGLELSGLPFLWALRKPGWANDDLDALPSGFGERTSDRGIVCMGWAPQMEILGH 360

Query: 361 PAIGGCFFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVEKGVAVEVERKEEDGSFS 420
           P+IGG  FH GWGS IE+LQFGH L+LLPFI+DQPLNAR LVEKG+ VEV+R E DGSF+
Sbjct: 361 PSIGGSLFHSGWGSIIESLQFGHTLILLPFIIDQPLNARYLVEKGLGVEVQRGE-DGSFT 420

Query: 421 GEDIAKALKEAMASEEGEKIRRRATEMSAIFGDTKLHQ-RYIEEFVEFLK 461
            + +AKAL  AM S EG+ +R +A+E +AIFG+ KLHQ  YI +FV+FLK
Sbjct: 421 RDGVAKALNLAMISAEGKGLREKASEAAAIFGNQKLHQDYYIGKFVDFLK 464

BLAST of CmaCh14G020520 vs. TrEMBL
Match: F6I663_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0046g01950 PE=4 SV=1)

HSP 1 Score: 501.1 bits (1289), Expect = 1.4e-138
Identity = 270/469 (57.57%), Postives = 330/469 (70.36%), Query Frame = 1

Query: 4   KKAD-HVVLLPWSAFGHLMPHFQLSLALAKAGVHVSFISTPKNLNRLPGIPPSLSPFITL 63
           K AD HVV++PW AFGH++P  QLS+ALAKAGV VSF+STP+N+ RLP +PP L P I+ 
Sbjct: 21  KMADLHVVMVPWLAFGHMIPFLQLSIALAKAGVRVSFVSTPRNIRRLPKLPPDLEPLISF 80

Query: 64  VPIPLPKLPGDPLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWMIVDF 123
           V +PLP + G  LPE AEATVD+P +KI +LKL+ DL +   +KFVAD    PDW+I D 
Sbjct: 81  VELPLPAVDGGLLPEDAEATVDVPTEKIQYLKLAYDLLQHPFKKFVADQS--PDWIISDT 140

Query: 124 NATWICDISREFRIPIVFFRVLSPGFLAFFAH---VLGSGL----PLSEIGSLMSPP--I 183
            A W+ + + E RIP + F + S     F      ++G G     P  E  SL S P  +
Sbjct: 141 MAHWVVETAEEHRIPSMAFILFSSAAAVFVGPNECLIGEGRRRVRPSPE--SLTSSPEWV 200

Query: 184 ISGSTVAFRRYEAAKIHADLFEKNDSGMSDRERVTKIISGSRAIAVRSCYEFDVDYLKLY 243
              S+VAFR YEA   +A  F +N SG++D  RV K+    +A+AVRSC EF+ +YL ++
Sbjct: 201 SFPSSVAFRGYEARTCYAGFFGENVSGITDAHRVAKVCHACKAVAVRSCIEFEGEYLNIH 260

Query: 244 SIFCGKRVIPLGFLPPEKPQKSEFEADSPWKSTFEWLDHQSPQSVVFVGFGSECKLTKDQ 303
               GK VIP+GFLPPEK Q      +  W   F+WLD Q P+SVVFVGFGSECKLTKDQ
Sbjct: 261 EKIMGKPVIPVGFLPPEK-QGGRETTEGSWSEIFKWLDEQKPKSVVFVGFGSECKLTKDQ 320

Query: 304 IHKIARGLELSELPFLWSLRKPDWA-GDSDALPAGFQDRTAERGIVRMGWAPQMEILGHP 363
           +H+IA GLELSELPFLW+LRKP+W   D DALP+ F DRT+ +GIV MGWAPQMEIL HP
Sbjct: 321 VHEIAYGLELSELPFLWALRKPNWTMEDIDALPSCFSDRTSGKGIVWMGWAPQMEILAHP 380

Query: 364 AIGGCFFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVEKGVAVEVERKEEDGSFSG 423
           +IGG  FH GWGS IE LQFGH LVLLPFIVDQ LNARLLVEKG+AVE+ER E DGSFS 
Sbjct: 381 SIGGSLFHSGWGSVIETLQFGHCLVLLPFIVDQGLNARLLVEKGLAVEIERSE-DGSFSR 440

Query: 424 EDIAKALKEAMASEEGEKIRRRATEMSAIFGDTKLHQ-RYIEEFVEFLK 461
           EDIAK+L+ AM SEEGEK+R RA E +AIF D +L Q  YI   V++LK
Sbjct: 441 EDIAKSLRVAMVSEEGEKLRARAREAAAIFIDKRLQQEHYIGGLVKYLK 483

BLAST of CmaCh14G020520 vs. TAIR10
Match: AT2G22590.1 (AT2G22590.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 322.8 bits (826), Expect = 3.5e-88
Identity = 180/458 (39.30%), Postives = 264/458 (57.64%), Query Frame = 1

Query: 8   HVVLLPWSAFGHLMPHFQLSLALAKAGVHVSFISTPKNLNRL-PGIPPSLSPFITLVPIP 67
           HVV+ PW AFGH++P+ +LS  +A+ G  VSFISTP+N++RL P +P +LS  I  V + 
Sbjct: 15  HVVMFPWLAFGHMVPYLELSKLIAQKGHKVSFISTPRNIDRLLPRLPENLSSVINFVKLS 74

Query: 68  LPKLPGDPLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWMIVDFNATW 127
           LP +  + LPE  EAT D+PF+ IP+LK++ D  +  V +F+    + PDW++ DF   W
Sbjct: 75  LP-VGDNKLPEDGEATTDVPFELIPYLKIAYDGLKVPVTEFLES--SKPDWVLQDFAGFW 134

Query: 128 ICDISREFRIPIVFFRVLSPGFLAFFAHVLGSGLPLSEIGSLMSPP--IISGSTVAFRRY 187
           +  ISR   I   FF   +   L       G     +     M PP  +   ++VAF+ +
Sbjct: 135 LPPISRRLGIKTGFFSAFNGATLGILKPP-GFEEYRTSPADFMKPPKWVPFETSVAFKLF 194

Query: 188 EAAKIHADLF-EKNDSGMSDRERVTKIISGSRAIAVRSCYEFDVDYLKLYSIFCGKRVIP 247
           E   I      E  +  + D  RV  +I G   I VRSCYE++ ++L L      K VIP
Sbjct: 195 ECRFIFKGFMAETTEGNVPDIHRVGGVIDGCDVIFVRSCYEYEAEWLGLTQELHRKPVIP 254

Query: 248 LGFLPPEKPQKSEFEADSPWKSTFEWLDHQSPQSVVFVGFGSECKLTKDQIHKIARGLEL 307
           +G LPP+  +K  FE    W S  +WLD +  +S+V+V FGSE K ++ ++++IA GLEL
Sbjct: 255 VGVLPPKPDEK--FEDTDTWLSVKKWLDSRKSKSIVYVAFGSEAKPSQTELNEIALGLEL 314

Query: 308 SELPFLWSL--RKPDWAGDSDALPAGFQDRTAERGIVRMGWAPQMEILGHPAIGGCFFHG 367
           S LPF W L  R+  W  +   LP GF++RTA+RG+V  GW  Q+  L H +IG    H 
Sbjct: 315 SGLPFFWVLKTRRGPWDTEPVELPEGFEERTADRGMVWRGWVEQLRTLSHDSIGLVLTHP 374

Query: 368 GWGSAIEALQFGHRLVLLPFIVDQPLNARLLVEKGVAVEVERKEEDGSFSGEDIAKALKE 427
           GWG+ IEA++F   + +L F+ DQ LNAR++ EK +   + R E +G F+ E +A +L+ 
Sbjct: 375 GWGTIIEAIRFAKPMAMLVFVYDQGLNARVIEEKKIGYMIPRDETEGFFTKESVANSLRL 434

Query: 428 AMASEEGEKIRRRATEMSAIFGDTKLHQRYIEEFVEFL 460
            M  EEG+  R    EM  +FGD     RY++ F+E+L
Sbjct: 435 VMVEEEGKVYRENVKEMKGVFGDMDRQDRYVDSFLEYL 466

BLAST of CmaCh14G020520 vs. TAIR10
Match: AT5G49690.1 (AT5G49690.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 308.9 bits (790), Expect = 5.3e-84
Identity = 173/473 (36.58%), Postives = 268/473 (56.66%), Query Frame = 1

Query: 1   MAEKKAD--HVVLLPWSAFGHLMPHFQLSLALAKAGVHVSFISTPKNLNRLPGIPPSLSP 60
           M +K+ +  HV + PW A GHL+P  +LS  LA+ G  +SFISTP+N+ RLP +  +L+ 
Sbjct: 1   MVDKREEVMHVAMFPWLAMGHLLPFLRLSKLLAQKGHKISFISTPRNIERLPKLQSNLAS 60

Query: 61  FITLVPIPLPKLPGDPLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWM 120
            IT V  PLP + G  LP  +E+++D+P++K   LK + DL +P +++F+    + PDW+
Sbjct: 61  SITFVSFPLPPISG--LPPSSESSMDVPYNKQQSLKAAFDLLQPPLKEFL--RRSSPDWI 120

Query: 121 IVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGSGLPLSEIGS------LMSPP 180
           I D+ + W+  I+ E  I   FF + +   L F      S   + EI S      ++ P 
Sbjct: 121 IYDYASHWLPSIAAELGISKAFFSLFNAATLCFMGP---SSSLIEEIRSTPEDFTVVPPW 180

Query: 181 IISGSTVAFRRYEAAKIHADLFEKNDSGMSDRERVTKIISGSRAIAVRSCYEFDVDYLKL 240
           +   S + FR +E  + + +  E++ +G+SD  R    I  S A+ VRSC EF+ ++  L
Sbjct: 181 VPFKSNIVFRYHEVTR-YVEKTEEDVTGVSDSVRFGYSIDESDAVFVRSCPEFEPEWFGL 240

Query: 241 YSIFCGKRVIPLGFLPPEKPQKSEFEADSPWKSTFEWLDHQSPQSVVFVGFGSECKLTKD 300
                 K V P+GFLPP    + +   D+ W    +WLD Q   SVV+V  G+E  L  +
Sbjct: 241 LKDLYRKPVFPIGFLPPVI--EDDDAVDTTWVRIKKWLDKQRLNSVVYVSLGTEASLRHE 300

Query: 301 QIHKIARGLELSELPFLWSLRKPDWAGDSDALPAGFQDRTAERGIVRMGWAPQMEILGHP 360
           ++ ++A GLE SE PF W LR      +   +P GF+ R   RG+V +GW PQ++IL H 
Sbjct: 301 EVTELALGLEKSETPFFWVLR------NEPKIPDGFKTRVKGRGMVHVGWVPQVKILSHE 360

Query: 361 AIGGCFFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVEKGVAVEVERKEEDGSFSG 420
           ++GG   H GW S +E L FG   +  P + +Q LN RLL  KG+ VEV R E DGSF  
Sbjct: 361 SVGGFLTHCGWNSVVEGLGFGKVPIFFPVLNEQGLNTRLLHGKGLGVEVSRDERDGSFDS 420

Query: 421 EDIAKALKEAMASEEGEKIRRRATEMSAIFGDTKLHQRYIEEFVEFLKNGDSN 466
           + +A +++  M  + GE+IR +A  M  +FG+   + RY++E V F+++  S+
Sbjct: 421 DSVADSIRLVMIDDAGEEIRAKAKVMKDLFGNMDENIRYVDELVRFMRSKGSS 457

BLAST of CmaCh14G020520 vs. TAIR10
Match: AT5G65550.1 (AT5G65550.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 284.3 bits (726), Expect = 1.4e-76
Identity = 171/474 (36.08%), Postives = 258/474 (54.43%), Query Frame = 1

Query: 1   MAEKKAD-HVVLLPWSAFGHLMPHFQLSLALAKAGVHVSFISTPKNLNRLPGIPPSLSPF 60
           MAE K   HV + PW A GH++P+ QLS  +A+ G  VSFIST +N++RLP I   LS  
Sbjct: 1   MAEPKPKLHVAVFPWLALGHMIPYLQLSKLIARKGHTVSFISTARNISRLPNISSDLS-- 60

Query: 61  ITLVPIPLPKLPGDPLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWMI 120
           +  V +PL +   D LPE AEAT D+P   I +LK + D    +  +F+    + P+W++
Sbjct: 61  VNFVSLPLSQTV-DHLPENAEATTDVPETHIAYLKKAFDGLSEAFTEFL--EASKPNWIV 120

Query: 121 VDFNATWICDISREFRIPIVFFRVLSPGFLAFF---AHVLGSGL-PLSEIGSLMSPP--I 180
            D    W+  I+ +  +    F   +   +      A V+  G  P      L+ PP  +
Sbjct: 121 YDILHHWVPPIAEKLGVRRAIFCTFNAASIIIIGGPASVMIQGHDPRKTAEDLIVPPPWV 180

Query: 181 ISGSTVAFRRYEAAKIHADLFEKNDSG-----MSDRERVTKIISGSRAIAVRSCYEFDVD 240
              + + +R +EA +I     E   +G     ++D  R+     GS  I +RSC E + +
Sbjct: 181 PFETNIVYRLFEAKRI----MEYPTAGVTGVELNDNCRLGLAYVGSEVIVIRSCMELEPE 240

Query: 241 YLKLYSIFCGKRVIPLGFLPPEKPQKSEFEADSPWKSTFEWLDHQSPQSVVFVGFGSECK 300
           +++L S   GK VIP+G LP      ++ E    W    EWLD    +SVV+V  G+E  
Sbjct: 241 WIQLLSKLQGKPVIPIGLLPATPMDDADDEGT--WLDIREWLDRHQAKSVVYVALGTEVT 300

Query: 301 LTKDQIHKIARGLELSELPFLWSLRKPDWAGDSDALPAGFQDRTAERGIVRMGWAPQMEI 360
           ++ ++I  +A GLEL  LPF W+LRK   A  S  LP GF++R  ERG++   W PQ +I
Sbjct: 301 ISNEEIQGLAHGLELCRLPFFWTLRKRTRA--SMLLPDGFKERVKERGVIWTEWVPQTKI 360

Query: 361 LGHPAIGGCFFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVEKGVAVEVERKEEDG 420
           L H ++GG   H GWGSA+E L FG  L++ P  +DQPL ARLL    + +E+ R E DG
Sbjct: 361 LSHGSVGGFVTHCGWGSAVEGLSFGVPLIMFPCNLDQPLVARLLSGMNIGLEIPRNERDG 420

Query: 421 SFSGEDIAKALKEAMASEEGEKIRRR-ATEMSAIFGDTKLHQRYIEEFVEFLKN 462
            F+   +A+ ++  +  EEG+  R   A++   IFG+ +L  +Y + F+EFL+N
Sbjct: 421 LFTSASVAETIRHVVVEEEGKIYRNNAASQQKKIFGNKRLQDQYADGFIEFLEN 461

BLAST of CmaCh14G020520 vs. TAIR10
Match: AT1G64910.1 (AT1G64910.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 198.4 bits (503), Expect = 1.0e-50
Identity = 146/462 (31.60%), Postives = 235/462 (50.87%), Query Frame = 1

Query: 8   HVVLLPWSAFGHLMPHFQLSLALAKAGVHVSFI---STPKNLNRLPGIPPSLSPFITLVP 67
           H  + PW AFGH+ P+  L+  LA+ G  ++F+      K L  L   P S    I    
Sbjct: 6   HAFMFPWFAFGHMTPYLHLANKLAERGHRITFLIPKKAQKQLEHLNLFPDS----IVFHS 65

Query: 68  IPLPKLPGDPLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWMIVDFNA 127
           + +P + G  LP GAE   DIP     FL  ++DL    V   V+     PD ++ D  A
Sbjct: 66  LTIPHVDG--LPAGAETFSDIPMPLWKFLPPAIDLTRDQVEAAVS--ALSPDLILFDI-A 125

Query: 128 TWICDISREFRIPIVFFRVLSPGFLAFFAHVLGSGLPLSEIGSLMSPPIISGSTVAFRRY 187
           +W+ ++++E+R+  + + ++S   +A         +P  E+G  + PP    S + +R++
Sbjct: 126 SWVPEVAKEYRVKSMLYNIISATSIAH------DFVPGGELG--VPPPGYPSSKLLYRKH 185

Query: 188 EA-AKIHADLFEKNDSGMSDRERVTKIISGSRAIAVRSCYEFDVDYLKLYSIFCGKRVIP 247
           +A A +   ++ K  S      R+   +     I++R+C E +  + +       K+V  
Sbjct: 186 DAHALLSFSVYYKRFS-----HRLITGLMNCDFISIRTCKEIEGKFCEYLERQYHKKVFL 245

Query: 248 LGFLPPEKPQKSEFEADSPWKSTFEWLDHQSPQSVVFVGFGSECKLTKDQIHKIARGLEL 307
            G + PE P K +   +  W     WL+     SVVF   GS+  L KDQ  ++  G+EL
Sbjct: 246 TGPMLPE-PNKGK-PLEDRWS---HWLNGFEQGSVVFCALGSQVTLEKDQFQELCLGIEL 305

Query: 308 SELPFLWSLRKPDWAGD-SDALPAGFQDRTAERGIVRMGWAPQMEILGHPAIGGCFFHGG 367
           + LPF  ++  P  A    DALP GF++R  +RG+V   W  Q  +L HP++G    H G
Sbjct: 306 TGLPFFVAVTPPKGAKTIQDALPEGFEERVKDRGVVLGEWVQQPLLLAHPSVGCFLSHCG 365

Query: 368 WGSAIEALQFGHRLVLLPFIVDQPLNARLLVEK-GVAVEVERKEEDGSFSGEDIAKALKE 427
           +GS  E++    ++VLLPF+ DQ LN RL+ E+  V+VEV+R EE G FS E ++ A+  
Sbjct: 366 FGSMWESIMSDCQIVLLPFLADQVLNTRLMTEELKVSVEVQR-EETGWFSKESLSVAITS 425

Query: 428 AM--ASEEGEKIRRRATEMSAIFGDTKLHQRYIEEFVEFLKN 462
            M  ASE G  +RR  +++  +     L   Y ++FV+ L+N
Sbjct: 426 VMDQASEIGNLVRRNHSKLKEVLVSDGLLTGYTDKFVDTLEN 439

BLAST of CmaCh14G020520 vs. TAIR10
Match: AT3G29630.1 (AT3G29630.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 197.2 bits (500), Expect = 2.2e-50
Identity = 155/461 (33.62%), Postives = 231/461 (50.11%), Query Frame = 1

Query: 8   HVVLLPWSAFGHLMPHFQLSLALAKAGVHVSFISTPKNLNRLPGIPPSLSP-FITLVPIP 67
           H  L PW  FGH++P+  L+  LA+ G  V+F++  K   +L   P +L P  I    + 
Sbjct: 6   HAFLYPWFGFGHMIPYLHLANKLAEKGHRVTFLAPKKAQKQLE--PLNLFPNSIHFENVT 65

Query: 68  LPKLPGDPLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWMIVDFNATW 127
           LP + G  LP GAE T D+P      L  ++DL    +   V      PD +  DF   W
Sbjct: 66  LPHVDG--LPVGAETTADLPNSSKRVLADAMDLLREQIE--VKIRSLKPDLIFFDF-VDW 125

Query: 128 ICDISREFRIPIVFFRVLSPGFLAFFAHVLGSGLPLSEIGSLMSPPIISGSTVAFRRYEA 187
           I  +++E  I  V ++++S  F+A F        P +E+GS   PP    S VA R ++A
Sbjct: 126 IPQMAKELGIKSVSYQIISAAFIAMFF------APRAELGS--PPPGFPSSKVALRGHDA 185

Query: 188 AKIHADLFEKNDSGMSDRERVTKIISGSRAIAVRSCYEFDVDYLKLYSIFCGKRVIPLG- 247
             I++ LF      + DR  VT  +     IA+R+C E + +        C ++V+  G 
Sbjct: 186 -NIYS-LFANTRKFLFDR--VTTGLKNCDVIAIRTCAEIEGNLCDFIERQCQRKVLLTGP 245

Query: 248 -FLPPEKPQKSEFEADSPWKSTFEWLDHQSPQSVVFVGFGSECKLTKDQIHKIARGLELS 307
            FL P+   KS    +  W +   WL+   P SVV+  FG+      DQ  ++  G+EL+
Sbjct: 246 MFLDPQG--KSGKPLEDRWNN---WLNGFEPSSVVYCAFGTHFFFEIDQFQELCLGMELT 305

Query: 308 ELPFLWSLRKPDWAGD-SDALPAGFQDRTAERGIVRMGWAPQMEILGHPAIGGCFFHGGW 367
            LPFL ++  P  +    +ALP GF++R   RGIV  GW  Q  IL HP+IG    H G+
Sbjct: 306 GLPFLVAVMPPRGSSTIQEALPEGFEERIKGRGIVWGGWVEQPLILSHPSIGCFVNHCGF 365

Query: 368 GSAIEALQFGHRLVLLPFIVDQPLNARLLVEK-GVAVEVERKEEDGSFSGEDIAKALKEA 427
           GS  E+L    ++V +P +VDQ L  RLL E+  V+V+V+R E  G FS E +   +K  
Sbjct: 366 GSMWESLVSDCQIVFIPQLVDQVLTTRLLTEELEVSVKVKRDEITGWFSKESLRDTVKSV 425

Query: 428 M--ASEEGEKIRRRATEMSAIFGDTKLHQRYIEEFVEFLKN 462
           M   SE G  +RR   ++        L   Y ++FV+ L+N
Sbjct: 426 MDKNSEIGNLVRRNHKKLKETLVSPGLLSSYADKFVDELEN 442

BLAST of CmaCh14G020520 vs. NCBI nr
Match: gi|659075186|ref|XP_008438010.1| (PREDICTED: UDP-glycosyltransferase 91A1-like [Cucumis melo])

HSP 1 Score: 795.4 bits (2053), Expect = 5.3e-227
Identity = 386/463 (83.37%), Postives = 421/463 (90.93%), Query Frame = 1

Query: 1   MAEKKADHVVLLPWSAFGHLMPHFQLSLALAKAGVHVSFISTPKNLNRLPGIPPSLSPFI 60
           MAE K  HVV+ PWSAFGHL+PHFQLS+ALAKAGVHVSFISTPKNL RLP IPPSLS FI
Sbjct: 25  MAENKGLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSSFI 84

Query: 61  TLVPIPLPKLPGDPLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWMIV 120
           T VPIPLPKLPGDPLPEGAEATVDIPF+KIPFLK++LDLAEP  RKF+ADHP+PPDW IV
Sbjct: 85  TPVPIPLPKLPGDPLPEGAEATVDIPFEKIPFLKVALDLAEPPFRKFIADHPHPPDWFIV 144

Query: 121 DFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGSGLPLSEIGSLMSPPIISGSTVA 180
           DFN +WI DISREFR+PIVFFRVLSPGFLAF+AHVLG+ LPLSEIGSL+SPP I GSTVA
Sbjct: 145 DFNVSWISDISREFRVPIVFFRVLSPGFLAFYAHVLGARLPLSEIGSLISPPPIEGSTVA 204

Query: 181 FRRYEAAKIHADLFEKNDSGMSDRERVTKIISGSRAIAVRSCYEFDVDYLKLYSIFCGKR 240
           +RR+EA  I A  FEKNDSGMSD ERVTKIIS  +AIAVR+CYEFDVDYLKLYS +CGK+
Sbjct: 205 YRRHEAVGIRAGFFEKNDSGMSDYERVTKIISACQAIAVRTCYEFDVDYLKLYSNYCGKK 264

Query: 241 VIPLGFLPPEKPQKSEFEADSPWKSTFEWLDHQSPQSVVFVGFGSECKLTKDQIHKIARG 300
           VIPLG LPPEKP K+EFEA+SPWKSTFEWLD Q+P+SVVFVGFGSECKLTKDQIH+IARG
Sbjct: 265 VIPLGLLPPEKPPKTEFEANSPWKSTFEWLDQQNPKSVVFVGFGSECKLTKDQIHEIARG 324

Query: 301 LELSELPFLWSLRKPDWAGDSDALPAGFQDRTAERGIVRMGWAPQMEILGHPAIGGCFFH 360
           +ELSELPFLW+LRKPDWA DSD LPAGF DRTA RG+V MGWAPQMEILGHPAIGG FFH
Sbjct: 325 VELSELPFLWALRKPDWAEDSDVLPAGFPDRTAGRGMVSMGWAPQMEILGHPAIGGSFFH 384

Query: 361 GGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVEKGVAVEVERKEEDGSFSGEDIAKALK 420
           GGWGSAIEAL+FG+ L+LLPFIVDQPLNARLLVEKGVAVEVER E+DG FSGE IAKAL+
Sbjct: 385 GGWGSAIEALEFGNCLILLPFIVDQPLNARLLVEKGVAVEVERNEDDGCFSGEAIAKALR 444

Query: 421 EAMASEEGEKIRRRATEMSAIFGDTKLHQRYIEEFVEFLKNGD 464
           EAM S EGEKIR+RA E++AIFGDTKLHQRYIEEFVEFLKN D
Sbjct: 445 EAMVSGEGEKIRKRAEEVAAIFGDTKLHQRYIEEFVEFLKNRD 487

BLAST of CmaCh14G020520 vs. NCBI nr
Match: gi|449433069|ref|XP_004134320.1| (PREDICTED: UDP-glycosyltransferase 91A1-like [Cucumis sativus])

HSP 1 Score: 788.9 bits (2036), Expect = 4.9e-225
Identity = 379/461 (82.21%), Postives = 421/461 (91.32%), Query Frame = 1

Query: 1   MAEKKADHVVLLPWSAFGHLMPHFQLSLALAKAGVHVSFISTPKNLNRLPGIPPSLSPFI 60
           MAE K  HVV+ PWSAFGHL+PHFQLS+ALAKAGVHVSFISTPKNL RLP IPPSLS FI
Sbjct: 1   MAENKGLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSSFI 60

Query: 61  TLVPIPLPKLPGDPLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWMIV 120
           TLVPIPLPKLPGDPLPEGAEATVDIPFDKIPFLK++LDL EP  RKF+ADH +PPDW IV
Sbjct: 61  TLVPIPLPKLPGDPLPEGAEATVDIPFDKIPFLKVALDLTEPPFRKFIADHAHPPDWFIV 120

Query: 121 DFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGSGLPLSEIGSLMSPPIISGSTVA 180
           DFN +WI DISREFRIPIVFFRVLSPGFLAF+AH+LG+ LP++EIGSL+SPP I GSTVA
Sbjct: 121 DFNVSWIGDISREFRIPIVFFRVLSPGFLAFYAHLLGNRLPMTEIGSLISPPPIEGSTVA 180

Query: 181 FRRYEAAKIHADLFEKNDSGMSDRERVTKIISGSRAIAVRSCYEFDVDYLKLYSIFCGKR 240
           +RR+EA  IHA  FEKNDSG+SD ERVTKI +  R IAVR+CYEFDVDYLKLYS +CGK+
Sbjct: 181 YRRHEAVGIHAGFFEKNDSGLSDYERVTKINTACRVIAVRTCYEFDVDYLKLYSNYCGKK 240

Query: 241 VIPLGFLPPEKPQKSEFEADSPWKSTFEWLDHQSPQSVVFVGFGSECKLTKDQIHKIARG 300
           VIPLGFLPPEKP K+EFEA+SPWKSTFEWLD Q+P+SVVFVGFGSECKLTKDQIH+IARG
Sbjct: 241 VIPLGFLPPEKPPKTEFEANSPWKSTFEWLDQQNPKSVVFVGFGSECKLTKDQIHEIARG 300

Query: 301 LELSELPFLWSLRKPDWAGDSDALPAGFQDRTAERGIVRMGWAPQMEILGHPAIGGCFFH 360
           +ELSELPF+W+LR+PDWA DSD LPAGF+DRTAERGIV MGWAPQM+ILGHPAIGG FFH
Sbjct: 301 VELSELPFMWALRQPDWAEDSDVLPAGFRDRTAERGIVSMGWAPQMQILGHPAIGGSFFH 360

Query: 361 GGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVEKGVAVEVERKEEDGSFSGEDIAKALK 420
           GGWGSAIEAL+FG+ L+LLPFIVDQPLNARLLVEKGVA+EVER E+DG  SGE IAKAL+
Sbjct: 361 GGWGSAIEALEFGNCLILLPFIVDQPLNARLLVEKGVAIEVERNEDDGCSSGEAIAKALR 420

Query: 421 EAMASEEGEKIRRRATEMSAIFGDTKLHQRYIEEFVEFLKN 462
           EAM SEEGEKIR+RA E++AIFGDTKLHQRYIEEFVEFLK+
Sbjct: 421 EAMVSEEGEKIRKRAKEVAAIFGDTKLHQRYIEEFVEFLKH 461

BLAST of CmaCh14G020520 vs. NCBI nr
Match: gi|747091051|ref|XP_011093242.1| (PREDICTED: putative UDP-rhamnose:rhamnosyltransferase 1 [Sesamum indicum])

HSP 1 Score: 535.8 bits (1379), Expect = 7.5e-149
Identity = 273/461 (59.22%), Postives = 338/461 (73.32%), Query Frame = 1

Query: 8   HVVLLPWSAFGHLMPHFQLSLALAKAGVHVSFISTPKNLNRLPGIPPSLSPFITLVPIPL 67
           HV +LPWSAFGHL+P FQLS+ALAK+G+HVSF++TP+N+ RLP IPP+LS  I  VP+PL
Sbjct: 8   HVAMLPWSAFGHLIPFFQLSIALAKSGIHVSFLATPRNILRLPKIPPNLSNLIHFVPLPL 67

Query: 68  PKLPGDPLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWMIVDFNATWI 127
           PK    PLPE AEATVDIP DKI +LKL+ DL +  VR FV++  NPP W+IVDF   W 
Sbjct: 68  PKPESSPLPESAEATVDIPADKIQYLKLACDLLQEPVRGFVSE--NPPHWIIVDFFHHWA 127

Query: 128 CDISREFRIPIVFFRVLSPGFLAFFA-HVL---GSGLPLSEIGSLMSPPIISGSTVAFRR 187
            DI+++F IPIV F V+S   + FF  H +   G    L +  +L    I   S VA+++
Sbjct: 128 VDIAQDFNIPIVIFWVVSAATVDFFGVHKVPLDGDKRMLPQSWTLPPEYIDFSSKVAYKK 187

Query: 188 YEAAKIHADLFEKNDSGMSDRERVTKIISGSRAIAVRSCYEFDVDYLKLYSIFCGKRVIP 247
           +EA ++HA  F +N SGM D  RV KII    AIA+R+C EF+ DYLKL+    GK V P
Sbjct: 188 HEAEEMHAGYFGQNASGMPDSSRVAKIIQACNAIALRTCPEFEADYLKLHEKLTGKPVFP 247

Query: 248 LGFLPPEK-PQKSEFEADSPWKSTFEWLDHQSPQSVVFVGFGSECKLTKDQIHKIARGLE 307
           +GFLPPEK  ++S    + PW   F+WLD Q+P+SVVFVGFGSECKL KDQIH+IA G+E
Sbjct: 248 VGFLPPEKMKRRSPTSNEEPWSGIFQWLDKQNPRSVVFVGFGSECKLNKDQIHEIAHGVE 307

Query: 308 LSELPFLWSLRKPDWAGDS-DALPAGFQDRTAERGIVRMGWAPQMEILGHPAIGGCFFHG 367
           LS LPFLW+LRKPDWA D  DA PAGF++RTA RG+  +GWAPQ E+L HP++GGC FH 
Sbjct: 308 LSGLPFLWALRKPDWADDDEDAFPAGFRERTAARGVAHVGWAPQREVLSHPSVGGCLFHA 367

Query: 368 GWGSAIEALQFGHRLVLLPFIVDQPLNARLLVEKGVAVEVERKEEDGSFSGEDIAKALKE 427
           GWGS IE+LQ+GH LV LPFI+DQPLNARLLVEKG+  EVER  EDGSFS  DIAKAL++
Sbjct: 368 GWGSIIESLQYGHCLVFLPFIIDQPLNARLLVEKGLGWEVER-GEDGSFSRYDIAKALEK 427

Query: 428 AMASEEGEKIRRRATEM-SAIFGDTKLHQRYIEEFVEFLKN 462
           AM  +EGE++R R  E    IFGD KLH  Y+E+FVE+LKN
Sbjct: 428 AMVLKEGEEVRVRGREAGDGIFGDEKLHHSYVEKFVEYLKN 465

BLAST of CmaCh14G020520 vs. NCBI nr
Match: gi|747091053|ref|XP_011093243.1| (PREDICTED: putative UDP-rhamnose:rhamnosyltransferase 1 [Sesamum indicum])

HSP 1 Score: 531.2 bits (1367), Expect = 1.8e-147
Identity = 271/461 (58.79%), Postives = 336/461 (72.89%), Query Frame = 1

Query: 8   HVVLLPWSAFGHLMPHFQLSLALAKAGVHVSFISTPKNLNRLPGIPPSLSPFITLVPIPL 67
           HV +LPWSAFGHL+P  QLS+ALAK+G+HVSF++TP+N+ RLP IPP+LS  I  VP+PL
Sbjct: 8   HVAMLPWSAFGHLIPFLQLSIALAKSGIHVSFLATPRNILRLPKIPPNLSNLIDFVPLPL 67

Query: 68  PKLPGDPLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWMIVDFNATWI 127
           PK    PLPE AEATVDIP DKI +LKL+ DL +  VR FV++  NPP W+IVDF   W 
Sbjct: 68  PKPESSPLPESAEATVDIPADKIQYLKLACDLLQEPVRGFVSE--NPPHWIIVDFFHHWA 127

Query: 128 CDISREFRIPIVFFRVLSPGFLAFFA-HVL---GSGLPLSEIGSLMSPPIISGSTVAFRR 187
            DI+++F IPIV F V S   + FF  H +   G    L +  +L    I   S VA+++
Sbjct: 128 VDIAQDFNIPIVTFWVFSAATVDFFGVHKVPLDGDKRMLPQSWTLPPEYIDFSSKVAYKK 187

Query: 188 YEAAKIHADLFEKNDSGMSDRERVTKIISGSRAIAVRSCYEFDVDYLKLYSIFCGKRVIP 247
           +EA ++HA  F +N SGM D  RV KII  + AIA+R+C EF+ DYLKL+    GK V P
Sbjct: 188 HEAEEMHAGYFGQNASGMPDSSRVAKIIQATNAIAIRTCPEFEADYLKLHEKLTGKPVFP 247

Query: 248 LGFLPPEK-PQKSEFEADSPWKSTFEWLDHQSPQSVVFVGFGSECKLTKDQIHKIARGLE 307
           +GFLPPEK  ++S    + PW   F+WLD Q+P+SVVFVGFGSECKL KDQIH+IA G+E
Sbjct: 248 VGFLPPEKMKRRSPTSNEEPWSGIFQWLDKQNPRSVVFVGFGSECKLNKDQIHEIAHGVE 307

Query: 308 LSELPFLWSLRKPDWAGDS-DALPAGFQDRTAERGIVRMGWAPQMEILGHPAIGGCFFHG 367
           LS LPFLW+LRKPDWA D  DA PAGF++RTA RG+  +GWAPQ E+L HP++GGC FH 
Sbjct: 308 LSGLPFLWALRKPDWADDDEDAFPAGFRERTATRGVAHVGWAPQREVLSHPSVGGCLFHA 367

Query: 368 GWGSAIEALQFGHRLVLLPFIVDQPLNARLLVEKGVAVEVERKEEDGSFSGEDIAKALKE 427
           GWGS IE+LQ+GH LV LPFI+DQPLNARLLVEKG+  EVER  EDGSFS  DIAKAL++
Sbjct: 368 GWGSIIESLQYGHCLVFLPFIIDQPLNARLLVEKGLGWEVER-GEDGSFSRYDIAKALEK 427

Query: 428 AMASEEGEKIRRRATEM-SAIFGDTKLHQRYIEEFVEFLKN 462
           AM  +EGE++  R  E    IFGD KLH  Y+E+FVE+LKN
Sbjct: 428 AMVLKEGEEVMVRGREAGDGIFGDEKLHLSYVEKFVEYLKN 465

BLAST of CmaCh14G020520 vs. NCBI nr
Match: gi|657942934|ref|XP_008346533.1| (PREDICTED: putative UDP-rhamnose:rhamnosyltransferase 1 [Malus domestica])

HSP 1 Score: 517.3 bits (1331), Expect = 2.8e-143
Identity = 266/474 (56.12%), Postives = 335/474 (70.68%), Query Frame = 1

Query: 4   KKADH--VVLLPWSAFGHLMPHFQLSLALAKAGVHVSFISTPKNLNRLPGIPPSLSPFIT 63
           +K DH  VV+LPWSAFGH+MP FQLS+ALAKA VHVSFISTPKN+ RLP I P L PF+ 
Sbjct: 2   EKDDHLSVVMLPWSAFGHMMPFFQLSIALAKAKVHVSFISTPKNIQRLPKILPDLQPFVQ 61

Query: 64  LVPIPLPKLPGDPLPEGAEATVDIPFDKIPFLKLSLDLAEPSVRKFVADHPNPPDWMIVD 123
           LV IP P L  D LPEGAEA+VD+PF+    LK++ DL +  +++F+ D    PDW+I D
Sbjct: 62  LVXIPFPXLDPDLLPEGAEASVDLPFENTDNLKIAYDLLQTPIKQFIGDRL--PDWIITD 121

Query: 124 FNATWICDISREFRIPIVFFRVLSPGFLAFFAHV-----LGSGLPLSEIGSLMSPP--II 183
           F A W+ +I +E+ +P+V+F V+S   +  F+         +   L    SL SPP  + 
Sbjct: 122 FAAHWVVEIGKEYGVPLVYFTVVSAATVVVFSSAENPSSXNTDXTLPSPESLTSPPDLVT 181

Query: 184 SGSTVAFRRYEAAKIHADLFEKNDSGMSDRERVTKIISGSRAIAVRSCYEFDVDYLKLYS 243
           S STVA+R +EA  +H  L+  NDSG+SD ER  KI+S  R  A+RSCYEF+ +YL+ Y 
Sbjct: 182 SQSTVAYREHEAVDMHEGLYGVNDSGISDAERHAKILSACRVFAIRSCYEFEGEYLEAYK 241

Query: 244 IFCGKRVIPLGFLPPEKPQKSEFEADSPWKSTFEWLDHQSPQSVVFVGFGSECKLTKDQI 303
              GK VIP G LPPE P K      S     FEWLD Q  +SVVFVGFGSECKL+K+Q+
Sbjct: 242 NXSGKLVIPTGLLPPEIPXKGVKGEISSDDGIFEWLDKQKTRSVVFVGFGSECKLSKEQV 301

Query: 304 HKIARGLELSELPFLWSLRKPDWA-GDSDALPAGFQDRTAERGIVRMGWAPQMEILGHPA 363
            +IA GLELSELPFLW+LRKP+WA  ++DALP GF DR +E+G+V +GW PQMEIL HP+
Sbjct: 302 FEIAHGLELSELPFLWALRKPNWANSEADALPLGFVDRVSEKGLVCIGWVPQMEILAHPS 361

Query: 364 IGGCFFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVEKGVAVEVERKEEDGSFSGE 423
           +GG  FH GWGS IE LQFGH LV+LPFI+DQPLNARLL EKG+AVEV+R+  DGSFS +
Sbjct: 362 VGGSLFHSGWGSIIETLQFGHVLVVLPFIIDQPLNARLLEEKGMAVEVKRR-GDGSFSRD 421

Query: 424 DIAKALKEAMASEEGEKIRRRATEMSAIFGDTKLHQ-RYIEEFVEFLKNGDSNQ 467
           DIAK L+ AM  EEGE++R  A + + +FGD KLHQ  YI +FV FLKN  + +
Sbjct: 422 DIAKTLRHAMVEEEGERLRSNARKAATVFGDHKLHQDHYIGKFVNFLKNNTTKR 472

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
URT1_FRAAN7.1e-9140.30Putative UDP-rhamnose:rhamnosyltransferase 1 OS=Fragaria ananassa GN=GT4 PE=2 SV... [more]
SGT3_SOYBN1.9e-8839.61Soyasaponin III rhamnosyltransferase OS=Glycine max GN=GmSGT3 PE=1 SV=1[more]
U91A1_ARATH6.2e-8739.30UDP-glycosyltransferase 91A1 OS=Arabidopsis thaliana GN=UGT91A1 PE=2 SV=1[more]
U91D1_STERE7.6e-8537.77UDP-glycosyltransferase 91D1 OS=Stevia rebaudiana GN=UGT91D1 PE=2 SV=1[more]
U91C1_ARATH9.3e-8336.58UDP-glycosyltransferase 91C1 OS=Arabidopsis thaliana GN=UGT91C1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L7F1_CUCSA3.4e-22582.21Uncharacterized protein OS=Cucumis sativus GN=Csa_3G121730 PE=4 SV=1[more]
M5XKS9_PRUPE1.3e-13955.48Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025376mg PE=4 SV=1[more]
B9GQB9_POPTR2.2e-13954.72Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s16380g PE=4 SV=1[more]
B9I8N6_POPTR3.8e-13955.32Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0014s08440g PE=4 SV=1[more]
F6I663_VITVI1.4e-13857.57Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0046g01950 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT2G22590.13.5e-8839.30 UDP-Glycosyltransferase superfamily protein[more]
AT5G49690.15.3e-8436.58 UDP-Glycosyltransferase superfamily protein[more]
AT5G65550.11.4e-7636.08 UDP-Glycosyltransferase superfamily protein[more]
AT1G64910.11.0e-5031.60 UDP-Glycosyltransferase superfamily protein[more]
AT3G29630.12.2e-5033.62 UDP-Glycosyltransferase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659075186|ref|XP_008438010.1|5.3e-22783.37PREDICTED: UDP-glycosyltransferase 91A1-like [Cucumis melo][more]
gi|449433069|ref|XP_004134320.1|4.9e-22582.21PREDICTED: UDP-glycosyltransferase 91A1-like [Cucumis sativus][more]
gi|747091051|ref|XP_011093242.1|7.5e-14959.22PREDICTED: putative UDP-rhamnose:rhamnosyltransferase 1 [Sesamum indicum][more]
gi|747091053|ref|XP_011093243.1|1.8e-14758.79PREDICTED: putative UDP-rhamnose:rhamnosyltransferase 1 [Sesamum indicum][more]
gi|657942934|ref|XP_008346533.1|2.8e-14356.12PREDICTED: putative UDP-rhamnose:rhamnosyltransferase 1 [Malus domestica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016758 transferase activity, transferring hexosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh14G020520.1CmaCh14G020520.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 2..461
score: 1.1E
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 275..426
score: 9.6
NoneNo IPR availableunknownCoilCoilcoord: 459..466
scor
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 267..438
score: 8.4
NoneNo IPR availablePANTHERPTHR11926:SF310SUBFAMILY NOT NAMEDcoord: 2..461
score: 1.1E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 7..461
score: 2.34

The following gene(s) are paralogous to this gene:

None