Cp4.1LG06g05470.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG06g05470.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionGlycosyltransferase
LocationCp4.1LG06 : 3350292 .. 3353275 (+)
Sequence length1600
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGGAGTCATTGTGCCAAACATGAAACATGTCTCAAACAAACCAAGGGTGCTATTCGTGCCATACCCGGCTCAAGGTCACGTCACACCGATGCTCATGCTCGCCGCCGTTTTCCAGCGCCGTGGTTTCCTTCCTATATTTCTCACGCCTAGCTACATCCACCGTCATATCTCGTCGCAAATTTCACTCACCAATGAAATCCTCTTCATTTCCATGCCAGACACTGTCGATGACAACACGCCACATGACTTTTTCACCATTGAGACTGCCTTAGAGACCACCATGCCGAGCTACGTTCGACGAGTGTTGGGCGAGTACAACTCGAACGAAAGTGGCGTTGTTTGTATGGTGGTCGACTTGCTTGCTTCTTCGGCCATCGAAGTCGGAAAGGAGTATGGCGTAGCCGTGGCTGGGTTTTGGCCAGCCATGTTCGCAACTTATAACCTCATTTCAGCTGTTCCTGACATGGTCAAGAACAACCTTATTTCTTCCGATACAGGTTACCACCCTTTTTATTTTTCATTTTTTTTTGACAAATAAAATAATAACCCATAGTTTTTGGGTCAACCATATTCTGCTCCATTTTTTTATATTATAATTAATTTAATAATCATCATATCTCTCTTTTAAAAAAATGCTTTAATTACCTAAAATAAATAAATAAATAAATATATATTTTAATCTTCATTCCCATTTTATTTTTAAATTTTAGTCGGAGAGATAAGAAAAAAAAACCCTCTATTACTGCCACGTGTCTTCCTCCCATTGGGCTGCTTTTTTCTTCTATCTAAATCTTATATTCAAACATGTAAAATAAATACGGTATTTTAAAAATAAAATCCTATATTTTTTAAGTTTCATTCCAATTTTATTTTAAAATAAATAAATATATATTTTTTAAGTTTCATTCCCATTTTATTTTAAAATTTTAGTCAAAGAGATAAAGAAAAAGAAAATACCGTATTTTTCTATTCCTGCCACGTGTCTTACTCCCATTGGTTGATTTTTCTTCAATCTAAATCCTATATTTAAACATGTAAATTTATCATTAGTCTTTAAACTTTGTCTAATTAAGCTTAATTAACAATCTAAAAGATTATAATGAAGAATGCAAATCTTGAATTTTGAACAGGATGTCCAGGAGAAGGATCAAAACGGTGCGTTCCGAACCAGCCGCTGCTGTCTACAGAAGAACTGCCGTGGCTGATCGGAACTTCGTCGGCAAGGAAAGCGAGGTTCAAATTCTGGACAAGAACCATGGTTCGAGCCAAATCTCTTCAATGGATTCTCGTTAATTCCTTCCCGGAAGAGCTTCCCCTTGAAAACCCCATCCCCAAAAGCTCCGCCGCCGTTTTCCTCGTTGGACCTTTGAGCCGCCACTCGAATCCGGCAAAAACACCCACTTTCTGGGAAGAAGACGACGGCTGCCTGCAGTGGCTGGAGAAACAGAGCCCTAATTCCGTCGTGTACATCTCGTTTGGGAGTTGGGTTAGCCCCATTAACGAATCGAAGGTGAGGAGCTTGGCCGTGGCGCTTTTGGGTCTTAGGAAACCGTTCATTTGGGTGCTGAAAAGTAATTGGCGAGATGGGTTACCAATCGGGTTCACACAAAAGGTACGGTTAAAACATATAGTTTATATTTAATCATAGTTAATATATTTCTTTTCTTTTTTTAATCGGTTAAATTAATAAAAATGCTCTTTAACCTTTTATTTTATCCCTTAAAATTTAAAAAACATAATATATATATATATATTAATTTTTTTTAATGTTAAAAATACTCTTATTATTAGTATATAAATGAAACGTAAAATAAATTTTTTTAAAAAATATTTAAAAATATTTAAGGATCTTAATTTTATTTTTAAAAATTTATGGGTATTTTTGAAACATGATACTAATAATATTTTTGAATTTTTTTTAAATTTTAAGTTATATTTAAAAATATATGAAAATCTTAATTTTATTTTGTAAAATTTATGGATATTTTAGAAACATGATATTAATAATATTCTTGAATTTTAATAAAAAATTTAAAATGTTTTTGAAGTTTTAAGGTATATTTTTAAATATATTTAAAACATATAAGGATCTTAATTTTATTTTTTAAAATTTATGGGTATTTTTGAACTTTAATAAAAATTTAAAATGTTTTTAAAGTTTTAAAGTAAAAATATTTTTAAATTTTTTTAAAATTTAAGAATATGGAAACTTTTAAAAAATTAAGAATTTTTTTAGATAAAATATAGATAAAGATTTTTATTTATTTATTAATTAAGCATTTTTAATTCATAATTTGGAAGCCAGGTAGTAATTGAGTTAGGTTCAAATTATTACTAATCTTATTTATTCATATTTGTTTCAGATTCAAAGATACGGGAGGCTCGTTTCATGGGCTCCACAGATGGAGATTTTAAAGCATAGAGCAGTCGGTTGTTATCTAACTCACTGTGGATGGAACTCGATCATGGAAGCGATTCAGTGCCGAAAGCGACTACTTTGTTTCCCGGTGGCGGGCGACCAATTTTTGAATTGTGGGTATGTAGTGAAAGTGTGGAGAATTGGGTTGAAACTCAATGGTTTTGGAGAGAAAGAGGTTGAAGAAGGAGTGAGGAAAGTGATGGAGGATGGAGAGATGAAGGCGAGGATGATGAAGCTTCATGAGAGGATAATGGGGGAAGATGCCAATTCTAGAGTCAACTCTAGCTTCACGACCTTCATCAAAGACATCAATAAGCTTTCGTTCGATAAATTTCTGTAGCGTTTAAGCTGTCGGGTTTAGTTTTGGTGAAAAAAATGTGACCTAATTTGATTGTACGACCCGATAATAGGAAGAAATTGATGTTGTTTGGTTTAGGTTCCATCTTAAAACTGGTTAACTGGTCTGTATGTCTATATATCGTATAAAATTGTGAGATATCTTCATGTTACGATTTATCCTTAATAGGATTCATTGAAAAAAATATTTAAAGATATATTATAATTT

mRNA sequence

TGGAGTCATTGTGCCAAACATGAAACATGTCTCAAACAAACCAAGGGTGCTATTCGTGCCATACCCGGCTCAAGGTCACGTCACACCGATGCTCATGCTCGCCGCCGTTTTCCAGCGCCGTGGTTTCCTTCCTATATTTCTCACGCCTAGCTACATCCACCGTCATATCTCGTCGCAAATTTCACTCACCAATGAAATCCTCTTCATTTCCATGCCAGACACTGTCGATGACAACACGCCACATGACTTTTTCACCATTGAGACTGCCTTAGAGACCACCATGCCGAGCTACGTTCGACGAGTGTTGGGCGAGTACAACTCGAACGAAAGTGGCGTTGTTTGTATGGTGGTCGACTTGCTTGCTTCTTCGGCCATCGAAGTCGGAAAGGAGTATGGCGTAGCCGTGGCTGGGTTTTGGCCAGCCATGTTCGCAACTTATAACCTCATTTCAGCTGTTCCTGACATGGTCAAGAACAACCTTATTTCTTCCGATACAGGATGTCCAGGAGAAGGATCAAAACGGTGCGTTCCGAACCAGCCGCTGCTGTCTACAGAAGAACTGCCGTGGCTGATCGGAACTTCGTCGGCAAGGAAAGCGAGGTTCAAATTCTGGACAAGAACCATGGTTCGAGCCAAATCTCTTCAATGGATTCTCGTTAATTCCTTCCCGGAAGAGCTTCCCCTTGAAAACCCCATCCCCAAAAGCTCCGCCGCCGTTTTCCTCGTTGGACCTTTGAGCCGCCACTCGAATCCGGCAAAAACACCCACTTTCTGGGAAGAAGACGACGGCTGCCTGCAGTGGCTGGAGAAACAGAGCCCTAATTCCGTCGTGTACATCTCGTTTGGGAGTTGGGTTAGCCCCATTAACGAATCGAAGGTGAGGAGCTTGGCCGTGGCGCTTTTGGGTCTTAGGAAACCGTTCATTTGGGTGCTGAAAAGTAATTGGCGAGATGGGTTACCAATCGGGTTCACACAAAAGATTCAAAGATACGGGAGGCTCGTTTCATGGGCTCCACAGATGGAGATTTTAAAGCATAGAGCAGTCGGTTGTTATCTAACTCACTGTGGATGGAACTCGATCATGGAAGCGATTCAGTGCCGAAAGCGACTACTTTGTTTCCCGGTGGCGGGCGACCAATTTTTGAATTGTGGGTATGTAGTGAAAGTGTGGAGAATTGGGTTGAAACTCAATGGTTTTGGAGAGAAAGAGGTTGAAGAAGGAGTGAGGAAAGTGATGGAGGATGGAGAGATGAAGGCGAGGATGATGAAGCTTCATGAGAGGATAATGGGGGAAGATGCCAATTCTAGAGTCAACTCTAGCTTCACGACCTTCATCAAAGACATCAATAAGCTTTCGTTCGATAAATTTCTGTAGCGTTTAAGCTGTCGGGTTTAGTTTTGGTGAAAAAAATGTGACCTAATTTGATTGTACGACCCGATAATAGGAAGAAATTGATGTTGTTTGGTTTAGGTTCCATCTTAAAACTGGTTAACTGGTCTGTATGTCTATATATCGTATAAAATTGTGAGATATCTTCATGTTACGATTTATCCTTAATAGGATTCATTGAAAAAAATATTTAAAGATATATTATAATTT

Coding sequence (CDS)

ATGAAACATGTCTCAAACAAACCAAGGGTGCTATTCGTGCCATACCCGGCTCAAGGTCACGTCACACCGATGCTCATGCTCGCCGCCGTTTTCCAGCGCCGTGGTTTCCTTCCTATATTTCTCACGCCTAGCTACATCCACCGTCATATCTCGTCGCAAATTTCACTCACCAATGAAATCCTCTTCATTTCCATGCCAGACACTGTCGATGACAACACGCCACATGACTTTTTCACCATTGAGACTGCCTTAGAGACCACCATGCCGAGCTACGTTCGACGAGTGTTGGGCGAGTACAACTCGAACGAAAGTGGCGTTGTTTGTATGGTGGTCGACTTGCTTGCTTCTTCGGCCATCGAAGTCGGAAAGGAGTATGGCGTAGCCGTGGCTGGGTTTTGGCCAGCCATGTTCGCAACTTATAACCTCATTTCAGCTGTTCCTGACATGGTCAAGAACAACCTTATTTCTTCCGATACAGGATGTCCAGGAGAAGGATCAAAACGGTGCGTTCCGAACCAGCCGCTGCTGTCTACAGAAGAACTGCCGTGGCTGATCGGAACTTCGTCGGCAAGGAAAGCGAGGTTCAAATTCTGGACAAGAACCATGGTTCGAGCCAAATCTCTTCAATGGATTCTCGTTAATTCCTTCCCGGAAGAGCTTCCCCTTGAAAACCCCATCCCCAAAAGCTCCGCCGCCGTTTTCCTCGTTGGACCTTTGAGCCGCCACTCGAATCCGGCAAAAACACCCACTTTCTGGGAAGAAGACGACGGCTGCCTGCAGTGGCTGGAGAAACAGAGCCCTAATTCCGTCGTGTACATCTCGTTTGGGAGTTGGGTTAGCCCCATTAACGAATCGAAGGTGAGGAGCTTGGCCGTGGCGCTTTTGGGTCTTAGGAAACCGTTCATTTGGGTGCTGAAAAGTAATTGGCGAGATGGGTTACCAATCGGGTTCACACAAAAGATTCAAAGATACGGGAGGCTCGTTTCATGGGCTCCACAGATGGAGATTTTAAAGCATAGAGCAGTCGGTTGTTATCTAACTCACTGTGGATGGAACTCGATCATGGAAGCGATTCAGTGCCGAAAGCGACTACTTTGTTTCCCGGTGGCGGGCGACCAATTTTTGAATTGTGGGTATGTAGTGAAAGTGTGGAGAATTGGGTTGAAACTCAATGGTTTTGGAGAGAAAGAGGTTGAAGAAGGAGTGAGGAAAGTGATGGAGGATGGAGAGATGAAGGCGAGGATGATGAAGCTTCATGAGAGGATAATGGGGGAAGATGCCAATTCTAGAGTCAACTCTAGCTTCACGACCTTCATCAAAGACATCAATAAGCTTTCGTTCGATAAATTTCTGTAG

Protein sequence

MKHVSNKPRVLFVPYPAQGHVTPMLMLAAVFQRRGFLPIFLTPSYIHRHISSQISLTNEILFISMPDTVDDNTPHDFFTIETALETTMPSYVRRVLGEYNSNESGVVCMVVDLLASSAIEVGKEYGVAVAGFWPAMFATYNLISAVPDMVKNNLISSDTGCPGEGSKRCVPNQPLLSTEELPWLIGTSSARKARFKFWTRTMVRAKSLQWILVNSFPEELPLENPIPKSSAAVFLVGPLSRHSNPAKTPTFWEEDDGCLQWLEKQSPNSVVYISFGSWVSPINESKVRSLAVALLGLRKPFIWVLKSNWRDGLPIGFTQKIQRYGRLVSWAPQMEILKHRAVGCYLTHCGWNSIMEAIQCRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNGFGEKEVEEGVRKVMEDGEMKARMMKLHERIMGEDANSRVNSSFTTFIKDINKLSFDKFL
BLAST of Cp4.1LG06g05470.1 vs. Swiss-Prot
Match: U82A1_ARATH (UDP-glycosyltransferase 82A1 OS=Arabidopsis thaliana GN=UGT82A1 PE=2 SV=1)

HSP 1 Score: 463.8 bits (1192), Expect = 2.2e-129
Identity = 238/467 (50.96%), Postives = 314/467 (67.24%), Query Frame = 1

Query: 4   VSNKPRVLFVPYPAQGHVTPMLMLAAVFQRRGFLPIFLTPSYIHRHISSQISLTNE---I 63
           V+ KP+++F+PYPAQGHVTPML LA+ F  RGF P+ +TP  IHR IS+    TNE   I
Sbjct: 3   VTQKPKIIFIPYPAQGHVTPMLHLASAFLSRGFSPVVMTPESIHRRISA----TNEDLGI 62

Query: 64  LFISMPDTVD--DNTPHDFFTIETALETTMPSYVRRVLGEYNSNESGVVCMVVDLLASSA 123
            F+++ D  D  D  P DFF+IE ++E  MP  + R+L E    +  V C+VVDLLAS A
Sbjct: 63  TFLALSDGQDRPDAPPSDFFSIENSMENIMPPQLERLLLE---EDLDVACVVVDLLASWA 122

Query: 124 IEVGKEYGVAVAGFWPAMFATYNLISAVPDMVKNNLISSDTGCPGEGSKRCV-PNQPLLS 183
           I V    GV VAGFWP MFA Y LI A+P++V+  L+S   GCP +  K  V P QPLLS
Sbjct: 123 IGVADRCGVPVAGFWPVMFAAYRLIQAIPELVRTGLVSQK-GCPRQLEKTIVQPEQPLLS 182

Query: 184 TEELPWLIGTSSARKARFKFWTRTMVRAKSLQWILVNSFPEELP----------LENPIP 243
            E+LPWLIGT  A+K RFKFW RT+ R KSL+WIL +SF +E              N + 
Sbjct: 183 AEDLPWLIGTPKAQKKRFKFWQRTLERTKSLRWILTSSFKDEYEDVDNHKASYKKSNDLN 242

Query: 244 KSSAA----VFLVGPLSRH---SNPAKTPT-FWEEDDGCLQWLEKQSPNSVVYISFGSWV 303
           K +      +  +GPL      +N   T T FWEED  CL WL++Q+PNSV+YISFGSWV
Sbjct: 243 KENNGQNPQILHLGPLHNQEATNNITITKTSFWEEDMSCLGWLQEQNPNSVIYISFGSWV 302

Query: 304 SPINESKVRSLAVALLGLRKPFIWVLKSNWRDGLPIGFTQKI---QRYGRLVSWAPQMEI 363
           SPI ES +++LA+AL    +PF+W L   W++GLP GF  ++   +  GR+VSWAPQ+E+
Sbjct: 303 SPIGESNIQTLALALEASGRPFLWALNRVWQEGLPPGFVHRVTITKNQGRIVSWAPQLEV 362

Query: 364 LKHRAVGCYLTHCGWNSIMEAIQCRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNGFGEK 423
           L++ +VGCY+THCGWNS MEA+   +RLLC+PVAGDQF+NC Y+V VW+IG++L+GFGEK
Sbjct: 363 LRNDSVGCYVTHCGWNSTMEAVASSRRLLCYPVAGDQFVNCKYIVDVWKIGVRLSGFGEK 422

Query: 424 EVEEGVRKVMEDGEMKARMMKLHERIMGEDANSRVNSSFTTFIKDIN 444
           EVE+G+RKVMED +M  R+ KL +R MG +A      +FT    ++N
Sbjct: 423 EVEDGLRKVMEDQDMGERLRKLRDRAMGNEARLSSEMNFTFLKNELN 461

BLAST of Cp4.1LG06g05470.1 vs. Swiss-Prot
Match: U83A1_ARATH (UDP-glycosyltransferase 83A1 OS=Arabidopsis thaliana GN=UGT83A1 PE=2 SV=1)

HSP 1 Score: 197.6 bits (501), Expect = 2.9e-49
Identity = 128/461 (27.77%), Postives = 221/461 (47.94%), Query Frame = 1

Query: 7   KPRVLFVPYPAQGHVTPMLMLAAVFQRRGFLPIFLTPSYIHRHISSQIS-------LTNE 66
           +P V+ +PYPAQGHV P++  +    ++G    F+   + H  I S +        + ++
Sbjct: 11  RPHVVVIPYPAQGHVLPLISFSRYLAKQGIQITFINTEFNHNRIISSLPNSPHEDYVGDQ 70

Query: 67  ILFISMPDTVDDNTPHDFFT--IETALETTMPSYVRRVLGEYNSNESG---VVCMVVDLL 126
           I  +S+PD ++D+         +  ++   MP  V  ++    +  SG   + C+V D  
Sbjct: 71  INLVSIPDGLEDSPEERNIPGKLSESVLRFMPKKVEELIERMMAETSGGTIISCVVADQS 130

Query: 127 ASSAIEVGKEYGVAVAGFWPAMFATYNLISAVPDMVKNNLISSDTGCPGEGSKRCVPNQP 186
              AIEV  ++G+    F PA  A+  L  ++  ++ + LI SD       + +  P  P
Sbjct: 131 LGWAIEVAAKFGIRRTAFCPAAAASMVLGFSIQKLIDDGLIDSDGTVRVNKTIQLSPGMP 190

Query: 187 LLSTEELPWL-IGTSSARKARFKFWTRTMVRAKSLQWILVNSFPEELPLENPIPKSSAAV 246
            + T++  W+ +    ++K  F+   +     +S  W+L NS  E   LE         +
Sbjct: 191 KMETDKFVWVCLKNKESQKNIFQLMLQNNNSIESTDWLLCNSVHE---LETAAFGLGPNI 250

Query: 247 FLVGPL----SRHSNPAKTPTFWEEDDGCLQWLEKQSPNSVVYISFGSWVSPINESKVRS 306
             +GP+    S         +F   D  CL WL++Q P SV+Y++FGS+   +   ++  
Sbjct: 251 VPIGPIGWAHSLEEGSTSLGSFLPHDRDCLDWLDRQIPGSVIYVAFGSF-GVMGNPQLEE 310

Query: 307 LAVALLGLRKPFIWVLKSNWRDGLPIGFTQKIQRYGRLVSWAPQMEILKHRAVGCYLTHC 366
           LA+ L   ++P +WV     +  + +G  +      ++V WAPQ E+L   A+GC+++HC
Sbjct: 311 LAIGLELTKRPVLWVTGD--QQPIKLGSDRV-----KVVRWAPQREVLSSGAIGCFVSHC 370

Query: 367 GWNSIMEAIQCRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNG-----FGEKEVEEGVRK 426
           GWNS +E  Q     LC P   DQF+N  Y+  VW+IGL L           EV++ + +
Sbjct: 371 GWNSTLEGAQNGIPFLCIPYFADQFINKAYICDVWKIGLGLERDARGVVPRLEVKKKIDE 430

Query: 427 VMED-GEMKARMMKLHERIMGEDANSRVN----SSFTTFIK 441
           +M D GE + R MK+ E +M   A   ++    + F  +IK
Sbjct: 431 IMRDGGEYEERAMKVKEIVMKSVAKDGISCENLNKFVNWIK 460

BLAST of Cp4.1LG06g05470.1 vs. Swiss-Prot
Match: U85A1_ARATH (UDP-glycosyltransferase 85A1 OS=Arabidopsis thaliana GN=UGT85A1 PE=2 SV=1)

HSP 1 Score: 193.0 bits (489), Expect = 7.2e-48
Identity = 131/475 (27.58%), Postives = 225/475 (47.37%), Query Frame = 1

Query: 3   HVSNKPRVLFVPYPAQGHVTPMLMLAAVFQRRGFLPIFLTPSYIHRHI-----SSQISLT 62
           H S KP V+ VPYPAQGH+ PM+ +A +   RGF   F+   Y H        S+ +   
Sbjct: 7   HNSQKPHVVCVPYPAQGHINPMMRVAKLLHARGFYVTFVNTVYNHNRFLRSRGSNALDGL 66

Query: 63  NEILFISMPDTVDDNTPHDFFTIETALETTMPSYV---RRVLGEYNSNES--GVVCMVVD 122
               F S+ D + +        I    E+TM + +   R +L   N+ ++   V C+V D
Sbjct: 67  PSFRFESIADGLPETDMDATQDITALCESTMKNCLAPFRELLQRINAGDNVPPVSCIVSD 126

Query: 123 LLASSAIEVGKEYGVAVAGFWPAMFATYNLISAVPDMVKNNL--ISSDTGCPGEGSKRCV 182
              S  ++V +E GV    FW      +         ++  L  +  ++    E  +  V
Sbjct: 127 GCMSFTLDVAEELGVPEVLFWTTSGCAFLAYLHFYLFIEKGLCPLKDESYLTKEYLEDTV 186

Query: 183 ----PNQPLLSTEELPWLIGTSSARKARFKFWTRTMVRAKSLQWILVNSFPE-ELPLENP 242
               P    +  +++P  I T++       F  R   RAK    I++N+F + E  + + 
Sbjct: 187 IDFIPTMKNVKLKDIPSFIRTTNPDDVMISFALRETERAKRASAIILNTFDDLEHDVVHA 246

Query: 243 IPKSSAAVFLVGPLSRHSNPA---------KTPTFWEEDDGCLQWLEKQSPNSVVYISFG 302
           +      V+ VGPL   +N            +   W+E+  CL WL+ ++ NSV+YI+FG
Sbjct: 247 MQSILPPVYSVGPLHLLANREIEEGSEIGMMSSNLWKEEMECLDWLDTKTQNSVIYINFG 306

Query: 303 SWVSPINESKVRSLAVALLGLRKPFIWVLKSNWRDG----LPIGFTQKIQRYGRLVSWAP 362
           S ++ ++  ++   A  L G  K F+WV++ +   G    +P  F  + +    L SW P
Sbjct: 307 S-ITVLSVKQLVEFAWGLAGSGKEFLWVIRPDLVAGEEAMVPPDFLMETKDRSMLASWCP 366

Query: 363 QMEILKHRAVGCYLTHCGWNSIMEAIQCRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNG 422
           Q ++L H A+G +LTHCGWNSI+E++ C   ++C+P   DQ +NC +    W +G+++ G
Sbjct: 367 QEKVLSHPAIGGFLTHCGWNSILESLSCGVPMVCWPFFADQQMNCKFCCDEWDVGIEIGG 426

Query: 423 FGEKEVEEGVRKVMEDGEMKARMMKL---HERIMGEDANSRVNSSFTTFIKDINK 445
             ++E  E V + + DGE   +M +     +R+  +    ++ SS   F   ++K
Sbjct: 427 DVKREEVEAVVRELMDGEKGKKMREKAVEWQRLAEKATEHKLGSSVMNFETVVSK 480

BLAST of Cp4.1LG06g05470.1 vs. Swiss-Prot
Match: U85A5_ARATH (UDP-glycosyltransferase 85A5 OS=Arabidopsis thaliana GN=UGT85A5 PE=2 SV=1)

HSP 1 Score: 186.8 bits (473), Expect = 5.2e-46
Identity = 126/457 (27.57%), Postives = 216/457 (47.26%), Query Frame = 1

Query: 7   KPRVLFVPYPAQGHVTPMLMLAAVFQRRGFLPIFLTPSYIHRHI-----SSQISLTNEIL 66
           KP V+ +P+PAQGH+ PML +A +   RGF   F+  +Y H  +      + +       
Sbjct: 11  KPHVVCIPFPAQGHINPMLKVAKLLYARGFHVTFVNTNYNHNRLIRSRGPNSLDGLPSFR 70

Query: 67  FISMPDTVDDNTPHDFFTIETALETTMPSYV---RRVLGEYNSNES--GVVCMVVDLLAS 126
           F S+PD + +        + T  E+TM + +   + +L   N+ +    V C+V D + S
Sbjct: 71  FESIPDGLPEENKDVMQDVPTLCESTMKNCLAPFKELLRRINTTKDVPPVSCIVSDGVMS 130

Query: 127 SAIEVGKEYGVAVAGFWPAMFATYNLISAVPDMVKNNLISSDTGCPGEGSKRCVPNQPLL 186
             ++  +E GV    FW      +         ++  L         +     +P+   L
Sbjct: 131 FTLDAAEELGVPDVLFWTPSACGFLAYLHFYRFIEKGLSPIKDESSLDTKINWIPSMKNL 190

Query: 187 STEELPWLIGTSSARKARFKFWTRTMVRAKSLQWILVNSFPEELPLENPIPKSSAA---- 246
             +++P  I  ++       F+     RAK    I++N+F     LE+ + +S  +    
Sbjct: 191 GLKDIPSFIRATNTEDIMLNFFVHEADRAKRASAIILNTFDS---LEHDVVRSIQSIIPQ 250

Query: 247 VFLVGPL--------SRHSNPAKTPT-FWEEDDGCLQWLEKQSPNSVVYISFGSWVSPIN 306
           V+ +GPL           S+  +  T  W E+  CL WL+ +SPNSVVY++FGS ++ ++
Sbjct: 251 VYTIGPLHLFVNRDIDEESDIGQIGTNMWREEMECLDWLDTKSPNSVVYVNFGS-ITVMS 310

Query: 307 ESKVRSLAVALLGLRKPFIWVLKSNWRDG----LPIGFTQKIQRYGRLVSWAPQMEILKH 366
             ++   A  L   +K F+WV++ +   G    LP  F  +      L SW PQ ++L H
Sbjct: 311 AKQLVEFAWGLAATKKDFLWVIRPDLVAGDVPMLPPDFLIETANRRMLASWCPQEKVLSH 370

Query: 367 RAVGCYLTHCGWNSIMEAIQCRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNG-FGEKEV 426
            AVG +LTH GWNS +E++     ++C+P   +Q  NC Y    W +G+++ G    +EV
Sbjct: 371 PAVGGFLTHSGWNSTLESLSGGVPMVCWPFFAEQQTNCKYCCDEWEVGMEIGGDVRREEV 430

Query: 427 EEGVRKVMEDGEMKARMMKLHE-RIMGEDANSRVNSS 435
           EE VR++M+  + K    K  E + + E+A   +  S
Sbjct: 431 EELVRELMDGDKGKKMRQKAEEWQRLAEEATKPIYGS 463

BLAST of Cp4.1LG06g05470.1 vs. Swiss-Prot
Match: U76E2_ARATH (UDP-glycosyltransferase 76E2 OS=Arabidopsis thaliana GN=UGT76E2 PE=2 SV=1)

HSP 1 Score: 180.3 bits (456), Expect = 4.8e-44
Identity = 131/433 (30.25%), Postives = 215/433 (49.65%), Query Frame = 1

Query: 9   RVLFVPYPAQGHVTPMLMLAAVFQRRGF-LPIFLTPSYIHRHISSQISLTNEILFISMPD 68
           R++ VP PAQGHVTPM+ L      +GF + + LT S     +SS    + +  F+++P 
Sbjct: 10  RIVLVPVPAQGHVTPMMQLGKALHSKGFSITVVLTQS---NRVSSSKDFS-DFHFLTIPG 69

Query: 69  TVDDNT-----PHDF-FTIETALETTMPSYVRRVLGEYNSNESGVVCMVVDLLASSAIEV 128
           ++ ++      P  F   +    E +    + ++L E  +N+  + C+V D     +   
Sbjct: 70  SLTESDLQNLGPQKFVLKLNQICEASFKQCIGQLLHEQCNND--IACVVYDEYMYFSHAA 129

Query: 129 GKEYGVAVAGFWPAMFATYNLISAVPDMVKNNLISSDTGCPGEGSKRCVPNQPLLSTEEL 188
            KE+ +    F     AT  +  +V   V       D   P E   +  P    L  ++L
Sbjct: 130 VKEFQLPSVVF-STTSATAFVCRSVLSRVNAESFLIDMKDP-ETQDKVFPGLHPLRYKDL 189

Query: 189 PWLIGTSSARKARFKFWTRTMVRAKSLQWILVNSFP--EELPLENPIPKSSAAVFLVGPL 248
           P  +      ++  K ++ T V  ++   +++NS    E   L     +    V+ +GPL
Sbjct: 190 PTSV--FGPIESTLKVYSET-VNTRTASAVIINSASCLESSSLARLQQQLQVPVYPIGPL 249

Query: 249 SRHSNPAKTPTFWEEDDGCLQWLEKQSPNSVVYISFGSWVSPINESKVRSLAVALLGLRK 308
             H   +   +  EED  C++WL KQ  NSV+YIS GS ++ ++   +  +A  L    +
Sbjct: 250 --HITASAPSSLLEEDRSCVEWLNKQKSNSVIYISLGS-LALMDTKDMLEMAWGLSNSNQ 309

Query: 309 PFIWVLK------SNWRDGLPIGFTQKIQRYGRLVSWAPQMEILKHRAVGCYLTHCGWNS 368
           PF+WV++      S W + LP  F + +   G +V WAPQME+L+H AVG + +HCGWNS
Sbjct: 310 PFLWVVRPGSIPGSEWTESLPEEFNRLVSERGYIVKWAPQMEVLRHPAVGGFWSHCGWNS 369

Query: 369 IMEAIQCRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNGFGEKE-VEEGVRKVM---EDG 423
            +E+I     ++C P  GDQ +N  Y+ +VWRIG++L G  +KE VE  V  ++   E  
Sbjct: 370 TVESIGEGVPMICRPFTGDQKVNARYLERVWRIGVQLEGDLDKETVERAVEWLLVDEEGA 428

BLAST of Cp4.1LG06g05470.1 vs. TrEMBL
Match: A0A0A0L1R0_CUCSA (Glycosyltransferase OS=Cucumis sativus GN=Csa_4G617410 PE=3 SV=1)

HSP 1 Score: 747.7 bits (1929), Expect = 8.5e-213
Identity = 360/449 (80.18%), Postives = 398/449 (88.64%), Query Frame = 1

Query: 1   MKHVSNKPRVLFVPYPAQGHVTPMLMLAAVFQRRGFLPIFLTPSYIHRHISSQISLTNEI 60
           MK+   KP+V+ VPYPAQGHVTPMLMLAAVF RRGFLPIFLTPSYIH HISSQ+S ++ I
Sbjct: 1   MKYALKKPKVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSSDGI 60

Query: 61  LFISMPDTVDDNTPHDFFTIETALETTMPSYVRRVLGEYNSNES----GVVCMVVDLLAS 120
           +F+SM D +DDN P DFFTIE A+ETTMP  +R+VL E+NS ES    GVVCMVVDLLAS
Sbjct: 61  IFVSMSDGLDDNMPRDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMVVDLLAS 120

Query: 121 SAIEVGKEYGVAVAGFWPAMFATYNLISAVPDMVKNNLISSDTGCPGEGSKRCVPNQPLL 180
           SAIEVG E+GV V GFWPAMFATY L+S +P+M++NN ISSDTGCP EGSKRCVP+QPLL
Sbjct: 121 SAIEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRCVPSQPLL 180

Query: 181 STEELPWLIGTSSARKARFKFWTRTMVRAKSLQWILVNSFPEEL-PLENPIPKSSAA-VF 240
           S EELPWL+GTSSA K RFKFW RTM RA+S+  +LVNSFPEEL PL+  I KSSAA VF
Sbjct: 181 SAEELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELLPLQKLITKSSAASVF 240

Query: 241 LVGPLSRHSNPAKTPTFWEEDDGCLQWLEKQSPNSVVYISFGSWVSPINESKVRSLAVAL 300
           LVGPLSRHSNPAKTPTFWEEDDGC++WLEKQ PNSV+YISFGSWVSPINESKVRSLA+ L
Sbjct: 241 LVGPLSRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVRSLAMTL 300

Query: 301 LGLRKPFIWVLKSNWRDGLPIGFTQKIQRYGRLVSWAPQMEILKHRAVGCYLTHCGWNSI 360
           LGL+ PFIWVLK+NWRDGLPIGF QKIQ YGRLVSWAPQ+EILKHRAVGCYLTHCGWNSI
Sbjct: 301 LGLKNPFIWVLKNNWRDGLPIGFQQKIQSYGRLVSWAPQIEILKHRAVGCYLTHCGWNSI 360

Query: 361 MEAIQCRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNGFGEKEVEEGVRKVMEDGEMKAR 420
           MEAIQ  KRLLCFPVAGDQFLNCGYVVKVWRIG++LNGFGEKEVEEG+RKVMEDGEMK R
Sbjct: 361 MEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKEVEEGMRKVMEDGEMKGR 420

Query: 421 MMKLHERIMGEDANSRVNSSFTTFIKDIN 444
            MKLHERIMGE+AN RVNS+FTTFI +IN
Sbjct: 421 FMKLHERIMGEEANCRVNSNFTTFINEIN 449

BLAST of Cp4.1LG06g05470.1 vs. TrEMBL
Match: F6HGE7_VITVI (Glycosyltransferase OS=Vitis vinifera GN=VIT_01s0010g00530 PE=3 SV=1)

HSP 1 Score: 544.3 bits (1401), Expect = 1.4e-151
Identity = 257/451 (56.98%), Postives = 343/451 (76.05%), Query Frame = 1

Query: 1   MKHVSNKPRVLFVPYPAQGHVTPMLMLAAVFQRRGFLPIFLTPSYIHRHISSQISLTNEI 60
           MK++  +P +L VPYPAQGHVTP+L LA+    +GF+P+ +TP +IHR I+ ++   + I
Sbjct: 1   MKYMK-RPMILLVPYPAQGHVTPLLKLASCLVTQGFMPVMITPEFIHRQIAPRVDAKDGI 60

Query: 61  LFISMPDTVDDNTPHDFFTIETALETTMPSYVRRVLGEYNSNESGVVCMVVDLLASSAIE 120
           L +S+PD VD++ P DFFTIE  +E TMP Y+ R++ + +  +  VVCMVVDLLAS AI+
Sbjct: 61  LCMSIPDGVDEDLPRDFFTIEMTMENTMPVYLERLIRKLDE-DGRVVCMVVDLLASWAIK 120

Query: 121 VGKEYGVAVAGFWPAMFATYNLISAVPDMVKNNLISSDTGCPGEGSKRC-VPNQPLLSTE 180
           V    GV  AGFWPAM ATY LISA+P++++  LIS +TG P E  K C +P QP LSTE
Sbjct: 121 VADHCGVPAAGFWPAMLATYGLISAIPELIRTGLIS-ETGIPEEQRKICFLPCQPELSTE 180

Query: 181 ELPWLIGTSSARKARFKFWTRTMVRAKSLQWILVNSFPEELP---LENPI---PKSSAAV 240
           +LPWLIGT +A++ARF+FWTRT  RAK+L WILVNSFPEE     L+N +   P     +
Sbjct: 181 DLPWLIGTFTAKRARFEFWTRTFARAKTLPWILVNSFPEECSDGKLQNQLIYSPGDGPRL 240

Query: 241 FLVGPLSRHSNPAKTPTFWEEDDGCLQWLEKQSPNSVVYISFGSWVSPINESKVRSLAVA 300
             +GPL RH+   +TP+ WEED  CL WLE+Q P +VVYISFGSWVSPI E +VR LA+A
Sbjct: 241 LQIGPLIRHA-AIRTPSLWEEDFNCLDWLEQQKPCTVVYISFGSWVSPIGEPRVRDLALA 300

Query: 301 LLGLRKPFIWVLKSNWRDGLPIGFTQKIQRYGRLVSWAPQMEILKHRAVGCYLTHCGWNS 360
           L    +PFIWVL+ NWR+GLP+G+ +++ + G++VSWAPQME+L+H AVGCYLTHCGWNS
Sbjct: 301 LEASGRPFIWVLRPNWREGLPVGYLERVSKQGKVVSWAPQMELLQHEAVGCYLTHCGWNS 360

Query: 361 IMEAIQCRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNGFGEKEVEEGVRKVMEDGEMKA 420
            +EAIQC+KRLLC+PVAGDQF+NC Y+V VW+IG++++GFG++++EEG+RKVMED EM  
Sbjct: 361 TLEAIQCQKRLLCYPVAGDQFVNCAYIVNVWQIGVRIHGFGQRDLEEGMRKVMEDSEMNK 420

Query: 421 RMMKLHERIMGEDANSRVNSSFTTFIKDINK 445
           R+ KL+ERIMGE+A  RV ++ TTF  ++ K
Sbjct: 421 RLSKLNERIMGEEAGLRVMTNITTFTDNLKK 447

BLAST of Cp4.1LG06g05470.1 vs. TrEMBL
Match: A0A061DMA7_THECC (Glycosyltransferase OS=Theobroma cacao GN=TCM_002013 PE=3 SV=1)

HSP 1 Score: 527.3 bits (1357), Expect = 1.8e-146
Identity = 243/449 (54.12%), Postives = 333/449 (74.16%), Query Frame = 1

Query: 8   PRVLFVPYPAQGHVTPMLMLAAVFQRRGFLPIFLTPSYIHRHISSQISLTNEILFISMPD 67
           P+++ VPYPAQGHVTPML L + F  +GF PI +TP +IH  I++ +   +EI F+S+PD
Sbjct: 10  PKIILVPYPAQGHVTPMLKLGSAFLGQGFQPIIVTPEFIHHRITANMDPIDEIRFLSIPD 69

Query: 68  TVDDNTPHDFFTIETALETTMPSYVRRVLGEYNSNESG--VVCMVVDLLASSAIEVGKEY 127
            + +  PHDFF IE A+E TMP+++  ++   +  E    V C+V+DLLAS AI+V    
Sbjct: 70  GLSEEGPHDFFAIEKAMENTMPTHLEGLIHRVDEEEEDGRVACVVIDLLASWAIQVAYRC 129

Query: 128 GVAVAGFWPAMFATYNLISAVPDMVKNNLISSD---TGCPG-EGSKRCVPNQPLLSTEEL 187
            +  AGFWP M  TY LI+A+PDM++++LIS      GCP  +G+  C+P QP+LSTE+L
Sbjct: 130 RIPAAGFWPNMQITYRLITAIPDMLRSSLISKTGHVAGCPQRQGTVCCLPGQPMLSTEDL 189

Query: 188 PWLIGTSSARKARFKFWTRTMVRAKSLQWILVNSFPEELPLE--NPIPKSSAAVFLVGPL 247
           PWLIGT +AR ARFKFWTRT+ R++SL+W+LVNSFP E   +  N     +  VF VGPL
Sbjct: 190 PWLIGTQAARNARFKFWTRTLERSRSLRWLLVNSFPHEFTGDDHNSTDHDNPIVFPVGPL 249

Query: 248 SRHSNPAKTPTFWEEDDGCLQWLEKQSPNSVVYISFGSWVSPINESKVRSLAVALLGLRK 307
           S+ +   K P+FWEED  C+ WL+K+ PNSV+YISFGSWVSPI ++K+++LA+ L  LR+
Sbjct: 250 SKPAI-VKNPSFWEEDSSCIDWLDKRKPNSVLYISFGSWVSPIGDAKIKTLALTLEALRR 309

Query: 308 PFIWVLKSNWRDGLPIGFTQKIQRYGRLVSWAPQMEILKHRAVGCYLTHCGWNSIMEAIQ 367
           PFIWVL   WR GLP  + +++ + G++VSWAPQ+++L+H+AVG YLTHCGWNS +EAIQ
Sbjct: 310 PFIWVLAHAWRQGLPNRYLERVSKQGKVVSWAPQLQVLQHKAVGLYLTHCGWNSTVEAIQ 369

Query: 368 CRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNGFGEKEVEEGVRKVMEDGEMKARMMKLH 427
           C+KRLLCFP+AGDQF+NC Y+VKVW+IG+K+NGFG+K+VE+ +RKV EDGEMK R+MKL+
Sbjct: 370 CQKRLLCFPIAGDQFVNCKYIVKVWKIGVKINGFGQKDVEDALRKVTEDGEMKERLMKLY 429

Query: 428 ERIMGEDANSRVNSSFTTFIKDINKLSFD 449
           ER MGE+A SR  ++   F+ D     F+
Sbjct: 430 ERTMGEEATSRAVANLKAFLLDSTTKQFN 457

BLAST of Cp4.1LG06g05470.1 vs. TrEMBL
Match: M5WYR7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa016262mg PE=4 SV=1)

HSP 1 Score: 526.6 bits (1355), Expect = 3.1e-146
Identity = 252/450 (56.00%), Postives = 332/450 (73.78%), Query Frame = 1

Query: 10  VLFVPYPAQGHVTPMLMLAAVFQRRGFLPIFLTPSYIHRHISSQISLTNEILFISMPDTV 69
           ++ VPYPAQGHVTPML LA+ F   GF  + +TP +IH  I  ++   ++IL + +PD +
Sbjct: 14  IILVPYPAQGHVTPMLKLASAFLSHGFKSVLVTPDHIHNQIVPKVEQNDKILCMPIPDGL 73

Query: 70  DDNTPHDFFTIETALETTMPSYVRRVLGEYNSNESG--VVCMVVDLLASSAIEVGKEYGV 129
           D + P DFF IE A+E TMP ++  ++ + + ++ G  VVC+V DLLAS AI+V    GV
Sbjct: 74  DKDAPRDFFAIEKAMENTMPGHLESLVHQLDHHQDGDQVVCIVADLLASWAIDVANRCGV 133

Query: 130 AVAGFWPAMFATYNLISAVPDMVKNNLISSDTGCPGEGSKRCVPNQPLLSTEELPWLIGT 189
             AGFWPAM ATY LI+A+PDMV+  LIS DTG P +    C+PNQP+LS+E+LPWLIGT
Sbjct: 134 PSAGFWPAMLATYRLITAIPDMVRTGLIS-DTGFPKQLGGVCLPNQPMLSSEDLPWLIGT 193

Query: 190 SSARKARFKFWTRTMVRAKSLQWILVNSFPEE--------LPLENPIPKSSAA----VFL 249
            ++RKARFKFW RT+ R+K+L W+LVNSFP E        L     +  ++ A    VF 
Sbjct: 194 PASRKARFKFWKRTLDRSKTLPWLLVNSFPNEYCTNGEQQLDHHQLVKMNTQAQQPLVFP 253

Query: 250 VGPLSRHSNPAKTPTFWEEDDGCLQWLEKQSPNSVVYISFGSWVSPINESKVRSLAVALL 309
           +GPLS+H+   K P+FWEED  CL WL+KQ+PNSV+YISFGSWVSPI E+KVRSLA+AL 
Sbjct: 254 IGPLSKHTT-IKNPSFWEEDTSCLTWLDKQNPNSVIYISFGSWVSPIGEAKVRSLALALE 313

Query: 310 GLRKPFIWVLKSNWRDGLPIGFTQKIQRYGRLVSWAPQMEILKHRAVGCYLTHCGWNSIM 369
            L KPF+WVL S+W  GLP G+ +++ R G++VSWAPQ+E+L+H+AVG YL HCGWNS M
Sbjct: 314 ALGKPFLWVLGSSWLGGLPNGYLERVSRQGKVVSWAPQLEVLQHKAVGFYLAHCGWNSTM 373

Query: 370 EAIQCRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNGFGEKEVEEGVRKVMEDGEMKARM 429
           EAIQC+K LLC+PVAGDQF+NC Y+VKVWRIG+KL GFG+K+VEEG++KV ED EM  R+
Sbjct: 374 EAIQCQKPLLCYPVAGDQFVNCAYIVKVWRIGVKLIGFGQKDVEEGLKKVAEDAEMSNRL 433

Query: 430 MKLHERIMGEDANSRVNSSFTTFIKDINKL 446
            KL+ER MG++AN R  ++ + FI D  K+
Sbjct: 434 RKLNERTMGDEANLRAVANLSAFIDDQLKI 461

BLAST of Cp4.1LG06g05470.1 vs. TrEMBL
Match: B9IHC9_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0016s02070g PE=4 SV=2)

HSP 1 Score: 518.1 bits (1333), Expect = 1.1e-143
Identity = 246/454 (54.19%), Postives = 331/454 (72.91%), Query Frame = 1

Query: 1   MKHVSNKPRVLFVPYPAQGHVTPMLMLAAVFQRRGFLPIFLTPSYIHRHISSQISLTNEI 60
           MK +   P+++ VPYPAQGHVTP+L LA+ F   GF P+ +TP +IHR I S I   + I
Sbjct: 4   MKRIQT-PKIILVPYPAQGHVTPLLKLASAFLDHGFEPVMVTPEFIHRRIISNIDPKSHI 63

Query: 61  LFISMPDTVDDNTPHDFFTIETALETTMPSYVRRVLGEYNSNESGVVCMVVDLLASSAIE 120
             IS+PD ++ + P DFF  E A+E  MPS++  ++ ++N +   V CM+VDLLAS AIE
Sbjct: 64  SCISIPDGLEMDMPRDFFANEKAMEINMPSHLEGLVRKFNEDGEVVACMIVDLLASWAIE 123

Query: 121 VGKEYGVAVAGFWPAMFATYNLISAVPDMVKNNLISSDTGCPGEGSKRC-VPNQPLLSTE 180
           VG   GV VAGFWPAM ATY LI+A+PDMV+  LIS +TG P      C +PNQPLLSTE
Sbjct: 124 VGHRCGVPVAGFWPAMLATYQLIAAIPDMVRTGLIS-ETGSPQHLGPLCFLPNQPLLSTE 183

Query: 181 ELPWLIGTSSARKARFKFWTRTMVRAKSLQWILVNSFPEELPLENPIPKSSAA------- 240
           +LPWLIGT ++RKARF+FWTRT+ R++ L W+LVNSFPEE  +++  P + A        
Sbjct: 184 DLPWLIGTPASRKARFEFWTRTLDRSRKLSWLLVNSFPEEC-IDHDKPHNGALLENSMDQ 243

Query: 241 --VFLVGPLSRHSNPAKTPTFWEEDDGCLQWLEKQSPNSVVYISFGSWVSPINESKVRSL 300
             +  VG LS+H    K P+FWEED  CLQWL+KQ P+SV+YISFGSWVSPI E KV+ L
Sbjct: 244 PLICQVGALSKHPL-VKNPSFWEEDMSCLQWLDKQKPSSVLYISFGSWVSPIGEGKVKKL 303

Query: 301 AVALLGLRKPFIWVLKSNWRDGLPIGFTQKIQRYGRLVSWAPQMEILKHRAVGCYLTHCG 360
           A+ L  L +PFIWVL   W+ GLP G+ +++ + G++VSWAPQ+++L+H+AV CYLTHCG
Sbjct: 304 ALTLEALGQPFIWVLGPTWQGGLPFGYIERVSKQGKVVSWAPQLKVLQHKAVMCYLTHCG 363

Query: 361 WNSIMEAIQCRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNGFGEKEVEEGVRKVMEDGE 420
           WNS MEAIQC+K L+C+PVAGDQF+NC Y+ + W+IG+K+NGFGEKE+E+G+R+V+ED  
Sbjct: 364 WNSTMEAIQCQKCLICYPVAGDQFVNCAYITEKWKIGVKINGFGEKEMEQGLRRVVEDHT 423

Query: 421 MKARMMKLHERIMGEDANSRVNSSFTTFIKDINK 445
           M  ++ +LHE  MGE+A+  + S+ TTF+ D  +
Sbjct: 424 MNDKLTRLHEITMGEEASVVMMSNLTTFVNDFKE 453

BLAST of Cp4.1LG06g05470.1 vs. TAIR10
Match: AT3G22250.1 (AT3G22250.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 463.8 bits (1192), Expect = 1.2e-130
Identity = 238/467 (50.96%), Postives = 314/467 (67.24%), Query Frame = 1

Query: 4   VSNKPRVLFVPYPAQGHVTPMLMLAAVFQRRGFLPIFLTPSYIHRHISSQISLTNE---I 63
           V+ KP+++F+PYPAQGHVTPML LA+ F  RGF P+ +TP  IHR IS+    TNE   I
Sbjct: 3   VTQKPKIIFIPYPAQGHVTPMLHLASAFLSRGFSPVVMTPESIHRRISA----TNEDLGI 62

Query: 64  LFISMPDTVD--DNTPHDFFTIETALETTMPSYVRRVLGEYNSNESGVVCMVVDLLASSA 123
            F+++ D  D  D  P DFF+IE ++E  MP  + R+L E    +  V C+VVDLLAS A
Sbjct: 63  TFLALSDGQDRPDAPPSDFFSIENSMENIMPPQLERLLLE---EDLDVACVVVDLLASWA 122

Query: 124 IEVGKEYGVAVAGFWPAMFATYNLISAVPDMVKNNLISSDTGCPGEGSKRCV-PNQPLLS 183
           I V    GV VAGFWP MFA Y LI A+P++V+  L+S   GCP +  K  V P QPLLS
Sbjct: 123 IGVADRCGVPVAGFWPVMFAAYRLIQAIPELVRTGLVSQK-GCPRQLEKTIVQPEQPLLS 182

Query: 184 TEELPWLIGTSSARKARFKFWTRTMVRAKSLQWILVNSFPEELP----------LENPIP 243
            E+LPWLIGT  A+K RFKFW RT+ R KSL+WIL +SF +E              N + 
Sbjct: 183 AEDLPWLIGTPKAQKKRFKFWQRTLERTKSLRWILTSSFKDEYEDVDNHKASYKKSNDLN 242

Query: 244 KSSAA----VFLVGPLSRH---SNPAKTPT-FWEEDDGCLQWLEKQSPNSVVYISFGSWV 303
           K +      +  +GPL      +N   T T FWEED  CL WL++Q+PNSV+YISFGSWV
Sbjct: 243 KENNGQNPQILHLGPLHNQEATNNITITKTSFWEEDMSCLGWLQEQNPNSVIYISFGSWV 302

Query: 304 SPINESKVRSLAVALLGLRKPFIWVLKSNWRDGLPIGFTQKI---QRYGRLVSWAPQMEI 363
           SPI ES +++LA+AL    +PF+W L   W++GLP GF  ++   +  GR+VSWAPQ+E+
Sbjct: 303 SPIGESNIQTLALALEASGRPFLWALNRVWQEGLPPGFVHRVTITKNQGRIVSWAPQLEV 362

Query: 364 LKHRAVGCYLTHCGWNSIMEAIQCRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNGFGEK 423
           L++ +VGCY+THCGWNS MEA+   +RLLC+PVAGDQF+NC Y+V VW+IG++L+GFGEK
Sbjct: 363 LRNDSVGCYVTHCGWNSTMEAVASSRRLLCYPVAGDQFVNCKYIVDVWKIGVRLSGFGEK 422

Query: 424 EVEEGVRKVMEDGEMKARMMKLHERIMGEDANSRVNSSFTTFIKDIN 444
           EVE+G+RKVMED +M  R+ KL +R MG +A      +FT    ++N
Sbjct: 423 EVEDGLRKVMEDQDMGERLRKLRDRAMGNEARLSSEMNFTFLKNELN 461

BLAST of Cp4.1LG06g05470.1 vs. TAIR10
Match: AT3G02100.1 (AT3G02100.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 197.6 bits (501), Expect = 1.7e-50
Identity = 128/461 (27.77%), Postives = 221/461 (47.94%), Query Frame = 1

Query: 7   KPRVLFVPYPAQGHVTPMLMLAAVFQRRGFLPIFLTPSYIHRHISSQIS-------LTNE 66
           +P V+ +PYPAQGHV P++  +    ++G    F+   + H  I S +        + ++
Sbjct: 11  RPHVVVIPYPAQGHVLPLISFSRYLAKQGIQITFINTEFNHNRIISSLPNSPHEDYVGDQ 70

Query: 67  ILFISMPDTVDDNTPHDFFT--IETALETTMPSYVRRVLGEYNSNESG---VVCMVVDLL 126
           I  +S+PD ++D+         +  ++   MP  V  ++    +  SG   + C+V D  
Sbjct: 71  INLVSIPDGLEDSPEERNIPGKLSESVLRFMPKKVEELIERMMAETSGGTIISCVVADQS 130

Query: 127 ASSAIEVGKEYGVAVAGFWPAMFATYNLISAVPDMVKNNLISSDTGCPGEGSKRCVPNQP 186
              AIEV  ++G+    F PA  A+  L  ++  ++ + LI SD       + +  P  P
Sbjct: 131 LGWAIEVAAKFGIRRTAFCPAAAASMVLGFSIQKLIDDGLIDSDGTVRVNKTIQLSPGMP 190

Query: 187 LLSTEELPWL-IGTSSARKARFKFWTRTMVRAKSLQWILVNSFPEELPLENPIPKSSAAV 246
            + T++  W+ +    ++K  F+   +     +S  W+L NS  E   LE         +
Sbjct: 191 KMETDKFVWVCLKNKESQKNIFQLMLQNNNSIESTDWLLCNSVHE---LETAAFGLGPNI 250

Query: 247 FLVGPL----SRHSNPAKTPTFWEEDDGCLQWLEKQSPNSVVYISFGSWVSPINESKVRS 306
             +GP+    S         +F   D  CL WL++Q P SV+Y++FGS+   +   ++  
Sbjct: 251 VPIGPIGWAHSLEEGSTSLGSFLPHDRDCLDWLDRQIPGSVIYVAFGSF-GVMGNPQLEE 310

Query: 307 LAVALLGLRKPFIWVLKSNWRDGLPIGFTQKIQRYGRLVSWAPQMEILKHRAVGCYLTHC 366
           LA+ L   ++P +WV     +  + +G  +      ++V WAPQ E+L   A+GC+++HC
Sbjct: 311 LAIGLELTKRPVLWVTGD--QQPIKLGSDRV-----KVVRWAPQREVLSSGAIGCFVSHC 370

Query: 367 GWNSIMEAIQCRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNG-----FGEKEVEEGVRK 426
           GWNS +E  Q     LC P   DQF+N  Y+  VW+IGL L           EV++ + +
Sbjct: 371 GWNSTLEGAQNGIPFLCIPYFADQFINKAYICDVWKIGLGLERDARGVVPRLEVKKKIDE 430

Query: 427 VMED-GEMKARMMKLHERIMGEDANSRVN----SSFTTFIK 441
           +M D GE + R MK+ E +M   A   ++    + F  +IK
Sbjct: 431 IMRDGGEYEERAMKVKEIVMKSVAKDGISCENLNKFVNWIK 460

BLAST of Cp4.1LG06g05470.1 vs. TAIR10
Match: AT1G22400.1 (AT1G22400.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 193.0 bits (489), Expect = 4.1e-49
Identity = 131/475 (27.58%), Postives = 225/475 (47.37%), Query Frame = 1

Query: 3   HVSNKPRVLFVPYPAQGHVTPMLMLAAVFQRRGFLPIFLTPSYIHRHI-----SSQISLT 62
           H S KP V+ VPYPAQGH+ PM+ +A +   RGF   F+   Y H        S+ +   
Sbjct: 7   HNSQKPHVVCVPYPAQGHINPMMRVAKLLHARGFYVTFVNTVYNHNRFLRSRGSNALDGL 66

Query: 63  NEILFISMPDTVDDNTPHDFFTIETALETTMPSYV---RRVLGEYNSNES--GVVCMVVD 122
               F S+ D + +        I    E+TM + +   R +L   N+ ++   V C+V D
Sbjct: 67  PSFRFESIADGLPETDMDATQDITALCESTMKNCLAPFRELLQRINAGDNVPPVSCIVSD 126

Query: 123 LLASSAIEVGKEYGVAVAGFWPAMFATYNLISAVPDMVKNNL--ISSDTGCPGEGSKRCV 182
              S  ++V +E GV    FW      +         ++  L  +  ++    E  +  V
Sbjct: 127 GCMSFTLDVAEELGVPEVLFWTTSGCAFLAYLHFYLFIEKGLCPLKDESYLTKEYLEDTV 186

Query: 183 ----PNQPLLSTEELPWLIGTSSARKARFKFWTRTMVRAKSLQWILVNSFPE-ELPLENP 242
               P    +  +++P  I T++       F  R   RAK    I++N+F + E  + + 
Sbjct: 187 IDFIPTMKNVKLKDIPSFIRTTNPDDVMISFALRETERAKRASAIILNTFDDLEHDVVHA 246

Query: 243 IPKSSAAVFLVGPLSRHSNPA---------KTPTFWEEDDGCLQWLEKQSPNSVVYISFG 302
           +      V+ VGPL   +N            +   W+E+  CL WL+ ++ NSV+YI+FG
Sbjct: 247 MQSILPPVYSVGPLHLLANREIEEGSEIGMMSSNLWKEEMECLDWLDTKTQNSVIYINFG 306

Query: 303 SWVSPINESKVRSLAVALLGLRKPFIWVLKSNWRDG----LPIGFTQKIQRYGRLVSWAP 362
           S ++ ++  ++   A  L G  K F+WV++ +   G    +P  F  + +    L SW P
Sbjct: 307 S-ITVLSVKQLVEFAWGLAGSGKEFLWVIRPDLVAGEEAMVPPDFLMETKDRSMLASWCP 366

Query: 363 QMEILKHRAVGCYLTHCGWNSIMEAIQCRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNG 422
           Q ++L H A+G +LTHCGWNSI+E++ C   ++C+P   DQ +NC +    W +G+++ G
Sbjct: 367 QEKVLSHPAIGGFLTHCGWNSILESLSCGVPMVCWPFFADQQMNCKFCCDEWDVGIEIGG 426

Query: 423 FGEKEVEEGVRKVMEDGEMKARMMKL---HERIMGEDANSRVNSSFTTFIKDINK 445
             ++E  E V + + DGE   +M +     +R+  +    ++ SS   F   ++K
Sbjct: 427 DVKREEVEAVVRELMDGEKGKKMREKAVEWQRLAEKATEHKLGSSVMNFETVVSK 480

BLAST of Cp4.1LG06g05470.1 vs. TAIR10
Match: AT3G11340.1 (AT3G11340.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 192.6 bits (488), Expect = 5.3e-49
Identity = 139/443 (31.38%), Postives = 216/443 (48.76%), Query Frame = 1

Query: 1   MKHVSNKPRVLFVPYPAQGHVTPMLMLAAVFQRRGFLPIFLTPSYIHRHISSQISLT-NE 60
           M+    KP +   P+P QGH+ PM  LA +F  RGF     + + IH   +S  S     
Sbjct: 1   METRETKPVIFLFPFPLQGHLNPMFQLANIFFNRGF-----SITVIHTEFNSPNSSNFPH 60

Query: 61  ILFISMPDTVDDNTPH-DFFTIETALETTMPSYVRRVLGEYNSNESGVVCMVVDLLASSA 120
             F+S+PD++ +   + D   I   L +   +     L +  S E    C++VD L    
Sbjct: 61  FTFVSIPDSLSEPESYPDVIEILHDLNSKCVAPFGDCLKKLISEEPTAACVIVDALWYFT 120

Query: 121 IEVGKEYGVAVAGFWPAMFATYNLISAVPDMVKNNLISSDTGCPGEGSKRC--VPNQPLL 180
            ++ +++      F   +  T NL SA     K +++        + +K    VP  P L
Sbjct: 121 HDLTEKFN-----FPRIVLRTVNL-SAFVAFSKFHVLREKGYLSLQETKADSPVPELPYL 180

Query: 181 STEELPWLIGTSSARKARFKFWTRTMVRAKSLQWILVNSFPE-------ELPLENPIPKS 240
             ++LPW   T   R    K     M   KS   I+ N+  +       E  +E P+P  
Sbjct: 181 RMKDLPWF-QTEDPRSGD-KLQIGVMKSLKSSSGIIFNAIEDLETDQLDEARIEFPVP-- 240

Query: 241 SAAVFLVGPLSRHSNPAKTPTFWEEDDGCLQWLEKQSPNSVVYISFGSWVSPINESKVRS 300
              +F +GP  R+ + A + +    D  CL WL+KQ+ NSV+Y S GS ++ I+ES+   
Sbjct: 241 ---LFCIGPFHRYVS-ASSSSLLAHDMTCLSWLDKQATNSVIYASLGS-IASIDESEFLE 300

Query: 301 LAVALLGLRKPFIWVLK------SNWRDGLPIGFTQKIQRYGRLVSWAPQMEILKHRAVG 360
           +A  L    +PF+WV++        W + LP GF + ++  G++V WAPQ E+L HRA G
Sbjct: 301 IAWGLRNSNQPFLWVVRPGLIHGKEWIEILPKGFIENLEGRGKIVKWAPQPEVLAHRATG 360

Query: 361 CYLTHCGWNSIMEAIQCRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNGFGEK-EVEEGV 420
            +LTHCGWNS +E I     ++C P  GDQ +N  Y+  VW+IGL L    E+  +E  V
Sbjct: 361 GFLTHCGWNSTLEGICEAIPMICRPSFGDQRVNARYINDVWKIGLHLENKVERLVIENAV 420

Query: 421 RKVM---EDGEMKARMMKLHERI 423
           R +M   E  E++ R+M + E +
Sbjct: 421 RTLMTSSEGEEIRKRIMPMKETV 423

BLAST of Cp4.1LG06g05470.1 vs. TAIR10
Match: AT1G22370.2 (AT1G22370.2 UDP-glucosyl transferase 85A5)

HSP 1 Score: 186.8 bits (473), Expect = 2.9e-47
Identity = 126/457 (27.57%), Postives = 216/457 (47.26%), Query Frame = 1

Query: 7   KPRVLFVPYPAQGHVTPMLMLAAVFQRRGFLPIFLTPSYIHRHI-----SSQISLTNEIL 66
           KP V+ +P+PAQGH+ PML +A +   RGF   F+  +Y H  +      + +       
Sbjct: 11  KPHVVCIPFPAQGHINPMLKVAKLLYARGFHVTFVNTNYNHNRLIRSRGPNSLDGLPSFR 70

Query: 67  FISMPDTVDDNTPHDFFTIETALETTMPSYV---RRVLGEYNSNES--GVVCMVVDLLAS 126
           F S+PD + +        + T  E+TM + +   + +L   N+ +    V C+V D + S
Sbjct: 71  FESIPDGLPEENKDVMQDVPTLCESTMKNCLAPFKELLRRINTTKDVPPVSCIVSDGVMS 130

Query: 127 SAIEVGKEYGVAVAGFWPAMFATYNLISAVPDMVKNNLISSDTGCPGEGSKRCVPNQPLL 186
             ++  +E GV    FW      +         ++  L         +     +P+   L
Sbjct: 131 FTLDAAEELGVPDVLFWTPSACGFLAYLHFYRFIEKGLSPIKDESSLDTKINWIPSMKNL 190

Query: 187 STEELPWLIGTSSARKARFKFWTRTMVRAKSLQWILVNSFPEELPLENPIPKSSAA---- 246
             +++P  I  ++       F+     RAK    I++N+F     LE+ + +S  +    
Sbjct: 191 GLKDIPSFIRATNTEDIMLNFFVHEADRAKRASAIILNTFDS---LEHDVVRSIQSIIPQ 250

Query: 247 VFLVGPL--------SRHSNPAKTPT-FWEEDDGCLQWLEKQSPNSVVYISFGSWVSPIN 306
           V+ +GPL           S+  +  T  W E+  CL WL+ +SPNSVVY++FGS ++ ++
Sbjct: 251 VYTIGPLHLFVNRDIDEESDIGQIGTNMWREEMECLDWLDTKSPNSVVYVNFGS-ITVMS 310

Query: 307 ESKVRSLAVALLGLRKPFIWVLKSNWRDG----LPIGFTQKIQRYGRLVSWAPQMEILKH 366
             ++   A  L   +K F+WV++ +   G    LP  F  +      L SW PQ ++L H
Sbjct: 311 AKQLVEFAWGLAATKKDFLWVIRPDLVAGDVPMLPPDFLIETANRRMLASWCPQEKVLSH 370

Query: 367 RAVGCYLTHCGWNSIMEAIQCRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNG-FGEKEV 426
            AVG +LTH GWNS +E++     ++C+P   +Q  NC Y    W +G+++ G    +EV
Sbjct: 371 PAVGGFLTHSGWNSTLESLSGGVPMVCWPFFAEQQTNCKYCCDEWEVGMEIGGDVRREEV 430

Query: 427 EEGVRKVMEDGEMKARMMKLHE-RIMGEDANSRVNSS 435
           EE VR++M+  + K    K  E + + E+A   +  S
Sbjct: 431 EELVRELMDGDKGKKMRQKAEEWQRLAEEATKPIYGS 463

BLAST of Cp4.1LG06g05470.1 vs. NCBI nr
Match: gi|659129416|ref|XP_008464676.1| (PREDICTED: UDP-glycosyltransferase 82A1 [Cucumis melo])

HSP 1 Score: 765.8 bits (1976), Expect = 4.3e-218
Identity = 367/450 (81.56%), Postives = 405/450 (90.00%), Query Frame = 1

Query: 1   MKHVSNKPRVLFVPYPAQGHVTPMLMLAAVFQRRGFLPIFLTPSYIHRHISSQISLTNEI 60
           MK+ S KP+VL VPYPAQGHVTPMLMLAAVF RRGFLPIFLTPSYIHRHISSQIS ++EI
Sbjct: 1   MKYTSKKPKVLLVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHRHISSQISSSDEI 60

Query: 61  LFISMPDTVDDNTPHDFFTIETALETTMPSYVRRVLGEYNSNESG-----VVCMVVDLLA 120
           LF+SM D +DDN P DFFT+E  +ETTMP Y+R+VL E+NS ES      VVCMVVDLLA
Sbjct: 61  LFVSMSDGLDDNMPRDFFTVEAVMETTMPIYLRQVLSEHNSKESSDSSGSVVCMVVDLLA 120

Query: 121 SSAIEVGKEYGVAVAGFWPAMFATYNLISAVPDMVKNNLISSDTGCPGEGSKRCVPNQPL 180
           SSAIEVGKE+GV V GFWPAM ATY LIS +P+MV++NLISSDTGCP EGSKRCVP+QPL
Sbjct: 121 SSAIEVGKEFGVTVVGFWPAMLATYKLISTIPEMVQSNLISSDTGCPEEGSKRCVPSQPL 180

Query: 181 LSTEELPWLIGTSSARKARFKFWTRTMVRAKSLQWILVNSFPEEL-PLENPIPKSSAA-V 240
           LSTEELPWLIGT SARK RFKFW RTM RAKS+Q +LVNSFPEEL PL+ P PKSSAA V
Sbjct: 181 LSTEELPWLIGTPSARKGRFKFWKRTMARAKSVQCLLVNSFPEELLPLQKPTPKSSAASV 240

Query: 241 FLVGPLSRHSNPAKTPTFWEEDDGCLQWLEKQSPNSVVYISFGSWVSPINESKVRSLAVA 300
           FLVGPL++HSNPAKTPTFWEEDDGC++WLEKQ PNSV+YISFGSWVSPINESKVRSLA+A
Sbjct: 241 FLVGPLTQHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVRSLAMA 300

Query: 301 LLGLRKPFIWVLKSNWRDGLPIGFTQKIQRYGRLVSWAPQMEILKHRAVGCYLTHCGWNS 360
           LLGL+ PFIWVLK+NWRDGLPIGF QK Q YGRLVSWAPQ+EILKH+AVGCYLTHCGWNS
Sbjct: 301 LLGLKNPFIWVLKNNWRDGLPIGFQQKSQSYGRLVSWAPQIEILKHKAVGCYLTHCGWNS 360

Query: 361 IMEAIQCRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNGFGEKEVEEGVRKVMEDGEMKA 420
           IMEAIQC KRLLCFPVAGDQFLNCGYVVKVWRIG++LNGFGEKEVEEG++KVMEDGEMK 
Sbjct: 361 IMEAIQCGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKEVEEGMKKVMEDGEMKG 420

Query: 421 RMMKLHERIMGEDANSRVNSSFTTFIKDIN 444
           R+MKLHERIMGE+AN RVNS+FT FI +IN
Sbjct: 421 RLMKLHERIMGEEANYRVNSNFTAFINEIN 450

BLAST of Cp4.1LG06g05470.1 vs. NCBI nr
Match: gi|449463617|ref|XP_004149528.1| (PREDICTED: UDP-glycosyltransferase 82A1 [Cucumis sativus])

HSP 1 Score: 747.7 bits (1929), Expect = 1.2e-212
Identity = 360/449 (80.18%), Postives = 398/449 (88.64%), Query Frame = 1

Query: 1   MKHVSNKPRVLFVPYPAQGHVTPMLMLAAVFQRRGFLPIFLTPSYIHRHISSQISLTNEI 60
           MK+   KP+V+ VPYPAQGHVTPMLMLAAVF RRGFLPIFLTPSYIH HISSQ+S ++ I
Sbjct: 1   MKYALKKPKVILVPYPAQGHVTPMLMLAAVFHRRGFLPIFLTPSYIHCHISSQVSSSDGI 60

Query: 61  LFISMPDTVDDNTPHDFFTIETALETTMPSYVRRVLGEYNSNES----GVVCMVVDLLAS 120
           +F+SM D +DDN P DFFTIE A+ETTMP  +R+VL E+NS ES    GVVCMVVDLLAS
Sbjct: 61  IFVSMSDGLDDNMPRDFFTIEAAIETTMPVCLRQVLSEHNSKESSGGTGVVCMVVDLLAS 120

Query: 121 SAIEVGKEYGVAVAGFWPAMFATYNLISAVPDMVKNNLISSDTGCPGEGSKRCVPNQPLL 180
           SAIEVG E+GV V GFWPAMFATY L+S +P+M++NN ISSDTGCP EGSKRCVP+QPLL
Sbjct: 121 SAIEVGNEFGVTVVGFWPAMFATYKLMSTIPEMIQNNFISSDTGCPEEGSKRCVPSQPLL 180

Query: 181 STEELPWLIGTSSARKARFKFWTRTMVRAKSLQWILVNSFPEEL-PLENPIPKSSAA-VF 240
           S EELPWL+GTSSA K RFKFW RTM RA+S+  +LVNSFPEEL PL+  I KSSAA VF
Sbjct: 181 SAEELPWLVGTSSAIKGRFKFWKRTMARARSVHCLLVNSFPEELLPLQKLITKSSAASVF 240

Query: 241 LVGPLSRHSNPAKTPTFWEEDDGCLQWLEKQSPNSVVYISFGSWVSPINESKVRSLAVAL 300
           LVGPLSRHSNPAKTPTFWEEDDGC++WLEKQ PNSV+YISFGSWVSPINESKVRSLA+ L
Sbjct: 241 LVGPLSRHSNPAKTPTFWEEDDGCVKWLEKQRPNSVIYISFGSWVSPINESKVRSLAMTL 300

Query: 301 LGLRKPFIWVLKSNWRDGLPIGFTQKIQRYGRLVSWAPQMEILKHRAVGCYLTHCGWNSI 360
           LGL+ PFIWVLK+NWRDGLPIGF QKIQ YGRLVSWAPQ+EILKHRAVGCYLTHCGWNSI
Sbjct: 301 LGLKNPFIWVLKNNWRDGLPIGFQQKIQSYGRLVSWAPQIEILKHRAVGCYLTHCGWNSI 360

Query: 361 MEAIQCRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNGFGEKEVEEGVRKVMEDGEMKAR 420
           MEAIQ  KRLLCFPVAGDQFLNCGYVVKVWRIG++LNGFGEKEVEEG+RKVMEDGEMK R
Sbjct: 361 MEAIQYGKRLLCFPVAGDQFLNCGYVVKVWRIGVRLNGFGEKEVEEGMRKVMEDGEMKGR 420

Query: 421 MMKLHERIMGEDANSRVNSSFTTFIKDIN 444
            MKLHERIMGE+AN RVNS+FTTFI +IN
Sbjct: 421 FMKLHERIMGEEANCRVNSNFTTFINEIN 449

BLAST of Cp4.1LG06g05470.1 vs. NCBI nr
Match: gi|657973141|ref|XP_008378366.1| (PREDICTED: UDP-glycosyltransferase 82A1-like [Malus domestica])

HSP 1 Score: 552.4 bits (1422), Expect = 7.5e-154
Identity = 261/458 (56.99%), Postives = 347/458 (75.76%), Query Frame = 1

Query: 1   MKHVSNKPRVLFVPYPAQGHVTPMLMLAAVFQRRGFLPIFLTPSYIHRHISSQISLTNEI 60
           +K  ++ P ++ VPYPAQGHVTPM  LA+ F  +GF P+ +TP YIH  I  ++   ++I
Sbjct: 4   IKRSNSNPIIILVPYPAQGHVTPMFKLASAFLSQGFKPVMVTPDYIHHQIVRKVEPKDKI 63

Query: 61  LFISMPDTVDDNTPHDFFTIETALETTMPSYVRRVLGEYNSNESG-VVCMVVDLLASSAI 120
           L + +PD +D +TP DFF IE A+E  M + + R++ + +  +   VVC+VVDLLAS AI
Sbjct: 64  LCMPIPDGLDKDTPRDFFAIEKAMENNMANPLERLIHQLDDKDGDEVVCVVVDLLASWAI 123

Query: 121 EVGKEYGVAVAGFWPAMFATYNLISAVPDMVKNNLISSDTGCPGEGSKRCVPNQPLLSTE 180
           +V    GVA AGFWPAM ATY LI+A+PDM++  LIS+DTG P + S  C+PNQP+LSTE
Sbjct: 124 DVANRCGVACAGFWPAMHATYRLITAIPDMLRTGLISADTGFPKQLSGICLPNQPVLSTE 183

Query: 181 ELPWLIGTSSARKARFKFWTRTMVRAKSLQWILVNSFPEELPLENP--------IPKSSA 240
           ELPWLIGT +ARKARF+FWTRT+ R+K+LQWILV+SFP E  + +         + KS+ 
Sbjct: 184 ELPWLIGTPAARKARFRFWTRTLERSKTLQWILVHSFPNEYTISDEQHQQLGDQLFKSTT 243

Query: 241 A----VFLVGPLSRHSNPAKTPTFWEEDDGCLQWLEKQSPNSVVYISFGSWVSPINESKV 300
                VF +GPLS+H+   K P+FWEED  CL WL+KQ+PN+V YISFGSWVSPI E+KV
Sbjct: 244 TQQPLVFPIGPLSKHTT-TKNPSFWEEDTSCLNWLDKQNPNTVAYISFGSWVSPIGEAKV 303

Query: 301 RSLAVALLGLRKPFIWVLKSNWRDGLPIGFTQKIQRYGRLVSWAPQMEILKHRAVGCYLT 360
           RSLA+AL  L KPF+WVL S+W  GLPIG+ +++ + G++VSWAPQM++L+H+AVGCYLT
Sbjct: 304 RSLALALEALGKPFLWVLGSSWLGGLPIGYLERVAKQGKVVSWAPQMDVLQHKAVGCYLT 363

Query: 361 HCGWNSIMEAIQCRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNGFGEKEVEEGVRKVME 420
           HCGWNS MEAIQC+K LLC+PVAGDQF+NC Y+VKVWRIG++L+GFG+++VEEG+R++ME
Sbjct: 364 HCGWNSTMEAIQCQKPLLCYPVAGDQFVNCAYIVKVWRIGVRLSGFGQRDVEEGLRRMME 423

Query: 421 DGEMKARMMKLHERIMGEDANSRVNSSFTTFIKDINKL 446
           + EM  RM KL+ER MG++AN R  S+ T F  D NK+
Sbjct: 424 EDEMSKRMRKLNERTMGDEANLRAVSNLTAF-TDQNKM 459

BLAST of Cp4.1LG06g05470.1 vs. NCBI nr
Match: gi|694400527|ref|XP_009375352.1| (PREDICTED: UDP-glycosyltransferase 82A1-like [Pyrus x bretschneideri])

HSP 1 Score: 551.2 bits (1419), Expect = 1.7e-153
Identity = 257/451 (56.98%), Postives = 339/451 (75.17%), Query Frame = 1

Query: 1   MKHVSNKPRVLFVPYPAQGHVTPMLMLAAVFQRRGFLPIFLTPSYIHRHISSQISLTNEI 60
           +K  ++ P ++ VPYPAQGHVTPML LA+ F  + F P+ +TP YIH  I  ++   ++I
Sbjct: 7   IKRSNSNPIIILVPYPAQGHVTPMLKLASAFLSQDFKPVMVTPDYIHHQIVRKVEPKDKI 66

Query: 61  LFISMPDTVDDNTPHDFFTIETALETTMPSYVRRVLGEYNSNESG-VVCMVVDLLASSAI 120
           L I +PD +  +TP DFF IE A+E  M + + R++ + +  +   VVC+VVDLLAS AI
Sbjct: 67  LCIPIPDELAKDTPRDFFAIEKAMENKMANPLERLIHQLDDKDGDEVVCVVVDLLASWAI 126

Query: 121 EVGKEYGVAVAGFWPAMFATYNLISAVPDMVKNNLISSDTGCPGEGSKRCVPNQPLLSTE 180
           +V    GVA AGFWPAM ATY LI+A+PDM++  LIS+DTG P + S  C+PNQP+LSTE
Sbjct: 127 DVANRCGVACAGFWPAMHATYRLITAIPDMLRTGLISADTGFPKQLSGICLPNQPVLSTE 186

Query: 181 ELPWLIGTSSARKARFKFWTRTMVRAKSLQWILVNSFPEELPLE------------NPIP 240
           ELPWLIGT +ARKARF+FWTRT+ R+K+LQWIL+NSFP E  +             N   
Sbjct: 187 ELPWLIGTPAARKARFRFWTRTLERSKTLQWILINSFPNEYTISDEQHQQLGDQLFNSTR 246

Query: 241 KSSAAVFLVGPLSRHSNPAKTPTFWEEDDGCLQWLEKQSPNSVVYISFGSWVSPINESKV 300
                VF +GPLS+H+   K P+FWEED  CL WL+KQ+PN+VVYISFGSWVSPI E+KV
Sbjct: 247 TQQPLVFPIGPLSKHTT-TKNPSFWEEDTSCLNWLDKQNPNTVVYISFGSWVSPIGEAKV 306

Query: 301 RSLAVALLGLRKPFIWVLKSNWRDGLPIGFTQKIQRYGRLVSWAPQMEILKHRAVGCYLT 360
           RSLA+AL  L KPF+WVL S+W  GLPIG+ +++ + G++VSWAPQM++L+H+AVGCYLT
Sbjct: 307 RSLALALEALGKPFLWVLGSSWLGGLPIGYLERVAKQGKVVSWAPQMDVLQHKAVGCYLT 366

Query: 361 HCGWNSIMEAIQCRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNGFGEKEVEEGVRKVME 420
           HCGWNS MEAIQC+K LLC+PVAGDQF+NC Y+VKVWRIG++L+GFG+++VEEG+R++ME
Sbjct: 367 HCGWNSTMEAIQCQKPLLCYPVAGDQFVNCAYIVKVWRIGVRLSGFGQRDVEEGLRRMME 426

Query: 421 DGEMKARMMKLHERIMGEDANSRVNSSFTTF 439
           + EM  RM KL+ER MG++AN +  S+ T F
Sbjct: 427 EDEMSKRMRKLNERTMGDEANLKAVSNLTAF 456

BLAST of Cp4.1LG06g05470.1 vs. NCBI nr
Match: gi|694406288|ref|XP_009377960.1| (PREDICTED: UDP-glycosyltransferase 82A1-like [Pyrus x bretschneideri])

HSP 1 Score: 547.7 bits (1410), Expect = 1.8e-152
Identity = 262/458 (57.21%), Postives = 338/458 (73.80%), Query Frame = 1

Query: 1   MKHVSNKPRVLF-VPYPAQGHVTPMLMLAAVFQRRGFLPIFLTPSYIHRHISSQISLTNE 60
           MK V N   ++  VPYPAQGHVTPML LA+ F  +GF P+ +TP YIH  I  ++    +
Sbjct: 4   MKCVKNSNSIIILVPYPAQGHVTPMLKLASAFLTQGFKPVMVTPDYIHHQIVRKVEPKEK 63

Query: 61  ILFISMPDTVDDNTPHDFFTIETALETTMPSYVRRVLGEYNSNESGVVCMVVDLLASSAI 120
           IL + + D +D +TP DFF +E A+E  MPS++  ++ + + +   VVC+VVDLLAS AI
Sbjct: 64  ILCMPISDGLDKDTPRDFFAVEKAMEDNMPSHLESLVHQLDKDGDEVVCVVVDLLASWAI 123

Query: 121 EVGKEYGVAVAGFWPAMFATYNLISAVPDMVKNNLISSDTGCPGEGSKRCVPNQPLLSTE 180
           +V    GVA AGFWPAM ATY LI+A+PDM++  LI +DTG P +    C+PN P+L TE
Sbjct: 124 DVANRCGVACAGFWPAMHATYRLITAIPDMIRTGLICADTGFPKQLGGICLPNLPVLFTE 183

Query: 181 ELPWLIGTSSARKARFKFWTRTMVRAKSLQWILVNSFPEELPLENP-------IPKSSAA 240
           ELPWLIGT +ARK RFKFWTRT+ R+K+LQ ILVNSFP E  + +        + KS+  
Sbjct: 184 ELPWLIGTPAARKGRFKFWTRTLERSKTLQRILVNSFPNEYSINDEQQLLGDQLVKSTKT 243

Query: 241 ----VFLVGPLSRHSNPAKTPTFWEEDDGCLQWLEKQSPNSVVYISFGSWVSPINESKVR 300
               VF +GPLS+H+   K P+FWEED  CL WL+KQ+PN+VVYISFGSWVSPI E KVR
Sbjct: 244 QQPLVFPIGPLSKHTT-TKNPSFWEEDTSCLNWLDKQNPNTVVYISFGSWVSPIGEGKVR 303

Query: 301 SLAVALLGLRKPFIWVLKSNWRDGLPIGFTQKIQRYGRLVSWAPQMEILKHRAVGCYLTH 360
           SLA+AL  LRKPF+WVL S+W  GLPIG+ +++ + GR+VSWAPQM++L+H+AVGCYLTH
Sbjct: 304 SLALALEALRKPFLWVLGSSWLGGLPIGYLERVAKQGRVVSWAPQMDVLQHKAVGCYLTH 363

Query: 361 CGWNSIMEAIQCRKRLLCFPVAGDQFLNCGYVVKVWRIGLKLNGFGEKEVEEGVRKVMED 420
           CGWNS MEAIQC K LLC+PVAGDQF+NC Y+V VWRIG+KL+GFG+++VEEG+R+VME+
Sbjct: 364 CGWNSTMEAIQCEKPLLCYPVAGDQFVNCSYIVNVWRIGVKLSGFGQRDVEEGLRRVMEE 423

Query: 421 GEMKARMMKLHERIMGEDANSRVNSSFTTFIKDINKLS 447
            EM  RM KL+ER MG+DAN RV S+   F   +  L+
Sbjct: 424 DEMSNRMRKLNERSMGDDANLRVVSNLIAFTDQVKVLA 460

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
U82A1_ARATH2.2e-12950.96UDP-glycosyltransferase 82A1 OS=Arabidopsis thaliana GN=UGT82A1 PE=2 SV=1[more]
U83A1_ARATH2.9e-4927.77UDP-glycosyltransferase 83A1 OS=Arabidopsis thaliana GN=UGT83A1 PE=2 SV=1[more]
U85A1_ARATH7.2e-4827.58UDP-glycosyltransferase 85A1 OS=Arabidopsis thaliana GN=UGT85A1 PE=2 SV=1[more]
U85A5_ARATH5.2e-4627.57UDP-glycosyltransferase 85A5 OS=Arabidopsis thaliana GN=UGT85A5 PE=2 SV=1[more]
U76E2_ARATH4.8e-4430.25UDP-glycosyltransferase 76E2 OS=Arabidopsis thaliana GN=UGT76E2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L1R0_CUCSA8.5e-21380.18Glycosyltransferase OS=Cucumis sativus GN=Csa_4G617410 PE=3 SV=1[more]
F6HGE7_VITVI1.4e-15156.98Glycosyltransferase OS=Vitis vinifera GN=VIT_01s0010g00530 PE=3 SV=1[more]
A0A061DMA7_THECC1.8e-14654.12Glycosyltransferase OS=Theobroma cacao GN=TCM_002013 PE=3 SV=1[more]
M5WYR7_PRUPE3.1e-14656.00Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa016262mg PE=4 SV=1[more]
B9IHC9_POPTR1.1e-14354.19Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0016s02070g PE=4 SV=2[more]
Match NameE-valueIdentityDescription
AT3G22250.11.2e-13050.96 UDP-Glycosyltransferase superfamily protein[more]
AT3G02100.11.7e-5027.77 UDP-Glycosyltransferase superfamily protein[more]
AT1G22400.14.1e-4927.58 UDP-Glycosyltransferase superfamily protein[more]
AT3G11340.15.3e-4931.38 UDP-Glycosyltransferase superfamily protein[more]
AT1G22370.22.9e-4727.57 UDP-glucosyl transferase 85A5[more]
Match NameE-valueIdentityDescription
gi|659129416|ref|XP_008464676.1|4.3e-21881.56PREDICTED: UDP-glycosyltransferase 82A1 [Cucumis melo][more]
gi|449463617|ref|XP_004149528.1|1.2e-21280.18PREDICTED: UDP-glycosyltransferase 82A1 [Cucumis sativus][more]
gi|657973141|ref|XP_008378366.1|7.5e-15456.99PREDICTED: UDP-glycosyltransferase 82A1-like [Malus domestica][more]
gi|694400527|ref|XP_009375352.1|1.7e-15356.98PREDICTED: UDP-glycosyltransferase 82A1-like [Pyrus x bretschneideri][more]
gi|694406288|ref|XP_009377960.1|1.8e-15257.21PREDICTED: UDP-glycosyltransferase 82A1-like [Pyrus x bretschneideri][more]
The following terms have been associated with this mRNA:
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016758 transferase activity, transferring hexosyl groups

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG06g05470Cp4.1LG06g05470gene


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG06g05470.1:five_prime_utr:001Cp4.1LG06g05470.1:five_prime_utr:001five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG06g05470.1:cds:001Cp4.1LG06g05470.1:cds:001CDS
Cp4.1LG06g05470.1:cds:002Cp4.1LG06g05470.1:cds:002CDS
Cp4.1LG06g05470.1:cds:003Cp4.1LG06g05470.1:cds:003CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG06g05470.1:three_prime_utr:001Cp4.1LG06g05470.1:three_prime_utr:001three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG06g05470.1Cp4.1LG06g05470.1-proteinpolypeptide


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 5..444
score: 3.7E
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 257..419
score: 5.9
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePROSITEPS00375UDPGTcoord: 330..373
scor
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 252..422
score: 9.6
NoneNo IPR availablePANTHERPTHR11926:SF149UDP-GLYCOSYLTRANSFERASE 82A1coord: 5..444
score: 3.7E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 3..424
score: 7.14