Tan0003759 (gene) Snake gourd v1

Overview
NameTan0003759
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGlycosyltransferase
LocationLG01: 106890023 .. 106892058 (-)
RNA-Seq ExpressionTan0003759
SyntenyTan0003759
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GATAGATGTGGCTAAAGAATTTGAGGTTCCTTGTTATTTGTTTTACACATCCAATGCTGCTTTTCTTGCTCAGCTTTATGATCAAAATAATAGCAAAAAATTTCAAATTTGGCTAAAAACTAAAAAAAGATGGAAATCATGGCAGTGGAAGAGAAATGATAGAACAAGGATAATTTTCAAAAATCAAAAACAAAAAATCAGTAATCAAATATGACTTTAACGTCTATATTTATTTTACTCGAGCATATGTATAACTTTGGTCAGATATTCGAATTGGTCTCCACAGATATATTTTTTTAAAAAAAAAGTATTACGATAAGGAAATGGTATCCCACTCCCAAAAAGAAAAAGCTTCAATCGCACTATTCTTCACTAAATTCCAATCTCTAATAAGTACCTGCTTTGATGTGATGCGAATGCGACCTTGTCGAATTATGAACTCCGGCGAGCTTTCAGCGCAGACTAGAATTCAGATGAAGAAAATGGAGCTAGTTTTCATCCCATGGCCGGGCATTAGCCATTTCTCCTCCACTCTCCAACTTGCCCAACTCCTCCTCCGCCGCGACCACCGCCTCTCCGTCACTGTCCTCCTCATCCCGTCGCCGTGGGAGCCCATTCCCGCCGCCTCCCTCCAATCCCTCCACCCCTCTCGGATCCGATTCCTCACCCTCCCCGAACACCCCCTTCCCCCTGACGCTGATATTACCTCTCTGTTCAAATCCATCGTAGAAACCCAGAAGCAAAATGTCAGAGACGCCCTTGCGAAGCTTCCCGATTCCCCCATCATCGCCGGCTTCGTCGTCGACATGTTCTGCACCTCCATGATAGATGTGGCCGACGAACTTGGGGTTCCCTCTTATGTATTCCTCACTTCAAGTGCTGGGTATCTCTCTTTCACCTCCCATCTTCAGGAGCTTTACGATCGGCACAACAAACAGAGCGACCAATTCCTCCGGTCGGATTTCCAGTTCGCCGTCCCGGGTTTCACCAATCCGGTTCCTGGCAAGGTCATTCCAAGCTCTTATTTCAACAAAAATTCAGAGTGGATTATACACGAGAGCACCCGGAGGTTAAGGGAGAGCAGTGGGATTTTGATAAACACCTTTTCCGAACTTGAATCGAAGATTCTCGACTCCTTTTCCTCCTCCTCCTCCTACCAATTTCCGCCCGTCTACACCGTGGGGCCGATTCTGAGTTTGCATTCGAACAACAACAATGGCAGCTCCGGTGAATCGAGTGAGCGTTTGGAGATTATGAAATGGCTGGACCAACAACCTCCATTATCGGTAGTATTCCTCTGCTTTGGGAACTGGGGGAGCTTCAATCGGGATCAAGTGAGAGAGATTGCAAATGCTTTAGAGCGGAGTGGGTACCGATTCGTGTGGTCGCTGCGGCAGCCATCGCGGGAAGGAAAAATTGAAAACCCAAACGAATACGATTACACTAAAGATGTTGTTCCAGAGGGGTTTTTGGATCGGACGGCGGAGATTGGGAGGGTGATTGGGTGGGCGCCGCAAGTGGAGATTCTTGGGCATCCGGCGACGGGAGGGTTTGTATCGCATTGCGGGTGGAATTCGACGCTGGAGAGTGTTTGGTTTGGAGTGCCGATAGGGACATGGGCGATGTATGCAGAGCAGCAGTTGAATGCGGTGGAGATGGAAGTGGAGTTGGGTTTGGCGGTGGAGATCTCGTCGGAGACCGGCGTGGTAAGTGCAGAGAAGATTGAGAGTGGAATCAGAGAACTGATGGCCGGCGATGGGGAGGTTCGGAAATTGATGAAGATGAAGAGTGAAGAGAGCCGGAGAAGTGTAATGGAGGGAGGATCTTCTTTCAATGCTCTGAATCGTTTTATTGAGGAGGTAATGGCGAAGACGGCCATTTCATGAATGAATTGGTTTTTGTTCAAATATTTTACGTTTTAAACTAAGTTTATGTTAATAATAATAACCAGTCTATTTGAACTCAAAAATTAATAATGTAGAGCTGAGTTGACTCATTTCCTAGTCCACTATCATAAATAGATTAGAAAATTTCTA

mRNA sequence

GATAGATGTGGCTAAAGAATTTGAGGTTCCTTGTTATTTGTTTTACACATCCAATGCTGCTTTTCTTGCTCAGCTTTATGATCAAAATAATAGCAAAAAATTTCAAATTTGGCTAAAAACTAAAAAAAGATGGAAATCATGGCAGTGGAAGAGAAATGATAGAACAAGGATAATTTTCAAAAATCAAAAACAAAAAATCAGTAATCAAATATGACTTTAACGTCTATATTTATTTTACTCGAGCATATGTATAACTTTGGTCAGATATTCGAATTGGTCTCCACAGATATATTTTTTTAAAAAAAAAGTATTACGATAAGGAAATGGTATCCCACTCCCAAAAAGAAAAAGCTTCAATCGCACTATTCTTCACTAAATTCCAATCTCTAATAAGTACCTGCTTTGATGTGATGCGAATGCGACCTTGTCGAATTATGAACTCCGGCGAGCTTTCAGCGCAGACTAGAATTCAGATGAAGAAAATGGAGCTAGTTTTCATCCCATGGCCGGGCATTAGCCATTTCTCCTCCACTCTCCAACTTGCCCAACTCCTCCTCCGCCGCGACCACCGCCTCTCCGTCACTGTCCTCCTCATCCCGTCGCCGTGGGAGCCCATTCCCGCCGCCTCCCTCCAATCCCTCCACCCCTCTCGGATCCGATTCCTCACCCTCCCCGAACACCCCCTTCCCCCTGACGCTGATATTACCTCTCTGTTCAAATCCATCGTAGAAACCCAGAAGCAAAATGTCAGAGACGCCCTTGCGAAGCTTCCCGATTCCCCCATCATCGCCGGCTTCGTCGTCGACATGTTCTGCACCTCCATGATAGATGTGGCCGACGAACTTGGGGTTCCCTCTTATGTATTCCTCACTTCAAGTGCTGGGTATCTCTCTTTCACCTCCCATCTTCAGGAGCTTTACGATCGGCACAACAAACAGAGCGACCAATTCCTCCGGTCGGATTTCCAGTTCGCCGTCCCGGGTTTCACCAATCCGGTTCCTGGCAAGGTCATTCCAAGCTCTTATTTCAACAAAAATTCAGAGTGGATTATACACGAGAGCACCCGGAGGTTAAGGGAGAGCAGTGGGATTTTGATAAACACCTTTTCCGAACTTGAATCGAAGATTCTCGACTCCTTTTCCTCCTCCTCCTCCTACCAATTTCCGCCCGTCTACACCGTGGGGCCGATTCTGAGTTTGCATTCGAACAACAACAATGGCAGCTCCGGTGAATCGAGTGAGCGTTTGGAGATTATGAAATGGCTGGACCAACAACCTCCATTATCGGTAGTATTCCTCTGCTTTGGGAACTGGGGGAGCTTCAATCGGGATCAAGTGAGAGAGATTGCAAATGCTTTAGAGCGGAGTGGGTACCGATTCGTGTGGTCGCTGCGGCAGCCATCGCGGGAAGGAAAAATTGAAAACCCAAACGAATACGATTACACTAAAGATGTTGTTCCAGAGGGGTTTTTGGATCGGACGGCGGAGATTGGGAGGGTGATTGGGTGGGCGCCGCAAGTGGAGATTCTTGGGCATCCGGCGACGGGAGGGTTTGTATCGCATTGCGGGTGGAATTCGACGCTGGAGAGTGTTTGGTTTGGAGTGCCGATAGGGACATGGGCGATGTATGCAGAGCAGCAGTTGAATGCGGTGGAGATGGAAGTGGAGTTGGGTTTGGCGGTGGAGATCTCGTCGGAGACCGGCGTGGTAAGTGCAGAGAAGATTGAGAGTGGAATCAGAGAACTGATGGCCGGCGATGGGGAGGTTCGGAAATTGATGAAGATGAAGAGTGAAGAGAGCCGGAGAAGTGTAATGGAGGGAGGATCTTCTTTCAATGCTCTGAATCGTTTTATTGAGGAGGTAATGGCGAAGACGGCCATTTCATGAATGAATTGGTTTTTGTTCAAATATTTTACGTTTTAAACTAAGTTTATGTTAATAATAATAACCAGTCTATTTGAACTCAAAAATTAATAATGTAGAGCTGAGTTGACTCATTTCCTAGTCCACTATCATAAATAGATTAGAAAATTTCTA

Coding sequence (CDS)

ATGGTATCCCACTCCCAAAAAGAAAAAGCTTCAATCGCACTATTCTTCACTAAATTCCAATCTCTAATAAGTACCTGCTTTGATGTGATGCGAATGCGACCTTGTCGAATTATGAACTCCGGCGAGCTTTCAGCGCAGACTAGAATTCAGATGAAGAAAATGGAGCTAGTTTTCATCCCATGGCCGGGCATTAGCCATTTCTCCTCCACTCTCCAACTTGCCCAACTCCTCCTCCGCCGCGACCACCGCCTCTCCGTCACTGTCCTCCTCATCCCGTCGCCGTGGGAGCCCATTCCCGCCGCCTCCCTCCAATCCCTCCACCCCTCTCGGATCCGATTCCTCACCCTCCCCGAACACCCCCTTCCCCCTGACGCTGATATTACCTCTCTGTTCAAATCCATCGTAGAAACCCAGAAGCAAAATGTCAGAGACGCCCTTGCGAAGCTTCCCGATTCCCCCATCATCGCCGGCTTCGTCGTCGACATGTTCTGCACCTCCATGATAGATGTGGCCGACGAACTTGGGGTTCCCTCTTATGTATTCCTCACTTCAAGTGCTGGGTATCTCTCTTTCACCTCCCATCTTCAGGAGCTTTACGATCGGCACAACAAACAGAGCGACCAATTCCTCCGGTCGGATTTCCAGTTCGCCGTCCCGGGTTTCACCAATCCGGTTCCTGGCAAGGTCATTCCAAGCTCTTATTTCAACAAAAATTCAGAGTGGATTATACACGAGAGCACCCGGAGGTTAAGGGAGAGCAGTGGGATTTTGATAAACACCTTTTCCGAACTTGAATCGAAGATTCTCGACTCCTTTTCCTCCTCCTCCTCCTACCAATTTCCGCCCGTCTACACCGTGGGGCCGATTCTGAGTTTGCATTCGAACAACAACAATGGCAGCTCCGGTGAATCGAGTGAGCGTTTGGAGATTATGAAATGGCTGGACCAACAACCTCCATTATCGGTAGTATTCCTCTGCTTTGGGAACTGGGGGAGCTTCAATCGGGATCAAGTGAGAGAGATTGCAAATGCTTTAGAGCGGAGTGGGTACCGATTCGTGTGGTCGCTGCGGCAGCCATCGCGGGAAGGAAAAATTGAAAACCCAAACGAATACGATTACACTAAAGATGTTGTTCCAGAGGGGTTTTTGGATCGGACGGCGGAGATTGGGAGGGTGATTGGGTGGGCGCCGCAAGTGGAGATTCTTGGGCATCCGGCGACGGGAGGGTTTGTATCGCATTGCGGGTGGAATTCGACGCTGGAGAGTGTTTGGTTTGGAGTGCCGATAGGGACATGGGCGATGTATGCAGAGCAGCAGTTGAATGCGGTGGAGATGGAAGTGGAGTTGGGTTTGGCGGTGGAGATCTCGTCGGAGACCGGCGTGGTAAGTGCAGAGAAGATTGAGAGTGGAATCAGAGAACTGATGGCCGGCGATGGGGAGGTTCGGAAATTGATGAAGATGAAGAGTGAAGAGAGCCGGAGAAGTGTAATGGAGGGAGGATCTTCTTTCAATGCTCTGAATCGTTTTATTGAGGAGGTAATGGCGAAGACGGCCATTTCATGA

Protein sequence

MVSHSQKEKASIALFFTKFQSLISTCFDVMRMRPCRIMNSGELSAQTRIQMKKMELVFIPWPGISHFSSTLQLAQLLLRRDHRLSVTVLLIPSPWEPIPAASLQSLHPSRIRFLTLPEHPLPPDADITSLFKSIVETQKQNVRDALAKLPDSPIIAGFVVDMFCTSMIDVADELGVPSYVFLTSSAGYLSFTSHLQELYDRHNKQSDQFLRSDFQFAVPGFTNPVPGKVIPSSYFNKNSEWIIHESTRRLRESSGILINTFSELESKILDSFSSSSSYQFPPVYTVGPILSLHSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQVREIANALERSGYRFVWSLRQPSREGKIENPNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQVEILGHPATGGFVSHCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEISSETGVVSAEKIESGIRELMAGDGEVRKLMKMKSEESRRSVMEGGSSFNALNRFIEEVMAKTAIS
Homology
BLAST of Tan0003759 vs. ExPASy Swiss-Prot
Match: Q66PF3 (Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3 OS=Fragaria ananassa OX=3747 GN=GT3 PE=2 SV=1)

HSP 1 Score: 453.8 bits (1166), Expect = 2.7e-126
Identity = 243/478 (50.84%), Postives = 326/478 (68.20%), Query Frame = 0

Query: 52  KKMELVFIPWPGISHFSSTLQLAQLLLRRDHRLSVTVLLIPSP-WEPIPAASLQSLHPS- 111
           K  ELV IP PGI H  STL++A+LL+ RD +L +TVL++  P       A +QSL  S 
Sbjct: 3   KPAELVLIPSPGIGHLVSTLEIAKLLVSRDDKLFITVLIMHFPAVSKGTDAYVQSLADSS 62

Query: 112 -----RIRFLTLPEHPLP-PDADITSLFKSIVETQKQNVRDALAKLPDSPI--IAGFVVD 171
                RI F+ LP   +   +  + +     VE+Q+ +V+DA+A L DS    +AGFVVD
Sbjct: 63  SPISQRINFINLPHTNMDHTEGSVRNSLVGFVESQQPHVKDAVANLRDSKTTRLAGFVVD 122

Query: 172 MFCTSMIDVADELGVPSYVFLTSSAGYLSFTSHLQELYDRHNKQSDQFLRSDFQFAVPGF 231
           MFCT+MI+VA++LGVPSYVF TS A  L    HLQEL D++NK   +F  SD +  +P F
Sbjct: 123 MFCTTMINVANQLGVPSYVFFTSGAATLGLLFHLQELRDQYNKDCTEFKDSDAELIIPSF 182

Query: 232 TNPVPGKVIPSSYFNKNSEWIIHESTRRLRESSGILINTFSELESKILDSFSSSSSYQFP 291
            NP+P KV+P     K+S        +R RE+ GIL+NTF++LES  L + SS +  + P
Sbjct: 183 FNPLPAKVLPGRMLVKDSAEPFLNVIKRFRETKGILVNTFTDLESHALHALSSDA--EIP 242

Query: 292 PVYTVGPILSLHSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQVREI 351
           PVY VGP+L+L+SN +   S E  ++ +I+KWLD QPPLSVVFLCFG+ GSF+  QVREI
Sbjct: 243 PVYPVGPLLNLNSNESRVDSDEVKKKNDILKWLDDQPPLSVVFLCFGSMGSFDESQVREI 302

Query: 352 ANALERSGYRFVWSLRQPSREGKIENPNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQVEI 411
           ANALE +G+RF+WSLR+    GK+  P++YD    V+PEGFLDRT  IG+VIGWAPQV +
Sbjct: 303 ANALEHAGHRFLWSLRRSPPTGKVAFPSDYDDHTGVLPEGFLDRTGGIGKVIGWAPQVAV 362

Query: 412 LGHPATGGFVSHCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEI-----S 471
           L HP+ GGFVSHCGWNSTLES+W GVP+ TW +YAEQQLNA +   EL LAVEI     S
Sbjct: 363 LAHPSVGGFVSHCGWNSTLESLWHGVPVATWPLYAEQQLNAFQPVKELELAVEIDMSYRS 422

Query: 472 SETGVVSAEKIESGIRELMAGD-GEVRKLMKMKSEESRRSVMEGGSSFNALNRFIEEV 514
               +VSA++IE GIRE+M  D  ++RK +K  SE+ ++++M+GGSS+ +L  FI+++
Sbjct: 423 KSPVLVSAKEIERGIREVMELDSSDIRKRVKEMSEKGKKALMDGGSSYTSLGHFIDQI 478

BLAST of Tan0003759 vs. ExPASy Swiss-Prot
Match: Q2V6K0 (UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria ananassa OX=3747 GN=GT6 PE=1 SV=1)

HSP 1 Score: 437.2 bits (1123), Expect = 2.6e-121
Identity = 237/479 (49.48%), Postives = 318/479 (66.39%), Query Frame = 0

Query: 52  KKMELVFIPWPGISHFSSTLQLAQLLLRRDHRLSVTVLLIPSPW----EPIPAASL---Q 111
           K  EL+FIP PGI H  ST+++A+LLL RD  L +T+L++  P+      +   SL    
Sbjct: 3   KASELIFIPIPGIGHIVSTVEIAKLLLCRDDNLFITILIMKFPFTADGSDVYIKSLAVDP 62

Query: 112 SLHPSRIRFLTLPEHPLPPDADITSLFKSIVETQKQNVRDALAKL----PDSPIIAGFVV 171
           SL   RIRF+ LP+          + F + +++ K +V+DA+ +L     ++  IAGFV+
Sbjct: 63  SLKTQRIRFVNLPQEHFQGTG--ATGFFTFIDSHKSHVKDAVTRLMETKSETTRIAGFVI 122

Query: 172 DMFCTSMIDVADELGVPSYVFLTSSAGYLSFTSHLQELYDRHNKQSDQFLRSDFQFAVPG 231
           DMFCT MID+A+E G+PSYVF TS A  L    HLQ L D  NK   +F  SD +  V  
Sbjct: 123 DMFCTGMIDLANEFGLPSYVFYTSGAADLGLMFHLQALRDEENKDCTEFKDSDAELVVSS 182

Query: 232 FTNPVP-GKVIPSSYFNKNSEWIIHESTRRLRESSGILINTFSELESKILDSFSSSSSYQ 291
           F NP+P  +V+PS  F K          +R RE+ GIL+NTF ELE   + S SS    +
Sbjct: 183 FVNPLPAARVLPSVVFEKEGGNFFLNFAKRYRETKGILVNTFLELEPHAIQSLSSDG--K 242

Query: 292 FPPVYTVGPILSLHSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQVR 351
             PVY VGPIL++ S  N  SS +S ++ +I++WLD QPP SVVFLCFG+ G F  DQV+
Sbjct: 243 ILPVYPVGPILNVKSEGNQVSSEKSKQKSDILEWLDDQPPSSVVFLCFGSMGCFGEDQVK 302

Query: 352 EIANALERSGYRFVWSLRQPSREGKIENPNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQV 411
           EIA+ALE+ G RF+WSLRQPS+E KI  P++Y   K V+PEGFLDRT ++G+VIGWAPQ+
Sbjct: 303 EIAHALEQGGIRFLWSLRQPSKE-KIGFPSDYTDYKAVLPEGFLDRTTDLGKVIGWAPQL 362

Query: 412 EILGHPATGGFVSHCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEIS--- 471
            IL HPA GGFVSHCGWNSTLES+W+GVPI TW  YAEQQ+NA E+  EL LAVEI    
Sbjct: 363 AILAHPAVGGFVSHCGWNSTLESIWYGVPIATWPFYAEQQVNAFELVKELKLAVEIDMGY 422

Query: 472 -SETGV-VSAEKIESGIRELMAGDGEVRKLMKMKSEESRRSVMEGGSSFNALNRFIEEV 514
             ++GV VS E IE GI+E+M  + E+RK +K  S+ SR+++ E GSS+++L RF++++
Sbjct: 423 RKDSGVIVSRENIEKGIKEVMEQESELRKRVKEMSQMSRKALEEDGSSYSSLGRFLDQI 476

BLAST of Tan0003759 vs. ExPASy Swiss-Prot
Match: D3UAG1 (UDP-glycosyltransferase 71A16 OS=Pyrus communis OX=23211 GN=UGT71A16 PE=1 SV=1)

HSP 1 Score: 432.6 bits (1111), Expect = 6.5e-120
Identity = 235/479 (49.06%), Postives = 313/479 (65.34%), Query Frame = 0

Query: 52  KKMELVFIPWPGISHFSSTLQLAQLLLRRDHRLSVTVLLIPSPWEPIPAASLQSLHPSRI 111
           +  +LVF+P PGI H  ST+++A+ L+ RD +L +TVL++  P++  P  +  S    RI
Sbjct: 3   RSAQLVFVPAPGIGHIVSTVEMAKQLVARDDQLFITVLVMKLPYDQ-PFTNTDSSISHRI 62

Query: 112 RFLTLPEHPLPPDADIT---SLFKSIVETQKQNVRDALAK-LPDS--------PIIAGFV 171
            F+ LPE  L     +    S F+  VE  K +VRDA+   LP+S        P +AGFV
Sbjct: 63  NFVNLPEAQLDKQDTVPNPGSFFRMFVENHKTHVRDAVINLLPESDQSESTSKPRLAGFV 122

Query: 172 VDMFCTSMIDVADELGVPSYVFLTSSAGYLSFTSHLQELYDRHNKQSDQFLRSDFQFAVP 231
           +DMF  S+IDVA+E  VPSYVF TS++  L+  SH Q L D       +   S  + AVP
Sbjct: 123 LDMFSASLIDVANEFEVPSYVFFTSNSSTLALLSHFQSLRDEGGIDITELTSSTAELAVP 182

Query: 232 GFTNPVPGKVIPSSYFNKNSEWIIHESTRRLRESSGILINTFSELESKILDSFSSSSSYQ 291
            F NP P  V+P S+ +K S      +  R +++ GIL+NTF ELES  L      S  +
Sbjct: 183 SFINPYPVAVLPGSFLDKESTKSTLNNVGRYKQTKGILVNTFLELESHALHYL--DSGVK 242

Query: 292 FPPVYTVGPILSLHSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQVR 351
            PPVY VGP+L+L S++ +  S       +I++WLD QPPLSVVFLCFG+ GSF   QV+
Sbjct: 243 IPPVYPVGPLLNLKSSHEDKGS-------DILRWLDDQPPLSVVFLCFGSMGSFGDAQVK 302

Query: 352 EIANALERSGYRFVWSLRQPSREGKIENPNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQV 411
           EIA  LE SG+RF+WSLRQP  +GK   P++Y   K V+PEGFLDRTA +GRVIGWAPQ 
Sbjct: 303 EIACTLEHSGHRFLWSLRQPPSKGKRALPSDYADLKTVLPEGFLDRTATVGRVIGWAPQA 362

Query: 412 EILGHPATGGFVSHCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEISSE- 471
            ILGHPA GGFVSHCGWNSTLES+W GVPI  W MYAEQ +NA ++ VELGLAVEI  + 
Sbjct: 363 AILGHPAIGGFVSHCGWNSTLESIWNGVPIAAWPMYAEQNMNAFQLVVELGLAVEIKMDY 422

Query: 472 ----TGVVSAEKIESGIRELMAGDGEVRKLMKMKSEESRRSVMEGGSSFNALNRFIEEV 514
                 VVSAE IE GIR++M  D +VRK +K  SE+S++++++GGSS+++L RFI+++
Sbjct: 423 RKDSDVVVSAEDIERGIRQVMELDSDVRKRVKEMSEKSKKALVDGGSSYSSLGRFIDQI 471

BLAST of Tan0003759 vs. ExPASy Swiss-Prot
Match: D3THI6 (UDP-glycosyltransferase 71A15 OS=Malus domestica OX=3750 GN=UGT71A15 PE=1 SV=1)

HSP 1 Score: 427.6 bits (1098), Expect = 2.1e-118
Identity = 234/477 (49.06%), Postives = 310/477 (64.99%), Query Frame = 0

Query: 55  ELVFIPWPGISHFSSTLQLAQLLLRRDHRLSVTVLLIPSPWEPIPAASLQSLHPSRIRFL 114
           +LVF+P PGI H  ST+++A+ L  RD +L +TVL++  P+   P  +  S    RI F+
Sbjct: 6   QLVFVPAPGIGHIVSTVEMAKQLAARDDQLFITVLVMKLPYAQ-PFTNTDSSISHRINFV 65

Query: 115 TLPEHPLPPDADIT----SLFKSIVETQKQNVRDALAK-LPDS--------PIIAGFVVD 174
            LPE   P   DI     S F+  VE  K +VRDA+   LP+S        P +AGFV+D
Sbjct: 66  NLPE-AQPDKQDIVPNPGSFFRMFVENHKSHVRDAVINVLPESDQSESTSKPRLAGFVLD 125

Query: 175 MFCTSMIDVADELGVPSYVFLTSSAGYLSFTSHLQELYDRHNKQSDQFLRSDFQFAVPGF 234
           MF  S+IDVA+E  VPSY+F TS+A  L+  SH Q L D       +   S  + AVP F
Sbjct: 126 MFSASLIDVANEFKVPSYLFFTSNASALALMSHFQSLRDEGGIDITELTSSTAELAVPSF 185

Query: 235 TNPVPGKVIPSSYFNKNSEWIIHESTRRLRESSGILINTFSELESKILDSFSSSSSYQFP 294
            NP P  V+P S  +  S         + +++ GIL+NTF ELES  L    S    + P
Sbjct: 186 INPYPAAVLPGSLLDMESTKSTLNHVSKYKQTKGILVNTFMELESHALHYLDSGD--KIP 245

Query: 295 PVYTVGPILSLHSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQVREI 354
           PVY VGP+L+L S++ + +S       +I++WLD QPP SVVFLCFG+ GSF   QV+EI
Sbjct: 246 PVYPVGPLLNLKSSDEDKAS-------DILRWLDDQPPFSVVFLCFGSMGSFGEAQVKEI 305

Query: 355 ANALERSGYRFVWSLRQPSREGKIENPNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQVEI 414
           A ALE SG+RF+WSLR+P  +GK   P++Y+  K V+PEGFLDRTA +G+VIGWAPQ  I
Sbjct: 306 ACALEHSGHRFLWSLRRPPPQGKRAMPSDYEDLKTVLPEGFLDRTATVGKVIGWAPQAAI 365

Query: 415 LGHPATGGFVSHCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEISSE--- 474
           LGHPATGGFVSHCGWNSTLES+W GVPI  W +YAEQ LNA ++ VELGLAVEI  +   
Sbjct: 366 LGHPATGGFVSHCGWNSTLESLWNGVPIAAWPLYAEQNLNAFQLVVELGLAVEIKMDYRR 425

Query: 475 --TGVVSAEKIESGIRELMAGDGEVRKLMKMKSEESRRSVMEGGSSFNALNRFIEEV 514
               VVSAE IE GIR +M  D +VRK +K  SE+S++++++GGSS+++L RFI+++
Sbjct: 426 DSDVVVSAEDIERGIRRVMELDSDVRKRVKEMSEKSKKALVDGGSSYSSLGRFIDKI 471

BLAST of Tan0003759 vs. ExPASy Swiss-Prot
Match: Q40284 (Anthocyanidin 3-O-glucosyltransferase 1 OS=Manihot esculenta OX=3983 GN=GT1 PE=2 SV=1)

HSP 1 Score: 409.8 bits (1052), Expect = 4.5e-113
Identity = 221/467 (47.32%), Postives = 309/467 (66.17%), Query Frame = 0

Query: 64  ISHFSSTLQLAQLLLRRDHRLSVTVLL-----IPSPWEPIPAASLQSLHPSRIRFLTLPE 123
           + H  S ++ A+LLL R H LS+TVL+     + S       + + S   +R+RF+ LP 
Sbjct: 1   MGHLVSAVETAKLLLSRCHSLSITVLIFNNSVVTSKVHNYVDSQIAS-SSNRLRFIYLPR 60

Query: 124 HPLPPDADITSLFKSIVETQKQNVRDALAKLP------DSPIIAGFVVDMFCTSMIDVAD 183
                D    S F S++E QK +V++++ K+       +SP + GF+VDMFCT+MIDVA+
Sbjct: 61  -----DETGISSFSSLIEKQKPHVKESVMKITEFGSSVESPRLVGFIVDMFCTAMIDVAN 120

Query: 184 ELGVPSYVFLTSSAGYLSFTSHLQELYDRHNKQSDQFLRSDFQFAVPGFTNPVPGKVIPS 243
           E GVPSY+F TS A +L+F  H+Q+++D  N    +F  SD +  VPG  N  P K +P+
Sbjct: 121 EFGVPSYIFYTSGAAFLNFMLHVQKIHDEENFNPTEFNASDGELQVPGLVNSFPSKAMPT 180

Query: 244 SYFNKNSEWIIHESTRRLRESSGILINTFSELESKILDSFSSSSSYQFPPVYTVGPILSL 303
           +  +K     + E+TRR  E+ G++INTF ELES  ++SF        PP+Y VGPIL +
Sbjct: 181 AILSKQWFPPLLENTRRYGEAKGVIINTFFELESHAIESFKD------PPIYPVGPILDV 240

Query: 304 HSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQVREIANALERSGYRF 363
            SN  N +        EIM+WLD QPP SVVFLCFG+ GSF++DQV+EIA ALE SG+RF
Sbjct: 241 RSNGRNTNQ-------EIMQWLDDQPPSSVVFLCFGSNGSFSKDQVKEIACALEDSGHRF 300

Query: 364 VWSLRQPSREGKIENPNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQVEILGHPATGGFVS 423
           +WSL      G +E+P++Y+  ++V+PEGFL+RT+ I +VIGWAPQV +L HPATGG VS
Sbjct: 301 LWSLADHRAPGFLESPSDYEDLQEVLPEGFLERTSGIEKVIGWAPQVAVLAHPATGGLVS 360

Query: 424 HCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEIS----SETG-VVSAEKI 483
           H GWNS LES+WFGVP+ TW MYAEQQ NA +M +ELGLAVEI     +++G +V  ++I
Sbjct: 361 HSGWNSILESIWFGVPVATWPMYAEQQFNAFQMVIELGLAVEIKMDYRNDSGEIVKCDQI 420

Query: 484 ESGIRELMAGDGEVRKLMKMKSEESRRSVMEGGSSFNALNRFIEEVM 515
           E GIR LM  D + RK +K  SE+SR ++MEGGSS+  L+  I++++
Sbjct: 421 ERGIRCLMKHDSDRRKKVKEMSEKSRGALMEGGSSYCWLDNLIKDMI 448

BLAST of Tan0003759 vs. NCBI nr
Match: AXK92493.1 (flavonoids UDP-glycosyltransferase [Siraitia grosvenorii])

HSP 1 Score: 642.1 bits (1655), Expect = 4.1e-180
Identity = 331/483 (68.53%), Postives = 387/483 (80.12%), Query Frame = 0

Query: 51  MKKMELVFIPWPGISHFSSTLQLAQLLLRRDHRLSVTVLLIPSPWEPIPAASLQSLHP-- 110
           MKK+ELVF+P PGI H S+ LQ+A LLLRRDHRLSVTVL IP PWE       +SL P  
Sbjct: 1   MKKVELVFVPGPGIGHLSTALQIADLLLRRDHRLSVTVLSIPLPWEAKTTTQPESLFPSS 60

Query: 111 -----SRIRFLTLPEHPLPPDADITSLFKSIVETQKQNVRDALAKLPDSPIIAGFVVDMF 170
                SRIRF++LP+ PLP DA     F+++ ETQKQNV++A+AKL DS I+AG V+DMF
Sbjct: 61  TTTTTSRIRFISLPQRPLPDDAKGPFQFQAVFETQKQNVKEAVAKLSDSSILAGLVLDMF 120

Query: 171 CTSMIDVADELGVPSYVFLTSSAGYLSFTSHLQELYDRHNKQSDQFLRSDFQFAVPGFTN 230
           C +M+DVA +LGVPSYVF TSSAGYLSFTSHLQ+L DRH K++ Q +RSD + AVPGFTN
Sbjct: 121 CVTMVDVAKQLGVPSYVFFTSSAGYLSFTSHLQDLSDRHGKETQQLMRSDVEIAVPGFTN 180

Query: 231 PVPGKVIPSSYFNKN-SEWIIHESTRRLRESSGILINTFSELESKILDSFS-SSSSYQFP 290
           PVPGKVIP  YFNKN +EW +H+  RR RE++GIL+NTFSELES+++DSFS ++++ QFP
Sbjct: 181 PVPGKVIPGVYFNKNMAEW-LHDCARRFRETNGILVNTFSELESQVMDSFSDATAASQFP 240

Query: 291 PVYTVGPILSLHSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQVREI 350
            VY VGPILSL+ N +  SS ES    EI+KWLDQQPP SVVFLCFG+ GS N DQ REI
Sbjct: 241 AVYAVGPILSLNKNTSAASS-ESQSGDEILKWLDQQPPSSVVFLCFGSKGSLNPDQAREI 300

Query: 351 ANALERSGYRFVWSLRQPSREGKIENPNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQVEI 410
           A+ALERSG+RFVWSLRQPS +GK E P EYD  +DV+PEGFLDRTAE+GRVIGWAPQVEI
Sbjct: 301 AHALERSGHRFVWSLRQPSPKGKFEKPIEYDNIEDVLPEGFLDRTAEMGRVIGWAPQVEI 360

Query: 411 LGHPATGGFVSHCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEISSETG- 470
           LGHPATGGFVSHCGWNSTLES+W+GVPI TW MYAEQ  NA EM VELGLAV ISSE+  
Sbjct: 361 LGHPATGGFVSHCGWNSTLESLWYGVPIATWPMYAEQHFNAFEMGVELGLAVGISSESSI 420

Query: 471 ----VVSAEKIESGIRELM-----AGDGEVRKLMKMKSEESRRSVMEGGSSFNALNRFIE 515
               +VSAEKIE GIR+LM      G GEVRKL+K KSEESR+SVMEGGSSF +LNRFI+
Sbjct: 421 EEGVIVSAEKIEEGIRKLMGGGGGGGGGEVRKLVKAKSEESRKSVMEGGSSFTSLNRFID 480

BLAST of Tan0003759 vs. NCBI nr
Match: 7BV3_A (Chain A, Glycosyltransferase [Siraitia grosvenorii] >7BV3_B Chain B, Glycosyltransferase [Siraitia grosvenorii])

HSP 1 Score: 638.3 bits (1645), Expect = 5.9e-179
Identity = 329/481 (68.40%), Postives = 385/481 (80.04%), Query Frame = 0

Query: 53  KMELVFIPWPGISHFSSTLQLAQLLLRRDHRLSVTVLLIPSPWEPIPAASLQSLHP---- 112
           K+ELVF+P PGI H S+ LQ+A LLLRRDHRLSVTVL IP PWE       +SL P    
Sbjct: 1   KVELVFVPGPGIGHLSTALQIADLLLRRDHRLSVTVLSIPLPWEAKTTTQPESLFPSSTT 60

Query: 113 ---SRIRFLTLPEHPLPPDADITSLFKSIVETQKQNVRDALAKLPDSPIIAGFVVDMFCT 172
              SRIRF++LP+ PLP DA     F+++ ETQKQNV++A+AKL DS I+AG V+DMFC 
Sbjct: 61  TTTSRIRFISLPQRPLPDDAKGPFQFQAVFETQKQNVKEAVAKLSDSSILAGLVLDMFCV 120

Query: 173 SMIDVADELGVPSYVFLTSSAGYLSFTSHLQELYDRHNKQSDQFLRSDFQFAVPGFTNPV 232
           +M+DVA +LGVPSYVF TSSAGYLSFTSHLQ+L DRH K++ Q +RSD + AVPGFTNPV
Sbjct: 121 TMVDVAKQLGVPSYVFFTSSAGYLSFTSHLQDLSDRHGKETQQLMRSDVEIAVPGFTNPV 180

Query: 233 PGKVIPSSYFNKN-SEWIIHESTRRLRESSGILINTFSELESKILDSFS-SSSSYQFPPV 292
           PGKVIP  YFNKN +EW +H+  RR RE++GIL+NTFSELES+++DSFS ++++ QFP V
Sbjct: 181 PGKVIPGVYFNKNMAEW-LHDCARRFRETNGILVNTFSELESQVMDSFSDATAASQFPAV 240

Query: 293 YTVGPILSLHSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQVREIAN 352
           Y VGPILSL+ N +  SS ES    EI+KWLDQQPP SVVFLCFG+ GS N DQ REIA+
Sbjct: 241 YAVGPILSLNKNTSAASS-ESQSGDEILKWLDQQPPSSVVFLCFGSKGSLNPDQAREIAH 300

Query: 353 ALERSGYRFVWSLRQPSREGKIENPNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQVEILG 412
           ALERSG+RFVWSLRQPS +GK E P EYD  +DV+PEGFLDRTAE+GRVIGWAPQVEILG
Sbjct: 301 ALERSGHRFVWSLRQPSPKGKFEKPIEYDNIEDVLPEGFLDRTAEMGRVIGWAPQVEILG 360

Query: 413 HPATGGFVSHCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEISSETG--- 472
           HPATGGFVSHCGWNSTLES+W+GVPI TW MYAEQ  NA EM VELGLAV ISSE+    
Sbjct: 361 HPATGGFVSHCGWNSTLESLWYGVPIATWPMYAEQHFNAFEMGVELGLAVGISSESSIEE 420

Query: 473 --VVSAEKIESGIRELM-----AGDGEVRKLMKMKSEESRRSVMEGGSSFNALNRFIEEV 515
             +VSAEKIE GIR+LM      G GEVRKL+K KSEESR+SVMEGGSSF +LNRFI+EV
Sbjct: 421 GVIVSAEKIEEGIRKLMGGGGGGGGGEVRKLVKAKSEESRKSVMEGGSSFTSLNRFIDEV 479

BLAST of Tan0003759 vs. NCBI nr
Match: XP_038896118.1 (anthocyanidin 3-O-glucosyltransferase 2-like isoform X1 [Benincasa hispida])

HSP 1 Score: 615.5 bits (1586), Expect = 4.1e-172
Identity = 317/479 (66.18%), Postives = 380/479 (79.33%), Query Frame = 0

Query: 51  MKKMELVFIPWPGISHFSSTLQLAQLLLRRDHRLSVTVLLIPSPWEPIPAASLQSLHPSR 110
           M K+ELVFI WP I H S+ L LA LLLRR+H LS+TVL+IP PWE I    LQSL PS 
Sbjct: 1   MNKIELVFIAWPDIGHLSAALHLAHLLLRRNHHLSITVLIIPPPWETITTTQLQSLLPSS 60

Query: 111 ------IRFLTLPEHPLPPDADITSLFKSIVETQKQNVRDALAKL-PDSPIIAGFVVDMF 170
                 I  + LP+ PLP +A+  SL K+ +ETQKQNV D +AKL  +S ++AGF++D+F
Sbjct: 61  TADPIPIPIIILPQIPLPQNAEFISLIKTTIETQKQNVIDTVAKLISNSTVLAGFILDIF 120

Query: 171 CTSMIDVADELGVPSYVFLTSSAGYLSFTSHLQELYDR-HNKQSDQFLRSDFQFAVPGFT 230
           CT MIDVA++LGVP+Y+F TSSA  LS T HLQ LYD   N+++ Q L  + +  +PGF 
Sbjct: 121 CTDMIDVANQLGVPTYLFSTSSAANLSLTLHLQHLYDHPQNEETHQSLNPNVEIPIPGFA 180

Query: 231 NPVPGKVIPSSYFNKNSEWIIHESTRRLRESSGILINTFSELESKILDSFSSSS-SYQFP 290
           NP+PGK IPS+YF++N++W IHESTRR RES+GILINTFSELES +L++FS +S S   P
Sbjct: 181 NPIPGKAIPSAYFDENAKW-IHESTRRFRESNGILINTFSELESNVLEAFSEASISSHLP 240

Query: 291 PVYTVGPILSLHSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQVREI 350
           PVY VGPIL+L+ N+       S E  EI+KWLD+QP  SVVFLCFG+ GSFN+DQV EI
Sbjct: 241 PVYAVGPILNLNKNS-------SGEGFEILKWLDRQPFQSVVFLCFGSRGSFNQDQVNEI 300

Query: 351 ANALERSGYRFVWSLRQPSREGKIENPNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQVEI 410
           A ALERSGY+FVWSLRQPS EG+ +NP   DY K+VVPEGFLDRTAEIGRVIGWAPQVEI
Sbjct: 301 AEALERSGYQFVWSLRQPSSEGEFQNP---DYVKEVVPEGFLDRTAEIGRVIGWAPQVEI 360

Query: 411 LGHPATGGFVSHCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEI-SSETG 470
           LGHPATGGFVSHCGWNS LES+WFGVPIGTWAMYAEQ LNA EMEVELGLAV I SSE+G
Sbjct: 361 LGHPATGGFVSHCGWNSILESLWFGVPIGTWAMYAEQGLNAAEMEVELGLAVGISSSESG 420

Query: 471 VVSAEKIESGIRELMAGDGEVRKLMKMKSEESRRSVMEGGSSFNALNRFIEEVMAKTAI 520
           VV AEKIESGI+ELM GDGE+RK++KMKSEESR++VME GSSF ALNRFIE+ + ++ +
Sbjct: 421 VVKAEKIESGIKELMGGDGEIRKMVKMKSEESRKTVMENGSSFIALNRFIEKSLQESFV 468

BLAST of Tan0003759 vs. NCBI nr
Match: XP_038896119.1 (anthocyanidin 3-O-glucosyltransferase 2-like isoform X2 [Benincasa hispida])

HSP 1 Score: 614.8 bits (1584), Expect = 7.0e-172
Identity = 317/472 (67.16%), Postives = 376/472 (79.66%), Query Frame = 0

Query: 51  MKKMELVFIPWPGISHFSSTLQLAQLLLRRDHRLSVTVLLIPSPWEPIPAASLQSLHPSR 110
           M K+ELVFI WP I H S+ L LA LLLRR+H LS+TVL+IP PWE I    LQSL PS 
Sbjct: 1   MNKIELVFIAWPDIGHLSAALHLAHLLLRRNHHLSITVLIIPPPWETITTTQLQSLLPSS 60

Query: 111 ------IRFLTLPEHPLPPDADITSLFKSIVETQKQNVRDALAKL-PDSPIIAGFVVDMF 170
                 I  + LP+ PLP +A+  SL K+ +ETQKQNV D +AKL  +S ++AGF++D+F
Sbjct: 61  TADPIPIPIIILPQIPLPQNAEFISLIKTTIETQKQNVIDTVAKLISNSTVLAGFILDIF 120

Query: 171 CTSMIDVADELGVPSYVFLTSSAGYLSFTSHLQELYDR-HNKQSDQFLRSDFQFAVPGFT 230
           CT MIDVA++LGVP+Y+F TSSA  LS T HLQ LYD   N+++ Q L  + +  +PGF 
Sbjct: 121 CTDMIDVANQLGVPTYLFSTSSAANLSLTLHLQHLYDHPQNEETHQSLNPNVEIPIPGFA 180

Query: 231 NPVPGKVIPSSYFNKNSEWIIHESTRRLRESSGILINTFSELESKILDSFSSSS-SYQFP 290
           NP+PGK IPS+YF++N++W IHESTRR RES+GILINTFSELES +L++FS +S S   P
Sbjct: 181 NPIPGKAIPSAYFDENAKW-IHESTRRFRESNGILINTFSELESNVLEAFSEASISSHLP 240

Query: 291 PVYTVGPILSLHSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQVREI 350
           PVY VGPIL+L+ N+       S E  EI+KWLD+QP  SVVFLCFG+ GSFN+DQV EI
Sbjct: 241 PVYAVGPILNLNKNS-------SGEGFEILKWLDRQPFQSVVFLCFGSRGSFNQDQVNEI 300

Query: 351 ANALERSGYRFVWSLRQPSREGKIENPNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQVEI 410
           A ALERSGY+FVWSLRQPS EG+ +NP   DY K+VVPEGFLDRTAEIGRVIGWAPQVEI
Sbjct: 301 AEALERSGYQFVWSLRQPSSEGEFQNP---DYVKEVVPEGFLDRTAEIGRVIGWAPQVEI 360

Query: 411 LGHPATGGFVSHCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEI-SSETG 470
           LGHPATGGFVSHCGWNS LES+WFGVPIGTWAMYAEQ LNA EMEVELGLAV I SSE+G
Sbjct: 361 LGHPATGGFVSHCGWNSILESLWFGVPIGTWAMYAEQGLNAAEMEVELGLAVGISSSESG 420

Query: 471 VVSAEKIESGIRELMAGDGEVRKLMKMKSEESRRSVMEGGSSFNALNRFIEE 513
           VV AEKIESGI+ELM GDGE+RK++KMKSEESR++VME GSSF ALNRFIE+
Sbjct: 421 VVKAEKIESGIKELMGGDGEIRKMVKMKSEESRKTVMENGSSFIALNRFIEK 461

BLAST of Tan0003759 vs. NCBI nr
Match: XP_004146062.2 (anthocyanidin 3-O-glucosyltransferase 2 [Cucumis sativus] >KGN54986.1 hypothetical protein Csa_012670 [Cucumis sativus])

HSP 1 Score: 586.3 bits (1510), Expect = 2.7e-163
Identity = 313/477 (65.62%), Postives = 370/477 (77.57%), Query Frame = 0

Query: 53  KMELVFIPWPGISHFSSTLQLAQLLLRRDHRLSVTVLLIPSPWEPIPAASLQSLHP-SRI 112
           KMEL+FI WP I H S+TL LA LL+RR+HRLSVT  +IP P   I +  L SL P S I
Sbjct: 25  KMELIFIAWPDIGHLSATLHLADLLIRRNHRLSVTFFIIPPPSHTITSTKLHSLLPSSTI 84

Query: 113 RFLTLPE-HPLPPDADITSLFKSIVETQKQNVRDALAKL----PDSP-IIAGFVVDMFCT 172
             + LP+  PLP      SL K+ ++TQKQNV  A+A L    PDSP ++AGFV+DMFCT
Sbjct: 85  PIIILPQIPPLPHHPQFISLIKTTIQTQKQNVFHAVADLISNSPDSPTVLAGFVLDMFCT 144

Query: 173 SMIDVADELGVPSYVFLTSSAGYLSFTSHLQELYDRHNKQSDQFLRSDFQFAVPGFTNPV 232
            MIDVA++LGVPSY+F TSSA  LS T HLQ LYDR    + Q L  D Q  +PGF NPV
Sbjct: 145 PMIDVANQLGVPSYLFSTSSAANLSLTLHLQHLYDR----THQSLNPDVQIPIPGFVNPV 204

Query: 233 PGKVIPSSYFNKNSEWIIHESTRRLRESSGILINTFSELESKILDSFS-SSSSYQFPPVY 292
             K IP++YF++N++W IHES RR  ES+GILINTFSELES ++++F+ SSSS  FPPVY
Sbjct: 205 TAKAIPTAYFDENAKW-IHESVRRFGESNGILINTFSELESNVIEAFADSSSSSTFPPVY 264

Query: 293 TVGPILSLHSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQVREIANA 352
            VGPIL+L+ N+       SSE  EI+KWLD+QP  SVVFLCFG+ GSF RDQV+EIA A
Sbjct: 265 AVGPILNLNKNS-------SSEGYEILKWLDEQPFQSVVFLCFGSRGSFGRDQVKEIAEA 324

Query: 353 LERSGYRFVWSLRQPSREGKIENPNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQVEILGH 412
           LERSGYRFVWSLR+PS EG+I+N    DY K+VVPEGFLDRTA +GRVIGWAPQ++IL H
Sbjct: 325 LERSGYRFVWSLREPSSEGEIQNT---DYIKEVVPEGFLDRTAGMGRVIGWAPQMKILEH 384

Query: 413 PATGGFVSHCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEISSET--GVV 472
           PATGGFVSHCGWNS LES+WFGVPIG WAMYAEQ LNAVEM VELGLAVEIS+ET  G+V
Sbjct: 385 PATGGFVSHCGWNSILESLWFGVPIGAWAMYAEQGLNAVEMGVELGLAVEISTETGQGIV 444

Query: 473 SAEKIESGIRELMAGDGEVRKLMKMKSEESRRSVMEGGSSFNALNRFIEEVMAKTAI 520
            AEKIESGI+E+M GDGE+RK++KMKSEESR+SVME GSSF ALNRFIE V+AK  +
Sbjct: 445 RAEKIESGIKEVMKGDGEIRKMVKMKSEESRKSVMENGSSFTALNRFIEVVIAKAKL 486

BLAST of Tan0003759 vs. ExPASy TrEMBL
Match: A0A346A6C4 (Glycosyltransferase OS=Siraitia grosvenorii OX=190515 GN=UGT74AC2 PE=2 SV=1)

HSP 1 Score: 642.1 bits (1655), Expect = 2.0e-180
Identity = 331/483 (68.53%), Postives = 387/483 (80.12%), Query Frame = 0

Query: 51  MKKMELVFIPWPGISHFSSTLQLAQLLLRRDHRLSVTVLLIPSPWEPIPAASLQSLHP-- 110
           MKK+ELVF+P PGI H S+ LQ+A LLLRRDHRLSVTVL IP PWE       +SL P  
Sbjct: 1   MKKVELVFVPGPGIGHLSTALQIADLLLRRDHRLSVTVLSIPLPWEAKTTTQPESLFPSS 60

Query: 111 -----SRIRFLTLPEHPLPPDADITSLFKSIVETQKQNVRDALAKLPDSPIIAGFVVDMF 170
                SRIRF++LP+ PLP DA     F+++ ETQKQNV++A+AKL DS I+AG V+DMF
Sbjct: 61  TTTTTSRIRFISLPQRPLPDDAKGPFQFQAVFETQKQNVKEAVAKLSDSSILAGLVLDMF 120

Query: 171 CTSMIDVADELGVPSYVFLTSSAGYLSFTSHLQELYDRHNKQSDQFLRSDFQFAVPGFTN 230
           C +M+DVA +LGVPSYVF TSSAGYLSFTSHLQ+L DRH K++ Q +RSD + AVPGFTN
Sbjct: 121 CVTMVDVAKQLGVPSYVFFTSSAGYLSFTSHLQDLSDRHGKETQQLMRSDVEIAVPGFTN 180

Query: 231 PVPGKVIPSSYFNKN-SEWIIHESTRRLRESSGILINTFSELESKILDSFS-SSSSYQFP 290
           PVPGKVIP  YFNKN +EW +H+  RR RE++GIL+NTFSELES+++DSFS ++++ QFP
Sbjct: 181 PVPGKVIPGVYFNKNMAEW-LHDCARRFRETNGILVNTFSELESQVMDSFSDATAASQFP 240

Query: 291 PVYTVGPILSLHSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQVREI 350
            VY VGPILSL+ N +  SS ES    EI+KWLDQQPP SVVFLCFG+ GS N DQ REI
Sbjct: 241 AVYAVGPILSLNKNTSAASS-ESQSGDEILKWLDQQPPSSVVFLCFGSKGSLNPDQAREI 300

Query: 351 ANALERSGYRFVWSLRQPSREGKIENPNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQVEI 410
           A+ALERSG+RFVWSLRQPS +GK E P EYD  +DV+PEGFLDRTAE+GRVIGWAPQVEI
Sbjct: 301 AHALERSGHRFVWSLRQPSPKGKFEKPIEYDNIEDVLPEGFLDRTAEMGRVIGWAPQVEI 360

Query: 411 LGHPATGGFVSHCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEISSETG- 470
           LGHPATGGFVSHCGWNSTLES+W+GVPI TW MYAEQ  NA EM VELGLAV ISSE+  
Sbjct: 361 LGHPATGGFVSHCGWNSTLESLWYGVPIATWPMYAEQHFNAFEMGVELGLAVGISSESSI 420

Query: 471 ----VVSAEKIESGIRELM-----AGDGEVRKLMKMKSEESRRSVMEGGSSFNALNRFIE 515
               +VSAEKIE GIR+LM      G GEVRKL+K KSEESR+SVMEGGSSF +LNRFI+
Sbjct: 421 EEGVIVSAEKIEEGIRKLMGGGGGGGGGEVRKLVKAKSEESRKSVMEGGSSFTSLNRFID 480

BLAST of Tan0003759 vs. ExPASy TrEMBL
Match: A0A0A0L4D9 (Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_4G618510 PE=3 SV=1)

HSP 1 Score: 586.3 bits (1510), Expect = 1.3e-163
Identity = 313/477 (65.62%), Postives = 370/477 (77.57%), Query Frame = 0

Query: 53  KMELVFIPWPGISHFSSTLQLAQLLLRRDHRLSVTVLLIPSPWEPIPAASLQSLHP-SRI 112
           KMEL+FI WP I H S+TL LA LL+RR+HRLSVT  +IP P   I +  L SL P S I
Sbjct: 25  KMELIFIAWPDIGHLSATLHLADLLIRRNHRLSVTFFIIPPPSHTITSTKLHSLLPSSTI 84

Query: 113 RFLTLPE-HPLPPDADITSLFKSIVETQKQNVRDALAKL----PDSP-IIAGFVVDMFCT 172
             + LP+  PLP      SL K+ ++TQKQNV  A+A L    PDSP ++AGFV+DMFCT
Sbjct: 85  PIIILPQIPPLPHHPQFISLIKTTIQTQKQNVFHAVADLISNSPDSPTVLAGFVLDMFCT 144

Query: 173 SMIDVADELGVPSYVFLTSSAGYLSFTSHLQELYDRHNKQSDQFLRSDFQFAVPGFTNPV 232
            MIDVA++LGVPSY+F TSSA  LS T HLQ LYDR    + Q L  D Q  +PGF NPV
Sbjct: 145 PMIDVANQLGVPSYLFSTSSAANLSLTLHLQHLYDR----THQSLNPDVQIPIPGFVNPV 204

Query: 233 PGKVIPSSYFNKNSEWIIHESTRRLRESSGILINTFSELESKILDSFS-SSSSYQFPPVY 292
             K IP++YF++N++W IHES RR  ES+GILINTFSELES ++++F+ SSSS  FPPVY
Sbjct: 205 TAKAIPTAYFDENAKW-IHESVRRFGESNGILINTFSELESNVIEAFADSSSSSTFPPVY 264

Query: 293 TVGPILSLHSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQVREIANA 352
            VGPIL+L+ N+       SSE  EI+KWLD+QP  SVVFLCFG+ GSF RDQV+EIA A
Sbjct: 265 AVGPILNLNKNS-------SSEGYEILKWLDEQPFQSVVFLCFGSRGSFGRDQVKEIAEA 324

Query: 353 LERSGYRFVWSLRQPSREGKIENPNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQVEILGH 412
           LERSGYRFVWSLR+PS EG+I+N    DY K+VVPEGFLDRTA +GRVIGWAPQ++IL H
Sbjct: 325 LERSGYRFVWSLREPSSEGEIQNT---DYIKEVVPEGFLDRTAGMGRVIGWAPQMKILEH 384

Query: 413 PATGGFVSHCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEISSET--GVV 472
           PATGGFVSHCGWNS LES+WFGVPIG WAMYAEQ LNAVEM VELGLAVEIS+ET  G+V
Sbjct: 385 PATGGFVSHCGWNSILESLWFGVPIGAWAMYAEQGLNAVEMGVELGLAVEISTETGQGIV 444

Query: 473 SAEKIESGIRELMAGDGEVRKLMKMKSEESRRSVMEGGSSFNALNRFIEEVMAKTAI 520
            AEKIESGI+E+M GDGE+RK++KMKSEESR+SVME GSSF ALNRFIE V+AK  +
Sbjct: 445 RAEKIESGIKEVMKGDGEIRKMVKMKSEESRKSVMENGSSFTALNRFIEVVIAKAKL 486

BLAST of Tan0003759 vs. ExPASy TrEMBL
Match: A0A5A7VHR6 (Glycosyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold384G002810 PE=3 SV=1)

HSP 1 Score: 574.7 bits (1480), Expect = 3.9e-160
Identity = 312/476 (65.55%), Postives = 365/476 (76.68%), Query Frame = 0

Query: 51  MKKMELVFIPWPGISHFSSTLQLAQLLLRRDHRLSVTVLLIPSPWEPIPAASLQSLHP-S 110
           M K+EL+FI WP I H S+TL LA LLLRR+ RLSVT  +IP P + I +  L SL P S
Sbjct: 1   MDKIELIFIAWPDIGHLSATLHLADLLLRRNQRLSVTFFIIPPPSQTITSTQLHSLLPSS 60

Query: 111 RIRFLTLPE-HPLPPDADITSLFKSIVETQKQN----VRDALAKLPDS-PIIAGFVVDMF 170
            I  + LP+  PLP      SL K+ ++TQKQN    V D L+  PDS  ++AGFV+DMF
Sbjct: 61  TIPIIVLPQIPPLPHHPQFISLIKTTIQTQKQNVLRAVADHLSNSPDSNTVLAGFVLDMF 120

Query: 171 CTSMIDVADELGVPSYVFLTSSAGYLSFTSHLQELYDRHNKQSDQFLRSDFQFAVPGFTN 230
           CT MIDVA++LGVPSY+F TSSA  LS   HLQ LYD H  QS   L  D Q  +PGF N
Sbjct: 121 CTPMIDVANQLGVPSYLFSTSSAANLSLALHLQHLYD-HTHQS---LNPDVQIPIPGFAN 180

Query: 231 PVPGKVIPSSYFNKNSEWIIHESTRRLRESSGILINTFSELESKILDSFS-SSSSYQFPP 290
           PV  K IP++YF++N++W IHESTRR  ES+GILINTFSELES +LD+FS SSSS  FPP
Sbjct: 181 PVTAKAIPTAYFDENAKW-IHESTRRFGESNGILINTFSELESNVLDAFSDSSSSSTFPP 240

Query: 291 VYTVGPILSLHSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQVREIA 350
           VY VGPIL+++ ++       SSE  EI+KWLDQQP  SVVFLCFG+ GSF RDQV+EIA
Sbjct: 241 VYAVGPILNMNKDS-------SSEGYEILKWLDQQPFQSVVFLCFGSRGSFGRDQVKEIA 300

Query: 351 NALERSGYRFVWSLRQPSREGKIENPNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQVEIL 410
            ALE+SGYRFVWSLRQPS EG+I+   + DY K+VVPEGFLDRTA IGRVIGWAPQ++IL
Sbjct: 301 EALEQSGYRFVWSLRQPSSEGEIQ---KTDYIKEVVPEGFLDRTAGIGRVIGWAPQMKIL 360

Query: 411 GHPATGGFVSHCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEISSET--G 470
            HPATGGFVSHCGWNS LES+WFGVPIG WAMY EQ LNAVEM VELGLAVEI++ET  G
Sbjct: 361 EHPATGGFVSHCGWNSILESLWFGVPIGAWAMYGEQGLNAVEMGVELGLAVEITAETGHG 420

Query: 471 VVSAEKIESGIRELMAGDGEVRKLMKMKSEESRRSVMEGGSSFNALNRFIEEVMAK 517
           VV AEKIESGI+E+M GDGE+RK +KMK EESR+SVME GSSF ALNRFIE V+AK
Sbjct: 421 VVRAEKIESGIKEVMKGDGEIRKTVKMKREESRKSVMENGSSFTALNRFIEVVIAK 461

BLAST of Tan0003759 vs. ExPASy TrEMBL
Match: A0A1S3CNM1 (Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103502510 PE=3 SV=1)

HSP 1 Score: 574.7 bits (1480), Expect = 3.9e-160
Identity = 312/476 (65.55%), Postives = 365/476 (76.68%), Query Frame = 0

Query: 51  MKKMELVFIPWPGISHFSSTLQLAQLLLRRDHRLSVTVLLIPSPWEPIPAASLQSLHP-S 110
           M K+EL+FI WP I H S+TL LA LLLRR+ RLSVT  +IP P + I +  L SL P S
Sbjct: 1   MDKIELIFIAWPDIGHLSATLHLADLLLRRNQRLSVTFFIIPPPSQTITSTQLHSLLPSS 60

Query: 111 RIRFLTLPE-HPLPPDADITSLFKSIVETQKQN----VRDALAKLPDS-PIIAGFVVDMF 170
            I  + LP+  PLP      SL K+ ++TQKQN    V D L+  PDS  ++AGFV+DMF
Sbjct: 61  TIPIIVLPQIPPLPHHPQFISLIKTTIQTQKQNVLRAVADHLSNSPDSNTVLAGFVLDMF 120

Query: 171 CTSMIDVADELGVPSYVFLTSSAGYLSFTSHLQELYDRHNKQSDQFLRSDFQFAVPGFTN 230
           CT MIDVA++LGVPSY+F TSSA  LS   HLQ LYD H  QS   L  D Q  +PGF N
Sbjct: 121 CTPMIDVANQLGVPSYLFSTSSAANLSLALHLQHLYD-HTHQS---LNPDVQIPIPGFAN 180

Query: 231 PVPGKVIPSSYFNKNSEWIIHESTRRLRESSGILINTFSELESKILDSFS-SSSSYQFPP 290
           PV  K IP++YF++N++W IHESTRR  ES+GILINTFSELES +LD+FS SSSS  FPP
Sbjct: 181 PVTAKAIPTAYFDENAKW-IHESTRRFGESNGILINTFSELESNVLDAFSDSSSSSTFPP 240

Query: 291 VYTVGPILSLHSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQVREIA 350
           VY VGPIL+++ ++       SSE  EI+KWLDQQP  SVVFLCFG+ GSF RDQV+EIA
Sbjct: 241 VYAVGPILNMNKDS-------SSEGYEILKWLDQQPFQSVVFLCFGSRGSFGRDQVKEIA 300

Query: 351 NALERSGYRFVWSLRQPSREGKIENPNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQVEIL 410
            ALE+SGYRFVWSLRQPS EG+I+   + DY K+VVPEGFLDRTA IGRVIGWAPQ++IL
Sbjct: 301 EALEQSGYRFVWSLRQPSSEGEIQ---KTDYIKEVVPEGFLDRTAGIGRVIGWAPQMKIL 360

Query: 411 GHPATGGFVSHCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEISSET--G 470
            HPATGGFVSHCGWNS LES+WFGVPIG WAMY EQ LNAVEM VELGLAVEI++ET  G
Sbjct: 361 EHPATGGFVSHCGWNSILESLWFGVPIGAWAMYGEQGLNAVEMGVELGLAVEITAETGHG 420

Query: 471 VVSAEKIESGIRELMAGDGEVRKLMKMKSEESRRSVMEGGSSFNALNRFIEEVMAK 517
           VV AEKIESGI+E+M GDGE+RK +KMK EESR+SVME GSSF ALNRFIE V+AK
Sbjct: 421 VVRAEKIESGIKEVMKGDGEIRKTVKMKREESRKSVMENGSSFTALNRFIEVVIAK 461

BLAST of Tan0003759 vs. ExPASy TrEMBL
Match: A0A1S4E4T0 (Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103502510 PE=3 SV=1)

HSP 1 Score: 555.1 bits (1429), Expect = 3.2e-154
Identity = 302/474 (63.71%), Postives = 354/474 (74.68%), Query Frame = 0

Query: 51  MKKMELVFIPWPGISHFSSTLQLAQLLLRRDHRLSVTVLLIPSPWEPIPAASLQSLHP-S 110
           M K+EL+FI WP I H S+TL LA LLLRR+ RLSVT  +IP P + I +  L SL P S
Sbjct: 1   MDKIELIFIAWPDIGHLSATLHLADLLLRRNQRLSVTFFIIPPPSQTITSTQLHSLLPSS 60

Query: 111 RIRFLTLPE-HPLPPDADITSLFKSIVETQKQN----VRDALAKLPDS-PIIAGFVVDMF 170
            I  + LP+  PLP      SL K+ ++TQKQN    V D L+  PDS  ++AGFV+DMF
Sbjct: 61  TIPIIVLPQIPPLPHHPQFISLIKTTIQTQKQNVLRAVADHLSNSPDSNTVLAGFVLDMF 120

Query: 171 CTSMIDVADELGVPSYVFLTSSAGYLSFTSHLQELYDRHNKQSDQFLRSDFQFAVPGFTN 230
           CT MIDVA++LGVPSY+F TSSA  LS   HLQ LYD H  QS   L  D Q  +PGF N
Sbjct: 121 CTPMIDVANQLGVPSYLFSTSSAANLSLALHLQHLYD-HTHQS---LNPDVQIPIPGFAN 180

Query: 231 PVPGKVIPSSYFNKNSEWIIHESTRRLRESSGILINTFSELESKILDSFS-SSSSYQFPP 290
           PV  K IP++YF++N++W IHESTRR  ES+GILINTFSELES +LD+FS SSSS  FPP
Sbjct: 181 PVTAKAIPTAYFDENAKW-IHESTRRFGESNGILINTFSELESNVLDAFSDSSSSSTFPP 240

Query: 291 VYTVGPILSLHSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQVREIA 350
           VY VGPIL+++ ++       SSE  EI+KWLDQQP  SVVFLCFG+ GSF RDQV+EIA
Sbjct: 241 VYAVGPILNMNKDS-------SSEGYEILKWLDQQPFQSVVFLCFGSRGSFGRDQVKEIA 300

Query: 351 NALERSGYRFVWSLRQPSREGKIENPNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQVEIL 410
            ALE+SGYRFVWSLRQPS EG+I+   + DY K+VVPEGFLDRTA IGRVIGWAPQ++IL
Sbjct: 301 EALEQSGYRFVWSLRQPSSEGEIQ---KTDYIKEVVPEGFLDRTAGIGRVIGWAPQMKIL 360

Query: 411 GHPATGGFVSHCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEISSETGVV 470
            HPATGGFVSHCGWNS LES+WFGVPIG WAMY EQ LNAVE+  E G         GVV
Sbjct: 361 EHPATGGFVSHCGWNSILESLWFGVPIGAWAMYGEQGLNAVEITAETG--------HGVV 420

Query: 471 SAEKIESGIRELMAGDGEVRKLMKMKSEESRRSVMEGGSSFNALNRFIEEVMAK 517
            AEKIESGI+E+M GDGE+RK +KMK EESR+SVME GSSF ALNRFIE V+AK
Sbjct: 421 RAEKIESGIKEVMKGDGEIRKTVKMKREESRKSVMENGSSFTALNRFIEVVIAK 451

BLAST of Tan0003759 vs. TAIR 10
Match: AT3G21760.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 408.7 bits (1049), Expect = 7.1e-114
Identity = 229/483 (47.41%), Postives = 304/483 (62.94%), Query Frame = 0

Query: 53  KMELVFIPWPGISHFSSTLQLAQLLLRRDHRLSVTVLLIP------SPWEPIPAASLQSL 112
           K+ELVFIP PG  H    +++A+L + RD  LS+T+++IP      S       ASL S 
Sbjct: 2   KLELVFIPSPGDGHLRPLVEVAKLHVDRDDHLSITIIIIPQMHGFSSSNSSSYIASLSSD 61

Query: 113 HPSRIRFLTLPEHPLPPDADITSLFKSIVETQKQNVRDALAKL-----PDSPI-IAGFVV 172
              R+ +  L     P   D    F   ++  K  V+  + KL     PDSP  +AGFVV
Sbjct: 62  SEERLSYNVLSVPDKPDSDDTKPHFFDYIDNFKPQVKATVEKLTDPGPPDSPSRLAGFVV 121

Query: 173 DMFCTSMIDVADELGVPSYVFLTSSAGYLSFTSHLQELYDRHNKQSDQFLRSD-FQFAVP 232
           DMFC  MIDVA+E GVPSY+F TS+A +L    H++ LYD  N        SD  +  VP
Sbjct: 122 DMFCMMMIDVANEFGVPSYMFYTSNATFLGLQVHVEYLYDVKNYDVSDLKDSDTTELEVP 181

Query: 233 GFTNPVPGKVIPSSYFNKNSEWIIHESTRRLRESSGILINTFSELESKILDSFSSSSSYQ 292
             T P+P K  PS    K    ++   TRR RE+ GIL+NTF+ELE + +  FS   S  
Sbjct: 182 CLTRPLPVKCFPSVLLTKEWLPVMFRQTRRFRETKGILVNTFAELEPQAMKFFSGVDS-P 241

Query: 293 FPPVYTVGPILSLHSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQVR 352
            P VYTVGP+++L  N  N S  + S   EI++WLD+QP  SVVFLCFG+ G F   Q +
Sbjct: 242 LPTVYTVGPVMNLKINGPNSSDDKQS---EILRWLDEQPRKSVVFLCFGSMGGFREGQAK 301

Query: 353 EIANALERSGYRFVWSLRQPSREGKIENPNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQV 412
           EIA ALERSG+RFVWSLR+   +G I  P E+   ++++PEGFL+RTAEIG+++GWAPQ 
Sbjct: 302 EIAIALERSGHRFVWSLRRAQPKGSIGPPEEFTNLEEILPEGFLERTAEIGKIVGWAPQS 361

Query: 413 EILGHPATGGFVSHCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEI---- 472
            IL +PA GGFVSHCGWNSTLES+WFGVP+ TW +YAEQQ+NA EM  ELGLAVE+    
Sbjct: 362 AILANPAIGGFVSHCGWNSTLESLWFGVPMATWPLYAEQQVNAFEMVEELGLAVEVRNSF 421

Query: 473 -----SSETGVVSAEKIESGIRELMAGDGEVRKLMKMKSEESRRSVMEGGSSFNALNRFI 514
                +++  +++AE+IE GIR LM  D +VR  +K  SE+S  ++M+GGSS  AL +FI
Sbjct: 422 RGDFMAADDELMTAEEIERGIRCLMEQDSDVRSRVKEMSEKSHVALMDGGSSHVALLKFI 480

BLAST of Tan0003759 vs. TAIR 10
Match: AT3G21790.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 407.9 bits (1047), Expect = 1.2e-113
Identity = 234/492 (47.56%), Postives = 312/492 (63.41%), Query Frame = 0

Query: 53  KMELVFIPWPGISHFSSTLQLAQLLLRRDHRLSVTVLLIPSPWEPIPAAS-----LQSLH 112
           K ELVFIP+PGI H  ST+++A+LL+ R+ RLS++V+++P   E    AS     L +  
Sbjct: 2   KFELVFIPYPGIGHLRSTVEMAKLLVDRETRLSISVIILPFISEGEVGASDYIAALSASS 61

Query: 113 PSRIRFLTLPEHPLPPDADITSLFKSIVETQKQNVRDALAKL-------PDSPIIAGFVV 172
            +R+R+  +      P  ++T++ +  ++ Q+  VR  +AKL       PDSP IAGFV+
Sbjct: 62  NNRLRYEVISAVD-QPTIEMTTI-EIHMKNQEPKVRSTVAKLLEDYSSKPDSPKIAGFVL 121

Query: 173 DMFCTSMIDVADELGVPSYVFLTSSAGYLSFTSHLQELYD--RHNKQSDQFLRSDFQFAV 232
           DMFCTSM+DVA+E G PSY+F TSSAG LS T H+Q L D  +++   + +  S+     
Sbjct: 122 DMFCTSMVDVANEFGFPSYMFYTSSAGILSVTYHVQMLCDENKYDVSENDYADSEAVLNF 181

Query: 233 PGFTNPVPGKVIPSSYFNKNSEWIIHESTRRLRESSGILINTFSELESKILDSFSSSSSY 292
           P  + P P K +P +        +     R+ RE  GIL+NT +ELE  +L   SSS + 
Sbjct: 182 PSLSRPYPVKCLPHALAANMWLPVFVNQARKFREMKGILVNTVAELEPYVLKFLSSSDT- 241

Query: 293 QFPPVYTVGPILSLHSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQV 352
             PPVY VGP+L L    N     +  +RLEI++WLDQQPP SVVFLCFG+ G F  +QV
Sbjct: 242 --PPVYPVGPLLHL---ENQRDDSKDEKRLEIIRWLDQQPPSSVVFLCFGSMGGFGEEQV 301

Query: 353 REIANALERSGYRFVWSLRQPSREGKIENPNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQ 412
           REIA ALERSG+RF+WSLR+ S     E P E+   ++V+PEGF DRT +IG+VIGWAPQ
Sbjct: 302 REIAIALERSGHRFLWSLRRASPNIFKELPGEFTNLEEVLPEGFFDRTKDIGKVIGWAPQ 361

Query: 413 VEILGHPATGGFVSHCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEISS- 472
           V +L +PA GGFV+HCGWNSTLES+WFGVP   W +YAEQ+ NA  M  ELGLAVEI   
Sbjct: 362 VAVLANPAIGGFVTHCGWNSTLESLWFGVPTAAWPLYAEQKFNAFLMVEELGLAVEIRKY 421

Query: 473 ---------ETGVVSAEKIESGIRELMAGDGEVRKLMKMKSEESRRSVMEGGSSFNALNR 521
                     T  V+AE+IE  I  LM  D +VRK +K  SE+   ++M+GGSS  AL +
Sbjct: 422 WRGEHLAGLPTATVTAEEIEKAIMCLMEQDSDVRKRVKDMSEKCHVALMDGGSSRTALQK 481

BLAST of Tan0003759 vs. TAIR 10
Match: AT3G21750.1 (UDP-glucosyl transferase 71B1 )

HSP 1 Score: 391.0 bits (1003), Expect = 1.5e-108
Identity = 208/480 (43.33%), Postives = 298/480 (62.08%), Query Frame = 0

Query: 53  KMELVFIPWPGISHFSSTLQLAQLLLRRDHRLSVTVLLIPSPWEPIPAASLQSLHPSRIR 112
           K+ELVFIP PG+ H  +T  LA+LL+  D+RLSVT+++IPS      ++S+ +    R+R
Sbjct: 2   KVELVFIPSPGVGHIRATTALAKLLVASDNRLSVTLIVIPSRVSDDASSSVYTNSEDRLR 61

Query: 113 FLTLPEHPLPPDADITSLFKSIVETQKQNVRDALAKLP------DSPIIAGFVVDMFCTS 172
           ++ LP        D T+   S +++QK  VR  ++K+           +AG VVDMFCTS
Sbjct: 62  YILLPAR------DQTTDLVSYIDSQKPQVRAVVSKVAGDVSTRSDSRLAGIVVDMFCTS 121

Query: 173 MIDVADELGVPSYVFLTSSAGYLSFTSHLQELYDRHNKQSDQFLRSDFQFAVPGFTNPVP 232
           MID+ADE  + +Y+F TS+A YL    H+Q LYD       +F  ++ +F VP  T P P
Sbjct: 122 MIDIADEFNLSAYIFYTSNASYLGLQFHVQSLYDEKELDVSEFKDTEMKFDVPTLTQPFP 181

Query: 233 GKVIPSSYFNKNSEWIIHESTRRLRESSGILINTFSELESKILDSFS-SSSSYQFPPVYT 292
            K +PS   NK     +    R  R + GIL+N+ +++E + L  FS  + +   PPVY 
Sbjct: 182 AKCLPSVMLNKKWFPYVLGRARSFRATKGILVNSVADMEPQALSFFSGGNGNTNIPPVYA 241

Query: 293 VGPILSLHSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQVREIANAL 352
           VGPI+ L       SSG+  +R EI+ WL +QP  SVVFLCFG+ G F+ +Q REIA AL
Sbjct: 242 VGPIMDLE------SSGDEEKRKEILHWLKEQPTKSVVFLCFGSMGGFSEEQAREIAVAL 301

Query: 353 ERSGYRFVWSLRQPSREGKIEN--PNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQVEILG 412
           ERSG+RF+WSLR+ S  G   N  P E+   ++++P+GFLDRT EIG++I WAPQV++L 
Sbjct: 302 ERSGHRFLWSLRRASPVGNKSNPPPGEFTNLEEILPKGFLDRTVEIGKIISWAPQVDVLN 361

Query: 413 HPATGGFVSHCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEIS------- 472
            PA G FV+HCGWNS LES+WFGVP+  W +YAEQQ NA  M  ELGLA E+        
Sbjct: 362 SPAIGAFVTHCGWNSILESLWFGVPMAAWPIYAEQQFNAFHMVDELGLAAEVKKEYRRDF 421

Query: 473 --SETGVVSAEKIESGIRELMAGDGEVRKLMKMKSEESRRSVMEGGSSFNALNRFIEEVM 515
              E  +V+A++IE GI+  M  D ++RK +    ++   ++++GGSS  AL +F+++V+
Sbjct: 422 LVEEPEIVTADEIERGIKCAMEQDSKMRKRVMEMKDKLHVALVDGGSSNCALKKFVQDVV 469

BLAST of Tan0003759 vs. TAIR 10
Match: AT3G21780.1 (UDP-glucosyl transferase 71B6 )

HSP 1 Score: 386.7 bits (992), Expect = 2.9e-107
Identity = 227/486 (46.71%), Postives = 306/486 (62.96%), Query Frame = 0

Query: 53  KMELVFIPWPGISHFSSTLQLAQLLLRRDHRLSVTVLLIP-SPWEPIPAASLQSLHPSRI 112
           K+ELVFIP P ISH  +T+++A+ L+ ++  LS+TV++I  S        SL S +  R 
Sbjct: 2   KIELVFIPSPAISHLMATVEMAEQLVDKNDNLSITVIIISFSSKNTSMITSLTSNNRLRY 61

Query: 113 RFLTLPEHPLPPDADITSLFKSIVETQKQNVRDALAK-----LPDSPIIAGFVVDMFCTS 172
             ++  +   P +   T    S +++ K  VRDA+AK     LPD+P +AGFVVDM+CTS
Sbjct: 62  EIISGGDQQ-PTELKATD---SHIQSLKPLVRDAVAKLVDSTLPDAPRLAGFVVDMYCTS 121

Query: 173 MIDVADELGVPSYVFLTSSAGYLSFTSHLQELYDRHN-KQSDQFLRSDFQFAVPGFTNPV 232
           MIDVA+E GVPSY+F TS+AG+L    H+Q +YD  +     +   SD +  VP  T+P 
Sbjct: 122 MIDVANEFGVPSYLFYTSNAGFLGLLLHIQFMYDAEDIYDMSELEDSDVELVVPSLTSPY 181

Query: 233 PGKVIPSSYFNKNSEWIIHEST--RRLRESSGILINTFSELESKILDSFSSSSSYQFPPV 292
           P K +P  Y  K+ EW+    T  RR RE+ GIL+NT  +LE + L   S+ +    P  
Sbjct: 182 PLKCLP--YIFKSKEWLTFFVTQARRFRETKGILVNTVPDLEPQALTFLSNGN---IPRA 241

Query: 293 YTVGPILSLHSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQVREIAN 352
           Y VGP+L L + N +    + S   EI++WLD+QPP SVVFLCFG+ G F+ +QVRE A 
Sbjct: 242 YPVGPLLHLKNVNCDYVDKKQS---EILRWLDEQPPRSVVFLCFGSMGGFSEEQVRETAL 301

Query: 353 ALERSGYRFVWSLRQPSREGKIENPNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQVEILG 412
           AL+RSG+RF+WSLR+ S     E P E+   ++++PEGF DRTA  G+VIGWA QV IL 
Sbjct: 302 ALDRSGHRFLWSLRRASPNILREPPGEFTNLEEILPEGFFDRTANRGKVIGWAEQVAILA 361

Query: 413 HPATGGFVSHCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEIS------- 472
            PA GGFVSH GWNSTLES+WFGVP+  W +YAEQ+ NA EM  ELGLAVEI        
Sbjct: 362 KPAIGGFVSHGGWNSTLESLWFGVPMAIWPLYAEQKFNAFEMVEELGLAVEIKKHWRGDL 421

Query: 473 --SETGVVSAEKIESGIRELMAGDGEVRKLMKMKSEESRRSVMEGGSSFNALNRFIEEVM 521
               + +V+AE+IE GI  LM  D +VRK +   SE+   ++M+GGSS  AL RFI++V 
Sbjct: 422 LLGRSEIVTAEEIEKGIICLMEQDSDVRKRVNEISEKCHVALMDGGSSETALKRFIQDVT 475

BLAST of Tan0003759 vs. TAIR 10
Match: AT4G15280.1 (UDP-glucosyl transferase 71B5 )

HSP 1 Score: 379.0 bits (972), Expect = 6.0e-105
Identity = 212/479 (44.26%), Postives = 297/479 (62.00%), Query Frame = 0

Query: 53  KMELVFIPWPGISHFSSTLQLAQLLLRRDHRLSVTVLLIPSPWEPIPA----ASLQSL-H 112
           K+ELVFIP PGI H   T++LA+ L+  ++RLS+T+++IPS ++   A    ASL +L  
Sbjct: 2   KIELVFIPLPGIGHLRPTVKLAKQLIGSENRLSITIIIIPSRFDAGDASACIASLTTLSQ 61

Query: 113 PSRIRFLTLPEHPLPPDADITSLFKSI-VETQKQNVRDALAKLPDSPI--IAGFVVDMFC 172
             R+ + ++     PP +D   +   + +E QK  VRDA+A     P   +AGFVVDMFC
Sbjct: 62  DDRLHYESISVAKQPPTSDPDPVPAQVYIEKQKTKVRDAVAARIVDPTRKLAGFVVDMFC 121

Query: 173 TSMIDVADELGVPSYVFLTSSAGYLSFTSHLQELYDRHNKQSDQFLRSDFQFAVPGFTNP 232
           +SMIDVA+E GVP Y+  TS+A +L    H+Q++YD+      +   S  +   P  T P
Sbjct: 122 SSMIDVANEFGVPCYMVYTSNATFLGTMLHVQQMYDQKKYDVSELENSVTELEFPSLTRP 181

Query: 233 VPGKVIPSSYFNKNSEWIIHESTRRLRESSGILINTFSELESKILDSFSSSSSYQFPPVY 292
            P K +P    +K    +     R  R+  GIL+NT +ELE   L  F+ +     P VY
Sbjct: 182 YPVKCLPHILTSKEWLPLSLAQARCFRKMKGILVNTVAELEPHALKMFNINGD-DLPQVY 241

Query: 293 TVGPILSLHSNNNNGSSGESSERLEIMKWLDQQPPLSVVFLCFGNWGSFNRDQVREIANA 352
            VGP+L L + N+     +  ++ EI++WLD+QP  SVVFLCFG+ G F  +Q RE A A
Sbjct: 242 PVGPVLHLENGND-----DDEKQSEILRWLDEQPSKSVVFLCFGSLGGFTEEQTRETAVA 301

Query: 353 LERSGYRFVWSLRQPSREGKIENPNEYDYTKDVVPEGFLDRTAEIGRVIGWAPQVEILGH 412
           L+RSG RF+W LR  S   K + P +Y   ++V+PEGFL+RT + G+VIGWAPQV +L  
Sbjct: 302 LDRSGQRFLWCLRHASPNIKTDRPRDYTNLEEVLPEGFLERTLDRGKVIGWAPQVAVLEK 361

Query: 413 PATGGFVSHCGWNSTLESVWFGVPIGTWAMYAEQQLNAVEMEVELGLAVEI--------- 472
           PA GGFV+HCGWNS LES+WFGVP+ TW +YAEQ++NA EM  ELGLAVEI         
Sbjct: 362 PAIGGFVTHCGWNSILESLWFGVPMVTWPLYAEQKVNAFEMVEELGLAVEIRKYLKGDLF 421

Query: 473 SSETGVVSAEKIESGIRELMAGDGEVRKLMKMKSEESRRSVMEGGSSFNALNRFIEEVM 515
           + E   V+AE IE  IR +M  D +VR  +K  +E+   ++M+GGSS  AL +FI++V+
Sbjct: 422 AGEMETVTAEDIERAIRRVMEQDSDVRNNVKEMAEKCHFALMDGGSSKAALEKFIQDVI 474

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q66PF32.7e-12650.84Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3 OS=Fragaria ananassa OX... [more]
Q2V6K02.6e-12149.48UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria ananassa OX=3747 GN=... [more]
D3UAG16.5e-12049.06UDP-glycosyltransferase 71A16 OS=Pyrus communis OX=23211 GN=UGT71A16 PE=1 SV=1[more]
D3THI62.1e-11849.06UDP-glycosyltransferase 71A15 OS=Malus domestica OX=3750 GN=UGT71A15 PE=1 SV=1[more]
Q402844.5e-11347.32Anthocyanidin 3-O-glucosyltransferase 1 OS=Manihot esculenta OX=3983 GN=GT1 PE=2... [more]
Match NameE-valueIdentityDescription
AXK92493.14.1e-18068.53flavonoids UDP-glycosyltransferase [Siraitia grosvenorii][more]
7BV3_A5.9e-17968.40Chain A, Glycosyltransferase [Siraitia grosvenorii] >7BV3_B Chain B, Glycosyltra... [more]
XP_038896118.14.1e-17266.18anthocyanidin 3-O-glucosyltransferase 2-like isoform X1 [Benincasa hispida][more]
XP_038896119.17.0e-17267.16anthocyanidin 3-O-glucosyltransferase 2-like isoform X2 [Benincasa hispida][more]
XP_004146062.22.7e-16365.62anthocyanidin 3-O-glucosyltransferase 2 [Cucumis sativus] >KGN54986.1 hypothetic... [more]
Match NameE-valueIdentityDescription
A0A346A6C42.0e-18068.53Glycosyltransferase OS=Siraitia grosvenorii OX=190515 GN=UGT74AC2 PE=2 SV=1[more]
A0A0A0L4D91.3e-16365.62Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_4G618510 PE=3 SV=1[more]
A0A5A7VHR63.9e-16065.55Glycosyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold384G... [more]
A0A1S3CNM13.9e-16065.55Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103502510 PE=3 SV=1[more]
A0A1S4E4T03.2e-15463.71Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103502510 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G21760.17.1e-11447.41UDP-Glycosyltransferase superfamily protein [more]
AT3G21790.11.2e-11347.56UDP-Glycosyltransferase superfamily protein [more]
AT3G21750.11.5e-10843.33UDP-glucosyl transferase 71B1 [more]
AT3G21780.12.9e-10746.71UDP-glucosyl transferase 71B6 [more]
AT4G15280.16.0e-10544.26UDP-glucosyl transferase 71B5 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 57..506
e-value: 1.7E-135
score: 454.6
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 297..494
e-value: 1.7E-135
score: 454.6
NoneNo IPR availablePANTHERPTHR48048GLYCOSYLTRANSFERASEcoord: 51..516
NoneNo IPR availablePANTHERPTHR48048:SF57GLYCOSYLTRANSFERASEcoord: 51..516
NoneNo IPR availableSUPERFAMILY53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 54..514
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 322..486
e-value: 1.6E-21
score: 76.7
IPR002213UDP-glucuronosyl/UDP-glucosyltransferaseCDDcd03784GT1_Gtf-likecoord: 54..512
e-value: 1.66678E-63
score: 210.871
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 395..438

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0003759.1Tan0003759.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0009507 chloroplast
molecular_function GO:0008194 UDP-glycosyltransferase activity