Cla97C02G045320 (gene) Watermelon (97103) v2

NameCla97C02G045320
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionUDP-glycosyltransferase
LocationCla97Chr02 : 33376124 .. 33377620 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAACTTATCCCTATCCCATCATTACTCGTTTCTTCTTCTTCTTCAACAAATTCTCTTAAAATGGATTCCACCATTTCTGAAACTCCTCAACTCACCCACCTCGCCGCACTACCCTACCCCGGCAGAGGCCATATCAACGCCCTTATGAACTTCTGCAAGCTCCTCTCTCTCAAAAATCCCAACATTCTCATCTCCTTCATCGTCTCCGACGAGTGGTTCACCTTCCTCGCCGCCGATCCCAAACCCCCAAATCTCCATTTCGCCACTTTCCCAAATCTCATCCCCTCCGAGCTCGGCCGCGCCAAGGACTTCCCCGGTTTCTTCCGATCAGTCAACACCGTAATGGAGCCTCCGATTCAGACTCTGCTCACCCACCTCCACCCCCCGCCGTCCATTGTCGTCGCTGATTCCTTCCTCACGTGGGCCGTCCGGTTGGGCAACCGCCTCAACATTCCGGTGGCTTCCTTCTGGCCAATGTCCGCTACAGTTCTCTCCATTTATTATCATTTCGATTTTCTTAAAGAAAATGGACATTTCCCCGCCGATCTATCAGGTACTGCAAAAATTTCCCCTGTTCTGAGTTCTTTAATTTATGGGAAAAAATCTTTGACGGGAAATCTGTTGTTTCGTTTTCTTTAAGAGTGTGGTGAAGAGATTGTGGATTACATTCCCGGAGTTTCCGAGACACGTCTTGCAGATCTTCCCACTTTCTTCTCCGGCGATGGCTATGAAGTCATCGATTTAACATTGGAAGCTGCGCGATCTATTGACAAAGCTCAATTTCTCATCTCCACCTCTGTTTACGAGCTCGAATCCTCAGTAATCGATGTTTTGAAACTGAAATTTCCCTTTCCAGCTTACACAATCGGGCCCTGTACACCGTATTTCGACGCCCCAAACGGCTGCACCGACGACTATCTCCGATGGCTGGATTCCCAAACAGAGGGCTCTGTTTTGTACATTTCGCAGGGGAGTTTTCTTTCAGTTTCAAGCGCCCAAATGGACGAGATCGTAGCCGGCGTGAAAGCTAGTGGCGTTCGATTCTTGTGGGTGGCGCGTGGGAACGACGGCCAGTTGAAGGGTGTGGATAGAGAAACGGGAATGGTGGTTGGATGGTGCGATCAATTGAGGGTTCTGTGCCATAGCGCCATCGGAGGGTTTTGGACTCACGCCGGTTGGAATTCGACTCTGGAAGGGGTTTTTGCCGGCGTTTCTATGCTGACGTGGCCGATATTCTGCGATCAAGTTCCGAACAGTAAGAAGATTGTGGAGGACTGGAAAGTTGGGGTCCGATTTAAAGCAGTTGGGGGTAGGGATTTGGTGAGGAGAGAGGAAATTGCAGAGTTTGTGAAGAGATTTATGAACTCAGAGAGCGTTGAAGGGAGGAAGATGAGGAACAGAGTGTCGGAATTGCGAGGGATTTGCCGGCGAGCGGTGGCGAAAGGTGGTTCCTCTCATTCCAATATTGATGCATTTCTCGACCACATAATATAA

mRNA sequence

ATGAAACTTATCCCTATCCCATCATTACTCGTTTCTTCTTCTTCTTCAACAAATTCTCTTAAAATGGATTCCACCATTTCTGAAACTCCTCAACTCACCCACCTCGCCGCACTACCCTACCCCGGCAGAGGCCATATCAACGCCCTTATGAACTTCTGCAAGCTCCTCTCTCTCAAAAATCCCAACATTCTCATCTCCTTCATCGTCTCCGACGAGTGGTTCACCTTCCTCGCCGCCGATCCCAAACCCCCAAATCTCCATTTCGCCACTTTCCCAAATCTCATCCCCTCCGAGCTCGGCCGCGCCAAGGACTTCCCCGGTTTCTTCCGATCAGTCAACACCGTAATGGAGCCTCCGATTCAGACTCTGCTCACCCACCTCCACCCCCCGCCGTCCATTGTCGTCGCTGATTCCTTCCTCACGTGGGCCGTCCGGTTGGGCAACCGCCTCAACATTCCGGTGGCTTCCTTCTGGCCAATGTCCGCTACAGTTCTCTCCATTTATTATCATTTCGATTTTCTTAAAGAAAATGGACATTTCCCCGCCGATCTATCAGAGTGTGGTGAAGAGATTGTGGATTACATTCCCGGAGTTTCCGAGACACGTCTTGCAGATCTTCCCACTTTCTTCTCCGGCGATGGCTATGAAGTCATCGATTTAACATTGGAAGCTGCGCGATCTATTGACAAAGCTCAATTTCTCATCTCCACCTCTGTTTACGAGCTCGAATCCTCAGTAATCGATGTTTTGAAACTGAAATTTCCCTTTCCAGCTTACACAATCGGGCCCTGTACACCGTATTTCGACGCCCCAAACGGCTGCACCGACGACTATCTCCGATGGCTGGATTCCCAAACAGAGGGCTCTGTTTTGTACATTTCGCAGGGGAGTTTTCTTTCAGTTTCAAGCGCCCAAATGGACGAGATCGTAGCCGGCGTGAAAGCTAGTGGCGTTCGATTCTTGTGGGTGGCGCGTGGGAACGACGGCCAGTTGAAGGGTGTGGATAGAGAAACGGGAATGGTGGTTGGATGGTGCGATCAATTGAGGGTTCTGTGCCATAGCGCCATCGGAGGGTTTTGGACTCACGCCGGTTGGAATTCGACTCTGGAAGGGGTTTTTGCCGGCGTTTCTATGCTGACGTGGCCGATATTCTGCGATCAAGTTCCGAACAGTAAGAAGATTGTGGAGGACTGGAAAGTTGGGGTCCGATTTAAAGCAGTTGGGGGTAGGGATTTGGTGAGGAGAGAGGAAATTGCAGAGTTTGTGAAGAGATTTATGAACTCAGAGAGCGTTGAAGGGAGGAAGATGAGGAACAGAGTGTCGGAATTGCGAGGGATTTGCCGGCGAGCGGTGGCGAAAGGTGGTTCCTCTCATTCCAATATTGATGCATTTCTCGACCACATAATATAA

Coding sequence (CDS)

ATGAAACTTATCCCTATCCCATCATTACTCGTTTCTTCTTCTTCTTCAACAAATTCTCTTAAAATGGATTCCACCATTTCTGAAACTCCTCAACTCACCCACCTCGCCGCACTACCCTACCCCGGCAGAGGCCATATCAACGCCCTTATGAACTTCTGCAAGCTCCTCTCTCTCAAAAATCCCAACATTCTCATCTCCTTCATCGTCTCCGACGAGTGGTTCACCTTCCTCGCCGCCGATCCCAAACCCCCAAATCTCCATTTCGCCACTTTCCCAAATCTCATCCCCTCCGAGCTCGGCCGCGCCAAGGACTTCCCCGGTTTCTTCCGATCAGTCAACACCGTAATGGAGCCTCCGATTCAGACTCTGCTCACCCACCTCCACCCCCCGCCGTCCATTGTCGTCGCTGATTCCTTCCTCACGTGGGCCGTCCGGTTGGGCAACCGCCTCAACATTCCGGTGGCTTCCTTCTGGCCAATGTCCGCTACAGTTCTCTCCATTTATTATCATTTCGATTTTCTTAAAGAAAATGGACATTTCCCCGCCGATCTATCAGAGTGTGGTGAAGAGATTGTGGATTACATTCCCGGAGTTTCCGAGACACGTCTTGCAGATCTTCCCACTTTCTTCTCCGGCGATGGCTATGAAGTCATCGATTTAACATTGGAAGCTGCGCGATCTATTGACAAAGCTCAATTTCTCATCTCCACCTCTGTTTACGAGCTCGAATCCTCAGTAATCGATGTTTTGAAACTGAAATTTCCCTTTCCAGCTTACACAATCGGGCCCTGTACACCGTATTTCGACGCCCCAAACGGCTGCACCGACGACTATCTCCGATGGCTGGATTCCCAAACAGAGGGCTCTGTTTTGTACATTTCGCAGGGGAGTTTTCTTTCAGTTTCAAGCGCCCAAATGGACGAGATCGTAGCCGGCGTGAAAGCTAGTGGCGTTCGATTCTTGTGGGTGGCGCGTGGGAACGACGGCCAGTTGAAGGGTGTGGATAGAGAAACGGGAATGGTGGTTGGATGGTGCGATCAATTGAGGGTTCTGTGCCATAGCGCCATCGGAGGGTTTTGGACTCACGCCGGTTGGAATTCGACTCTGGAAGGGGTTTTTGCCGGCGTTTCTATGCTGACGTGGCCGATATTCTGCGATCAAGTTCCGAACAGTAAGAAGATTGTGGAGGACTGGAAAGTTGGGGTCCGATTTAAAGCAGTTGGGGGTAGGGATTTGGTGAGGAGAGAGGAAATTGCAGAGTTTGTGAAGAGATTTATGAACTCAGAGAGCGTTGAAGGGAGGAAGATGAGGAACAGAGTGTCGGAATTGCGAGGGATTTGCCGGCGAGCGGTGGCGAAAGGTGGTTCCTCTCATTCCAATATTGATGCATTTCTCGACCACATAATATAA

Protein sequence

MKLIPIPSLLVSSSSSTNSLKMDSTISETPQLTHLAALPYPGRGHINALMNFCKLLSLKNPNILISFIVSDEWFTFLAADPKPPNLHFATFPNLIPSELGRAKDFPGFFRSVNTVMEPPIQTLLTHLHPPPSIVVADSFLTWAVRLGNRLNIPVASFWPMSATVLSIYYHFDFLKENGHFPADLSECGEEIVDYIPGVSETRLADLPTFFSGDGYEVIDLTLEAARSIDKAQFLISTSVYELESSVIDVLKLKFPFPAYTIGPCTPYFDAPNGCTDDYLRWLDSQTEGSVLYISQGSFLSVSSAQMDEIVAGVKASGVRFLWVARGNDGQLKGVDRETGMVVGWCDQLRVLCHSAIGGFWTHAGWNSTLEGVFAGVSMLTWPIFCDQVPNSKKIVEDWKVGVRFKAVGGRDLVRREEIAEFVKRFMNSESVEGRKMRNRVSELRGICRRAVAKGGSSHSNIDAFLDHII
BLAST of Cla97C02G045320 vs. NCBI nr
Match: XP_008465275.1 (PREDICTED: UDP-glycosyltransferase 87A1-like [Cucumis melo])

HSP 1 Score: 787.7 bits (2033), Expect = 2.1e-224
Identity = 376/448 (83.93%), Postives = 407/448 (90.85%), Query Frame = 0

Query: 22  MDSTISETPQL-THLAALPYPGRGHINALMNFCKLLSLKNPNILISFIVSDEWFTFLAAD 81
           MDSTIS+  QL THLAALPYPGRGHINALMNFCKLL LKNPNILISFIVS+EW +FLAAD
Sbjct: 1   MDSTISKASQLITHLAALPYPGRGHINALMNFCKLLFLKNPNILISFIVSEEWLSFLAAD 60

Query: 82  PKPPNLHFATFPNLIPSELGRAKDFPGFFRSVNTVMEPPIQTLLTHLHPPPSIVVADSFL 141
           PKPPNLHF+TFPN+IPSELGRA D+PGFFRSVNT++E PI TLLTHL+PPPSI+VAD FL
Sbjct: 61  PKPPNLHFSTFPNIIPSELGRANDYPGFFRSVNTILESPIHTLLTHLNPPPSIIVADPFL 120

Query: 142 TWAVRLGNRLNIPVASFWPMSATVLSIYYHFDFLKENGHFPADLSECGEEIVDYIPGVSE 201
           +WAV L NRLNIPVASFWPMS TVLS YYHF+ L+ENGHFPA+LSE GEEIVDYIPGVS 
Sbjct: 121 SWAVPLANRLNIPVASFWPMSVTVLSCYYHFNLLEENGHFPANLSERGEEIVDYIPGVSH 180

Query: 202 TRLADLPTFFSGDGYEVIDLTLEAARSIDKAQFLISTSVYELESSVIDVLKLKFPFPAYT 261
           TRLAD PTFFSGDG+E+ DLT EAARSIDKAQFLISTSVYELE SV+DV KLK PFP YT
Sbjct: 181 TRLADFPTFFSGDGHEIRDLTFEAARSIDKAQFLISTSVYELEPSVVDVFKLKCPFPVYT 240

Query: 262 IGPCTPYFDAPNGCTDDYLRWLDSQTEGSVLYISQGSFLSVSSAQMDEIVAGVKASGVRF 321
           IGPCTPYF+ PNGCTD+YL+WLDSQT+ SVLYIS GSFLSVSSAQMDEIVAGV+ASGVRF
Sbjct: 241 IGPCTPYFETPNGCTDEYLQWLDSQTDCSVLYISHGSFLSVSSAQMDEIVAGVEASGVRF 300

Query: 322 LWVARGNDGQLKGVDRETGMVVGWCDQLRVLCHSAIGGFWTHAGWNSTLEGVFAGVSMLT 381
           LWVARGND +LKGVDRE GMVV WCDQL+VLCHSA+GGFWTH GWNST+EGVFAGV MLT
Sbjct: 301 LWVARGNDSRLKGVDREMGMVVRWCDQLKVLCHSAVGGFWTHCGWNSTMEGVFAGVPMLT 360

Query: 382 WPIFCDQVPNSKKIVEDWKVGVRFKAVGGRDLVRREEIAEFVKRFMNSESVEGRKMRNRV 441
           WPIF DQVPN KKIVEDWKVGVR  AVGG DLVRREEIA+ VKRFMNSESVEGRKMRNRV
Sbjct: 361 WPIFLDQVPNRKKIVEDWKVGVRVNAVGGNDLVRREEIAKLVKRFMNSESVEGRKMRNRV 420

Query: 442 SELRGICRRAVAKGGSSHSNIDAFLDHI 469
           SEL+ ICRRAV +GGSSHSN+DAF+  I
Sbjct: 421 SELKDICRRAVVEGGSSHSNMDAFIGRI 448

BLAST of Cla97C02G045320 vs. NCBI nr
Match: XP_004143221.2 (PREDICTED: UDP-glycosyltransferase 87A1-like [Cucumis sativus] >KGN47052.1 hypothetical protein Csa_6G181570 [Cucumis sativus])

HSP 1 Score: 780.8 bits (2015), Expect = 2.6e-222
Identity = 370/447 (82.77%), Postives = 406/447 (90.83%), Query Frame = 0

Query: 22  MDSTISETPQLTHLAALPYPGRGHINALMNFCKLLSLKNPNILISFIVSDEWFTFLAADP 81
           MDSTI +  QLTHLAALPYPGRGHINAL+NFCKLLS KNPNILISFIVSDEW + LAADP
Sbjct: 1   MDSTIPKPSQLTHLAALPYPGRGHINALINFCKLLSRKNPNILISFIVSDEWLSLLAADP 60

Query: 82  KPPNLHFATFPNLIPSELGRAKDFPGFFRSVNTVMEPPIQTLLTHLHPPPSIVVADSFLT 141
           KPPNLHF+TFPN+IPSE GRA DFPGFFRSVNT+ME PI TLLTHL+PPPSI+VADSF++
Sbjct: 61  KPPNLHFSTFPNIIPSEHGRANDFPGFFRSVNTIMESPIHTLLTHLNPPPSIIVADSFVS 120

Query: 142 WAVRLGNRLNIPVASFWPMSATVLSIYYHFDFLKENGHFPADLSECGEEIVDYIPGVSET 201
           WAV L NRLNIPVASFWPMS TVLS+YYHF+ L+ENGHFPADLSE GEEIVDYIPGVS+T
Sbjct: 121 WAVPLANRLNIPVASFWPMSVTVLSMYYHFNLLQENGHFPADLSERGEEIVDYIPGVSDT 180

Query: 202 RLADLPTFFSGDGYEVIDLTLEAARSIDKAQFLISTSVYELESSVIDVLKLKFPFPAYTI 261
           RLADLPTFFSGDG+EV+DLT++AARSIDKAQFLISTSVYELE SVID  KLKFPFP YTI
Sbjct: 181 RLADLPTFFSGDGHEVVDLTVKAARSIDKAQFLISTSVYELEPSVIDAFKLKFPFPVYTI 240

Query: 262 GPCTPYFDAPNGCTDDYLRWLDSQTEGSVLYISQGSFLSVSSAQMDEIVAGVKASGVRFL 321
           GPCTPYF+  N CTD+Y +WLDSQTE SVLYISQGSFLSVSS+QM+EIVAGVKASGVRFL
Sbjct: 241 GPCTPYFETTNSCTDEYFQWLDSQTECSVLYISQGSFLSVSSSQMEEIVAGVKASGVRFL 300

Query: 322 WVARGNDGQLKGVDRETGMVVGWCDQLRVLCHSAIGGFWTHAGWNSTLEGVFAGVSMLTW 381
           WVARGNDG+LK VDRE G+VV WCDQL+VLCHSA+GGFWTH GWNST+EGVFAGV MLTW
Sbjct: 301 WVARGNDGRLKDVDREMGVVVRWCDQLKVLCHSAVGGFWTHCGWNSTMEGVFAGVPMLTW 360

Query: 382 PIFCDQVPNSKKIVEDWKVGVRFKAVGGRDLVRREEIAEFVKRFMNSESVEGRKMRNRVS 441
           PIFCDQVPN KKIVE+WKVGVR +AVGG+DLVRREEIA FVKRFM +ESVEGRKMR R S
Sbjct: 361 PIFCDQVPNRKKIVEEWKVGVRVEAVGGKDLVRREEIANFVKRFMKTESVEGRKMRKRAS 420

Query: 442 ELRGICRRAVAKGGSSHSNIDAFLDHI 469
           EL+ ICR AV +GGSS SN+DAF+  I
Sbjct: 421 ELQDICRGAVEEGGSSSSNMDAFIGRI 447

BLAST of Cla97C02G045320 vs. NCBI nr
Match: XP_022944194.1 (UDP-glycosyltransferase 87A1-like [Cucurbita moschata])

HSP 1 Score: 767.7 bits (1981), Expect = 2.3e-218
Identity = 374/454 (82.38%), Postives = 404/454 (88.99%), Query Frame = 0

Query: 22  MDSTISETPQLTHLAALPYPGRGHINALMNFCKLLSLKNPNILISFIVSDEWFTFLAADP 81
           M  T S T +LTHLAA+PYPGRGHINALMN CKLLSLKN  +LISFIV+DEW TFLAA P
Sbjct: 1   MAPTTSSTRRLTHLAAVPYPGRGHINALMNLCKLLSLKNSGVLISFIVTDEWRTFLAAVP 60

Query: 82  KPPNLHFATFPNLIPSELGRAKDFPGFFRSVNTVMEPPIQTLLTHLHPPPSIVVADSFLT 141
           KP N+ F TFPN+IPSELGRA DFPGFFRSVNTVME P+Q LLTHLHPPP ++VADSFLT
Sbjct: 61  KPQNIQFLTFPNVIPSELGRADDFPGFFRSVNTVMEAPVQNLLTHLHPPPVVIVADSFLT 120

Query: 142 WAVRLGNRLNIPVASFWPMSATVLSIYYHFDFLKENGHFPADLSECGEEIVDYIPGVSET 201
           WA RLGNRLNIPVASFWPMS TVLSIYYHFD LKENGHFPADLSE G EIVDYIPGVSET
Sbjct: 121 WATRLGNRLNIPVASFWPMSVTVLSIYYHFDLLKENGHFPADLSERGGEIVDYIPGVSET 180

Query: 202 RLADLPTFFSGDGYEVIDLTLEAARSIDKAQFLISTSVYELESSVIDVLKLKFPFPAYTI 261
           RL+DLPTFFSGDG +VIDLTLEAARSIDKAQFLISTSVYELESSVIDVLK KFPFP YT+
Sbjct: 181 RLSDLPTFFSGDGRKVIDLTLEAARSIDKAQFLISTSVYELESSVIDVLKQKFPFPVYTV 240

Query: 262 GPCTPYFD----APN---GCTDDYLRWLDSQTEGSVLYISQGSFLSVSSAQMDEIVAGVK 321
           GPCTPYF+    APN   G T++YLRWLDSQ+EGSVLY+SQGSFLSVSSAQMDEIVAGVK
Sbjct: 241 GPCTPYFELGTSAPNGGRGGTNNYLRWLDSQSEGSVLYVSQGSFLSVSSAQMDEIVAGVK 300

Query: 322 ASGVRFLWVARGNDGQLKGVDRETGMVVGWCDQLRVLCHSAIGGFWTHAGWNSTLEGVFA 381
           ASGVRFLWVAR +DG+LKGVD E GMVV WC+QLRVLCHSA+GGFWTH GWNSTLEGVFA
Sbjct: 301 ASGVRFLWVARRDDGRLKGVDEEAGMVVEWCEQLRVLCHSAVGGFWTHGGWNSTLEGVFA 360

Query: 382 GVSMLTWPIFCDQVPNSKKIVEDWKVGVRFKAVGGRDLVRREEIAEFVKRFMNSESVEGR 441
           GV MLTWPIFCDQVPNSKKIVEDWK+GVRF+AVGGR+LV + EIAEFVKRFMNSESVEGR
Sbjct: 361 GVPMLTWPIFCDQVPNSKKIVEDWKIGVRFEAVGGRNLVTKVEIAEFVKRFMNSESVEGR 420

Query: 442 KMRNRVSELRGICRRAVAKGGSSHSNIDAFLDHI 469
           +MR  V + R ICRRA A+ GSS +NIDAFL+ I
Sbjct: 421 EMRKNVMKFREICRRAAAEDGSSDANIDAFLNQI 454

BLAST of Cla97C02G045320 vs. NCBI nr
Match: XP_023512982.1 (UDP-glycosyltransferase 87A1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 764.2 bits (1972), Expect = 2.5e-217
Identity = 372/455 (81.76%), Postives = 399/455 (87.69%), Query Frame = 0

Query: 22  MDSTISETPQLTHLAALPYPGRGHINALMNFCKLLSLKNPNILISFIVSDEWFTFLAADP 81
           M  T S T +LTHLAA+PYPGRGHINALMN CKLLSLKNP +LISFIV+DEW TFLAA P
Sbjct: 1   MAPTTSSTRRLTHLAAVPYPGRGHINALMNLCKLLSLKNPGVLISFIVTDEWRTFLAAVP 60

Query: 82  KPPNLHFATFPNLIPSELGRAKDFPGFFRSVNTVMEPPIQTLLTHLHPPPSIVVADSFLT 141
           KP N+ F TFPN+IPSELGRA DFPGFFRSVNTVME P+Q LLTHLHPPP ++VADSFLT
Sbjct: 61  KPQNIQFLTFPNVIPSELGRADDFPGFFRSVNTVMEAPVQNLLTHLHPPPVVIVADSFLT 120

Query: 142 WAVRLGNRLNIPVASFWPMSATVLSIYYHFDFLKENGHFPADLSECGEEIVDYIPGVSET 201
           WA RLGNRLNIPVASFWPMSATVLSIYYHFD LKENGHFPADLSE G EIVDYIPGVSET
Sbjct: 121 WATRLGNRLNIPVASFWPMSATVLSIYYHFDLLKENGHFPADLSERGGEIVDYIPGVSET 180

Query: 202 RLADLPTFFSGDGYEVIDLTLEAARSIDKAQFLISTSVYELESSVIDVLKLKFPFPAYTI 261
           RLADLPTFFSGDG +VIDLTLEAARSIDKAQFLISTSVYELESSVIDVLK KFPFP YT+
Sbjct: 181 RLADLPTFFSGDGRKVIDLTLEAARSIDKAQFLISTSVYELESSVIDVLKQKFPFPVYTV 240

Query: 262 GPCTPYFDAPNGC--------TDDYLRWLDSQTEGSVLYISQGSFLSVSSAQMDEIVAGV 321
           GPCTPYF+  +             YLRWLDSQ+EGSVLY+SQGSFLSVSSAQMDEIVAGV
Sbjct: 241 GPCTPYFELGSSAPXXXXXXXXXXYLRWLDSQSEGSVLYVSQGSFLSVSSAQMDEIVAGV 300

Query: 322 KASGVRFLWVARGNDGQLKGVDRETGMVVGWCDQLRVLCHSAIGGFWTHAGWNSTLEGVF 381
           KASGVRFLWVAR +DG+LKGVD E GMVV WC+QLRVLCHSA+GGFWTH GWNSTLEGVF
Sbjct: 301 KASGVRFLWVARRDDGRLKGVDEEAGMVVEWCEQLRVLCHSAVGGFWTHGGWNSTLEGVF 360

Query: 382 AGVSMLTWPIFCDQVPNSKKIVEDWKVGVRFKAVGGRDLVRREEIAEFVKRFMNSESVEG 441
           AGV MLTWPIFCDQVPNSKKIVEDWK+GVRF+AVGGR+LV + EIAEFVKRFMNSESVEG
Sbjct: 361 AGVPMLTWPIFCDQVPNSKKIVEDWKIGVRFEAVGGRNLVTKVEIAEFVKRFMNSESVEG 420

Query: 442 RKMRNRVSELRGICRRAVAKGGSSHSNIDAFLDHI 469
           R MR  V + R ICRRA A+ GSS +NIDAFL+ I
Sbjct: 421 RAMRKNVMKFREICRRAAAEDGSSDANIDAFLNQI 455

BLAST of Cla97C02G045320 vs. NCBI nr
Match: XP_022986868.1 (UDP-glycosyltransferase 87A1-like [Cucurbita maxima])

HSP 1 Score: 763.8 bits (1971), Expect = 3.3e-217
Identity = 369/454 (81.28%), Postives = 404/454 (88.99%), Query Frame = 0

Query: 22  MDSTISETPQLTHLAALPYPGRGHINALMNFCKLLSLKNPNILISFIVSDEWFTFLAADP 81
           M    S T +LTHLAA+PYPGRGHINA MN CKLLSLKNP++LISFIV+DEW TFLAA P
Sbjct: 1   MAPATSSTRRLTHLAAVPYPGRGHINAFMNLCKLLSLKNPDLLISFIVTDEWRTFLAAVP 60

Query: 82  KPPNLHFATFPNLIPSELGRAKDFPGFFRSVNTVMEPPIQTLLTHLHPPPSIVVADSFLT 141
           KP N+ F TFPN+IPSELGRA DFPGFFR+VNTVME P+Q LLTHLHPPP ++VADSFLT
Sbjct: 61  KPQNIQFLTFPNVIPSELGRADDFPGFFRAVNTVMEAPVQNLLTHLHPPPVVIVADSFLT 120

Query: 142 WAVRLGNRLNIPVASFWPMSATVLSIYYHFDFLKENGHFPADLSECGEEIVDYIPGVSET 201
           WA RLGNR NIPVASFWPMS TVLSIYYHFD L+ENGHFPADLSE G EIVDYIPGVSET
Sbjct: 121 WATRLGNRFNIPVASFWPMSVTVLSIYYHFDLLEENGHFPADLSERGGEIVDYIPGVSET 180

Query: 202 RLADLPTFFSGDGYEVIDLTLEAARSIDKAQFLISTSVYELESSVIDVLKLKFPFPAYTI 261
           RLADLPTFFSGDG +VIDLTLEAARSIDKAQFLISTSVYELESSVIDVLK KFPFP YT+
Sbjct: 181 RLADLPTFFSGDGRKVIDLTLEAARSIDKAQFLISTSVYELESSVIDVLKQKFPFPVYTV 240

Query: 262 GPCTPYFD----APNGC---TDDYLRWLDSQTEGSVLYISQGSFLSVSSAQMDEIVAGVK 321
           GPCTPYF+    APNGC   T++YLRWLDSQ+EGSVLY+SQGSFLSVSSAQMDEIVAGVK
Sbjct: 241 GPCTPYFELGSSAPNGCGGGTNNYLRWLDSQSEGSVLYVSQGSFLSVSSAQMDEIVAGVK 300

Query: 322 ASGVRFLWVARGNDGQLKGVDRETGMVVGWCDQLRVLCHSAIGGFWTHAGWNSTLEGVFA 381
           ASGVRFLWVARG+D +LKGV+ ETGMVV WC+QLRVLCH+A+GGFWTH GWNSTLEGV+A
Sbjct: 301 ASGVRFLWVARGDDDRLKGVNGETGMVVEWCEQLRVLCHNAVGGFWTHGGWNSTLEGVYA 360

Query: 382 GVSMLTWPIFCDQVPNSKKIVEDWKVGVRFKAVGGRDLVRREEIAEFVKRFMNSESVEGR 441
            V MLTWPIFCDQVPNSKKIVEDWK+GVRF+AVGGR+LV + EIAEFVKRFMNSESVEGR
Sbjct: 361 AVPMLTWPIFCDQVPNSKKIVEDWKIGVRFEAVGGRNLVTKVEIAEFVKRFMNSESVEGR 420

Query: 442 KMRNRVSELRGICRRAVAKGGSSHSNIDAFLDHI 469
           +MR  V + R ICRRA A+ GSS +NIDAFL+ I
Sbjct: 421 EMRKNVMKFREICRRAAAEDGSSDANIDAFLNQI 454

BLAST of Cla97C02G045320 vs. TrEMBL
Match: tr|A0A1S3CNH7|A0A1S3CNH7_CUCME (UDP-glycosyltransferase 87A1-like OS=Cucumis melo OX=3656 GN=LOC103502931 PE=4 SV=1)

HSP 1 Score: 787.7 bits (2033), Expect = 1.4e-224
Identity = 376/448 (83.93%), Postives = 407/448 (90.85%), Query Frame = 0

Query: 22  MDSTISETPQL-THLAALPYPGRGHINALMNFCKLLSLKNPNILISFIVSDEWFTFLAAD 81
           MDSTIS+  QL THLAALPYPGRGHINALMNFCKLL LKNPNILISFIVS+EW +FLAAD
Sbjct: 1   MDSTISKASQLITHLAALPYPGRGHINALMNFCKLLFLKNPNILISFIVSEEWLSFLAAD 60

Query: 82  PKPPNLHFATFPNLIPSELGRAKDFPGFFRSVNTVMEPPIQTLLTHLHPPPSIVVADSFL 141
           PKPPNLHF+TFPN+IPSELGRA D+PGFFRSVNT++E PI TLLTHL+PPPSI+VAD FL
Sbjct: 61  PKPPNLHFSTFPNIIPSELGRANDYPGFFRSVNTILESPIHTLLTHLNPPPSIIVADPFL 120

Query: 142 TWAVRLGNRLNIPVASFWPMSATVLSIYYHFDFLKENGHFPADLSECGEEIVDYIPGVSE 201
           +WAV L NRLNIPVASFWPMS TVLS YYHF+ L+ENGHFPA+LSE GEEIVDYIPGVS 
Sbjct: 121 SWAVPLANRLNIPVASFWPMSVTVLSCYYHFNLLEENGHFPANLSERGEEIVDYIPGVSH 180

Query: 202 TRLADLPTFFSGDGYEVIDLTLEAARSIDKAQFLISTSVYELESSVIDVLKLKFPFPAYT 261
           TRLAD PTFFSGDG+E+ DLT EAARSIDKAQFLISTSVYELE SV+DV KLK PFP YT
Sbjct: 181 TRLADFPTFFSGDGHEIRDLTFEAARSIDKAQFLISTSVYELEPSVVDVFKLKCPFPVYT 240

Query: 262 IGPCTPYFDAPNGCTDDYLRWLDSQTEGSVLYISQGSFLSVSSAQMDEIVAGVKASGVRF 321
           IGPCTPYF+ PNGCTD+YL+WLDSQT+ SVLYIS GSFLSVSSAQMDEIVAGV+ASGVRF
Sbjct: 241 IGPCTPYFETPNGCTDEYLQWLDSQTDCSVLYISHGSFLSVSSAQMDEIVAGVEASGVRF 300

Query: 322 LWVARGNDGQLKGVDRETGMVVGWCDQLRVLCHSAIGGFWTHAGWNSTLEGVFAGVSMLT 381
           LWVARGND +LKGVDRE GMVV WCDQL+VLCHSA+GGFWTH GWNST+EGVFAGV MLT
Sbjct: 301 LWVARGNDSRLKGVDREMGMVVRWCDQLKVLCHSAVGGFWTHCGWNSTMEGVFAGVPMLT 360

Query: 382 WPIFCDQVPNSKKIVEDWKVGVRFKAVGGRDLVRREEIAEFVKRFMNSESVEGRKMRNRV 441
           WPIF DQVPN KKIVEDWKVGVR  AVGG DLVRREEIA+ VKRFMNSESVEGRKMRNRV
Sbjct: 361 WPIFLDQVPNRKKIVEDWKVGVRVNAVGGNDLVRREEIAKLVKRFMNSESVEGRKMRNRV 420

Query: 442 SELRGICRRAVAKGGSSHSNIDAFLDHI 469
           SEL+ ICRRAV +GGSSHSN+DAF+  I
Sbjct: 421 SELKDICRRAVVEGGSSHSNMDAFIGRI 448

BLAST of Cla97C02G045320 vs. TrEMBL
Match: tr|A0A0A0KF85|A0A0A0KF85_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G181570 PE=4 SV=1)

HSP 1 Score: 780.8 bits (2015), Expect = 1.7e-222
Identity = 370/447 (82.77%), Postives = 406/447 (90.83%), Query Frame = 0

Query: 22  MDSTISETPQLTHLAALPYPGRGHINALMNFCKLLSLKNPNILISFIVSDEWFTFLAADP 81
           MDSTI +  QLTHLAALPYPGRGHINAL+NFCKLLS KNPNILISFIVSDEW + LAADP
Sbjct: 1   MDSTIPKPSQLTHLAALPYPGRGHINALINFCKLLSRKNPNILISFIVSDEWLSLLAADP 60

Query: 82  KPPNLHFATFPNLIPSELGRAKDFPGFFRSVNTVMEPPIQTLLTHLHPPPSIVVADSFLT 141
           KPPNLHF+TFPN+IPSE GRA DFPGFFRSVNT+ME PI TLLTHL+PPPSI+VADSF++
Sbjct: 61  KPPNLHFSTFPNIIPSEHGRANDFPGFFRSVNTIMESPIHTLLTHLNPPPSIIVADSFVS 120

Query: 142 WAVRLGNRLNIPVASFWPMSATVLSIYYHFDFLKENGHFPADLSECGEEIVDYIPGVSET 201
           WAV L NRLNIPVASFWPMS TVLS+YYHF+ L+ENGHFPADLSE GEEIVDYIPGVS+T
Sbjct: 121 WAVPLANRLNIPVASFWPMSVTVLSMYYHFNLLQENGHFPADLSERGEEIVDYIPGVSDT 180

Query: 202 RLADLPTFFSGDGYEVIDLTLEAARSIDKAQFLISTSVYELESSVIDVLKLKFPFPAYTI 261
           RLADLPTFFSGDG+EV+DLT++AARSIDKAQFLISTSVYELE SVID  KLKFPFP YTI
Sbjct: 181 RLADLPTFFSGDGHEVVDLTVKAARSIDKAQFLISTSVYELEPSVIDAFKLKFPFPVYTI 240

Query: 262 GPCTPYFDAPNGCTDDYLRWLDSQTEGSVLYISQGSFLSVSSAQMDEIVAGVKASGVRFL 321
           GPCTPYF+  N CTD+Y +WLDSQTE SVLYISQGSFLSVSS+QM+EIVAGVKASGVRFL
Sbjct: 241 GPCTPYFETTNSCTDEYFQWLDSQTECSVLYISQGSFLSVSSSQMEEIVAGVKASGVRFL 300

Query: 322 WVARGNDGQLKGVDRETGMVVGWCDQLRVLCHSAIGGFWTHAGWNSTLEGVFAGVSMLTW 381
           WVARGNDG+LK VDRE G+VV WCDQL+VLCHSA+GGFWTH GWNST+EGVFAGV MLTW
Sbjct: 301 WVARGNDGRLKDVDREMGVVVRWCDQLKVLCHSAVGGFWTHCGWNSTMEGVFAGVPMLTW 360

Query: 382 PIFCDQVPNSKKIVEDWKVGVRFKAVGGRDLVRREEIAEFVKRFMNSESVEGRKMRNRVS 441
           PIFCDQVPN KKIVE+WKVGVR +AVGG+DLVRREEIA FVKRFM +ESVEGRKMR R S
Sbjct: 361 PIFCDQVPNRKKIVEEWKVGVRVEAVGGKDLVRREEIANFVKRFMKTESVEGRKMRKRAS 420

Query: 442 ELRGICRRAVAKGGSSHSNIDAFLDHI 469
           EL+ ICR AV +GGSS SN+DAF+  I
Sbjct: 421 ELQDICRGAVEEGGSSSSNMDAFIGRI 447

BLAST of Cla97C02G045320 vs. TrEMBL
Match: tr|A0A1S3CLR3|A0A1S3CLR3_CUCME (UDP-glycosyltransferase 87A1-like OS=Cucumis melo OX=3656 GN=LOC103501885 PE=4 SV=1)

HSP 1 Score: 726.9 bits (1875), Expect = 3.0e-206
Identity = 342/435 (78.62%), Postives = 383/435 (88.05%), Query Frame = 0

Query: 34  HLAALPYPGRGHINALMNFCKLLSLKNPNILISFIVSDEWFTFLAADPKPPNLHFATFPN 93
           HLAALPYPGRGHINALMNFCKLLSLKNPNI ISFIV++EW +FLAADPKPPN+HF T PN
Sbjct: 9   HLAALPYPGRGHINALMNFCKLLSLKNPNISISFIVTEEWLSFLAADPKPPNIHFVTIPN 68

Query: 94  LIPSELGRAKDFPGFFRSVNTVMEPPIQTLLTHLHPPPSIVVADSFLTWAVRLGNRLNIP 153
           +IPSEL RA DFPGF RSV T ME P++TLL  L PPP+ ++AD+F TWAV+LG RL++P
Sbjct: 69  VIPSELHRANDFPGFIRSVQTHMEAPVETLLRRLEPPPTAIIADTFGTWAVQLGKRLDVP 128

Query: 154 VASFWPMSATVLSIYYHFDFLKENGHFPADLSECGEEIVDYIPGVSETRLADLPTFFSGD 213
           VAS WPMSATV SI YHFD LKENGHFPADLSE GEEIVDY PGVS+ RLADLP+FFSG+
Sbjct: 129 VASLWPMSATVFSILYHFDLLKENGHFPADLSERGEEIVDYFPGVSKIRLADLPSFFSGN 188

Query: 214 GYEVIDLTLEAARSIDKAQFLISTSVYELESSVIDVLKLKFPFPAYTIGPCTPYFDAPNG 273
           G + I+  +++ARS+DKAQFLISTSVYELESSV+D LK KFPFP YTIGP TPYF+  + 
Sbjct: 189 GLQSIEFAVKSARSVDKAQFLISTSVYELESSVLDSLKAKFPFPVYTIGPSTPYFELESS 248

Query: 274 CTDDYLRWLDSQTEGSVLYISQGSFLSVSSAQMDEIVAGVKASGVRFLWVARGNDGQLKG 333
            ++DYLRWLDSQT+GSVLY+SQGSFLSVS+AQMDEI+AGVKASGVRFLWVARG+D + K 
Sbjct: 249 VSNDYLRWLDSQTDGSVLYVSQGSFLSVSNAQMDEIIAGVKASGVRFLWVARGDDDRWKD 308

Query: 334 VDRETGMVVGWCDQLRVLCHSAIGGFWTHAGWNSTLEGVFAGVSMLTWPIFCDQVPNSKK 393
           VDRETGMVVGWCDQLRVLCH A+GGFWTH GWNSTLEGVFAGV ML WPIF DQ PNSKK
Sbjct: 309 VDRETGMVVGWCDQLRVLCHRAVGGFWTHGGWNSTLEGVFAGVPMLVWPIFWDQFPNSKK 368

Query: 394 IVEDWKVGVRFKAVGGRDLVRREEIAEFVKRFMNSESVEGRKMRNRVSELRGICRRAVAK 453
           I EDWKVGVRFK  GG+DLVRREEIAEFVK+FMNSESVE ++MR RVSE + ICRRAVAK
Sbjct: 369 IAEDWKVGVRFKGAGGKDLVRREEIAEFVKKFMNSESVESKEMRKRVSEFQEICRRAVAK 428

Query: 454 GGSSHSNIDAFLDHI 469
           GGSS SNIDAFL+HI
Sbjct: 429 GGSSDSNIDAFLNHI 443

BLAST of Cla97C02G045320 vs. TrEMBL
Match: tr|A0A0A0KH18|A0A0A0KH18_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G181560 PE=4 SV=1)

HSP 1 Score: 724.2 bits (1868), Expect = 1.9e-205
Identity = 340/435 (78.16%), Postives = 382/435 (87.82%), Query Frame = 0

Query: 34  HLAALPYPGRGHINALMNFCKLLSLKNPNILISFIVSDEWFTFLAADPKPPNLHFATFPN 93
           HLAALPYPGRGHINAL+NFCK+LSLK+PNI ISFIV+DEW TFLAADPKPPN+HF TFPN
Sbjct: 9   HLAALPYPGRGHINALINFCKILSLKSPNISISFIVTDEWLTFLAADPKPPNIHFVTFPN 68

Query: 94  LIPSELGRAKDFPGFFRSVNTVMEPPIQTLLTHLHPPPSIVVADSFLTWAVRLGNRLNIP 153
           +IPSEL RA DFPGF RS+ T ME P++TLL  LHPPP+ ++AD+F+ WAV+LG RL++P
Sbjct: 69  VIPSELHRANDFPGFVRSIQTHMEAPVETLLRRLHPPPTAIIADTFVYWAVQLGKRLDVP 128

Query: 154 VASFWPMSATVLSIYYHFDFLKENGHFPADLSECGEEIVDYIPGVSETRLADLPTFFSGD 213
           VAS WPMSATV SI YHFD LKENGHFPADLSE GEEIVDY PGVS+ RLADLP+FFSG+
Sbjct: 129 VASLWPMSATVFSILYHFDLLKENGHFPADLSERGEEIVDYFPGVSKIRLADLPSFFSGN 188

Query: 214 GYEVIDLTLEAARSIDKAQFLISTSVYELESSVIDVLKLKFPFPAYTIGPCTPYFDAPNG 273
           G + +  ++++ARS+DKAQFLISTSVYELESSVID LK  FPFP YTIGP TPYF+  + 
Sbjct: 189 GLQTLGFSVKSARSVDKAQFLISTSVYELESSVIDSLKANFPFPVYTIGPSTPYFELESS 248

Query: 274 CTDDYLRWLDSQTEGSVLYISQGSFLSVSSAQMDEIVAGVKASGVRFLWVARGNDGQLKG 333
            ++DYL+WLDSQ EGSVLYISQGSFLSVS+ QMDEIVAGVKASGVRFLWVARG+D + K 
Sbjct: 249 ASNDYLQWLDSQAEGSVLYISQGSFLSVSNTQMDEIVAGVKASGVRFLWVARGDDDRWKD 308

Query: 334 VDRETGMVVGWCDQLRVLCHSAIGGFWTHAGWNSTLEGVFAGVSMLTWPIFCDQVPNSKK 393
           VDRETGMVVGWCDQLRVLCH A+GGFWTH GWNST+EGVFAGV ML WPIF DQ PNSKK
Sbjct: 309 VDRETGMVVGWCDQLRVLCHGAVGGFWTHGGWNSTVEGVFAGVPMLVWPIFWDQFPNSKK 368

Query: 394 IVEDWKVGVRFKAVGGRDLVRREEIAEFVKRFMNSESVEGRKMRNRVSELRGICRRAVAK 453
           I EDW+VGVRFK VGG+DLVRREEIAEFVKRFMNSESVEG++MR RVSE + ICR AVAK
Sbjct: 369 IAEDWQVGVRFKGVGGKDLVRREEIAEFVKRFMNSESVEGKEMRKRVSEFQEICRGAVAK 428

Query: 454 GGSSHSNIDAFLDHI 469
           GGSS SNIDAFL HI
Sbjct: 429 GGSSDSNIDAFLKHI 443

BLAST of Cla97C02G045320 vs. TrEMBL
Match: tr|A0A2P6QKD5|A0A2P6QKD5_ROSCH (Putative indole-3-acetate beta-glucosyltransferase OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr5g0071341 PE=4 SV=1)

HSP 1 Score: 553.9 bits (1426), Expect = 3.4e-154
Identity = 264/440 (60.00%), Postives = 332/440 (75.45%), Query Frame = 0

Query: 34  HLAALPYPGRGHINALMNFCKLLSLKNPNILISFIVSDEWFTFLAADPKPPNLHFATFPN 93
           HL ALPYPGRGHIN +MN CKLL+ K  ++LI+F+V++EW  F+ ADPKP N+ FA  PN
Sbjct: 13  HLVALPYPGRGHINPMMNLCKLLASKKHDLLITFVVTEEWHGFIGADPKPDNIRFAMVPN 72

Query: 94  LIPSELGRAKDFPGFFRSVNTVMEPPIQTLLTHLHPPPSIVVADSFLTWAVRLGNRLNIP 153
           +IPSELGRAKDFPGF  +V T ++ P + LL  L P  S +VAD F+ WAVR+GN+ NI 
Sbjct: 73  VIPSELGRAKDFPGFVEAVCTKLQEPFELLLDRLEPEVSFIVADPFVVWAVRVGNKRNIA 132

Query: 154 VASFWPMSATVLSIYYHFDFLKENGHFPADLSECGEEIVDYIPGVSETRLADLPTFFSGD 213
           VAS WPMSA+V S  +HF+ LKENGHFPAD+S  G+E++DYIPG+S TRLADLPT F  D
Sbjct: 133 VASLWPMSASVFSTLHHFELLKENGHFPADVSVRGDEVIDYIPGISTTRLADLPTVFYDD 192

Query: 214 GYEVIDLTLEAARSIDKAQFLISTSVYELESSVIDVLKLKFPFPAYTIGPCTPYFDAPNG 273
             +V++  LEA  S  KAQ+LISTS YELE    D L+ K   P Y IGP  P+F+   G
Sbjct: 193 VQQVLNRALEAVSSTYKAQYLISTSFYELEPHAFDTLRAKLSSPVYPIGPSIPHFEISKG 252

Query: 274 CTD---DYLRWLDSQTEGSVLYISQGSFLSVSSAQMDEIVAGVKASGVRFLWVARGNDGQ 333
             D   +YL WLDSQ + SV+Y+S GSFLSVS+AQMDEI+AGVK SGVRFLWVARG+  +
Sbjct: 253 AHDNDLNYLHWLDSQPKSSVMYVSLGSFLSVSNAQMDEIIAGVKNSGVRFLWVARGDASK 312

Query: 334 LKGVDRETGMVVGWCDQLRVLCHSAIGGFWTHAGWNSTLEGVFAGVSMLTWPIFCDQVPN 393
           LK    + G+VV WCDQL VLCHS+IGGFW+H GWNST E V+AG+ +LT+PIF DQ+PN
Sbjct: 313 LKDGVGDMGLVVPWCDQLGVLCHSSIGGFWSHCGWNSTQEAVYAGLPVLTFPIFFDQIPN 372

Query: 394 SKKIVEDWKVG--VRFKAVGGRDLVRREEIAEFVKRFMNSESVEGRKMRNRVSELRGICR 453
            K+IVEDWK+G  V+ K VGG D+V REEIAE ++RFM+ ES EG+++R R  +L+  C+
Sbjct: 373 GKQIVEDWKIGYRVKKKKVGGGDIVTREEIAELLQRFMDVESNEGKEVRKRAKQLQETCQ 432

Query: 454 RAVAKGGSSHSNIDAFLDHI 469
            A+AKGGSS SN+DAF+ +I
Sbjct: 433 GAIAKGGSSDSNLDAFIKNI 452

BLAST of Cla97C02G045320 vs. Swiss-Prot
Match: sp|O64733|U87A2_ARATH (UDP-glycosyltransferase 87A2 OS=Arabidopsis thaliana OX=3702 GN=UGT87A2 PE=1 SV=1)

HSP 1 Score: 475.7 bits (1223), Expect = 5.9e-133
Identity = 237/453 (52.32%), Postives = 308/453 (67.99%), Query Frame = 0

Query: 22  MDSTISETPQLTHLAALPYPGRGHINALMNFCKLLSLKNPNILISFIVSDEWFTFLAADP 81
           MD   S   Q  H+ A+PYPGRGHIN +MN CK L  + PN+ ++F+V++EW  F+  DP
Sbjct: 1   MDPNESPPNQFRHVVAMPYPGRGHINPMMNLCKRLVRRYPNLHVTFVVTEEWLGFIGPDP 60

Query: 82  KPPNLHFATFPNLIPSELGRAKDFPGFFRSVNTVMEPPIQTLLTHLH-PPPSIVVADSFL 141
           KP  +HF+T PNLIPSEL RAKDF GF  +V T +E P + LL  L+ PPPS++ AD+++
Sbjct: 61  KPDRIHFSTLPNLIPSELVRAKDFIGFIDAVYTRLEEPFEKLLDSLNSPPPSVIFADTYV 120

Query: 142 TWAVRLGNRLNIPVASFWPMSATVLSIYYHFDFLKENGHFPADLSECGEEIVDYIPGVSE 201
            WAVR+G + NIPV S W MSAT+LS + H D L  +GH   + SE  EE+VDY+PG+S 
Sbjct: 121 IWAVRVGRKRNIPVVSLWTMSATILSFFLHSDLLISHGHALFEPSE--EEVVDYVPGLSP 180

Query: 202 TRLADLPTFFSGDGYEVIDLTLEAARSIDKAQFLISTSVYELESSVIDVLKLKFPFPAYT 261
           T+L DLP  F G    V          +  A+ L+ T+ YELE   ID    K   P Y 
Sbjct: 181 TKLRDLPPIFDGYSDRVFKTAKLCFDELPGARSLLFTTAYELEHKAIDAFTSKLDIPVYA 240

Query: 262 IGPCTPYFDAP---NGCTDDYLRWLDSQTEGSVLYISQGSFLSVSSAQMDEIVAGVKASG 321
           IGP  P+ +     +    +Y++WL+ Q EGSVLYISQGSFLSVS AQM+EIV G++ SG
Sbjct: 241 IGPLIPFEELSVQNDNKEPNYIQWLEEQPEGSVLYISQGSFLSVSEAQMEEIVKGLRESG 300

Query: 322 VRFLWVARGNDGQLK-GVDRETGMVVGWCDQLRVLCHSAIGGFWTHAGWNSTLEGVFAGV 381
           VRFLWVARG + +LK  ++   G+VV WCDQLRVLCH A+GGFWTH G+NSTLEG+++GV
Sbjct: 301 VRFLWVARGGELKLKEALEGSLGVVVSWCDQLRVLCHKAVGGFWTHCGFNSTLEGIYSGV 360

Query: 382 SMLTWPIFCDQVPNSKKIVEDWKVGVRFKAVGGRD-LVRREEIAEFVKRFMNSESVEGRK 441
            ML +P+F DQ+ N+K IVEDW+VG+R +     + L+ REEI E VKRFM+ ES EG++
Sbjct: 361 PMLAFPLFWDQILNAKMIVEDWRVGMRIERTKKNELLIGREEIKEVVKRFMDRESEEGKE 420

Query: 442 MRNRVSELRGICRRAVAKGGSSHSNIDAFLDHI 469
           MR R  +L  I R AVAK GSS+ NID F+ HI
Sbjct: 421 MRRRACDLSEISRGAVAKSGSSNVNIDEFVRHI 451

BLAST of Cla97C02G045320 vs. Swiss-Prot
Match: sp|O64732|U87A1_ARATH (UDP-glycosyltransferase 87A1 OS=Arabidopsis thaliana OX=3702 GN=UGT87A1 PE=2 SV=1)

HSP 1 Score: 467.6 bits (1202), Expect = 1.6e-130
Identity = 228/437 (52.17%), Postives = 309/437 (70.71%), Query Frame = 0

Query: 38  LPYPGRGHINALMNFCKLLSLKNPNILISFIVSDEWFTFLAADPKPPNLHFATFPNLIPS 97
           +P+PGRGHIN ++N CK L  ++PN+ ++F+V++EW  F+ +DPKP  +HFAT PN+IPS
Sbjct: 1   MPWPGRGHINPMLNLCKSLVRRDPNLTVTFVVTEEWLGFIGSDPKPNRIHFATLPNIIPS 60

Query: 98  ELGRAKDFPGFFRSVNTVMEPPIQTLLTHLHPPPSIVVADSFLTWAVRLGNRLNIPVASF 157
           EL RA DF  F  +V T +E P + LL  L+ PP+ ++AD+++ WAVR+G + NIPVASF
Sbjct: 61  ELVRANDFIAFIDAVLTRLEEPFEQLLDRLNSPPTAIIADTYIIWAVRVGTKRNIPVASF 120

Query: 158 WPMSATVLSIYYHFDFLKENGHFPADLSECG-EEIVDYIPGVSETRLADLPTFFSGDGYE 217
           W  SAT+LS++ + D L  +GHFP + SE   +EIVDYIPG+S TRL+DL     G  ++
Sbjct: 121 WTTSATILSLFINSDLLASHGHFPIEPSESKLDEIVDYIPGLSPTRLSDL-QILHGYSHQ 180

Query: 218 VIDLTLEAARSIDKAQFLISTSVYELESSVIDVLKLKFPFPAYTIGPCTPYFDAPNGCTD 277
           V ++  ++   + KA++L+  S YELE   ID    KF FP Y+ GP  P  +   G  +
Sbjct: 181 VFNIFKKSFGELYKAKYLLFPSAYELEPKAIDFFTSKFDFPVYSTGPLIPLEELSVGNEN 240

Query: 278 ---DYLRWLDSQTEGSVLYISQGSFLSVSSAQMDEIVAGVKASGVRFLWVARGNDGQLK- 337
              DY +WLD Q E SVLYISQGSFLSVS AQM+EIV GV+ +GV+F WVARG + +LK 
Sbjct: 241 RELDYFKWLDEQPESSVLYISQGSFLSVSEAQMEEIVVGVREAGVKFFWVARGGELKLKE 300

Query: 338 GVDRETGMVVGWCDQLRVLCHSAIGGFWTHAGWNSTLEGVFAGVSMLTWPIFCDQVPNSK 397
            ++   G+VV WCDQLRVLCH+AIGGFWTH G+NSTLEG+ +GV +LT+P+F DQ  N+K
Sbjct: 301 ALEGSLGVVVSWCDQLRVLCHAAIGGFWTHCGYNSTLEGICSGVPLLTFPVFWDQFLNAK 360

Query: 398 KIVEDWKVGVRFKAVGGRD-LVRREEIAEFVKRFMNSESVEGRKMRNRVSELRGICRRAV 457
            IVE+W+VG+  +     + L+  +EI E VKRFM+ ES EG++MR R  +L  ICR AV
Sbjct: 361 MIVEEWRVGMGIERKKQMELLIVSDEIKELVKRFMDGESEEGKEMRRRTCDLSEICRGAV 420

Query: 458 AKGGSSHSNIDAFLDHI 469
           AKGGSS +NIDAF+  I
Sbjct: 421 AKGGSSDANIDAFIKDI 436

BLAST of Cla97C02G045320 vs. Swiss-Prot
Match: sp|Q9SJL0|U86A1_ARATH (UDP-glycosyltransferase 86A1 OS=Arabidopsis thaliana OX=3702 GN=UGT86A1 PE=2 SV=1)

HSP 1 Score: 224.9 bits (572), Expect = 1.8e-57
Identity = 139/476 (29.20%), Postives = 238/476 (50.00%), Query Frame = 0

Query: 34  HLAALPYPGRGHINALMNFCKLLSLKNPNILISFIVSDEWFTFL--------------AA 93
           H+  +PYP +GH+   ++    + L +    I+F+ +D     +              A 
Sbjct: 10  HIMMIPYPLQGHVIPFVHLA--IKLASHGFTITFVNTDSIHHHISTAHQDDAGDIFSAAR 69

Query: 94  DPKPPNLHFATFPNLIPSELGRAKDFPGFFRSVNTVMEPPIQTLLTHL----HPPPSIVV 153
                ++ + T  +  P +  R+ +   FF  +  V    +  L+  L     PP + ++
Sbjct: 70  SSGQHDIRYTTVSDGFPLDFDRSLNHDQFFEGILHVFSAHVDDLIAKLSRRDDPPVTCLI 129

Query: 154 ADSFLTWAVRLGNRLNIPVASFWPMSATVLSIYYHFDFLKENGHFPADLSECGEEIVDYI 213
           AD+F  W+  + ++ N+   SFW   A VL++YYH D L  NGHF +   +  ++++DY+
Sbjct: 130 ADTFYVWSSMICDKHNLVNVSFWTEPALVLNLYYHMDLLISNGHFKS--LDNRKDVIDYV 189

Query: 214 PGVSETRLADLPTFFSGDGYEVIDLTL------EAARSIDKAQFLISTSVYELESSVIDV 273
           PGV      DL ++      +V   T+      +A + + +A F++  +V ELE   +  
Sbjct: 190 PGVKAIEPKDLMSYLQVSDKDVDTNTVVYRILFKAFKDVKRADFVVCNTVQELEPDSLSA 249

Query: 274 LKLKFPFPAYTIGP-------CTPYFDAPNGCTDDYLRWLDSQTEGSVLYISQGSFLSVS 333
           L+ K   P Y IGP             A + CT+    WL  +  GSVLY+S GS+  V 
Sbjct: 250 LQAK--QPVYAIGPVFSTDSVVPTSLWAESDCTE----WLKGRPTGSVLYVSFGSYAHVG 309

Query: 334 SAQMDEIVAGVKASGVRFLWVARGN-------DGQLKG-VD--RETGMVVGWCDQLRVLC 393
             ++ EI  G+  SG+ F+WV R +       D    G VD  ++ G+VV WC Q+ V+ 
Sbjct: 310 KKEIVEIAHGLLLSGISFIWVLRPDIVGSNVPDFLPAGFVDQAQDRGLVVQWCCQMEVIS 369

Query: 394 HSAIGGFWTHAGWNSTLEGVFAGVSMLTWPIFCDQVPNSKKIVEDWKVGVRFKAVGGRDL 453
           + A+GGF+TH GWNS LE V+ G+ +L +P+  DQ  N K +V+DW +G+    +  +  
Sbjct: 370 NPAVGGFFTHCGWNSILESVWCGLPLLCYPLLTDQFTNRKLVVDDWCIGIN---LCEKKT 429

Query: 454 VRREEIAEFVKRFMNSESVEGRKMRNRVSELRGICRRAVAKGGSSHSNIDAFLDHI 469
           + R++++  VKR MN E+    ++RN V +++   + AV   GSS +N + F+  +
Sbjct: 430 ITRDQVSANVKRLMNGET--SSELRNNVEKVKRHLKDAVTTVGSSETNFNLFVSEV 470

BLAST of Cla97C02G045320 vs. Swiss-Prot
Match: sp|Q9M9E7|U85A4_ARATH (UDP-glycosyltransferase 85A4 OS=Arabidopsis thaliana OX=3702 GN=UGT85A4 PE=2 SV=1)

HSP 1 Score: 215.3 bits (547), Expect = 1.4e-54
Identity = 147/496 (29.64%), Postives = 237/496 (47.78%), Query Frame = 0

Query: 22  MDSTISETPQLTHLAALPYPGRGHINALMNFCKLLSLKNPNILISFIVSDEWFTFLAADP 81
           M+     + Q  H   +PYP +GHIN ++   KLL  +     ++F+ +D     +    
Sbjct: 1   MEQHGGSSSQKPHAMCIPYPAQGHINPMLKLAKLLHAR--GFHVTFVNTDYNHRRILQSR 60

Query: 82  KP------PNLHFATFPNLIP-SELGRAKDFPGFFRSVNTVMEPPIQTLLTHLH-----P 141
            P      P+  F T P+ +P +++   +D      S       P + L+  L+     P
Sbjct: 61  GPHALNGLPSFRFETIPDGLPWTDVDAKQDMLKLIDSTINNCLAPFKDLILRLNSGSDIP 120

Query: 142 PPSIVVADSFLTWAVRLGNRLNIPVASFWPMSATVLSIYYHFDFLKENGHFP----ADLS 201
           P S +++D+ +++ +     L IPV   W  SAT L +Y H+  L E    P    +DL 
Sbjct: 121 PVSCIISDASMSFTIDAAEELKIPVVLLWTNSATALILYLHYQKLIEKEIIPLKDSSDLK 180

Query: 202 ECGEEIVDYIPGVSETRLADLPTFFSGDGYE--VIDLTLEAARSIDKAQFLISTSVYELE 261
           +  E  +D+IP + + +L D P F +    +  +I   L     I +A  +   +  +LE
Sbjct: 181 KHLETEIDWIPSMKKIKLKDFPDFVTTTNPQDPMISFILHVTGRIKRASAIFINTFEKLE 240

Query: 262 SSVIDVLKLKFPFPAYTIGPCTPYFDAPNGCTD-----------------DYLRWLDSQT 321
            +V+  L+   P   Y++G   P+    N   D                 + L WLD++ 
Sbjct: 241 HNVLLSLRSLLP-QIYSVG---PFQILENREIDKNSEIRKLGLNLWEEETESLDWLDTKA 300

Query: 322 EGSVLYISQGSFLSVSSAQMDEIVAGVKASGVRFLWVARGNDGQLKGVD----------- 381
           E +V+Y++ GS   ++S Q+ E   G+  SG  FLWV R   G + G D           
Sbjct: 301 EKAVIYVNFGSLTVLTSEQILEFAWGLARSGKEFLWVVR--SGMVDGDDSILPAEFLSET 360

Query: 382 RETGMVV-GWCDQLRVLCHSAIGGFWTHAGWNSTLEGVFAGVSMLTWPIFCDQVPNSKKI 441
           +  GM++ GWC Q +VL H AIGGF TH GWNSTLE ++AGV M+ WP F DQ+ N K  
Sbjct: 361 KNRGMLIKGWCSQEKVLSHPAIGGFLTHCGWNSTLESLYAGVPMICWPFFADQLTNRKFC 420

Query: 442 VEDWKVGVRFKAVGGRDLVRREEIAEFVKRFMNSESVEGRKMRNRVSELRGICRRAVAKG 470
            EDW +G+        + V+RE +   VK  M+ E  +G+++R +V E R +   A A  
Sbjct: 421 CEDWGIGMEI-----GEEVKRERVETVVKELMDGE--KGKRLREKVVEWRRLAEEASAPP 480

BLAST of Cla97C02G045320 vs. Swiss-Prot
Match: sp|F8WLS6|UGT6_CATRO (7-deoxyloganetin glucosyltransferase OS=Catharanthus roseus OX=4058 GN=UGT85A23 PE=1 SV=1)

HSP 1 Score: 214.5 bits (545), Expect = 2.5e-54
Identity = 149/484 (30.79%), Postives = 231/484 (47.73%), Query Frame = 0

Query: 27  SETPQLTHLAALPYPGRGHINALMNFCKLLSLKNPNI-LISFIVSDEWFTFLAADPKPPN 86
           S+  +  H   +PYP +GHIN ++   KLL  K  +I  ++   + +             
Sbjct: 7   SDYSKKPHAVCIPYPAQGHINPMLKLAKLLHYKGFHITFVNTEFNHKRLLKSRGSDSLKG 66

Query: 87  LH---FATFPN-LIPSELGRAKDFPGFFRSVNTVMEPPIQTLLTHLH-------PPPSIV 146
           LH   F T P+ L PS++   +D P    S  T    P + LL  L+       PP S V
Sbjct: 67  LHSFQFKTIPDGLPPSDVDATQDIPSLCESTTTHCLVPFKQLLQKLNDTSSSEVPPVSCV 126

Query: 147 VADSFLTWAVRLGNRLNIPVASFWPMSATVLSIYYHFDFLKENGHFP---ADLSECG--E 206
           V+D+ +++ +     L+IP   FW  SA  +  Y H+  L + G  P   A     G  +
Sbjct: 127 VSDAVMSFTISAAQELDIPEVLFWTPSACGVLGYMHYAQLIDKGLTPLKDASYFSNGFLD 186

Query: 207 EIVDYIPGVSETRLADLPTFF---SGDGYEVIDLTLEAARSIDKAQFLISTSVYELESSV 266
           +++D+IPG+   RL DLPTF    + D Y +  +  E  RS  KA  ++  +  ELES V
Sbjct: 187 QVLDWIPGMEGIRLRDLPTFLRTTNPDEYMIKFILQETERS-KKASAIVLNTFQELESEV 246

Query: 267 IDVLKLKFPFPAYTIGPCTPYFDAPNGCTDDYLR---------------WLDSQTEGSVL 326
           ID L    P P Y IGP        N   D+ L+               WLD++   SV+
Sbjct: 247 IDSLSTLLP-PIYPIGPLQ---ILQNQVDDESLKVLGSNLWKEEPECLEWLDTKDPNSVV 306

Query: 327 YISQGSFLSVSSAQMDEIVAGVKASGVRFLWVAR----GNDGQLKGVD-----RETGMVV 386
           Y++ GS   +++ Q+ E   G+  S   FLW+ R      +  + G +     +E G++ 
Sbjct: 307 YVNFGSITVMTNDQLIEFAWGLANSKQNFLWIIRPDLISGESSILGEEFVEETKERGLIA 366

Query: 387 GWCDQLRVLCHSAIGGFWTHAGWNSTLEGVFAGVSMLTWPIFCDQVPNSKKIVEDWKVGV 446
            WC Q +V+ H AIGGF TH GWNST+E + +GV M+ WP F +Q  N +     W +G+
Sbjct: 367 SWCHQEQVINHPAIGGFLTHNGWNSTIESISSGVPMICWPFFAEQQTNCRFCCNKWGIGM 426

Query: 447 RFKAVGGRDLVRREEIAEFVKRFMNSESVEGRKMRNRVSELRGICRRAVAK-GGSSHSNI 466
              +      V+R+E+   VK  M  E  +G++M+ +  E + I      K  GSS+SN+
Sbjct: 427 EINSD-----VKRDEVESLVKELMVGE--KGKEMKKKALEWKNIAEVTTTKPDGSSYSNL 478

BLAST of Cla97C02G045320 vs. TAIR10
Match: AT2G30140.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 475.7 bits (1223), Expect = 3.3e-134
Identity = 237/453 (52.32%), Postives = 308/453 (67.99%), Query Frame = 0

Query: 22  MDSTISETPQLTHLAALPYPGRGHINALMNFCKLLSLKNPNILISFIVSDEWFTFLAADP 81
           MD   S   Q  H+ A+PYPGRGHIN +MN CK L  + PN+ ++F+V++EW  F+  DP
Sbjct: 1   MDPNESPPNQFRHVVAMPYPGRGHINPMMNLCKRLVRRYPNLHVTFVVTEEWLGFIGPDP 60

Query: 82  KPPNLHFATFPNLIPSELGRAKDFPGFFRSVNTVMEPPIQTLLTHLH-PPPSIVVADSFL 141
           KP  +HF+T PNLIPSEL RAKDF GF  +V T +E P + LL  L+ PPPS++ AD+++
Sbjct: 61  KPDRIHFSTLPNLIPSELVRAKDFIGFIDAVYTRLEEPFEKLLDSLNSPPPSVIFADTYV 120

Query: 142 TWAVRLGNRLNIPVASFWPMSATVLSIYYHFDFLKENGHFPADLSECGEEIVDYIPGVSE 201
            WAVR+G + NIPV S W MSAT+LS + H D L  +GH   + SE  EE+VDY+PG+S 
Sbjct: 121 IWAVRVGRKRNIPVVSLWTMSATILSFFLHSDLLISHGHALFEPSE--EEVVDYVPGLSP 180

Query: 202 TRLADLPTFFSGDGYEVIDLTLEAARSIDKAQFLISTSVYELESSVIDVLKLKFPFPAYT 261
           T+L DLP  F G    V          +  A+ L+ T+ YELE   ID    K   P Y 
Sbjct: 181 TKLRDLPPIFDGYSDRVFKTAKLCFDELPGARSLLFTTAYELEHKAIDAFTSKLDIPVYA 240

Query: 262 IGPCTPYFDAP---NGCTDDYLRWLDSQTEGSVLYISQGSFLSVSSAQMDEIVAGVKASG 321
           IGP  P+ +     +    +Y++WL+ Q EGSVLYISQGSFLSVS AQM+EIV G++ SG
Sbjct: 241 IGPLIPFEELSVQNDNKEPNYIQWLEEQPEGSVLYISQGSFLSVSEAQMEEIVKGLRESG 300

Query: 322 VRFLWVARGNDGQLK-GVDRETGMVVGWCDQLRVLCHSAIGGFWTHAGWNSTLEGVFAGV 381
           VRFLWVARG + +LK  ++   G+VV WCDQLRVLCH A+GGFWTH G+NSTLEG+++GV
Sbjct: 301 VRFLWVARGGELKLKEALEGSLGVVVSWCDQLRVLCHKAVGGFWTHCGFNSTLEGIYSGV 360

Query: 382 SMLTWPIFCDQVPNSKKIVEDWKVGVRFKAVGGRD-LVRREEIAEFVKRFMNSESVEGRK 441
            ML +P+F DQ+ N+K IVEDW+VG+R +     + L+ REEI E VKRFM+ ES EG++
Sbjct: 361 PMLAFPLFWDQILNAKMIVEDWRVGMRIERTKKNELLIGREEIKEVVKRFMDRESEEGKE 420

Query: 442 MRNRVSELRGICRRAVAKGGSSHSNIDAFLDHI 469
           MR R  +L  I R AVAK GSS+ NID F+ HI
Sbjct: 421 MRRRACDLSEISRGAVAKSGSSNVNIDEFVRHI 451

BLAST of Cla97C02G045320 vs. TAIR10
Match: AT2G30150.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 467.6 bits (1202), Expect = 8.9e-132
Identity = 228/437 (52.17%), Postives = 309/437 (70.71%), Query Frame = 0

Query: 38  LPYPGRGHINALMNFCKLLSLKNPNILISFIVSDEWFTFLAADPKPPNLHFATFPNLIPS 97
           +P+PGRGHIN ++N CK L  ++PN+ ++F+V++EW  F+ +DPKP  +HFAT PN+IPS
Sbjct: 1   MPWPGRGHINPMLNLCKSLVRRDPNLTVTFVVTEEWLGFIGSDPKPNRIHFATLPNIIPS 60

Query: 98  ELGRAKDFPGFFRSVNTVMEPPIQTLLTHLHPPPSIVVADSFLTWAVRLGNRLNIPVASF 157
           EL RA DF  F  +V T +E P + LL  L+ PP+ ++AD+++ WAVR+G + NIPVASF
Sbjct: 61  ELVRANDFIAFIDAVLTRLEEPFEQLLDRLNSPPTAIIADTYIIWAVRVGTKRNIPVASF 120

Query: 158 WPMSATVLSIYYHFDFLKENGHFPADLSECG-EEIVDYIPGVSETRLADLPTFFSGDGYE 217
           W  SAT+LS++ + D L  +GHFP + SE   +EIVDYIPG+S TRL+DL     G  ++
Sbjct: 121 WTTSATILSLFINSDLLASHGHFPIEPSESKLDEIVDYIPGLSPTRLSDL-QILHGYSHQ 180

Query: 218 VIDLTLEAARSIDKAQFLISTSVYELESSVIDVLKLKFPFPAYTIGPCTPYFDAPNGCTD 277
           V ++  ++   + KA++L+  S YELE   ID    KF FP Y+ GP  P  +   G  +
Sbjct: 181 VFNIFKKSFGELYKAKYLLFPSAYELEPKAIDFFTSKFDFPVYSTGPLIPLEELSVGNEN 240

Query: 278 ---DYLRWLDSQTEGSVLYISQGSFLSVSSAQMDEIVAGVKASGVRFLWVARGNDGQLK- 337
              DY +WLD Q E SVLYISQGSFLSVS AQM+EIV GV+ +GV+F WVARG + +LK 
Sbjct: 241 RELDYFKWLDEQPESSVLYISQGSFLSVSEAQMEEIVVGVREAGVKFFWVARGGELKLKE 300

Query: 338 GVDRETGMVVGWCDQLRVLCHSAIGGFWTHAGWNSTLEGVFAGVSMLTWPIFCDQVPNSK 397
            ++   G+VV WCDQLRVLCH+AIGGFWTH G+NSTLEG+ +GV +LT+P+F DQ  N+K
Sbjct: 301 ALEGSLGVVVSWCDQLRVLCHAAIGGFWTHCGYNSTLEGICSGVPLLTFPVFWDQFLNAK 360

Query: 398 KIVEDWKVGVRFKAVGGRD-LVRREEIAEFVKRFMNSESVEGRKMRNRVSELRGICRRAV 457
            IVE+W+VG+  +     + L+  +EI E VKRFM+ ES EG++MR R  +L  ICR AV
Sbjct: 361 MIVEEWRVGMGIERKKQMELLIVSDEIKELVKRFMDGESEEGKEMRRRTCDLSEICRGAV 420

Query: 458 AKGGSSHSNIDAFLDHI 469
           AKGGSS +NIDAF+  I
Sbjct: 421 AKGGSSDANIDAFIKDI 436

BLAST of Cla97C02G045320 vs. TAIR10
Match: AT2G36970.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 224.9 bits (572), Expect = 1.0e-58
Identity = 139/476 (29.20%), Postives = 238/476 (50.00%), Query Frame = 0

Query: 34  HLAALPYPGRGHINALMNFCKLLSLKNPNILISFIVSDEWFTFL--------------AA 93
           H+  +PYP +GH+   ++    + L +    I+F+ +D     +              A 
Sbjct: 10  HIMMIPYPLQGHVIPFVHLA--IKLASHGFTITFVNTDSIHHHISTAHQDDAGDIFSAAR 69

Query: 94  DPKPPNLHFATFPNLIPSELGRAKDFPGFFRSVNTVMEPPIQTLLTHL----HPPPSIVV 153
                ++ + T  +  P +  R+ +   FF  +  V    +  L+  L     PP + ++
Sbjct: 70  SSGQHDIRYTTVSDGFPLDFDRSLNHDQFFEGILHVFSAHVDDLIAKLSRRDDPPVTCLI 129

Query: 154 ADSFLTWAVRLGNRLNIPVASFWPMSATVLSIYYHFDFLKENGHFPADLSECGEEIVDYI 213
           AD+F  W+  + ++ N+   SFW   A VL++YYH D L  NGHF +   +  ++++DY+
Sbjct: 130 ADTFYVWSSMICDKHNLVNVSFWTEPALVLNLYYHMDLLISNGHFKS--LDNRKDVIDYV 189

Query: 214 PGVSETRLADLPTFFSGDGYEVIDLTL------EAARSIDKAQFLISTSVYELESSVIDV 273
           PGV      DL ++      +V   T+      +A + + +A F++  +V ELE   +  
Sbjct: 190 PGVKAIEPKDLMSYLQVSDKDVDTNTVVYRILFKAFKDVKRADFVVCNTVQELEPDSLSA 249

Query: 274 LKLKFPFPAYTIGP-------CTPYFDAPNGCTDDYLRWLDSQTEGSVLYISQGSFLSVS 333
           L+ K   P Y IGP             A + CT+    WL  +  GSVLY+S GS+  V 
Sbjct: 250 LQAK--QPVYAIGPVFSTDSVVPTSLWAESDCTE----WLKGRPTGSVLYVSFGSYAHVG 309

Query: 334 SAQMDEIVAGVKASGVRFLWVARGN-------DGQLKG-VD--RETGMVVGWCDQLRVLC 393
             ++ EI  G+  SG+ F+WV R +       D    G VD  ++ G+VV WC Q+ V+ 
Sbjct: 310 KKEIVEIAHGLLLSGISFIWVLRPDIVGSNVPDFLPAGFVDQAQDRGLVVQWCCQMEVIS 369

Query: 394 HSAIGGFWTHAGWNSTLEGVFAGVSMLTWPIFCDQVPNSKKIVEDWKVGVRFKAVGGRDL 453
           + A+GGF+TH GWNS LE V+ G+ +L +P+  DQ  N K +V+DW +G+    +  +  
Sbjct: 370 NPAVGGFFTHCGWNSILESVWCGLPLLCYPLLTDQFTNRKLVVDDWCIGIN---LCEKKT 429

Query: 454 VRREEIAEFVKRFMNSESVEGRKMRNRVSELRGICRRAVAKGGSSHSNIDAFLDHI 469
           + R++++  VKR MN E+    ++RN V +++   + AV   GSS +N + F+  +
Sbjct: 430 ITRDQVSANVKRLMNGET--SSELRNNVEKVKRHLKDAVTTVGSSETNFNLFVSEV 470

BLAST of Cla97C02G045320 vs. TAIR10
Match: AT1G78270.1 (UDP-glucosyl transferase 85A4)

HSP 1 Score: 215.3 bits (547), Expect = 8.0e-56
Identity = 147/496 (29.64%), Postives = 237/496 (47.78%), Query Frame = 0

Query: 22  MDSTISETPQLTHLAALPYPGRGHINALMNFCKLLSLKNPNILISFIVSDEWFTFLAADP 81
           M+     + Q  H   +PYP +GHIN ++   KLL  +     ++F+ +D     +    
Sbjct: 1   MEQHGGSSSQKPHAMCIPYPAQGHINPMLKLAKLLHAR--GFHVTFVNTDYNHRRILQSR 60

Query: 82  KP------PNLHFATFPNLIP-SELGRAKDFPGFFRSVNTVMEPPIQTLLTHLH-----P 141
            P      P+  F T P+ +P +++   +D      S       P + L+  L+     P
Sbjct: 61  GPHALNGLPSFRFETIPDGLPWTDVDAKQDMLKLIDSTINNCLAPFKDLILRLNSGSDIP 120

Query: 142 PPSIVVADSFLTWAVRLGNRLNIPVASFWPMSATVLSIYYHFDFLKENGHFP----ADLS 201
           P S +++D+ +++ +     L IPV   W  SAT L +Y H+  L E    P    +DL 
Sbjct: 121 PVSCIISDASMSFTIDAAEELKIPVVLLWTNSATALILYLHYQKLIEKEIIPLKDSSDLK 180

Query: 202 ECGEEIVDYIPGVSETRLADLPTFFSGDGYE--VIDLTLEAARSIDKAQFLISTSVYELE 261
           +  E  +D+IP + + +L D P F +    +  +I   L     I +A  +   +  +LE
Sbjct: 181 KHLETEIDWIPSMKKIKLKDFPDFVTTTNPQDPMISFILHVTGRIKRASAIFINTFEKLE 240

Query: 262 SSVIDVLKLKFPFPAYTIGPCTPYFDAPNGCTD-----------------DYLRWLDSQT 321
            +V+  L+   P   Y++G   P+    N   D                 + L WLD++ 
Sbjct: 241 HNVLLSLRSLLP-QIYSVG---PFQILENREIDKNSEIRKLGLNLWEEETESLDWLDTKA 300

Query: 322 EGSVLYISQGSFLSVSSAQMDEIVAGVKASGVRFLWVARGNDGQLKGVD----------- 381
           E +V+Y++ GS   ++S Q+ E   G+  SG  FLWV R   G + G D           
Sbjct: 301 EKAVIYVNFGSLTVLTSEQILEFAWGLARSGKEFLWVVR--SGMVDGDDSILPAEFLSET 360

Query: 382 RETGMVV-GWCDQLRVLCHSAIGGFWTHAGWNSTLEGVFAGVSMLTWPIFCDQVPNSKKI 441
           +  GM++ GWC Q +VL H AIGGF TH GWNSTLE ++AGV M+ WP F DQ+ N K  
Sbjct: 361 KNRGMLIKGWCSQEKVLSHPAIGGFLTHCGWNSTLESLYAGVPMICWPFFADQLTNRKFC 420

Query: 442 VEDWKVGVRFKAVGGRDLVRREEIAEFVKRFMNSESVEGRKMRNRVSELRGICRRAVAKG 470
            EDW +G+        + V+RE +   VK  M+ E  +G+++R +V E R +   A A  
Sbjct: 421 CEDWGIGMEI-----GEEVKRERVETVVKELMDGE--KGKRLREKVVEWRRLAEEASAPP 480

BLAST of Cla97C02G045320 vs. TAIR10
Match: AT2G28080.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 212.2 bits (539), Expect = 6.7e-55
Identity = 136/471 (28.87%), Postives = 225/471 (47.77%), Query Frame = 0

Query: 34  HLAALPYPGRGHINALMNFCKLLSLKNPNILISFI-----------VSDEWFTFLAADPK 93
           H   +PYP +GH+N  ++    + L +  I ++F+            SD           
Sbjct: 18  HALLIPYPFQGHVNPFVHLA--IKLASQGITVTFVNTHYIHHQITNGSDGDIFAGVRSES 77

Query: 94  PPNLHFATFPNLIPSELGRAKDFPGFFRSVNTVMEPPIQTLLTHL---HPPPSIVVADSF 153
             ++ +AT  + +P    R+ +   +  S+  V    ++ L+  L       ++++AD+F
Sbjct: 78  GLDIRYATVSDGLPVGFDRSLNHDTYQSSLLHVFYAHVEELVASLVGGDGGVNVMIADTF 137

Query: 154 LTWAVRLGNRLNIPVASFWPMSATVLSIYYHFDFLKENGHFPADLSECGEEIVDYIPGVS 213
             W   +  +  +   SFW  +A V S+YYH D L+ +GHF A   E   +++DYIPGV+
Sbjct: 138 FVWPSVVARKFGLVCVSFWTEAALVFSLYYHMDLLRIHGHFGA--QETRSDLIDYIPGVA 197

Query: 214 ETRLADLPTFF--SGDGYEVIDLTLEAARSIDKAQFLISTSVYELESSVIDVLKLKFPFP 273
                D  ++   +     V  +  +A   + K  F++  ++ + E   I  L  K PF 
Sbjct: 198 AINPKDTASYLQETDTSSVVHQIIFKAFEDVKKVDFVLCNTIQQFEDKTIKALNTKIPF- 257

Query: 274 AYTIGPCTPYFDAPNGCT------DDYLRWLDSQTEGSVLYISQGSFLSVSSAQMDEIVA 333
            Y IGP  P+ +     T       D  +WL+++ + SVLYIS GS+  V+   + EI  
Sbjct: 258 -YAIGPIIPFNNQTGSVTTSLWSESDCTQWLNTKPKSSVLYISFGSYAHVTKKDLVEIAH 317

Query: 334 GVKASGVRFLWVARGN-------DGQLKGVDRET---GMVVGWCDQLRVLCHSAIGGFWT 393
           G+  S V F+WV R +       +   +G + E    G+V+ WC Q+ VL H ++GGF T
Sbjct: 318 GILLSKVNFVWVVRPDIVSSDETNPLPEGFETEAGDRGIVIPWCCQMTVLSHESVGGFLT 377

Query: 394 HAGWNSTLEGVFAGVSMLTWPIFCDQVPNSKKIVEDWKVGVRF---KAVGGRDLVRREEI 453
           H GWNS LE ++  V +L +P+  DQV N K +V+DW++G+     K+  GRD     E+
Sbjct: 378 HCGWNSILETIWCEVPVLCFPLLTDQVTNRKLVVDDWEIGINLCEDKSDFGRD-----EV 437

Query: 454 AEFVKRFMNSESVEGRKMRNRVSELRGICRRAVAKGGSSHSNIDAFLDHII 470
              + R M   S E  K+      L G  R +   G SS  N+  F+D ++
Sbjct: 438 GRNINRLMCGVSKE--KIGRVKMSLEGAVRNS---GSSSEMNLGLFIDGLL 472

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008465275.12.1e-22483.93PREDICTED: UDP-glycosyltransferase 87A1-like [Cucumis melo][more]
XP_004143221.22.6e-22282.77PREDICTED: UDP-glycosyltransferase 87A1-like [Cucumis sativus] >KGN47052.1 hypot... [more]
XP_022944194.12.3e-21882.38UDP-glycosyltransferase 87A1-like [Cucurbita moschata][more]
XP_023512982.12.5e-21781.76UDP-glycosyltransferase 87A1-like [Cucurbita pepo subsp. pepo][more]
XP_022986868.13.3e-21781.28UDP-glycosyltransferase 87A1-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A1S3CNH7|A0A1S3CNH7_CUCME1.4e-22483.93UDP-glycosyltransferase 87A1-like OS=Cucumis melo OX=3656 GN=LOC103502931 PE=4 S... [more]
tr|A0A0A0KF85|A0A0A0KF85_CUCSA1.7e-22282.77Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G181570 PE=4 SV=1[more]
tr|A0A1S3CLR3|A0A1S3CLR3_CUCME3.0e-20678.62UDP-glycosyltransferase 87A1-like OS=Cucumis melo OX=3656 GN=LOC103501885 PE=4 S... [more]
tr|A0A0A0KH18|A0A0A0KH18_CUCSA1.9e-20578.16Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G181560 PE=4 SV=1[more]
tr|A0A2P6QKD5|A0A2P6QKD5_ROSCH3.4e-15460.00Putative indole-3-acetate beta-glucosyltransferase OS=Rosa chinensis OX=74649 GN... [more]
Match NameE-valueIdentityDescription
sp|O64733|U87A2_ARATH5.9e-13352.32UDP-glycosyltransferase 87A2 OS=Arabidopsis thaliana OX=3702 GN=UGT87A2 PE=1 SV=... [more]
sp|O64732|U87A1_ARATH1.6e-13052.17UDP-glycosyltransferase 87A1 OS=Arabidopsis thaliana OX=3702 GN=UGT87A1 PE=2 SV=... [more]
sp|Q9SJL0|U86A1_ARATH1.8e-5729.20UDP-glycosyltransferase 86A1 OS=Arabidopsis thaliana OX=3702 GN=UGT86A1 PE=2 SV=... [more]
sp|Q9M9E7|U85A4_ARATH1.4e-5429.64UDP-glycosyltransferase 85A4 OS=Arabidopsis thaliana OX=3702 GN=UGT85A4 PE=2 SV=... [more]
sp|F8WLS6|UGT6_CATRO2.5e-5430.797-deoxyloganetin glucosyltransferase OS=Catharanthus roseus OX=4058 GN=UGT85A23 ... [more]
Match NameE-valueIdentityDescription
AT2G30140.13.3e-13452.32UDP-Glycosyltransferase superfamily protein[more]
AT2G30150.18.9e-13252.17UDP-Glycosyltransferase superfamily protein[more]
AT2G36970.11.0e-5829.20UDP-Glycosyltransferase superfamily protein[more]
AT1G78270.18.0e-5629.64UDP-glucosyl transferase 85A4[more]
AT2G28080.16.7e-5528.87UDP-Glycosyltransferase superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016740 transferase activity
molecular_function GO:0016758 transferase activity, transferring hexosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C02G045320.1Cla97C02G045320.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 34..276
e-value: 8.0E-121
score: 406.3
coord: 450..460
e-value: 8.0E-121
score: 406.3
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 277..449
e-value: 8.0E-121
score: 406.3
NoneNo IPR availablePANTHERPTHR11926:SF176UDP-GLYCOSYLTRANSFERASE 87A1-RELATEDcoord: 32..468
NoneNo IPR availablePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 32..468
NoneNo IPR availableSUPERFAMILYSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 34..469
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 281..412
e-value: 6.0E-18
score: 64.8

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla97C02G045320CmaCh13G000390Cucurbita maxima (Rimu)cmawmbB220
The following gene(s) are paralogous to this gene:

None