Clc10G12200 (gene) Watermelon (cordophanus) v2

Overview
NameClc10G12200
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionGlycosyltransferase
LocationClcChr10: 25699001 .. 25702413 (+)
RNA-Seq ExpressionClc10G12200
SyntenyClc10G12200
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATCAGAACACCTGCAAAATTCCCATAATTTAGAACACATTATTATTCTCCACAATCAAAAGTGCTTCTAACAAACAAATACCTATATTTATTAAATTAATATGGGAGAATAAGTTCAGATTCATTTCAAAGTTGTCCTTATAATTTATGAATTATGAGGAATGGAACTACTCTCTCAGCGATTAGCCCCTAGTGTTTCTGCCCTCTAATAATTATTTCTCATTCTCACTTCTCTCTTCATGGAATCCCCAACTCATGTTGCTCTTCTTTCCAGCCCCGGAATGGGCCACCTCTTCCCCTCACTCGAGCTTGCCACGCGCCTCTCCACGCGCCACCACCTCACCGTCACCGTTTTCCTTGCCCCCTCCCACTCCTCCTCCGCTGAAAACAAAGTCATCGCTACTGCCGAAGCCGTCGGTCTCTTCACCGTCGTCAAACTCCCTCTGGTCGACATGTCGGACGTCACTGACTCCTCTGTCGTTGGCCGCCTCGCCACCACCATGCGCCGCCATGTCCCGGCTTTCCGCTCCGCTGTCTCCACCCTCACCTCTCCTCCCTCCGTCCTCATCGCCGACATCTTCGCTACCGAATCCTTTGCCGTTGCCGACGAGTTCCACATGGCAAAGTACGTCTTTGTTGCCTCTAATGCATGGTTCTTAGCCTTAATCGTTTACGCCCAGGTTTGGGAAACGCAAATCGTTGGGCAGTACGTGGACCAGAAAGAACCACTCCAAATACCGGGATGCGAACCGGTTCGCCCATGCGACGTTATTGACCCGCTTCTGGACCGGACCCATCCACAGTATTTGGAAATGGTCAAAGTGGGGATGGGAATAGCGTCGAGCGACGGCGTTTTGGTTAACACATGGGACGACTTGCAAGGTCGCACCCTCGCATCTTTCAGTGATCAGAATTTGTTGGGTGGAATTATGAAGCCGCCGGTTTACTCTATCGGACCGATTGTGCGGCAGTCCGGTTCAAGCGAGTTGTTCAATTGGTTGAGTAAGCAACCCAGGGAGTCGGTTATTTATGTGTCGTTTGGGAGCGGTGGAACGCTGTCGTTTAAGCAAATGATAGAAGTGGCTCACGGTTTGGAGATGAGTCAGGCGAGATTTGTTTGGGTAGTACGAGCCCCAAAGGTAAGATCAGACGGTGCGTATTTCACGACGGGGGATGGGAGTGAGGAGCAATCGGAGGCAAAGTTTTTGCCAGAGGGATTTTTGGAGCGTACGAGCGAGGTGGGATTTGTTGTACCGATGTGGGCGGACCAAACGGCGGTGTTGGGGAGTCCGGCGGTGGGGGGTTTTTTCACGCATGGTGGATGGAACTCGGCGTTGGAAGGGATTACGAATGGAGTTCCGATGGTTGTGTGGCCGTTGTACGCAGAGCAGCGGCTGAACGCAACGTTTCTGGCAGAGGAGGTGGGAGTGGCGGTCCGGCCAAAGGAGCTGCCAACGAAGGCAGTGATCGGAAGGGAGGAGATCGCGGCGATGGTGAGGAAGATAATGGCGGAGGAGGATGAAGAAGGAAAAGGCATTAGAGCAAAGGCTAAGGAACTTCAACGGAGTGCAGAGAAGGCATCGGCTGAAGGTGGGTCATCTTACGACAACTTTGCTCGAGTTGTGAAATTATTTGGTCGTAAAGGATAAAGTGCCCATGAATTTCACTGTCTTCAATTCAGTGTTTTTTCTCTCGTTCGGTTCAATAACGTGGTGGGTACAAATTTCATCACACAAATACATACTACTTTTTTTTTCCTTTTTTAATTATTATTATTAGTCAGTGTTTAGTAAATTTTAACTATCAACTTAAAAGAATTATGAAGAATGAAACTTACTTGGTAGGGAGAGGATACTGAGGGAACAAGCTTGCAAGAATATCAACCAATTCTCCACGATGAAGCATACTCTCAACACAAATATATCTTCCTGAAGCGGAAGGAGTTTCATAAACCAAAACATGGGCTTTTGCAACATCCTTGACATCAACATAACCTTGAACTGCATTTACATATGTTTTTGCTGATCCTGTTAAATATTTCATTATATGAACCACACTTGCATTCACACTTTCTTGCAACAAAGGCCCCAATACCAACATTGGATTTACGACCACAAGATCCACCCCTTTTTCTTTTGCCACTTCCCATGCTGCTTGTTCTGCCACTGTCTTTGCATAACAATACCAATTCTGTGCAGAAATTTAATCCCAACACTTCAAGATAGTACTTTTAGAATTAACAATTAAGTGTATAACAACATTTTTAAAAAATTGCAAACTTTGAGAGCTTGATTTAAATTTTGCTATATATCTACAAATTCTTTTGAATTATACTATATATGCTAATACTTTGGATCTGATTGTTATATTTACAATTACCTACTTATCTAAAGTAGCTAGTATCTCTTGTAAATTAATTATAGGGTTATTATAAGTCAATTACCTTGGTGTTCTTGCAAAACTCAAGGTCACTCCAACAAGACTCATCCACCACCACGTCGGGGCTTCGGTTAGGGTTCATGTAGACAGAGCCGATCGACGACGTGAACACCACTCGTCGAACATTTGCTTCAGCAGCCGCGGTCATGACATTCTTTGTTCCGATTATGGCCTGTTCCACCTTTTCCTGTAAAATTGTAGTGAGTTATTGTTATTGTTATTTTCCATATAAAATAAAATATAAAGTTCATAATATTAAAAAATCTAGAAAAGTAGAGAGAATGATGAAGTGGAGTTCTAACGGGATCATCGGTCACCGGAGACGCGGTGTGGAAAACACCATGGCAACCCATAATAGCTGCTTTGAGACTGTCAAAATCAAGAAGATCAGCACTAAACAAAGACAACCGCTCTTTAGCTCCTTCTAAGTTGTTCAAATGAGCATTCTTTTGGTCATCTGGGTTTCTAACGGTTCCTCTGACAGTGTAGCCCTTCTCAAGAAGAAGCTTTACAAGCCATGAAGCAATGAAGCCTCCGGCGCCGGTGACACAAACAATTTGGCCGGAAACTGCAGAGGTATCGATCGGCATAGTAGGGGATAAGGAAGAAGGATTTGTTTTGTTTGGAATAGAAAGAGAAGGAGAGAAAGTTGAAAGAAGAGGATGTAGGTTTAGATGGATGCCCACAAAAAGGAGTGGAACATTGGGTGCATTTAAAGGGGAGGAAATCGATCTACCTCAGCTTAAAAAATGAGCACATGTTAATATAAAACCGTTATAAACTTTGGCCTCCAATTATTGACTATTTATTTGGTTGGGTCAAAAGTGAGACCTCAACAACCACCATGGCTTAACCAAAGTTGCTTCATTTTCTGTATCTCTCAATATCACCAAACTTCCACGAATATTCTCCCTCTCAAAGTAGTTTTGTATAATTATATCCATATGAAAACATTACTGAATGTGGCCTTTCCGC

mRNA sequence

ATCAGAACACCTGCAAAATTCCCATAATTTAGAACACATTATTATTCTCCACAATCAAAAGTGCTTCTAACAAACAAATACCTATATTTATTAAATTAATATGGGAGAATAAGTTCAGATTCATTTCAAAGTTGTCCTTATAATTTATGAATTATGAGGAATGGAACTACTCTCTCAGCGATTAGCCCCTAGTGTTTCTGCCCTCTAATAATTATTTCTCATTCTCACTTCTCTCTTCATGGAATCCCCAACTCATGTTGCTCTTCTTTCCAGCCCCGGAATGGGCCACCTCTTCCCCTCACTCGAGCTTGCCACGCGCCTCTCCACGCGCCACCACCTCACCGTCACCGTTTTCCTTGCCCCCTCCCACTCCTCCTCCGCTGAAAACAAAGTCATCGCTACTGCCGAAGCCGTCGGTCTCTTCACCGTCGTCAAACTCCCTCTGGTCGACATGTCGGACGTCACTGACTCCTCTGTCGTTGGCCGCCTCGCCACCACCATGCGCCGCCATGTCCCGGCTTTCCGCTCCGCTGTCTCCACCCTCACCTCTCCTCCCTCCGTCCTCATCGCCGACATCTTCGCTACCGAATCCTTTGCCGTTGCCGACGAGTTCCACATGGCAAAGTACGTCTTTGTTGCCTCTAATGCATGGTTCTTAGCCTTAATCGTTTACGCCCAGGTTTGGGAAACGCAAATCGTTGGGCAGTACGTGGACCAGAAAGAACCACTCCAAATACCGGGATGCGAACCGGTTCGCCCATGCGACGTTATTGACCCGCTTCTGGACCGGACCCATCCACAGTATTTGGAAATGGTCAAAGTGGGGATGGGAATAGCGTCGAGCGACGGCGTTTTGGTTAACACATGGGACGACTTGCAAGGTCGCACCCTCGCATCTTTCAGTGATCAGAATTTGTTGGGTGGAATTATGAAGCCGCCGGTTTACTCTATCGGACCGATTGTGCGGCAGTCCGGTTCAAGCGAGTTGTTCAATTGGTTGAGTAAGCAACCCAGGGAGTCGGTTATTTATGTGTCGTTTGGGAGCGGTGGAACGCTGTCGTTTAAGCAAATGATAGAAGTGGCTCACGGTTTGGAGATGAGTCAGGCGAGATTTGTTTGGGTAGTACGAGCCCCAAAGGTAAGATCAGACGGTGCGTATTTCACGACGGGGGATGGGAGTGAGGAGCAATCGGAGGCAAAGTTTTTGCCAGAGGGATTTTTGGAGCGTACGAGCGAGGTGGGATTTGTTGTACCGATGTGGGCGGACCAAACGGCGGTGTTGGGGAGTCCGGCGGTGGGGGGTTTTTTCACGCATGGTGGATGGAACTCGGCGTTGGAAGGGATTACGAATGGAGTTCCGATGGTTGTGTGGCCGTTGTACGCAGAGCAGCGGCTGAACGCAACGTTTCTGGCAGAGGAGGTGGGAGTGGCGGTCCGGCCAAAGGAGCTGCCAACGAAGGCAGTGATCGGAAGGGAGGAGATCGCGGCGATGGTGAGGAAGATAATGGCGGAGGAGGATGAAGAAGGAAAAGGCATTAGAGCAAAGGCTAAGGAACTTCAACGGAGTGCAGAGAAGGCATCGGCTGAAGGTGGGTCATCTTACGACAACTTTGCTCGAGTTGTGAAATTATTTGGTCGTAAAGGATAAAGTGCCCATGAATTTCACTGTCTTCAATTCAGTGTTTTTTCTCTCGTTCGGTTCAATAACGTGGTCACTCCAACAAGACTCATCCACCACCACGTCGGGGCTTCGGTTAGGGTTCATGTAGACAGAGCCGATCGACGACGTGAACACCACTCGTCGAACATTTGCTTCAGCAGCCGCGGTCATGACATTCTTTGTTCCGATTATGGCCTGTTCCACCTTTTCCTGTAAAATTGTAGTGAGTTATTGTTATTGTTATTTTCCATATAAAATAAAATATAAAGTTCATAATATTAAAAAATCTAGAAAAGTAGAGAGAATGATGAAGTGGAGTTCTAACGGGATCATCGGTCACCGGAGACGCGGTGTGGAAAACACCATGGCAACCCATAATAGCTGCTTTGAGACTGTCAAAATCAAGAAGATCAGCACTAAACAAAGACAACCGCTCTTTAGCTCCTTCTAAGTTGTTCAAATGAGCATTCTTTTGGTCATCTGGGTTTCTAACGGTTCCTCTGACAGTGTAGCCCTTCTCAAGAAGAAGCTTTACAAGCCATGAAGCAATGAAGCCTCCGGCGCCGGTGACACAAACAATTTGGCCGGAAACTGCAGAGGTATCGATCGGCATAGTAGGGGATAAGGAAGAAGGATTTGTTTTGTTTGGAATAGAAAGAGAAGGAGAGAAAGTTGAAAGAAGAGGATGTAGGTTTAGATGGATGCCCACAAAAAGGAGTGGAACATTGGGTGCATTTAAAGGGGAGGAAATCGATCTACCTCAGCTTAAAAAATGAGCACATGTTAATATAAAACCGTTATAAACTTTGGCCTCCAATTATTGACTATTTATTTGGTTGGGTCAAAAGTGAGACCTCAACAACCACCATGGCTTAACCAAAGTTGCTTCATTTTCTGTATCTCTCAATATCACCAAACTTCCACGAATATTCTCCCTCTCAAAGTAGTTTTGTATAATTATATCCATATGAAAACATTACTGAATGTGGCCTTTCCGC

Coding sequence (CDS)

ATGGAATCCCCAACTCATGTTGCTCTTCTTTCCAGCCCCGGAATGGGCCACCTCTTCCCCTCACTCGAGCTTGCCACGCGCCTCTCCACGCGCCACCACCTCACCGTCACCGTTTTCCTTGCCCCCTCCCACTCCTCCTCCGCTGAAAACAAAGTCATCGCTACTGCCGAAGCCGTCGGTCTCTTCACCGTCGTCAAACTCCCTCTGGTCGACATGTCGGACGTCACTGACTCCTCTGTCGTTGGCCGCCTCGCCACCACCATGCGCCGCCATGTCCCGGCTTTCCGCTCCGCTGTCTCCACCCTCACCTCTCCTCCCTCCGTCCTCATCGCCGACATCTTCGCTACCGAATCCTTTGCCGTTGCCGACGAGTTCCACATGGCAAAGTACGTCTTTGTTGCCTCTAATGCATGGTTCTTAGCCTTAATCGTTTACGCCCAGGTTTGGGAAACGCAAATCGTTGGGCAGTACGTGGACCAGAAAGAACCACTCCAAATACCGGGATGCGAACCGGTTCGCCCATGCGACGTTATTGACCCGCTTCTGGACCGGACCCATCCACAGTATTTGGAAATGGTCAAAGTGGGGATGGGAATAGCGTCGAGCGACGGCGTTTTGGTTAACACATGGGACGACTTGCAAGGTCGCACCCTCGCATCTTTCAGTGATCAGAATTTGTTGGGTGGAATTATGAAGCCGCCGGTTTACTCTATCGGACCGATTGTGCGGCAGTCCGGTTCAAGCGAGTTGTTCAATTGGTTGAGTAAGCAACCCAGGGAGTCGGTTATTTATGTGTCGTTTGGGAGCGGTGGAACGCTGTCGTTTAAGCAAATGATAGAAGTGGCTCACGGTTTGGAGATGAGTCAGGCGAGATTTGTTTGGGTAGTACGAGCCCCAAAGGTAAGATCAGACGGTGCGTATTTCACGACGGGGGATGGGAGTGAGGAGCAATCGGAGGCAAAGTTTTTGCCAGAGGGATTTTTGGAGCGTACGAGCGAGGTGGGATTTGTTGTACCGATGTGGGCGGACCAAACGGCGGTGTTGGGGAGTCCGGCGGTGGGGGGTTTTTTCACGCATGGTGGATGGAACTCGGCGTTGGAAGGGATTACGAATGGAGTTCCGATGGTTGTGTGGCCGTTGTACGCAGAGCAGCGGCTGAACGCAACGTTTCTGGCAGAGGAGGTGGGAGTGGCGGTCCGGCCAAAGGAGCTGCCAACGAAGGCAGTGATCGGAAGGGAGGAGATCGCGGCGATGGTGAGGAAGATAATGGCGGAGGAGGATGAAGAAGGAAAAGGCATTAGAGCAAAGGCTAAGGAACTTCAACGGAGTGCAGAGAAGGCATCGGCTGAAGGTGGGTCATCTTACGACAACTTTGCTCGAGTTGTGAAATTATTTGGTCGTAAAGGATAA

Protein sequence

MESPTHVALLSSPGMGHLFPSLELATRLSTRHHLTVTVFLAPSHSSSAENKVIATAEAVGLFTVVKLPLVDMSDVTDSSVVGRLATTMRRHVPAFRSAVSTLTSPPSVLIADIFATESFAVADEFHMAKYVFVASNAWFLALIVYAQVWETQIVGQYVDQKEPLQIPGCEPVRPCDVIDPLLDRTHPQYLEMVKVGMGIASSDGVLVNTWDDLQGRTLASFSDQNLLGGIMKPPVYSIGPIVRQSGSSELFNWLSKQPRESVIYVSFGSGGTLSFKQMIEVAHGLEMSQARFVWVVRAPKVRSDGAYFTTGDGSEEQSEAKFLPEGFLERTSEVGFVVPMWADQTAVLGSPAVGGFFTHGGWNSALEGITNGVPMVVWPLYAEQRLNATFLAEEVGVAVRPKELPTKAVIGREEIAAMVRKIMAEEDEEGKGIRAKAKELQRSAEKASAEGGSSYDNFARVVKLFGRKG
Homology
BLAST of Clc10G12200 vs. NCBI nr
Match: XP_038880693.1 (anthocyanidin 3-O-glucosyltransferase 5-like [Benincasa hispida])

HSP 1 Score: 826.2 bits (2133), Expect = 1.4e-235
Identity = 426/471 (90.45%), Postives = 437/471 (92.78%), Query Frame = 0

Query: 1   MESPTHVALLSSPGMGHLFPSLELATRLSTRHHLTVTVFLAPSHSSSAENKVIATAEAVG 60
           M+S THVAL+SSPGMGHLFPSLELATRLSTRHHLTVTVF+ PSHSS+AENKVIA AEA G
Sbjct: 1   MDSQTHVALISSPGMGHLFPSLELATRLSTRHHLTVTVFIVPSHSSNAENKVIAAAEAAG 60

Query: 61  LFTVVKLPLVDMSDVTDSSVVGRLATTMRRHVPAFRSAVSTLTSPPSVLIADIFATESFA 120
           LFTVV+LP  DMSDVTDSSVVGRLA TMRRHVP  RSAVS LTS PSVLIADIFATESFA
Sbjct: 61  LFTVVELPPADMSDVTDSSVVGRLAITMRRHVPILRSAVSALTSLPSVLIADIFATESFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALIVYAQVWETQIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180
           VADEFHMAKYVFVASNAWFLAL VYAQVW+ QIVGQYVDQKEPLQIPGCEPVRPCDVIDP
Sbjct: 121 VADEFHMAKYVFVASNAWFLALTVYAQVWDKQIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180

Query: 181 LLDRTHPQYLEMVKVGMGIASSDGVLVNTWDDLQGRTLASFSDQNLLGGIMKPPVYSIGP 240
           LLDRT PQY E+VKVGMGIAS DGVLVNTWDDLQGRTLASF D+NLLG IMKPPVYSIGP
Sbjct: 181 LLDRTQPQYFEIVKVGMGIASCDGVLVNTWDDLQGRTLASFRDRNLLGKIMKPPVYSIGP 240

Query: 241 IVRQS-----GSSELFNWLSKQPRESVIYVSFGSGGTLSFKQMIEVAHGLEMSQARFVWV 300
           IVRQS     GSSELFNWLSKQP ESVIYVSFGSGGTLSF+QM EVAHGLEMS+ RFVWV
Sbjct: 241 IVRQSGSKKGGSSELFNWLSKQPTESVIYVSFGSGGTLSFEQMTEVAHGLEMSRQRFVWV 300

Query: 301 VRAPKVRSDGAYFTTGDGSEEQSEAKFLPEGFLERTSEVGFVVPMWADQTAVLGSPAVGG 360
           VRAPKVRSDGAYFTTGDGSEEQS  KFLPEGFLERTSEVGFVV MWADQTAVLGSPAVGG
Sbjct: 301 VRAPKVRSDGAYFTTGDGSEEQSAGKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360

Query: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRLNATFLAEEVGVAVRPKELPTKAVIGREEI 420
           FFTHGGWNSALEGITNGVPMVVWPLYAEQRLNAT LAEEV VAVRPKELPTKAVIGREEI
Sbjct: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRLNATMLAEEVRVAVRPKELPTKAVIGREEI 420

Query: 421 AAMVRKIMAEEDEEGKGIRAKAKELQRSAEKASAEGGSSYDNFARVVKLFG 467
           AAMVRKIMAEEDEEGK IRAKAKELQRSAE ASAE GSSY+NFARVVKLFG
Sbjct: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAENASAEDGSSYENFARVVKLFG 471

BLAST of Clc10G12200 vs. NCBI nr
Match: TYK16721.1 (anthocyanidin 3-O-glucosyltransferase 5 [Cucumis melo var. makuwa])

HSP 1 Score: 785.4 bits (2027), Expect = 2.7e-223
Identity = 405/471 (85.99%), Postives = 428/471 (90.87%), Query Frame = 0

Query: 1   MESPTHVALLSSPGMGHLFPSLELATRLSTRHHLTVTVFLAPSHSSSAENKVIATAEAVG 60
           MES  HVAL+SSPGMGHLFP+LELATRLST H LTVTVF+ PSHSSSAENKVIATA+A G
Sbjct: 1   MESAAHVALISSPGMGHLFPALELATRLSTHHRLTVTVFIVPSHSSSAENKVIATAQAAG 60

Query: 61  LFTVVKLPLVDMSDVTDSSVVGRLATTMRRHVPAFRSAVSTLTSPPSVLIADIFATESFA 120
           LFTVV+LP  DMSDVT+SS+VGRLA TMRRHVP FRSAVS +TSPPSVLIADIFA ESFA
Sbjct: 61  LFTVVELPPADMSDVTESSIVGRLAITMRRHVPIFRSAVSAMTSPPSVLIADIFAVESFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALIVYAQVWETQIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180
           VADEF MAKY FVASNAWFLA++VYAQVW+ +IVGQYVDQKEPLQIPGCEPVRPCDVIDP
Sbjct: 121 VADEFDMAKYTFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180

Query: 181 LLDRTHPQYLEMVKVGMGIASSDGVLVNTWDDLQGRTLASFSDQNLLGGIMKPPVYSIGP 240
           LLDRT  QY E++K+GMGIASSDGVLVNTWD+LQ RTLAS +D+ LLG I  PPVYSIGP
Sbjct: 181 LLDRTELQYSEILKLGMGIASSDGVLVNTWDELQHRTLASLNDRYLLGKI-SPPVYSIGP 240

Query: 241 IVRQ-----SGSSELFNWLSKQPRESVIYVSFGSGGTLSFKQMIEVAHGLEMSQARFVWV 300
           IVRQ      GSSELFNWLSKQP ESVIYVSFGSGGTLSF+QM EVAHGLEMS+ RFVWV
Sbjct: 241 IVRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSKQRFVWV 300

Query: 301 VRAPKVRSDGAYFTTGDGSEEQSEAKFLPEGFLERTSEVGFVVPMWADQTAVLGSPAVGG 360
           VRAPKVRSDGAYFTTGDGSEEQS AKFLPEGFLERTSEVGFVV MWADQTAVLGSPAVGG
Sbjct: 301 VRAPKVRSDGAYFTTGDGSEEQSSAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360

Query: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRLNATFLAEEVGVAVRPKELPTKAVIGREEI 420
           FFTH GWNSALEGITNGVPMVVWPLYAEQRLNAT LAEE+GVAVR KELPTKA+I REEI
Sbjct: 361 FFTHSGWNSALEGITNGVPMVVWPLYAEQRLNATMLAEEIGVAVRSKELPTKALIEREEI 420

Query: 421 AAMVRKIMAEEDEEGKGIRAKAKELQRSAEKASAEGGSSYDNFARVVKLFG 467
           AAMVRKIM EED+EGK IRAKAKELQRSAEKA AEGGSSY NFARVVKLFG
Sbjct: 421 AAMVRKIMVEEDDEGKAIRAKAKELQRSAEKALAEGGSSYHNFARVVKLFG 470

BLAST of Clc10G12200 vs. NCBI nr
Match: XP_008453746.1 (PREDICTED: anthocyanidin 3-O-glucosyltransferase 5, partial [Cucumis melo])

HSP 1 Score: 780.0 bits (2013), Expect = 1.1e-221
Identity = 402/469 (85.71%), Postives = 426/469 (90.83%), Query Frame = 0

Query: 1   MESPTHVALLSSPGMGHLFPSLELATRLSTRHHLTVTVFLAPSHSSSAENKVIATAEAVG 60
           MES  HVAL+SSPGMGHLFP+LELATRLST H LTVTVF+ PSHSSSAENKVIATA+A G
Sbjct: 1   MESAAHVALISSPGMGHLFPALELATRLSTHHRLTVTVFIVPSHSSSAENKVIATAQAAG 60

Query: 61  LFTVVKLPLVDMSDVTDSSVVGRLATTMRRHVPAFRSAVSTLTSPPSVLIADIFATESFA 120
           LFTVV+LP  DMSDVT+SS+VGRLA TMRRHVP FRSAVS +TSPPSVLIADIFA ESFA
Sbjct: 61  LFTVVELPPADMSDVTESSIVGRLAITMRRHVPIFRSAVSAMTSPPSVLIADIFAVESFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALIVYAQVWETQIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180
           VADEF MAKY FVASNAWFLA++VYAQVW+ +IVGQYVDQKEPLQIPGCEPVRPCDVIDP
Sbjct: 121 VADEFDMAKYTFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180

Query: 181 LLDRTHPQYLEMVKVGMGIASSDGVLVNTWDDLQGRTLASFSDQNLLGGIMKPPVYSIGP 240
           LLDRT  QY E++K+GMGIASSDGVLVNTWD+LQ RTLAS +D+ LLG I  PPVYSIGP
Sbjct: 181 LLDRTELQYSEILKLGMGIASSDGVLVNTWDELQHRTLASLNDRYLLGKI-SPPVYSIGP 240

Query: 241 IVRQ-----SGSSELFNWLSKQPRESVIYVSFGSGGTLSFKQMIEVAHGLEMSQARFVWV 300
           IVRQ      GSSELFNWLSKQP ESVIYVSFGSGGTLSF+QM EVAHGLEMS+ RFVWV
Sbjct: 241 IVRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSKQRFVWV 300

Query: 301 VRAPKVRSDGAYFTTGDGSEEQSEAKFLPEGFLERTSEVGFVVPMWADQTAVLGSPAVGG 360
           VRAPKVRSDGAYFTTGDGSEEQS AKFLPEGFLERTSEVGFVV MWADQTAVLGSPAVGG
Sbjct: 301 VRAPKVRSDGAYFTTGDGSEEQSSAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360

Query: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRLNATFLAEEVGVAVRPKELPTKAVIGREEI 420
           FFTH GWNSALEGITNGVPMVVWPLYAEQRLNAT LAEE+GVAVR KELPTKA+I REEI
Sbjct: 361 FFTHSGWNSALEGITNGVPMVVWPLYAEQRLNATMLAEEIGVAVRSKELPTKALIEREEI 420

Query: 421 AAMVRKIMAEEDEEGKGIRAKAKELQRSAEKASAEGGSSYDNFARVVKL 465
           AAMVRKIM EED+EGK IRAKAKELQRSAEKA AEGGSSY NFARVVK+
Sbjct: 421 AAMVRKIMVEEDDEGKAIRAKAKELQRSAEKALAEGGSSYHNFARVVKI 468

BLAST of Clc10G12200 vs. NCBI nr
Match: XP_022929544.1 (anthocyanidin 3-O-glucosyltransferase 5-like [Cucurbita moschata])

HSP 1 Score: 774.6 bits (1999), Expect = 4.8e-220
Identity = 398/468 (85.04%), Postives = 422/468 (90.17%), Query Frame = 0

Query: 1   MESPTHVALLSSPGMGHLFPSLELATRLSTRHHLTVTVFLAPSHSSSAENKVIATAEAVG 60
           MES  HVAL+SSPGMGHLFPSLELATRLS RHHL+VTVF+ PS SSSAENKVIA A+A G
Sbjct: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAENKVIAAAQAAG 60

Query: 61  LFTVVKLPLVDMSDVTDSSVVGRLATTMRRHVPAFRSAVSTLTSPPSVLIADIFATESFA 120
           LFTV++LP  DMSDVT+S+VVGRL  TMRRHVPA RSAVSTLT+ PSVLIADIFATESFA
Sbjct: 61  LFTVIELPPADMSDVTESAVVGRLCITMRRHVPALRSAVSTLTTLPSVLIADIFATESFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALIVYAQVWETQIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180
           VADEFHMAKYVFVASNAWFLA  +Y  V + QI GQYVDQKEPL IPGCEPVRPCDVIDP
Sbjct: 121 VADEFHMAKYVFVASNAWFLAFTIYVPVLDKQITGQYVDQKEPLYIPGCEPVRPCDVIDP 180

Query: 181 LLDRTHPQYLEMVKVGMGIASSDGVLVNTWDDLQGRTLASFSDQNLLGGIMKPPVYSIGP 240
           LLDRT PQY E V++GM I SSDGVLVNTWDDLQGRTLASF D+NLLG IM  PVYSIGP
Sbjct: 181 LLDRTEPQYFEYVRIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMNSPVYSIGP 240

Query: 241 IVRQS-----GSSELFNWLSKQPRESVIYVSFGSGGTLSFKQMIEVAHGLEMSQARFVWV 300
           IVRQ+     GSSELFNWLSKQP ESVIYVSFGSGGTLS +QM EVAHGLEMS  RFVWV
Sbjct: 241 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300

Query: 301 VRAPKVRSDGAYFTTGDGSEEQSEAKFLPEGFLERTSEVGFVVPMWADQTAVLGSPAVGG 360
           VRAPKVRSD  +FTTGDGSE+QSEAKFLP+GFLERTSEVGFVV MWADQTAVLGSPAVGG
Sbjct: 301 VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360

Query: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRLNATFLAEEVGVAVRPKELPTKAVIGREEI 420
           FFTHGGWNSALEGITNGVPMVVWPLYAEQR+NAT LAEEV VAVRPKELPTKAVIGREEI
Sbjct: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420

Query: 421 AAMVRKIMAEEDEEGKGIRAKAKELQRSAEKASAEGGSSYDNFARVVK 464
           AAMVRKIMAEEDEEGK IRAKAKELQRSAEK++AEGGSS++NFARVVK
Sbjct: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVK 468

BLAST of Clc10G12200 vs. NCBI nr
Match: KAG7015349.1 (Anthocyanidin 3-O-glucosyltransferase 5 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 773.9 bits (1997), Expect = 8.1e-220
Identity = 398/468 (85.04%), Postives = 422/468 (90.17%), Query Frame = 0

Query: 1   MESPTHVALLSSPGMGHLFPSLELATRLSTRHHLTVTVFLAPSHSSSAENKVIATAEAVG 60
           MES  HVAL+SSPGMGHLFPSLELATRLS RHHL+VTVF+ PS SSSAENKVIA A+A G
Sbjct: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAENKVIAAAQAAG 60

Query: 61  LFTVVKLPLVDMSDVTDSSVVGRLATTMRRHVPAFRSAVSTLTSPPSVLIADIFATESFA 120
           LFTV++LP  DMSDVT+S+VVGRL  TMRRHVPA RSAVSTLT+ PSVLIADIFATESFA
Sbjct: 61  LFTVIELPPADMSDVTESAVVGRLCITMRRHVPALRSAVSTLTTLPSVLIADIFATESFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALIVYAQVWETQIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180
           VADEFHMAKYVFVASNAWFLA  +Y  V + QI GQYVDQKEPL IPGCEPVRPCDVIDP
Sbjct: 121 VADEFHMAKYVFVASNAWFLAFTIYVPVLDKQITGQYVDQKEPLYIPGCEPVRPCDVIDP 180

Query: 181 LLDRTHPQYLEMVKVGMGIASSDGVLVNTWDDLQGRTLASFSDQNLLGGIMKPPVYSIGP 240
           LLDRT PQY E V++GM I SSDGVLVNTWDDLQGRTLASF D+NLLG IMK PVYSIGP
Sbjct: 181 LLDRTEPQYFEYVRIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMKSPVYSIGP 240

Query: 241 IVRQS-----GSSELFNWLSKQPRESVIYVSFGSGGTLSFKQMIEVAHGLEMSQARFVWV 300
           IVRQ+     GSSELFNWLSKQP ESVIYVSFGSGGTLS +QM EVAHGLEMS  RFVWV
Sbjct: 241 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300

Query: 301 VRAPKVRSDGAYFTTGDGSEEQSEAKFLPEGFLERTSEVGFVVPMWADQTAVLGSPAVGG 360
           VRAPKVRSD  +FTTGDGSE+QSEAKFLP+GFLERTSEVGFVV MWADQTAVLGSPAVGG
Sbjct: 301 VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360

Query: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRLNATFLAEEVGVAVRPKELPTKAVIGREEI 420
           FFTHGGWNSALEGITNGVPMVVWPLYAEQR+NAT LAEEV VAVRPKELPTKAVIGREEI
Sbjct: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420

Query: 421 AAMVRKIMAEEDEEGKGIRAKAKELQRSAEKASAEGGSSYDNFARVVK 464
           AAMVRKIMA EDEEGK IRAKAKELQRSAEK++AEGGSS++NFARVVK
Sbjct: 421 AAMVRKIMAVEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVK 468

BLAST of Clc10G12200 vs. ExPASy Swiss-Prot
Match: Q40287 (Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta OX=3983 GN=GT5 PE=2 SV=1)

HSP 1 Score: 459.1 bits (1180), Expect = 5.8e-128
Identity = 237/463 (51.19%), Postives = 324/463 (69.98%), Query Frame = 0

Query: 1   MESPTHVALLSSPGMGHLFPSLELATRLSTRHHLTVTVFLAPSHSSSAENKVIATAEAVG 60
           + S  H+ LLSSPG+GHL P LEL  R+ T  +  VT+F+  S +S+AE +V+ +A    
Sbjct: 6   LNSKPHIVLLSSPGLGHLIPVLELGKRIVTLCNFDVTIFMVGSDTSAAEPQVLRSAMTPK 65

Query: 61  LFTVVKLPLVDMSDVTD--SSVVGRLATTMRRHVPAFRSAVSTLTSPPSVLIADIFATES 120
           L  +++LP  ++S + D  ++V  RL   MR   PAFR+AVS L   P+ +I D+F TES
Sbjct: 66  LCEIIQLPPPNISCLIDPEATVCTRLFVLMREIRPAFRAAVSALKFRPAAIIVDLFGTES 125

Query: 121 FAVADEFHMAKYVFVASNAWFLALIVYAQVWETQIVGQYVDQKEPLQIPGCEPVRPCDVI 180
             VA E  +AKYV++ASNAWFLAL +Y  + + ++ G++V QKEP++IPGC PVR  +V+
Sbjct: 126 LEVAKELGIAKYVYIASNAWFLALTIYVPILDKEVEGEFVLQKEPMKIPGCRPVRTEEVV 185

Query: 181 DPLLDRTHPQYLEMVKVGMGIASSDGVLVNTWDDLQGRTLASFSDQNLLGGIMKPPVYSI 240
           DP+LDRT+ QY E  ++G+ I ++DG+L+NTW+ L+  T  +  D   LG + K PV+ I
Sbjct: 186 DPMLDRTNQQYSEYFRLGIEIPTADGILMNTWEALEPTTFGALRDVKFLGRVAKVPVFPI 245

Query: 241 GPIVRQSG----SSELFNWLSKQPRESVIYVSFGSGGTLSFKQMIEVAHGLEMSQARFVW 300
           GP+ RQ+G    + EL +WL +QP+ESV+YVSFGSGGTLS +QMIE+A GLE SQ RF+W
Sbjct: 246 GPLRRQAGPCGSNCELLDWLDQQPKESVVYVSFGSGGTLSLEQMIELAWGLERSQQRFIW 305

Query: 301 VVRAPKVRS-DGAYFTTGDGSEEQSEAKFLPEGFLERTSEVGFVVPMWADQTAVLGSPAV 360
           VVR P V++ D A+FT GDG+++ S   + PEGFL R   VG VVP W+ Q  ++  P+V
Sbjct: 306 VVRQPTVKTGDAAFFTQGDGADDMS--GYFPEGFLTRIQNVGLVVPQWSPQIHIMSHPSV 365

Query: 361 GGFFTHGGWNSALEGITNGVPMVVWPLYAEQRLNATFLAEEVGVAVRPKELPTKAVIGRE 420
           G F +H GWNS LE IT GVP++ WP+YAEQR+NAT L EE+GVAVRPK LP K V+ RE
Sbjct: 366 GVFLSHCGWNSVLESITAGVPIIAWPIYAEQRMNATLLTEELGVAVRPKNLPAKEVVKRE 425

Query: 421 EIAAMVRKIMAEEDEEGKGIRAKAKELQRSAEKASAEGGSSYD 457
           EI  M+R+IM   DEEG  IR + +EL+ S EKA  EGGSS++
Sbjct: 426 EIERMIRRIMV--DEEGSEIRKRVRELKDSGEKALNEGGSSFN 464

BLAST of Clc10G12200 vs. ExPASy Swiss-Prot
Match: Q9ZU72 (UDP-glycosyltransferase 72D1 OS=Arabidopsis thaliana OX=3702 GN=UGT72D1 PE=2 SV=1)

HSP 1 Score: 422.2 bits (1084), Expect = 7.9e-117
Identity = 225/465 (48.39%), Postives = 318/465 (68.39%), Query Frame = 0

Query: 1   MESPTHVALLSSPGMGHLFPSLELATRLSTRHHLTVTVFLAPSHSSS-AENKVIATAEAV 60
           M+ P H  L++SPG+GHL P LEL  RLS+  ++ VT+    S SSS  E + I  A A 
Sbjct: 1   MDQP-HALLVASPGLGHLIPILELGNRLSSVLNIHVTILAVTSGSSSPTETEAIHAAAAR 60

Query: 61  GLFTVVKLPLVDMSDVT--DSSVVGRLATTMRRHVPAFRSAVSTLTSPPSVLIADIFATE 120
            +  + ++P VD+ ++   D+++  ++   MR   PA R AV  +   P+V+I D   TE
Sbjct: 61  TICQITEIPSVDVDNLVEPDATIFTKMVVKMRAMKPAVRDAVKLMKRKPTVMIVDFLGTE 120

Query: 121 SFAVADEFHM-AKYVFVASNAWFLALIVYAQVWETQIVGQYVDQKEPLQIPGCEPVRPCD 180
             +VAD+  M AKYV+V ++AWFLA++VY  V +T + G+YVD KEPL+IPGC+PV P +
Sbjct: 121 LMSVADDVGMTAKYVYVPTHAWFLAVMVYLPVLDTVVEGEYVDIKEPLKIPGCKPVGPKE 180

Query: 181 VIDPLLDRTHPQYLEMVKVGMGIASSDGVLVNTWDDLQGRTLASFSDQNLLGGIMKPPVY 240
           +++ +LDR+  QY E V+ G+ +  SDGVLVNTW++LQG TLA+  +   L  +MK PVY
Sbjct: 181 LMETMLDRSGQQYKECVRAGLEVPMSDGVLVNTWEELQGNTLAALREDEELSRVMKVPVY 240

Query: 241 SIGPIVRQS----GSSELFNWLSKQPRESVIYVSFGSGGTLSFKQMIEVAHGLEMSQARF 300
            IGPIVR +      + +F WL +Q   SV++V  GSGGTL+F+Q +E+A GLE+S  RF
Sbjct: 241 PIGPIVRTNQHVDKPNSIFEWLDEQRERSVVFVCLGSGGTLTFEQTVELALGLELSGQRF 300

Query: 301 VWVVRAPKVRSDGAYFTTGDGSEEQSEAKFLPEGFLERTSEVGFVVPMWADQTAVLGSPA 360
           VWV+R P      A +     S+++  +  LPEGFL+RT  VG VV  WA Q  +L   +
Sbjct: 301 VWVLRRP------ASYLGAISSDDEQVSASLPEGFLDRTRGVGIVVTQWAPQVEILSHRS 360

Query: 361 VGGFFTHGGWNSALEGITNGVPMVVWPLYAEQRLNATFLAEEVGVAVRPKELPTKAVIGR 420
           +GGF +H GW+SALE +T GVP++ WPLYAEQ +NAT L EE+GVAVR  ELP++ VIGR
Sbjct: 361 IGGFLSHCGWSSALESLTKGVPIIAWPLYAEQWMNATLLTEEIGVAVRTSELPSERVIGR 420

Query: 421 EEIAAMVRKIMAEEDEEGKGIRAKAKELQRSAEKASAEGGSSYDN 458
           EE+A++VRKIMAEEDEEG+ IRAKA+E++ S+E+A ++ GSSY++
Sbjct: 421 EEVASLVRKIMAEEDEEGQKIRAKAEEVRVSSERAWSKDGSSYNS 458

BLAST of Clc10G12200 vs. ExPASy Swiss-Prot
Match: Q94A84 (UDP-glycosyltransferase 72E1 OS=Arabidopsis thaliana OX=3702 GN=UGT72E1 PE=1 SV=1)

HSP 1 Score: 382.9 bits (982), Expect = 5.3e-105
Identity = 207/465 (44.52%), Postives = 292/465 (62.80%), Query Frame = 0

Query: 6   HVALLSSPGMGHLFPSLELATRLSTRHHLTVTVFLAPSHSSSAENKVIATAEA-VGLFTV 65
           HVA+ +SPGMGH+ P +EL  RL+  H   VT+F+  + ++SA+++ + +      L  +
Sbjct: 7   HVAMFASPGMGHIIPVIELGKRLAGSHGFDVTIFVLETDAASAQSQFLNSPGCDAALVDI 66

Query: 66  VKLPLVDMSDVTDSSVVG--RLATTMRRHVPAFRSAVSTLTSPPSVLIADIFATESFAVA 125
           V LP  D+S + D S     +L   MR  +P  RS +  +   P+ LI D+F  ++  + 
Sbjct: 67  VGLPTPDISGLVDPSAFFGIKLLVMMRETIPTIRSKIEEMQHKPTALIVDLFGLDAIPLG 126

Query: 126 DEFHMAKYVFVASNAWFLALIVYAQVWETQIVGQYVDQKEPLQIPGCEPVRPCDVIDPLL 185
            EF+M  Y+F+ASNA FLA+ ++    +  +  +++ +K+P+ +PGCEPVR  D ++  L
Sbjct: 127 GEFNMLTYIFIASNARFLAVALFFPTLDKDMEEEHIIKKQPMVMPGCEPVRFEDTLETFL 186

Query: 186 DRTHPQYLEMVKVGMGIASSDGVLVNTWDDLQGRTLASFSDQNLLGGIMKPPVYSIGPIV 245
           D     Y E V  G    + DG++VNTWDD++ +TL S  D  LLG I   PVY IGP+ 
Sbjct: 187 DPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSLQDPKLLGRIAGVPVYPIGPLS 246

Query: 246 RQSGSSE----LFNWLSKQPRESVIYVSFGSGGTLSFKQMIEVAHGLEMSQARFVWVVRA 305
           R    S+    + +WL+KQP ESV+Y+SFGSGG+LS KQ+ E+A GLEMSQ RFVWVVR 
Sbjct: 247 RPVDPSKTNHPVLDWLNKQPDESVLYISFGSGGSLSAKQLTELAWGLEMSQQRFVWVVRP 306

Query: 306 PKVRSD-GAYFTTGDGSEEQSEAKFLPEGFLERTSEVGFVVPMWADQTAVLGSPAVGGFF 365
           P   S   AY +   G        +LPEGF+ RT E GF+V  WA Q  +L   AVGGF 
Sbjct: 307 PVDGSACSAYLSANSGKIRDGTPDYLPEGFVSRTHERGFMVSSWAPQAEILAHQAVGGFL 366

Query: 366 THGGWNSALEGITNGVPMVVWPLYAEQRLNATFLAEEVGVAVRPKELPTKAVIGREEIAA 425
           TH GWNS LE +  GVPM+ WPL+AEQ +NAT L EE+GVAVR K+LP++ VI R EI A
Sbjct: 367 THCGWNSILESVVGGVPMIAWPLFAEQMMNATLLNEELGVAVRSKKLPSEGVITRAEIEA 426

Query: 426 MVRKIMAEEDEEGKGIRAKAKEL-QRSAEKASAEGGSSYDNFARV 462
           +VRKIM E  EEG  +R K K+L + +AE  S +GG ++++ +R+
Sbjct: 427 LVRKIMVE--EEGAEMRKKIKKLKETAAESLSCDGGVAHESLSRI 469

BLAST of Clc10G12200 vs. ExPASy Swiss-Prot
Match: Q9LVR1 (UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana OX=3702 GN=UGT72E2 PE=1 SV=1)

HSP 1 Score: 371.3 bits (952), Expect = 1.6e-101
Identity = 206/470 (43.83%), Postives = 297/470 (63.19%), Query Frame = 0

Query: 6   HVALLSSPGMGHLFPSLELATRLSTRHHLTVTVFLAPSHSSSAENKVIATAEAVGLFTVV 65
           H A+ SSPGMGH+ P +EL  RLS  +   VTVF+  + ++SA++K +    + G+  +V
Sbjct: 7   HAAMFSSPGMGHVIPVIELGKRLSANNGFHVTVFVLETDAASAQSKFL---NSTGV-DIV 66

Query: 66  KLPLVDMSDVT--DSSVVGRLATTMRRHVPAFRSAVSTLTSPPSVLIADIFATESFAVAD 125
           KLP  D+  +   D  VV ++   MR  VPA RS ++ +   P+ LI D+F T++  +A 
Sbjct: 67  KLPSPDIYGLVDPDDHVVTKIGVIMRAAVPALRSKIAAMHQKPTALIVDLFGTDALCLAK 126

Query: 126 EFHMAKYVFVASNAWFLALIVYAQVWETQIVGQYVDQKEPLQIPGCEPVRPCDVIDPLLD 185
           EF+M  YVF+ +NA FL + +Y    +  I  ++  Q+ PL IPGCEPVR  D +D  L 
Sbjct: 127 EFNMLSYVFIPTNARFLGVSIYYPNLDKDIKEEHTVQRNPLAIPGCEPVRFEDTLDAYLV 186

Query: 186 RTHPQYLEMVKVGMGIASSDGVLVNTWDDLQGRTLASFSDQNLLGGIMKPPVYSIGPIVR 245
              P Y + V+ G+    +DG+LVNTW++++ ++L S  +  LLG + + PVY IGP+ R
Sbjct: 187 PDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSLLNPKLLGRVARVPVYPIGPLCR 246

Query: 246 QSGSSE----LFNWLSKQPRESVIYVSFGSGGTLSFKQMIEVAHGLEMSQARFVWVVRAP 305
              SSE    + +WL++QP ESV+Y+SFGSGG LS KQ+ E+A GLE SQ RFVWVVR P
Sbjct: 247 PIQSSETDHPVLDWLNEQPNESVLYISFGSGGCLSAKQLTELAWGLEQSQQRFVWVVRPP 306

Query: 306 KVRSDGA----YFTTGDGSEEQSEAKFLPEGFLERTSEVGFVVPMWADQTAVLGSPAVGG 365
               DG+    Y +   G  E +  ++LPEGF+ RTS+ GFVVP WA Q  +L   AVGG
Sbjct: 307 ---VDGSCCSEYVSANGGGTEDNTPEYLPEGFVSRTSDRGFVVPSWAPQAEILSHRAVGG 366

Query: 366 FFTHGGWNSALEGITNGVPMVVWPLYAEQRLNATFLAEEVGVAVRPKELPTKAVIGREEI 425
           F TH GW+S LE +  GVPM+ WPL+AEQ +NA  L++E+G+AVR  +   K  I R +I
Sbjct: 367 FLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDELGIAVRLDD--PKEDISRWKI 426

Query: 426 AAMVRKIMAEEDEEGKGIRAKAKELQRSAEKASA--EGGSSYDNFARVVK 464
            A+VRK+M E  +EG+ +R K K+L+ SAE + +   GG ++++  RV K
Sbjct: 427 EALVRKVMTE--KEGEAMRRKVKKLRDSAEMSLSIDGGGLAHESLCRVTK 465

BLAST of Clc10G12200 vs. ExPASy Swiss-Prot
Match: O81498 (UDP-glycosyltransferase 72E3 OS=Arabidopsis thaliana OX=3702 GN=UGT72E3 PE=1 SV=1)

HSP 1 Score: 370.9 bits (951), Expect = 2.1e-101
Identity = 200/467 (42.83%), Postives = 300/467 (64.24%), Query Frame = 0

Query: 6   HVALLSSPGMGHLFPSLELATRLSTRHHLTVTVFLAPSHSSSAENKVIATAEAVGLFTVV 65
           H A+ SSPGMGH+ P +ELA RLS  H   VTVF+  + ++S ++K++    + G+  +V
Sbjct: 7   HAAMFSSPGMGHVLPVIELAKRLSANHGFHVTVFVLETDAASVQSKLL---NSTGV-DIV 66

Query: 66  KLPLVDMSDVTD--SSVVGRLATTMRRHVPAFRSAVSTLTSPPSVLIADIFATESFAVAD 125
            LP  D+S + D  + VV ++   MR  VP  RS +  +   P+ LI D+F T++  +A 
Sbjct: 67  NLPSPDISGLVDPNAHVVTKIGVIMREAVPTLRSKIVAMHQNPTALIIDLFGTDALCLAA 126

Query: 126 EFHMAKYVFVASNAWFLALIVYAQVWETQIVGQYVDQKEPLQIPGCEPVRPCDVIDPLLD 185
           E +M  YVF+ASNA +L + +Y    +  I  ++  Q++PL IPGCEPVR  D++D  L 
Sbjct: 127 ELNMLTYVFIASNARYLGVSIYYPTLDEVIKEEHTVQRKPLTIPGCEPVRFEDIMDAYLV 186

Query: 186 RTHPQYLEMVKVGMGIASSDGVLVNTWDDLQGRTLASFSDQNLLGGIMKPPVYSIGPIVR 245
              P Y ++V+  +    +DG+LVNTW++++ ++L S  D  LLG + + PVY +GP+ R
Sbjct: 187 PDEPVYHDLVRHCLAYPKADGILVNTWEEMEPKSLKSLQDPKLLGRVARVPVYPVGPLCR 246

Query: 246 QSGSS----ELFNWLSKQPRESVIYVSFGSGGTLSFKQMIEVAHGLEMSQARFVWVVRAP 305
              SS     +F+WL+KQP ESV+Y+SFGSGG+L+ +Q+ E+A GLE SQ RF+WVVR P
Sbjct: 247 PIQSSTTDHPVFDWLNKQPNESVLYISFGSGGSLTAQQLTELAWGLEESQQRFIWVVRPP 306

Query: 306 KVRSD-GAYFTTGDGSEEQSEAKFLPEGFLERTSEVGFVVPMWADQTAVLGSPAVGGFFT 365
              S    YF+   G  + +  ++LPEGF+ RT + GF++P WA Q  +L   AVGGF T
Sbjct: 307 VDGSSCSDYFSAKGGVTKDNTPEYLPEGFVTRTCDRGFMIPSWAPQAEILAHQAVGGFLT 366

Query: 366 HGGWNSALEGITNGVPMVVWPLYAEQRLNATFLAEEVGVAVRPKELPTKAVIGREEIAAM 425
           H GW+S LE +  GVPM+ WPL+AEQ +NA  L++E+G++VR  +   K  I R +I AM
Sbjct: 367 HCGWSSTLESVLCGVPMIAWPLFAEQNMNAALLSDELGISVRVDD--PKEAISRSKIEAM 426

Query: 426 VRKIMAEEDEEGKGIRAKAKELQRSAEKASA--EGGSSYDNFARVVK 464
           VRK+MAE  +EG+ +R K K+L+ +AE + +   GGS++++  RV K
Sbjct: 427 VRKVMAE--DEGEEMRRKVKKLRDTAEMSLSIHGGGSAHESLCRVTK 465

BLAST of Clc10G12200 vs. ExPASy TrEMBL
Match: A0A5D3CXP2 (Glycosyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G005580 PE=3 SV=1)

HSP 1 Score: 785.4 bits (2027), Expect = 1.3e-223
Identity = 405/471 (85.99%), Postives = 428/471 (90.87%), Query Frame = 0

Query: 1   MESPTHVALLSSPGMGHLFPSLELATRLSTRHHLTVTVFLAPSHSSSAENKVIATAEAVG 60
           MES  HVAL+SSPGMGHLFP+LELATRLST H LTVTVF+ PSHSSSAENKVIATA+A G
Sbjct: 1   MESAAHVALISSPGMGHLFPALELATRLSTHHRLTVTVFIVPSHSSSAENKVIATAQAAG 60

Query: 61  LFTVVKLPLVDMSDVTDSSVVGRLATTMRRHVPAFRSAVSTLTSPPSVLIADIFATESFA 120
           LFTVV+LP  DMSDVT+SS+VGRLA TMRRHVP FRSAVS +TSPPSVLIADIFA ESFA
Sbjct: 61  LFTVVELPPADMSDVTESSIVGRLAITMRRHVPIFRSAVSAMTSPPSVLIADIFAVESFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALIVYAQVWETQIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180
           VADEF MAKY FVASNAWFLA++VYAQVW+ +IVGQYVDQKEPLQIPGCEPVRPCDVIDP
Sbjct: 121 VADEFDMAKYTFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180

Query: 181 LLDRTHPQYLEMVKVGMGIASSDGVLVNTWDDLQGRTLASFSDQNLLGGIMKPPVYSIGP 240
           LLDRT  QY E++K+GMGIASSDGVLVNTWD+LQ RTLAS +D+ LLG I  PPVYSIGP
Sbjct: 181 LLDRTELQYSEILKLGMGIASSDGVLVNTWDELQHRTLASLNDRYLLGKI-SPPVYSIGP 240

Query: 241 IVRQ-----SGSSELFNWLSKQPRESVIYVSFGSGGTLSFKQMIEVAHGLEMSQARFVWV 300
           IVRQ      GSSELFNWLSKQP ESVIYVSFGSGGTLSF+QM EVAHGLEMS+ RFVWV
Sbjct: 241 IVRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSKQRFVWV 300

Query: 301 VRAPKVRSDGAYFTTGDGSEEQSEAKFLPEGFLERTSEVGFVVPMWADQTAVLGSPAVGG 360
           VRAPKVRSDGAYFTTGDGSEEQS AKFLPEGFLERTSEVGFVV MWADQTAVLGSPAVGG
Sbjct: 301 VRAPKVRSDGAYFTTGDGSEEQSSAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360

Query: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRLNATFLAEEVGVAVRPKELPTKAVIGREEI 420
           FFTH GWNSALEGITNGVPMVVWPLYAEQRLNAT LAEE+GVAVR KELPTKA+I REEI
Sbjct: 361 FFTHSGWNSALEGITNGVPMVVWPLYAEQRLNATMLAEEIGVAVRSKELPTKALIEREEI 420

Query: 421 AAMVRKIMAEEDEEGKGIRAKAKELQRSAEKASAEGGSSYDNFARVVKLFG 467
           AAMVRKIM EED+EGK IRAKAKELQRSAEKA AEGGSSY NFARVVKLFG
Sbjct: 421 AAMVRKIMVEEDDEGKAIRAKAKELQRSAEKALAEGGSSYHNFARVVKLFG 470

BLAST of Clc10G12200 vs. ExPASy TrEMBL
Match: A0A1S3BX03 (Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103494390 PE=3 SV=1)

HSP 1 Score: 780.0 bits (2013), Expect = 5.5e-222
Identity = 402/469 (85.71%), Postives = 426/469 (90.83%), Query Frame = 0

Query: 1   MESPTHVALLSSPGMGHLFPSLELATRLSTRHHLTVTVFLAPSHSSSAENKVIATAEAVG 60
           MES  HVAL+SSPGMGHLFP+LELATRLST H LTVTVF+ PSHSSSAENKVIATA+A G
Sbjct: 1   MESAAHVALISSPGMGHLFPALELATRLSTHHRLTVTVFIVPSHSSSAENKVIATAQAAG 60

Query: 61  LFTVVKLPLVDMSDVTDSSVVGRLATTMRRHVPAFRSAVSTLTSPPSVLIADIFATESFA 120
           LFTVV+LP  DMSDVT+SS+VGRLA TMRRHVP FRSAVS +TSPPSVLIADIFA ESFA
Sbjct: 61  LFTVVELPPADMSDVTESSIVGRLAITMRRHVPIFRSAVSAMTSPPSVLIADIFAVESFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALIVYAQVWETQIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180
           VADEF MAKY FVASNAWFLA++VYAQVW+ +IVGQYVDQKEPLQIPGCEPVRPCDVIDP
Sbjct: 121 VADEFDMAKYTFVASNAWFLAVMVYAQVWDREIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180

Query: 181 LLDRTHPQYLEMVKVGMGIASSDGVLVNTWDDLQGRTLASFSDQNLLGGIMKPPVYSIGP 240
           LLDRT  QY E++K+GMGIASSDGVLVNTWD+LQ RTLAS +D+ LLG I  PPVYSIGP
Sbjct: 181 LLDRTELQYSEILKLGMGIASSDGVLVNTWDELQHRTLASLNDRYLLGKI-SPPVYSIGP 240

Query: 241 IVRQ-----SGSSELFNWLSKQPRESVIYVSFGSGGTLSFKQMIEVAHGLEMSQARFVWV 300
           IVRQ      GSSELFNWLSKQP ESVIYVSFGSGGTLSF+QM EVAHGLEMS+ RFVWV
Sbjct: 241 IVRQPGSKKGGSSELFNWLSKQPSESVIYVSFGSGGTLSFEQMTEVAHGLEMSKQRFVWV 300

Query: 301 VRAPKVRSDGAYFTTGDGSEEQSEAKFLPEGFLERTSEVGFVVPMWADQTAVLGSPAVGG 360
           VRAPKVRSDGAYFTTGDGSEEQS AKFLPEGFLERTSEVGFVV MWADQTAVLGSPAVGG
Sbjct: 301 VRAPKVRSDGAYFTTGDGSEEQSSAKFLPEGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360

Query: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRLNATFLAEEVGVAVRPKELPTKAVIGREEI 420
           FFTH GWNSALEGITNGVPMVVWPLYAEQRLNAT LAEE+GVAVR KELPTKA+I REEI
Sbjct: 361 FFTHSGWNSALEGITNGVPMVVWPLYAEQRLNATMLAEEIGVAVRSKELPTKALIEREEI 420

Query: 421 AAMVRKIMAEEDEEGKGIRAKAKELQRSAEKASAEGGSSYDNFARVVKL 465
           AAMVRKIM EED+EGK IRAKAKELQRSAEKA AEGGSSY NFARVVK+
Sbjct: 421 AAMVRKIMVEEDDEGKAIRAKAKELQRSAEKALAEGGSSYHNFARVVKI 468

BLAST of Clc10G12200 vs. ExPASy TrEMBL
Match: A0A6J1EP22 (Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111436081 PE=3 SV=1)

HSP 1 Score: 774.6 bits (1999), Expect = 2.3e-220
Identity = 398/468 (85.04%), Postives = 422/468 (90.17%), Query Frame = 0

Query: 1   MESPTHVALLSSPGMGHLFPSLELATRLSTRHHLTVTVFLAPSHSSSAENKVIATAEAVG 60
           MES  HVAL+SSPGMGHLFPSLELATRLS RHHL+VTVF+ PS SSSAENKVIA A+A G
Sbjct: 1   MESQPHVALVSSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAENKVIAAAQAAG 60

Query: 61  LFTVVKLPLVDMSDVTDSSVVGRLATTMRRHVPAFRSAVSTLTSPPSVLIADIFATESFA 120
           LFTV++LP  DMSDVT+S+VVGRL  TMRRHVPA RSAVSTLT+ PSVLIADIFATESFA
Sbjct: 61  LFTVIELPPADMSDVTESAVVGRLCITMRRHVPALRSAVSTLTTLPSVLIADIFATESFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALIVYAQVWETQIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180
           VADEFHMAKYVFVASNAWFLA  +Y  V + QI GQYVDQKEPL IPGCEPVRPCDVIDP
Sbjct: 121 VADEFHMAKYVFVASNAWFLAFTIYVPVLDKQITGQYVDQKEPLYIPGCEPVRPCDVIDP 180

Query: 181 LLDRTHPQYLEMVKVGMGIASSDGVLVNTWDDLQGRTLASFSDQNLLGGIMKPPVYSIGP 240
           LLDRT PQY E V++GM I SSDGVLVNTWDDLQGRTLASF D+NLLG IM  PVYSIGP
Sbjct: 181 LLDRTEPQYFEYVRIGMEIPSSDGVLVNTWDDLQGRTLASFRDRNLLGRIMNSPVYSIGP 240

Query: 241 IVRQS-----GSSELFNWLSKQPRESVIYVSFGSGGTLSFKQMIEVAHGLEMSQARFVWV 300
           IVRQ+     GSSELFNWLSKQP ESVIYVSFGSGGTLS +QM EVAHGLEMS  RFVWV
Sbjct: 241 IVRQTGGKKGGSSELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300

Query: 301 VRAPKVRSDGAYFTTGDGSEEQSEAKFLPEGFLERTSEVGFVVPMWADQTAVLGSPAVGG 360
           VRAPKVRSD  +FTTGDGSE+QSEAKFLP+GFLERTSEVGFVV MWADQTAVLGSPAVGG
Sbjct: 301 VRAPKVRSDATFFTTGDGSEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360

Query: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRLNATFLAEEVGVAVRPKELPTKAVIGREEI 420
           FFTHGGWNSALEGITNGVPMVVWPLYAEQR+NAT LAEEV VAVRPKELPTKAVIGREEI
Sbjct: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420

Query: 421 AAMVRKIMAEEDEEGKGIRAKAKELQRSAEKASAEGGSSYDNFARVVK 464
           AAMVRKIMAEEDEEGK IRAKAKELQRSAEK++AEGGSS++NFARVVK
Sbjct: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVK 468

BLAST of Clc10G12200 vs. ExPASy TrEMBL
Match: A0A6J1J726 (Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111483150 PE=3 SV=1)

HSP 1 Score: 767.3 bits (1980), Expect = 3.7e-218
Identity = 394/470 (83.83%), Postives = 421/470 (89.57%), Query Frame = 0

Query: 1   MESPTHVALLSSPGMGHLFPSLELATRLSTRHHLTVTVFLAPSHSSSAENKVIATAEAVG 60
           MES  HVAL+SSPGMGHLFPSLELATRLS RHHL+VTVF+ PS SSSAE KVIA A+A G
Sbjct: 1   MESQPHVALISSPGMGHLFPSLELATRLSMRHHLSVTVFIVPSRSSSAEYKVIAAAQAAG 60

Query: 61  LFTVVKLPLVDMSDVTDSSVVGRLATTMRRHVPAFRSAVSTLTSPPSVLIADIFATESFA 120
           LFTV++LP  DMSDVTDS+VVGRL+ TMRRHVPA RSAVS LTS PSVLIADIFATESFA
Sbjct: 61  LFTVIELPPADMSDVTDSTVVGRLSITMRRHVPALRSAVSALTSLPSVLIADIFATESFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALIVYAQVWETQIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180
           VADEFHMAKYVFVASNAWF AL +Y  V + QI GQYVDQKEP  IPGCEPVRPCDV+DP
Sbjct: 121 VADEFHMAKYVFVASNAWFFALTIYVPVLDKQINGQYVDQKEPFHIPGCEPVRPCDVMDP 180

Query: 181 LLDRTHPQYLEMVKVGMGIASSDGVLVNTWDDLQGRTLASFSDQNLLGGIMKPPVYSIGP 240
           LLDRT PQY E V++G  I SSDGVLVNTWDDL+GRTLASF D NLLG IMK PVYSIGP
Sbjct: 181 LLDRTEPQYFEYVRIGTEIPSSDGVLVNTWDDLEGRTLASFRDWNLLGRIMKSPVYSIGP 240

Query: 241 IVRQS-----GSSELFNWLSKQPRESVIYVSFGSGGTLSFKQMIEVAHGLEMSQARFVWV 300
           IVRQ+     G+SELFNWLSKQP ESVIYVSFGSGGTLS +QM EVAHGLEMS  RFVWV
Sbjct: 241 IVRQTGGKKGGASELFNWLSKQPGESVIYVSFGSGGTLSSEQMTEVAHGLEMSGQRFVWV 300

Query: 301 VRAPKVRSDGAYFTTGDGSEEQSEAKFLPEGFLERTSEVGFVVPMWADQTAVLGSPAVGG 360
           VRAPKVRSD  +FTTGDG+E+QSEAKFLP+GFLERTSEVGFVV MWADQTAVLGSPAVGG
Sbjct: 301 VRAPKVRSDATFFTTGDGTEDQSEAKFLPDGFLERTSEVGFVVSMWADQTAVLGSPAVGG 360

Query: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRLNATFLAEEVGVAVRPKELPTKAVIGREEI 420
           FFTHGGWNSALEGITNGVPMVVWPLYAEQR+NAT LAEEV VAVRPKELPTKAVIGREEI
Sbjct: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRMNATMLAEEVRVAVRPKELPTKAVIGREEI 420

Query: 421 AAMVRKIMAEEDEEGKGIRAKAKELQRSAEKASAEGGSSYDNFARVVKLF 466
           AAMVRKIMAEEDEEGK IRAKAKELQRSAEK++AEGGSS++NFARVVKL+
Sbjct: 421 AAMVRKIMAEEDEEGKAIRAKAKELQRSAEKSTAEGGSSFENFARVVKLW 470

BLAST of Clc10G12200 vs. ExPASy TrEMBL
Match: A0A6J1FTD7 (Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111447126 PE=3 SV=1)

HSP 1 Score: 765.8 bits (1976), Expect = 1.1e-217
Identity = 386/474 (81.43%), Postives = 425/474 (89.66%), Query Frame = 0

Query: 1   MESPTHVALLSSPGMGHLFPSLELATRLSTRHHLTVTVFLAPSHSSSAENKVIATAEAVG 60
           M+SPTHVAL+SSPGMGHLFPSLELATRLSTRHHLT+TVFL  SHSSSAEN V+A AEA G
Sbjct: 1   MDSPTHVALISSPGMGHLFPSLELATRLSTRHHLTLTVFLVTSHSSSAENNVVAAAEATG 60

Query: 61  LFTVVKLPLVDMSDVTDSSVVGRLATTMRRHVPAFRSAVSTLTSPPSVLIADIFATESFA 120
           LFTVV+LP  DMSDVTDS+VVGRLA TMRRHVPA RSA+S LTS PS LIADIF+TE+FA
Sbjct: 61  LFTVVELPPADMSDVTDSTVVGRLAITMRRHVPALRSAISALTSRPSALIADIFSTEAFA 120

Query: 121 VADEFHMAKYVFVASNAWFLALIVYAQVWETQIVGQYVDQKEPLQIPGCEPVRPCDVIDP 180
           VADEFHMAKYVFVASNAWFLAL +YAQV + QIVGQYVDQKEPLQIPGCEPVRPCDV+DP
Sbjct: 121 VADEFHMAKYVFVASNAWFLALTIYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDP 180

Query: 181 LLDRTHPQYLEMVKVGMGIASSDGVLVNTWDDLQGRTLASFSDQNLLGGIMKPPVYSIGP 240
           +LDRT  QY E VK+G  IASS GVLVN+WD+LQGRTLASF D++LLG +M  PVYSIGP
Sbjct: 181 MLDRTESQYYEYVKMGRAIASSHGVLVNSWDELQGRTLASFKDRSLLGRVMNAPVYSIGP 240

Query: 241 IVR-----QSGSSELFNWLSKQPRESVIYVSFGSGGTLSFKQMIEVAHGLEMSQARFVWV 300
           IVR     + GSSELFNWL KQP +SVIYVSFGSGGTLSF+QM E+AHGLE+S+ RFVWV
Sbjct: 241 IVRHFGSGKDGSSELFNWLRKQPGKSVIYVSFGSGGTLSFEQMTEMAHGLELSRQRFVWV 300

Query: 301 VRAPKVRSDGAYFTTGDGSEEQSEAKFLPEGFLERTSEVGFVVPMWADQTAVLGSPAVGG 360
           VR P VRSD  +FTTGDGSE+QSEA++LPEGFLERTSEVGF+V MWA+QTAVLGSPAVGG
Sbjct: 301 VRPPTVRSDAMFFTTGDGSEDQSEARYLPEGFLERTSEVGFLVSMWAEQTAVLGSPAVGG 360

Query: 361 FFTHGGWNSALEGITNGVPMVVWPLYAEQRLNATFLAEEVGVAVRPKELPTKAVIGREEI 420
           FFTHGGWNS+LEGIT GVPM+VWPLYAEQR+NAT LA+E+GVAVRPKELP  AVIGREEI
Sbjct: 361 FFTHGGWNSSLEGITKGVPMIVWPLYAEQRMNATMLADEMGVAVRPKELPGNAVIGREEI 420

Query: 421 AAMVRKIMAEEDEEGKGIRAKAKELQRSAEKASAEGGSSYDNFARVVKLFGRKG 470
           AAMVRKIMAEEDEEG+ IRAKA ELQRSAEKA A+GGSSY+NFARVVKLFGR G
Sbjct: 421 AAMVRKIMAEEDEEGRAIRAKAMELQRSAEKACAQGGSSYENFARVVKLFGRTG 474

BLAST of Clc10G12200 vs. TAIR 10
Match: AT2G18570.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 422.2 bits (1084), Expect = 5.6e-118
Identity = 225/465 (48.39%), Postives = 318/465 (68.39%), Query Frame = 0

Query: 1   MESPTHVALLSSPGMGHLFPSLELATRLSTRHHLTVTVFLAPSHSSS-AENKVIATAEAV 60
           M+ P H  L++SPG+GHL P LEL  RLS+  ++ VT+    S SSS  E + I  A A 
Sbjct: 1   MDQP-HALLVASPGLGHLIPILELGNRLSSVLNIHVTILAVTSGSSSPTETEAIHAAAAR 60

Query: 61  GLFTVVKLPLVDMSDVT--DSSVVGRLATTMRRHVPAFRSAVSTLTSPPSVLIADIFATE 120
            +  + ++P VD+ ++   D+++  ++   MR   PA R AV  +   P+V+I D   TE
Sbjct: 61  TICQITEIPSVDVDNLVEPDATIFTKMVVKMRAMKPAVRDAVKLMKRKPTVMIVDFLGTE 120

Query: 121 SFAVADEFHM-AKYVFVASNAWFLALIVYAQVWETQIVGQYVDQKEPLQIPGCEPVRPCD 180
             +VAD+  M AKYV+V ++AWFLA++VY  V +T + G+YVD KEPL+IPGC+PV P +
Sbjct: 121 LMSVADDVGMTAKYVYVPTHAWFLAVMVYLPVLDTVVEGEYVDIKEPLKIPGCKPVGPKE 180

Query: 181 VIDPLLDRTHPQYLEMVKVGMGIASSDGVLVNTWDDLQGRTLASFSDQNLLGGIMKPPVY 240
           +++ +LDR+  QY E V+ G+ +  SDGVLVNTW++LQG TLA+  +   L  +MK PVY
Sbjct: 181 LMETMLDRSGQQYKECVRAGLEVPMSDGVLVNTWEELQGNTLAALREDEELSRVMKVPVY 240

Query: 241 SIGPIVRQS----GSSELFNWLSKQPRESVIYVSFGSGGTLSFKQMIEVAHGLEMSQARF 300
            IGPIVR +      + +F WL +Q   SV++V  GSGGTL+F+Q +E+A GLE+S  RF
Sbjct: 241 PIGPIVRTNQHVDKPNSIFEWLDEQRERSVVFVCLGSGGTLTFEQTVELALGLELSGQRF 300

Query: 301 VWVVRAPKVRSDGAYFTTGDGSEEQSEAKFLPEGFLERTSEVGFVVPMWADQTAVLGSPA 360
           VWV+R P      A +     S+++  +  LPEGFL+RT  VG VV  WA Q  +L   +
Sbjct: 301 VWVLRRP------ASYLGAISSDDEQVSASLPEGFLDRTRGVGIVVTQWAPQVEILSHRS 360

Query: 361 VGGFFTHGGWNSALEGITNGVPMVVWPLYAEQRLNATFLAEEVGVAVRPKELPTKAVIGR 420
           +GGF +H GW+SALE +T GVP++ WPLYAEQ +NAT L EE+GVAVR  ELP++ VIGR
Sbjct: 361 IGGFLSHCGWSSALESLTKGVPIIAWPLYAEQWMNATLLTEEIGVAVRTSELPSERVIGR 420

Query: 421 EEIAAMVRKIMAEEDEEGKGIRAKAKELQRSAEKASAEGGSSYDN 458
           EE+A++VRKIMAEEDEEG+ IRAKA+E++ S+E+A ++ GSSY++
Sbjct: 421 EEVASLVRKIMAEEDEEGQKIRAKAEEVRVSSERAWSKDGSSYNS 458

BLAST of Clc10G12200 vs. TAIR 10
Match: AT3G50740.1 (UDP-glucosyl transferase 72E1 )

HSP 1 Score: 382.9 bits (982), Expect = 3.8e-106
Identity = 207/465 (44.52%), Postives = 292/465 (62.80%), Query Frame = 0

Query: 6   HVALLSSPGMGHLFPSLELATRLSTRHHLTVTVFLAPSHSSSAENKVIATAEA-VGLFTV 65
           HVA+ +SPGMGH+ P +EL  RL+  H   VT+F+  + ++SA+++ + +      L  +
Sbjct: 7   HVAMFASPGMGHIIPVIELGKRLAGSHGFDVTIFVLETDAASAQSQFLNSPGCDAALVDI 66

Query: 66  VKLPLVDMSDVTDSSVVG--RLATTMRRHVPAFRSAVSTLTSPPSVLIADIFATESFAVA 125
           V LP  D+S + D S     +L   MR  +P  RS +  +   P+ LI D+F  ++  + 
Sbjct: 67  VGLPTPDISGLVDPSAFFGIKLLVMMRETIPTIRSKIEEMQHKPTALIVDLFGLDAIPLG 126

Query: 126 DEFHMAKYVFVASNAWFLALIVYAQVWETQIVGQYVDQKEPLQIPGCEPVRPCDVIDPLL 185
            EF+M  Y+F+ASNA FLA+ ++    +  +  +++ +K+P+ +PGCEPVR  D ++  L
Sbjct: 127 GEFNMLTYIFIASNARFLAVALFFPTLDKDMEEEHIIKKQPMVMPGCEPVRFEDTLETFL 186

Query: 186 DRTHPQYLEMVKVGMGIASSDGVLVNTWDDLQGRTLASFSDQNLLGGIMKPPVYSIGPIV 245
           D     Y E V  G    + DG++VNTWDD++ +TL S  D  LLG I   PVY IGP+ 
Sbjct: 187 DPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSLQDPKLLGRIAGVPVYPIGPLS 246

Query: 246 RQSGSSE----LFNWLSKQPRESVIYVSFGSGGTLSFKQMIEVAHGLEMSQARFVWVVRA 305
           R    S+    + +WL+KQP ESV+Y+SFGSGG+LS KQ+ E+A GLEMSQ RFVWVVR 
Sbjct: 247 RPVDPSKTNHPVLDWLNKQPDESVLYISFGSGGSLSAKQLTELAWGLEMSQQRFVWVVRP 306

Query: 306 PKVRSD-GAYFTTGDGSEEQSEAKFLPEGFLERTSEVGFVVPMWADQTAVLGSPAVGGFF 365
           P   S   AY +   G        +LPEGF+ RT E GF+V  WA Q  +L   AVGGF 
Sbjct: 307 PVDGSACSAYLSANSGKIRDGTPDYLPEGFVSRTHERGFMVSSWAPQAEILAHQAVGGFL 366

Query: 366 THGGWNSALEGITNGVPMVVWPLYAEQRLNATFLAEEVGVAVRPKELPTKAVIGREEIAA 425
           TH GWNS LE +  GVPM+ WPL+AEQ +NAT L EE+GVAVR K+LP++ VI R EI A
Sbjct: 367 THCGWNSILESVVGGVPMIAWPLFAEQMMNATLLNEELGVAVRSKKLPSEGVITRAEIEA 426

Query: 426 MVRKIMAEEDEEGKGIRAKAKEL-QRSAEKASAEGGSSYDNFARV 462
           +VRKIM E  EEG  +R K K+L + +AE  S +GG ++++ +R+
Sbjct: 427 LVRKIMVE--EEGAEMRKKIKKLKETAAESLSCDGGVAHESLSRI 469

BLAST of Clc10G12200 vs. TAIR 10
Match: AT5G66690.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 371.3 bits (952), Expect = 1.1e-102
Identity = 206/470 (43.83%), Postives = 297/470 (63.19%), Query Frame = 0

Query: 6   HVALLSSPGMGHLFPSLELATRLSTRHHLTVTVFLAPSHSSSAENKVIATAEAVGLFTVV 65
           H A+ SSPGMGH+ P +EL  RLS  +   VTVF+  + ++SA++K +    + G+  +V
Sbjct: 7   HAAMFSSPGMGHVIPVIELGKRLSANNGFHVTVFVLETDAASAQSKFL---NSTGV-DIV 66

Query: 66  KLPLVDMSDVT--DSSVVGRLATTMRRHVPAFRSAVSTLTSPPSVLIADIFATESFAVAD 125
           KLP  D+  +   D  VV ++   MR  VPA RS ++ +   P+ LI D+F T++  +A 
Sbjct: 67  KLPSPDIYGLVDPDDHVVTKIGVIMRAAVPALRSKIAAMHQKPTALIVDLFGTDALCLAK 126

Query: 126 EFHMAKYVFVASNAWFLALIVYAQVWETQIVGQYVDQKEPLQIPGCEPVRPCDVIDPLLD 185
           EF+M  YVF+ +NA FL + +Y    +  I  ++  Q+ PL IPGCEPVR  D +D  L 
Sbjct: 127 EFNMLSYVFIPTNARFLGVSIYYPNLDKDIKEEHTVQRNPLAIPGCEPVRFEDTLDAYLV 186

Query: 186 RTHPQYLEMVKVGMGIASSDGVLVNTWDDLQGRTLASFSDQNLLGGIMKPPVYSIGPIVR 245
              P Y + V+ G+    +DG+LVNTW++++ ++L S  +  LLG + + PVY IGP+ R
Sbjct: 187 PDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSLLNPKLLGRVARVPVYPIGPLCR 246

Query: 246 QSGSSE----LFNWLSKQPRESVIYVSFGSGGTLSFKQMIEVAHGLEMSQARFVWVVRAP 305
              SSE    + +WL++QP ESV+Y+SFGSGG LS KQ+ E+A GLE SQ RFVWVVR P
Sbjct: 247 PIQSSETDHPVLDWLNEQPNESVLYISFGSGGCLSAKQLTELAWGLEQSQQRFVWVVRPP 306

Query: 306 KVRSDGA----YFTTGDGSEEQSEAKFLPEGFLERTSEVGFVVPMWADQTAVLGSPAVGG 365
               DG+    Y +   G  E +  ++LPEGF+ RTS+ GFVVP WA Q  +L   AVGG
Sbjct: 307 ---VDGSCCSEYVSANGGGTEDNTPEYLPEGFVSRTSDRGFVVPSWAPQAEILSHRAVGG 366

Query: 366 FFTHGGWNSALEGITNGVPMVVWPLYAEQRLNATFLAEEVGVAVRPKELPTKAVIGREEI 425
           F TH GW+S LE +  GVPM+ WPL+AEQ +NA  L++E+G+AVR  +   K  I R +I
Sbjct: 367 FLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDELGIAVRLDD--PKEDISRWKI 426

Query: 426 AAMVRKIMAEEDEEGKGIRAKAKELQRSAEKASA--EGGSSYDNFARVVK 464
            A+VRK+M E  +EG+ +R K K+L+ SAE + +   GG ++++  RV K
Sbjct: 427 EALVRKVMTE--KEGEAMRRKVKKLRDSAEMSLSIDGGGLAHESLCRVTK 465

BLAST of Clc10G12200 vs. TAIR 10
Match: AT5G26310.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 370.9 bits (951), Expect = 1.5e-102
Identity = 200/467 (42.83%), Postives = 300/467 (64.24%), Query Frame = 0

Query: 6   HVALLSSPGMGHLFPSLELATRLSTRHHLTVTVFLAPSHSSSAENKVIATAEAVGLFTVV 65
           H A+ SSPGMGH+ P +ELA RLS  H   VTVF+  + ++S ++K++    + G+  +V
Sbjct: 7   HAAMFSSPGMGHVLPVIELAKRLSANHGFHVTVFVLETDAASVQSKLL---NSTGV-DIV 66

Query: 66  KLPLVDMSDVTD--SSVVGRLATTMRRHVPAFRSAVSTLTSPPSVLIADIFATESFAVAD 125
            LP  D+S + D  + VV ++   MR  VP  RS +  +   P+ LI D+F T++  +A 
Sbjct: 67  NLPSPDISGLVDPNAHVVTKIGVIMREAVPTLRSKIVAMHQNPTALIIDLFGTDALCLAA 126

Query: 126 EFHMAKYVFVASNAWFLALIVYAQVWETQIVGQYVDQKEPLQIPGCEPVRPCDVIDPLLD 185
           E +M  YVF+ASNA +L + +Y    +  I  ++  Q++PL IPGCEPVR  D++D  L 
Sbjct: 127 ELNMLTYVFIASNARYLGVSIYYPTLDEVIKEEHTVQRKPLTIPGCEPVRFEDIMDAYLV 186

Query: 186 RTHPQYLEMVKVGMGIASSDGVLVNTWDDLQGRTLASFSDQNLLGGIMKPPVYSIGPIVR 245
              P Y ++V+  +    +DG+LVNTW++++ ++L S  D  LLG + + PVY +GP+ R
Sbjct: 187 PDEPVYHDLVRHCLAYPKADGILVNTWEEMEPKSLKSLQDPKLLGRVARVPVYPVGPLCR 246

Query: 246 QSGSS----ELFNWLSKQPRESVIYVSFGSGGTLSFKQMIEVAHGLEMSQARFVWVVRAP 305
              SS     +F+WL+KQP ESV+Y+SFGSGG+L+ +Q+ E+A GLE SQ RF+WVVR P
Sbjct: 247 PIQSSTTDHPVFDWLNKQPNESVLYISFGSGGSLTAQQLTELAWGLEESQQRFIWVVRPP 306

Query: 306 KVRSD-GAYFTTGDGSEEQSEAKFLPEGFLERTSEVGFVVPMWADQTAVLGSPAVGGFFT 365
              S    YF+   G  + +  ++LPEGF+ RT + GF++P WA Q  +L   AVGGF T
Sbjct: 307 VDGSSCSDYFSAKGGVTKDNTPEYLPEGFVTRTCDRGFMIPSWAPQAEILAHQAVGGFLT 366

Query: 366 HGGWNSALEGITNGVPMVVWPLYAEQRLNATFLAEEVGVAVRPKELPTKAVIGREEIAAM 425
           H GW+S LE +  GVPM+ WPL+AEQ +NA  L++E+G++VR  +   K  I R +I AM
Sbjct: 367 HCGWSSTLESVLCGVPMIAWPLFAEQNMNAALLSDELGISVRVDD--PKEAISRSKIEAM 426

Query: 426 VRKIMAEEDEEGKGIRAKAKELQRSAEKASA--EGGSSYDNFARVVK 464
           VRK+MAE  +EG+ +R K K+L+ +AE + +   GGS++++  RV K
Sbjct: 427 VRKVMAE--DEGEEMRRKVKKLRDTAEMSLSIHGGGSAHESLCRVTK 465

BLAST of Clc10G12200 vs. TAIR 10
Match: AT2G18560.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 356.7 bits (914), Expect = 2.9e-98
Identity = 181/383 (47.26%), Postives = 254/383 (66.32%), Query Frame = 0

Query: 88  MRRHVPAFRSAVSTLTSPPSVLIADIFATESFAVADEFHMAKYVFVASNAWFLALIVYAQ 147
           MR      R AV ++   P+V+I D F T   ++ D    +KYV++ S+AWFLALIVY  
Sbjct: 1   MREMKSTVRDAVKSMKQKPTVMIVDFFGTALLSITDVGVTSKYVYIPSHAWFLALIVYLP 60

Query: 148 VWETQIVGQYVDQKEPLQIPGCEPVRPCDVIDPLLDRTHPQYLEMVKVGMGIASSDGVLV 207
           V +  + G+YVD KEP++IPGC+PV P +++D +LDR+  QY + V++G+ I  SDGVLV
Sbjct: 61  VLDKVMEGEYVDIKEPMKIPGCKPVGPKELLDTMLDRSDQQYRDCVQIGLEIPMSDGVLV 120

Query: 208 NTWDDLQGRTLASFSDQNLLGGIMKPPVYSIGPIVRQS----GSSELFNWLSKQPRESVI 267
           NTW +LQG+TLA+  +   L  ++K PVY IGPIVR +      +  F WL KQ   SV+
Sbjct: 121 NTWGELQGKTLAALREDIDLNRVIKVPVYPIGPIVRTNVLIEKPNSTFEWLDKQEERSVV 180

Query: 268 YVSFGSGGTLSFKQMIEVAHGLEMSQARFVWVVRAPKVRSDGAYFTTGDGSEEQSEAKFL 327
           YV  GSGGTLSF+Q +E+A GLE+S   F+WV+R P        +      ++   +  L
Sbjct: 181 YVCLGSGGTLSFEQTMELAWGLELSCQSFLWVLRKP------PSYLGASSKDDDQVSDGL 240

Query: 328 PEGFLERTSEVGFVVPMWADQTAVLGSPAVGGFFTHGGWNSALEGITNGVPMVVWPLYAE 387
           PEGFL+RT  VG VV  WA Q  +L   ++GGF +H GW+S LE +T GVP++ WPLYAE
Sbjct: 241 PEGFLDRTRGVGLVVTQWAPQVEILSHRSIGGFLSHCGWSSVLESLTKGVPIIAWPLYAE 300

Query: 388 QRLNATFLAEEVGVAVRPKELPTKAVIGREEIAAMVRKIMAEEDEEGKGIRAKAKELQRS 447
           Q +NAT L EE+G+A+R  ELP+K VI REE+A++V+KI+AEED+EG+ I+ KA+E++ S
Sbjct: 301 QWMNATLLTEEIGMAIRTSELPSKKVISREEVASLVKKIVAEEDKEGRKIKTKAEEVRVS 360

Query: 448 AEKASAEGGSSYDNFARVVKLFG 467
           +E+A   GGSS+ +     K  G
Sbjct: 361 SERAWTHGGSSHSSLFEWAKRCG 377

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038880693.11.4e-23590.45anthocyanidin 3-O-glucosyltransferase 5-like [Benincasa hispida][more]
TYK16721.12.7e-22385.99anthocyanidin 3-O-glucosyltransferase 5 [Cucumis melo var. makuwa][more]
XP_008453746.11.1e-22185.71PREDICTED: anthocyanidin 3-O-glucosyltransferase 5, partial [Cucumis melo][more]
XP_022929544.14.8e-22085.04anthocyanidin 3-O-glucosyltransferase 5-like [Cucurbita moschata][more]
KAG7015349.18.1e-22085.04Anthocyanidin 3-O-glucosyltransferase 5 [Cucurbita argyrosperma subsp. argyrospe... [more]
Match NameE-valueIdentityDescription
Q402875.8e-12851.19Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta OX=3983 GN=GT5 PE=2... [more]
Q9ZU727.9e-11748.39UDP-glycosyltransferase 72D1 OS=Arabidopsis thaliana OX=3702 GN=UGT72D1 PE=2 SV=... [more]
Q94A845.3e-10544.52UDP-glycosyltransferase 72E1 OS=Arabidopsis thaliana OX=3702 GN=UGT72E1 PE=1 SV=... [more]
Q9LVR11.6e-10143.83UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana OX=3702 GN=UGT72E2 PE=1 SV=... [more]
O814982.1e-10142.83UDP-glycosyltransferase 72E3 OS=Arabidopsis thaliana OX=3702 GN=UGT72E3 PE=1 SV=... [more]
Match NameE-valueIdentityDescription
A0A5D3CXP21.3e-22385.99Glycosyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G0... [more]
A0A1S3BX035.5e-22285.71Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103494390 PE=3 SV=1[more]
A0A6J1EP222.3e-22085.04Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111436081 PE=3 SV=1[more]
A0A6J1J7263.7e-21883.83Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111483150 PE=3 SV=1[more]
A0A6J1FTD71.1e-21781.43Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111447126 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT2G18570.15.6e-11848.39UDP-Glycosyltransferase superfamily protein [more]
AT3G50740.13.8e-10644.52UDP-glucosyl transferase 72E1 [more]
AT5G66690.11.1e-10243.83UDP-Glycosyltransferase superfamily protein [more]
AT5G26310.11.5e-10242.83UDP-Glycosyltransferase superfamily protein [more]
AT2G18560.12.9e-9847.26UDP-Glycosyltransferase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 256..395
e-value: 2.0E-18
score: 66.5
IPR002213UDP-glucuronosyl/UDP-glucosyltransferaseCDDcd03784GT1_Gtf-likecoord: 6..449
e-value: 2.07379E-63
score: 208.945
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 6..454
e-value: 8.8E-133
score: 445.5
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 248..449
e-value: 8.8E-133
score: 445.5
NoneNo IPR availablePANTHERPTHR48049:SF63UDP-GLYCOSYLTRANSFERASE 72C1coord: 2..465
NoneNo IPR availablePANTHERPTHR48049GLYCOSYLTRANSFERASEcoord: 2..465
NoneNo IPR availableSUPERFAMILY53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 5..465
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 341..384

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc10G12200.1Clc10G12200.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0008194 UDP-glycosyltransferase activity