Bhi04G001143 (gene) Wax gourd

NameBhi04G001143
Typegene
OrganismBenincasa hispida (Wax gourd)
DescriptionGlycosyltransferase
Locationchr4 : 36301968 .. 36311462 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTGAACTACGTTATTATCCAGTATAAAGTTTAGCTTAGGGATTGAAAGTGTTTTTGAAAAATTTAAAAAATAATAATAATTAGATCTTGACGTTTATACATTCGCCGAATGCGCGCGTTTGAGCAAATCTTGGTGTACCATAACCCACTCTACTGCTGCAGCAGCACCAACACAGCCACATTCTCATCTCCGACAAAGAGCCGTCGGAAATGGAGGAGACGACGGGGAACGGAGTAGGAAGAAGAAGAATCGTGAAACAGAATCATGTAATCGTATTCCCTTTCCCAAGGCACGGCCACATCAGTCCAATGCTCCAATTCTCGAAGCGATTAATTTCCAAAGGCCTTCTCCTCACATTCCTCACCACTTCCTCTGCATCTCAATCCCTAATTCTCAATCTCCCTCCCTCTCCCTCTTTCCACCTCAAAATCATCTCCGATGTCTCTGAATCCAACGTACTCGCCTCTCTCGCCGCATATCTTCAGAGCTTCCGCGCCGCCGTCACCAAATCCTTGGCCAATTTCATCGATCAAGCCCTAATTTCAAGTTCCGATGAAGAAATTCCTCCCACTCTCATCGTTTACGATTCTGTTATGCCCTGGGTGCAGACTGTTGCTGCAGAGCGAGGTCTCGATACGGCTCCGTTTTTCACTCAATCTGCTGCCGTTAATCACGTTCTCCTGCTCGTCTATGGAGGATCTCTAAGTATTCCGCCACCGGAGAATGTGGCGGTTTCGCTTCCGGCAGAGATCGCTCTTCAGCCGGGAGATCTGCCGGCGTTTCCTGATGATTCTGAAGTGGTTTTGAAGTTCATGACCAGTCAGTTCTACAATCTGGAGAATGTGAAGTGGATTTTCATCAACACGTTTGATCGCCTCGAGTCCAAGGTAATTCATGCTTCTTCTCTTTGAATTGATTTTTTCTCTCTCGAAGATCAAAAGATTTTTAGATTAATGGGCGGATTTGAATCCCTAGCTTTCCACCCATACAATTTTTAAACGTTGATTGAAGTGAGCCATGGAAGGGACTAATATCAACTTAATAATGATACAGGTGTATTTGGAGATACACTTATTATTCTATTCTAATCACACACAAAAATACTTTTAAAAAGGATAATAAGAGTTTACTACAATGAATATGATCCAACTTGAATGAACTTATACTATCAACTTTGTTTGAGTACAAAGATGGATACATGTTACTTGTGATGTAAATATTTTTTTCTTAAGTAAATGATATTTATTAAATTTAAACTTTAAATTTAAAAATGGATTATAATATTTAATAAATAAATTGAGTACGGATTAAATTCGACATAATATTATATATATATATATATATATTTTTTTATATATTAATATGTTCTTTTATTTTCATGCATCTCTTTTATATTAAAAAAAATATGTATCTTTTTAAGAAAAGTTTATCATATATCTATAAAGTATGATTTAGACATTTAAATAAGATGAACAACATTTTCAAAATTTTGGTTTTGGAAAATGGAAAAATGTTTTAGGTTATGAAAATAGCAAAACAATTTTCTATTTAATTAATTAGTAAATTACAAAAATACCCCCAAAAAGCGCCTACGTTGAATTTGTTCCAAAAATACATTTTTTTTAAAAAAACATATCAACTCAGAATTTTATAGGGATTCGAAATCTAAATTATAATTTTCAAACTTTTATGAATAAAAATCGGATGTTCCAATTGTTGTAAGTGAAATTTAGATCATTTATTAAAATTAATTTTGAAAATTATTCGTGCTGTAATTCTTATAAGTATGTCGACCTTTTCAAATGGCGGATTAATAAAATTGATTAAAATTGATGCATAAACAATTTAAAAAAAATATTAATTTAGAGTCTAAATTGATTTGCTTATAAAATCTTATAGGTTTAATTAATTTAACTATAAGTCTAGAGGGTTTGAATTGATACATTTGAAATTTTTTATAGTTTAAATCAATACCATTGTTAATTTATAGTTTAAATTGATATATTCCTAGATTTTAGAGGTATAAACTGATATTTTCTTAGATATTTCTAATACAAGTTCAATGAATTCTAAAATATATATGAAATAAAACTTATCGAATTGTGTATTAGTAGGATAATAACACAACATAGCACGACTTTCTTTAGAAAGAGAATGTGAGATGTGGTAGGGTTATTTAGAGTCGGTTTGGTAGGCCATTTAAAAACAAATTATTTAATGAATGTAAAGTAGTATTTTCATATTTTTTCATGATAACTTTAGTAGCAAATTCAGAAATTAGATTTCAGCTAAAATAATTGTAAAAAATGCGTTTGATAATATATTCATAAAGAATAATACTAGTTGTAGTTCCTCACTGAATATTAAATTGAATATGAACTATTAATATTCATTTATGAGCATGTTTTATATTTTATATGTTAAAATTTTTGTATAATTATTATTTTGTAATATTATATAATTTTATGATACATTATATAATACATATTCTATTTTTTTTAAGAATTGATTTAATAATATGAAATATTACATGCACGGACATTTTTATCTTTTTTATTAAAATTAAGGCCTTTTAAAATTTCAAGTTCAAAATTTGAATATACTAAATACATAAAAATATTTTTAGATATTGAACTAATTAGATCATATAGAATCTAGATTATCTACCTATATATATTCATTTAATAAATCTGAAAATATAAAATGAAATCTAAATGACATACCAAACACACCATAATGGTTGATTTTGTACATGGTCATTATTTTCTTTTGTACAAAATTTTGATACTTACTTAAAGTAATAACCTTTTAGAGAATTATTAACTTTTTATTAGCGCATCTTCATGAAAAATATAAACTCTCACTTTAAATTACACACATGTCAATTACCACTAAATATGCTTATTTTTTTTTACAAAGAATTAATAATTTAATTTTGTTAAAGTAAGTGGTTCAATTGCCCAAATTTGAACATGAAAAACTGCAACATTTATCTAAATTTAGGCATCCAATTAGAGCAAAAAAATTACTTTCTTGTTATTATTTCATTTAACCTCTCTATTCCTTTACATATTTTCACTAATATTTCTTCATTTTTCTTTCAATTGTAAGTCACGTCAATATCATTTAAGAATGGTGTGAAATCTAGTAACATTAAAACATTAATTAATAAAGTAAACTTTATAATATGAATAGAAAGAGTATTTACTACGGAAGGCTAGGAATTTTAGAGAAAATTTGTACTACTATGAAACTATCATACTACCTCACACTAAAAATTTGTCTTCAAAAATATTTTTAATGGTAGTGGACCCCACACTATTCTTAAAAATATGCTGATATGACAAACAATAAATGAAGATGATATGAAAAATTTGAGTATAATAGAATATTATGCTTCCTTTGTTTGTACAAGAAAGTTTCCTTCATAAGATGTACTCGAAAGAAACGTAGTTCAATTGATTAAACTATTATATGTAAGTTCTAAGGTTTGAATCTTCACCCACATGTTTGATAAAAGAGAACAACTATGAATATACTTAAACAATGATAACCAAGGATAAATCGACAGAAATCGAGATTCTGTCAATGTGTGATGAAGATTTAAATCTCTTGACTATCATTATGGATCCATGATCTTGCCGATGAAACAGCTTTATATCTAGATTTAAATCGTGTTATATCCACAATTGTTCTTCTAGTTTCGCAATATTTTACGTAAACTATGCTAGAAGAATTTCAGTTATCCCTAATTATTTCAATAATATCAAACAAAATCAGGTTGTTAACTGGATGGCAAAAACATTGCCTATCAAGACAGTGGGACCAACCATTCCATCGGCATATCTAGACGGTCGGTTGGAAGATGACAAAGCCTACGGTTTGAATGTCTCAAAATCCAACGGCGGAAAGAGCCCCATCAAGTGGTTGGACTCAAAAGAAACTGCTTCAGTTGTTTATATTTCATTTGGAAGTTTGGTCATCTTATTAGAAGAACAAGTTAAAGAACTGACCAATTTACTTAGAGACACTGATTTTTCCTTCTTATGGGTCCTAAGAGAATCAGAATTGGAAAAGCTTCCTAACAACTTTCTACAAGACACATCAGAACGTGGCCTAATTGTGAACTGGTGCTGTCAACCACAAGTTCTATCTCATAAGGCTGTAAGTTGTTTTGTGACTCACTGTGGTTGGAACTCGACGCTCGAAGCATTGAGCTTGGGGGTGCCGATGGTTGCAATCCCACAGTGGGTTGATCAGACGACGAATGCTAAGTTCATCGCAGATGTTTGGGGAGTCGGAATTCGAGTGAAGAAGAATGAGAAAGGTATTGCTACAAAGGAAGAACTAGAAGCCTCCATCAGAAAAATTGTTCAAGGAGAAAGGGCAAATGAGTTTAAACAGAATTCAATCAAGTGGAAGAATTTGGCTAAAGAAGCTGTGGATGAAGGAGGCACTTCTGATAAACACATTGAAGAATTTGTCCAAGCAATTGTTGCATCAAACTAGGTATAACATTATAATTATATTTGTTTTTACACTCTTTTGTCTTCTAAATGACATAAACGGATGTGTAGAGCTTGTTTAGACTGTCTTTCCAAATGCTTAAAAGCATTTAAGCCGAGTCATTCTAAACAGGCTTAGTCATCCTTTGCAACAACAACAACTTGTTTACCTACATTGCGACCTGACAAGATGCCAATGAGAGTGGATGGACCATTCTCAAGGCCAAAAGCTAAATCTTCCAAATATACAATCTTTCCTTCCCTTATGGGAGGCAAAACAAAATCCATATACTTAGGATAGAGATGCAAGTAATCGTTCATAATAAACCCTTCCAAACGAACACGTTGTCCTATAGCGCAAATCAAATTATAAACACCTTCAGGATTGTCTTTTTGGAACTCAGATACCATCCCACACAAAGCAATACGGCCATTCCTCCTCATATTAACAAGAACAGCATCCAACATCTTCCCTCCAACATTGTCAAAGTAAATGTCTATTCCTTCAGGAAAGCATCTAAACAAACAAATATACAGAAGCCTAACATGATGATATCTTTCACAATCTTTATTAAAAGTACTGAAATATGAAAATGGATTATTGATTGCTAACCTTGTTAGAGTAGCTTCCAAGTTTGGCTCTTCTTTGTAGTTAAAAACGTCATCGAACCCGAGTTTGTTCTTCAGTAAATCAACCTGATTATGAACTGAGAGATGAGAATTCTTATACCAAAACAGGAAAAGATTAAAGGTTGTTCTAAGCTTTGTTTGTTTAATCAATCTTTTGTTTGCTACCAGCACAGCCAACAACATAACATCCCATTAACTTGGCGAATTGGCCAACAAGTTGACCAACTGCACCAGATGCAGCTGAGACGAACACATATTCTCCTTTTTTAGGACAACAAATCTCGAAGAAACCAGCATAAGCAGTCATCCCGCTTACACCTACCATTAACTTAGTCAAATTATATAGATATAATAATGTTACACCCCAATAAAAGGAAAGGAATTATGGTAACTAAAGGGTTAATAGAGCTTAAACTTCCATTAGGATTTTCTAGGAATCTCACCTGAGAAAGAATTCAACATATACCATCTTTATATTATCTCCCCTTTCCAAAATTCAAAGAAACCAACCTACCACTGAAAGTTGGTGGTTTTTTTTTTTTTTGAATGGATGACCTTAATTTTGCAGCATGATGTCATTTGCATCAAGTTTTGAAAATAATGAAGTATGACCCTCTACCATTGTCAAAACGTACTAAGAATAGGTTAATTATCCATACTGCCCCTCATGGTTTAAGTGAGTGTGGAAGTAAGATCTTTTGAACAATAGTCATTCGGACAATAGTGCTCTTACTCACTCTCTTACACTGCACTCCTCATCCTTCCTTGTGCTTGAGCTTGCATTCAATTCATCTAATACAATAGTTTTTCCACACAATAGGATTTTTCCTCCCTCACGCTCCTTCTCGCTCTGACTTACTCGTCTTCACACTCCTCACTTTCTCACCCTACTCCACACTCTCTCACTCGCGCTCACACTCATTGCTCCAAACACTCATTAATAGTATGCGAGAAGCAATATGGATAGTTAATATGCTCTTAATACACTCTGTCAATGACAAAAAATCATAATTCATCATTTTCAAAGCTTGGATCAAACGCCATTAAAAATCCACAAAATGGGTCACCCCTCACCCGTACAAAACTATTAAAAAAAGAAAAAGTATTCACCTCACCATTCAATTAACCATCTACTAAATTACAACTAAATATCTTCATGATTGTCAAGTCAGCTCATGGGCATATCCACTATCATAAAGACAACAAACACTCTGCACATGTATACTATGATTCTCCTCTAATGTTGTTCTCATTTATATTTACTAGCAAGTAAATAAAGGAGAGGAAATTTGAGAAAAAGAGATACCAAGAATTCCTGTGTAATAAGAGAGGGGAACATCAGTGTGATCAATTTTAAAGAGAGCTTCTACTTCTGAGGATGAAAGTAGAGAATATTCTTCCCAACCTGTCTGCCCCCACACAAAATCTCCTTTCTTAAACTTGGAATGATCAGAATCCACAACTTTCCCAACTCCATATCCCCTAATAGGCTGCAACAAAAGAAAAACAGAGCTCAGAATAAGCCAAAGTGATGAAAAAGACATAAAAAACGACGTCGTTTCGTGTGCCTACCAAACCAGGAGTGAAGGAAACAATATCAATGGAGGGATTTTCGAATTTGTTCATACAAGCATGCATATAAGGATCACAGGAAAGGTAGAGATTCTTTAGAAGAACTCCATTGAAGCCATGGGGAACCTTCAATTTAATGGTTCTATTGGAGATCACTTCGAGATCCGACTCTTTTGCAGTGGCACGGCCGGTGACATAGTCTTTTAAAATCACCTGTTTGTTGATTACTACTTCGCCATCACCGCCGTGGCTATGCATCATCATGAAAAGAGGAAGAGAAGGAATAATGATGGAAATGGAAAATGGGGTTTCTTTCTTTTTTTTTTTTTTTTTTCCCCTTCTTTTTCTTCATTGTTGGACTGGGATATATAGAAAAGCATAAAGGGTTTTGGGGAAGAAAAGTCTTGAATTTGAAAGAAACCACTGAAATTAAAATTTGAAAAAGGCAATCCGAAATCACTTTTGCCAAGCTAATCCCATTGGCTGGTTCAATTAATTGATACAAAAATGAAACTGTTGTATTAGAAAGACGTATAAACAAAGGTTAGTGGTGGGGAAAATTCTGGAATCTCAAACGAGATGAAATATTTAACTCTCGTTTATAAATTCTGGAAGGCATGTTTATAAGGTGTTGTTTAACTCTCGTTTTAAGGTCTACTATTAAATCTGTAATTGATAAATATAATCCGATTGCAGAATTAGATTACGTTTGAAAATTATATAGGACCTATAAAATTAGGTAGATGTAAGACCTATAAAATTAGGTAGATGTATTGATTACATAGGACCTATGAGATTAGGTAGATGTATTGATTGACCCTAAATATAGTAACTACAATTGTAAGATTATGATAGAACTCTGATTTATATATTTACTATTTAGGCAAAATTGCTAAGGGAGCCGAGACCCTCTTCCCTTACAAGCTCTATGAGCTCTACTCAATACAATGCAAAAAGATGCTCAACCTCTCTTCCTCACTATCTTAAATAACCAATTTGTCTAACTAACCGAATCTTTCCTCTTGGGCCCACACTTACACACCCCATCCTCAGTAACAAACTCAATTCATTCTTTACATTTAATTCACCATCTCTCTTTCTCTTCCTTATATATTGTCTAATGACCGGAGGTCTATTAGATTACGTTATTCATTCAAATACATTGCAAAGTAAACAACCTCAATCATAATCAGATTATGCTTGTAAAATGTCTATGTAAATGTCATCAAAAGCAATTTAATTACGTTTATAATTGAGGCCCAATTGATTCATCAATATAATCCGATTAGGATTGCACCAATTTTAAATTGCAGATTGGTAAACATGGCGTAAGTTCATCTCATGTAATATTTATCTTTGTGCATCCCATCAAATTATCTAATCATAATTATTCATGTAATATATATATATATATATATTTTTGTATGCAGTGAGAATGGAATGGATAACTATGGATGCACGTAAGTATTAGTATCTCAAAGTTATTTCATATCAAGGTGGAAGTTTTAGCTTGGAAAGGTGGTGTTTATATGCAAGAGAAGACCCTAAGATATTTGTCGTTTATATTTTATTTGTTTTATTTTTTGTGTGTTCTAACTTTTTAAGCATGGAATGGAATGATAGTAAGTCATTTATATTAATTCCCTTAATGTTGGCTACATCAAATTCTCTAGAGAGTCAAATATGATATTGTTGCTATTTTTAATTTTTAGATAATTATTATTTAAAAAAATTGTTAGAATTTATGTCCGAAATCTGATGTATCTCGTAGTTTGTAAAGACAAATTACATATCTAATAGATTAGGGATATTTTATTGACATTTAGACTACATTAATTCAATTCAATAAACTAAGATCCAAGGTTATCTAATTAAACTTAAATATGTATGTGGAGACATACATGTAGATCATGTTTAAGTGATAACCTAAATGGTCTGTAGTATATGGATAAGGTTGGGTACCTTATCTTAGTGACGCTACGAATACGGCCCACTTTATAGATGTTACAATTGTTGTAAAATGCTACAAATGATTTGATCCGAATCATTCATGTGAAGACATGCGAGCGAGGTATTCTATACAAAGAGTTTATATAAGATCGGACCATGAAATGAATAATCTCTCTTTATAATACCATTGCTAAAAAAGACTTATATTTCATTAGGATAATCATAAGTGATTTGACCTTAATCCTGAATGAGTTGTGAACTTCTGTTTATGAGGGCGACCCTTTGATTTGTATGGGTGAGAATGACCATATTAGCCGACTCAGCCTACCCTTTTGGGATTTGTCTGAGTAAAGAGTTGGGAACACAGTTACACAAGATGGAATTCACTTCTTCTCGTCTTTAGGAGTGGATAAATTGCTCCCTTAATAGCTGATTTCGGGTCTTGAACATTGAGACCCCAACCTCTCATTGGTCCAAGAGGTGTTAGTTTATAGTTGGACTATAAATTATTTGTTCATTATAGGGATCAGTGGTACTTAAGGGTGTTAGTTTATAGTTGGATTATAAATTGTTTGTTCATTATAGGGATCAGTGGTACTTAAAGAGTTAGATATAACTATAGAGGGAAAACAGTAATTTGATCTAGTTGTAGTTATGAGCGATTTGTGAAGGGTTGACTCACTGTTGATTGGTTATATCCATGGACACAGAAATATTTGGATTAAAGGAGTTTAATCAATTAATCTCATATCACTATAGCTTCTAATCTTAGGTCCATAAGATCCCCTTATTAGCTCACTAAAGGATAGTAATGAGGAATGATTTAAATTATTCAAAT

mRNA sequence

GTGAACTACGTTATTATCCAGTATAAAGTTTAGCTTAGGGATTGAAAGTGTTTTTGAAAAATTTAAAAAATAATAATAATTAGATCTTGACGTTTATACATTCGCCGAATGCGCGCGTTTGAGCAAATCTTGGTGTACCATAACCCACTCTACTGCTGCAGCAGCACCAACACAGCCACATTCTCATCTCCGACAAAGAGCCGTCGGAAATGGAGGAGACGACGGGGAACGGAGTAGGAAGAAGAAGAATCGTGAAACAGAATCATGTAATCGTATTCCCTTTCCCAAGGCACGGCCACATCAGTCCAATGCTCCAATTCTCGAAGCGATTAATTTCCAAAGGCCTTCTCCTCACATTCCTCACCACTTCCTCTGCATCTCAATCCCTAATTCTCAATCTCCCTCCCTCTCCCTCTTTCCACCTCAAAATCATCTCCGATGTCTCTGAATCCAACGTACTCGCCTCTCTCGCCGCATATCTTCAGAGCTTCCGCGCCGCCGTCACCAAATCCTTGGCCAATTTCATCGATCAAGCCCTAATTTCAAGTTCCGATGAAGAAATTCCTCCCACTCTCATCGTTTACGATTCTGTTATGCCCTGGGTGCAGACTGTTGCTGCAGAGCGAGGTCTCGATACGGCTCCGTTTTTCACTCAATCTGCTGCCGTTAATCACGTTCTCCTGCTCGTCTATGGAGGATCTCTAAGTATTCCGCCACCGGAGAATGTGGCGGTTTCGCTTCCGGCAGAGATCGCTCTTCAGCCGGGAGATCTGCCGGCGTTTCCTGATGATTCTGAAGTGGTTTTGAAGTTCATGACCAGTCAGTTCTACAATCTGGAGAATGTGAAGTGGATTTTCATCAACACGTTTGATCGCCTCGAGTCCAAGGTTGTTAACTGGATGGCAAAAACATTGCCTATCAAGACAGTGGGACCAACCATTCCATCGGCATATCTAGACGGTCGGTTGGAAGATGACAAAGCCTACGGTTTGAATGTCTCAAAATCCAACGGCGGAAAGAGCCCCATCAAGTGGTTGGACTCAAAAGAAACTGCTTCAGTTGTTTATATTTCATTTGGAAGTTTGGTCATCTTATTAGAAGAACAAGTTAAAGAACTGACCAATTTACTTAGAGACACTGATTTTTCCTTCTTATGGGTCCTAAGAGAATCAGAATTGGAAAAGCTTCCTAACAACTTTCTACAAGACACATCAGAACGTGGCCTAATTGTGAACTGGTGCTGTCAACCACAAGTTCTATCTCATAAGGCTGTAAGTTGTTTTGTGACTCACTGTGGTTGGAACTCGACGCTCGAAGCATTGAGCTTGGGGGTGCCGATGGTTGCAATCCCACAGTGGGTTGATCAGACGACGAATGCTAAGTTCATCGCAGATGTTTGGGGAGTCGGAATTCGAGTGAAGAAGAATGAGAAAGGTATTGCTACAAAGGAAGAACTAGAAGCCTCCATCAGAAAAATTGTTCAAGGAGAAAGGGCAAATGAGTTTAAACAGAATTCAATCAAGTGGAAGAATTTGGCTAAAGAAGCTGTGGATGAAGGAGGCACTTCTGATAAACACATTGAAGAATTTGTCCAAGCAATTGTTGCATCAAACTAGTGAGAATGGAATGGATAACTATGGATGCACGTAAGTATTAGTATCTCAAAGTTATTTCATATCAAGGTGGAAGTTTTAGCTTGGAAAGGTGGTGTTTATATGCAAGAGAAGACCCTAAGATATTTGTCGTTTATATTTTATTTGTTTTATTTTTTGTGTGTTCTAACTTTTTAAGCATGGAATGGAATGATAGTAAGTCATTTATATTAATTCCCTTAATGTTGGCTACATCAAATTCTCTAGAGAGTCAAATATGATATTGTTGCTATTTTTAATTTTTAGATAATTATTATTTAAAAAAATTGTTAGAATTTATGTCCGAAATCTGATGTATCTCGTAGTTTGTAAAGACAAATTACATATCTAATAGATTAGGGATATTTTATTGACATTTAGACTACATTAATTCAATTCAATAAACTAAGATCCAAGGTTATCTAATTAAACTTAAATATGTATGTGGAGACATACATGTAGATCATGTTTAAGTGATAACCTAAATGGTCTGTAGTATATGGATAAGGTTGGGTACCTTATCTTAGTGACGCTACGAATACGGCCCACTTTATAGATGTTACAATTGTTGTAAAATGCTACAAATGATTTGATCCGAATCATTCATGTGAAGACATGCGAGCGAGGTATTCTATACAAAGAGTTTATATAAGATCGGACCATGAAATGAATAATCTCTCTTTATAATACCATTGCTAAAAAAGACTTATATTTCATTAGGATAATCATAAGTGATTTGACCTTAATCCTGAATGAGTTGTGAACTTCTGTTTATGAGGGCGACCCTTTGATTTGTATGGGTGAGAATGACCATATTAGCCGACTCAGCCTACCCTTTTGGGATTTGTCTGAGTAAAGAGTTGGGAACACAGTTACACAAGATGGAATTCACTTCTTCTCGTCTTTAGGAGTGGATAAATTGCTCCCTTAATAGCTGATTTCGGGTCTTGAACATTGAGACCCCAACCTCTCATTGGTCCAAGAGGTGTTAGTTTATAGTTGGACTATAAATTATTTGTTCATTATAGGGATCAGTGGTACTTAAGGGTGTTAGTTTATAGTTGGATTATAAATTGTTTGTTCATTATAGGGATCAGTGGTACTTAAAGAGTTAGATATAACTATAGAGGGAAAACAGTAATTTGATCTAGTTGTAGTTATGAGCGATTTGTGAAGGGTTGACTCACTGTTGATTGGTTATATCCATGGACACAGAAATATTTGGATTAAAGGAGTTTAATCAATTAATCTCATATCACTATAGCTTCTAATCTTAGGTCCATAAGATCCCCTTATTAGCTCACTAAAGGATAGTAATGAGGAATGATTTAAATTATTCAAAT

Coding sequence (CDS)

ATGGAGGAGACGACGGGGAACGGAGTAGGAAGAAGAAGAATCGTGAAACAGAATCATGTAATCGTATTCCCTTTCCCAAGGCACGGCCACATCAGTCCAATGCTCCAATTCTCGAAGCGATTAATTTCCAAAGGCCTTCTCCTCACATTCCTCACCACTTCCTCTGCATCTCAATCCCTAATTCTCAATCTCCCTCCCTCTCCCTCTTTCCACCTCAAAATCATCTCCGATGTCTCTGAATCCAACGTACTCGCCTCTCTCGCCGCATATCTTCAGAGCTTCCGCGCCGCCGTCACCAAATCCTTGGCCAATTTCATCGATCAAGCCCTAATTTCAAGTTCCGATGAAGAAATTCCTCCCACTCTCATCGTTTACGATTCTGTTATGCCCTGGGTGCAGACTGTTGCTGCAGAGCGAGGTCTCGATACGGCTCCGTTTTTCACTCAATCTGCTGCCGTTAATCACGTTCTCCTGCTCGTCTATGGAGGATCTCTAAGTATTCCGCCACCGGAGAATGTGGCGGTTTCGCTTCCGGCAGAGATCGCTCTTCAGCCGGGAGATCTGCCGGCGTTTCCTGATGATTCTGAAGTGGTTTTGAAGTTCATGACCAGTCAGTTCTACAATCTGGAGAATGTGAAGTGGATTTTCATCAACACGTTTGATCGCCTCGAGTCCAAGGTTGTTAACTGGATGGCAAAAACATTGCCTATCAAGACAGTGGGACCAACCATTCCATCGGCATATCTAGACGGTCGGTTGGAAGATGACAAAGCCTACGGTTTGAATGTCTCAAAATCCAACGGCGGAAAGAGCCCCATCAAGTGGTTGGACTCAAAAGAAACTGCTTCAGTTGTTTATATTTCATTTGGAAGTTTGGTCATCTTATTAGAAGAACAAGTTAAAGAACTGACCAATTTACTTAGAGACACTGATTTTTCCTTCTTATGGGTCCTAAGAGAATCAGAATTGGAAAAGCTTCCTAACAACTTTCTACAAGACACATCAGAACGTGGCCTAATTGTGAACTGGTGCTGTCAACCACAAGTTCTATCTCATAAGGCTGTAAGTTGTTTTGTGACTCACTGTGGTTGGAACTCGACGCTCGAAGCATTGAGCTTGGGGGTGCCGATGGTTGCAATCCCACAGTGGGTTGATCAGACGACGAATGCTAAGTTCATCGCAGATGTTTGGGGAGTCGGAATTCGAGTGAAGAAGAATGAGAAAGGTATTGCTACAAAGGAAGAACTAGAAGCCTCCATCAGAAAAATTGTTCAAGGAGAAAGGGCAAATGAGTTTAAACAGAATTCAATCAAGTGGAAGAATTTGGCTAAAGAAGCTGTGGATGAAGGAGGCACTTCTGATAAACACATTGAAGAATTTGTCCAAGCAATTGTTGCATCAAACTAG

Protein sequence

MEETTGNGVGRRRIVKQNHVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSLILNLPPSPSFHLKIISDVSESNVLASLAAYLQSFRAAVTKSLANFIDQALISSSDEEIPPTLIVYDSVMPWVQTVAAERGLDTAPFFTQSAAVNHVLLLVYGGSLSIPPPENVAVSLPAEIALQPGDLPAFPDDSEVVLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKTVGPTIPSAYLDGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQVKELTNLLRDTDFSFLWVLRESELEKLPNNFLQDTSERGLIVNWCCQPQVLSHKAVSCFVTHCGWNSTLEALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVKKNEKGIATKEELEASIRKIVQGERANEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFVQAIVASN
BLAST of Bhi04G001143 vs. Swiss-Prot
Match: sp|Q9SYK9|U74E2_ARATH (UDP-glycosyltransferase 74E2 OS=Arabidopsis thaliana OX=3702 GN=UGT74E2 PE=1 SV=1)

HSP 1 Score: 372.5 bits (955), Expect = 7.0e-102
Identity = 199/456 (43.64%), Postives = 279/456 (61.18%), Query Frame = 0

Query: 18  NHVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSLILNLPPSPSFHLKI-IS 77
           +H+IV PFP  GHI+PM QF KRL SKGL LT +  S          PP  + H  I + 
Sbjct: 5   SHLIVLPFPGQGHITPMSQFCKRLASKGLKLTLVLVSDKPS------PPYKTEHDSITVF 64

Query: 78  DVSE-----SNVLASLAAYLQSFRAAVTKSLANFIDQALISSSDEEIPPTLIVYDSVMPW 137
            +S         L  L  Y++    ++  +L   ++   +S +    PP  IVYDS MPW
Sbjct: 65  PISNGFQEGEEPLQDLDDYMERVETSIKNTLPKLVEDMKLSGN----PPRAIVYDSTMPW 124

Query: 138 VQTVAAERGLDTAPFFTQSAAVNHVLLLVYGGSLSIPPPE---NVAVSLPAEIALQPGDL 197
           +  VA   GL  A FFTQ   V  +   V+ GS S+P  +   +   S P+   L   DL
Sbjct: 125 LLDVAHSYGLSGAVFFTQPWLVTAIYYHVFKGSFSVPSTKYGHSTLASFPSFPMLTANDL 184

Query: 198 PAFPDDSEV---VLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKTVGPTIP 257
           P+F  +S     +L+ +  Q  N++ V  +  NTFD+LE K++ W+    P+  +GPT+P
Sbjct: 185 PSFLCESSSYPNILRIVVDQLSNIDRVDIVLCNTFDKLEEKLLKWVQSLWPVLNIGPTVP 244

Query: 258 SAYLDGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQVKELTN 317
           S YLD RL +DK YG ++  +   +  ++WL+SKE  SVVY+SFGSLVIL E+Q+ EL  
Sbjct: 245 SMYLDKRLSEDKNYGFSLFNAKVAEC-MEWLNSKEPNSVVYLSFGSLVILKEDQMLELAA 304

Query: 318 LLRDTDFSFLWVLRESELEKLPNNFLQDTSERGLIVNWCCQPQVLSHKAVSCFVTHCGWN 377
            L+ +   FLWV+RE+E  KLP N++++  E+GLIV+W  Q  VL+HK++ CF+THCGWN
Sbjct: 305 GLKQSGRFFLWVVRETETHKLPRNYVEEIGEKGLIVSWSPQLDVLAHKSIGCFLTHCGWN 364

Query: 378 STLEALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVKKNEKGIATKEELEASIRKIVQ 437
           STLE LSLGVPM+ +P W DQ TNAKF+ DVW VG+RVK    G   +EE+  S+ ++++
Sbjct: 365 STLEGLSLGVPMIGMPHWTDQPTNAKFMQDVWKVGVRVKAEGDGFVRREEIMRSVEEVME 424

Query: 438 GERANEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFV 462
           GE+  E ++N+ KWK LA+EAV EGG+SDK I EFV
Sbjct: 425 GEKGKEIRKNAEKWKVLAQEAVSEGGSSDKSINEFV 449

BLAST of Bhi04G001143 vs. Swiss-Prot
Match: sp|Q6VAA6|U74G1_STERE (UDP-glycosyltransferase 74G1 OS=Stevia rebaudiana OX=55670 GN=UGT74G1 PE=1 SV=1)

HSP 1 Score: 359.4 bits (921), Expect = 6.1e-98
Identity = 199/462 (43.07%), Postives = 292/462 (63.20%), Query Frame = 0

Query: 11  RRRIVKQNHVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTT-SSASQSLILNLPPSPS 70
           +++I K  HV++ PFP  GHI+P +QF KRLISKG+  T +TT  + + +L  +   + S
Sbjct: 4   QQKIKKSPHVLLIPFPLQGHINPFIQFGKRLISKGVKTTLVTTIHTLNSTLNHSNTTTTS 63

Query: 71  FHLKIISD-VSESNVLASLAAYLQSFRAAVTKSLANFIDQALISSSDEEIPPTLIVYDSV 130
             ++ ISD   E   +++  +YL++F+   +KSLA+ I +       E      I+YDS+
Sbjct: 64  IEIQAISDGCDEGGFMSAGESYLETFKQVGSKSLADLIKKL----QSEGTTIDAIIYDSM 123

Query: 131 MPWVQTVAAERGLDTAPFFTQSAAVNHVLLLVYGGSLSIPPPENVAVSLPAEIALQPGDL 190
             WV  VA E G+D   FFTQ+  VN +   V+ G +S+P  E   VS+P    LQ  + 
Sbjct: 124 TEWVLDVAIEFGIDGGSFFTQACVVNSLYYHVHKGLISLPLGE--TVSVPGFPVLQRWET 183

Query: 191 PAFPDDSEVV----LKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKTVGPTI 250
           P    + E +     + +  QF N++  +W+F N+F +LE +V+ W  K   +K +GPT+
Sbjct: 184 PLILQNHEQIQSPWSQMLFGQFANIDQARWVFTNSFYKLEEEVIEWTRKIWNLKVIGPTL 243

Query: 251 PSAYLDGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQVKELT 310
           PS YLD RL+DDK  G N+ K+N  +  + WLD K   SVVY++FGSLV    EQV+E+T
Sbjct: 244 PSMYLDKRLDDDKDNGFNLYKANHHEC-MNWLDDKPKESVVYVAFGSLVKHGPEQVEEIT 303

Query: 311 NLLRDTDFSFLWVLRESELEKLPNNFLQ-DTSERGLIVNWCCQPQVLSHKAVSCFVTHCG 370
             L D+D +FLWV++  E  KLP N  +   + +GLIV WC Q  VL+H++V CFVTHCG
Sbjct: 304 RALIDSDVNFLWVIKHKEEGKLPENLSEVIKTGKGLIVAWCKQLDVLAHESVGCFVTHCG 363

Query: 371 WNSTLEALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVKKNEKGIATKEELEASIRKI 430
           +NSTLEA+SLGVP+VA+PQ+ DQTTNAK + ++ GVG+RVK +E GI  +  L + I+ I
Sbjct: 364 FNSTLEAISLGVPVVAMPQFSDQTTNAKLLDEILGVGVRVKADENGIVRRGNLASCIKMI 423

Query: 431 VQGERANEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFVQAIV 466
           ++ ER    ++N++KWK+LAK AV EGG+SD  I EFV  ++
Sbjct: 424 MEEERGVIIRKNAVKWKDLAKVAVHEGGSSDNDIVEFVSELI 458

BLAST of Bhi04G001143 vs. Swiss-Prot
Match: sp|Q9SKC5|U74D1_ARATH (UDP-glycosyltransferase 74D1 OS=Arabidopsis thaliana OX=3702 GN=UGT74D1 PE=1 SV=1)

HSP 1 Score: 350.9 bits (899), Expect = 2.2e-95
Identity = 195/459 (42.48%), Postives = 291/459 (63.40%), Query Frame = 0

Query: 19  HVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSLILNLPPSPSFHLKI---- 78
           +V+VF FP  GHI+P+LQFSKRL+SK + +TFLTTSS   S++       +  L +    
Sbjct: 8   NVLVFSFPIQGHINPLLQFSKRLLSKNVNVTFLTTSSTHNSILRRAITGGATALPLSFVP 67

Query: 79  ISDVSESNVLASLAA--YLQSFRAAVTKSLANFIDQALISSSDEEIPPTLIVYDSVMPWV 138
           I D  E +  ++  +  Y   F+  V++SL+      LISS D +  P  +VYDS +P+V
Sbjct: 68  IDDGFEEDHPSTDTSPDYFAKFQENVSRSLSE-----LISSMDPK--PNAVVYDSCLPYV 127

Query: 139 QTVAAER-GLDTAPFFTQSAAVNHVLLLVYGGSLSIPPPENVAVSLPAEIALQPGDLPAF 198
             V  +  G+  A FFTQS+ VN   +    G       +   V LPA   L+  DLP F
Sbjct: 128 LDVCRKHPGVAAASFFTQSSTVNATYIHFLRGEFKEFQND---VVLPAMPPLKGNDLPVF 187

Query: 199 PDDSEV---VLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKTVGPTIPSAY 258
             D+ +   + + ++SQF N++++ +  +N+FD LE +V+ WM    P+K +GP IPS Y
Sbjct: 188 LYDNNLCRPLFELISSQFVNVDDIDFFLVNSFDELEVEVLQWMKNQWPVKNIGPMIPSMY 247

Query: 259 LDGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQVKELTNLLR 318
           LD RL  DK YG+N+  +   +  + WLDSK   SV+Y+SFGSL +L ++Q+ E+   L+
Sbjct: 248 LDKRLAGDKDYGINLFNAQVNEC-LDWLDSKPPGSVIYVSFGSLAVLKDDQMIEVAAGLK 307

Query: 319 DTDFSFLWVLRESELEKLPNNFLQDTSERGLIVNWCCQPQVLSHKAVSCFVTHCGWNSTL 378
            T  +FLWV+RE+E +KLP+N+++D  ++GLIVNW  Q QVL+HK++ CF+THCGWNSTL
Sbjct: 308 QTGHNFLWVVRETETKKLPSNYIEDICDKGLIVNWSPQLQVLAHKSIGCFMTHCGWNSTL 367

Query: 379 EALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVKKNEKGIATKEELEASIRKIVQ--G 438
           EALSLGV ++ +P + DQ TNAKFI DVW VG+RVK ++ G   KEE+   + ++++   
Sbjct: 368 EALSLGVALIGMPAYSDQPTNAKFIEDVWKVGVRVKADQNGFVPKEEIVRCVGEVMEDMS 427

Query: 439 ERANEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFVQAIV 466
           E+  E ++N+ +    A+EA+ +GG SDK+I+EFV  IV
Sbjct: 428 EKGKEIRKNARRLMEFAREALSDGGNSDKNIDEFVAKIV 455

BLAST of Bhi04G001143 vs. Swiss-Prot
Match: sp|P0C7P7|U74E1_ARATH (UDP-glycosyltransferase 74E1 OS=Arabidopsis thaliana OX=3702 GN=UGT74E1 PE=3 SV=1)

HSP 1 Score: 350.1 bits (897), Expect = 3.7e-95
Identity = 192/457 (42.01%), Postives = 271/457 (59.30%), Query Frame = 0

Query: 18  NHVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSLILNLPPSPSFHLKIISD 77
           +HVIV PFP  GHI+PM QF KRL SK L +T +  S          PP  + H   I+ 
Sbjct: 5   SHVIVLPFPAQGHITPMSQFCKRLASKSLKITLVLVSDKPS------PPYKTEH-DTITV 64

Query: 78  VSESNVL-------ASLAAYLQSFRAAVTKSLANFIDQALISSSDEEIPPTLIVYDSVMP 137
           V  SN           L  Y++   +++   L   I+   +S +    PP  +VYDS MP
Sbjct: 65  VPISNGFQEGQERSEDLDEYMERVESSIKNRLPKLIEDMKLSGN----PPRALVYDSTMP 124

Query: 138 WVQTVAAERGLDTAPFFTQSAAVNHVLLLVYGGSLSIPPPE---NVAVSLPAEIALQPGD 197
           W+  VA   GL  A FFTQ   V+ +   V+ GS S+P  +   +   S P+   L   D
Sbjct: 125 WLLDVAHSYGLSGAVFFTQPWLVSAIYYHVFKGSFSVPSTKYGHSTLASFPSLPILNAND 184

Query: 198 LPAFPDDSE---VVLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKTVGPTI 257
           LP+F  +S     +L+ +  Q  N++ V  +  NTFD+LE K++ W+    P+  +GPT+
Sbjct: 185 LPSFLCESSSYPYILRTVIDQLSNIDRVDIVLCNTFDKLEEKLLKWIKSVWPVLNIGPTV 244

Query: 258 PSAYLDGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQVKELT 317
           PS YLD RL +DK YG ++  +   +  ++WL+SK+ +SVVY+SFGSLV+L ++Q+ EL 
Sbjct: 245 PSMYLDKRLAEDKNYGFSLFGAKIAEC-MEWLNSKQPSSVVYVSFGSLVVLKKDQLIELA 304

Query: 318 NLLRDTDFSFLWVLRESELEKLPNNFLQDTSERGLIVNWCCQPQVLSHKAVSCFVTHCGW 377
             L+ +   FLWV+RE+E  KLP N++++  E+GL V+W  Q +VL+HK++ CFVTHCGW
Sbjct: 305 AGLKQSGHFFLWVVRETERRKLPENYIEEIGEKGLTVSWSPQLEVLTHKSIGCFVTHCGW 364

Query: 378 NSTLEALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVKKNEKGIATKEELEASIRKIV 437
           NSTLE LSLGVPM+ +P W DQ TNAKF+ DVW VG+RVK +  G   +EE         
Sbjct: 365 NSTLEGLSLGVPMIGMPHWADQPTNAKFMEDVWKVGVRVKADSDGFVRREEXXXXXXXXX 424

Query: 438 QGERANEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFV 462
                   ++N+ KWK LA+EAV EGG+SDK+I EFV
Sbjct: 425 XXXXXXXXRKNAEKWKVLAQEAVSEGGSSDKNINEFV 449

BLAST of Bhi04G001143 vs. Swiss-Prot
Match: sp|O22822|U74F2_ARATH (UDP-glycosyltransferase 74F2 OS=Arabidopsis thaliana OX=3702 GN=UGT74F2 PE=1 SV=1)

HSP 1 Score: 346.3 bits (887), Expect = 5.4e-94
Identity = 197/459 (42.92%), Postives = 281/459 (61.22%), Query Frame = 0

Query: 16  KQNHVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSLILNLPPSPSFHLKII 75
           K+ HV+  P+P  GHI+P  QF KRL  KGL  T   T+    S  +N   S    +  I
Sbjct: 4   KRGHVLAVPYPTQGHITPFRQFCKRLHFKGLKTTLALTTFVFNS--INPDLSGPISIATI 63

Query: 76  SDVSES---NVLASLAAYLQSFRAAVTKSLANFIDQALISSSDEEIPPTLIVYDSVMPWV 135
           SD  +        S+  YL+ F+ + +K++A+ I +   S +    P T IVYD+ +PW 
Sbjct: 64  SDGYDHGGFETADSIDDYLKDFKTSGSKTIADIIQKHQTSDN----PITCIVYDAFLPWA 123

Query: 136 QTVAAERGLDTAPFFTQSAAVNHVLLLVY--GGSLSIPPPENVAVSLPAEIALQPGDLPA 195
             VA E GL   PFFTQ  AVN+V  L Y   GSL +P  E     LP    L+  DLP+
Sbjct: 124 LDVAREFGLVATPFFTQPCAVNYVYYLSYINNGSLQLPIEE-----LP---FLELQDLPS 183

Query: 196 FPDDS---EVVLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKTVGPTIPSA 255
           F   S       + +  QF N E   ++ +N+F  LE       +K  P+ T+GPTIPS 
Sbjct: 184 FFSVSGSYPAYFEMVLQQFINFEKADFVLVNSFQELELHENELWSKACPVLTIGPTIPSI 243

Query: 256 YLDGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQVKELTNLL 315
           YLD R++ D  Y LN+ +S      I WLD++   SVVY++FGS+  L   Q++EL + +
Sbjct: 244 YLDQRIKSDTGYDLNLFESKDDSFCINWLDTRPQGSVVYVAFGSMAQLTNVQMEELASAV 303

Query: 316 RDTDFSFLWVLRESELEKLPNNFLQDTS-ERGLIVNWCCQPQVLSHKAVSCFVTHCGWNS 375
             ++FSFLWV+R SE EKLP+ FL+  + E+ L++ W  Q QVLS+KA+ CF+THCGWNS
Sbjct: 304 --SNFSFLWVVRSSEEEKLPSGFLETVNKEKSLVLKWSPQLQVLSNKAIGCFLTHCGWNS 363

Query: 376 TLEALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVK-KNEKGIATKEELEASIRKIVQ 435
           T+EAL+ GVPMVA+PQW DQ  NAK+I DVW  G+RVK + E GIA +EE+E SI+++++
Sbjct: 364 TMEALTFGVPMVAMPQWTDQPMNAKYIQDVWKAGVRVKTEKESGIAKREEIEFSIKEVME 423

Query: 436 GERANEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFVQAI 465
           GER+ E K+N  KW++LA ++++EGG++D +I+ FV  +
Sbjct: 424 GERSKEMKKNVKKWRDLAVKSLNEGGSTDTNIDTFVSRV 446

BLAST of Bhi04G001143 vs. TAIR10
Match: AT1G05680.1 (Uridine diphosphate glycosyltransferase 74E2)

HSP 1 Score: 372.5 bits (955), Expect = 3.9e-103
Identity = 199/456 (43.64%), Postives = 279/456 (61.18%), Query Frame = 0

Query: 18  NHVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSLILNLPPSPSFHLKI-IS 77
           +H+IV PFP  GHI+PM QF KRL SKGL LT +  S          PP  + H  I + 
Sbjct: 5   SHLIVLPFPGQGHITPMSQFCKRLASKGLKLTLVLVSDKPS------PPYKTEHDSITVF 64

Query: 78  DVSE-----SNVLASLAAYLQSFRAAVTKSLANFIDQALISSSDEEIPPTLIVYDSVMPW 137
            +S         L  L  Y++    ++  +L   ++   +S +    PP  IVYDS MPW
Sbjct: 65  PISNGFQEGEEPLQDLDDYMERVETSIKNTLPKLVEDMKLSGN----PPRAIVYDSTMPW 124

Query: 138 VQTVAAERGLDTAPFFTQSAAVNHVLLLVYGGSLSIPPPE---NVAVSLPAEIALQPGDL 197
           +  VA   GL  A FFTQ   V  +   V+ GS S+P  +   +   S P+   L   DL
Sbjct: 125 LLDVAHSYGLSGAVFFTQPWLVTAIYYHVFKGSFSVPSTKYGHSTLASFPSFPMLTANDL 184

Query: 198 PAFPDDSEV---VLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKTVGPTIP 257
           P+F  +S     +L+ +  Q  N++ V  +  NTFD+LE K++ W+    P+  +GPT+P
Sbjct: 185 PSFLCESSSYPNILRIVVDQLSNIDRVDIVLCNTFDKLEEKLLKWVQSLWPVLNIGPTVP 244

Query: 258 SAYLDGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQVKELTN 317
           S YLD RL +DK YG ++  +   +  ++WL+SKE  SVVY+SFGSLVIL E+Q+ EL  
Sbjct: 245 SMYLDKRLSEDKNYGFSLFNAKVAEC-MEWLNSKEPNSVVYLSFGSLVILKEDQMLELAA 304

Query: 318 LLRDTDFSFLWVLRESELEKLPNNFLQDTSERGLIVNWCCQPQVLSHKAVSCFVTHCGWN 377
            L+ +   FLWV+RE+E  KLP N++++  E+GLIV+W  Q  VL+HK++ CF+THCGWN
Sbjct: 305 GLKQSGRFFLWVVRETETHKLPRNYVEEIGEKGLIVSWSPQLDVLAHKSIGCFLTHCGWN 364

Query: 378 STLEALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVKKNEKGIATKEELEASIRKIVQ 437
           STLE LSLGVPM+ +P W DQ TNAKF+ DVW VG+RVK    G   +EE+  S+ ++++
Sbjct: 365 STLEGLSLGVPMIGMPHWTDQPTNAKFMQDVWKVGVRVKAEGDGFVRREEIMRSVEEVME 424

Query: 438 GERANEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFV 462
           GE+  E ++N+ KWK LA+EAV EGG+SDK I EFV
Sbjct: 425 GEKGKEIRKNAEKWKVLAQEAVSEGGSSDKSINEFV 449

BLAST of Bhi04G001143 vs. TAIR10
Match: AT2G31750.1 (UDP-glucosyl transferase 74D1)

HSP 1 Score: 350.9 bits (899), Expect = 1.2e-96
Identity = 195/459 (42.48%), Postives = 291/459 (63.40%), Query Frame = 0

Query: 19  HVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSLILNLPPSPSFHLKI---- 78
           +V+VF FP  GHI+P+LQFSKRL+SK + +TFLTTSS   S++       +  L +    
Sbjct: 8   NVLVFSFPIQGHINPLLQFSKRLLSKNVNVTFLTTSSTHNSILRRAITGGATALPLSFVP 67

Query: 79  ISDVSESNVLASLAA--YLQSFRAAVTKSLANFIDQALISSSDEEIPPTLIVYDSVMPWV 138
           I D  E +  ++  +  Y   F+  V++SL+      LISS D +  P  +VYDS +P+V
Sbjct: 68  IDDGFEEDHPSTDTSPDYFAKFQENVSRSLSE-----LISSMDPK--PNAVVYDSCLPYV 127

Query: 139 QTVAAER-GLDTAPFFTQSAAVNHVLLLVYGGSLSIPPPENVAVSLPAEIALQPGDLPAF 198
             V  +  G+  A FFTQS+ VN   +    G       +   V LPA   L+  DLP F
Sbjct: 128 LDVCRKHPGVAAASFFTQSSTVNATYIHFLRGEFKEFQND---VVLPAMPPLKGNDLPVF 187

Query: 199 PDDSEV---VLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKTVGPTIPSAY 258
             D+ +   + + ++SQF N++++ +  +N+FD LE +V+ WM    P+K +GP IPS Y
Sbjct: 188 LYDNNLCRPLFELISSQFVNVDDIDFFLVNSFDELEVEVLQWMKNQWPVKNIGPMIPSMY 247

Query: 259 LDGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQVKELTNLLR 318
           LD RL  DK YG+N+  +   +  + WLDSK   SV+Y+SFGSL +L ++Q+ E+   L+
Sbjct: 248 LDKRLAGDKDYGINLFNAQVNEC-LDWLDSKPPGSVIYVSFGSLAVLKDDQMIEVAAGLK 307

Query: 319 DTDFSFLWVLRESELEKLPNNFLQDTSERGLIVNWCCQPQVLSHKAVSCFVTHCGWNSTL 378
            T  +FLWV+RE+E +KLP+N+++D  ++GLIVNW  Q QVL+HK++ CF+THCGWNSTL
Sbjct: 308 QTGHNFLWVVRETETKKLPSNYIEDICDKGLIVNWSPQLQVLAHKSIGCFMTHCGWNSTL 367

Query: 379 EALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVKKNEKGIATKEELEASIRKIVQ--G 438
           EALSLGV ++ +P + DQ TNAKFI DVW VG+RVK ++ G   KEE+   + ++++   
Sbjct: 368 EALSLGVALIGMPAYSDQPTNAKFIEDVWKVGVRVKADQNGFVPKEEIVRCVGEVMEDMS 427

Query: 439 ERANEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFVQAIV 466
           E+  E ++N+ +    A+EA+ +GG SDK+I+EFV  IV
Sbjct: 428 EKGKEIRKNARRLMEFAREALSDGGNSDKNIDEFVAKIV 455

BLAST of Bhi04G001143 vs. TAIR10
Match: AT1G05675.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 350.1 bits (897), Expect = 2.1e-96
Identity = 192/457 (42.01%), Postives = 271/457 (59.30%), Query Frame = 0

Query: 18  NHVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSLILNLPPSPSFHLKIISD 77
           +HVIV PFP  GHI+PM QF KRL SK L +T +  S          PP  + H   I+ 
Sbjct: 5   SHVIVLPFPAQGHITPMSQFCKRLASKSLKITLVLVSDKPS------PPYKTEH-DTITV 64

Query: 78  VSESNVL-------ASLAAYLQSFRAAVTKSLANFIDQALISSSDEEIPPTLIVYDSVMP 137
           V  SN           L  Y++   +++   L   I+   +S +    PP  +VYDS MP
Sbjct: 65  VPISNGFQEGQERSEDLDEYMERVESSIKNRLPKLIEDMKLSGN----PPRALVYDSTMP 124

Query: 138 WVQTVAAERGLDTAPFFTQSAAVNHVLLLVYGGSLSIPPPE---NVAVSLPAEIALQPGD 197
           W+  VA   GL  A FFTQ   V+ +   V+ GS S+P  +   +   S P+   L   D
Sbjct: 125 WLLDVAHSYGLSGAVFFTQPWLVSAIYYHVFKGSFSVPSTKYGHSTLASFPSLPILNAND 184

Query: 198 LPAFPDDSE---VVLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKTVGPTI 257
           LP+F  +S     +L+ +  Q  N++ V  +  NTFD+LE K++ W+    P+  +GPT+
Sbjct: 185 LPSFLCESSSYPYILRTVIDQLSNIDRVDIVLCNTFDKLEEKLLKWIKSVWPVLNIGPTV 244

Query: 258 PSAYLDGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQVKELT 317
           PS YLD RL +DK YG ++  +   +  ++WL+SK+ +SVVY+SFGSLV+L ++Q+ EL 
Sbjct: 245 PSMYLDKRLAEDKNYGFSLFGAKIAEC-MEWLNSKQPSSVVYVSFGSLVVLKKDQLIELA 304

Query: 318 NLLRDTDFSFLWVLRESELEKLPNNFLQDTSERGLIVNWCCQPQVLSHKAVSCFVTHCGW 377
             L+ +   FLWV+RE+E  KLP N++++  E+GL V+W  Q +VL+HK++ CFVTHCGW
Sbjct: 305 AGLKQSGHFFLWVVRETERRKLPENYIEEIGEKGLTVSWSPQLEVLTHKSIGCFVTHCGW 364

Query: 378 NSTLEALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVKKNEKGIATKEELEASIRKIV 437
           NSTLE LSLGVPM+ +P W DQ TNAKF+ DVW VG+RVK +  G   +EE         
Sbjct: 365 NSTLEGLSLGVPMIGMPHWADQPTNAKFMEDVWKVGVRVKADSDGFVRREEXXXXXXXXX 424

Query: 438 QGERANEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFV 462
                   ++N+ KWK LA+EAV EGG+SDK+I EFV
Sbjct: 425 XXXXXXXXRKNAEKWKVLAQEAVSEGGSSDKNINEFV 449

BLAST of Bhi04G001143 vs. TAIR10
Match: AT2G43820.1 (UDP-glucosyltransferase 74F2)

HSP 1 Score: 346.3 bits (887), Expect = 3.0e-95
Identity = 197/459 (42.92%), Postives = 281/459 (61.22%), Query Frame = 0

Query: 16  KQNHVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSLILNLPPSPSFHLKII 75
           K+ HV+  P+P  GHI+P  QF KRL  KGL  T   T+    S  +N   S    +  I
Sbjct: 4   KRGHVLAVPYPTQGHITPFRQFCKRLHFKGLKTTLALTTFVFNS--INPDLSGPISIATI 63

Query: 76  SDVSES---NVLASLAAYLQSFRAAVTKSLANFIDQALISSSDEEIPPTLIVYDSVMPWV 135
           SD  +        S+  YL+ F+ + +K++A+ I +   S +    P T IVYD+ +PW 
Sbjct: 64  SDGYDHGGFETADSIDDYLKDFKTSGSKTIADIIQKHQTSDN----PITCIVYDAFLPWA 123

Query: 136 QTVAAERGLDTAPFFTQSAAVNHVLLLVY--GGSLSIPPPENVAVSLPAEIALQPGDLPA 195
             VA E GL   PFFTQ  AVN+V  L Y   GSL +P  E     LP    L+  DLP+
Sbjct: 124 LDVAREFGLVATPFFTQPCAVNYVYYLSYINNGSLQLPIEE-----LP---FLELQDLPS 183

Query: 196 FPDDS---EVVLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKTVGPTIPSA 255
           F   S       + +  QF N E   ++ +N+F  LE       +K  P+ T+GPTIPS 
Sbjct: 184 FFSVSGSYPAYFEMVLQQFINFEKADFVLVNSFQELELHENELWSKACPVLTIGPTIPSI 243

Query: 256 YLDGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQVKELTNLL 315
           YLD R++ D  Y LN+ +S      I WLD++   SVVY++FGS+  L   Q++EL + +
Sbjct: 244 YLDQRIKSDTGYDLNLFESKDDSFCINWLDTRPQGSVVYVAFGSMAQLTNVQMEELASAV 303

Query: 316 RDTDFSFLWVLRESELEKLPNNFLQDTS-ERGLIVNWCCQPQVLSHKAVSCFVTHCGWNS 375
             ++FSFLWV+R SE EKLP+ FL+  + E+ L++ W  Q QVLS+KA+ CF+THCGWNS
Sbjct: 304 --SNFSFLWVVRSSEEEKLPSGFLETVNKEKSLVLKWSPQLQVLSNKAIGCFLTHCGWNS 363

Query: 376 TLEALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVK-KNEKGIATKEELEASIRKIVQ 435
           T+EAL+ GVPMVA+PQW DQ  NAK+I DVW  G+RVK + E GIA +EE+E SI+++++
Sbjct: 364 TMEALTFGVPMVAMPQWTDQPMNAKYIQDVWKAGVRVKTEKESGIAKREEIEFSIKEVME 423

Query: 436 GERANEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFVQAI 465
           GER+ E K+N  KW++LA ++++EGG++D +I+ FV  +
Sbjct: 424 GERSKEMKKNVKKWRDLAVKSLNEGGSTDTNIDTFVSRV 446

BLAST of Bhi04G001143 vs. TAIR10
Match: AT2G31790.1 (UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 345.1 bits (884), Expect = 6.7e-95
Identity = 184/454 (40.53%), Postives = 271/454 (59.69%), Query Frame = 0

Query: 16  KQNHVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSLILNLPPSPSFHLKII 75
           K+ HV+ FP+P  GHI+PM+Q +KRL  KG+  T +  S   +    +   S + H    
Sbjct: 5   KKGHVLFFPYPLQGHINPMIQLAKRLSKKGITSTLIIASKDHREPYTSDDYSITVHTIHD 64

Query: 76  SDVSESNVLASLAAYLQSFRAAVTKSLANFIDQALISSSDEEIPPTLIVYDSVMPWVQTV 135
                 +  A     L  F  + ++SL +FI  A +S +    PP  ++YD  MP+   +
Sbjct: 65  GFFPHEHPHAKFVD-LDRFHNSTSRSLTDFISSAKLSDN----PPKALIYDPFMPFALDI 124

Query: 136 AAERGLDTAPFFTQSAAVNHVLLLVYGGSLSIPPPEN---VAVSLPAEIALQPGDLPAFP 195
           A +  L    +FTQ    + V   +  G+  +P   +      S P    L   DLP+F 
Sbjct: 125 AKDLDLYVVAYFTQPWLASLVYYHINEGTYDVPVDRHENPTLASFPGFPLLSQDDLPSFA 184

Query: 196 DDS---EVVLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKTVGPTIPSAYL 255
            +     ++ +F+  QF NL     I  NTFD+LE KVV WM    P+K +GP +PS +L
Sbjct: 185 CEKGSYPLLHEFVVRQFSNLLQADCILCNTFDQLEPKVVKWMNDQWPVKNIGPVVPSKFL 244

Query: 256 DGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQVKELTNLLRD 315
           D RL +DK Y L  SK+   +S +KWL ++   SVVY++FG+LV L E+Q+KE+   +  
Sbjct: 245 DNRLPEDKDYELENSKTEPDESVLKWLGNRPAKSVVYVAFGTLVALSEKQMKEIAMAISQ 304

Query: 316 TDFSFLWVLRESELEKLPNNFLQDTSER--GLIVNWCCQPQVLSHKAVSCFVTHCGWNST 375
           T + FLW +RESE  KLP+ F+++  E+  GL+  W  Q +VL+H+++ CFV+HCGWNST
Sbjct: 305 TGYHFLWSVRESERSKLPSGFIEEAEEKDSGLVAKWVPQLEVLAHESIGCFVSHCGWNST 364

Query: 376 LEALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVKKNEKGIATKEELEASIRKIVQGE 435
           LEAL LGVPMV +PQW DQ TNAKFI DVW +G+RV+ + +G+++KEE+   I ++++GE
Sbjct: 365 LEALCLGVPMVGVPQWTDQPTNAKFIEDVWKIGVRVRTDGEGLSSKEEIARCIVEVMEGE 424

Query: 436 RANEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFV 462
           R  E ++N  K K LA+EA+ EGG+SDK I+EFV
Sbjct: 425 RGKEIRKNVEKLKVLAREAISEGGSSDKKIDEFV 453

BLAST of Bhi04G001143 vs. TrEMBL
Match: tr|A0A1S3BCU2|A0A1S3BCU2_CUCME (Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103488485 PE=3 SV=1)

HSP 1 Score: 792.3 bits (2045), Expect = 5.7e-226
Identity = 398/469 (84.86%), Postives = 432/469 (92.11%), Query Frame = 0

Query: 1   MEETTGNGVGRRRIVKQNHVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSL 60
           ME T  NG G R  +KQ+HVIVFPFPRHGH+SPMLQFSKRLISKGLLLTFL TSSASQSL
Sbjct: 1   MEMTAANGGGER--IKQSHVIVFPFPRHGHMSPMLQFSKRLISKGLLLTFLITSSASQSL 60

Query: 61  ILNLPPSPSFHLKIISDVSESNVLASLAAYLQSFRAAVTKSLANFIDQALISSSDEEIPP 120
            +N+PPSPSFH KIISD+ ES+ +A+L AYL+SFRAAVTKSL+NFID+ L SSS+EE+PP
Sbjct: 61  TINIPPSPSFHFKIISDLPESDDVATLDAYLRSFRAAVTKSLSNFIDEVLTSSSNEEVPP 120

Query: 121 TLIVYDSVMPWVQTVAAERGLDTAPFFTQSAAVNHVLLLVYGGSLSIPPPENVAVSLPAE 180
           TLIVYDSVMPWVQ+VAAERGLD+APFFT+SAAVNH+L LVYGGSLSIPPP+NV VSLP+E
Sbjct: 121 TLIVYDSVMPWVQSVAAERGLDSAPFFTESAAVNHLLHLVYGGSLSIPPPDNVVVSLPSE 180

Query: 181 IALQPGDLPAFPDDSEVVLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKTV 240
           I LQP DLP+FPDD EVVL FMTSQF +LENVKWIFINTFDRLESKVVNWMAKTLPIKTV
Sbjct: 181 IVLQPEDLPSFPDDPEVVLDFMTSQFSHLENVKWIFINTFDRLESKVVNWMAKTLPIKTV 240

Query: 241 GPTIPSAYLDGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQV 300
           GPTIPSAYLDGRLE DKAYGLNVSKSN GK PIKWLDSKETASV+YISFGSLVIL EEQV
Sbjct: 241 GPTIPSAYLDGRLEKDKAYGLNVSKSNNGKCPIKWLDSKETASVIYISFGSLVILSEEQV 300

Query: 301 KELTNLLRDTDFSFLWVLRESELEKLPNNFLQDTSERGLIVNWCCQPQVLSHKAVSCFVT 360
           KELTNLLRDTDFSFLWVLRESE+ KLP NF+QDTS+RGLIVNWCCQ QVLSHKAVSCFVT
Sbjct: 301 KELTNLLRDTDFSFLWVLRESEMVKLPKNFVQDTSDRGLIVNWCCQLQVLSHKAVSCFVT 360

Query: 361 HCGWNSTLEALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVKKNEKGIATKEELEASI 420
           HCGWNSTLEALSLGVPMVAIPQW+DQTTNAKF+ADVW VG+RVKKNEK +A KEELEASI
Sbjct: 361 HCGWNSTLEALSLGVPMVAIPQWIDQTTNAKFVADVWRVGVRVKKNEKSVAIKEELEASI 420

Query: 421 RKI-VQGERANEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFVQAIVASN 469
           RKI VQG   NEFKQN+IKWKNLAKEAVDE G+SDK+IEEFVQA+VASN
Sbjct: 421 RKIVVQGNGTNEFKQNAIKWKNLAKEAVDERGSSDKNIEEFVQALVASN 467

BLAST of Bhi04G001143 vs. TrEMBL
Match: tr|A0A0A0KD63|A0A0A0KD63_CUCSA (Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_6G366280 PE=3 SV=1)

HSP 1 Score: 791.2 bits (2042), Expect = 1.3e-225
Identity = 397/469 (84.65%), Postives = 433/469 (92.32%), Query Frame = 0

Query: 1   MEETTGNGVGRRRIVKQNHVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSL 60
           ME+   NG G R  +KQNHVIVFPFPRHGH+SPMLQFSKRLISKGLLLTFL TSSASQSL
Sbjct: 1   MEKAMANGGGGR--IKQNHVIVFPFPRHGHMSPMLQFSKRLISKGLLLTFLVTSSASQSL 60

Query: 61  ILNLPPSPSFHLKIISDVSESNVLASLAAYLQSFRAAVTKSLANFIDQALISSSDEEIPP 120
            +N+PPSPSFH+KIISD+ ES+ +A+  AY++SF+AAVTKSL+NFID+ALISSS EE+ P
Sbjct: 61  TINIPPSPSFHIKIISDLPESDDVATFDAYIRSFQAAVTKSLSNFIDEALISSSYEEVSP 120

Query: 121 TLIVYDSVMPWVQTVAAERGLDTAPFFTQSAAVNHVLLLVYGGSLSIPPPENVAVSLPAE 180
           TLIVYDS+MPWV +VAAERGLD+APFFT+SAAVNH+L LVYGGSLSIP PENV VSLP+E
Sbjct: 121 TLIVYDSIMPWVHSVAAERGLDSAPFFTESAAVNHLLHLVYGGSLSIPAPENVVVSLPSE 180

Query: 181 IALQPGDLPAFPDDSEVVLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKTV 240
           I LQPGDLP+FPDD EVVL FM +QF +LENVKWIFINTFDRLESKVVNWMAKTLPIKTV
Sbjct: 181 IVLQPGDLPSFPDDPEVVLDFMINQFSHLENVKWIFINTFDRLESKVVNWMAKTLPIKTV 240

Query: 241 GPTIPSAYLDGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQV 300
           GPTIPSAYLDGRLE+DKAYGLNVSKSN GKSPIKWLDSKETASV+YISFGSLV+L EEQV
Sbjct: 241 GPTIPSAYLDGRLENDKAYGLNVSKSNNGKSPIKWLDSKETASVIYISFGSLVMLSEEQV 300

Query: 301 KELTNLLRDTDFSFLWVLRESELEKLPNNFLQDTSERGLIVNWCCQPQVLSHKAVSCFVT 360
           KELTNLLRDTDFSFLWVLRESEL KLPNNF+QDTS+ GLIVNWCCQ QVLSHKAVSCFVT
Sbjct: 301 KELTNLLRDTDFSFLWVLRESELVKLPNNFVQDTSDHGLIVNWCCQLQVLSHKAVSCFVT 360

Query: 361 HCGWNSTLEALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVKKNEKGIATKEELEASI 420
           HCGWNSTLEALSLGVPMVAIPQWVDQTTNAKF+ADVW VG+RVKKNEKG+A KEELEASI
Sbjct: 361 HCGWNSTLEALSLGVPMVAIPQWVDQTTNAKFVADVWRVGVRVKKNEKGVAIKEELEASI 420

Query: 421 RKI-VQGERANEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFVQAIVASN 469
           RKI VQG R NEFKQNSIKWKNLAKEAVDE G+SDK+IEEFVQA+ ASN
Sbjct: 421 RKIVVQGNRPNEFKQNSIKWKNLAKEAVDERGSSDKNIEEFVQALAASN 467

BLAST of Bhi04G001143 vs. TrEMBL
Match: tr|A0A1S3BCV0|A0A1S3BCV0_CUCME (Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103488486 PE=3 SV=1)

HSP 1 Score: 529.3 bits (1362), Expect = 9.1e-147
Identity = 280/470 (59.57%), Postives = 346/470 (73.62%), Query Frame = 0

Query: 10  GRRRIVKQNHVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSLILNLPPSPS 69
           G R++     V+VF +P+HGH+SPMLQF+KRL SKGL +TFLTTSS +QSL +NL PS  
Sbjct: 8   GGRKLSSNVVVVVFAYPKHGHMSPMLQFAKRLASKGLRVTFLTTSSVNQSLQINLLPSYQ 67

Query: 70  FHLKIISDVSESNVLASLAAYLQSFRAAVTKSLANFIDQAL---ISSSDEEIPPT-LIVY 129
             L+ ISDV    +L SL    +SF A V++S  +F+D AL   I+S  +  PP   +V+
Sbjct: 68  IDLQFISDVRTEPIL-SLKDEHESFDAVVSRSFGDFLDGALRTNINSDYDSTPPRYFVVF 127

Query: 130 DSVMPWVQTVAAERGLDTAPFFTQSAAVNHVLLLVYGGSL---SIPPPENVAVSLPAEIA 189
           DS+MPW   VAAERG+D+APFFT+S AVNH+L  VY GSL   S+PP     VS+P+   
Sbjct: 128 DSIMPWAMDVAAERGMDSAPFFTESCAVNHILNQVYEGSLCLSSVPPA--AGVSIPSLPV 187

Query: 190 LQPGDLPAFPDDSEVVLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKTVGP 249
           L   DLP F  + EVV+ FM  QF + +  KWIF+NTFD+LE KVVNWMAK  PIKTVGP
Sbjct: 188 LAVEDLPFFSYEREVVVNFMVRQFSSFKKAKWIFVNTFDQLEMKVVNWMAKRWPIKTVGP 247

Query: 250 TIPSAYLDGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQVKE 309
           TIPSAYL+G LE+DK+YGL   K       ++WLD+KE  SV+YISFGSLV+L  EQV E
Sbjct: 248 TIPSAYLEGELENDKSYGLKHLKMEDNGKILEWLDTKENGSVIYISFGSLVVLPHEQVDE 307

Query: 310 LTNLLRD-----TDFSFLWVLRESELEKLPNNFLQDTSERGLIVNWCCQPQVLSHKAVSC 369
           L N L+      T+ SFLWVLRESE+EKLPNNF+Q TS +GL+VNWCCQ QVLSH A+ C
Sbjct: 308 LANCLKSITTTTTNLSFLWVLRESEIEKLPNNFIQSTSHKGLVVNWCCQLQVLSHNAIGC 367

Query: 370 FVTHCGWNSTLEALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVK-KNEKGIATKEEL 429
           FVTHCGWNST+EALSLGVPMVA+PQW+DQTTNAKF+ADVW VG+RVK  ++KGIATKEEL
Sbjct: 368 FVTHCGWNSTIEALSLGVPMVAVPQWIDQTTNAKFVADVWEVGVRVKIGSDKGIATKEEL 427

Query: 430 EASIRKIVQGERA-NEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFVQAIV 466
           EASI+++  G+   NE K NS     LAKEA+ EGG+S K+IEEFV +I+
Sbjct: 428 EASIQRVFGGDHGKNEIKINSTNLMKLAKEAMKEGGSSYKNIEEFVDSII 474

BLAST of Bhi04G001143 vs. TrEMBL
Match: tr|A0A1Q3BGR8|A0A1Q3BGR8_CEPFO (Glycosyltransferase OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_10644 PE=3 SV=1)

HSP 1 Score: 483.0 bits (1242), Expect = 7.5e-133
Identity = 245/456 (53.73%), Postives = 328/456 (71.93%), Query Frame = 0

Query: 14  IVKQNHVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSLILNLPPSPSFHLK 73
           I  + H++ FP+P  GH++PM+Q SKRL SKGL +T +TT+S S+S+      + SFH +
Sbjct: 4   IASETHILAFPYPAQGHMNPMVQLSKRLASKGLKVTLITTTSTSKSI---QTQACSFHFE 63

Query: 74  IISDVSESNV-LASLAAYLQSFRAAVTKSLANFIDQALISSSDEEIPPTLIVYDSVMPWV 133
           II+DVS   V       Y+Q FR  V++SL+  I+    S +  + PP  ++YDS MPWV
Sbjct: 64  IITDVSYEGVKTEDTEEYVQRFRVVVSQSLSELIE----SLNKTKNPPKFLIYDSGMPWV 123

Query: 134 QTVAAERGLDTAPFFTQSAAVNHVLLLVYGGSLSIPPPENVAVSLPAEIALQPGDLPAFP 193
             +A   GLD APFFTQ  AV  +   VY G+LS+P   +  VS P+   L   DLP+F 
Sbjct: 124 LNIARRLGLDGAPFFTQPCAVGAIYYHVYQGALSVPLEASSMVSFPSMPPLAIDDLPSFL 183

Query: 194 DDSEV---VLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKTVGPTIPSAYL 253
            +S +    LK + +QF N+E   W+  NTFD+LES++VNWMA   PIKT+GPT+PS YL
Sbjct: 184 HNSRLYPAFLKSVVNQFSNIEEANWLLCNTFDKLESEIVNWMASRWPIKTIGPTVPSIYL 243

Query: 254 DGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQVKELTNLLRD 313
           D RL DDK YGL++ K N     +KWL+SKE  SVVY++FGSL  L EEQ++EL   L+ 
Sbjct: 244 DKRLRDDKDYGLSLFKPN-TDGCMKWLNSKEVGSVVYVAFGSLAGLGEEQMEELVWGLKR 303

Query: 314 TDFSFLWVLRESELEKLPNNFLQDTSERGLIVNWCCQPQVLSHKAVSCFVTHCGWNSTLE 373
           +++ +LWV+RESE +KLP NFL++TS++GL+V W  Q +VL+HKA  CF+THCGWNSTLE
Sbjct: 304 SNYYYLWVVRESEEKKLPRNFLEETSKKGLVVKWSPQLEVLAHKATGCFMTHCGWNSTLE 363

Query: 374 ALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVKKNEKGIATKEELEASIRKIVQGERA 433
           +LSLGVPMVA+P W DQTTNAKF+ADVW VG+RVKKNEKGI T+EE+E  ++++++GERA
Sbjct: 364 SLSLGVPMVAMPHWTDQTTNAKFVADVWEVGVRVKKNEKGIVTREEIELCLKEVMEGERA 423

Query: 434 NEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFVQAIV 466
           NE K+NS KWK+LAKEAVDEGG+SDK+IEEFV  ++
Sbjct: 424 NEIKRNSDKWKSLAKEAVDEGGSSDKNIEEFVAELL 451

BLAST of Bhi04G001143 vs. TrEMBL
Match: tr|A0A2N9JBC5|A0A2N9JBC5_FAGSY (Glycosyltransferase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS61651 PE=3 SV=1)

HSP 1 Score: 479.9 bits (1234), Expect = 6.3e-132
Identity = 254/457 (55.58%), Postives = 333/457 (72.87%), Query Frame = 0

Query: 16  KQNHVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSLILNLPPSPSFHLKII 75
           ++ H++V P+P  GHI+PMLQFSKRL SKG  +TF+TTSS S+S  +    S S +++II
Sbjct: 3   RETHILVIPYPVQGHINPMLQFSKRLASKGPRVTFITTSSISES--IQAHASHSINVEII 62

Query: 76  SDVS-ESNVLASLAAYLQSFRAAVTKSLANFIDQALISSSDEEIPPTLIVYDSVMPWVQT 135
           SD S E N L S+ AYL+ F+  V++SLA  I++     +  + PP ++VYDSVMPW   
Sbjct: 63  SDGSEEGNNLESIEAYLKRFQLNVSQSLAKIIEK----HNSSQYPPKILVYDSVMPWALN 122

Query: 136 VAAERGLDTAPFFTQSAAVNHVLLLVYGGSLSIPPPENVAVSLPAEIALQPGDLPAFPDD 195
           +A + GLD A FFTQS AV  +    + G++ I P E  +VSLP+  +L   DLP+F  D
Sbjct: 123 IARQLGLDGATFFTQSCAVKAIYYHAHHGAVPI-PFEGPSVSLPSMPSLGIDDLPSFLCD 182

Query: 196 S---EVVLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMA-KTLPIKTVGPTIPSAYLD 255
                 +L  + +QF N+    WI  NTFD+LE ++VNWMA K  P KTVGP IPS YLD
Sbjct: 183 KGTYPALLNLVLNQFSNILEANWILCNTFDKLEYELVNWMASKQWPFKTVGPAIPSLYLD 242

Query: 256 GRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQVKELTNLLRDT 315
            RLEDDK YGL++ K +   + +KWLD+KET SVVY SFGSL  L EEQ++ELT  L+++
Sbjct: 243 KRLEDDKEYGLHLFKPD-VDACMKWLDTKETGSVVYTSFGSLASLGEEQMEELTWGLKNS 302

Query: 316 DFSFLWVLRESELEKLPNNFLQDTSERGLIVNWCCQPQVLSHKAVSCFVTHCGWNSTLEA 375
           +  FLWV+RE+E +KLP+NFL++T E GL+VNWC Q +VL+HKAV CF+THCGWNSTLEA
Sbjct: 303 NCYFLWVVRETEQQKLPSNFLEETVENGLVVNWCPQLEVLTHKAVGCFMTHCGWNSTLEA 362

Query: 376 LSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVKKNEKGIATKEELEASIRKIVQGERAN 435
           LS+GVPMVA+PQW DQTTNAKFI DVW VG+R+K +E+GIATKEE++ SIR++++GE   
Sbjct: 363 LSIGVPMVAMPQWTDQTTNAKFIKDVWKVGVRIKLDERGIATKEEIKLSIREVMEGESGK 422

Query: 436 EFKQNSIKWKNLAKEAVDEGGTSDKHIEEFVQAIVAS 468
           E K+NSIKWK  AKEAVDEGG+SDK+IEEFV  +  S
Sbjct: 423 EMKKNSIKWKEFAKEAVDEGGSSDKNIEEFVAKLAHS 451

BLAST of Bhi04G001143 vs. NCBI nr
Match: XP_008445481.1 (PREDICTED: UDP-glycosyltransferase 74E2-like [Cucumis melo])

HSP 1 Score: 792.3 bits (2045), Expect = 8.7e-226
Identity = 398/469 (84.86%), Postives = 432/469 (92.11%), Query Frame = 0

Query: 1   MEETTGNGVGRRRIVKQNHVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSL 60
           ME T  NG G R  +KQ+HVIVFPFPRHGH+SPMLQFSKRLISKGLLLTFL TSSASQSL
Sbjct: 1   MEMTAANGGGER--IKQSHVIVFPFPRHGHMSPMLQFSKRLISKGLLLTFLITSSASQSL 60

Query: 61  ILNLPPSPSFHLKIISDVSESNVLASLAAYLQSFRAAVTKSLANFIDQALISSSDEEIPP 120
            +N+PPSPSFH KIISD+ ES+ +A+L AYL+SFRAAVTKSL+NFID+ L SSS+EE+PP
Sbjct: 61  TINIPPSPSFHFKIISDLPESDDVATLDAYLRSFRAAVTKSLSNFIDEVLTSSSNEEVPP 120

Query: 121 TLIVYDSVMPWVQTVAAERGLDTAPFFTQSAAVNHVLLLVYGGSLSIPPPENVAVSLPAE 180
           TLIVYDSVMPWVQ+VAAERGLD+APFFT+SAAVNH+L LVYGGSLSIPPP+NV VSLP+E
Sbjct: 121 TLIVYDSVMPWVQSVAAERGLDSAPFFTESAAVNHLLHLVYGGSLSIPPPDNVVVSLPSE 180

Query: 181 IALQPGDLPAFPDDSEVVLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKTV 240
           I LQP DLP+FPDD EVVL FMTSQF +LENVKWIFINTFDRLESKVVNWMAKTLPIKTV
Sbjct: 181 IVLQPEDLPSFPDDPEVVLDFMTSQFSHLENVKWIFINTFDRLESKVVNWMAKTLPIKTV 240

Query: 241 GPTIPSAYLDGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQV 300
           GPTIPSAYLDGRLE DKAYGLNVSKSN GK PIKWLDSKETASV+YISFGSLVIL EEQV
Sbjct: 241 GPTIPSAYLDGRLEKDKAYGLNVSKSNNGKCPIKWLDSKETASVIYISFGSLVILSEEQV 300

Query: 301 KELTNLLRDTDFSFLWVLRESELEKLPNNFLQDTSERGLIVNWCCQPQVLSHKAVSCFVT 360
           KELTNLLRDTDFSFLWVLRESE+ KLP NF+QDTS+RGLIVNWCCQ QVLSHKAVSCFVT
Sbjct: 301 KELTNLLRDTDFSFLWVLRESEMVKLPKNFVQDTSDRGLIVNWCCQLQVLSHKAVSCFVT 360

Query: 361 HCGWNSTLEALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVKKNEKGIATKEELEASI 420
           HCGWNSTLEALSLGVPMVAIPQW+DQTTNAKF+ADVW VG+RVKKNEK +A KEELEASI
Sbjct: 361 HCGWNSTLEALSLGVPMVAIPQWIDQTTNAKFVADVWRVGVRVKKNEKSVAIKEELEASI 420

Query: 421 RKI-VQGERANEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFVQAIVASN 469
           RKI VQG   NEFKQN+IKWKNLAKEAVDE G+SDK+IEEFVQA+VASN
Sbjct: 421 RKIVVQGNGTNEFKQNAIKWKNLAKEAVDERGSSDKNIEEFVQALVASN 467

BLAST of Bhi04G001143 vs. NCBI nr
Match: XP_004144190.1 (PREDICTED: UDP-glycosyltransferase 74E2-like [Cucumis sativus] >KGN47630.1 hypothetical protein Csa_6G366280 [Cucumis sativus])

HSP 1 Score: 791.2 bits (2042), Expect = 1.9e-225
Identity = 397/469 (84.65%), Postives = 433/469 (92.32%), Query Frame = 0

Query: 1   MEETTGNGVGRRRIVKQNHVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSL 60
           ME+   NG G R  +KQNHVIVFPFPRHGH+SPMLQFSKRLISKGLLLTFL TSSASQSL
Sbjct: 1   MEKAMANGGGGR--IKQNHVIVFPFPRHGHMSPMLQFSKRLISKGLLLTFLVTSSASQSL 60

Query: 61  ILNLPPSPSFHLKIISDVSESNVLASLAAYLQSFRAAVTKSLANFIDQALISSSDEEIPP 120
            +N+PPSPSFH+KIISD+ ES+ +A+  AY++SF+AAVTKSL+NFID+ALISSS EE+ P
Sbjct: 61  TINIPPSPSFHIKIISDLPESDDVATFDAYIRSFQAAVTKSLSNFIDEALISSSYEEVSP 120

Query: 121 TLIVYDSVMPWVQTVAAERGLDTAPFFTQSAAVNHVLLLVYGGSLSIPPPENVAVSLPAE 180
           TLIVYDS+MPWV +VAAERGLD+APFFT+SAAVNH+L LVYGGSLSIP PENV VSLP+E
Sbjct: 121 TLIVYDSIMPWVHSVAAERGLDSAPFFTESAAVNHLLHLVYGGSLSIPAPENVVVSLPSE 180

Query: 181 IALQPGDLPAFPDDSEVVLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKTV 240
           I LQPGDLP+FPDD EVVL FM +QF +LENVKWIFINTFDRLESKVVNWMAKTLPIKTV
Sbjct: 181 IVLQPGDLPSFPDDPEVVLDFMINQFSHLENVKWIFINTFDRLESKVVNWMAKTLPIKTV 240

Query: 241 GPTIPSAYLDGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQV 300
           GPTIPSAYLDGRLE+DKAYGLNVSKSN GKSPIKWLDSKETASV+YISFGSLV+L EEQV
Sbjct: 241 GPTIPSAYLDGRLENDKAYGLNVSKSNNGKSPIKWLDSKETASVIYISFGSLVMLSEEQV 300

Query: 301 KELTNLLRDTDFSFLWVLRESELEKLPNNFLQDTSERGLIVNWCCQPQVLSHKAVSCFVT 360
           KELTNLLRDTDFSFLWVLRESEL KLPNNF+QDTS+ GLIVNWCCQ QVLSHKAVSCFVT
Sbjct: 301 KELTNLLRDTDFSFLWVLRESELVKLPNNFVQDTSDHGLIVNWCCQLQVLSHKAVSCFVT 360

Query: 361 HCGWNSTLEALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVKKNEKGIATKEELEASI 420
           HCGWNSTLEALSLGVPMVAIPQWVDQTTNAKF+ADVW VG+RVKKNEKG+A KEELEASI
Sbjct: 361 HCGWNSTLEALSLGVPMVAIPQWVDQTTNAKFVADVWRVGVRVKKNEKGVAIKEELEASI 420

Query: 421 RKI-VQGERANEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFVQAIVASN 469
           RKI VQG R NEFKQNSIKWKNLAKEAVDE G+SDK+IEEFVQA+ ASN
Sbjct: 421 RKIVVQGNRPNEFKQNSIKWKNLAKEAVDERGSSDKNIEEFVQALAASN 467

BLAST of Bhi04G001143 vs. NCBI nr
Match: XP_022997132.1 (UDP-glycosyltransferase 74E2-like [Cucurbita maxima] >XP_022997133.1 UDP-glycosyltransferase 74E2-like [Cucurbita maxima])

HSP 1 Score: 739.6 bits (1908), Expect = 6.7e-210
Identity = 376/467 (80.51%), Postives = 412/467 (88.22%), Query Frame = 0

Query: 1   MEETTGNGVGRRRIVKQNHVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSL 60
           ME+TT NG G    +KQNHVIVFPFPRHGH++PMLQF+KRL+SKG LLTFLTTSSASQSL
Sbjct: 1   MEKTTVNGGGE---MKQNHVIVFPFPRHGHMNPMLQFAKRLVSKGFLLTFLTTSSASQSL 60

Query: 61  ILNLPPSPSFHLKIISDVSESNVLASLAAYLQSFRAAVTKSLANFIDQALISSSDEEIPP 120
           IL+LPPSP  H K+ISDV ESN + SL AYL+SFRAA +KSLANFID++LIS S+ E+ P
Sbjct: 61  ILDLPPSP-IHHKVISDVPESNNIDSLDAYLRSFRAAASKSLANFIDESLISDSN-EVLP 120

Query: 121 TLIVYDSVMPWVQTVAAERGLDTAPFFTQSAAVNHVLLLVYGGSLSIPPPENVAVSLPAE 180
           +LIVYDSVMPWVQ+VAAERGLD APFFTQSAAVNH+L LVY GSLSIPPPE+VAVSLP+E
Sbjct: 121 SLIVYDSVMPWVQSVAAERGLDAAPFFTQSAAVNHILDLVYKGSLSIPPPEDVAVSLPSE 180

Query: 181 IALQPGDLPAFPDDSEVVLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKTV 240
           I LQP DLPA PDD  VVL FMTSQF NLE VKWIF NTFDRLE KVVNWM KTLPIKTV
Sbjct: 181 IVLQPADLPALPDDGVVVLDFMTSQFINLEKVKWIFFNTFDRLECKVVNWMTKTLPIKTV 240

Query: 241 GPTIPSAYLDGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQV 300
           GPTIPSAYLDGRL DDKAYGLNV   N GK  I+WLDSKETAS++YISFGSLV L  EQV
Sbjct: 241 GPTIPSAYLDGRLVDDKAYGLNVLNPNDGKKAIQWLDSKETASIIYISFGSLVNLKIEQV 300

Query: 301 KELTNLLRDTDFSFLWVLRESELEKLPNNFLQDTSERGLIVNWCCQPQVLSHKAVSCFVT 360
            ELT  L DT+ SFLWVLRESEL KLPNNF+QDTSE GLIVNWCCQ QVLSHKAVSCFVT
Sbjct: 301 NELTCFLEDTNLSFLWVLRESELGKLPNNFVQDTSEHGLIVNWCCQLQVLSHKAVSCFVT 360

Query: 361 HCGWNSTLEALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVKKNEKGIATKEELEASI 420
           HCGWNST+EALSLGVPMVAIPQWVDQTTNAKF+ADVW VG+RVKKN+KGIATKEELEASI
Sbjct: 361 HCGWNSTIEALSLGVPMVAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIATKEELEASI 420

Query: 421 RKIVQGERANEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFVQAIVAS 468
           RK+VQGE+ NE KQNSIKWK LAKEA+DEGG+SDK+I+EFVQA+ AS
Sbjct: 421 RKVVQGEKPNEIKQNSIKWKKLAKEAMDEGGSSDKNIDEFVQAMAAS 462

BLAST of Bhi04G001143 vs. NCBI nr
Match: XP_022962392.1 (UDP-glycosyltransferase 74E2-like [Cucurbita moschata] >XP_022962393.1 UDP-glycosyltransferase 74E2-like [Cucurbita moschata])

HSP 1 Score: 734.6 bits (1895), Expect = 2.1e-208
Identity = 372/467 (79.66%), Postives = 416/467 (89.08%), Query Frame = 0

Query: 1   MEETTGNGVGRRRIVKQNHVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSL 60
           ME+TT +G G    +KQ+HVIVFPFPRHGH++PMLQF+KRL+SKGLLLTFLTTSSAS+SL
Sbjct: 1   MEKTTVDGGGE---MKQSHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESL 60

Query: 61  ILNLPPSPSFHLKIISDVSESNVLASLAAYLQSFRAAVTKSLANFIDQALISSSDEEIPP 120
           IL+LPPSP  H K+ISD  ESN + SL AYL+SFRAA +KSLANFID+ALIS S+ E+ P
Sbjct: 61  ILDLPPSPIRH-KVISDDPESNNIDSLDAYLRSFRAAASKSLANFIDEALISDSN-EVLP 120

Query: 121 TLIVYDSVMPWVQTVAAERGLDTAPFFTQSAAVNHVLLLVYGGSLSIPPPENVAVSLPAE 180
           +LIVYDSVMPWVQ+VAAERGLD APFFTQSAAVNH+L LVY GSLSIPPPE+VAVSLP+E
Sbjct: 121 SLIVYDSVMPWVQSVAAERGLDAAPFFTQSAAVNHILDLVYKGSLSIPPPEDVAVSLPSE 180

Query: 181 IALQPGDLPAFPDDSEVVLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKTV 240
           I LQP DLP  PDD +VVL+FMTSQF NLENVKWIF NTFDRLE KVVNWM KTLPIKTV
Sbjct: 181 IVLQPADLPTLPDDGDVVLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTV 240

Query: 241 GPTIPSAYLDGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQV 300
           GPTIPSAYLDGRL DDKAYGLNV   N GK  I+WLDSKETASV+YISFGSLV L +EQV
Sbjct: 241 GPTIPSAYLDGRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLEKEQV 300

Query: 301 KELTNLLRDTDFSFLWVLRESELEKLPNNFLQDTSERGLIVNWCCQPQVLSHKAVSCFVT 360
            ELT  LR+T+ SFLWVLRESEL KLPNNF+QDTSE+GLIVNWCCQ +VLSHKAVSCFVT
Sbjct: 301 TELTCFLRNTNLSFLWVLRESELGKLPNNFVQDTSEQGLIVNWCCQLEVLSHKAVSCFVT 360

Query: 361 HCGWNSTLEALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVKKNEKGIATKEELEASI 420
           HCGWNST+EALSLGVPM+AIPQWVDQTTNAKF+ADVW VG+RVKKN+KGI TKEELEASI
Sbjct: 361 HCGWNSTIEALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIVTKEELEASI 420

Query: 421 RKIVQGERANEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFVQAIVAS 468
           RKIVQGE+ NE KQNSIKWK +AKEA+DEGG+SDK+I+EFVQA+ AS
Sbjct: 421 RKIVQGEKPNEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFVQAMAAS 462

BLAST of Bhi04G001143 vs. NCBI nr
Match: XP_023546480.1 (UDP-glycosyltransferase 74E2-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 731.9 bits (1888), Expect = 1.4e-207
Identity = 369/467 (79.01%), Postives = 415/467 (88.87%), Query Frame = 0

Query: 1   MEETTGNGVGRRRIVKQNHVIVFPFPRHGHISPMLQFSKRLISKGLLLTFLTTSSASQSL 60
           ME+TT +G G    +KQ+HVIVFPFPRHGH++PMLQF+KRL+SKGLLLTFLTTSSAS+SL
Sbjct: 1   MEKTTVDGGGE---MKQSHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESL 60

Query: 61  ILNLPPSPSFHLKIISDVSESNVLASLAAYLQSFRAAVTKSLANFIDQALISSSDEEIPP 120
           IL+LPPSP  H K+ISDV ES+ + SL AYL+SFRAA +KSLANFID+ALIS S+ E+ P
Sbjct: 61  ILDLPPSP-IHHKVISDVPESSNIDSLDAYLRSFRAAASKSLANFIDEALISDSN-EVLP 120

Query: 121 TLIVYDSVMPWVQTVAAERGLDTAPFFTQSAAVNHVLLLVYGGSLSIPPPENVAVSLPAE 180
           +LIVYDSVMPWVQ+VAAERGLD APFFTQSAAVNH+L LVY GSLSIPPPE+VA+SLP+E
Sbjct: 121 SLIVYDSVMPWVQSVAAERGLDAAPFFTQSAAVNHILDLVYKGSLSIPPPEDVAISLPSE 180

Query: 181 IALQPGDLPAFPDDSEVVLKFMTSQFYNLENVKWIFINTFDRLESKVVNWMAKTLPIKTV 240
           I LQP DLP  PDD +VVL+FMTSQF NLENVKWIF NTFDRLE KVVNWM KTLPIKTV
Sbjct: 181 IVLQPADLPTLPDDGDVVLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTV 240

Query: 241 GPTIPSAYLDGRLEDDKAYGLNVSKSNGGKSPIKWLDSKETASVVYISFGSLVILLEEQV 300
           GPTIPSAYLDGRL  DKAYGLNV   N GK  I+WLDSKETAS++YISFGSLV L +EQV
Sbjct: 241 GPTIPSAYLDGRLAYDKAYGLNVLNPNDGKKAIQWLDSKETASIIYISFGSLVNLEKEQV 300

Query: 301 KELTNLLRDTDFSFLWVLRESELEKLPNNFLQDTSERGLIVNWCCQPQVLSHKAVSCFVT 360
            ELT  L+DT+ SFLWVLRESEL KLPNNF+QDT E+GLIVNWCCQ QVLSHKAVSCFVT
Sbjct: 301 TELTCFLKDTNLSFLWVLRESELGKLPNNFVQDTLEQGLIVNWCCQLQVLSHKAVSCFVT 360

Query: 361 HCGWNSTLEALSLGVPMVAIPQWVDQTTNAKFIADVWGVGIRVKKNEKGIATKEELEASI 420
           HCGWNST+EALSLGVPMVAIPQWVDQTTNAKF+ADVW VG+RVKKN+KGI TKEELEASI
Sbjct: 361 HCGWNSTIEALSLGVPMVAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIVTKEELEASI 420

Query: 421 RKIVQGERANEFKQNSIKWKNLAKEAVDEGGTSDKHIEEFVQAIVAS 468
           RK+VQGE+ NE KQNSIKWK +AKEA+DEGG+SDK+I+EFVQA+ AS
Sbjct: 421 RKVVQGEKPNEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFVQAMAAS 462

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
sp|Q9SYK9|U74E2_ARATH7.0e-10243.64UDP-glycosyltransferase 74E2 OS=Arabidopsis thaliana OX=3702 GN=UGT74E2 PE=1 SV=... [more]
sp|Q6VAA6|U74G1_STERE6.1e-9843.07UDP-glycosyltransferase 74G1 OS=Stevia rebaudiana OX=55670 GN=UGT74G1 PE=1 SV=1[more]
sp|Q9SKC5|U74D1_ARATH2.2e-9542.48UDP-glycosyltransferase 74D1 OS=Arabidopsis thaliana OX=3702 GN=UGT74D1 PE=1 SV=... [more]
sp|P0C7P7|U74E1_ARATH3.7e-9542.01UDP-glycosyltransferase 74E1 OS=Arabidopsis thaliana OX=3702 GN=UGT74E1 PE=3 SV=... [more]
sp|O22822|U74F2_ARATH5.4e-9442.92UDP-glycosyltransferase 74F2 OS=Arabidopsis thaliana OX=3702 GN=UGT74F2 PE=1 SV=... [more]
Match NameE-valueIdentityDescription
AT1G05680.13.9e-10343.64Uridine diphosphate glycosyltransferase 74E2[more]
AT2G31750.11.2e-9642.48UDP-glucosyl transferase 74D1[more]
AT1G05675.12.1e-9642.01UDP-Glycosyltransferase superfamily protein[more]
AT2G43820.13.0e-9542.92UDP-glucosyltransferase 74F2[more]
AT2G31790.16.7e-9540.53UDP-Glycosyltransferase superfamily protein[more]
Match NameE-valueIdentityDescription
tr|A0A1S3BCU2|A0A1S3BCU2_CUCME5.7e-22684.86Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103488485 PE=3 SV=1[more]
tr|A0A0A0KD63|A0A0A0KD63_CUCSA1.3e-22584.65Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_6G366280 PE=3 SV=1[more]
tr|A0A1S3BCV0|A0A1S3BCV0_CUCME9.1e-14759.57Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103488486 PE=3 SV=1[more]
tr|A0A1Q3BGR8|A0A1Q3BGR8_CEPFO7.5e-13353.73Glycosyltransferase OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_10644 PE=3 SV=... [more]
tr|A0A2N9JBC5|A0A2N9JBC5_FAGSY6.3e-13255.58Glycosyltransferase OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS61651 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
XP_008445481.18.7e-22684.86PREDICTED: UDP-glycosyltransferase 74E2-like [Cucumis melo][more]
XP_004144190.11.9e-22584.65PREDICTED: UDP-glycosyltransferase 74E2-like [Cucumis sativus] >KGN47630.1 hypot... [more]
XP_022997132.16.7e-21080.51UDP-glycosyltransferase 74E2-like [Cucurbita maxima] >XP_022997133.1 UDP-glycosy... [more]
XP_022962392.12.1e-20879.66UDP-glycosyltransferase 74E2-like [Cucurbita moschata] >XP_022962393.1 UDP-glyco... [more]
XP_023546480.11.4e-20779.01UDP-glycosyltransferase 74E2-like [Cucurbita pepo subsp. pepo][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
Vocabulary: INTERPRO
TermDefinition
IPR035595UDP_glycos_trans_CS
IPR002213UDP_glucos_trans
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0008152 metabolic process
cellular_component GO:0043231 intracellular membrane-bounded organelle
cellular_component GO:0005575 cellular_component
molecular_function GO:0080043 quercetin 3-O-glucosyltransferase activity
molecular_function GO:0080044 quercetin 7-O-glucosyltransferase activity
molecular_function GO:0016758 transferase activity, transferring hexosyl groups
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi04M001143Bhi04M001143mRNA


Analysis Name: InterPro Annotations of wax gourd
Date Performed: 2019-11-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 277..438
e-value: 7.7E-26
score: 90.8
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 253..445
e-value: 3.6E-132
score: 443.7
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 23..252
e-value: 3.6E-132
score: 443.7
coord: 446..457
e-value: 3.6E-132
score: 443.7
NoneNo IPR availablePANTHERPTHR11926:SF524UDP-GLYCOSYLTRANSFERASE 74C1-RELATEDcoord: 15..464
NoneNo IPR availablePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 15..464
NoneNo IPR availableCDDcd03784GT1_Gtf_likecoord: 19..424
e-value: 5.05891E-27
score: 109.76
NoneNo IPR availableSUPERFAMILYSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 18..464
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 343..386