Cp4.1LG17g09910 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG17g09910
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionSterol 3-beta-glucosyltransferase
LocationCp4.1LG17 : 7470595 .. 7477380 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTCCTCCGCCGGTTGAGTCGTCGTCGTCAATCATGGTTGGCGGATGAATATGGAGGGATTTGGGGTTCCAGTTCATCATCAAACCAGCCCTCCAGGTTGGTTCCATATTATTTCCCCCTTTTTTCTTGATTTCTCTTGCTGCATCATTCTTCATATACATACATAACTACTTATAAACATTCCATCATTGTTATGTGCCGTCTCATCAACTGTCACCAACCCGATGATGCTCCCACTGCAAATTGTTGATTTGAGTTCCTAATGTTTTGAGTGGATGAATATGAAATTCCTTCTTTTAGGGTATGTAAAAAAGATAATTAAAACTTAGAACTAAATAGTGCTGGTACTCGAATATTGGCTCGTAATATCGTTTTGAAATCTTAGAGACAATAAAAATGGAAGCTTTTTGAGGAGAAAATTTTATATTTGTGTTGCGGTTGTATATTAAATTGAAACCCAAGTTTGTAAAATTGGTTCCTTCTTGGTGAATTAATTACACTGCTTTAGGTGCTTCATGCCGAACTCTTTTGAAAGCCAATACCCTCCATGTGGAGATACCTTCACACAGTACGGAATCAGTCTCACGCGAGCACCGCCTTCAAAGATCAAAAACCGAGACGCATAGACGCGAAACCATATTTGCTGCAGATGCCCTTCAAATATACGATCATAAAATCCCCATTCATCGCAAGGTAAGCATTCCACAAAATTTCTTCAATCCTCGAATGTTGATTCCTTTTAATCATTACAAATTTTGCAGTTCAAATTGTTGCAAAGAGTAGCCACCGTGAAGGATGATGGAAGTGTTGAGTTTGAAGTTGAAGGAAACATCGAATCCGAATCGATTAATGTTGAATCTGATGAACCCCTGGATGAAGCTGAGTTTGAGTATATAAGGCCTATGCAAATTGTAATGCTCATTGTAGGTACCCGGGGAGATGTGCAGCCATTTATCCCAATCGCCAAGCGTTTGCAGGTAATCCTGTATATACATCTATATCTATATCTATACCTATATCTATATCTATATCTTTATCGAGAGTTATGTATGTACGTATGTAGGATTATGGTCATCGTGTTAGATTAGCTACTCATCCAAATTTCAAGGACTTTGTGTTGATGGCTGGCTTAGAGTTTTATCCTTTAGGAGGAGATCCTAAACTGCTTGCATCCTGTACGTACTTCACACATCAAGCTTCTACAAATCCATGAATGCAACCTTTATAGATTGTTAAATGAAAATCCTGTTTTTTTTTTGTTTGAAACAGATATGGTAAGAAATAAAGGGTTCTTGCCCTCTGGGCCTTCTGAGATACTCATTCAAAGAAACCATATGAAGGAAATTATTTATTCCCTTCTTCCAGCTTGCAAACAACCTGATGTGGATACTGGTATTCCCTTTCAAGCAGATGCAATCATTGCCAACCGTACAGCATATGGTTAGATTTTGTCGCCCGATTGAATTTTTAAAAATTATGTTGTACAACGTTGAAATATATTGATTTATTAGGATTTTTTTAAAATAACTATTTTTGAAAGAATAATAAACGTGAAAATGTTACCGTGGGGAACCCAATCCCTGCCTCAATCCCCACAAAAATAAATGAGAACTTTTGCAGGGATGGGGAAGGCTTGCTTGTCTCTGCCCTGCTCCGTGGATATCTTTAGTCCCCTATAGTAAGCTACGAGGCAAATGCACTATCGCAAGCATGAACATGAAAAAGTCGAGAAAATAAACAAAACTAATTACATGGCACTAACATAGGGTAGGTGAGGATGGGGCCGTTACATTTGGACAAGCATGACCAAGACACTCAACGTCGGTGTTGCTATTTTTGCAAATTCTTTACTTTGCTAACTAAACTTTTTGATACTAATTTCTCATAAACTCTGTTTTGTAGGGCATACACACGTTGCAGAGGGACTTAAGCTGCCACTTCATATATTCTTTACGATGCCATGGACGTAAGTCTTGTGTGGGTGATGAATCATTACCTGCTTCACTTGTTGTAATGCTTATAGTATAGTTGTATGCTCTCCTCCAGGCCAACTACTGAGTTTCCACATCCATTATCCCGTGTGAAGCAACCGGCAGGGTATAGGGTGAGTTTGTTGGTTGCTTTTGAATATTCATTCACTTTGCATATTCACTCAAAGCTTTGATTCTCCTTCATTTCCAGATTTCATATCAAATCGTTGATTCCTTGATTTGGCTTGGCGTGCGAGATATCATTAATGATTTCAGGAAAAAGAAGCTAAAGATACGACCTGTAACATATTTAAGTAGTTCACGATTCTCCGAATCTGATGTGCCACATGCTTATTTGTGGAGTCCCTACCTTGTTCCTAAACCAAAAGGTGTTGATTTTTACACATTTTTATTTGTGGAGTCAAGAATGCAACTTTGGATGACTTTTTATTTTCCGTAATTCATAGATTGGGGGCCTAAGATTGATGTTGTTGGATATTGCTTCCTTGACCTTGCATCAAATTATGAACCCCCTGAGTCACTGGTGAAATGGCTTGAAGCTGGGGACAGCCCTATTTATTTTGGTTTTGGCAGCCTTGTAAGTTTCTTGCTTCTTCCTCAAACAGTTGAAAGTATCATACAAGCATGGTTTTAGCTACTTTTTTTCATTTGTTCTTCAGCCTGTACAAGAGCCTGATAAGATGACACAGATTATCATACAAGCATTGGAAAGAACTGGACAGCGGGGTATCATTAACGAAGGGTGGGGTGGCCTTGGGAGGTGTAAGTGTTTATTTTCAGTTCACTGCACATTGTTTATGGATTGCTTACATGACTGAATATTATATCTTCCCAGCTGCAGAACCGAAGGACTTCATATATTTGTTAGACAATTGCCCTCATGATTGGCTTTTTCTCAAATGCAAGGCTGTGGTGAGGCTTGATTGTTGCTTTGAATGTATTTTACCGCCTACATATTGTGTTCTTTAAGCATTGCAAGGGTCCTCAGGTGCATCATGGTGGTGCTGGAACAACAGCTGCTGGCCTGAAAGCTGCGGTACTTTGTTTCATCCGATACGTGTTTTGCGTTTTTAGTTTGATATGATTTGGTTATGATTTGGTTATGATTTTCATTGAATTCCCCAGTGCCCAACTACAGTTGTTCCATTCTTTGGTGACCAACCTTTCTGGGGGGATAGAGTACATGCCAGAGGAGTTGGCCCTCCACCTATCCCGGTTGACGAGTTCTCGCTTCCAAGATTGGTGAATGCTATAAACTATATGCTCGATCCCAAGGTAAAACCGAGATTCCCTTCTATTTAGTCTTTAACTTAATTCAAGAAAGCTTGGGATGATTTGGTTGATTCTAGTGAAGTATATCAAGCAAGTGACATGACAGGTAAAAAGGCAGGCTGTGGAGCTTGCAAAGGTTCTCAAGAACGAGGACGGGGTCGAAGGAGCTGTGAGAGCATTCTTCAAGCAACTTGGACGGAGGAAACGCGAGACAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCTGAACCTCAGAGGTCAAGTCTTTTGTTTATAAGAAAATGTTTTGGTTGTTCTTAAATCTTGGATAGTGGCGATTCTCAATTCACTTTGCAAGGACCATTTTTTGCTGGTCTTGTAAAGTTTATCATCGTTACATAATAACATTATTTGTATAAATTGTTGTAAAGGCTGAGGCTAAACTCTTGAATTTGTGTATTGTCAATCAGAAAAAAGAACAAATAAAGCTCACAAGAATATGTATAAATACCTGTTACTAGCATGAATGAACACTATCATCATGTATATTACGACTACTAAGTTCTTGTCAAAAATAAGAGAAAAAGTGAAAGAGAGTGGAATATAATGATATTAGTGAAGATCACCATGATTCTGAAGTGACATAAAAGGGATCAAAAAGGGGTACCAATTTCACTTCTTGTTTCTGTTGGACTGTACAGAGCAGATCTGTTTCTTGGTCTGTTCTCTTCCAGCTGCTTCACAACTTCATCCATGGAGGGTCTTACAGAAGCCACTGGTGCACAGCAGCCCATTGCGAGCTTCAATGCCTGAACCACTCCATCTTCCATGGGGCTTCTAATTCCTTTCAAAACCTCCACATCAAAAACCTCCATTGTTGTCTCCTCCAGAACTGCCACTTTTACCATCGATGGCAGGTCCACAAACTCTCCACTTCTTCCACTCTTCCCTGGCTTCTTCCCAATCAGAATCTCTAACAGCAAGATCCCAAATGCATACACATCTGTTCTTGAATTGCATTTCTTCATCCTTTGCAGCTCTGGTGCCTTGTACCCGTCCGACTTTGCAAGCGATACAATCTCGTCAGCTACCGATGGAATCATTAGCTTGTCGAGCCCGAACTCCGTCAGTCTCACTGTAAAGAAGTCGTCCACAAGTACATTTTTTGATCTCACATTACCATGTGTAATGGGGACTTCAAGACCGGTGTGAAGATGAGCTAGTCCCTGGGCGACGCCTAACGCAATCTTGTGCCTCCTAGCCCAGTTCAACACTGGTTTTCCTGCTCTAGATTCTGCATTCAATCAAAAAGAAAAAGAATTCTCTGTTGATATATAGCAATACAGCCATGAAATGAAGCATTAGAGAATGTGCATACCATGAAGAAAATCATGTAAAGTTCTGATGGGCAGATAGTCATAAATGAGCAGCTTTTCACCTCTCTTTCCCTGATAGAACGCTCTCAAAGGAATCAAATTCTCATGGCGAATCTTCCCCAATTGTTTAATCACAGACGAGCAAGTGTTTCTGTCTTTGCAACTACCTTCCCTCAGGAGTCTCAGAGCAATGGTACCTCCATCAGCAAGCTTTGCTTTATATATCGTGCCATAGCTTGTTTTCTCCATAACCTGCCCTGTAGCATTCAAGACATCATCCAATGTCAAATGCTCACCACCCTGAAACAAAATGAGCTTTCCTTCACCACCTGTGCCGCCGCCGATACCGCCACCCCCATTTTCTTCATCTTCTCCTTCCTCAATCTCATCTTCACTCTCAGTTATCTTCTTCTTCTTGTTTTGCATATAACCAATAAGCAATGAAGCTAAAACAACCGTCCCAGTCATCAAGCCGATAACAAGACCAGCAATGGCGCCTGAACTCAAATGAGAAGCTCCTGCACAGCTTTTCAAAGGCTCCCCACAAAGCCCAGGACTGTTCCCTTCAAAAGCCTCCACACCAAACTTTGAGTTATTACCAAACACAGGCAATGTTCCACTGAAGTTGTTATTTGAAAGGTTCAACTTCTCCAGCTCTAACTGGCCTAAGCTCTGAGGGATTTGTCCAGACAGCAAATTGTTCCCAAGATCAAGCTCTTTCAAGCCCTGAAATCTAGTAATGAATTCTGGGAAAGTACCTGAAATCTGGTTGTTTCCAAAATCCAAAGCCTCCAAACTCTTGCAGGTGGAATCTGGTAAAGCCGGCTCGGGCAGAGACCCAGATAATGAATTCCCATCAAGCCTGACAGAAACAAGTTTATCACACAGATTCCAGATCGAGGGTGGAAGAGCTCCACTCAACAGATTGCTACTCAAATCAATGTCAGAAAGAGAGGAGCTATAACCAAGTTCGAGTGGAATGGGTCCATTCAAAGAGTTTATGTTGAGATAGAGACTTTGGAGCATAGTGAACTCACCAAGCTCCTTAGGCAAGGAACCGGAGAGGTTAGCAGAAGGAAGTTGCAGGGAGAGAAGATGAAGAGAAGGGTCTTTGAAGAGAGTGAGATTAGACCACTGGGGAGAAGAAGAATCACTGCAAACCAGAGATGTTCCATTAGAGAAGACCCATTTGAGTCCCCTCCATTGACAAAGTGGCAAAGAGTAATTCCATGAAGACAAAAGCAAGTTTTGAGTATCACCTTCAAGTGAAGCTCTGATCTTTTCCAGAAGAAGCTGAACATCAGAAGAAGGGAAATGCAAGGAATCACCACCTCGAACAGGGGCTCTGTTCAACAAGAAGAATGAAATACAGAGAGCCAAAAGCTTCAAACCCACCATGACTATGGAGAAAGAGGAGAGAAATGGGGAAAGTTAGTAGAATCAATCACCAAGAGAGCTTCTTTTATCTTTTTTACTGGAAAAAGCTGACAGAGTGTTGACTATGAGTGAACTGATAGTCCAATTACTCATCCTCCCACAATCTCCACCGTCCGATCTGTACAAAGACAAACATCAGACAGATTAAAAACATCTTGGAATGGCAAACCAGATAGGTATTATTATAATCGTTTACTCTCTTTATTTCTGTATTTTTTTTTCTTTTTAATCCTTTGTTTCTGCACGCTTGCCCTTCGCATTCATTTTCCTTTTTCTATTTTTGTTGGGTTTTTCAAATTTGGTGTGGAGAAAGAGGAGAGAGAGAGATTTCCCCATTGAACAGCCCACCAAGCTTCGTCCTTTTTTTAACGGATAGTTGCAGAGGAGGTCTGGGTGGGACGCAAGTCTCATCACTCGACACGTGGCTTGTTATAAATGATGATGAGGATCAACAATGGTTGTGTGAGAGTGTGGAGTTCAATCCATGGAGCTCACATGCAGAGTTGCATTTCTTCTTTCGGCTCTGGGCCATCCATGCTTGCCCCAAACAAGCACACTGTTGGAAATTGTTCGGATAAGATGGGGATTTTAAGGCAAGTTTTAGTCTTGCAATTAAACGATTATTTTATTCAAACAAGTAACATTCATCAATAGTTTATGCTAAATTTTAGGATAAAAGAACTTAAAGATATAATTTACTAAATAATAAGATAGTAAAAATCCAACATTATGA

mRNA sequence

TCTCCTCCGCCGGTTGAGTCGTCGTCGTCAATCATGGTTGGCGGATGAATATGGAGGGATTTGGGGTTCCAGTTCATCATCAAACCAGCCCTCCAGGTGCTTCATGCCGAACTCTTTTGAAAGCCAATACCCTCCATGTGGAGATACCTTCACACAGTACGGAATCAGTCTCACGCGAGCACCGCCTTCAAAGATCAAAAACCGAGACGCATAGACGCGAAACCATATTTGCTGCAGATGCCCTTCAAATATACGATCATAAAATCCCCATTCATCGCAAGTTCAAATTGTTGCAAAGAGTAGCCACCGTGAAGGATGATGGAAGTGTTGAGTTTGAAGTTGAAGGAAACATCGAATCCGAATCGATTAATGTTGAATCTGATGAACCCCTGGATGAAGCTGAGTTTGAGTATATAAGGCCTATGCAAATTGTAATGCTCATTGTAGGTACCCGGGGAGATGTGCAGCCATTTATCCCAATCGCCAAGCGTTTGCAGGAAATTATTTATTCCCTTCTTCCAGCTTGCAAACAACCTGATGTGGATACTGGGCATACACACGTTGCAGAGGGACTTAAGCTGCCACTTCATATATTCTTTACGATGCCATGGACGCCAACTACTGAGTTTCCACATCCATTATCCCGTGTGAAGCAACCGGCAGGGTATAGGATTTCATATCAAATCGTTGATTCCTTGATTTGGCTTGGCGTGCGAGATATCATTAATGATTTCAGGAAAAAGAAGCTAAAGATACGACCTGTAACATATTTAAGTAGTTCACGATTCTCCGAATCTGATGTGCCACATGCTTATTTGTGGAGTCCCTACCTTGTTCCTAAACCAAAAGATTGGGGGCCTAAGATTGATGTTGTTGGATATTGCTTCCTTGACCTTGCATCAAATTATGAACCCCCTGAGTCACTGGTGAAATGGCTTGAAGCTGGGGACAGCCCTATTTATTTTGGTTTTGGCAGCCTTCCTGTACAAGAGCCTGATAAGATGACACAGATTATCATACAAGCATTGGAAAGAACTGGACAGCGGGGTATCATTAACGAAGGGTGGGGTGGCCTTGGGAGGTCTGCAGAACCGAAGGACTTCATATATTTGTTAGACAATTGCCCTCATGATTGGCTTTTTCTCAAATGCAAGGCTGTGGTGCATCATGGTGGTGCTGGAACAACAGCTGCTGGCCTGAAAGCTGCGTGCCCAACTACAGTTGTTCCATTCTTTGGTGACCAACCTTTCTGGGGGGATAGAGTACATGCCAGAGGAGTTGGCCCTCCACCTATCCCGGTTGACGAGTTCTCGCTTCCAAGATTGGTGAATGCTATAAACTATATGCTCGATCCCAAGGTAAAAAGGCAGGCTGTGGAGCTTGCAAAGGTTCTCAAGAACGAGGACGGGGTCGAAGGAGCTGTGAGAGCATTCTTCAAGCAACTTGGACGGAGGAAACGCGAGACAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCTGAACCTCAGAGGTCAAGTCTTTTGTTTATAAGAAAATGTTTTGGTTGTTCTTAAATCTTGGATAGTGGCGATTCTCAATTCACTTTGCAAGGACCATTTTTTGCTGGTCTTGTAAAGTTTATCATCGTTACATAATAACATTATTTGTATAAATTGTTGTAAAGGCTGAGGCTAAACTCTTGAATTTGTGTATTGTCAATCAGAAAAAAGAACAAATAAAGCTCACAAGAATATGTATAAATACCTGTTACTAGCATGAATGAACACTATCATCATGTATATTACGACTACTAAGTTCTTGTCAAAAATAAGAGAAAAAGTGAAAGAGAGTGGAATATAATGATATTAGTGAAGATCACCATGATTCTGAAGTGACATAAAAGGGATCAAAAAGGGGTACCAATTTCACTTCTTGTTTCTGTTGGACTGTACAGAGCAGATCTGTTTCTTGGTCTGTTCTCTTCCAGCTGCTTCACAACTTCATCCATGGAGGGTCTTACAGAAGCCACTGGTGCACAGCAGCCCATTGCGAGCTTCAATGCCTGAACCACTCCATCTTCCATGGGGCTTCTAATTCCTTTCAAAACCTCCACATCAAAAACCTCCATTGTTGTCTCCTCCAGAACTGCCACTTTTACCATCGATGGCAGGTCCACAAACTCTCCACTTCTTCCACTCTTCCCTGGCTTCTTCCCAATCAGAATCTCTAACAGCAAGATCCCAAATGCATACACATCTGTTCTTGAATTGCATTTCTTCATCCTTTGCAGCTCTGGTGCCTTGTACCCGTCCGACTTTGCAAGCGATACAATCTCGTCAGCTACCGATGGAATCATTAGCTTGTCGAGCCCGAACTCCGTCAGTCTCACTGTAAAGAAGTCGTCCACAAGTACATTTTTTGATCTCACATTACCATGTGTAATGGGGACTTCAAGACCGGTGTGAAGATGAGCTAGTCCCTGGGCGACGCCTAACGCAATCTTGTGCCTCCTAGCCCAGTTCAACACTGGTTTTCCTGCTCTAGATTCTGCATTCAATCAAAAAGAAAAAGAATTCTCTGTTGATATATAGCAATACAGCCATGAAATGAAGCATTAGAGAATGTGCATACCATGAAGAAAATCATGTAAAGTTCTGATGGGCAGATAGTCATAAATGAGCAGCTTTTCACCTCTCTTTCCCTGATAGAACGCTCTCAAAGGAATCAAATTCTCATGGCGAATCTTCCCCAATTGTTTAATCACAGACGAGCAAGTGTTTCTGTCTTTGCAACTACCTTCCCTCAGGAGTCTCAGAGCAATGGTACCTCCATCAGCAAGCTTTGCTTTATATATCGTGCCATAGCTTGTTTTCTCCATAACCTGCCCTGTAGCATTCAAGACATCATCCAATGTCAAATGCTCACCACCCTGAAACAAAATGAGCTTTCCTTCACCACCTGTGCCGCCGCCGATACCGCCACCCCCATTTTCTTCATCTTCTCCTTCCTCAATCTCATCTTCACTCTCAGTTATCTTCTTCTTCTTGTTTTGCATATAACCAATAAGCAATGAAGCTAAAACAACCGTCCCAGTCATCAAGCCGATAACAAGACCAGCAATGGCGCCTGAACTCAAATGAGAAGCTCCTGCACAGCTTTTCAAAGGCTCCCCACAAAGCCCAGGACTGTTCCCTTCAAAAGCCTCCACACCAAACTTTGAGTTATTACCAAACACAGGCAATGTTCCACTGAAGTTGTTATTTGAAAGGTTCAACTTCTCCAGCTCTAACTGGCCTAAGCTCTGAGGGATTTGTCCAGACAGCAAATTGTTCCCAAGATCAAGCTCTTTCAAGCCCTGAAATCTAGTAATGAATTCTGGGAAAGTACCTGAAATCTGGTTGTTTCCAAAATCCAAAGCCTCCAAACTCTTGCAGGTGGAATCTGGTAAAGCCGGCTCGGGCAGAGACCCAGATAATGAATTCCCATCAAGCCTGACAGAAACAAGTTTATCACACAGATTCCAGATCGAGGGTGGAAGAGCTCCACTCAACAGATTGCTACTCAAATCAATGTCAGAAAGAGAGGAGCTATAACCAAGTTCGAGTGGAATGGGTCCATTCAAAGAGTTTATGTTGAGATAGAGACTTTGGAGCATAGTGAACTCACCAAGCTCCTTAGGCAAGGAACCGGAGAGGTTAGCAGAAGGAAGTTGCAGGGAGAGAAGATGAAGAGAAGGGTCTTTGAAGAGAGTGAGATTAGACCACTGGGGAGAAGAAGAATCACTGCAAACCAGAGATGTTCCATTAGAGAAGACCCATTTGAGTCCCCTCCATTGACAAAGTGGCAAAGAGTAATTCCATGAAGACAAAAGCAAGTTTTGAGTATCACCTTCAAGTGAAGCTCTGATCTTTTCCAGAAGAAGCTGAACATCAGAAGAAGGGAAATGCAAGGAATCACCACCTCGAACAGGGGCTCTGTTCAACAAGAAGAATGAAATACAGAGAGCCAAAAGCTTCAAACCCACCATGACTATGGAGAAAGAGGAGAGAAATGGGGAAAGTTAGTAGAATCAATCACCAAGAGAGCTTCTTTTATCTTTTTTACTGGAAAAAGCTGACAGAGTGTTGACTATGAGTGAACTGATAGTCCAATTACTCATCCTCCCACAATCTCCACCGTCCGATCTGTACAAAGACAAACATCAGACAGATTAAAAACATCTTGGAATGGCAAACCAGATAGAGGAGAGAGAGAGATTTCCCCATTGAACAGCCCACCAAGCTTCGTCCTTTTTTTAACGGATAGTTGCAGAGGAGGTCTGGGTGGGACGCAAGTCTCATCACTCGACACGTGGCTTGTTATAAATGATGATGAGGATCAACAATGGTTGTGTGAGAGTGTGGAGTTCAATCCATGGAGCTCACATGCAGAGTTGCATTTCTTCTTTCGGCTCTGGGCCATCCATGCTTGCCCCAAACAAGCACACTGTTGGAAATTGTTCGGATAAGATGGGGATTTTAAGGCAAGTTTTAGTCTTGCAATTAAACGATTATTTTATTCAAACAAGTAACATTCATCAATAGTTTATGCTAAATTTTAGGATAAAAGAACTTAAAGATATAATTTACTAAATAATAAGATAGTAAAAATCCAACATTATGA

Coding sequence (CDS)

ATGAATATGGAGGGATTTGGGGTTCCAGTTCATCATCAAACCAGCCCTCCAGGTGCTTCATGCCGAACTCTTTTGAAAGCCAATACCCTCCATGTGGAGATACCTTCACACAGTACGGAATCAGTCTCACGCGAGCACCGCCTTCAAAGATCAAAAACCGAGACGCATAGACGCGAAACCATATTTGCTGCAGATGCCCTTCAAATATACGATCATAAAATCCCCATTCATCGCAAGTTCAAATTGTTGCAAAGAGTAGCCACCGTGAAGGATGATGGAAGTGTTGAGTTTGAAGTTGAAGGAAACATCGAATCCGAATCGATTAATGTTGAATCTGATGAACCCCTGGATGAAGCTGAGTTTGAGTATATAAGGCCTATGCAAATTGTAATGCTCATTGTAGGTACCCGGGGAGATGTGCAGCCATTTATCCCAATCGCCAAGCGTTTGCAGGAAATTATTTATTCCCTTCTTCCAGCTTGCAAACAACCTGATGTGGATACTGGGCATACACACGTTGCAGAGGGACTTAAGCTGCCACTTCATATATTCTTTACGATGCCATGGACGCCAACTACTGAGTTTCCACATCCATTATCCCGTGTGAAGCAACCGGCAGGGTATAGGATTTCATATCAAATCGTTGATTCCTTGATTTGGCTTGGCGTGCGAGATATCATTAATGATTTCAGGAAAAAGAAGCTAAAGATACGACCTGTAACATATTTAAGTAGTTCACGATTCTCCGAATCTGATGTGCCACATGCTTATTTGTGGAGTCCCTACCTTGTTCCTAAACCAAAAGATTGGGGGCCTAAGATTGATGTTGTTGGATATTGCTTCCTTGACCTTGCATCAAATTATGAACCCCCTGAGTCACTGGTGAAATGGCTTGAAGCTGGGGACAGCCCTATTTATTTTGGTTTTGGCAGCCTTCCTGTACAAGAGCCTGATAAGATGACACAGATTATCATACAAGCATTGGAAAGAACTGGACAGCGGGGTATCATTAACGAAGGGTGGGGTGGCCTTGGGAGGTCTGCAGAACCGAAGGACTTCATATATTTGTTAGACAATTGCCCTCATGATTGGCTTTTTCTCAAATGCAAGGCTGTGGTGCATCATGGTGGTGCTGGAACAACAGCTGCTGGCCTGAAAGCTGCGTGCCCAACTACAGTTGTTCCATTCTTTGGTGACCAACCTTTCTGGGGGGATAGAGTACATGCCAGAGGAGTTGGCCCTCCACCTATCCCGGTTGACGAGTTCTCGCTTCCAAGATTGGTGAATGCTATAAACTATATGCTCGATCCCAAGGTAAAAAGGCAGGCTGTGGAGCTTGCAAAGGTTCTCAAGAACGAGGACGGGGTCGAAGGAGCTGTGAGAGCATTCTTCAAGCAACTTGGACGGAGGAAACGCGAGACAGAGCCAGAGCCAGAGCCAGAGCCAGAGCCTGAACCTCAGAGGTCAAGTCTTTTGTTTATAAGAAAATGTTTTGGTTGTTCTTAA

Protein sequence

MNMEGFGVPVHHQTSPPGASCRTLLKANTLHVEIPSHSTESVSREHRLQRSKTETHRRETIFAADALQIYDHKIPIHRKFKLLQRVATVKDDGSVEFEVEGNIESESINVESDEPLDEAEFEYIRPMQIVMLIVGTRGDVQPFIPIAKRLQEIIYSLLPACKQPDVDTGHTHVAEGLKLPLHIFFTMPWTPTTEFPHPLSRVKQPAGYRISYQIVDSLIWLGVRDIINDFRKKKLKIRPVTYLSSSRFSESDVPHAYLWSPYLVPKPKDWGPKIDVVGYCFLDLASNYEPPESLVKWLEAGDSPIYFGFGSLPVQEPDKMTQIIIQALERTGQRGIINEGWGGLGRSAEPKDFIYLLDNCPHDWLFLKCKAVVHHGGAGTTAAGLKAACPTTVVPFFGDQPFWGDRVHARGVGPPPIPVDEFSLPRLVNAINYMLDPKVKRQAVELAKVLKNEDGVEGAVRAFFKQLGRRKRETEPEPEPEPEPEPQRSSLLFIRKCFGCS
BLAST of Cp4.1LG17g09910 vs. Swiss-Prot
Match: U80A2_ARATH (Sterol 3-beta-glucosyltransferase UGT80A2 OS=Arabidopsis thaliana GN=UGT80A2 PE=1 SV=1)

HSP 1 Score: 577.8 bits (1488), Expect = 1.2e-163
Identity = 266/375 (70.93%), Postives = 311/375 (82.93%), Query Frame = 1

Query: 144 IPIAK-RLQEIIYSLLPACKQPDVDTG----------------HTHVAEGLKLPLHIFFT 203
           IPI + ++++IIYSLLPACK+PD D+G                HTHVAE LK+P+H+FFT
Sbjct: 270 IPIQRNQMKDIIYSLLPACKEPDPDSGISFKADAIIANPPAYGHTHVAEALKIPIHVFFT 329

Query: 204 MPWTPTTEFPHPLSRVKQPAGYRISYQIVDSLIWLGVRDIINDFRKKKLKIRPVTYLSSS 263
           MPWTPT+EFPHPLSRVKQPAGYR+SYQIVDSLIWLG+RD++ND RKKKLK+RPVTYLS +
Sbjct: 330 MPWTPTSEFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDMVNDLRKKKLKLRPVTYLSGT 389

Query: 264 RFSESDVPHAYLWSPYLVPKPKDWGPKIDVVGYCFLDLASNYEPPESLVKWLEAGDSPIY 323
           + S S++PH Y+WSP+LVPKPKDWGP+IDVVG+C+LDLASNYEPP  LV+WLEAGD PIY
Sbjct: 390 QGSGSNIPHGYMWSPHLVPKPKDWGPQIDVVGFCYLDLASNYEPPAELVEWLEAGDKPIY 449

Query: 324 FGFGSLPVQEPDKMTQIIIQALERTGQRGIINEGWGGLGRSAEPKDFIYLLDNCPHDWLF 383
            GFGSLPVQEP+KMT+II++AL+RT QRGIIN+GWGGLG   EPKDF+YLLDN PHDWLF
Sbjct: 450 IGFGSLPVQEPEKMTEIIVEALQRTKQRGIINKGWGGLGNLKEPKDFVYLLDNVPHDWLF 509

Query: 384 LKCKAVVHHGGAGTTAAGLKAACPTTVVPFFGDQPFWGDRVHARGVGPPPIPVDEFSLPR 443
            +CKAVVHHGGAGTTAAGLKA+CPTT+VPFFGDQPFWG+RVHARGVGP PIPVDEFSL +
Sbjct: 510 PRCKAVVHHGGAGTTAAGLKASCPTTIVPFFGDQPFWGERVHARGVGPSPIPVDEFSLHK 569

Query: 444 LVNAINYMLDPKVKRQAVELAKVLKNEDGVEGAVRAFFKQLGRRKRETEPEPEPEPEPEP 502
           L +AIN+MLD KVK  A  LAK +K+EDGV GAV+AFFK L   K+        +P PEP
Sbjct: 570 LEDAINFMLDDKVKSSAETLAKAMKDEDGVAGAVKAFFKHLPSAKQNIS-----DPIPEP 629

BLAST of Cp4.1LG17g09910 vs. Swiss-Prot
Match: U80B1_ARATH (Sterol 3-beta-glucosyltransferase UGT80B1 OS=Arabidopsis thaliana GN=UGT80B1 PE=2 SV=1)

HSP 1 Score: 425.6 bits (1093), Expect = 7.4e-118
Identity = 194/362 (53.59%), Postives = 253/362 (69.89%), Query Frame = 1

Query: 148 KRLQEIIYSLLPACKQPDVDT----------------GHTHVAEGLKLPLHIFFTMPWTP 207
           K+L+ II SLLPAC +PD++T                GH HVAE L +P+HIFFTMPWTP
Sbjct: 238 KQLKAIIESLLPACIEPDLETATSFRAQAIIANPPAYGHVHVAEALGVPIHIFFTMPWTP 297

Query: 208 TTEFPHPLSRVKQPAGYRISYQIVDSLIWLGVRDIINDFRKKKLKIRPVTYLSSSRFSES 267
           T EFPHPL+RV Q A Y +SY +VD ++W  +R  INDFRK+KL + P+ Y S+   S S
Sbjct: 298 TNEFPHPLARVPQSAAYWLSYIVVDLMVWWSIRTYINDFRKRKLNLAPIAYFSTYHGSIS 357

Query: 268 DVPHAYLWSPYLVPKPKDWGPKIDVVGYCFLDLASNYEPPESLVKWLEAGDSPIYFGFGS 327
            +P  Y+WSP++VPKP DWGP +DVVGYCFL+L S Y+P E  + W+E G  P+Y GFGS
Sbjct: 358 HLPTGYMWSPHVVPKPSDWGPLVDVVGYCFLNLGSKYQPREEFLHWIERGSPPVYIGFGS 417

Query: 328 LPVQEPDKMTQIIIQALERTGQRGIINEGWGGLGRSA-EPKDFIYLLDNCPHDWLFLKCK 387
           +P+ +P +   II++ L+ T QRGI++ GWGGLG  A E  + ++L+++CPHDWLF +C 
Sbjct: 418 MPLDDPKQTMDIILETLKDTEQRGIVDRGWGGLGNLATEVPENVFLVEDCPHDWLFPQCS 477

Query: 388 AVVHHGGAGTTAAGLKAACPTTVVPFFGDQPFWGDRVHARGVGPPPIPVDEFSLPRLVNA 447
           AVVHHGGAGTTA GLKA CPTT+VPFFGDQ FWGDR++ +G+GP PIP+ + S+  L ++
Sbjct: 478 AVVHHGGAGTTATGLKAGCPTTIVPFFGDQFFWGDRIYEKGLGPAPIPIAQLSVENLSSS 537

Query: 448 INYMLDPKVKRQAVELAKVLKNEDGVEGAVRAFFKQLGRRKRETEPEPEPEPEPEPQRSS 493
           I +ML P+VK Q +ELAKVL+NEDGV  AV AF + L        PE   E + E  R  
Sbjct: 538 IRFMLQPEVKSQVMELAKVLENEDGVAAAVDAFHRHL--PPELPLPESSSEKKDEDDRPD 597

BLAST of Cp4.1LG17g09910 vs. Swiss-Prot
Match: ATG26_PICPG (Sterol 3-beta-glucosyltransferase OS=Komagataella pastoris (strain GS115 / ATCC 20864) GN=ATG26 PE=3 SV=1)

HSP 1 Score: 206.8 bits (525), Expect = 5.4e-52
Identity = 124/372 (33.33%), Postives = 197/372 (52.96%), Query Frame = 1

Query: 126  PMQIVMLIVGTRGDVQPFIPIAKR-----LQEIIYSLLPACKQPDV------DTGHTHVA 185
            P +++ L+V  +     F+  AK      + E++ S   AC+  DV           H+A
Sbjct: 817  PAELMSLMVTHKSLSVGFLKEAKEKFTGWIGELLQSSWDACQDADVLIESPSAMAGIHIA 876

Query: 186  EGLKLPLHIFFTMPWTPTTEFPHPLSRVKQPAGYRISYQ---IVDSLIWLGVRDIINDFR 245
            E L++P    FTMPWT T  +PH     +Q  G   +Y    I +++ W G+   +N +R
Sbjct: 877  EKLQIPYFRAFTMPWTRTRAYPHAFVVPEQKRGGSYNYLTHIIFENVFWKGISGEVNKWR 936

Query: 246  KKKLKIRPVTYLSSSRFSESDVPHAYLWSPYLVPKPKDWGPKIDVVGYCFLDL--ASNYE 305
            ++ L + P T L   R  ++ VP  Y  SP + P   D+   + VVGY FLD   A +Y+
Sbjct: 937  EQVLML-PKTNLE--RLEQNKVPFLYNVSPTVFPPSMDFPHWVKVVGYWFLDEGEADSYD 996

Query: 306  PPESLVKWLEA----GDSPIYFGFGSLPVQEPDKMTQIIIQALERTGQRGIINEGWGG-- 365
            PP+ L++++E     G   +Y GFGS+ V +P ++T+ +I A+     R I+N+GW    
Sbjct: 997  PPKPLLEFMEKAKTDGKKLVYIGFGSIVVSDPKQLTEAVIDAVLSADVRCILNKGWSDRL 1056

Query: 366  ---LGRSAEPKDFIYLLDNCPHDWLFLKCKAVVHHGGAGTTAAGLKAACPTTVVPFFGDQ 425
                G   E  + IY   N PHDWLF K  A VHHGG+GTT A L+A  PT + PFFGDQ
Sbjct: 1057 GKQTGVEVELPEEIYNSGNVPHDWLFGKIDASVHHGGSGTTGATLRAGIPTIIKPFFGDQ 1116

Query: 426  PFWGDRVHARGVGPPPIPVDEFSLPRLVNAINYMLDPKVKRQAVELAKVLKNEDGVEGAV 473
             F+ +RV   GVG     ++  SL + +  +    + ++  +A E+ K +++E+GV  A+
Sbjct: 1117 FFYANRVEDIGVGIGLRKLNSKSLSKAIKEVT--TNTRIIEKAKEIGKQIQSENGVSAAI 1176

BLAST of Cp4.1LG17g09910 vs. Swiss-Prot
Match: UGT52_DICDI (UDP-sugar-dependent glycosyltransferase 52 OS=Dictyostelium discoideum GN=ugt52 PE=2 SV=1)

HSP 1 Score: 203.0 bits (515), Expect = 7.8e-51
Identity = 118/326 (36.20%), Postives = 182/326 (55.83%), Query Frame = 1

Query: 172  HVAEGLKLPLHIFFTMPWTPTTEFPHPLSRVK--QPAGY--RISYQIVDSLIWLGVRDII 231
            H+ E L++P    FTMP+T T  +P+P +     Q  G     ++ +++ ++W  +   I
Sbjct: 1280 HIGEVLQIPFFNAFTMPFTRTRTYPNPFAPFASHQMGGVFNLATHVMMEKVLWQPISGQI 1339

Query: 232  NDFRKKKLKIRPVTYLSSSRFSES-DVPHAYLWSPYLVPKPKDWGPKIDVVGYCFLDLAS 291
            N +R + LKI P  + SS   +E+  +P+ Y +S YLVPKP DW  +I + GY  L   +
Sbjct: 1340 NQWRTETLKIPP--WNSSVSINETYRMPYLYCFSKYLVPKPPDWSGEIAITGYWTLKNQA 1399

Query: 292  NYE-PPESLVKWL------EAGDSPIYFGFGSLPVQEPDKMTQIIIQALERTGQRGIINE 351
            N + PP+ L+++L      E  D PIY GFGS+ +  P  ++ ++I+A++ +G+R II++
Sbjct: 1400 NSDSPPDDLIQFLNEESSTENDDIPIYIGFGSIVIDNPTALSLLLIEAIKLSGKRAIISQ 1459

Query: 352  GWGGLG-----------------------RSAEPKDFIYLLDN-CPHDWLFLKCKAVVHH 411
            GWGGL                        +S+   + IYLL     H WLF K   V+ H
Sbjct: 1460 GWGGLSIDEHNNNNNNNNNNNNGENSDSNKSSLQSNRIYLLKKPVDHSWLFEKVSLVISH 1519

Query: 412  GGAGTTAAGLKAACPTTVVPFFGDQPFWGDRVHARGVGPPPIPVDEFSLPRL-VNAINYM 461
            GGAGT AA L AA PT VVPFFGDQ FWG+R+   G+G   IP D  +   L  + I+ +
Sbjct: 1520 GGAGTVAASLLAAKPTIVVPFFGDQFFWGERIKQTGIG-TSIPFDILTAKSLSSHIISIL 1579

BLAST of Cp4.1LG17g09910 vs. Swiss-Prot
Match: ATG26_KLULA (Sterol 3-beta-glucosyltransferase OS=Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 / DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37) GN=ATG26 PE=3 SV=1)

HSP 1 Score: 202.2 bits (513), Expect = 1.3e-50
Identity = 109/311 (35.05%), Postives = 169/311 (54.34%), Query Frame = 1

Query: 172  HVAEGLKLPLHIFFTMPWTPTTEFPHPLSRVKQPAGYRISY---QIVDSLIWLGVRDIIN 231
            H+AE L++P    FTMPWT T  +PH      Q  G   +Y    + +++ W G+   +N
Sbjct: 849  HIAEALRIPYFRAFTMPWTRTRAYPHAFIVPDQKRGGNYNYFTHVLFENIFWKGISGKVN 908

Query: 232  DFRKKKLKIRPVTYLSSSRFSESDVPHAYLWSPYLVPKPKDWGPKIDVVGYCFLDLASNY 291
            ++R+ KLK+ P T L S +  ++ VP  Y  SP + P   D+   I V GY FLD   +Y
Sbjct: 909  EWRETKLKL-PKTNLVSMQ--QNRVPFLYNVSPIVFPPSVDFNEWIKVTGYWFLDEKRSY 968

Query: 292  EPPESLVKWL----EAGDSPIYFGFGSLPVQEPDKMTQIIIQALERTGQRGIINEGWGG- 351
            +PP   +++L    E     +Y GFGS+ V +P+KMT  II+A+   G   ++N+GW   
Sbjct: 969  KPPAEFMEFLNKARELKKKVVYIGFGSIVVNDPEKMTDTIIEAVRDAGVYCVLNKGWSNR 1028

Query: 352  ----LGRSAEPK--DFIYLLDNCPHDWLFLKCKAVVHHGGAGTTAAGLKAACPTTVVPFF 411
                L +  + +   +IY   + PHDWLF K  A VHHGG+GTT A L+A  PT + PFF
Sbjct: 1029 FGDPLAKKIDKELPSYIYNSGDVPHDWLFTKIDATVHHGGSGTTGASLRAGLPTIIKPFF 1088

Query: 412  GDQPFWGDRVHARGVGPPPIPVDEFSLPRLVNAINYMLDPKVKRQAVELAKVLKNEDGVE 469
            GDQ F+  RV   G G     ++  SL + +  +    + ++ ++A ++ + +  E GV 
Sbjct: 1089 GDQFFYASRVEDIGAGVALKKLNRSSLAKALKEVT--TNTRIIQKARQIGESISKEHGVA 1148

BLAST of Cp4.1LG17g09910 vs. TrEMBL
Match: A0A0A0K4H9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G336540 PE=4 SV=1)

HSP 1 Score: 656.8 bits (1693), Expect = 2.2e-185
Identity = 306/368 (83.15%), Postives = 329/368 (89.40%), Query Frame = 1

Query: 149 RLQEIIYSLLPACKQPDVDTG----------------HTHVAEGLKLPLHIFFTMPWTPT 208
           +++EIIYSLLPACK PD+DTG                HTHVAEGLKLPLHIFFTMPWTPT
Sbjct: 223 QMKEIIYSLLPACKDPDMDTGIPFEADAIIANRTAYGHTHVAEGLKLPLHIFFTMPWTPT 282

Query: 209 TEFPHPLSRVKQPAGYRISYQIVDSLIWLGVRDIINDFRKKKLKIRPVTYLSSSRFSESD 268
           +EFPHPLSRVKQ AGYR+SYQIVDSLIWLG+RDIINDFRKKKL+IRPVTYLS S+FSESD
Sbjct: 283 SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGLRDIINDFRKKKLQIRPVTYLSGSQFSESD 342

Query: 269 VPHAYLWSPYLVPKPKDWGPKIDVVGYCFLDLASNYEPPESLVKWLEAGDSPIYFGFGSL 328
           VPH YLWSPY+VPKPKDWGPKIDVVGYCFLDL+SNYEPPESLVKWLEAGD P+Y GFGSL
Sbjct: 343 VPHVYLWSPYIVPKPKDWGPKIDVVGYCFLDLSSNYEPPESLVKWLEAGDKPVYIGFGSL 402

Query: 329 PVQEPDKMTQIIIQALERTGQRGIINEGWGGLGRSAEPKDFIYLLDNCPHDWLFLKCKAV 388
           PVQ+P+KMTQIIIQALE T QRGIINEGWGGLG+SAEPKDF+YLLDNCPHDWLF KCKAV
Sbjct: 403 PVQDPEKMTQIIIQALETTKQRGIINEGWGGLGKSAEPKDFLYLLDNCPHDWLFPKCKAV 462

Query: 389 VHHGGAGTTAAGLKAACPTTVVPFFGDQPFWGDRVHARGVGPPPIPVDEFSLPRLVNAIN 448
           VHHGGAGTTAAGLKAACPTT+VPFFGDQPFWG+RVH RGVGPPPIPVDEFSL RLVNAIN
Sbjct: 463 VHHGGAGTTAAGLKAACPTTIVPFFGDQPFWGERVHDRGVGPPPIPVDEFSLQRLVNAIN 522

Query: 449 YMLDPKVKRQAVELAKVLKNEDGVEGAVRAFFKQLGRRKRETEPEPEPEPEPEPQRSSLL 501
           YMLDPKVK +AV LAKVL+NEDGVEGAVRAFF+QL RRK         EPEPEPQ+S+LL
Sbjct: 523 YMLDPKVKERAVLLAKVLENEDGVEGAVRAFFRQLSRRKL--------EPEPEPQKSNLL 582

BLAST of Cp4.1LG17g09910 vs. TrEMBL
Match: M5VNV0_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002976mg PE=4 SV=1)

HSP 1 Score: 617.8 bits (1592), Expect = 1.1e-173
Identity = 286/375 (76.27%), Postives = 323/375 (86.13%), Query Frame = 1

Query: 144 IPIAK-RLQEIIYSLLPACKQPDVDTG----------------HTHVAEGLKLPLHIFFT 203
           IPI + +++EIIYSLLPACK+PD+D+G                HTHVAE LK+PLHIFFT
Sbjct: 249 IPIQRNQIKEIIYSLLPACKEPDMDSGIPFKADAIIANPPAYGHTHVAEALKIPLHIFFT 308

Query: 204 MPWTPTTEFPHPLSRVKQPAGYRISYQIVDSLIWLGVRDIINDFRKKKLKIRPVTYLSSS 263
           MPWTPT+EFPHPLSRVKQ  GYR+SYQIVDSLIWLG+RD+IND RKKKLK+RPVTYLS S
Sbjct: 309 MPWTPTSEFPHPLSRVKQSTGYRLSYQIVDSLIWLGIRDMINDVRKKKLKLRPVTYLSGS 368

Query: 264 RFSESDVPHAYLWSPYLVPKPKDWGPKIDVVGYCFLDLASNYEPPESLVKWLEAGDSPIY 323
           + S+SDVPH Y+WSP+LVPKPKDWGPK+DVVG+CFLDLASNYEPPE LVKWLEAGD PIY
Sbjct: 369 QGSDSDVPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASNYEPPELLVKWLEAGDRPIY 428

Query: 324 FGFGSLPVQEPDKMTQIIIQALERTGQRGIINEGWGGLGRSAEPKDFIYLLDNCPHDWLF 383
            GFGSLPVQEP+KMTQII++ALE+TGQRGIIN+GWGGLG  AEPKDFIYLLDNCPHDWLF
Sbjct: 429 IGFGSLPVQEPEKMTQIIVEALEKTGQRGIINKGWGGLGNLAEPKDFIYLLDNCPHDWLF 488

Query: 384 LKCKAVVHHGGAGTTAAGLKAACPTTVVPFFGDQPFWGDRVHARGVGPPPIPVDEFSLPR 443
           L+CKAVVHHGGAGTTAAGLKAACPTT+VPFFGDQPFWG+RVHARGVGP PI VDEFSLP+
Sbjct: 489 LQCKAVVHHGGAGTTAAGLKAACPTTIVPFFGDQPFWGERVHARGVGPAPIAVDEFSLPK 548

Query: 444 LVNAINYMLDPKVKRQAVELAKVLKNEDGVEGAVRAFFKQLGRRKRETEPEPEPEPEPEP 502
           LV+AI +MLDPKVK +AVELAK ++NEDGV GAV+AFFK L  RK        P+PEPEP
Sbjct: 549 LVDAIKFMLDPKVKERAVELAKDMENEDGVTGAVKAFFKHLPCRK--------PDPEPEP 608

BLAST of Cp4.1LG17g09910 vs. TrEMBL
Match: A0A067K2C0_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17573 PE=4 SV=1)

HSP 1 Score: 615.9 bits (1587), Expect = 4.3e-173
Identity = 283/375 (75.47%), Postives = 321/375 (85.60%), Query Frame = 1

Query: 144 IPIAK-RLQEIIYSLLPACKQPDVDTG----------------HTHVAEGLKLPLHIFFT 203
           IP+ + +++EIIYSLLPACK PD+D+G                HTHVAE LK+PLHIFFT
Sbjct: 257 IPVQRNQMKEIIYSLLPACKDPDMDSGIPFKADAIIANPPAYGHTHVAEALKIPLHIFFT 316

Query: 204 MPWTPTTEFPHPLSRVKQPAGYRISYQIVDSLIWLGVRDIINDFRKKKLKIRPVTYLSSS 263
           MPWTPT+EFPHPLSRVKQ AGYR+SYQIVDSLIWLG+RD+IND RKKKLK+RPVTYLS S
Sbjct: 317 MPWTPTSEFPHPLSRVKQAAGYRLSYQIVDSLIWLGIRDMINDVRKKKLKLRPVTYLSGS 376

Query: 264 RFSESDVPHAYLWSPYLVPKPKDWGPKIDVVGYCFLDLASNYEPPESLVKWLEAGDSPIY 323
           + SESDVPH Y+WSP+LVPKPKDWGPK+DVVG+CFLDLASNYEPPESLVKWLEAG  PIY
Sbjct: 377 QGSESDVPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASNYEPPESLVKWLEAGPKPIY 436

Query: 324 FGFGSLPVQEPDKMTQIIIQALERTGQRGIINEGWGGLGRSAEPKDFIYLLDNCPHDWLF 383
            GFGSLPVQEP+KMTQII+ ALE+TGQRGIIN+GWGGLG  AEPKD IYLLDNCPHDWLF
Sbjct: 437 IGFGSLPVQEPEKMTQIIVDALEQTGQRGIINKGWGGLGNLAEPKDSIYLLDNCPHDWLF 496

Query: 384 LKCKAVVHHGGAGTTAAGLKAACPTTVVPFFGDQPFWGDRVHARGVGPPPIPVDEFSLPR 443
           L+CKAVVHHGGAGTTAAGLKAACPTT+VPFFGDQPFWG+RV+ARGVGP PIPVDEFSLP+
Sbjct: 497 LRCKAVVHHGGAGTTAAGLKAACPTTIVPFFGDQPFWGERVYARGVGPQPIPVDEFSLPK 556

Query: 444 LVNAINYMLDPKVKRQAVELAKVLKNEDGVEGAVRAFFKQLGRRKRETEPEPEPEPEPEP 502
           L++AI  MLDPKVK +A+ELAK ++NEDGV GAV+AFFK L        P+ +PEPEP P
Sbjct: 557 LIDAIKIMLDPKVKERAIELAKAMENEDGVTGAVKAFFKHL--------PKMKPEPEPSP 616

BLAST of Cp4.1LG17g09910 vs. TrEMBL
Match: B9IB34_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0014s17640g PE=4 SV=2)

HSP 1 Score: 615.9 bits (1587), Expect = 4.3e-173
Identity = 280/368 (76.09%), Postives = 319/368 (86.68%), Query Frame = 1

Query: 149 RLQEIIYSLLPACKQPDVDT----------------GHTHVAEGLKLPLHIFFTMPWTPT 208
           +++EIIYSLLPACK PD+D+                GHTHVAE LK+PLHIFFTMPWTPT
Sbjct: 255 QIKEIIYSLLPACKDPDIDSKIPFRADAIIANPPAYGHTHVAEALKVPLHIFFTMPWTPT 314

Query: 209 TEFPHPLSRVKQPAGYRISYQIVDSLIWLGVRDIINDFRKKKLKIRPVTYLSSSRFSESD 268
           +EFPHPLSRVKQ AGYR+SYQIVDS+IWLG+RD+IND RKKKLK+RPVTYLS S+ S+SD
Sbjct: 315 SEFPHPLSRVKQSAGYRLSYQIVDSMIWLGIRDMINDLRKKKLKLRPVTYLSGSQGSDSD 374

Query: 269 VPHAYLWSPYLVPKPKDWGPKIDVVGYCFLDLASNYEPPESLVKWLEAGDSPIYFGFGSL 328
           VP+ Y+WSP+L PKPKDWGPKIDVVG+CFLDLASNYEPPE L+KWLEAG  PIY GFGSL
Sbjct: 375 VPYGYIWSPHLAPKPKDWGPKIDVVGFCFLDLASNYEPPEPLLKWLEAGQKPIYIGFGSL 434

Query: 329 PVQEPDKMTQIIIQALERTGQRGIINEGWGGLGRSAEPKDFIYLLDNCPHDWLFLKCKAV 388
           PVQEP+KMTQ I++ALE+TGQRGIIN+GWGGLG  AEPKDFIYLLDNCPHDWLFL+CKAV
Sbjct: 435 PVQEPEKMTQTIVEALEQTGQRGIINKGWGGLGNLAEPKDFIYLLDNCPHDWLFLQCKAV 494

Query: 389 VHHGGAGTTAAGLKAACPTTVVPFFGDQPFWGDRVHARGVGPPPIPVDEFSLPRLVNAIN 448
           VHHGGAGTTAAGLKAACPTT+VPFFGDQPFWG+R+HARGVGPPPIPVDEFSL +LV AI+
Sbjct: 495 VHHGGAGTTAAGLKAACPTTIVPFFGDQPFWGERLHARGVGPPPIPVDEFSLTKLVEAIH 554

Query: 449 YMLDPKVKRQAVELAKVLKNEDGVEGAVRAFFKQLGRRKRETEPEPEPEPEPEPQRSSLL 501
           +MLDPKVK +AVELAK ++NEDGV+GAV+AFFK L R+K    PEPEPE EP  + SS+ 
Sbjct: 555 FMLDPKVKERAVELAKDMENEDGVDGAVKAFFKHLPRKK----PEPEPESEPSTEPSSIF 614

BLAST of Cp4.1LG17g09910 vs. TrEMBL
Match: K7KDS5_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_03G083100 PE=4 SV=1)

HSP 1 Score: 614.4 bits (1583), Expect = 1.2e-172
Identity = 282/375 (75.20%), Postives = 321/375 (85.60%), Query Frame = 1

Query: 144 IPIAK-RLQEIIYSLLPACKQPDVDTG----------------HTHVAEGLKLPLHIFFT 203
           IPI + +++EII SLLPACK+PD+D+G                HTHVAE LK+P+HIFFT
Sbjct: 226 IPIQRNQMKEIINSLLPACKEPDIDSGVPFKADAIIANPPAYGHTHVAEALKIPIHIFFT 285

Query: 204 MPWTPTTEFPHPLSRVKQPAGYRISYQIVDSLIWLGVRDIINDFRKKKLKIRPVTYLSSS 263
           MPWTPTTEFPHPLSRVKQ AGYR+SYQIVDSLIWLG+RD+IND RKKKLK+RPVTYLS S
Sbjct: 286 MPWTPTTEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKKLKLRPVTYLSGS 345

Query: 264 RFSESDVPHAYLWSPYLVPKPKDWGPKIDVVGYCFLDLASNYEPPESLVKWLEAGDSPIY 323
           + SE+DVPHAY+WSP+LVPKPKDWGPKIDVVG+CFLDLASNYEPPESLVKWLE GD PIY
Sbjct: 346 QGSETDVPHAYIWSPHLVPKPKDWGPKIDVVGFCFLDLASNYEPPESLVKWLEEGDKPIY 405

Query: 324 FGFGSLPVQEPDKMTQIIIQALERTGQRGIINEGWGGLGRSAEPKDFIYLLDNCPHDWLF 383
            GFGSLPVQEP +MTQII+ ALE TGQRGIIN+GWGGLG  AEPKD IYLLDNCPHDWLF
Sbjct: 406 IGFGSLPVQEPKRMTQIIVDALEITGQRGIINKGWGGLGNLAEPKDSIYLLDNCPHDWLF 465

Query: 384 LKCKAVVHHGGAGTTAAGLKAACPTTVVPFFGDQPFWGDRVHARGVGPPPIPVDEFSLPR 443
           L+CKAVVHHGGAGTTAAGLKAACPTT+VPFFGDQPFWG+RVH RGVGPPPIPVDEFSLP+
Sbjct: 466 LRCKAVVHHGGAGTTAAGLKAACPTTIVPFFGDQPFWGERVHVRGVGPPPIPVDEFSLPK 525

Query: 444 LVNAINYMLDPKVKRQAVELAKVLKNEDGVEGAVRAFFKQLGRRKRETEPEPEPEPEPEP 502
           LV+A+  MLDPKVK +A+ELAK ++NEDGV GAV+AFFKQL        P+ +PEP+ +P
Sbjct: 526 LVDALKLMLDPKVKERAIELAKAMENEDGVTGAVKAFFKQL--------PQKKPEPDADP 585

BLAST of Cp4.1LG17g09910 vs. TAIR10
Match: AT3G07020.2 (AT3G07020.2 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 577.8 bits (1488), Expect = 6.5e-165
Identity = 266/375 (70.93%), Postives = 311/375 (82.93%), Query Frame = 1

Query: 144 IPIAK-RLQEIIYSLLPACKQPDVDTG----------------HTHVAEGLKLPLHIFFT 203
           IPI + ++++IIYSLLPACK+PD D+G                HTHVAE LK+P+H+FFT
Sbjct: 270 IPIQRNQMKDIIYSLLPACKEPDPDSGISFKADAIIANPPAYGHTHVAEALKIPIHVFFT 329

Query: 204 MPWTPTTEFPHPLSRVKQPAGYRISYQIVDSLIWLGVRDIINDFRKKKLKIRPVTYLSSS 263
           MPWTPT+EFPHPLSRVKQPAGYR+SYQIVDSLIWLG+RD++ND RKKKLK+RPVTYLS +
Sbjct: 330 MPWTPTSEFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDMVNDLRKKKLKLRPVTYLSGT 389

Query: 264 RFSESDVPHAYLWSPYLVPKPKDWGPKIDVVGYCFLDLASNYEPPESLVKWLEAGDSPIY 323
           + S S++PH Y+WSP+LVPKPKDWGP+IDVVG+C+LDLASNYEPP  LV+WLEAGD PIY
Sbjct: 390 QGSGSNIPHGYMWSPHLVPKPKDWGPQIDVVGFCYLDLASNYEPPAELVEWLEAGDKPIY 449

Query: 324 FGFGSLPVQEPDKMTQIIIQALERTGQRGIINEGWGGLGRSAEPKDFIYLLDNCPHDWLF 383
            GFGSLPVQEP+KMT+II++AL+RT QRGIIN+GWGGLG   EPKDF+YLLDN PHDWLF
Sbjct: 450 IGFGSLPVQEPEKMTEIIVEALQRTKQRGIINKGWGGLGNLKEPKDFVYLLDNVPHDWLF 509

Query: 384 LKCKAVVHHGGAGTTAAGLKAACPTTVVPFFGDQPFWGDRVHARGVGPPPIPVDEFSLPR 443
            +CKAVVHHGGAGTTAAGLKA+CPTT+VPFFGDQPFWG+RVHARGVGP PIPVDEFSL +
Sbjct: 510 PRCKAVVHHGGAGTTAAGLKASCPTTIVPFFGDQPFWGERVHARGVGPSPIPVDEFSLHK 569

Query: 444 LVNAINYMLDPKVKRQAVELAKVLKNEDGVEGAVRAFFKQLGRRKRETEPEPEPEPEPEP 502
           L +AIN+MLD KVK  A  LAK +K+EDGV GAV+AFFK L   K+        +P PEP
Sbjct: 570 LEDAINFMLDDKVKSSAETLAKAMKDEDGVAGAVKAFFKHLPSAKQNIS-----DPIPEP 629

BLAST of Cp4.1LG17g09910 vs. TAIR10
Match: AT1G43620.1 (AT1G43620.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 425.6 bits (1093), Expect = 4.1e-119
Identity = 194/362 (53.59%), Postives = 253/362 (69.89%), Query Frame = 1

Query: 148 KRLQEIIYSLLPACKQPDVDT----------------GHTHVAEGLKLPLHIFFTMPWTP 207
           K+L+ II SLLPAC +PD++T                GH HVAE L +P+HIFFTMPWTP
Sbjct: 238 KQLKAIIESLLPACIEPDLETATSFRAQAIIANPPAYGHVHVAEALGVPIHIFFTMPWTP 297

Query: 208 TTEFPHPLSRVKQPAGYRISYQIVDSLIWLGVRDIINDFRKKKLKIRPVTYLSSSRFSES 267
           T EFPHPL+RV Q A Y +SY +VD ++W  +R  INDFRK+KL + P+ Y S+   S S
Sbjct: 298 TNEFPHPLARVPQSAAYWLSYIVVDLMVWWSIRTYINDFRKRKLNLAPIAYFSTYHGSIS 357

Query: 268 DVPHAYLWSPYLVPKPKDWGPKIDVVGYCFLDLASNYEPPESLVKWLEAGDSPIYFGFGS 327
            +P  Y+WSP++VPKP DWGP +DVVGYCFL+L S Y+P E  + W+E G  P+Y GFGS
Sbjct: 358 HLPTGYMWSPHVVPKPSDWGPLVDVVGYCFLNLGSKYQPREEFLHWIERGSPPVYIGFGS 417

Query: 328 LPVQEPDKMTQIIIQALERTGQRGIINEGWGGLGRSA-EPKDFIYLLDNCPHDWLFLKCK 387
           +P+ +P +   II++ L+ T QRGI++ GWGGLG  A E  + ++L+++CPHDWLF +C 
Sbjct: 418 MPLDDPKQTMDIILETLKDTEQRGIVDRGWGGLGNLATEVPENVFLVEDCPHDWLFPQCS 477

Query: 388 AVVHHGGAGTTAAGLKAACPTTVVPFFGDQPFWGDRVHARGVGPPPIPVDEFSLPRLVNA 447
           AVVHHGGAGTTA GLKA CPTT+VPFFGDQ FWGDR++ +G+GP PIP+ + S+  L ++
Sbjct: 478 AVVHHGGAGTTATGLKAGCPTTIVPFFGDQFFWGDRIYEKGLGPAPIPIAQLSVENLSSS 537

Query: 448 INYMLDPKVKRQAVELAKVLKNEDGVEGAVRAFFKQLGRRKRETEPEPEPEPEPEPQRSS 493
           I +ML P+VK Q +ELAKVL+NEDGV  AV AF + L        PE   E + E  R  
Sbjct: 538 IRFMLQPEVKSQVMELAKVLENEDGVAAAVDAFHRHL--PPELPLPESSSEKKDEDDRPD 597

BLAST of Cp4.1LG17g09910 vs. TAIR10
Match: AT5G24750.1 (AT5G24750.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 80.9 bits (198), Expect = 2.5e-15
Identity = 40/120 (33.33%), Postives = 64/120 (53.33%), Query Frame = 1

Query: 361 PHDWLFLKCKAVVHHGGAGTTAAGLKAACPTTVVPFFGDQPFWGDRVHARGVGPPPIPVD 420
           P++W+F  C A +HHGG+G+ AA L+A  P  + PF  DQ +W +++   GV P P+  +
Sbjct: 391 PYNWMFRTCAAAIHHGGSGSVAAALQAGIPQIICPFMLDQFYWAEKMSWLGVAPQPLKRN 450

Query: 421 EFSLPR-------------LVNAINYMLDPKVKRQAVELAKVLKNEDGVEGAVRAFFKQL 468
              L               +  AI   L  K + +A+E+A++L  EDGV  AVR   +++
Sbjct: 451 HLLLEDSNDEKNITEAAQVVAKAIYDALSAKTRARAMEIAEILSLEDGVTEAVRVLREEV 510

BLAST of Cp4.1LG17g09910 vs. NCBI nr
Match: gi|659116212|ref|XP_008457966.1| (PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2-like isoform X2 [Cucumis melo])

HSP 1 Score: 812.0 bits (2096), Expect = 5.8e-232
Identity = 403/549 (73.41%), Postives = 438/549 (79.78%), Query Frame = 1

Query: 5   GFGVPVHHQTSPPGASCRTLLKANTLHVEIPSHSTESVSREHRLQRSKTETHRRETIFAA 64
           GFG    HQT+PPG   R+  K+N+LHVE+ S + E  S +HRLQRSKTE  R ETIFA 
Sbjct: 10  GFGPQYDHQTTPPGVPGRSFSKSNSLHVELSSDTLEPTSFQHRLQRSKTERLRHETIFAE 69

Query: 65  DALQIYDHKIPIHRKFKLLQRVATVKDDGSVEFEVEGNIESESINVES--------DEPL 124
           DA QI D KIPI +K KLL RV TVKDDGSVEFEV  +IES SINVES        DEPL
Sbjct: 70  DAAQILDIKIPIDQKIKLLHRVTTVKDDGSVEFEVPEDIESLSINVESEEIFSNVDDEPL 129

Query: 125 DEAEFEYIRPMQIVMLIVGTRGDVQPFIPIAKRLQEIIY--------------------- 184
           D ++F+YIRPMQIV+LIVGTRGDVQPFIPI KRLQ+  +                     
Sbjct: 130 DTSDFQYIRPMQIVILIVGTRGDVQPFIPIGKRLQDYGHRVRLATHPNFKEFVLLAGLEF 189

Query: 185 --------SLLPACKQPDVDTG----------------HTHVAEGLKLPLHIFFTMPWTP 244
                    L  +CK PDVDTG                HTHVAEGLKLPLHIFFTMPWTP
Sbjct: 190 YPLGGDPKQLAASCKNPDVDTGIPFEADAIIANRTAYGHTHVAEGLKLPLHIFFTMPWTP 249

Query: 245 TTEFPHPLSRVKQPAGYRISYQIVDSLIWLGVRDIINDFRKKKLKIRPVTYLSSSRFSES 304
           T+EFPHPLSRVKQ AGYR+SYQIVDSLIWLG+RDIINDFRKKK++IRPVTYLS S+FSES
Sbjct: 250 TSEFPHPLSRVKQQAGYRLSYQIVDSLIWLGLRDIINDFRKKKMQIRPVTYLSGSQFSES 309

Query: 305 DVPHAYLWSPYLVPKPKDWGPKIDVVGYCFLDLASNYEPPESLVKWLEAGDSPIYFGFGS 364
           DVPH YLWSPYLVPKPKDWGPKIDVVGYCFLDLAS+YEPPE+LVKWLEAGD P+Y GFGS
Sbjct: 310 DVPHVYLWSPYLVPKPKDWGPKIDVVGYCFLDLASSYEPPETLVKWLEAGDKPVYIGFGS 369

Query: 365 LPVQEPDKMTQIIIQALERTGQRGIINEGWGGLGRSAEPKDFIYLLDNCPHDWLFLKCKA 424
           LPVQ+P+KMTQIIIQALE T QRGIINEGWGGLG+SAEPKDF+YLLDNCPHDWLF KCKA
Sbjct: 370 LPVQDPEKMTQIIIQALETTKQRGIINEGWGGLGKSAEPKDFLYLLDNCPHDWLFPKCKA 429

Query: 425 VVHHGGAGTTAAGLKAACPTTVVPFFGDQPFWGDRVHARGVGPPPIPVDEFSLPRLVNAI 484
           VVHHGGAGTTAAGLKAACPTTVVPFFGDQPFWG+RVHARGVGPPPIPVDEFSLPRLVNAI
Sbjct: 430 VVHHGGAGTTAAGLKAACPTTVVPFFGDQPFWGERVHARGVGPPPIPVDEFSLPRLVNAI 489

Query: 485 NYMLDPKVKRQAVELAKVLKNEDGVEGAVRAFFKQLGRRKRETEPEPEPEPEPEPQRSSL 501
           NYMLDPKVK +AV LAK L+NEDGVEGAVRAFF+QL RRK         EPEP+PQ+SSL
Sbjct: 490 NYMLDPKVKERAVSLAKALENEDGVEGAVRAFFRQLSRRKL--------EPEPQPQKSSL 549

BLAST of Cp4.1LG17g09910 vs. NCBI nr
Match: gi|571546115|ref|XP_006602441.1| (PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2-like isoform X3 [Glycine max])

HSP 1 Score: 698.4 bits (1801), Expect = 9.4e-198
Identity = 342/515 (66.41%), Postives = 396/515 (76.89%), Query Frame = 1

Query: 14  TSPPGASCRTLLKANTLHVEIPSHSTESVSREHRLQRSKTETHRRETIFAADALQIYDHK 73
           +S  G   + L K  TL  +I    +ES S + +++RSKTE  R   +   DA QI+D K
Sbjct: 32  SSASGILGKGLSKVTTLPADISQDKSESSSSKFKMERSKTERQRH--LSPEDAAQIFDDK 91

Query: 74  IPIHRKFKLLQRVATVKDDGSVEFEVEGNIESESINVES-------DEPLDEAEFEYIRP 133
           IPI  K KLL R+ATVKDDG+VEFEV  ++E E+I   S       D+ LD  +F YI P
Sbjct: 92  IPIQEKLKLLNRIATVKDDGTVEFEVPVDVEPEAIFARSKQVNHVVDDSLDATDFHYIPP 151

Query: 134 MQIVMLIVGTRGDVQPFIPIAKRLQEIIYSL-------------------LPACKQPDVD 193
           + IVMLIVGTRGDVQPFI I KR+Q+  + +                    P    P V 
Sbjct: 152 LNIVMLIVGTRGDVQPFIAIGKRMQDYGHRVRLATHSNFKEFVLTAGLEFYPLGGDPKVL 211

Query: 194 TG-HTHVAEGLKLPLHIFFTMPWTPTTEFPHPLSRVKQPAGYRISYQIVDSLIWLGVRDI 253
            G HTHVAE LK+P+HIFFTMPWTPTTEFPHPLSRVKQ AGYR+SYQIVDSLIWLG+RD+
Sbjct: 212 AGWHTHVAEALKIPIHIFFTMPWTPTTEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDM 271

Query: 254 INDFRKKKLKIRPVTYLSSSRFSESDVPHAYLWSPYLVPKPKDWGPKIDVVGYCFLDLAS 313
           IND RKKKLK+RPVTYLS S+ SE+DVPHAY+WSP+LVPKPKDWGPKIDVVG+CFLDLA 
Sbjct: 272 INDLRKKKLKLRPVTYLSGSQGSETDVPHAYIWSPHLVPKPKDWGPKIDVVGFCFLDLAL 331

Query: 314 NYEPPESLVKWLEAGDSPIYFGFGSLPVQEPDKMTQIIIQALERTGQRGIINEGWGGLGR 373
           NYEPPESLVKWLE GD PIY GFGSLPVQEP KMTQII+ ALE TGQRGIIN+GWGGLG 
Sbjct: 332 NYEPPESLVKWLEEGDKPIYIGFGSLPVQEPKKMTQIIVDALEITGQRGIINKGWGGLGN 391

Query: 374 SAEPKDFIYLLDNCPHDWLFLKCKAVVHHGGAGTTAAGLKAACPTTVVPFFGDQPFWGDR 433
            AEPKD IYLLDNCPHDWLFL+CKAVVHHGGAGTTAAGLKAACPTT+VPFFGDQPFWG+R
Sbjct: 392 LAEPKDSIYLLDNCPHDWLFLRCKAVVHHGGAGTTAAGLKAACPTTIVPFFGDQPFWGER 451

Query: 434 VHARGVGPPPIPVDEFSLPRLVNAINYMLDPKVKRQAVELAKVLKNEDGVEGAVRAFFKQ 493
           VHARGVGPPPIPVDEFSLP+LV+AI  MLDPKVK +A+ELAK ++NEDGV GAV+AFFKQ
Sbjct: 452 VHARGVGPPPIPVDEFSLPKLVDAIKLMLDPKVKERAIELAKAMENEDGVTGAVKAFFKQ 511

Query: 494 LGRRKRETEPEPEPEPEPEPQRSSLLFIRKCFGCS 502
           L ++K E++ +P+P        +    +R+CFGCS
Sbjct: 512 LPQKKSESDADPQP--------TGFFSVRRCFGCS 536

BLAST of Cp4.1LG17g09910 vs. NCBI nr
Match: gi|659116210|ref|XP_008457965.1| (PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2-like isoform X1 [Cucumis melo])

HSP 1 Score: 662.5 bits (1708), Expect = 5.7e-187
Identity = 315/399 (78.95%), Postives = 340/399 (85.21%), Query Frame = 1

Query: 126 PMQIVMLIVGTRGDVQPFIPI--------AKRLQEIIYSLLPACKQPDVDTG-------- 185
           P Q+   +V  RG    F+P           +++EIIYSLLPACK PDVDTG        
Sbjct: 196 PKQLAAYMVRNRG----FLPSWPSEILIQRNQMKEIIYSLLPACKNPDVDTGIPFEADAI 255

Query: 186 --------HTHVAEGLKLPLHIFFTMPWTPTTEFPHPLSRVKQPAGYRISYQIVDSLIWL 245
                   HTHVAEGLKLPLHIFFTMPWTPT+EFPHPLSRVKQ AGYR+SYQIVDSLIWL
Sbjct: 256 IANRTAYGHTHVAEGLKLPLHIFFTMPWTPTSEFPHPLSRVKQQAGYRLSYQIVDSLIWL 315

Query: 246 GVRDIINDFRKKKLKIRPVTYLSSSRFSESDVPHAYLWSPYLVPKPKDWGPKIDVVGYCF 305
           G+RDIINDFRKKK++IRPVTYLS S+FSESDVPH YLWSPYLVPKPKDWGPKIDVVGYCF
Sbjct: 316 GLRDIINDFRKKKMQIRPVTYLSGSQFSESDVPHVYLWSPYLVPKPKDWGPKIDVVGYCF 375

Query: 306 LDLASNYEPPESLVKWLEAGDSPIYFGFGSLPVQEPDKMTQIIIQALERTGQRGIINEGW 365
           LDLAS+YEPPE+LVKWLEAGD P+Y GFGSLPVQ+P+KMTQIIIQALE T QRGIINEGW
Sbjct: 376 LDLASSYEPPETLVKWLEAGDKPVYIGFGSLPVQDPEKMTQIIIQALETTKQRGIINEGW 435

Query: 366 GGLGRSAEPKDFIYLLDNCPHDWLFLKCKAVVHHGGAGTTAAGLKAACPTTVVPFFGDQP 425
           GGLG+SAEPKDF+YLLDNCPHDWLF KCKAVVHHGGAGTTAAGLKAACPTTVVPFFGDQP
Sbjct: 436 GGLGKSAEPKDFLYLLDNCPHDWLFPKCKAVVHHGGAGTTAAGLKAACPTTVVPFFGDQP 495

Query: 426 FWGDRVHARGVGPPPIPVDEFSLPRLVNAINYMLDPKVKRQAVELAKVLKNEDGVEGAVR 485
           FWG+RVHARGVGPPPIPVDEFSLPRLVNAINYMLDPKVK +AV LAK L+NEDGVEGAVR
Sbjct: 496 FWGERVHARGVGPPPIPVDEFSLPRLVNAINYMLDPKVKERAVSLAKALENEDGVEGAVR 555

Query: 486 AFFKQLGRRKRETEPEPEPEPEPEPQRSSLLFIRKCFGC 501
           AFF+QL RRK         EPEP+PQ+SSLLFIRKCFGC
Sbjct: 556 AFFRQLSRRKL--------EPEPQPQKSSLLFIRKCFGC 582

BLAST of Cp4.1LG17g09910 vs. NCBI nr
Match: gi|449443905|ref|XP_004139716.1| (PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2-like [Cucumis sativus])

HSP 1 Score: 656.8 bits (1693), Expect = 3.1e-185
Identity = 306/368 (83.15%), Postives = 329/368 (89.40%), Query Frame = 1

Query: 149 RLQEIIYSLLPACKQPDVDTG----------------HTHVAEGLKLPLHIFFTMPWTPT 208
           +++EIIYSLLPACK PD+DTG                HTHVAEGLKLPLHIFFTMPWTPT
Sbjct: 223 QMKEIIYSLLPACKDPDMDTGIPFEADAIIANRTAYGHTHVAEGLKLPLHIFFTMPWTPT 282

Query: 209 TEFPHPLSRVKQPAGYRISYQIVDSLIWLGVRDIINDFRKKKLKIRPVTYLSSSRFSESD 268
           +EFPHPLSRVKQ AGYR+SYQIVDSLIWLG+RDIINDFRKKKL+IRPVTYLS S+FSESD
Sbjct: 283 SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGLRDIINDFRKKKLQIRPVTYLSGSQFSESD 342

Query: 269 VPHAYLWSPYLVPKPKDWGPKIDVVGYCFLDLASNYEPPESLVKWLEAGDSPIYFGFGSL 328
           VPH YLWSPY+VPKPKDWGPKIDVVGYCFLDL+SNYEPPESLVKWLEAGD P+Y GFGSL
Sbjct: 343 VPHVYLWSPYIVPKPKDWGPKIDVVGYCFLDLSSNYEPPESLVKWLEAGDKPVYIGFGSL 402

Query: 329 PVQEPDKMTQIIIQALERTGQRGIINEGWGGLGRSAEPKDFIYLLDNCPHDWLFLKCKAV 388
           PVQ+P+KMTQIIIQALE T QRGIINEGWGGLG+SAEPKDF+YLLDNCPHDWLF KCKAV
Sbjct: 403 PVQDPEKMTQIIIQALETTKQRGIINEGWGGLGKSAEPKDFLYLLDNCPHDWLFPKCKAV 462

Query: 389 VHHGGAGTTAAGLKAACPTTVVPFFGDQPFWGDRVHARGVGPPPIPVDEFSLPRLVNAIN 448
           VHHGGAGTTAAGLKAACPTT+VPFFGDQPFWG+RVH RGVGPPPIPVDEFSL RLVNAIN
Sbjct: 463 VHHGGAGTTAAGLKAACPTTIVPFFGDQPFWGERVHDRGVGPPPIPVDEFSLQRLVNAIN 522

Query: 449 YMLDPKVKRQAVELAKVLKNEDGVEGAVRAFFKQLGRRKRETEPEPEPEPEPEPQRSSLL 501
           YMLDPKVK +AV LAKVL+NEDGVEGAVRAFF+QL RRK         EPEPEPQ+S+LL
Sbjct: 523 YMLDPKVKERAVLLAKVLENEDGVEGAVRAFFRQLSRRKL--------EPEPEPQKSNLL 582

BLAST of Cp4.1LG17g09910 vs. NCBI nr
Match: gi|659092990|ref|XP_008447326.1| (PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2 isoform X3 [Cucumis melo])

HSP 1 Score: 639.0 bits (1647), Expect = 6.8e-180
Identity = 312/506 (61.66%), Postives = 375/506 (74.11%), Query Frame = 1

Query: 17  PGASCRTLLKANTLHVEIPS-HSTESVSREHRLQRSKTETHRRETIFAADALQIYDHKIP 76
           PG S +TLLK NT+ ++  +    ES S +H+L+RSKTE H+       +A +I+D KIP
Sbjct: 61  PGISGKTLLKVNTMPIQTSNIDQLESDSSQHKLERSKTEVHKHNKFLPEEAAKIFDDKIP 120

Query: 77  IHRKFKLLQRVATVKDDGSVEFEVEGNIESESINVESDEPLDEAEFEYIRPMQIVMLIVG 136
           +HRK     RV         EF +   +E  ++  +              P  +   +V 
Sbjct: 121 VHRK-DYGHRVRLATHSNFKEFVLTAGLEFFALGGD--------------PKILAGYMVK 180

Query: 137 TRGDVQPF---IPIAK-RLQEIIYSLLPACKQPDVDTG----------------HTHVAE 196
            +G +      IP+ + +++EIIYSLLPACK PD ++G                HTHVAE
Sbjct: 181 NKGFLPSGPSEIPVQRNQMKEIIYSLLPACKDPDPESGIPFEAEAIIANPPAYGHTHVAE 240

Query: 197 GLKLPLHIFFTMPWTPTTEFPHPLSRVKQPAGYRISYQIVDSLIWLGVRDIINDFRKKKL 256
            LK+P+HIFFTMPWTPT+EFPHPLSRVKQ AGYR+SYQIVDSLIWLG+RD+IND RKK+L
Sbjct: 241 ALKIPIHIFFTMPWTPTSEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRL 300

Query: 257 KIRPVTYLSSSRFSESDVPHAYLWSPYLVPKPKDWGPKIDVVGYCFLDLASNYEPPESLV 316
           K+RPVTYLS S  SES+VPH Y+WSP+LVPKPKDWGPK+DVVG+CFLDLASNYEPPESLV
Sbjct: 301 KLRPVTYLSGSHASESNVPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASNYEPPESLV 360

Query: 317 KWLEAGDSPIYFGFGSLPVQEPDKMTQIIIQALERTGQRGIINEGWGGLGRSAEPKDFIY 376
            WL+AGD PIY GFGSLPVQEP KMTQII++ALE TGQRGIIN+GWGGLG   EPKDF+Y
Sbjct: 361 NWLKAGDKPIYIGFGSLPVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVY 420

Query: 377 LLDNCPHDWLFLKCKAVVHHGGAGTTAAGLKAACPTTVVPFFGDQPFWGDRVHARGVGPP 436
           LLDNCPHDWLFL+CKAVVHHGGAGTTAAGLKAACPTT++PFFGDQPFWG+RVHARGVGP 
Sbjct: 421 LLDNCPHDWLFLQCKAVVHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPA 480

Query: 437 PIPVDEFSLPRLVNAINYMLDPKVKRQAVELAKVLKNEDGVEGAVRAFFKQLGRRKRETE 496
           PIPV+EFS  +LV AIN+MLDPKVK+ A+ELAK ++NEDGVEGAV+AFFK    +K E E
Sbjct: 481 PIPVEEFSFNKLVEAINFMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKKAEQE 540

Query: 497 PEPEPEPEPEPQRSSLLFIRKCFGCS 502
            EPE         S++  IR+CFGCS
Sbjct: 541 SEPED--------STVFSIRRCFGCS 543

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
U80A2_ARATH1.2e-16370.93Sterol 3-beta-glucosyltransferase UGT80A2 OS=Arabidopsis thaliana GN=UGT80A2 PE=... [more]
U80B1_ARATH7.4e-11853.59Sterol 3-beta-glucosyltransferase UGT80B1 OS=Arabidopsis thaliana GN=UGT80B1 PE=... [more]
ATG26_PICPG5.4e-5233.33Sterol 3-beta-glucosyltransferase OS=Komagataella pastoris (strain GS115 / ATCC ... [more]
UGT52_DICDI7.8e-5136.20UDP-sugar-dependent glycosyltransferase 52 OS=Dictyostelium discoideum GN=ugt52 ... [more]
ATG26_KLULA1.3e-5035.05Sterol 3-beta-glucosyltransferase OS=Kluyveromyces lactis (strain ATCC 8585 / CB... [more]
Match NameE-valueIdentityDescription
A0A0A0K4H9_CUCSA2.2e-18583.15Uncharacterized protein OS=Cucumis sativus GN=Csa_7G336540 PE=4 SV=1[more]
M5VNV0_PRUPE1.1e-17376.27Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002976mg PE=4 SV=1[more]
A0A067K2C0_JATCU4.3e-17375.47Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17573 PE=4 SV=1[more]
B9IB34_POPTR4.3e-17376.09Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0014s17640g PE=4 SV=2[more]
K7KDS5_SOYBN1.2e-17275.20Uncharacterized protein OS=Glycine max GN=GLYMA_03G083100 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G07020.26.5e-16570.93 UDP-Glycosyltransferase superfamily protein[more]
AT1G43620.14.1e-11953.59 UDP-Glycosyltransferase superfamily protein[more]
AT5G24750.12.5e-1533.33 UDP-Glycosyltransferase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659116212|ref|XP_008457966.1|5.8e-23273.41PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2-like isoform X2 [Cucumis me... [more]
gi|571546115|ref|XP_006602441.1|9.4e-19866.41PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2-like isoform X3 [Glycine ma... [more]
gi|659116210|ref|XP_008457965.1|5.7e-18778.95PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2-like isoform X1 [Cucumis me... [more]
gi|449443905|ref|XP_004139716.1|3.1e-18583.15PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2-like [Cucumis sativus][more]
gi|659092990|ref|XP_008447326.1|6.8e-18061.66PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2 isoform X3 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0006629 lipid metabolic process
biological_process GO:0030259 lipid glycosylation
biological_process GO:0048316 seed development
biological_process GO:0016125 sterol metabolic process
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005886 plasma membrane
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0016758 transferase activity, transferring hexosyl groups
molecular_function GO:0051507 beta-sitosterol UDP-glucosyltransferase activity
molecular_function GO:0080043 quercetin 3-O-glucosyltransferase activity
molecular_function GO:0080044 quercetin 7-O-glucosyltransferase activity
molecular_function GO:0016906 sterol 3-beta-glucosyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG17g09910.1Cp4.1LG17g09910.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 124..496
score: 4.5E
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 305..452
score: 3.
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 127..152
score: 2.4E-7coord: 286..448
score: 5.1
NoneNo IPR availablePANTHERPTHR11926:SF308STEROL 3-BETA-GLUCOSYLTRANSFERASE UGT80A2coord: 124..496
score: 4.5E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 127..463
score: 2.2

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG17g09910Cp4.1LG15g01320Cucurbita pepo (Zucchini)cpecpeB258
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG17g09910Wax gourdcpewgoB0427
Cp4.1LG17g09910Cucurbita pepo (Zucchini)cpecpeB165
Cp4.1LG17g09910Cucurbita maxima (Rimu)cmacpeB385
Cp4.1LG17g09910Cucurbita moschata (Rifu)cmocpeB348
Cp4.1LG17g09910Cucumber (Chinese Long) v2cpecuB331
Cp4.1LG17g09910Melon (DHL92) v3.5.1cpemeB287
Cp4.1LG17g09910Silver-seed gourdcarcpeB0598