Cla002739 (gene) Watermelon (97103) v1

NameCla002739
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionSterol 3-beta-glucosyltransferase (AHRD V1 ***- B6U4Q7_MAIZE); contains Interpro domain(s) IPR004276 Glycosyl transferase, family 28
LocationChr7 : 489982 .. 497941 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGGTATAAAGAAGGTGCAATCAATCATTTTTATTTAATATATATGAGGCAATATTTGTTCTCATGGGGCAATTACAATGTTTGCAGGATTATGGTCATCGTGTTAGACTTGCTACTCATTCAAATTTCAAGGAGTTTGTTTTAACTGCTGGGCTGGAGTTTTTTGCTCTAGGTGGTGATCCTAAGATTCTTGCTGGCTGTAAGTTCACCTCTGCATTATTTACCAGTCTATCTCATGCTTTAGTGTCTCCTTGGCTCCGCCGGGGTAAATTATAGGTCATAGAAATTAAATTCCAAACCATCACATATGAGATAATGATTGAAGAATTACCAGCTTGTTATCCACGTCCTCTACCAAAAAAGACATCCCCTTCAAACCAAATGCAAAAATAAGTATTGTTGACACAACAGCTTCTTAATTCCATATCTCTAAATAGAAACCAAGAGAGTGACTTAACTGTGAAAAAGAACATTCCTGAAGATGAAGCTGCTGCCTGCTTTAATGAAAAATCTGATGAAAGCGAACTGACTAGATCTTCCATTATGTATAACTGCTAGAATATATTCACTGTGGCTCTCTTTCCAATTCTTTGGTTTTGTATTATTTATTTTTTTAATTTCAATGTTTATCTTCTCAGTTCATATTTGATTCAATGCAGATTTATGATCTTAGGAGCCCCTGGATTGTCTATTAACAATTATTTATTTGTATATGGTTTTTTTAAAGCTAAGTGATGCCTTCTCTATTATGAGTTTCCTTTGCTTCTGAGTGTAGGTATAATGCACAGATTAAATATTTTGCAGACAATCATTCAAATGATTGAATATTTATGTGCTTTACATTTACTAACTTCTATAGTTCTATTAGAGAATTAAAACGACTGTTTCCATTTATAGTTCTGTGAGCATTATATTACCCTTATGCAACAGACATGGTGAAAAACAAAGGTTTCTTGCCTTCGGGGCCTTCTGAAATACCAGTTCAAAGGAATCAAATGAAGGAAATCATTTATTCGCTACTTCCAGCTTGCAAAGAACCTGATCCTGAATCTGGTATTGCCTTTGACGCAGAAGCTATCATTGCTAACCCACCAGCTTACGGTAAGGACCTTGCCTCTAACTATGTTTTTCCCCTGTAAATAAAGAGCGAGGTTATATTGTTATTCCCTGCGGTGATAAAACCTTCTACTATCCTTTTGCTTATATTTGTGGAAAGCATCAGTTCTCTTTACTTTTCTCTCATCCTGGGCAGAGGACTTTTGGAATGAGGTTGCAAGATGTTGGCACATTTTTCCTTTTTCCTTCTTTAATCTCTCGTGACAATCTAATTTGTTATTTTTGTGAAGAAACTATGTACAATAACCTCTTCAATGCTTCTGCAGGGCATACTCATGTGGCAGAGGCACTTAAAATTCCAATTCACATATTTTTCACAATGCCATGGACGTAAGTTTGCCCTAGATAGTGTACCTTTTTTTTCTTTTTTTAACAATCTGGAGATACTGTTATTCAGTTATATTGTGTGAATATTTACATATATTTGTGTTATATGCTTTGTGAATCATGCTCTATTTCCTCCTCTTGCCAGGCCAACAAGTGAATTTCCTCATCCTTTATCCCGCGTCAAGCAACAGGCTGGCTATCGGGTATTTCTATTGTGGCATTGCTTTCTGCCTTCACGCGCACAAACACCATAATTACTTAATTACTTTTATGTTCTTCTTTTTTTTATTTCTTTACTTAACTTATTTTTATACCTTCATCAAATTTTGATTAAATCTAATTTGATTTTTTACTCTCTCTAATTTCCAGCTTTCATATCAAATTGTGGACTCCTTAATTTGGCTTGGGATACGGGACATGATTAATGATCTTAGGAAGAAAAGACTGAAGCTACGACCTGTCACTTATTTAAGTGGTTCACATGCCTCTGAATCAAATGTGCCACATGGGTATATATGGAGCCCGCATCTTGTTCCTAAACCAAAAGGTTCGTTCTTGTAATGATGAGTTGTTCTTTCTACTGTAAAGCAATTTTAACAAGTTACTCTTTTAGAAATTTACCTGTTGTTTTCATTTTGTAAAGAATATAGAATCATAGGATCTTATGATTCTAGCTTTAAACTTGGGTTTCTTTCAGATTGGGGACCTAAGGTGGATGTGGTCGGATTTTGCTTCCTGGATCTTGCATCAACTTATGAACCTCCCGAGTCGCTTGTAAATTGGCTTAAGGCTGGTGACAGGCCTATTTATATCGGTTTTGGTAGCCTTGTGAGTGTTTCATATTATCCGCTCCCTTTAATTTGATATTTCTGCTTTTTAAAGTTTCTTCTTTCTCTGCATCAAAAGCTCCCTTTTTGTTGGGGCAGGTGGAAGGGATATAAAAGAGGTTGTAGATTTTTATTTTATTTCTTTTGGATGTGAAATAAAACTTCCATTAAGAATTAATTGTACAAAGTTTCAAATAGATATCCTTGCATGTGGTGGCCCGGCCAATTCGGTAAAATAGTGGTGATGATTGCAAATCATTTATTCATCTGGTACCCTCTTCTTGGAAGATCCTTTTATTCCTTGCTTGCCGAAAGTTCCATATTGCAATGTCATTTGTTCTACTCTTTCTTTCCTTTTCCTAACTTATTTTTGTGTAAACTCAAGAGAGGGAAGGTTCCAAGTGCTTTATAGTTGGTTTATCTTGTAGTTCTTGATCCTTTACATTTCAATGTATTTCAGTTCTTGGTTTCTGTTAGGTGTTACCTTAACAAGGGGGCTTTTATTTACTCCTCCTTTTGCATTCATCGTTTCATTTATATATATATATAAATAATCTAGTGAACTGAAGTTAGGAAGATGTGATCTCTTTAACTCACAAACACTTAAATTCTCATTCAATTTGATCAACCTTGAAGGAAGGTTAGGATTCGGACTGCATTAATAGTTAATAATCAATGACGAAAGGTAAGAAAACAAGTTCTTTCCCTGAGAATATAACACTTTGCAAGTAGGCTCGTGATTCTTACAGTTACTGAAATGTTTTGGACATGCAACAAGAATCACCAAAATTGGTAAGGCTCATGGCTCCAGAAAATAGTGCACGACACTATTTTTTATTTTTTATTATTTTTTTTGTGTGTGTGTGTGGATTGGAAGAATACTATTTTTCTATTCAGGCTAAATTATGAAAAATATCCTTGAATTTTGCCCCTTATTTCATAAGTGGCAAGCGCCTTTTGTGCCTTTCTGTTGCCTTGAAGATATGTTCAAGGCGACACAGGTGCTCGCCTTCTTCTTTTTCTATTTTATTTTATTTATTTAATTATTTTTGTAATTCTTCAAGTTTTCTATTTTTCTTCTTTAATCCTATCAATTAAGAAGAAGAATCAAGGTTTAAACGAGAAGAATTGAAGAAAAACCAAAGAAATGAAGAAGAAATGAAAAAAAAATAGCAAAAGTAGTAGAAAAAAGTGGAAGAAGCCATCCTTTTTGTTTAGCCATCCAACAGCTGCTTATCAAAATGGAAGAAGAGGAAGAAGAGGAGGGGTTGGATATCAGAAGGGAAGAAGAAATATTCTTCTAATTTTACATTTCCTTGAATTGGCTGCCAACATTAAAGTAATGTTGGCTGCATGCCAACCAAGCTGCCAAATGGCAAAATTTCTACCCCATTAGGCATTTTTCAAAGAACTATTTTTACCTATTTAAAAAGCATTTCTGTTTCTATTTCTTTTTTCAAATATCTTTTTTTTCTCAATATTGTTTTAGTTAACTTTTATTTAAATCTTAATAATTTTGTTAATATTCTAACGTTTTAAAAATTGAAGTATTTTTTTTTCTATCTATAATTGTCAAATCCGTTAGAATGTTAGACAAAAATTGAGATGTAGAATGGGTGTGACAATGACCTGAAATGGGGTACGCGATGTCTGAGTCCCAATCTCAGTTCTTGAGCACCGCAACACGGTTCTACATCCTAATTATTGTCCAACATTCTAATTGATTTATGCTCCAATGGTAGTTTTGAAACACTTATGAAAGGTCAAGGGTGGGTAATATTGCAATTTTTGAAAGAGTAGGGTATTGAAGAGGCCAGTTTCCATTTCCAAAAAAAAGAAAAAAAAAAAAAGAGTAGGGTATTTTTCATAATTTTGCCTTGGATTTTTTTTAAGATGGAAACAATACTTTTTCATTGATATAGTGAAACTGAAATTAATTATTCTTGAAATCCTTCAAGAAAACCCAACTGTTTTTCGATTTACTGAAAATTACTTCTCATCTGAATATTTGCTACTACTACCATGATTTTGGCCGGTTTTCCAAGACTGGGCTTATGTCTAAGGTTTTGCAGCAAGTGCCATGCCCTCCTTTCTTCAAACAGGAAAACTCACTCCATATTTCAAATTGACTTCCCTTTCCTGATTACAAATCCATTTCTATTCTCTGTTTGAGTGCATGGATGTTATCTTGATGTCCTGCTTATCTCTGACTTTTACTTTTCAGGTTCAATTGTCAGTAGCATCTAGAAACGTCTCATGATTATTCAAATTGTTGTTTATCTGGTTCCCAAGAAACAGTCTTCTAAGTTAATGGCCTCCTCACGTTCTTAACCTAGTTTTCATTGTATGTCAGCCTGTTCAAGAACCTGCAAAGATGACCCAAATAATTGTAAAGGCGCTGGAAAGTACTGGGCAGAGGGGCATCATCAACAAAGGCTGGGGCGGTCTTGGAAATTGTAAGTTTCTGAATTTCCATCTCAGTATGTTCTGGTGACGTCAATATGATTGTAACTAGTCCTACCTATTCACTGCCTTCCCTGAACTTCTGTTGCCTGGAATTATTGATTCTTGGAGATCTAATTTATTTAGCTTAAATTTTATTTCTTTCCAGTGGAAGAACCGAAGGACTTTGTATATTTATTGGACAATTGCCCTCATGATTGGCTTTTCTTACAGTGCAAGGCCGTGGTGAGTATGTATAGTTGGTTCATATAACGGGTTCTTTTTTGGTGTTGCTTTTATCTCAAACAAGTTTAAACAGAGGGCCTGAAGTCTTAGTTGTCTCTTATGTATATAAGAGAAAGGCTAAATACTATTTTGGTCCAATTACTTTGGTTGTTATTTTATTCTCTGTACTTTGTGAGTGTCAGCCCTGAACTTCCTAGAATCTCAAAATTAGCCTCCGCCCTAAATTTGGTGTTAGTTTGAACTTCGAGGTTAAAATTTAGCTGACTTGACCAAAAATGGTTCTCTCCTTTATATCTCTCTTCCCTCCTTCCTTTGCTTCCAGCCCTCCTCATTCTTTTGTCAATTGCATGCGAGAAGACCCACAAAATAGATAGCTCATCGATGTCGTCCCATTTGGCAGCATATGGGTGTTGCAGCCATCGCAATACTTCCACCTTCATTGTAAGAAATGAGGTTTTCTAACCTGGTGGTGATGTTGATCGGCAAAACTACAGGCCACATGACAGAAGACATCAGATTATTTGGATGAACAGGCCACACGGTGGAGTGGAGTGGTTCATTAAAATTATAAGACGTCTTTCTCCATTCACACTCCACAACCTCCTTGTAGGCGTAGTTTTTTTTTTCTTTTTTCCCCTTTATTTATTGAATATTCTCGGAGAGCACTATAGAAGTTTTGTAACTAAAGTCATAACCCCATGTGTGGACAACATTTCCATGTGAAATTCTTTATTCTGACGGGGTGGTTGGGGAGGGAGAGCTGAAGGAGAGAAAAGTTAAAGCTGAAGGAGAGAAGTTAACTCTTTTGCTAAGTAAGCTAAATTTTAACCCCAAACTAAACCAAAACTAATAGTAAGGCAGTGGAAATGTCTCAAAAGATTTATAGTAAGACAGTGGAAATCTGTTCAAGATTTATTGAAAAGTAAATTTGAATACTATCAAAGTAGGAAACCATCATGGGTTGGTCTATTGGTAAAAAAGGAGACATAGTCTCAATAAATGGCTAAGAGGTCATGGGTTCAATCCATGGTGGCCAACTACCGAAGATTCAATATCCTGCGAGTTTCCTTGACACCCGAGTATTGTAAGGTCAGGCTGGTTATCCTGTGAGATTTGTTGAGGTGCGTGTAAGCTAGCTCGGACATTCATGGATATAAAAAATAAAAAAATAAAAAAAAGATACTATCGAAGTAGGATTAAATAGAATAAGAACTCAAGTACAAGGACCAAATCAAAGTAGGATTAAATAGAATAACAACTCAAGTACAAGGACCAAAATAGTTTTCTAACAGAACAAAAGGCGTAGGTGTTGATGTGTTGTGCTTGAGCACTTTTTTGTGCTTGTGTTCTTTCTTGTCATCCAACAAAGAAATCACTTACAACAGTGAGCTTGGCCCACAATTTTGGAGATGGAAGACTACAGATATAGCAAGGTCTGATTAGATTTCTAGAGGGGTGAGATAGTAATTTGATATATGCGTGGCTTTTTTAAAGGTCTGAAAATGTATAATTTAACTAAAAACGTTTTGAAAAAAAAATGGAACTTTGTTGTATTAATTAATGGAGAAAATATTAATAGGACACGGTGATAGGGCATCACAAACACGTAATCTAAAATGGATTTTTTTCCCCTTTCTGATAGGAAACAAAATAAGATGAAATGTTACAAAGGTAGGGATAACCATTCCCAAGCTATAGAGATTTACGAGAGCCAAATAATTCCAAAAGATCATTTAAGGTGGATTCTAATAAACCATGGCTTGAAATATCTTCTACAGTTCTACTTAAATAGCAAGTAAAAGTGAATTTTGAACTTGATTCTCAAGTATCAAAAGGAGCACAGATACTTCCGCAGTTTTTCACTATTTTCCCTCTTGAATGGTTATTTAAGTAAATTAACCGACTTCCTCTGACTAAAGGCGGGATATCCTATGCTATGGCTATTGGCATTAACTATGTTTTGGCTGGTTCTTTTACCTGTTTCCTGCATCTGAAAGGGCTTCCCCCCACATTGTCTTATTTACAGGTGCATCATGGAGGTGCTGGAACAACGGCTGCTGGACTGAAAGCTGCTGTAATGGCTTCACATTGTAAACTTGTCTTTTTAGTATTACGTGACCGATTATTGCTAACTTTCATATCTCTGTCTCTCTGTCTCGGTTCATTCTAGTGCCCAACAACGATCATTCCTTTCTTTGGAGATCAACCATTTTGGGGTGAGCGGGTACATGCTAGAGGAGTTGGTCCTGCACCCATCCCCGTTGAAGAGTTCTCATTTAACAAGTTAGTGGATGCCATAAACTTCATGCTAGATCCAAAGGTAATCTTTCTGGAAGTTGAACAAATCTTGTACATATTATGCTTATCAAAATTATTGGAAAAAAAAAAAAAAAAGAGACCAAAATACGCACCATTTCATCAAGTAATTTGGTTCAAGCAAATTAATACCCCCTCCCCTCCAACCCAAAATTTTTGTGTCTGCTCTCGTAAACATTTACCTCATGACACTTACTACAGTAGAGGTTATGATGATTTTCTGTTTGTGATCATAATGGAGCCGGGAGGTCTAGGAAGTACATCCGACTAGGAAAGAAGTAAACTGCGAGATTGATAATTCCTTACTTGTTGCATTTGAAGGACCTACAATTGCCCTTACCAGATAATCATAGATTTAAGGGAATCCTATAGCGAAAAGAAAAAAGATATTATTACTATGTTGGTTGGTATGAGTGGAGAAACTCATCCATTGGAAGGGGAAAAAGTAATTCTACCTGTTATTGATATGCAAATACAGAAAGAACGGGATATTATAAACTTATGTTTCATCCCAGGTAAAGCAGTCGGCTCTCGAGCTCGCCAAGGCCATGGAGAACGAGGATGGAGTAGAAGGAGCTGTAAAAGCCTTCTTCAAGCATTATCGTCCAAAAAAGACCGAACCAGAGTCCGAGTCAGAGGACTCAACCGTTTTCTCTATACGTAGATGCTTTGGTTGTTCTTGA

mRNA sequence

ATGATGGATTATGGTCATCGTGTTAGACTTGCTACTCATTCAAATTTCAAGGAGTTTGTTTTAACTGCTGGGCTGGAGTTTTTTGCTCTAGGTGGTGATCCTAAGATTCTTGCTGGCTACATGGTGAAAAACAAAGGTTTCTTGCCTTCGGGGCCTTCTGAAATACCAGTTCAAAGGAATCAAATGAAGGAAATCATTTATTCGCTACTTCCAGCTTGCAAAGAACCTGATCCTGAATCTGGTATTGCCTTTGACGCAGAAGCTATCATTGCTAACCCACCAGCTTACGGGCATACTCATGTGGCAGAGGCACTTAAAATTCCAATTCACATATTTTTCACAATGCCATGGACGCCAACAAGTGAATTTCCTCATCCTTTATCCCGCGTCAAGCAACAGGCTGGCTATCGGCTTTCATATCAAATTGTGGACTCCTTAATTTGGCTTGGGATACGGGACATGATTAATGATCTTAGGAAGAAAAGACTGAAGCTACGACCTGTCACTTATTTAAGTGGTTCACATGCCTCTGAATCAAATGTGCCACATGGGTATATATGGAGCCCGCATCTTGTTCCTAAACCAAAAGATTGGGGACCTAAGGTGGATGTGGTCGGATTTTGCTTCCTGGATCTTGCATCAACTTATGAACCTCCCGAGTCGCTTGTAAATTGGCTTAAGGCTGGTGACAGGCCTATTTATATCGGTTTTGGTAGCCTTCCTGTTCAAGAACCTGCAAAGATGACCCAAATAATTGTAAAGGCGCTGGAAAGTACTGGGCAGAGGGGCATCATCAACAAAGGCTGGGGCGGTCTTGGAAATTTGGAAGAACCGAAGGACTTTGTATATTTATTGGACAATTGCCCTCATGATTGGCTTTTCTTACAGTGCAAGGCCGTGGTGCATCATGGAGGTGCTGGAACAACGGCTGCTGGACTGAAAGCTGCTTGCCCAACAACGATCATTCCTTTCTTTGGAGATCAACCATTTTGGGGTGAGCGGGTACATGCTAGAGGAGTTGGTCCTGCACCCATCCCCGTTGAAGAGTTCTCATTTAACAAGTTAGTGGATGCCATAAACTTCATGCTAGATCCAAAGGTAAAGCAGTCGGCTCTCGAGCTCGCCAAGGCCATGGAGAACGAGGATGGAGTAGAAGGAGCTGTAAAAGCCTTCTTCAAGCATTATCGTCCAAAAAAGACCGAACCAGAGTCCGAGTCAGAGGACTCAACCGTTTTCTCTATACGTAGATGCTTTGGTTGTTCTTGA

Coding sequence (CDS)

ATGATGGATTATGGTCATCGTGTTAGACTTGCTACTCATTCAAATTTCAAGGAGTTTGTTTTAACTGCTGGGCTGGAGTTTTTTGCTCTAGGTGGTGATCCTAAGATTCTTGCTGGCTACATGGTGAAAAACAAAGGTTTCTTGCCTTCGGGGCCTTCTGAAATACCAGTTCAAAGGAATCAAATGAAGGAAATCATTTATTCGCTACTTCCAGCTTGCAAAGAACCTGATCCTGAATCTGGTATTGCCTTTGACGCAGAAGCTATCATTGCTAACCCACCAGCTTACGGGCATACTCATGTGGCAGAGGCACTTAAAATTCCAATTCACATATTTTTCACAATGCCATGGACGCCAACAAGTGAATTTCCTCATCCTTTATCCCGCGTCAAGCAACAGGCTGGCTATCGGCTTTCATATCAAATTGTGGACTCCTTAATTTGGCTTGGGATACGGGACATGATTAATGATCTTAGGAAGAAAAGACTGAAGCTACGACCTGTCACTTATTTAAGTGGTTCACATGCCTCTGAATCAAATGTGCCACATGGGTATATATGGAGCCCGCATCTTGTTCCTAAACCAAAAGATTGGGGACCTAAGGTGGATGTGGTCGGATTTTGCTTCCTGGATCTTGCATCAACTTATGAACCTCCCGAGTCGCTTGTAAATTGGCTTAAGGCTGGTGACAGGCCTATTTATATCGGTTTTGGTAGCCTTCCTGTTCAAGAACCTGCAAAGATGACCCAAATAATTGTAAAGGCGCTGGAAAGTACTGGGCAGAGGGGCATCATCAACAAAGGCTGGGGCGGTCTTGGAAATTTGGAAGAACCGAAGGACTTTGTATATTTATTGGACAATTGCCCTCATGATTGGCTTTTCTTACAGTGCAAGGCCGTGGTGCATCATGGAGGTGCTGGAACAACGGCTGCTGGACTGAAAGCTGCTTGCCCAACAACGATCATTCCTTTCTTTGGAGATCAACCATTTTGGGGTGAGCGGGTACATGCTAGAGGAGTTGGTCCTGCACCCATCCCCGTTGAAGAGTTCTCATTTAACAAGTTAGTGGATGCCATAAACTTCATGCTAGATCCAAAGGTAAAGCAGTCGGCTCTCGAGCTCGCCAAGGCCATGGAGAACGAGGATGGAGTAGAAGGAGCTGTAAAAGCCTTCTTCAAGCATTATCGTCCAAAAAAGACCGAACCAGAGTCCGAGTCAGAGGACTCAACCGTTTTCTCTATACGTAGATGCTTTGGTTGTTCTTGA

Protein sequence

MMDYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRNQMKEIIYSLLPACKEPDPESGIAFDAEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPTSEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESNVPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASTYEPPESLVNWLKAGDRPIYIGFGSLPVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAVVHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIPVEEFSFNKLVDAINFMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKKTEPESESEDSTVFSIRRCFGCS
BLAST of Cla002739 vs. Swiss-Prot
Match: U80A2_ARATH (Sterol 3-beta-glucosyltransferase UGT80A2 OS=Arabidopsis thaliana GN=UGT80A2 PE=1 SV=1)

HSP 1 Score: 743.8 bits (1919), Expect = 1.0e-213
Identity = 349/422 (82.70%), Postives = 382/422 (90.52%), Query Frame = 1

Query: 1   MMDYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRN 60
           + DYGHRVRLATH+NFKEFVLTAGLEF+ LGGDPK+LAGYMVKNKGFLPSGPSEIP+QRN
Sbjct: 216 LQDYGHRVRLATHANFKEFVLTAGLEFYPLGGDPKVLAGYMVKNKGFLPSGPSEIPIQRN 275

Query: 61  QMKEIIYSLLPACKEPDPESGIAFDAEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPT 120
           QMK+IIYSLLPACKEPDP+SGI+F A+AIIANPPAYGHTHVAEALKIPIH+FFTMPWTPT
Sbjct: 276 QMKDIIYSLLPACKEPDPDSGISFKADAIIANPPAYGHTHVAEALKIPIHVFFTMPWTPT 335

Query: 121 SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESN 180
           SEFPHPLSRVKQ AGYRLSYQIVDSLIWLGIRDM+NDLRKK+LKLRPVTYLSG+  S SN
Sbjct: 336 SEFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDMVNDLRKKKLKLRPVTYLSGTQGSGSN 395

Query: 181 VPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASTYEPPESLVNWLKAGDRPIYIGFGSL 240
           +PHGY+WSPHLVPKPKDWGP++DVVGFC+LDLAS YEPP  LV WL+AGD+PIYIGFGSL
Sbjct: 396 IPHGYMWSPHLVPKPKDWGPQIDVVGFCYLDLASNYEPPAELVEWLEAGDKPIYIGFGSL 455

Query: 241 PVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAV 300
           PVQEP KMT+IIV+AL+ T QRGIINKGWGGLGNL+EPKDFVYLLDN PHDWLF +CKAV
Sbjct: 456 PVQEPEKMTEIIVEALQRTKQRGIINKGWGGLGNLKEPKDFVYLLDNVPHDWLFPRCKAV 515

Query: 301 VHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIPVEEFSFNKLVDAIN 360
           VHHGGAGTTAAGLKA+CPTTI+PFFGDQPFWGERVHARGVGP+PIPV+EFS +KL DAIN
Sbjct: 516 VHHGGAGTTAAGLKASCPTTIVPFFGDQPFWGERVHARGVGPSPIPVDEFSLHKLEDAIN 575

Query: 361 FMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHY-RPKKTEPESESEDSTVFSIRRCFG 420
           FMLD KVK SA  LAKAM++EDGV GAVKAFFKH    K+   +   E S   S R+CFG
Sbjct: 576 FMLDDKVKSSAETLAKAMKDEDGVAGAVKAFFKHLPSAKQNISDPIPEPSGFLSFRKCFG 635

Query: 421 CS 422
           CS
Sbjct: 636 CS 637

BLAST of Cla002739 vs. Swiss-Prot
Match: U80B1_ARATH (Sterol 3-beta-glucosyltransferase UGT80B1 OS=Arabidopsis thaliana GN=UGT80B1 PE=2 SV=1)

HSP 1 Score: 552.7 bits (1423), Expect = 3.4e-156
Identity = 251/408 (61.52%), Postives = 313/408 (76.72%), Query Frame = 1

Query: 1   MMDYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRN 60
           + ++GHRVRLATH+NF+ FV  AG+EF+ LGGDP+ LA YM +NKG +PSGPSEI  QR 
Sbjct: 179 LQEFGHRVRLATHANFRSFVRAAGVEFYPLGGDPRELAAYMARNKGLIPSGPSEISKQRK 238

Query: 61  QMKEIIYSLLPACKEPDPESGIAFDAEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPT 120
           Q+K II SLLPAC EPD E+  +F A+AIIANPPAYGH HVAEAL +PIHIFFTMPWTPT
Sbjct: 239 QLKAIIESLLPACIEPDLETATSFRAQAIIANPPAYGHVHVAEALGVPIHIFFTMPWTPT 298

Query: 121 SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESN 180
           +EFPHPL+RV Q A Y LSY +VD ++W  IR  IND RK++L L P+ Y S  H S S+
Sbjct: 299 NEFPHPLARVPQSAAYWLSYIVVDLMVWWSIRTYINDFRKRKLNLAPIAYFSTYHGSISH 358

Query: 181 VPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASTYEPPESLVNWLKAGDRPIYIGFGSL 240
           +P GY+WSPH+VPKP DWGP VDVVG+CFL+L S Y+P E  ++W++ G  P+YIGFGS+
Sbjct: 359 LPTGYMWSPHVVPKPSDWGPLVDVVGYCFLNLGSKYQPREEFLHWIERGSPPVYIGFGSM 418

Query: 241 PVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNL-EEPKDFVYLLDNCPHDWLFLQCKA 300
           P+ +P +   II++ L+ T QRGI+++GWGGLGNL  E  + V+L+++CPHDWLF QC A
Sbjct: 419 PLDDPKQTMDIILETLKDTEQRGIVDRGWGGLGNLATEVPENVFLVEDCPHDWLFPQCSA 478

Query: 301 VVHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIPVEEFSFNKLVDAI 360
           VVHHGGAGTTA GLKA CPTTI+PFFGDQ FWG+R++ +G+GPAPIP+ + S   L  +I
Sbjct: 479 VVHHGGAGTTATGLKAGCPTTIVPFFGDQFFWGDRIYEKGLGPAPIPIAQLSVENLSSSI 538

Query: 361 NFMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKKTEPESESE 408
            FML P+VK   +ELAK +ENEDGV  AV AF +H  P+   PES SE
Sbjct: 539 RFMLQPEVKSQVMELAKVLENEDGVAAAVDAFHRHLPPELPLPESSSE 586

BLAST of Cla002739 vs. Swiss-Prot
Match: ATG26_YARLI (Sterol 3-beta-glucosyltransferase OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN=ATG26 PE=3 SV=3)

HSP 1 Score: 230.7 bits (587), Expect = 2.9e-59
Identity = 153/439 (34.85%), Postives = 231/439 (52.62%), Query Frame = 1

Query: 1    MMDYGHRVRLATHSNFKEFVLTAGLEFFALGGDP----KILAGYMVKNKGFLPSGPSEIP 60
            +++ GHRVR+ATHS FK+++   G+EF  + GDP    KI+  + V +  FL    S+  
Sbjct: 1018 LIEEGHRVRIATHSEFKDWIEGYGIEFKEVAGDPSELMKIMVDHGVFSVSFLRDAASKF- 1077

Query: 61   VQRNQMKEIIYSLLPACKEPDPESGIAFDAEAIIANPPAYGHTHVAEALKIPIHIFFTMP 120
              R  + E++ S   AC+  D           +I +P A    H+AEAL+IP    FTMP
Sbjct: 1078 --RGWINELLASSWEACQGSD----------VLIESPSAMAGIHIAEALQIPYFRAFTMP 1137

Query: 121  WTPTSEFPHPLSRVKQQAGYR---LSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSG 180
            W+ T  +PH      Q+ G     L+Y + D++ W GI   +N  RKK L L P T L  
Sbjct: 1138 WSRTRAYPHAFIVPDQKMGGSYNYLTYVMFDNVFWKGISGQVNRWRKKTLHL-PRTNLD- 1197

Query: 181  SHASESNVPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLAST-YEPPESLVNWLKA---- 240
             H  ++ VP  Y  SP ++P P D+   + + G+ FLD  S  Y P + L  +++     
Sbjct: 1198 -HMEQNKVPFLYNVSPAVLPPPVDFPDWIKITGYWFLDEGSKDYTPDDKLCRFMEKARND 1257

Query: 241  GDRPIYIGFGSLPVQEPAKMTQIIVKALESTGQRGIINKGWG---GLGNLEEPK----DF 300
            G + +YIGFGS+ V +P  +T+ +V+++     R I+NKGW    G  + +EP+    + 
Sbjct: 1258 GKKLVYIGFGSIVVSDPTALTKSVVESVLKADVRCILNKGWSDRLGKKDAKEPEIPLPEE 1317

Query: 301  VYLLDNCPHDWLFLQCKAVVHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVG 360
            V  + NCPHDWLF Q  A VHHGG+GTT AGL+A  PT I PFFGDQ F+  RV   G G
Sbjct: 1318 VLQITNCPHDWLFPQIDACVHHGGSGTTGAGLRAGLPTIIKPFFGDQFFYANRVEDLGAG 1377

Query: 361  --PAPIPVEEFSFNKLVDAINFMLDPKVKQSALELAKAMENEDGVEGAVKAFFKH----- 411
                 + V +FS   L +A +   + ++   A  + + + +E+GV  A++A ++      
Sbjct: 1378 IHLRKLNVSQFS-KALWEATH---NERIIAKAAAVGRQIRSENGVISAIQAIYRDLDYAR 1436

BLAST of Cla002739 vs. Swiss-Prot
Match: ATG26_ASPNC (Sterol 3-beta-glucosyltransferase OS=Aspergillus niger (strain CBS 513.88 / FGSC A1513) GN=atg26 PE=3 SV=2)

HSP 1 Score: 222.2 bits (565), Expect = 1.0e-56
Identity = 143/427 (33.49%), Postives = 224/427 (52.46%), Query Frame = 1

Query: 5    GHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQ-RNQMK 64
            GHR ++ATH+ F+ +V   G++F  + GDP  L    V+N  F  S   E   + R  + 
Sbjct: 924  GHRPKIATHAEFEPWVRKHGIDFALVDGDPAELMRICVENGMFTYSFFKEATAKFRGWID 983

Query: 65   EIIYSLLPACKEPDPESGIAFDAEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPTSEF 124
            +++ S   AC+          D + +I +P A    H+AEAL+IP    FTMPW+ T  +
Sbjct: 984  DLLSSAWKACQ----------DTDLLIESPSAMAGIHIAEALRIPYFRAFTMPWSRTRAY 1043

Query: 125  PHPLSRVKQQAGYR---LSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESN 184
            PH  +  + + G     ++Y + D++ W  +   +N  RKK L L+     +G    + N
Sbjct: 1044 PHAFAVPEHKMGGAYNYITYVMFDNVFWKSVAGQVNRWRKKELGLKA----TGLDKMQPN 1103

Query: 185  -VPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASTYEPPESLVNWL----KAGDRPIYI 244
             VP  Y +SP +VP P D+   + + G+ FL   S + PP  LV+++    K   + +YI
Sbjct: 1104 KVPFLYNYSPTVVPPPLDYPDWIRITGYWFLSEGSDWTPPAELVDFIQRARKDEKKVVYI 1163

Query: 245  GFGSLPVQEPAKMTQIIVKALESTGQRGIINKGWGG-LGNLEEPKDFV------YLLDNC 304
            GFGS+ V +P+ +T+ ++++++    R I++KGW   LG+    K  V      Y + + 
Sbjct: 1164 GFGSIVVSDPSALTRTVIESVQKADVRCILSKGWSDRLGDPASAKSEVPLPPEIYQIQSA 1223

Query: 305  PHDWLFLQCKAVVHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIPVE 364
            PHDWLF    A  HHGGAGTT A L+A  PT + PFFGDQ F+G RV   GVG     + 
Sbjct: 1224 PHDWLFSHIDAAAHHGGAGTTGASLRAGVPTIVKPFFGDQFFFGTRVEDLGVGICLKKLN 1283

Query: 365  EFSFNK-LVDAINFMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKKTEPESESE 414
               F++ L +A +   D ++   A +L   + +EDGV+ A++A ++     KT  ++ S 
Sbjct: 1284 VTMFSRALWEATH---DERMIVKAHKLGAQIRSEDGVDTAIQAIYRDLEYAKTLAQARSN 1333

BLAST of Cla002739 vs. Swiss-Prot
Match: ATG26_PICPG (Sterol 3-beta-glucosyltransferase OS=Komagataella pastoris (strain GS115 / ATCC 20864) GN=ATG26 PE=3 SV=1)

HSP 1 Score: 221.5 bits (563), Expect = 1.8e-56
Identity = 144/421 (34.20%), Postives = 222/421 (52.73%), Query Frame = 1

Query: 6    HRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNK----GFLPSGPSEIPVQRNQ 65
            H+V++ TH  FK +V + G+EF  + G+P  L   MV +K    GFL     +       
Sbjct: 789  HKVKIVTHEEFKPWVESYGIEFATIAGNPAELMSLMVTHKSLSVGFLKEAKEKFT---GW 848

Query: 66   MKEIIYSLLPACKEPDPESGIAFDAEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPTS 125
            + E++ S   AC+          DA+ +I +P A    H+AE L+IP    FTMPWT T 
Sbjct: 849  IGELLQSSWDACQ----------DADVLIESPSAMAGIHIAEKLQIPYFRAFTMPWTRTR 908

Query: 126  EFPHPLSRVKQQAGYRLSYQ---IVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASE 185
             +PH     +Q+ G   +Y    I +++ W GI   +N  R++ L L P T L      +
Sbjct: 909  AYPHAFVVPEQKRGGSYNYLTHIIFENVFWKGISGEVNKWREQVLML-PKTNLE--RLEQ 968

Query: 186  SNVPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDL--ASTYEPPESLVNWLKA----GDRP 245
            + VP  Y  SP + P   D+   V VVG+ FLD   A +Y+PP+ L+ +++     G + 
Sbjct: 969  NKVPFLYNVSPTVFPPSMDFPHWVKVVGYWFLDEGEADSYDPPKPLLEFMEKAKTDGKKL 1028

Query: 246  IYIGFGSLPVQEPAKMTQIIVKALESTGQRGIINKGWGG-LGN-----LEEPKDFVYLLD 305
            +YIGFGS+ V +P ++T+ ++ A+ S   R I+NKGW   LG      +E P++ +Y   
Sbjct: 1029 VYIGFGSIVVSDPKQLTEAVIDAVLSADVRCILNKGWSDRLGKQTGVEVELPEE-IYNSG 1088

Query: 306  NCPHDWLFLQCKAVVHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIP 365
            N PHDWLF +  A VHHGG+GTT A L+A  PT I PFFGDQ F+  RV   GVG     
Sbjct: 1089 NVPHDWLFGKIDASVHHGGSGTTGATLRAGIPTIIKPFFGDQFFYANRVEDIGVGIGLRK 1148

Query: 366  VEEFSFNKLVDAINFMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKKTEPESES 408
            +   S +K +  +    + ++ + A E+ K +++E+GV  A++  ++     K   +S+ 
Sbjct: 1149 LNSKSLSKAIKEVT--TNTRIIEKAKEIGKQIQSENGVSAAIRCLYQEMEYAKKLSKSKQ 1190

BLAST of Cla002739 vs. TrEMBL
Match: B9IB34_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0014s17640g PE=4 SV=2)

HSP 1 Score: 761.5 bits (1965), Expect = 5.3e-217
Identity = 356/424 (83.96%), Postives = 388/424 (91.51%), Query Frame = 1

Query: 1   MMDYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRN 60
           + DYGHRVRLATHSNF+EFVLTAGLEFF LGGDPK+LAGYMVKNKGFLPSGPSE+ +QRN
Sbjct: 195 LQDYGHRVRLATHSNFREFVLTAGLEFFPLGGDPKVLAGYMVKNKGFLPSGPSEVSIQRN 254

Query: 61  QMKEIIYSLLPACKEPDPESGIAFDAEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPT 120
           Q+KEIIYSLLPACK+PD +S I F A+AIIANPPAYGHTHVAEALK+P+HIFFTMPWTPT
Sbjct: 255 QIKEIIYSLLPACKDPDIDSKIPFRADAIIANPPAYGHTHVAEALKVPLHIFFTMPWTPT 314

Query: 121 SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESN 180
           SEFPHPLSRVKQ AGYRLSYQIVDS+IWLGIRDMINDLRKK+LKLRPVTYLSGS  S+S+
Sbjct: 315 SEFPHPLSRVKQSAGYRLSYQIVDSMIWLGIRDMINDLRKKKLKLRPVTYLSGSQGSDSD 374

Query: 181 VPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASTYEPPESLVNWLKAGDRPIYIGFGSL 240
           VP+GYIWSPHL PKPKDWGPK+DVVGFCFLDLAS YEPPE L+ WL+AG +PIYIGFGSL
Sbjct: 375 VPYGYIWSPHLAPKPKDWGPKIDVVGFCFLDLASNYEPPEPLLKWLEAGQKPIYIGFGSL 434

Query: 241 PVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAV 300
           PVQEP KMTQ IV+ALE TGQRGIINKGWGGLGNL EPKDF+YLLDNCPHDWLFLQCKAV
Sbjct: 435 PVQEPEKMTQTIVEALEQTGQRGIINKGWGGLGNLAEPKDFIYLLDNCPHDWLFLQCKAV 494

Query: 301 VHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIPVEEFSFNKLVDAIN 360
           VHHGGAGTTAAGLKAACPTTI+PFFGDQPFWGER+HARGVGP PIPV+EFS  KLV+AI+
Sbjct: 495 VHHGGAGTTAAGLKAACPTTIVPFFGDQPFWGERLHARGVGPPPIPVDEFSLTKLVEAIH 554

Query: 361 FMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKKTEPESESEDST----VFSIRR 420
           FMLDPKVK+ A+ELAK MENEDGV+GAVKAFFKH   KK EPE ESE ST    +FS  +
Sbjct: 555 FMLDPKVKERAVELAKDMENEDGVDGAVKAFFKHLPRKKPEPEPESEPSTEPSSIFSFSK 614

BLAST of Cla002739 vs. TrEMBL
Match: A0A061DVF1_THECC (UDP-Glycosyltransferase superfamily protein isoform 4 OS=Theobroma cacao GN=TCM_003027 PE=4 SV=1)

HSP 1 Score: 759.2 bits (1959), Expect = 2.6e-216
Identity = 354/420 (84.29%), Postives = 386/420 (91.90%), Query Frame = 1

Query: 1   MMDYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRN 60
           + +YGHRVRLATHSNFKEFVLTAGLEF+ LGGDPK+LAGYMVKNKGFLPSGPSEIP QR+
Sbjct: 20  LQEYGHRVRLATHSNFKEFVLTAGLEFYPLGGDPKVLAGYMVKNKGFLPSGPSEIPTQRH 79

Query: 61  QMKEIIYSLLPACKEPDPESGIAFDAEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPT 120
           Q+KEIIYSLLPACKEPDP+SG+ F A+AIIANPPAYGHTHVAE+L++P+HIFFTMPWTPT
Sbjct: 80  QIKEIIYSLLPACKEPDPDSGVPFKADAIIANPPAYGHTHVAESLQVPLHIFFTMPWTPT 139

Query: 121 SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESN 180
           SEFPHPLSRVKQ AGYRLSYQIVDSLIWLGIRDMIND+RKK+LKLRPVTYLSGS AS+S+
Sbjct: 140 SEFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDMINDVRKKKLKLRPVTYLSGSQASDSD 199

Query: 181 VPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASTYEPPESLVNWLKAGDRPIYIGFGSL 240
           VPHGYIWSPHLVPKPKDWGPK+DVVGFCFLDLA+ YEPPE+L+ WL+AG +PIYIGFGSL
Sbjct: 200 VPHGYIWSPHLVPKPKDWGPKIDVVGFCFLDLATNYEPPETLIKWLEAGTKPIYIGFGSL 259

Query: 241 PVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAV 300
           PVQEP KMTQIIV ALE TGQRGIINKGWGGLGNL E KD VYLLDN PHDWLFLQC AV
Sbjct: 260 PVQEPEKMTQIIVDALEKTGQRGIINKGWGGLGNLAENKDSVYLLDNVPHDWLFLQCMAV 319

Query: 301 VHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIPVEEFSFNKLVDAIN 360
           VHHGGAGTTAAGLKAACPTTI+PFFGDQPFWGERVHARGVGP PIPV+EFS  KLVDAIN
Sbjct: 320 VHHGGAGTTAAGLKAACPTTIVPFFGDQPFWGERVHARGVGPPPIPVDEFSLPKLVDAIN 379

Query: 361 FMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKKTEPESESEDSTVFSIRRCFGC 420
           FMLDPKVK+ A+ELA+AM+NEDGV GAVKAF KH   KK  PE   E S++FS+ RCFGC
Sbjct: 380 FMLDPKVKEKAVELAQAMKNEDGVTGAVKAFLKHLPCKKPSPEPSPERSSIFSVSRCFGC 439

BLAST of Cla002739 vs. TrEMBL
Match: A0A061DNI4_THECC (UDP-Glycosyltransferase superfamily protein isoform 3 OS=Theobroma cacao GN=TCM_003027 PE=4 SV=1)

HSP 1 Score: 759.2 bits (1959), Expect = 2.6e-216
Identity = 354/420 (84.29%), Postives = 386/420 (91.90%), Query Frame = 1

Query: 1   MMDYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRN 60
           + +YGHRVRLATHSNFKEFVLTAGLEF+ LGGDPK+LAGYMVKNKGFLPSGPSEIP QR+
Sbjct: 194 LQEYGHRVRLATHSNFKEFVLTAGLEFYPLGGDPKVLAGYMVKNKGFLPSGPSEIPTQRH 253

Query: 61  QMKEIIYSLLPACKEPDPESGIAFDAEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPT 120
           Q+KEIIYSLLPACKEPDP+SG+ F A+AIIANPPAYGHTHVAE+L++P+HIFFTMPWTPT
Sbjct: 254 QIKEIIYSLLPACKEPDPDSGVPFKADAIIANPPAYGHTHVAESLQVPLHIFFTMPWTPT 313

Query: 121 SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESN 180
           SEFPHPLSRVKQ AGYRLSYQIVDSLIWLGIRDMIND+RKK+LKLRPVTYLSGS AS+S+
Sbjct: 314 SEFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDMINDVRKKKLKLRPVTYLSGSQASDSD 373

Query: 181 VPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASTYEPPESLVNWLKAGDRPIYIGFGSL 240
           VPHGYIWSPHLVPKPKDWGPK+DVVGFCFLDLA+ YEPPE+L+ WL+AG +PIYIGFGSL
Sbjct: 374 VPHGYIWSPHLVPKPKDWGPKIDVVGFCFLDLATNYEPPETLIKWLEAGTKPIYIGFGSL 433

Query: 241 PVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAV 300
           PVQEP KMTQIIV ALE TGQRGIINKGWGGLGNL E KD VYLLDN PHDWLFLQC AV
Sbjct: 434 PVQEPEKMTQIIVDALEKTGQRGIINKGWGGLGNLAENKDSVYLLDNVPHDWLFLQCMAV 493

Query: 301 VHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIPVEEFSFNKLVDAIN 360
           VHHGGAGTTAAGLKAACPTTI+PFFGDQPFWGERVHARGVGP PIPV+EFS  KLVDAIN
Sbjct: 494 VHHGGAGTTAAGLKAACPTTIVPFFGDQPFWGERVHARGVGPPPIPVDEFSLPKLVDAIN 553

Query: 361 FMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKKTEPESESEDSTVFSIRRCFGC 420
           FMLDPKVK+ A+ELA+AM+NEDGV GAVKAF KH   KK  PE   E S++FS+ RCFGC
Sbjct: 554 FMLDPKVKEKAVELAQAMKNEDGVTGAVKAFLKHLPCKKPSPEPSPERSSIFSVSRCFGC 613

BLAST of Cla002739 vs. TrEMBL
Match: A0A0D2P7F2_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_004G058300 PE=4 SV=1)

HSP 1 Score: 753.4 bits (1944), Expect = 1.4e-214
Identity = 355/422 (84.12%), Postives = 387/422 (91.71%), Query Frame = 1

Query: 1   MMDYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRN 60
           + DYGHRVRLATHSNFK+FVLTAGLEF+ LGGDPK+LAGYMVKNKGFLPSGPSEIP+QRN
Sbjct: 210 LQDYGHRVRLATHSNFKDFVLTAGLEFYPLGGDPKVLAGYMVKNKGFLPSGPSEIPIQRN 269

Query: 61  QMKEIIYSLLPACKEPDPESGIAFDAEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPT 120
           Q+KEIIYSLLPACK+PDP+S I F A+AIIANPPAYGHTHVAEALK+PIHIFFTMPWTPT
Sbjct: 270 QIKEIIYSLLPACKDPDPDSAIPFKADAIIANPPAYGHTHVAEALKVPIHIFFTMPWTPT 329

Query: 121 SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESN 180
           SEFPHPLSRVKQ AGYRLSYQIVDS+IWLGIRDMINDLRKK+L+LRPVTYLSGS  S+S+
Sbjct: 330 SEFPHPLSRVKQPAGYRLSYQIVDSMIWLGIRDMINDLRKKKLRLRPVTYLSGSQGSDSD 389

Query: 181 VPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASTYEPPESLVNWLKAGDRPIYIGFGSL 240
           VP+GYIWSPHLVPKPKDWGPK+DVVGFCFLDLA+ YEPPESLV WL+AG +PIYIGFGSL
Sbjct: 390 VPYGYIWSPHLVPKPKDWGPKIDVVGFCFLDLATNYEPPESLVKWLEAGPKPIYIGFGSL 449

Query: 241 PVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAV 300
           PVQEP KMTQIIV ALE TGQRGIINKGWGGLG+L E KD VYLLDN PHDWLFLQC AV
Sbjct: 450 PVQEPEKMTQIIVDALEQTGQRGIINKGWGGLGSLAESKDSVYLLDNVPHDWLFLQCMAV 509

Query: 301 VHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIPVEEFSFNKLVDAIN 360
           VHHGGAGTTAAGLKAACPTTI+PFFGDQPFWG+RVHARGVGPAPIP++EFS  KL+DAI 
Sbjct: 510 VHHGGAGTTAAGLKAACPTTIVPFFGDQPFWGDRVHARGVGPAPIPIDEFSLPKLIDAIK 569

Query: 361 FMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKKTEPESESE-DSTVFSIRRCFG 420
           FML+P+VK+ A+ELAKAMENEDGV GAVKAFFKH   KK E E   E  S++FSI RCFG
Sbjct: 570 FMLNPEVKEKAVELAKAMENEDGVSGAVKAFFKHLPRKKPELEPSVEPPSSLFSISRCFG 629

Query: 421 CS 422
           CS
Sbjct: 630 CS 631

BLAST of Cla002739 vs. TrEMBL
Match: A0A068U5T1_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00016342001 PE=4 SV=1)

HSP 1 Score: 750.4 bits (1936), Expect = 1.2e-213
Identity = 353/421 (83.85%), Postives = 382/421 (90.74%), Query Frame = 1

Query: 1   MMDYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRN 60
           + DYGHRVRLATHSNFKEFVLT+GLEF+ LGGDPK+LAGYMVKNKGFLPSGPSEIP+QRN
Sbjct: 197 LQDYGHRVRLATHSNFKEFVLTSGLEFYPLGGDPKVLAGYMVKNKGFLPSGPSEIPIQRN 256

Query: 61  QMKEIIYSLLPACKEPDPESGIAFDAEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPT 120
           Q+KEII SLLPACKEPD ++GI F A+AIIANPPAYGHTHVAEALK+P+HIFFTMPWTPT
Sbjct: 257 QIKEIINSLLPACKEPDVDTGIPFKADAIIANPPAYGHTHVAEALKVPLHIFFTMPWTPT 316

Query: 121 SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESN 180
           SEFPHPLSRVKQ AGYRLSYQIVDSLIWLGIRDMIND RKK+LKLRPVTYLSGS  SES+
Sbjct: 317 SEFPHPLSRVKQPAGYRLSYQIVDSLIWLGIRDMINDARKKKLKLRPVTYLSGSQGSESD 376

Query: 181 VPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASTYEPPESLVNWLKAGDRPIYIGFGSL 240
           +PHGYIWSPHLVPKPKDWG KVDVVGFCFLDLAS Y+PPE LVNWL AG +PIYIGFGSL
Sbjct: 377 IPHGYIWSPHLVPKPKDWGQKVDVVGFCFLDLASNYQPPEPLVNWLNAGPKPIYIGFGSL 436

Query: 241 PVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAV 300
           PVQEP KMTQIIV+ALE TGQRGIINKGWGGLGNL EPKDF+YLLDN PHDWLFLQC AV
Sbjct: 437 PVQEPEKMTQIIVEALEKTGQRGIINKGWGGLGNLAEPKDFIYLLDNVPHDWLFLQCAAV 496

Query: 301 VHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIPVEEFSFNKLVDAIN 360
           VHHGGAGTTAAGLKAACPTT++PFFGDQPFWGERVHARGVGP PIPV+EF+  KLV AIN
Sbjct: 497 VHHGGAGTTAAGLKAACPTTVVPFFGDQPFWGERVHARGVGPPPIPVDEFTLPKLVYAIN 556

Query: 361 FMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKKTEPESESEDSTVFSIRRCFGC 420
           FMLD +VK+ A+ELAKAME+EDGV GAV+AF KH   KK EPE     S+  SIR+CFGC
Sbjct: 557 FMLDAEVKERAVELAKAMEDEDGVTGAVRAFLKHLPRKKMEPEPVPAASSFLSIRKCFGC 616

Query: 421 S 422
           S
Sbjct: 617 S 617

BLAST of Cla002739 vs. NCBI nr
Match: gi|449468616|ref|XP_004152017.1| (PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2 isoform X1 [Cucumis sativus])

HSP 1 Score: 857.8 bits (2215), Expect = 7.8e-246
Identity = 410/421 (97.39%), Postives = 415/421 (98.57%), Query Frame = 1

Query: 1   MMDYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRN 60
           + DYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRN
Sbjct: 204 LQDYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRN 263

Query: 61  QMKEIIYSLLPACKEPDPESGIAFDAEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPT 120
           QMKEIIYSLLPACK+PDPESGI F+AEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPT
Sbjct: 264 QMKEIIYSLLPACKDPDPESGIPFEAEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPT 323

Query: 121 SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESN 180
           SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESN
Sbjct: 324 SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESN 383

Query: 181 VPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASTYEPPESLVNWLKAGDRPIYIGFGSL 240
           VPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLAS YEPPESLVNWLKAGDRPIYIGFGSL
Sbjct: 384 VPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASNYEPPESLVNWLKAGDRPIYIGFGSL 443

Query: 241 PVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAV 300
           PVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAV
Sbjct: 444 PVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAV 503

Query: 301 VHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIPVEEFSFNKLVDAIN 360
           VHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGP+PIPVEEFSFNKLV+AIN
Sbjct: 504 VHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPSPIPVEEFSFNKLVEAIN 563

Query: 361 FMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKKTEPESESEDSTVFSIRRCFGC 420
           FMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKK E ESE EDSTVFSIRRCFGC
Sbjct: 564 FMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKKVEQESEPEDSTVFSIRRCFGC 623

Query: 421 S 422
           S
Sbjct: 624 S 624

BLAST of Cla002739 vs. NCBI nr
Match: gi|778682074|ref|XP_011651641.1| (PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2 isoform X2 [Cucumis sativus])

HSP 1 Score: 857.8 bits (2215), Expect = 7.8e-246
Identity = 410/421 (97.39%), Postives = 415/421 (98.57%), Query Frame = 1

Query: 1   MMDYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRN 60
           + DYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRN
Sbjct: 130 LQDYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRN 189

Query: 61  QMKEIIYSLLPACKEPDPESGIAFDAEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPT 120
           QMKEIIYSLLPACK+PDPESGI F+AEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPT
Sbjct: 190 QMKEIIYSLLPACKDPDPESGIPFEAEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPT 249

Query: 121 SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESN 180
           SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESN
Sbjct: 250 SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESN 309

Query: 181 VPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASTYEPPESLVNWLKAGDRPIYIGFGSL 240
           VPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLAS YEPPESLVNWLKAGDRPIYIGFGSL
Sbjct: 310 VPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASNYEPPESLVNWLKAGDRPIYIGFGSL 369

Query: 241 PVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAV 300
           PVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAV
Sbjct: 370 PVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAV 429

Query: 301 VHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIPVEEFSFNKLVDAIN 360
           VHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGP+PIPVEEFSFNKLV+AIN
Sbjct: 430 VHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPSPIPVEEFSFNKLVEAIN 489

Query: 361 FMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKKTEPESESEDSTVFSIRRCFGC 420
           FMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKK E ESE EDSTVFSIRRCFGC
Sbjct: 490 FMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKKVEQESEPEDSTVFSIRRCFGC 549

Query: 421 S 422
           S
Sbjct: 550 S 550

BLAST of Cla002739 vs. NCBI nr
Match: gi|659092988|ref|XP_008447325.1| (PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2 isoform X2 [Cucumis melo])

HSP 1 Score: 857.8 bits (2215), Expect = 7.8e-246
Identity = 410/421 (97.39%), Postives = 415/421 (98.57%), Query Frame = 1

Query: 1   MMDYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRN 60
           + DYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRN
Sbjct: 130 LQDYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRN 189

Query: 61  QMKEIIYSLLPACKEPDPESGIAFDAEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPT 120
           QMKEIIYSLLPACK+PDPESGI F+AEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPT
Sbjct: 190 QMKEIIYSLLPACKDPDPESGIPFEAEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPT 249

Query: 121 SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESN 180
           SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESN
Sbjct: 250 SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESN 309

Query: 181 VPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASTYEPPESLVNWLKAGDRPIYIGFGSL 240
           VPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLAS YEPPESLVNWLKAGD+PIYIGFGSL
Sbjct: 310 VPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASNYEPPESLVNWLKAGDKPIYIGFGSL 369

Query: 241 PVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAV 300
           PVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAV
Sbjct: 370 PVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAV 429

Query: 301 VHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIPVEEFSFNKLVDAIN 360
           VHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIPVEEFSFNKLV+AIN
Sbjct: 430 VHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIPVEEFSFNKLVEAIN 489

Query: 361 FMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKKTEPESESEDSTVFSIRRCFGC 420
           FMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKK E ESE EDSTVFSIRRCFGC
Sbjct: 490 FMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKKAEQESEPEDSTVFSIRRCFGC 549

Query: 421 S 422
           S
Sbjct: 550 S 550

BLAST of Cla002739 vs. NCBI nr
Match: gi|659092986|ref|XP_008447324.1| (PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2 isoform X1 [Cucumis melo])

HSP 1 Score: 857.8 bits (2215), Expect = 7.8e-246
Identity = 410/421 (97.39%), Postives = 415/421 (98.57%), Query Frame = 1

Query: 1   MMDYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRN 60
           + DYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRN
Sbjct: 203 LQDYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRN 262

Query: 61  QMKEIIYSLLPACKEPDPESGIAFDAEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPT 120
           QMKEIIYSLLPACK+PDPESGI F+AEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPT
Sbjct: 263 QMKEIIYSLLPACKDPDPESGIPFEAEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPT 322

Query: 121 SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESN 180
           SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESN
Sbjct: 323 SEFPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESN 382

Query: 181 VPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASTYEPPESLVNWLKAGDRPIYIGFGSL 240
           VPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLAS YEPPESLVNWLKAGD+PIYIGFGSL
Sbjct: 383 VPHGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASNYEPPESLVNWLKAGDKPIYIGFGSL 442

Query: 241 PVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAV 300
           PVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAV
Sbjct: 443 PVQEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAV 502

Query: 301 VHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIPVEEFSFNKLVDAIN 360
           VHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIPVEEFSFNKLV+AIN
Sbjct: 503 VHHGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIPVEEFSFNKLVEAIN 562

Query: 361 FMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKKTEPESESEDSTVFSIRRCFGC 420
           FMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKK E ESE EDSTVFSIRRCFGC
Sbjct: 563 FMLDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKKAEQESEPEDSTVFSIRRCFGC 622

Query: 421 S 422
           S
Sbjct: 623 S 623

BLAST of Cla002739 vs. NCBI nr
Match: gi|659092990|ref|XP_008447326.1| (PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2 isoform X3 [Cucumis melo])

HSP 1 Score: 857.1 bits (2213), Expect = 1.3e-245
Identity = 410/419 (97.85%), Postives = 414/419 (98.81%), Query Frame = 1

Query: 3   DYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRNQM 62
           DYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRNQM
Sbjct: 125 DYGHRVRLATHSNFKEFVLTAGLEFFALGGDPKILAGYMVKNKGFLPSGPSEIPVQRNQM 184

Query: 63  KEIIYSLLPACKEPDPESGIAFDAEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPTSE 122
           KEIIYSLLPACK+PDPESGI F+AEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPTSE
Sbjct: 185 KEIIYSLLPACKDPDPESGIPFEAEAIIANPPAYGHTHVAEALKIPIHIFFTMPWTPTSE 244

Query: 123 FPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESNVP 182
           FPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESNVP
Sbjct: 245 FPHPLSRVKQQAGYRLSYQIVDSLIWLGIRDMINDLRKKRLKLRPVTYLSGSHASESNVP 304

Query: 183 HGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASTYEPPESLVNWLKAGDRPIYIGFGSLPV 242
           HGYIWSPHLVPKPKDWGPKVDVVGFCFLDLAS YEPPESLVNWLKAGD+PIYIGFGSLPV
Sbjct: 305 HGYIWSPHLVPKPKDWGPKVDVVGFCFLDLASNYEPPESLVNWLKAGDKPIYIGFGSLPV 364

Query: 243 QEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAVVH 302
           QEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAVVH
Sbjct: 365 QEPAKMTQIIVKALESTGQRGIINKGWGGLGNLEEPKDFVYLLDNCPHDWLFLQCKAVVH 424

Query: 303 HGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIPVEEFSFNKLVDAINFM 362
           HGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIPVEEFSFNKLV+AINFM
Sbjct: 425 HGGAGTTAAGLKAACPTTIIPFFGDQPFWGERVHARGVGPAPIPVEEFSFNKLVEAINFM 484

Query: 363 LDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKKTEPESESEDSTVFSIRRCFGCS 422
           LDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKK E ESE EDSTVFSIRRCFGCS
Sbjct: 485 LDPKVKQSALELAKAMENEDGVEGAVKAFFKHYRPKKAEQESEPEDSTVFSIRRCFGCS 543

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
U80A2_ARATH1.0e-21382.70Sterol 3-beta-glucosyltransferase UGT80A2 OS=Arabidopsis thaliana GN=UGT80A2 PE=... [more]
U80B1_ARATH3.4e-15661.52Sterol 3-beta-glucosyltransferase UGT80B1 OS=Arabidopsis thaliana GN=UGT80B1 PE=... [more]
ATG26_YARLI2.9e-5934.85Sterol 3-beta-glucosyltransferase OS=Yarrowia lipolytica (strain CLIB 122 / E 15... [more]
ATG26_ASPNC1.0e-5633.49Sterol 3-beta-glucosyltransferase OS=Aspergillus niger (strain CBS 513.88 / FGSC... [more]
ATG26_PICPG1.8e-5634.20Sterol 3-beta-glucosyltransferase OS=Komagataella pastoris (strain GS115 / ATCC ... [more]
Match NameE-valueIdentityDescription
B9IB34_POPTR5.3e-21783.96Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0014s17640g PE=4 SV=2[more]
A0A061DVF1_THECC2.6e-21684.29UDP-Glycosyltransferase superfamily protein isoform 4 OS=Theobroma cacao GN=TCM_... [more]
A0A061DNI4_THECC2.6e-21684.29UDP-Glycosyltransferase superfamily protein isoform 3 OS=Theobroma cacao GN=TCM_... [more]
A0A0D2P7F2_GOSRA1.4e-21484.12Uncharacterized protein OS=Gossypium raimondii GN=B456_004G058300 PE=4 SV=1[more]
A0A068U5T1_COFCA1.2e-21383.85Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00016342001 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|449468616|ref|XP_004152017.1|7.8e-24697.39PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2 isoform X1 [Cucumis sativus... [more]
gi|778682074|ref|XP_011651641.1|7.8e-24697.39PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2 isoform X2 [Cucumis sativus... [more]
gi|659092988|ref|XP_008447325.1|7.8e-24697.39PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2 isoform X2 [Cucumis melo][more]
gi|659092986|ref|XP_008447324.1|7.8e-24697.39PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2 isoform X1 [Cucumis melo][more]
gi|659092990|ref|XP_008447326.1|1.3e-24597.85PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2 isoform X3 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
IPR004276GlycoTrans_28_N
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO:0005975carbohydrate metabolic process
GO:0030259lipid glycosylation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030259 lipid glycosylation
biological_process GO:0048316 seed development
biological_process GO:0016125 sterol metabolic process
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0008152 metabolic process
biological_process GO:0015074 DNA integration
biological_process GO:0009813 flavonoid biosynthetic process
biological_process GO:0052696 flavonoid glucuronidation
cellular_component GO:0005886 plasma membrane
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0051507 beta-sitosterol UDP-glucosyltransferase activity
molecular_function GO:0016758 transferase activity, transferring hexosyl groups
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0080043 quercetin 3-O-glucosyltransferase activity
molecular_function GO:0080044 quercetin 7-O-glucosyltransferase activity
molecular_function GO:0016906 sterol 3-beta-glucosyltransferase activity
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU12360watermelon unigene v2 vs TrEMBLtranscribed_cluster
WMU12794watermelon unigene v2 vs TrEMBLtranscribed_cluster
WMU30736watermelon unigene v2 vs TrEMBLtranscribed_cluster
WMU53813watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla002739Cla002739.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU12360WMU12360transcribed_cluster
WMU53813WMU53813transcribed_cluster
WMU12794WMU12794transcribed_cluster
WMU30736WMU30736transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 1..420
score: 9.8E
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 234..376
score: 4.
IPR004276Glycosyltransferase family 28, N-terminal domainPFAMPF03033Glyco_transf_28coord: 2..124
score: 4.4
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 239..371
score: 6.5E-5coord: 2..238
score: 4.9
NoneNo IPR availablePANTHERPTHR11926:SF308STEROL 3-BETA-GLUCOSYLTRANSFERASE UGT80A2coord: 1..420
score: 9.8E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 2..394
score: 3.68

The following gene(s) are paralogous to this gene:

None