CmoCh05G004550 (gene) Cucurbita moschata (Rifu)

NameCmoCh05G004550
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionUDP-Glycosyltransferase superfamily protein
LocationCmo_Chr05 : 2114866 .. 2120021 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTTCACTATATAAACTACTACGACATTCTCATCTCCGACGGAATCGCTCGCCGGAAATGGAGAAGACGACGGTGGACGGAGGAGGAGAGATGAAACAGAGTCATGTAATCGTATTCCCTTTCCCAAGGCACGGCCACATGAATCCGATGCTCCAATTCGCGAAGCGATTAGTCTCCAAAGGCCTTCTCCTAACATTCCTCACCACTTCCTCAGCAAGTGAATCCCTAATTCTCGATCTCCCTCCCTCTCCGATCCGTCACAAAGTCATCTCCGATGATCCTGAATCCAACAACATCGACAGTCTCGACGCTTATCTCCGAAGCTTCCGAGCCGCCGCCTCCAAATCCTTGGCCAATTTCATCGACGAAGCCCTAATTTCTGATTCCAATGAAGTTCTTCCGAGTCTTATCGTTTACGACTCTGTTATGCCCTGGGTGCAGAGCGTCGCCGCAGAGCGTGGTTTGGATGCGGCTCCGTTTTTCACACAATCCGCCGCCGTTAATCATATCCTAGATCTCGTCTATAAAGGATCTCTGAGCATTCCGCCACCGGAGGATGTAGCGGTTTCGCTTCCATCGGAGATTGTTCTTCAACCAGCAGATCTGCCTACCTTGCCTGACGATGGAGATGTGGTTTTGGAGTTCATGACCAGTCAGTTCATCAATTTGGAGAATGTAAAGTGGATTTTCTTCAACACGTTTGATCGGCTCGAATGCAAGGTAATTGATGCTTCAAACACGATTCTAATCTGAACTTTTTTTGATAGCTAAAATTCAAAATTCGAGTACAAGTGACGTACGATCAAAGAGTCTTAAGCCCAATAATTATTAGGAGCTTATATTTTTTTTAGAACAATAATTTAAATATTTTACAACTAAAAATGATTTATACATAGAAATTGAAAAACAATTTAGATATTTTATAAATACTTCTCCATTTTATCTATGTTTTCATCCCAAAAAATTTCCAGCAGTCTCAATTATAAAATGTAATAATATTGTACCCAATGGGTAAAGTTAGTGTGTGTGTATATATATATATATATACACATACGTATACACAACTTTACCCATATATATATATACACACACATATAGATATACATATATACACATATATATCTACATACACACACATATATACATATATACATATATACACACACATATATACATATATATACACACATATATACATATATATACACACATATATATAATATTAATTACAAAATTATACATCATTTTTATTTAGGTTTATATTTTATTTGCTTGCTTTAATAATAACAATAAAAACTAGTATTAATTTTTTTAAAAAAAATAGTAGTTACGTCTAATTAATCAATTTGGAAGTTAAAAAATATATTTGGAAGTTAATCATCATTGTTATTTTATATCTAGAATTTCAAGTTTACTTAATTACTTGAATAATTTGAAACAATATAATATCAGGTTGTTAATTGGATGACCAAGACATTGCCCATTAAGACAGTGGGGCCGACCATTCCATCGGCATATTTGGATGGTCGATTGGTGGACGACAAAGCCTATGGGTTGAATGTCTTGAATCCCAATGATGGGAAGAAGGCTATAGAGTGGTTAGACTCGAAAGAAACTGCTTCAGTTATTTATATTTCATTTGGAAGTTTGGTTAACTTAGAAAAAGAACAAGTAACGGAGCTGACATGTTTTCTTAGAAACACGAATCTTTCCTTCTTATGGGTCCTAAGAGAATCCGAACTGGGAAAGCTTCCTAACAACTTTGTTCAAGATACATCAGAACAAGGCCTAATCGTGAACTGGTGTTGTCAACTAGAAGTTCTATCACATAAGGCAGTAAGTTGTTTCGTGACTCATTGTGGTTGGAATTCAACCATAGAAGCATTGAGCTTGGGGGTGCCAATGATTGCAATCCCCCAATGGGTCGATCAAACGACGAACGCCAAGTTCGTCGCAGATGTTTGGGAAGTCGGAGTTCGAGTGAAGAAGAACGACAAAGGCATTGTGACAAAGGAAGAACTAGAGGCCTCGATCCGAAAGATTGTTCAAGGAGAGAAACCAAATGAGATTAAACAGAACTCAATCAAGTGGAAGAAAGTAGCTAAAGAAGCCATGGACGAAGGAGGCAGCTCTGATAAGAACATTGATGAATTTGTCCAAGCAATGGCTGCGTCAATCATCTAGGTAACAACATAATCTTTATCTTACATTCATTTGTCTTTCAATTCCACGTAGAAATAAACTATCTTCCATGATAATACTAATTTGGAACATGTCTTTCAAAAATGCTTAAATACATTCAAGCCAAACCATTCTAACAGCATCCAACATCTTCCCTCCAACATTGTCAAAGTAAATGTCTATTCCTTCGGAGAAACATCTGAACAAGCAAACAGATATGCAGCCAGCCCAAACATGATATCAATCCTCAATCCTCCCTTGCTGTAACGACCCAGATCCACCGCCAGCAGATATTGTCCTCTTTGGGATTTCCTTTTCGGGCTTCCCCTCAAGACTTTAAAACGTGTCTGTTAAGGGAAGGTTTCCACACTTTTATAAATGGTGATTTGTTCTCCTCCCCAACCAATGTGGGACATCACAATACCAAATGGTATCAGAGCCAGACACTGGACGATGTGCCAGTCTTCTCGTTGTTCTCCGAAGGAGGTAGACACGAGGCGATGTGCCAGTAAGGACACTGGGCCCCAAAGGGGAGTGGATTTGGGGGCGGCCCCACGTCGATTGGAGGAAGGAAAGAGTGCCAGCAAGGACGCTAGGGCCCGAAGGGAAGTGGATTGTGATGTCCCACGTTGGTTGGGGAGGAGAACAAATCACCATTTATAAGGGTGTGGAAACTTTCCCCCACGCGTTTTAAAGCGTTAAGGGAAAGCCCGAAAGGAAAAGCTCAAAAGGAAAAGTCCAAAGAAGACAATATCTGCCAACGGTGGATATGGGTCGTTACAGTTGCAATCTTTATACAATCTTTATACAGGAAGAAAAACATACCTTCTCAGAGTGGCTTCCAAGTCTGGCTCTTCTTTGTAGTTGAAAACCTCATGGAACCCAAGTTTGTTCTTCAGCAAGTCAATCTGATAATGAATGAGATATGAGAGTTCTTCTACCAAAATTGGGTTTAATTGAAAGATTATGCATAGGTTTTGGAGGTTATGAAATCGCCCGGTTTGTCCGCTTTGGCCCATTACGAATCGTCATTAACCTCACAGTTTCAAAACACGTCTGTTAGAGAGAGGTTTCCACACCCTTGTAAGGAATGTTTCGTTTTCTTCTCCAACCAACATAGGATCTCACAATCCATCCCCTTTAGGGGCCAACGTCCTTGCTAGCACACCGCTTGGTATCTGGTTCTGATGCCATTTGTAACAGCCCAAGCTGCGGGTAGATACTATCCTTTAAGAGGTTTCCACACTCTTACCTATAAGGAATGTTTTGTTCTACTCTCCAAACTAATATGGGATCTCACAATCCCCACGCTGACACACCGTTCAGTGTCTAACTCTGATACCATTTGTAACAGTCCAAATTCACTACTAACCGTTAGTATCCGCTTTGGTCTATTATGTATCACCGTCACCCTCACGGTTTTGAAACGCGACTATTAGGAAGAGGTTTCCACACCCTTATAAGGGATGAGATCTCACATCCACTCCTTGGTGGCCAGTGTCAGCATCCTCACTGGCACACCGCTCGGTGTCCACCTCTAATACCAATTGTAACAACTCAAACTCACCACCACTAGATATTATCCGCTTTGGCCCATTAGGTATCATCATCGACCTCACTATTTTCACTGTCTATTAAGGAGAGGTTTCCACACCCTTAAATGTTTCATTCTCTCTCCAACCAACATGTGATCTCACAATCCACCCCTTGGGGGTCAGCGTCCTCACCGACACACCTCTCACTCCTTGGGGGACAGCGTCCTCACTGACACAACTCTCGGTATCCAACTCTAGTACCCAAGCCCACCACTAGCAGACAGATATTGTCTGTTTTGACCCACTACATATCACCGTCAACCTCACAATTTTCCACACCCTTATAAAAAGGAATGTTTCGTTCTCCTGTTCTCCTCTCCAACTCCAACCGATTTGAAATCTCACAACCACAGACAACTTATATATATATATATATATATAAATATCAAACCTTCTCTTTGCTACCAGCACAGCCAACCACATGACACCCCATAAACTTGGCAACCTGGCCAACAACTTGGCCAACAGCCCCAGAAGCAGCAGAGACGAACACATATTCTCCTTTCTTAGGACAACAAATCTCAAAGAAACCAGCATAAGCAGTCATCCCAGTCACACCTACAATTAACTTAGTCAAAAAAGCATTGCAAGATCCAAACCAAAGAGATCATAACAACAACAACAATACCAAGAATCCCTGTGTAATAAGACAGAGGAACATCCGTATGATCAATTTTAAATAGATTATCAGATTCCAATCTTGAAAGCACAGAATACTCTTCCCACCCCGTGCTCCCCCACACGAAATCGCCTTTCCCAAACTTCGGATGACCAGAATCCACAACCTTCCCAACTCCAAATCCCCTAATAGGCTGCAAAACAAAGAAATAAATTCACTATCGGAACTACAAAACCGACGAAACGACGTCGAAGCGTCTCGTACCAAACCAGGACTGAACGAAACAAAATCAATCGACGGATTCTCAATCTTGTTCATGCAGGCATGCATATATGGATCGCAGGAGAGGTACAGATTTTTGAGGAGAACTCCATCGGAGCCTTCGGGGAGCTTCAAATTGATGGTTCTGGTGGAGATTACTTCCAGATCCGACTCTTTCGCGGATCCTCCGGTGACATAGTCCTTTAGAATCACCTGTTTGTTGATTACTTCGCCGCTGTCGCCGCCGCGGCTCTGCATCATGTTCTCTGGAGAAAGGGAAATAGCGTAAAGGAAGGGTACTGAGGAAGAAGGGTCTCGAAATTTTGAACAGAACCAACCTTTTCCTTTGATATAAATATCAGGAATAGTTTCGTTCAACTGCGCGGCAGAAAAAATTTAAAGATTTTTAAGAATATAATTAAATGATAATAATTAATACGTAGTATATTGCTATTATTTTTTCATGCAAAACTTCTACCCAATGGATATTTAACACGTGG

mRNA sequence

TCTTCACTATATAAACTACTACGACATTCTCATCTCCGACGGAATCGCTCGCCGGAAATGGAGAAGACGACGGTGGACGGAGGAGGAGAGATGAAACAGAGTCATGTAATCGTATTCCCTTTCCCAAGGCACGGCCACATGAATCCGATGCTCCAATTCGCGAAGCGATTAGTCTCCAAAGGCCTTCTCCTAACATTCCTCACCACTTCCTCAGCAAGTGAATCCCTAATTCTCGATCTCCCTCCCTCTCCGATCCGTCACAAAGTCATCTCCGATGATCCTGAATCCAACAACATCGACAGTCTCGACGCTTATCTCCGAAGCTTCCGAGCCGCCGCCTCCAAATCCTTGGCCAATTTCATCGACGAAGCCCTAATTTCTGATTCCAATGAAGTTCTTCCGAGTCTTATCGTTTACGACTCTGTTATGCCCTGGGTGCAGAGCGTCGCCGCAGAGCGTGGTTTGGATGCGGCTCCGTTTTTCACACAATCCGCCGCCGTTAATCATATCCTAGATCTCGTCTATAAAGGATCTCTGAGCATTCCGCCACCGGAGGATGTAGCGGTTTCGCTTCCATCGGAGATTGTTCTTCAACCAGCAGATCTGCCTACCTTGCCTGACGATGGAGATGTGGTTTTGGAGTTCATGACCAGTCAGTTCATCAATTTGGAGAATGTAAAGTGGATTTTCTTCAACACGTTTGATCGGCTCGAATGCAAGGTTGTTAATTGGATGACCAAGACATTGCCCATTAAGACAGTGGGGCCGACCATTCCATCGGCATATTTGGATGGTCGATTGGTGGACGACAAAGCCTATGGGTTGAATGTCTTGAATCCCAATGATGGGAAGAAGGCTATAGAGTGGTTAGACTCGAAAGAAACTGCTTCAGTTATTTATATTTCATTTGGAAGTTTGGTTAACTTAGAAAAAGAACAAGTAACGGAGCTGACATGTTTTCTTAGAAACACGAATCTTTCCTTCTTATGGGTCCTAAGAGAATCCGAACTGGGAAAGCTTCCTAACAACTTTGTTCAAGATACATCAGAACAAGGCCTAATCGTGAACTGGTGTTGTCAACTAGAAGTTCTATCACATAAGGCAGTAAGTTGTTTCGTGACTCATTGTGGTTGGAATTCAACCATAGAAGCATTGAGCTTGGGGGTGCCAATGATTGCAATCCCCCAATGGGTCGATCAAACGACGAACGCCAAGTTCGTCGCAGATGTTTGGGAAGTCGGAGTTCGAGTGAAGAAGAACGACAAAGGCATTGTGACAAAGGAAGAACTAGAGGCCTCGATCCGAAAGATTGTTCAAGGAGAGAAACCAAATGAGATTAAACAGAACTCAATCAAGTGGAAGAAAGTAGCTAAAGAAGCCATGGACGAAGGAGGCAGCTCTGATAAGAACATTGATGAATTTGTCCAAGCAATGGCTGCGTCAATCATCTAGGCATGCATATATGGATCGCAGGAGAGGTACAGATTTTTGAGGAGAACTCCATCGGAGCCTTCGGGGAGCTTCAAATTGATGGTTCTGGTGGAGATTACTTCCAGATCCGACTCTTTCGCGGATCCTCCGGTGACATAGTCCTTTAGAATCACCTGTTTGTTGATTACTTCGCCGCTGTCGCCGCCGCGGCTCTGCATCATGTTCTCTGGAGAAAGGGAAATAGCGTAAAGGAAGGGTACTGAGGAAGAAGGGTCTCGAAATTTTGAACAGAACCAACCTTTTCCTTTGATATAAATATCAGGAATAGTTTCGTTCAACTGCGCGGCAGAAAAAATTTAAAGATTTTTAAGAATATAATTAAATGATAATAATTAATACGTAGTATATTGCTATTATTTTTTCATGCAAAACTTCTACCCAATGGATATTTAACACGTGG

Coding sequence (CDS)

ATGGAGAAGACGACGGTGGACGGAGGAGGAGAGATGAAACAGAGTCATGTAATCGTATTCCCTTTCCCAAGGCACGGCCACATGAATCCGATGCTCCAATTCGCGAAGCGATTAGTCTCCAAAGGCCTTCTCCTAACATTCCTCACCACTTCCTCAGCAAGTGAATCCCTAATTCTCGATCTCCCTCCCTCTCCGATCCGTCACAAAGTCATCTCCGATGATCCTGAATCCAACAACATCGACAGTCTCGACGCTTATCTCCGAAGCTTCCGAGCCGCCGCCTCCAAATCCTTGGCCAATTTCATCGACGAAGCCCTAATTTCTGATTCCAATGAAGTTCTTCCGAGTCTTATCGTTTACGACTCTGTTATGCCCTGGGTGCAGAGCGTCGCCGCAGAGCGTGGTTTGGATGCGGCTCCGTTTTTCACACAATCCGCCGCCGTTAATCATATCCTAGATCTCGTCTATAAAGGATCTCTGAGCATTCCGCCACCGGAGGATGTAGCGGTTTCGCTTCCATCGGAGATTGTTCTTCAACCAGCAGATCTGCCTACCTTGCCTGACGATGGAGATGTGGTTTTGGAGTTCATGACCAGTCAGTTCATCAATTTGGAGAATGTAAAGTGGATTTTCTTCAACACGTTTGATCGGCTCGAATGCAAGGTTGTTAATTGGATGACCAAGACATTGCCCATTAAGACAGTGGGGCCGACCATTCCATCGGCATATTTGGATGGTCGATTGGTGGACGACAAAGCCTATGGGTTGAATGTCTTGAATCCCAATGATGGGAAGAAGGCTATAGAGTGGTTAGACTCGAAAGAAACTGCTTCAGTTATTTATATTTCATTTGGAAGTTTGGTTAACTTAGAAAAAGAACAAGTAACGGAGCTGACATGTTTTCTTAGAAACACGAATCTTTCCTTCTTATGGGTCCTAAGAGAATCCGAACTGGGAAAGCTTCCTAACAACTTTGTTCAAGATACATCAGAACAAGGCCTAATCGTGAACTGGTGTTGTCAACTAGAAGTTCTATCACATAAGGCAGTAAGTTGTTTCGTGACTCATTGTGGTTGGAATTCAACCATAGAAGCATTGAGCTTGGGGGTGCCAATGATTGCAATCCCCCAATGGGTCGATCAAACGACGAACGCCAAGTTCGTCGCAGATGTTTGGGAAGTCGGAGTTCGAGTGAAGAAGAACGACAAAGGCATTGTGACAAAGGAAGAACTAGAGGCCTCGATCCGAAAGATTGTTCAAGGAGAGAAACCAAATGAGATTAAACAGAACTCAATCAAGTGGAAGAAAGTAGCTAAAGAAGCCATGGACGAAGGAGGCAGCTCTGATAAGAACATTGATGAATTTGTCCAAGCAATGGCTGCGTCAATCATCTAG
BLAST of CmoCh05G004550 vs. Swiss-Prot
Match: U74E2_ARATH (UDP-glycosyltransferase 74E2 OS=Arabidopsis thaliana GN=UGT74E2 PE=1 SV=1)

HSP 1 Score: 386.7 bits (992), Expect = 3.5e-106
Identity = 205/455 (45.05%), Postives = 282/455 (61.98%), Query Frame = 1

Query: 15  SHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLILDLPPSPIRHKVISDD 74
           SH+IV PFP  GH+ PM QF KRL SKGL LT +  S          PP    H  I+  
Sbjct: 5   SHLIVLPFPGQGHITPMSQFCKRLASKGLKLTLVLVSDKPS------PPYKTEHDSITVF 64

Query: 75  PESNN-------IDSLDAYLRSFRAAASKSLANFIDEALISDSNEVLPSLIVYDSVMPWV 134
           P SN        +  LD Y+     +   +L   +++  +S +    P  IVYDS MPW+
Sbjct: 65  PISNGFQEGEEPLQDLDDYMERVETSIKNTLPKLVEDMKLSGNP---PRAIVYDSTMPWL 124

Query: 135 QSVAAERGLDAAPFFTQSAAVNHILDLVYKGSLSIPPPE---DVAVSLPSEIVLQPADLP 194
             VA   GL  A FFTQ   V  I   V+KGS S+P  +       S PS  +L   DLP
Sbjct: 125 LDVAHSYGLSGAVFFTQPWLVTAIYYHVFKGSFSVPSTKYGHSTLASFPSFPMLTANDLP 184

Query: 195 TLPDDGDV---VLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTVGPTIPS 254
           +   +      +L  +  Q  N++ V  +  NTFD+LE K++ W+    P+  +GPT+PS
Sbjct: 185 SFLCESSSYPNILRIVVDQLSNIDRVDIVLCNTFDKLEEKLLKWVQSLWPVLNIGPTVPS 244

Query: 255 AYLDGRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLEKEQVTELTCF 314
            YLD RL +DK YG ++ N    +  +EWL+SKE  SV+Y+SFGSLV L+++Q+ EL   
Sbjct: 245 MYLDKRLSEDKNYGFSLFNAKVAE-CMEWLNSKEPNSVVYLSFGSLVILKEDQMLELAAG 304

Query: 315 LRNTNLSFLWVLRESELGKLPNNFVQDTSEQGLIVNWCCQLEVLSHKAVSCFVTHCGWNS 374
           L+ +   FLWV+RE+E  KLP N+V++  E+GLIV+W  QL+VL+HK++ CF+THCGWNS
Sbjct: 305 LKQSGRFFLWVVRETETHKLPRNYVEEIGEKGLIVSWSPQLDVLAHKSIGCFLTHCGWNS 364

Query: 375 TIEALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIVTKEELEASIRKIVQG 434
           T+E LSLGVPMI +P W DQ TNAKF+ DVW+VGVRVK    G V +EE+  S+ ++++G
Sbjct: 365 TLEGLSLGVPMIGMPHWTDQPTNAKFMQDVWKVGVRVKAEGDGFVRREEIMRSVEEVMEG 424

Query: 435 EKPNEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFV 457
           EK  EI++N+ KWK +A+EA+ EGGSSDK+I+EFV
Sbjct: 425 EKGKEIRKNAEKWKVLAQEAVSEGGSSDKSINEFV 449

BLAST of CmoCh05G004550 vs. Swiss-Prot
Match: U74E1_ARATH (UDP-glycosyltransferase 74E1 OS=Arabidopsis thaliana GN=UGT74E1 PE=3 SV=1)

HSP 1 Score: 382.5 bits (981), Expect = 6.6e-105
Identity = 201/455 (44.18%), Postives = 279/455 (61.32%), Query Frame = 1

Query: 15  SHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLILDLPPSPIRHKVISDD 74
           SHVIV PFP  GH+ PM QF KRL SK L +T +  S          PP    H  I+  
Sbjct: 5   SHVIVLPFPAQGHITPMSQFCKRLASKSLKITLVLVSDKPS------PPYKTEHDTITVV 64

Query: 75  PESNNI-------DSLDAYLRSFRAAASKSLANFIDEALISDSNEVLPSLIVYDSVMPWV 134
           P SN         + LD Y+    ++    L   I++  +S +    P  +VYDS MPW+
Sbjct: 65  PISNGFQEGQERSEDLDEYMERVESSIKNRLPKLIEDMKLSGNP---PRALVYDSTMPWL 124

Query: 135 QSVAAERGLDAAPFFTQSAAVNHILDLVYKGSLSIPPPE---DVAVSLPSEIVLQPADLP 194
             VA   GL  A FFTQ   V+ I   V+KGS S+P  +       S PS  +L   DLP
Sbjct: 125 LDVAHSYGLSGAVFFTQPWLVSAIYYHVFKGSFSVPSTKYGHSTLASFPSLPILNANDLP 184

Query: 195 TLPDDGD---VVLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTVGPTIPS 254
           +   +      +L  +  Q  N++ V  +  NTFD+LE K++ W+    P+  +GPT+PS
Sbjct: 185 SFLCESSSYPYILRTVIDQLSNIDRVDIVLCNTFDKLEEKLLKWIKSVWPVLNIGPTVPS 244

Query: 255 AYLDGRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLEKEQVTELTCF 314
            YLD RL +DK YG ++      +  +EWL+SK+ +SV+Y+SFGSLV L+K+Q+ EL   
Sbjct: 245 MYLDKRLAEDKNYGFSLFGAKIAE-CMEWLNSKQPSSVVYVSFGSLVVLKKDQLIELAAG 304

Query: 315 LRNTNLSFLWVLRESELGKLPNNFVQDTSEQGLIVNWCCQLEVLSHKAVSCFVTHCGWNS 374
           L+ +   FLWV+RE+E  KLP N++++  E+GL V+W  QLEVL+HK++ CFVTHCGWNS
Sbjct: 305 LKQSGHFFLWVVRETERRKLPENYIEEIGEKGLTVSWSPQLEVLTHKSIGCFVTHCGWNS 364

Query: 375 TIEALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIVTKEELEASIRKIVQG 434
           T+E LSLGVPMI +P W DQ TNAKF+ DVW+VGVRVK +  G V +EE    + ++++ 
Sbjct: 365 TLEGLSLGVPMIGMPHWADQPTNAKFMEDVWKVGVRVKADSDGFVRREEFVRRVEEVMEA 424

Query: 435 EKPNEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFV 457
           E+  EI++N+ KWK +A+EA+ EGGSSDKNI+EFV
Sbjct: 425 EQGKEIRKNAEKWKVLAQEAVSEGGSSDKNINEFV 449

BLAST of CmoCh05G004550 vs. Swiss-Prot
Match: U74D1_ARATH (UDP-glycosyltransferase 74D1 OS=Arabidopsis thaliana GN=UGT74D1 PE=1 SV=1)

HSP 1 Score: 366.7 bits (940), Expect = 3.8e-100
Identity = 200/463 (43.20%), Postives = 293/463 (63.28%), Query Frame = 1

Query: 10  GEMKQSHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLI----------L 69
           GE  +++V+VF FP  GH+NP+LQF+KRL+SK + +TFLTTSS   S++          L
Sbjct: 2   GEKAKANVLVFSFPIQGHINPLLQFSKRLLSKNVNVTFLTTSSTHNSILRRAITGGATAL 61

Query: 70  DLPPSPIRHKVISDDPESNNIDSLDAYLRSFRAAASKSLANFIDEALISDSNEVLPSLIV 129
            L   PI      D P +   D+   Y   F+   S+SL+  I       S +  P+ +V
Sbjct: 62  PLSFVPIDDGFEEDHPST---DTSPDYFAKFQENVSRSLSELIS------SMDPKPNAVV 121

Query: 130 YDSVMPWVQSVAAER-GLDAAPFFTQSAAVNHILDLVYKGSLSIPPPEDVAVSLPSEIVL 189
           YDS +P+V  V  +  G+ AA FFTQS+ VN       +G       + V  ++P    L
Sbjct: 122 YDSCLPYVLDVCRKHPGVAAASFFTQSSTVNATYIHFLRGEFKEFQNDVVLPAMPP---L 181

Query: 190 QPADLPTLPDDGDV---VLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTV 249
           +  DLP    D ++   + E ++SQF+N++++ +   N+FD LE +V+ WM    P+K +
Sbjct: 182 KGNDLPVFLYDNNLCRPLFELISSQFVNVDDIDFFLVNSFDELEVEVLQWMKNQWPVKNI 241

Query: 250 GPTIPSAYLDGRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLEKEQV 309
           GP IPS YLD RL  DK YG+N+ N    +  ++WLDSK   SVIY+SFGSL  L+ +Q+
Sbjct: 242 GPMIPSMYLDKRLAGDKDYGINLFNAQVNE-CLDWLDSKPPGSVIYVSFGSLAVLKDDQM 301

Query: 310 TELTCFLRNTNLSFLWVLRESELGKLPNNFVQDTSEQGLIVNWCCQLEVLSHKAVSCFVT 369
            E+   L+ T  +FLWV+RE+E  KLP+N+++D  ++GLIVNW  QL+VL+HK++ CF+T
Sbjct: 302 IEVAAGLKQTGHNFLWVVRETETKKLPSNYIEDICDKGLIVNWSPQLQVLAHKSIGCFMT 361

Query: 370 HCGWNSTIEALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIVTKEELEASI 429
           HCGWNST+EALSLGV +I +P + DQ TNAKF+ DVW+VGVRVK +  G V KEE+   +
Sbjct: 362 HCGWNSTLEALSLGVALIGMPAYSDQPTNAKFIEDVWKVGVRVKADQNGFVPKEEIVRCV 421

Query: 430 RKIVQ--GEKPNEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFV 457
            ++++   EK  EI++N+ +  + A+EA+ +GG+SDKNIDEFV
Sbjct: 422 GEVMEDMSEKGKEIRKNARRLMEFAREALSDGGNSDKNIDEFV 451

BLAST of CmoCh05G004550 vs. Swiss-Prot
Match: U74F2_ARATH (UDP-glycosyltransferase 74F2 OS=Arabidopsis thaliana GN=UGT74F2 PE=1 SV=1)

HSP 1 Score: 355.1 bits (910), Expect = 1.1e-96
Identity = 197/456 (43.20%), Postives = 278/456 (60.96%), Query Frame = 1

Query: 11  EMKQSHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLILDLPPSPIRHKV 70
           E K+ HV+  P+P  GH+ P  QF KRL  KGL  T   T+    S+  DL   PI    
Sbjct: 2   EHKRGHVLAVPYPTQGHITPFRQFCKRLHFKGLKTTLALTTFVFNSINPDLS-GPISIAT 61

Query: 71  ISDDPESNNI---DSLDAYLRSFRAAASKSLANFIDEALISDSNEVLPSLIVYDSVMPWV 130
           ISD  +       DS+D YL+ F+ + SK++A+ I +   SD N +  + IVYD+ +PW 
Sbjct: 62  ISDGYDHGGFETADSIDDYLKDFKTSGSKTIADIIQKHQTSD-NPI--TCIVYDAFLPWA 121

Query: 131 QSVAAERGLDAAPFFTQSAAVNHILDLVY--KGSLSIPPPEDVAVSLPSEIVLQPADLPT 190
             VA E GL A PFFTQ  AVN++  L Y   GSL +P  E     LP    L+  DLP+
Sbjct: 122 LDVAREFGLVATPFFTQPCAVNYVYYLSYINNGSLQLPIEE-----LP---FLELQDLPS 181

Query: 191 ---LPDDGDVVLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTVGPTIPSA 250
              +        E +  QFIN E   ++  N+F  LE       +K  P+ T+GPTIPS 
Sbjct: 182 FFSVSGSYPAYFEMVLQQFINFEKADFVLVNSFQELELHENELWSKACPVLTIGPTIPSI 241

Query: 251 YLDGRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLEKEQVTELTCFL 310
           YLD R+  D  Y LN+    D    I WLD++   SV+Y++FGS+  L   Q+ EL   +
Sbjct: 242 YLDQRIKSDTGYDLNLFESKDDSFCINWLDTRPQGSVVYVAFGSMAQLTNVQMEELASAV 301

Query: 311 RNTNLSFLWVLRESELGKLPNNFVQDTS-EQGLIVNWCCQLEVLSHKAVSCFVTHCGWNS 370
             +N SFLWV+R SE  KLP+ F++  + E+ L++ W  QL+VLS+KA+ CF+THCGWNS
Sbjct: 302 --SNFSFLWVVRSSEEEKLPSGFLETVNKEKSLVLKWSPQLQVLSNKAIGCFLTHCGWNS 361

Query: 371 TIEALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVK-KNDKGIVTKEELEASIRKIVQ 430
           T+EAL+ GVPM+A+PQW DQ  NAK++ DVW+ GVRVK + + GI  +EE+E SI+++++
Sbjct: 362 TMEALTFGVPMVAMPQWTDQPMNAKYIQDVWKAGVRVKTEKESGIAKREEIEFSIKEVME 421

Query: 431 GEKPNEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFV 457
           GE+  E+K+N  KW+ +A ++++EGGS+D NID FV
Sbjct: 422 GERSKEMKKNVKKWRDLAVKSLNEGGSTDTNIDTFV 443

BLAST of CmoCh05G004550 vs. Swiss-Prot
Match: U74G1_STERE (UDP-glycosyltransferase 74G1 OS=Stevia rebaudiana GN=UGT74G1 PE=1 SV=1)

HSP 1 Score: 354.4 bits (908), Expect = 1.9e-96
Identity = 195/455 (42.86%), Postives = 288/455 (63.30%), Query Frame = 1

Query: 13  KQSHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLI--LDLPPSPIRHKV 72
           K  HV++ PFP  GH+NP +QF KRL+SKG+  T +TT     S +   +   + I  + 
Sbjct: 9   KSPHVLLIPFPLQGHINPFIQFGKRLISKGVKTTLVTTIHTLNSTLNHSNTTTTSIEIQA 68

Query: 73  ISDDPESNNIDSL-DAYLRSFRAAASKSLANFIDEALISDSNEVLPSLIVYDSVMPWVQS 132
           ISD  +     S  ++YL +F+   SKSLA+ I + L S+   +    I+YDS+  WV  
Sbjct: 69  ISDGCDEGGFMSAGESYLETFKQVGSKSLADLIKK-LQSEGTTI--DAIIYDSMTEWVLD 128

Query: 133 VAAERGLDAAPFFTQSAAVNHILDLVYKGSLSIPPPEDVAVSLPSEIVLQPADLPTLPDD 192
           VA E G+D   FFTQ+  VN +   V+KG +S+P  E   VS+P   VLQ  + P +  +
Sbjct: 129 VAIEFGIDGGSFFTQACVVNSLYYHVHKGLISLPLGE--TVSVPGFPVLQRWETPLILQN 188

Query: 193 GDVVL----EFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTVGPTIPSAYLD 252
            + +     + +  QF N++  +W+F N+F +LE +V+ W  K   +K +GPT+PS YLD
Sbjct: 189 HEQIQSPWSQMLFGQFANIDQARWVFTNSFYKLEEEVIEWTRKIWNLKVIGPTLPSMYLD 248

Query: 253 GRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLEKEQVTELTCFLRNT 312
            RL DDK  G N+   N   + + WLD K   SV+Y++FGSLV    EQV E+T  L ++
Sbjct: 249 KRLDDDKDNGFNLYKANH-HECMNWLDDKPKESVVYVAFGSLVKHGPEQVEEITRALIDS 308

Query: 313 NLSFLWVLRESELGKLPNNFVQDTSE-QGLIVNWCCQLEVLSHKAVSCFVTHCGWNSTIE 372
           +++FLWV++  E GKLP N  +     +GLIV WC QL+VL+H++V CFVTHCG+NST+E
Sbjct: 309 DVNFLWVIKHKEEGKLPENLSEVIKTGKGLIVAWCKQLDVLAHESVGCFVTHCGFNSTLE 368

Query: 373 ALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIVTKEELEASIRKIVQGEKP 432
           A+SLGVP++A+PQ+ DQTTNAK + ++  VGVRVK ++ GIV +  L + I+ I++ E+ 
Sbjct: 369 AISLGVPVVAMPQFSDQTTNAKLLDEILGVGVRVKADENGIVRRGNLASCIKMIMEEERG 428

Query: 433 NEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFVQAM 460
             I++N++KWK +AK A+ EGGSSD +I EFV  +
Sbjct: 429 VIIRKNAVKWKDLAKVAVHEGGSSDNDIVEFVSEL 457

BLAST of CmoCh05G004550 vs. TrEMBL
Match: A0A0A0KD63_CUCSA (Glycosyltransferase OS=Cucumis sativus GN=Csa_6G366280 PE=3 SV=1)

HSP 1 Score: 727.6 bits (1877), Expect = 9.3e-207
Identity = 366/466 (78.54%), Postives = 413/466 (88.63%), Query Frame = 1

Query: 1   MEKTTVDGGG-EMKQSHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLIL 60
           MEK   +GGG  +KQ+HVIVFPFPRHGHM+PMLQF+KRL+SKGLLLTFL TSSAS+SL +
Sbjct: 1   MEKAMANGGGGRIKQNHVIVFPFPRHGHMSPMLQFSKRLISKGLLLTFLVTSSASQSLTI 60

Query: 61  DLPPSPIRH-KVISDDPESNNIDSLDAYLRSFRAAASKSLANFIDEALISDS-NEVLPSL 120
           ++PPSP  H K+ISD PES+++ + DAY+RSF+AA +KSL+NFIDEALIS S  EV P+L
Sbjct: 61  NIPPSPSFHIKIISDLPESDDVATFDAYIRSFQAAVTKSLSNFIDEALISSSYEEVSPTL 120

Query: 121 IVYDSVMPWVQSVAAERGLDAAPFFTQSAAVNHILDLVYKGSLSIPPPEDVAVSLPSEIV 180
           IVYDS+MPWV SVAAERGLD+APFFT+SAAVNH+L LVY GSLSIP PE+V VSLPSEIV
Sbjct: 121 IVYDSIMPWVHSVAAERGLDSAPFFTESAAVNHLLHLVYGGSLSIPAPENVVVSLPSEIV 180

Query: 181 LQPADLPTLPDDGDVVLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTVGP 240
           LQP DLP+ PDD +VVL+FM +QF +LENVKWIF NTFDRLE KVVNWM KTLPIKTVGP
Sbjct: 181 LQPGDLPSFPDDPEVVLDFMINQFSHLENVKWIFINTFDRLESKVVNWMAKTLPIKTVGP 240

Query: 241 TIPSAYLDGRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLEKEQVTE 300
           TIPSAYLDGRL +DKAYGLNV   N+GK  I+WLDSKETASVIYISFGSLV L +EQV E
Sbjct: 241 TIPSAYLDGRLENDKAYGLNVSKSNNGKSPIKWLDSKETASVIYISFGSLVMLSEEQVKE 300

Query: 301 LTCFLRNTNLSFLWVLRESELGKLPNNFVQDTSEQGLIVNWCCQLEVLSHKAVSCFVTHC 360
           LT  LR+T+ SFLWVLRESEL KLPNNFVQDTS+ GLIVNWCCQL+VLSHKAVSCFVTHC
Sbjct: 301 LTNLLRDTDFSFLWVLRESELVKLPNNFVQDTSDHGLIVNWCCQLQVLSHKAVSCFVTHC 360

Query: 361 GWNSTIEALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIVTKEELEASIRK 420
           GWNST+EALSLGVPM+AIPQWVDQTTNAKFVADVW VGVRVKKN+KG+  KEELEASIRK
Sbjct: 361 GWNSTLEALSLGVPMVAIPQWVDQTTNAKFVADVWRVGVRVKKNEKGVAIKEELEASIRK 420

Query: 421 I-VQGEKPNEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFVQAMAAS 463
           I VQG +PNE KQNSIKWK +AKEA+DE GSSDKNI+EFVQA+AAS
Sbjct: 421 IVVQGNRPNEFKQNSIKWKNLAKEAVDERGSSDKNIEEFVQALAAS 466

BLAST of CmoCh05G004550 vs. TrEMBL
Match: K7NBW3_SIRGR (Glycosyltransferase OS=Siraitia grosvenorii GN=UDPG1 PE=2 SV=1)

HSP 1 Score: 448.7 bits (1153), Expect = 8.4e-123
Identity = 224/452 (49.56%), Postives = 320/452 (70.80%), Query Frame = 1

Query: 11  EMKQSHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLILDLPPS-PIRHK 70
           E   +H++VFPFP  GH+NP+LQ +KRL++KG+ ++ +TT   S  L L    S  ++ +
Sbjct: 2   EKGDTHILVFPFPSQGHINPLLQLSKRLIAKGIKVSLVTTLHVSNHLQLQGAYSNSVKIE 61

Query: 71  VISDDPESN-NIDSLDAYLRSFRAAASKSLANFIDEALISDSNEVLPSLIVYDSVMPWVQ 130
           VISD  E     D++   L  FR   +K+L +F+ +A++S +    P  I+YDS MPWV 
Sbjct: 62  VISDGSEDRLETDTMRQTLDRFRQKMTKNLEDFLQKAMVSSNP---PKFILYDSTMPWVL 121

Query: 131 SVAAERGLDAAPFFTQSAAVNHILDLVYKGSLSIPPPEDVAVSLPSEIVLQPADLPTL-- 190
            VA E GLD APF+TQS A+N I   V  G L +PP E   +SLPS  +L+P+DLP    
Sbjct: 122 EVAKEFGLDRAPFYTQSCALNSINYHVLHGQLKLPP-ETPTISLPSMPLLRPSDLPAYDF 181

Query: 191 -PDDGDVVLEFMTSQFINLENVKWIFFNTFDRLECKVVNWM-TKTLPIKTVGPTIPSAYL 250
            P   D +++ +TSQ+ N+++   +F NTFD+LE +++ WM T   P+KTVGPT+PSAYL
Sbjct: 182 DPASTDTIIDLLTSQYSNIQDANLLFCNTFDKLEGEIIQWMETLGRPVKTVGPTVPSAYL 241

Query: 251 DGRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLEKEQVTELTCFLRN 310
           D R+ +DK YGL++  PN+    ++WLDSK + SV+Y+S+GSLV + +EQ+ EL   ++ 
Sbjct: 242 DKRVENDKHYGLSLFKPNEDV-CLKWLDSKPSGSVLYVSYGSLVEMGEEQLKELALGIKE 301

Query: 311 TNLSFLWVLRESELGKLPNNFVQDTSEQGLIVNWCCQLEVLSHKAVSCFVTHCGWNSTIE 370
           T   FLWV+R++E  KLP NFV+  +E+GL+V+WC QLEVL+H +V CF THCGWNST+E
Sbjct: 302 TGKFFLWVVRDTEAEKLPPNFVESVAEKGLVVSWCSQLEVLAHPSVGCFFTHCGWNSTLE 361

Query: 371 ALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIVTKEELEASIRKIVQGEKP 430
           AL LGVP++A PQW DQ TNAKF+ DVW+VG RVK+N++ + +KEE+ + I ++++GE+ 
Sbjct: 362 ALCLGVPVVAFPQWADQVTNAKFLEDVWKVGKRVKRNEQRLASKEEVRSCIWEVMEGERA 421

Query: 431 NEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFV 457
           +E K NS++WKK AKEA+DEGGSSDKNI+EFV
Sbjct: 422 SEFKSNSMEWKKWAKEAVDEGGSSDKNIEEFV 448

BLAST of CmoCh05G004550 vs. TrEMBL
Match: A0A0D2UTG6_GOSRA (Glycosyltransferase OS=Gossypium raimondii GN=B456_011G131200 PE=3 SV=1)

HSP 1 Score: 444.9 bits (1143), Expect = 1.2e-121
Identity = 233/452 (51.55%), Postives = 311/452 (68.81%), Query Frame = 1

Query: 13  KQSHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLILDLPPSPIRHKVIS 72
           +++ V+VFPFP  GH+NPMLQF+KRL SKGL +T ++TS   +        S I    I 
Sbjct: 6   EETDVLVFPFPIQGHINPMLQFSKRLASKGLRVTLISTSKTMQP-----SASSINFHSI- 65

Query: 73  DDPESNNIDSLDAYLRSFRAAASKSLANFIDEALISDSNEVLPSLIVYDSVMPWVQSVAA 132
           D  E +   ++D YL  + +   K LA FI+E  I   +     ++VYDS MPW   VA 
Sbjct: 66  DFHEGDAAANVDEYLELYESVVPKRLAQFIEEYQICSQHGA--KVLVYDSGMPWALDVAK 125

Query: 133 ERGLDAAPFFTQSAAVNHILDLVYKGSLSIPPPED----VAVSLPSEIVLQPADLPTLPD 192
           + GL  A FFTQ  AVN I   + +GSL +P  ++    V VSLPS   L  +DLP+   
Sbjct: 126 QFGLQGASFFTQCWAVNAIFIHLKEGSLRVPLEDENKGNVVVSLPSMPELGMSDLPSFVS 185

Query: 193 DGD----VVLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTVGPTIPSAYL 252
           D       + + + SQF N +   W+F NT+D+LE +++ WM    PIKTVGPTIPS YL
Sbjct: 186 DKSGSYPCLSKLVRSQFSNFQEADWVFCNTYDKLEHEIIKWMRSKWPIKTVGPTIPSMYL 245

Query: 253 DGRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLEKEQVTELTCFLRN 312
           D RL DD  YGL++  P D    + WL+SKE +SV+Y+SFGS+ +L +EQ+ EL   L+ 
Sbjct: 246 DKRLEDDNDYGLHLFKP-DTDLCLNWLNSKEASSVVYVSFGSIADLTEEQMVELAMGLKL 305

Query: 313 TNLSFLWVLRESELGKLPNNFVQDTSEQGLIVNWCCQLEVLSHKAVSCFVTHCGWNSTIE 372
           TN +FLWV+RE E  KLP+NF+++T+E+GL+V+W  QL+VL+H+AV CF+THCGWNST+E
Sbjct: 306 TNKNFLWVVREMEQNKLPSNFMEETAEKGLVVSWSPQLDVLAHRAVGCFMTHCGWNSTLE 365

Query: 373 ALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIVTKEELEASIRKIVQGEKP 432
           ALSLGVPMIA+PQW DQ TNAKFVADVWEVG+RVKK++KGI+ KEE+E   R+I++GEK 
Sbjct: 366 ALSLGVPMIAMPQWTDQPTNAKFVADVWEVGIRVKKDEKGIMRKEEIERCAREIMEGEKS 425

Query: 433 NEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFV 457
            +IK+NS KWK +AK+A+DEGGSSDKNI EFV
Sbjct: 426 LDIKRNSEKWKNLAKDAVDEGGSSDKNIQEFV 448

BLAST of CmoCh05G004550 vs. TrEMBL
Match: A0A061EH11_THECC (Uridine diphosphate glycosyltransferase 74E2, putative isoform 1 OS=Theobroma cacao GN=TCM_019571 PE=4 SV=1)

HSP 1 Score: 440.7 bits (1132), Expect = 2.3e-120
Identity = 228/434 (52.53%), Postives = 303/434 (69.82%), Query Frame = 1

Query: 26  GHMNPMLQFAKRLVSKGLLLTFLTTSSASESLILDLPPSPIRHKVISDDPESNNIDSLDA 85
           GH+NPMLQFAKRL SKGL +TFLTT        +  P + I  ++I + PE    +  + 
Sbjct: 216 GHINPMLQFAKRLASKGLKVTFLTTKP------MQSPSTSISIQII-EFPEGEQANGTEE 275

Query: 86  YLRSFRAAASKSLANFIDEALISDSNEVLPSLIVYDSVMPWVQSVAAERGLDAAPFFTQS 145
           +   F+   S+ L N ID    S S+   P  +VYDS +PW   VA + GL  A FFTQS
Sbjct: 276 FAHLFKTLVSERLTNLIDRLNSSSSDP--PKALVYDSFLPWALDVAKQCGLHGASFFTQS 335

Query: 146 AAVNHILDLVYKGSLSIPPPEDVAVSLPSEIVLQPADLPTLPDDGDV---VLEFMTSQFI 205
            + + I   + +G+L +P  E+  VSLPS  VL   DLP+   D      +L+ +  +F 
Sbjct: 336 WSNSSIYYHLNQGTLKVPLEENAVVSLPSMPVLGINDLPSFVSDTGSYPSLLKMVVDRFS 395

Query: 206 NLENVKWIFFNTFDRLECKVVNWMTKTLPIKTVGPTIPSAYLDGRLVDDKAYGLNVLNPN 265
           N +   W+F NTF  LE +V+N M    PIKTVGPTIPS YLD R+ DD  YGL++  P 
Sbjct: 396 NFQEADWLFCNTFKELEHEVINCMASKWPIKTVGPTIPSMYLDKRIKDDNDYGLHLFKP- 455

Query: 266 DGKKAIEWLDSKETASVIYISFGSLVNLEKEQVTELTCFLRNTNLSFLWVLRESELGKLP 325
           D +  I+WLDSKET SV+Y+SFGSL  L +EQ+ EL+  L+ +N  FLWV+RE+E  K+P
Sbjct: 456 DSELCIKWLDSKETDSVVYVSFGSLAGLTEEQMLELSLGLKRSNRYFLWVVREAEQSKIP 515

Query: 326 NNFVQDTSEQGLIVNWCCQLEVLSHKAVSCFVTHCGWNSTIEALSLGVPMIAIPQWVDQT 385
           +NF+++TSE+GL+V+WC QL+VL+H+AV CF+THCGWNST+EALSLGVPMIA+PQW DQ 
Sbjct: 516 SNFIEETSEKGLVVSWCPQLDVLAHRAVGCFMTHCGWNSTLEALSLGVPMIAMPQWTDQP 575

Query: 386 TNAKFVADVWEVGVRVKKNDKGIVTKEELEASIRKIVQGEKPNEIKQNSIKWKKVAKEAM 445
           TNAKFVADVW+ G+RV K++KG+VTKEE+E  IR+I++GE+  EI++NS KWK +AKEA+
Sbjct: 576 TNAKFVADVWQGGIRVSKDEKGVVTKEEVEWCIREIMEGERSLEIRKNSEKWKNLAKEAV 635

Query: 446 DEGGSSDKNIDEFV 457
           DEGGSSDKNI+EFV
Sbjct: 636 DEGGSSDKNIEEFV 639

BLAST of CmoCh05G004550 vs. TrEMBL
Match: B9N960_POPTR (Glycosyltransferase OS=Populus trichocarpa GN=POPTR_0001s39980g PE=3 SV=1)

HSP 1 Score: 439.5 bits (1129), Expect = 5.1e-120
Identity = 225/452 (49.78%), Postives = 309/452 (68.36%), Query Frame = 1

Query: 15  SHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLILDLPPSPIRHKVISDD 74
           SHV+V P P  GH+NPMLQF+KRL SKGL +T +T +S   S+  D   S     +    
Sbjct: 11  SHVLVLPLPIQGHINPMLQFSKRLASKGLRVTLITPTSMGTSMHQDNACSINMEPIFDGY 70

Query: 75  PESNNIDSLDAYLRSFRAAASKSLANFIDEALISDSNEVLPSLIVYDSVMPWVQSVAAER 134
            E     + + Y+  F+A   +SLA  ID+   + +++     I+YDS++PWV  VA   
Sbjct: 71  KEGERAATAEEYIERFKATIPQSLAELIDK---NSTSQYPAKFIIYDSILPWVLDVAKSW 130

Query: 135 GLDAAPFFTQSAAVNHILDLVYKGS-LSIPPPEDVAVSLPSEIVLQPADLPTL---PDDG 194
           G++  PFFTQS AV  +     +GS L IP  E   VSLPS   L+ +DLP+L   P   
Sbjct: 131 GIEGGPFFTQSCAVTVLYYHTLQGSALKIPMEEKSPVSLPSLPQLEFSDLPSLVHGPGSY 190

Query: 195 DVVLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTVGPTIPSAYLDGRLVD 254
             + + + SQF N++   W+ +NTF+ LE ++V+WM    PIK +GPTIPS +LD RL D
Sbjct: 191 PGIYDLLFSQFSNIDEASWLLWNTFNELEDEIVDWMASKWPIKPIGPTIPSMFLDKRLED 250

Query: 255 DKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLEKEQVTELTCFLRNTNLSFL 314
           DK YGL++  PN  +  ++WLDSKE  SV+Y+SFGSL  L ++Q+ EL   L+ +N  FL
Sbjct: 251 DKDYGLSLFKPN-SETCMKWLDSKEPGSVVYVSFGSLAVLTEDQMAELAWGLKRSNTHFL 310

Query: 315 WVLRESELGKLPNNFVQDTSEQGLIVNWCCQLEVLSHKAVSCFVTHCGWNSTIEALSLGV 374
           WV+RESE  K+P NFV++T+E GLI+ W  QL+VL+HK+V CF+THCGWNST+EALSLGV
Sbjct: 311 WVVRESEKQKVPGNFVEETTEMGLIITWSPQLKVLAHKSVGCFMTHCGWNSTLEALSLGV 370

Query: 375 PMIAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIVTKEELEASIRKI-VQGEKPNEIKQ 434
           PM+A+PQW DQ +NAKFVADVW+ GVRVK  + G+VT+EE+E  IR++ ++GE+ +EI+ 
Sbjct: 371 PMVAMPQWTDQPSNAKFVADVWQAGVRVKVGENGMVTQEEIERCIREVMMEGERRDEIRT 430

Query: 435 NSIKWKKVAKEAMDEGGSSDKNIDEFVQAMAA 462
           +S KWKK+A+ AMDEGGSSDKNIDEFV ++ A
Sbjct: 431 HSEKWKKLARMAMDEGGSSDKNIDEFVASLNA 458

BLAST of CmoCh05G004550 vs. TAIR10
Match: AT1G05680.1 (AT1G05680.1 Uridine diphosphate glycosyltransferase 74E2)

HSP 1 Score: 386.7 bits (992), Expect = 2.0e-107
Identity = 205/455 (45.05%), Postives = 282/455 (61.98%), Query Frame = 1

Query: 15  SHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLILDLPPSPIRHKVISDD 74
           SH+IV PFP  GH+ PM QF KRL SKGL LT +  S          PP    H  I+  
Sbjct: 5   SHLIVLPFPGQGHITPMSQFCKRLASKGLKLTLVLVSDKPS------PPYKTEHDSITVF 64

Query: 75  PESNN-------IDSLDAYLRSFRAAASKSLANFIDEALISDSNEVLPSLIVYDSVMPWV 134
           P SN        +  LD Y+     +   +L   +++  +S +    P  IVYDS MPW+
Sbjct: 65  PISNGFQEGEEPLQDLDDYMERVETSIKNTLPKLVEDMKLSGNP---PRAIVYDSTMPWL 124

Query: 135 QSVAAERGLDAAPFFTQSAAVNHILDLVYKGSLSIPPPE---DVAVSLPSEIVLQPADLP 194
             VA   GL  A FFTQ   V  I   V+KGS S+P  +       S PS  +L   DLP
Sbjct: 125 LDVAHSYGLSGAVFFTQPWLVTAIYYHVFKGSFSVPSTKYGHSTLASFPSFPMLTANDLP 184

Query: 195 TLPDDGDV---VLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTVGPTIPS 254
           +   +      +L  +  Q  N++ V  +  NTFD+LE K++ W+    P+  +GPT+PS
Sbjct: 185 SFLCESSSYPNILRIVVDQLSNIDRVDIVLCNTFDKLEEKLLKWVQSLWPVLNIGPTVPS 244

Query: 255 AYLDGRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLEKEQVTELTCF 314
            YLD RL +DK YG ++ N    +  +EWL+SKE  SV+Y+SFGSLV L+++Q+ EL   
Sbjct: 245 MYLDKRLSEDKNYGFSLFNAKVAE-CMEWLNSKEPNSVVYLSFGSLVILKEDQMLELAAG 304

Query: 315 LRNTNLSFLWVLRESELGKLPNNFVQDTSEQGLIVNWCCQLEVLSHKAVSCFVTHCGWNS 374
           L+ +   FLWV+RE+E  KLP N+V++  E+GLIV+W  QL+VL+HK++ CF+THCGWNS
Sbjct: 305 LKQSGRFFLWVVRETETHKLPRNYVEEIGEKGLIVSWSPQLDVLAHKSIGCFLTHCGWNS 364

Query: 375 TIEALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIVTKEELEASIRKIVQG 434
           T+E LSLGVPMI +P W DQ TNAKF+ DVW+VGVRVK    G V +EE+  S+ ++++G
Sbjct: 365 TLEGLSLGVPMIGMPHWTDQPTNAKFMQDVWKVGVRVKAEGDGFVRREEIMRSVEEVMEG 424

Query: 435 EKPNEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFV 457
           EK  EI++N+ KWK +A+EA+ EGGSSDK+I+EFV
Sbjct: 425 EKGKEIRKNAEKWKVLAQEAVSEGGSSDKSINEFV 449

BLAST of CmoCh05G004550 vs. TAIR10
Match: AT1G05675.1 (AT1G05675.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 382.5 bits (981), Expect = 3.7e-106
Identity = 201/455 (44.18%), Postives = 279/455 (61.32%), Query Frame = 1

Query: 15  SHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLILDLPPSPIRHKVISDD 74
           SHVIV PFP  GH+ PM QF KRL SK L +T +  S          PP    H  I+  
Sbjct: 5   SHVIVLPFPAQGHITPMSQFCKRLASKSLKITLVLVSDKPS------PPYKTEHDTITVV 64

Query: 75  PESNNI-------DSLDAYLRSFRAAASKSLANFIDEALISDSNEVLPSLIVYDSVMPWV 134
           P SN         + LD Y+    ++    L   I++  +S +    P  +VYDS MPW+
Sbjct: 65  PISNGFQEGQERSEDLDEYMERVESSIKNRLPKLIEDMKLSGNP---PRALVYDSTMPWL 124

Query: 135 QSVAAERGLDAAPFFTQSAAVNHILDLVYKGSLSIPPPE---DVAVSLPSEIVLQPADLP 194
             VA   GL  A FFTQ   V+ I   V+KGS S+P  +       S PS  +L   DLP
Sbjct: 125 LDVAHSYGLSGAVFFTQPWLVSAIYYHVFKGSFSVPSTKYGHSTLASFPSLPILNANDLP 184

Query: 195 TLPDDGD---VVLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTVGPTIPS 254
           +   +      +L  +  Q  N++ V  +  NTFD+LE K++ W+    P+  +GPT+PS
Sbjct: 185 SFLCESSSYPYILRTVIDQLSNIDRVDIVLCNTFDKLEEKLLKWIKSVWPVLNIGPTVPS 244

Query: 255 AYLDGRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLEKEQVTELTCF 314
            YLD RL +DK YG ++      +  +EWL+SK+ +SV+Y+SFGSLV L+K+Q+ EL   
Sbjct: 245 MYLDKRLAEDKNYGFSLFGAKIAE-CMEWLNSKQPSSVVYVSFGSLVVLKKDQLIELAAG 304

Query: 315 LRNTNLSFLWVLRESELGKLPNNFVQDTSEQGLIVNWCCQLEVLSHKAVSCFVTHCGWNS 374
           L+ +   FLWV+RE+E  KLP N++++  E+GL V+W  QLEVL+HK++ CFVTHCGWNS
Sbjct: 305 LKQSGHFFLWVVRETERRKLPENYIEEIGEKGLTVSWSPQLEVLTHKSIGCFVTHCGWNS 364

Query: 375 TIEALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIVTKEELEASIRKIVQG 434
           T+E LSLGVPMI +P W DQ TNAKF+ DVW+VGVRVK +  G V +EE    + ++++ 
Sbjct: 365 TLEGLSLGVPMIGMPHWADQPTNAKFMEDVWKVGVRVKADSDGFVRREEFVRRVEEVMEA 424

Query: 435 EKPNEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFV 457
           E+  EI++N+ KWK +A+EA+ EGGSSDKNI+EFV
Sbjct: 425 EQGKEIRKNAEKWKVLAQEAVSEGGSSDKNINEFV 449

BLAST of CmoCh05G004550 vs. TAIR10
Match: AT2G31750.1 (AT2G31750.1 UDP-glucosyl transferase 74D1)

HSP 1 Score: 366.7 bits (940), Expect = 2.1e-101
Identity = 200/463 (43.20%), Postives = 293/463 (63.28%), Query Frame = 1

Query: 10  GEMKQSHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLI----------L 69
           GE  +++V+VF FP  GH+NP+LQF+KRL+SK + +TFLTTSS   S++          L
Sbjct: 2   GEKAKANVLVFSFPIQGHINPLLQFSKRLLSKNVNVTFLTTSSTHNSILRRAITGGATAL 61

Query: 70  DLPPSPIRHKVISDDPESNNIDSLDAYLRSFRAAASKSLANFIDEALISDSNEVLPSLIV 129
            L   PI      D P +   D+   Y   F+   S+SL+  I       S +  P+ +V
Sbjct: 62  PLSFVPIDDGFEEDHPST---DTSPDYFAKFQENVSRSLSELIS------SMDPKPNAVV 121

Query: 130 YDSVMPWVQSVAAER-GLDAAPFFTQSAAVNHILDLVYKGSLSIPPPEDVAVSLPSEIVL 189
           YDS +P+V  V  +  G+ AA FFTQS+ VN       +G       + V  ++P    L
Sbjct: 122 YDSCLPYVLDVCRKHPGVAAASFFTQSSTVNATYIHFLRGEFKEFQNDVVLPAMPP---L 181

Query: 190 QPADLPTLPDDGDV---VLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTV 249
           +  DLP    D ++   + E ++SQF+N++++ +   N+FD LE +V+ WM    P+K +
Sbjct: 182 KGNDLPVFLYDNNLCRPLFELISSQFVNVDDIDFFLVNSFDELEVEVLQWMKNQWPVKNI 241

Query: 250 GPTIPSAYLDGRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLEKEQV 309
           GP IPS YLD RL  DK YG+N+ N    +  ++WLDSK   SVIY+SFGSL  L+ +Q+
Sbjct: 242 GPMIPSMYLDKRLAGDKDYGINLFNAQVNE-CLDWLDSKPPGSVIYVSFGSLAVLKDDQM 301

Query: 310 TELTCFLRNTNLSFLWVLRESELGKLPNNFVQDTSEQGLIVNWCCQLEVLSHKAVSCFVT 369
            E+   L+ T  +FLWV+RE+E  KLP+N+++D  ++GLIVNW  QL+VL+HK++ CF+T
Sbjct: 302 IEVAAGLKQTGHNFLWVVRETETKKLPSNYIEDICDKGLIVNWSPQLQVLAHKSIGCFMT 361

Query: 370 HCGWNSTIEALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIVTKEELEASI 429
           HCGWNST+EALSLGV +I +P + DQ TNAKF+ DVW+VGVRVK +  G V KEE+   +
Sbjct: 362 HCGWNSTLEALSLGVALIGMPAYSDQPTNAKFIEDVWKVGVRVKADQNGFVPKEEIVRCV 421

Query: 430 RKIVQ--GEKPNEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFV 457
            ++++   EK  EI++N+ +  + A+EA+ +GG+SDKNIDEFV
Sbjct: 422 GEVMEDMSEKGKEIRKNARRLMEFAREALSDGGNSDKNIDEFV 451

BLAST of CmoCh05G004550 vs. TAIR10
Match: AT2G43820.1 (AT2G43820.1 UDP-glucosyltransferase 74F2)

HSP 1 Score: 355.1 bits (910), Expect = 6.4e-98
Identity = 197/456 (43.20%), Postives = 278/456 (60.96%), Query Frame = 1

Query: 11  EMKQSHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLILDLPPSPIRHKV 70
           E K+ HV+  P+P  GH+ P  QF KRL  KGL  T   T+    S+  DL   PI    
Sbjct: 2   EHKRGHVLAVPYPTQGHITPFRQFCKRLHFKGLKTTLALTTFVFNSINPDLS-GPISIAT 61

Query: 71  ISDDPESNNI---DSLDAYLRSFRAAASKSLANFIDEALISDSNEVLPSLIVYDSVMPWV 130
           ISD  +       DS+D YL+ F+ + SK++A+ I +   SD N +  + IVYD+ +PW 
Sbjct: 62  ISDGYDHGGFETADSIDDYLKDFKTSGSKTIADIIQKHQTSD-NPI--TCIVYDAFLPWA 121

Query: 131 QSVAAERGLDAAPFFTQSAAVNHILDLVY--KGSLSIPPPEDVAVSLPSEIVLQPADLPT 190
             VA E GL A PFFTQ  AVN++  L Y   GSL +P  E     LP    L+  DLP+
Sbjct: 122 LDVAREFGLVATPFFTQPCAVNYVYYLSYINNGSLQLPIEE-----LP---FLELQDLPS 181

Query: 191 ---LPDDGDVVLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTVGPTIPSA 250
              +        E +  QFIN E   ++  N+F  LE       +K  P+ T+GPTIPS 
Sbjct: 182 FFSVSGSYPAYFEMVLQQFINFEKADFVLVNSFQELELHENELWSKACPVLTIGPTIPSI 241

Query: 251 YLDGRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLEKEQVTELTCFL 310
           YLD R+  D  Y LN+    D    I WLD++   SV+Y++FGS+  L   Q+ EL   +
Sbjct: 242 YLDQRIKSDTGYDLNLFESKDDSFCINWLDTRPQGSVVYVAFGSMAQLTNVQMEELASAV 301

Query: 311 RNTNLSFLWVLRESELGKLPNNFVQDTS-EQGLIVNWCCQLEVLSHKAVSCFVTHCGWNS 370
             +N SFLWV+R SE  KLP+ F++  + E+ L++ W  QL+VLS+KA+ CF+THCGWNS
Sbjct: 302 --SNFSFLWVVRSSEEEKLPSGFLETVNKEKSLVLKWSPQLQVLSNKAIGCFLTHCGWNS 361

Query: 371 TIEALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVK-KNDKGIVTKEELEASIRKIVQ 430
           T+EAL+ GVPM+A+PQW DQ  NAK++ DVW+ GVRVK + + GI  +EE+E SI+++++
Sbjct: 362 TMEALTFGVPMVAMPQWTDQPMNAKYIQDVWKAGVRVKTEKESGIAKREEIEFSIKEVME 421

Query: 431 GEKPNEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFV 457
           GE+  E+K+N  KW+ +A ++++EGGS+D NID FV
Sbjct: 422 GERSKEMKKNVKKWRDLAVKSLNEGGSTDTNIDTFV 443

BLAST of CmoCh05G004550 vs. TAIR10
Match: AT2G43840.2 (AT2G43840.2 UDP-glycosyltransferase 74 F1)

HSP 1 Score: 352.4 bits (903), Expect = 4.1e-97
Identity = 194/456 (42.54%), Postives = 285/456 (62.50%), Query Frame = 1

Query: 11  EMKQSHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLILDLPPSPIRHKV 70
           E  + HV+  PFP  GH+ P+ QF KRL SKG   T   T+    ++ LD P SPI    
Sbjct: 2   EKMRGHVLAVPFPSQGHITPIRQFCKRLHSKGFKTTHTLTTFIFNTIHLD-PSSPISIAT 61

Query: 71  ISDDPESNNIDSLDA---YLRSFRAAASKSLANFIDEALISDSNEVLPSLIVYDSVMPWV 130
           ISD  +     S  +   YL++F+   SK++A+ I +   +D N +  + IVYDS MPW 
Sbjct: 62  ISDGYDQGGFSSAGSVPEYLQNFKTFGSKTVADIIRKHQSTD-NPI--TCIVYDSFMPWA 121

Query: 131 QSVAAERGLDAAPFFTQSAAVNHI--LDLVYKGSLSIPPPEDVAVSLPSEIVLQPADLPT 190
             +A + GL AAPFFTQS AVN+I  L  +  GSL++P  +     LP   +L+  DLPT
Sbjct: 122 LDLAMDFGLAAAPFFTQSCAVNYINYLSYINNGSLTLPIKD-----LP---LLELQDLPT 181

Query: 191 L--PDDGDVV-LEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTVGPTIPSA 250
              P    +   E +  QF N +   ++  N+F  L+  V   ++K  P+ T+GPT+PS 
Sbjct: 182 FVTPTGSHLAYFEMVLQQFTNFDKADFVLVNSFHDLDLHVKELLSKVCPVLTIGPTVPSM 241

Query: 251 YLDGRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLEKEQVTELTCFL 310
           YLD ++  D  Y LN+ +  +     +WLD +   SV+YI+FGS+  L  EQ+ E+   +
Sbjct: 242 YLDQQIKSDNDYDLNLFDLKEAALCTDWLDKRPEGSVVYIAFGSMAKLSSEQMEEIASAI 301

Query: 311 RNTNLSFLWVLRESELGKLPNNFVQDTS-EQGLIVNWCCQLEVLSHKAVSCFVTHCGWNS 370
             +N S+LWV+R SE  KLP  F++    ++ L++ W  QL+VLS+KA+ CF+THCGWNS
Sbjct: 302 --SNFSYLWVVRASEESKLPPGFLETVDKDKSLVLKWSPQLQVLSNKAIGCFMTHCGWNS 361

Query: 371 TIEALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVK-KNDKGIVTKEELEASIRKIVQ 430
           T+E LSLGVPM+A+PQW DQ  NAK++ DVW+VGVRVK + + GI  +EE+E SI+++++
Sbjct: 362 TMEGLSLGVPMVAMPQWTDQPMNAKYIQDVWKVGVRVKAEKESGICKREEIEFSIKEVME 421

Query: 431 GEKPNEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFV 457
           GEK  E+K+N+ KW+ +A +++ EGGS+D NI+EFV
Sbjct: 422 GEKSKEMKENAGKWRDLAVKSLSEGGSTDININEFV 443

BLAST of CmoCh05G004550 vs. NCBI nr
Match: gi|449452887|ref|XP_004144190.1| (PREDICTED: UDP-glycosyltransferase 74E2-like [Cucumis sativus])

HSP 1 Score: 727.6 bits (1877), Expect = 1.3e-206
Identity = 366/466 (78.54%), Postives = 413/466 (88.63%), Query Frame = 1

Query: 1   MEKTTVDGGG-EMKQSHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLIL 60
           MEK   +GGG  +KQ+HVIVFPFPRHGHM+PMLQF+KRL+SKGLLLTFL TSSAS+SL +
Sbjct: 1   MEKAMANGGGGRIKQNHVIVFPFPRHGHMSPMLQFSKRLISKGLLLTFLVTSSASQSLTI 60

Query: 61  DLPPSPIRH-KVISDDPESNNIDSLDAYLRSFRAAASKSLANFIDEALISDS-NEVLPSL 120
           ++PPSP  H K+ISD PES+++ + DAY+RSF+AA +KSL+NFIDEALIS S  EV P+L
Sbjct: 61  NIPPSPSFHIKIISDLPESDDVATFDAYIRSFQAAVTKSLSNFIDEALISSSYEEVSPTL 120

Query: 121 IVYDSVMPWVQSVAAERGLDAAPFFTQSAAVNHILDLVYKGSLSIPPPEDVAVSLPSEIV 180
           IVYDS+MPWV SVAAERGLD+APFFT+SAAVNH+L LVY GSLSIP PE+V VSLPSEIV
Sbjct: 121 IVYDSIMPWVHSVAAERGLDSAPFFTESAAVNHLLHLVYGGSLSIPAPENVVVSLPSEIV 180

Query: 181 LQPADLPTLPDDGDVVLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTVGP 240
           LQP DLP+ PDD +VVL+FM +QF +LENVKWIF NTFDRLE KVVNWM KTLPIKTVGP
Sbjct: 181 LQPGDLPSFPDDPEVVLDFMINQFSHLENVKWIFINTFDRLESKVVNWMAKTLPIKTVGP 240

Query: 241 TIPSAYLDGRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLEKEQVTE 300
           TIPSAYLDGRL +DKAYGLNV   N+GK  I+WLDSKETASVIYISFGSLV L +EQV E
Sbjct: 241 TIPSAYLDGRLENDKAYGLNVSKSNNGKSPIKWLDSKETASVIYISFGSLVMLSEEQVKE 300

Query: 301 LTCFLRNTNLSFLWVLRESELGKLPNNFVQDTSEQGLIVNWCCQLEVLSHKAVSCFVTHC 360
           LT  LR+T+ SFLWVLRESEL KLPNNFVQDTS+ GLIVNWCCQL+VLSHKAVSCFVTHC
Sbjct: 301 LTNLLRDTDFSFLWVLRESELVKLPNNFVQDTSDHGLIVNWCCQLQVLSHKAVSCFVTHC 360

Query: 361 GWNSTIEALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIVTKEELEASIRK 420
           GWNST+EALSLGVPM+AIPQWVDQTTNAKFVADVW VGVRVKKN+KG+  KEELEASIRK
Sbjct: 361 GWNSTLEALSLGVPMVAIPQWVDQTTNAKFVADVWRVGVRVKKNEKGVAIKEELEASIRK 420

Query: 421 I-VQGEKPNEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFVQAMAAS 463
           I VQG +PNE KQNSIKWK +AKEA+DE GSSDKNI+EFVQA+AAS
Sbjct: 421 IVVQGNRPNEFKQNSIKWKNLAKEAVDERGSSDKNIEEFVQALAAS 466

BLAST of CmoCh05G004550 vs. NCBI nr
Match: gi|659089390|ref|XP_008445481.1| (PREDICTED: UDP-glycosyltransferase 74E2-like [Cucumis melo])

HSP 1 Score: 724.9 bits (1870), Expect = 8.7e-206
Identity = 367/466 (78.76%), Postives = 412/466 (88.41%), Query Frame = 1

Query: 1   MEKTTVDGGGE-MKQSHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLIL 60
           ME T  +GGGE +KQSHVIVFPFPRHGHM+PMLQF+KRL+SKGLLLTFL TSSAS+SL +
Sbjct: 1   MEMTAANGGGERIKQSHVIVFPFPRHGHMSPMLQFSKRLISKGLLLTFLITSSASQSLTI 60

Query: 61  DLPPSPIRH-KVISDDPESNNIDSLDAYLRSFRAAASKSLANFIDEALISDSNE-VLPSL 120
           ++PPSP  H K+ISD PES+++ +LDAYLRSFRAA +KSL+NFIDE L S SNE V P+L
Sbjct: 61  NIPPSPSFHFKIISDLPESDDVATLDAYLRSFRAAVTKSLSNFIDEVLTSSSNEEVPPTL 120

Query: 121 IVYDSVMPWVQSVAAERGLDAAPFFTQSAAVNHILDLVYKGSLSIPPPEDVAVSLPSEIV 180
           IVYDSVMPWVQSVAAERGLD+APFFT+SAAVNH+L LVY GSLSIPPP++V VSLPSEIV
Sbjct: 121 IVYDSVMPWVQSVAAERGLDSAPFFTESAAVNHLLHLVYGGSLSIPPPDNVVVSLPSEIV 180

Query: 181 LQPADLPTLPDDGDVVLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLPIKTVGP 240
           LQP DLP+ PDD +VVL+FMTSQF +LENVKWIF NTFDRLE KVVNWM KTLPIKTVGP
Sbjct: 181 LQPEDLPSFPDDPEVVLDFMTSQFSHLENVKWIFINTFDRLESKVVNWMAKTLPIKTVGP 240

Query: 241 TIPSAYLDGRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLEKEQVTE 300
           TIPSAYLDGRL  DKAYGLNV   N+GK  I+WLDSKETASVIYISFGSLV L +EQV E
Sbjct: 241 TIPSAYLDGRLEKDKAYGLNVSKSNNGKCPIKWLDSKETASVIYISFGSLVILSEEQVKE 300

Query: 301 LTCFLRNTNLSFLWVLRESELGKLPNNFVQDTSEQGLIVNWCCQLEVLSHKAVSCFVTHC 360
           LT  LR+T+ SFLWVLRESE+ KLP NFVQDTS++GLIVNWCCQL+VLSHKAVSCFVTHC
Sbjct: 301 LTNLLRDTDFSFLWVLRESEMVKLPKNFVQDTSDRGLIVNWCCQLQVLSHKAVSCFVTHC 360

Query: 361 GWNSTIEALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIVTKEELEASIRK 420
           GWNST+EALSLGVPM+AIPQW+DQTTNAKFVADVW VGVRVKKN+K +  KEELEASIRK
Sbjct: 361 GWNSTLEALSLGVPMVAIPQWIDQTTNAKFVADVWRVGVRVKKNEKSVAIKEELEASIRK 420

Query: 421 I-VQGEKPNEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFVQAMAAS 463
           I VQG   NE KQN+IKWK +AKEA+DE GSSDKNI+EFVQA+ AS
Sbjct: 421 IVVQGNGTNEFKQNAIKWKNLAKEAVDERGSSDKNIEEFVQALVAS 466

BLAST of CmoCh05G004550 vs. NCBI nr
Match: gi|778715379|ref|XP_004144267.2| (PREDICTED: UDP-glycosyltransferase 74F1-like [Cucumis sativus])

HSP 1 Score: 536.2 bits (1380), Expect = 5.7e-149
Identity = 287/477 (60.17%), Postives = 345/477 (72.33%), Query Frame = 1

Query: 1   MEKTTVDGGGEMKQSHVI--VFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLI 60
           ME TT +GG ++  SHV+  VF +P+HGHM+PMLQFAKRL SKGL +TFLTTSS +++L 
Sbjct: 13  MENTTENGGRKLS-SHVVLVVFAYPKHGHMSPMLQFAKRLASKGLRVTFLTTSSVNQTLQ 72

Query: 61  LDLPPSPIRHKVISDDPESNNIDSLDAYLRSFRAAASKSLANFIDEALISDSNEVLPS-- 120
           ++L PS         D  +  I SL     SF A  SKS  +F+D  L +  N    S  
Sbjct: 73  INLIPSYQIDLQFISDVRTEAILSLKDKHESFEAVVSKSFGDFLDGVLRTADNSDYDSTP 132

Query: 121 ---LIVYDSVMPWVQSVAAERGLDAAPFFTQSAAVNHILDLVYKGSLSIP--PPEDVAVS 180
               +V+DSVMPW   VAAERG+D+APFFT+S AVN IL+ VY+GSL +   PP   AVS
Sbjct: 133 LRYFVVFDSVMPWAMDVAAERGVDSAPFFTESCAVNQILNQVYEGSLCLSSVPPSVGAVS 192

Query: 181 LPSEIVLQPADLPTLPDDGDVVLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLP 240
           +PS  VL+  DLP  P + +VV+ FM  QF + +  KWIF NTFD+LE KVV WM K  P
Sbjct: 193 IPSLPVLEVEDLPFFPYEREVVMNFMVRQFSSFKKAKWIFVNTFDQLEMKVVRWMGKRWP 252

Query: 241 IKTVGPTIPSAYLDGRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLE 300
           IKTVGPTIPSAYL+G L DDK+YGL  L   +  K +EWLD+KE  SVIYISFGSLV L 
Sbjct: 253 IKTVGPTIPSAYLEGELEDDKSYGLKHLKMENNGKILEWLDTKENGSVIYISFGSLVILP 312

Query: 301 KEQVTELTCFLRN--------TNLSFLWVLRESELGKLPNNFVQDTSEQGLIVNWCCQLE 360
            +QV ELT FL+N        TNLSFLWVLRESE+ KLPNNF+Q TS +GL+VNWCCQL+
Sbjct: 313 HKQVDELTNFLKNITAAAATATNLSFLWVLRESEMEKLPNNFIQTTSHKGLVVNWCCQLQ 372

Query: 361 VLSHKAVSCFVTHCGWNSTIEALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVK-KND 420
           VLSH AV CFVTHCGWNSTIEALSLGVPM+A+PQW+DQTTNAKFVADVWEVG RVK  +D
Sbjct: 373 VLSHSAVGCFVTHCGWNSTIEALSLGVPMVAVPQWIDQTTNAKFVADVWEVGARVKIGSD 432

Query: 421 KGIVTKEELEASIRKIVQGEKPNEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFVQAM 460
           KGI TKEELEASI+ +  G+  N IK NS+K  K+AKEAM EGGSS+KNI +FV ++
Sbjct: 433 KGIATKEELEASIQSVFGGDGKNRIKINSMKLMKLAKEAMKEGGSSNKNIQQFVDSI 488

BLAST of CmoCh05G004550 vs. NCBI nr
Match: gi|659089392|ref|XP_008445482.1| (PREDICTED: UDP-glycosyltransferase 74E2-like [Cucumis melo])

HSP 1 Score: 525.0 bits (1351), Expect = 1.3e-145
Identity = 283/475 (59.58%), Postives = 343/475 (72.21%), Query Frame = 1

Query: 1   MEKTTVDGGGEMKQSHVIV-FPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLIL 60
           ME    +GG ++  + V+V F +P+HGHM+PMLQFAKRL SKGL +TFLTTSS ++SL +
Sbjct: 1   MENAKENGGRKLSSNVVVVVFAYPKHGHMSPMLQFAKRLASKGLRVTFLTTSSVNQSLQI 60

Query: 61  DLPPSPIRHKVISDDPESNNIDSLDAYLRSFRAAASKSLANFIDEALISDSNEVLPS--- 120
           +L PS         D  +  I SL     SF A  S+S  +F+D AL ++ N    S   
Sbjct: 61  NLLPSYQIDLQFISDVRTEPILSLKDEHESFDAVVSRSFGDFLDGALRTNINSDYDSTPP 120

Query: 121 --LIVYDSVMPWVQSVAAERGLDAAPFFTQSAAVNHILDLVYKGSL---SIPPPEDVAVS 180
              +V+DS+MPW   VAAERG+D+APFFT+S AVNHIL+ VY+GSL   S+PP     VS
Sbjct: 121 RYFVVFDSIMPWAMDVAAERGMDSAPFFTESCAVNHILNQVYEGSLCLSSVPPA--AGVS 180

Query: 181 LPSEIVLQPADLPTLPDDGDVVLEFMTSQFINLENVKWIFFNTFDRLECKVVNWMTKTLP 240
           +PS  VL   DLP    + +VV+ FM  QF + +  KWIF NTFD+LE KVVNWM K  P
Sbjct: 181 IPSLPVLAVEDLPFFSYEREVVVNFMVRQFSSFKKAKWIFVNTFDQLEMKVVNWMAKRWP 240

Query: 241 IKTVGPTIPSAYLDGRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLE 300
           IKTVGPTIPSAYL+G L +DK+YGL  L   D  K +EWLD+KE  SVIYISFGSLV L 
Sbjct: 241 IKTVGPTIPSAYLEGELENDKSYGLKHLKMEDNGKILEWLDTKENGSVIYISFGSLVVLP 300

Query: 301 KEQVTELTCFLRN-----TNLSFLWVLRESELGKLPNNFVQDTSEQGLIVNWCCQLEVLS 360
            EQV EL   L++     TNLSFLWVLRESE+ KLPNNF+Q TS +GL+VNWCCQL+VLS
Sbjct: 301 HEQVDELANCLKSITTTTTNLSFLWVLRESEIEKLPNNFIQSTSHKGLVVNWCCQLQVLS 360

Query: 361 HKAVSCFVTHCGWNSTIEALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVK-KNDKGI 420
           H A+ CFVTHCGWNSTIEALSLGVPM+A+PQW+DQTTNAKFVADVWEVGVRVK  +DKGI
Sbjct: 361 HNAIGCFVTHCGWNSTIEALSLGVPMVAVPQWIDQTTNAKFVADVWEVGVRVKIGSDKGI 420

Query: 421 VTKEELEASIRKIVQGEK-PNEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFVQAM 460
            TKEELEASI+++  G+   NEIK NS    K+AKEAM EGGSS KNI+EFV ++
Sbjct: 421 ATKEELEASIQRVFGGDHGKNEIKINSTNLMKLAKEAMKEGGSSYKNIEEFVDSI 473

BLAST of CmoCh05G004550 vs. NCBI nr
Match: gi|343466211|gb|AEM42999.1| (UDP-glucosyltransferase [Siraitia grosvenorii])

HSP 1 Score: 448.7 bits (1153), Expect = 1.2e-122
Identity = 224/452 (49.56%), Postives = 320/452 (70.80%), Query Frame = 1

Query: 11  EMKQSHVIVFPFPRHGHMNPMLQFAKRLVSKGLLLTFLTTSSASESLILDLPPS-PIRHK 70
           E   +H++VFPFP  GH+NP+LQ +KRL++KG+ ++ +TT   S  L L    S  ++ +
Sbjct: 2   EKGDTHILVFPFPSQGHINPLLQLSKRLIAKGIKVSLVTTLHVSNHLQLQGAYSNSVKIE 61

Query: 71  VISDDPESN-NIDSLDAYLRSFRAAASKSLANFIDEALISDSNEVLPSLIVYDSVMPWVQ 130
           VISD  E     D++   L  FR   +K+L +F+ +A++S +    P  I+YDS MPWV 
Sbjct: 62  VISDGSEDRLETDTMRQTLDRFRQKMTKNLEDFLQKAMVSSNP---PKFILYDSTMPWVL 121

Query: 131 SVAAERGLDAAPFFTQSAAVNHILDLVYKGSLSIPPPEDVAVSLPSEIVLQPADLPTL-- 190
            VA E GLD APF+TQS A+N I   V  G L +PP E   +SLPS  +L+P+DLP    
Sbjct: 122 EVAKEFGLDRAPFYTQSCALNSINYHVLHGQLKLPP-ETPTISLPSMPLLRPSDLPAYDF 181

Query: 191 -PDDGDVVLEFMTSQFINLENVKWIFFNTFDRLECKVVNWM-TKTLPIKTVGPTIPSAYL 250
            P   D +++ +TSQ+ N+++   +F NTFD+LE +++ WM T   P+KTVGPT+PSAYL
Sbjct: 182 DPASTDTIIDLLTSQYSNIQDANLLFCNTFDKLEGEIIQWMETLGRPVKTVGPTVPSAYL 241

Query: 251 DGRLVDDKAYGLNVLNPNDGKKAIEWLDSKETASVIYISFGSLVNLEKEQVTELTCFLRN 310
           D R+ +DK YGL++  PN+    ++WLDSK + SV+Y+S+GSLV + +EQ+ EL   ++ 
Sbjct: 242 DKRVENDKHYGLSLFKPNEDV-CLKWLDSKPSGSVLYVSYGSLVEMGEEQLKELALGIKE 301

Query: 311 TNLSFLWVLRESELGKLPNNFVQDTSEQGLIVNWCCQLEVLSHKAVSCFVTHCGWNSTIE 370
           T   FLWV+R++E  KLP NFV+  +E+GL+V+WC QLEVL+H +V CF THCGWNST+E
Sbjct: 302 TGKFFLWVVRDTEAEKLPPNFVESVAEKGLVVSWCSQLEVLAHPSVGCFFTHCGWNSTLE 361

Query: 371 ALSLGVPMIAIPQWVDQTTNAKFVADVWEVGVRVKKNDKGIVTKEELEASIRKIVQGEKP 430
           AL LGVP++A PQW DQ TNAKF+ DVW+VG RVK+N++ + +KEE+ + I ++++GE+ 
Sbjct: 362 ALCLGVPVVAFPQWADQVTNAKFLEDVWKVGKRVKRNEQRLASKEEVRSCIWEVMEGERA 421

Query: 431 NEIKQNSIKWKKVAKEAMDEGGSSDKNIDEFV 457
           +E K NS++WKK AKEA+DEGGSSDKNI+EFV
Sbjct: 422 SEFKSNSMEWKKWAKEAVDEGGSSDKNIEEFV 448

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
U74E2_ARATH3.5e-10645.05UDP-glycosyltransferase 74E2 OS=Arabidopsis thaliana GN=UGT74E2 PE=1 SV=1[more]
U74E1_ARATH6.6e-10544.18UDP-glycosyltransferase 74E1 OS=Arabidopsis thaliana GN=UGT74E1 PE=3 SV=1[more]
U74D1_ARATH3.8e-10043.20UDP-glycosyltransferase 74D1 OS=Arabidopsis thaliana GN=UGT74D1 PE=1 SV=1[more]
U74F2_ARATH1.1e-9643.20UDP-glycosyltransferase 74F2 OS=Arabidopsis thaliana GN=UGT74F2 PE=1 SV=1[more]
U74G1_STERE1.9e-9642.86UDP-glycosyltransferase 74G1 OS=Stevia rebaudiana GN=UGT74G1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KD63_CUCSA9.3e-20778.54Glycosyltransferase OS=Cucumis sativus GN=Csa_6G366280 PE=3 SV=1[more]
K7NBW3_SIRGR8.4e-12349.56Glycosyltransferase OS=Siraitia grosvenorii GN=UDPG1 PE=2 SV=1[more]
A0A0D2UTG6_GOSRA1.2e-12151.55Glycosyltransferase OS=Gossypium raimondii GN=B456_011G131200 PE=3 SV=1[more]
A0A061EH11_THECC2.3e-12052.53Uridine diphosphate glycosyltransferase 74E2, putative isoform 1 OS=Theobroma ca... [more]
B9N960_POPTR5.1e-12049.78Glycosyltransferase OS=Populus trichocarpa GN=POPTR_0001s39980g PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G05680.12.0e-10745.05 Uridine diphosphate glycosyltransferase 74E2[more]
AT1G05675.13.7e-10644.18 UDP-Glycosyltransferase superfamily protein[more]
AT2G31750.12.1e-10143.20 UDP-glucosyl transferase 74D1[more]
AT2G43820.16.4e-9843.20 UDP-glucosyltransferase 74F2[more]
AT2G43840.24.1e-9742.54 UDP-glycosyltransferase 74 F1[more]
Match NameE-valueIdentityDescription
gi|449452887|ref|XP_004144190.1|1.3e-20678.54PREDICTED: UDP-glycosyltransferase 74E2-like [Cucumis sativus][more]
gi|659089390|ref|XP_008445481.1|8.7e-20678.76PREDICTED: UDP-glycosyltransferase 74E2-like [Cucumis melo][more]
gi|778715379|ref|XP_004144267.2|5.7e-14960.17PREDICTED: UDP-glycosyltransferase 74F1-like [Cucumis sativus][more]
gi|659089392|ref|XP_008445482.1|1.3e-14559.58PREDICTED: UDP-glycosyltransferase 74E2-like [Cucumis melo][more]
gi|343466211|gb|AEM42999.1|1.2e-12249.56UDP-glucosyltransferase [Siraitia grosvenorii][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016758 transferase activity, transferring hexosyl groups
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh05G004550.1CmoCh05G004550.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 12..460
score: 4.7E
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 264..432
score: 6.8
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePROSITEPS00375UDPGTcoord: 338..381
scor
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 267..422
score: 8.2
NoneNo IPR availablePANTHERPTHR11926:SF271UDP-GLYCOSYLTRANSFERASE 74D1coord: 12..460
score: 4.7E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 15..459
score: 6.28E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CmoCh05G004550Cp4.1LG07g00640Cucurbita pepo (Zucchini)cmocpeB744
CmoCh05G004550Carg26692Silver-seed gourdcarcmoB0017
The following gene(s) are paralogous to this gene:

None