CSPI06G30750 (gene) Wild cucumber (PI 183967)

NameCSPI06G30750
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionGlycosyltransferase
LocationChr6 : 26368163 .. 26369605 (+)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTGGCCAAGAATCCAAAACCCACGTGGCTTTGCTAGTCAGCCCCGGAATGGGTCATCTCATCCCCTTCCTCGAACTCGCCAACCGTCTCGTCCTCCACCACAACCTCCAAGCCACCCTCTTCGTCGTCGGTACCGGCTCCTCCTCCGCAGAATCCACCCTCCTCCAAAAACCCTCCCTCGTCAACATCGTCTCTCTTCCTCACTCCTTATCCTCCCTGGACCCAAACGCCCCCATCTGCGACATAATCATCTCCATGATGACCGCCTCTTTCCCCTTCCTTCGCTCCTCCATCGCTGCCGTCAACCCTCGTCCGGCGGCCCTCATCGTAGACCTTTTCGGAACCCCAGCTCTGTCAATAGCCCATGAACTCGGCATGCTGGGCTTGGTTTTCATGACCACTAATGCTTGGTACCTCTCTGTCTCGTATCTTTACCCTTCCTTTGAGAAACCAATGGTTGACGCCCACGTGTACAACCACGATCCTCTCGTGATCCCGGGTTGCACCCCCGTCCGGTTCGAAGATACCATCGAAGTGTTCGAATTGAACCAGGAGGAAGTTTACGTGGGATTCGGTCGTTACGCAAGGGAACTTGGAACGGCCGATGGGATCTTGTCAAACACGTGGCAGGATCTTGAGCCCACAACTCTAAAAGCACTCTCCGAAGCTGGGACCCTCGGTAATGGGAAAGTCAACGAAGTCCCGATTTATCCAATTGGGCCGTTGACTAGGAATTGCGAGCCCACTTTAGAGAGTGAGGTGTTGAAATGGCTTGATCGCCAACCGGATGAGTCAGTGATATACGTGTCCTTTGGGAGTGGAGGGACATTATGTGAAGAACAAATCACGGAATTGGCGTGGGGGTTGGAGCTGAGCCAGCAGCGCTTTGTTTGGGTGATACGCCCGCCGGAAGGGACGGAATCCACGGGAGCGTTTTTCACGGCGGGAAGAGGATCATCAAGGGACTATTGGGCGTCAAAATACTTGCCGGAAGGGTTCATAAAAAGAACGAAAGAGGTGGGTTTAGTGATTCCCATGTGGGGCCCACAGGCGGAGATTTTGAGTCACAGATCGGTGAGGGGATTTGTGACACACTGCGGGTGGAACTCGTCATTAGAGAGCATAGTGAACGGAGTGGCGATGGTGACGTGGCCGTTGTATGCAGAGCAGAAGATGAACGCAGCGTTGCTGACTGAGGAAATGGGTGTGGCGGTTAGGTTGAGGGCGGAGGGTCAGGGAGTGGTGGAGAGGAAGGAGATCGAGAAGAAGGTGAGGATGATAATGGAAGGCAAAGAAGGTGAGGGAATTAGAGAGAGGGTTAAAGAGCTTAAAATTAGTGGGGGAAAAGCCGTCACCAAGGGTGGGTCTTCGTACAATTCCTTGGCTCGTGTGGCTTCGGAATGCGATATTTTCCGGCGCCGTAGAGACGGAGGGTATTAG

mRNA sequence

ATGCCTGGCCAAGAATCCAAAACCCACGTGGCTTTGCTAGTCAGCCCCGGAATGGGTCATCTCATCCCCTTCCTCGAACTCGCCAACCGTCTCGTCCTCCACCACAACCTCCAAGCCACCCTCTTCGTCGTCGGTACCGGCTCCTCCTCCGCAGAATCCACCCTCCTCCAAAAACCCTCCCTCGTCAACATCGTCTCTCTTCCTCACTCCTTATCCTCCCTGGACCCAAACGCCCCCATCTGCGACATAATCATCTCCATGATGACCGCCTCTTTCCCCTTCCTTCGCTCCTCCATCGCTGCCGTCAACCCTCGTCCGGCGGCCCTCATCGTAGACCTTTTCGGAACCCCAGCTCTGTCAATAGCCCATGAACTCGGCATGCTGGGCTTGGTTTTCATGACCACTAATGCTTGGTACCTCTCTGTCTCGTATCTTTACCCTTCCTTTGAGAAACCAATGGTTGACGCCCACGTGTACAACCACGATCCTCTCGTGATCCCGGGTTGCACCCCCGTCCGGTTCGAAGATACCATCGAAGTGTTCGAATTGAACCAGGAGGAAGTTTACGTGGGATTCGGTCGTTACGCAAGGGAACTTGGAACGGCCGATGGGATCTTGTCAAACACGTGGCAGGATCTTGAGCCCACAACTCTAAAAGCACTCTCCGAAGCTGGGACCCTCGGTAATGGGAAAGTCAACGAAGTCCCGATTTATCCAATTGGGCCGTTGACTAGGAATTGCGAGCCCACTTTAGAGAGTGAGGTGTTGAAATGGCTTGATCGCCAACCGGATGAGTCAGTGATATACGTGTCCTTTGGGAGTGGAGGGACATTATGTGAAGAACAAATCACGGAATTGGCGTGGGGGTTGGAGCTGAGCCAGCAGCGCTTTGTTTGGGTGATACGCCCGCCGGAAGGGACGGAATCCACGGGAGCGTTTTTCACGGCGGGAAGAGGATCATCAAGGGACTATTGGGCGTCAAAATACTTGCCGGAAGGGTTCATAAAAAGAACGAAAGAGGTGGGTTTAGTGATTCCCATGTGGGGCCCACAGGCGGAGATTTTGAGTCACAGATCGGTGAGGGGATTTGTGACACACTGCGGGTGGAACTCGTCATTAGAGAGCATAGTGAACGGAGTGGCGATGGTGACGTGGCCGTTGTATGCAGAGCAGAAGATGAACGCAGCGTTGCTGACTGAGGAAATGGGTGTGGCGGTTAGGTTGAGGGCGGAGGGTCAGGGAGTGGTGGAGAGGAAGGAGATCGAGAAGAAGGTGAGGATGATAATGGAAGGCAAAGAAGGTGAGGGAATTAGAGAGAGGGTTAAAGAGCTTAAAATTAGTGGGGGAAAAGCCGTCACCAAGGGTGGGTCTTCGTACAATTCCTTGGCTCGTGTGGCTTCGGAATGCGATATTTTCCGGCGCCGTAGAGACGGAGGGTATTAG

Coding sequence (CDS)

ATGCCTGGCCAAGAATCCAAAACCCACGTGGCTTTGCTAGTCAGCCCCGGAATGGGTCATCTCATCCCCTTCCTCGAACTCGCCAACCGTCTCGTCCTCCACCACAACCTCCAAGCCACCCTCTTCGTCGTCGGTACCGGCTCCTCCTCCGCAGAATCCACCCTCCTCCAAAAACCCTCCCTCGTCAACATCGTCTCTCTTCCTCACTCCTTATCCTCCCTGGACCCAAACGCCCCCATCTGCGACATAATCATCTCCATGATGACCGCCTCTTTCCCCTTCCTTCGCTCCTCCATCGCTGCCGTCAACCCTCGTCCGGCGGCCCTCATCGTAGACCTTTTCGGAACCCCAGCTCTGTCAATAGCCCATGAACTCGGCATGCTGGGCTTGGTTTTCATGACCACTAATGCTTGGTACCTCTCTGTCTCGTATCTTTACCCTTCCTTTGAGAAACCAATGGTTGACGCCCACGTGTACAACCACGATCCTCTCGTGATCCCGGGTTGCACCCCCGTCCGGTTCGAAGATACCATCGAAGTGTTCGAATTGAACCAGGAGGAAGTTTACGTGGGATTCGGTCGTTACGCAAGGGAACTTGGAACGGCCGATGGGATCTTGTCAAACACGTGGCAGGATCTTGAGCCCACAACTCTAAAAGCACTCTCCGAAGCTGGGACCCTCGGTAATGGGAAAGTCAACGAAGTCCCGATTTATCCAATTGGGCCGTTGACTAGGAATTGCGAGCCCACTTTAGAGAGTGAGGTGTTGAAATGGCTTGATCGCCAACCGGATGAGTCAGTGATATACGTGTCCTTTGGGAGTGGAGGGACATTATGTGAAGAACAAATCACGGAATTGGCGTGGGGGTTGGAGCTGAGCCAGCAGCGCTTTGTTTGGGTGATACGCCCGCCGGAAGGGACGGAATCCACGGGAGCGTTTTTCACGGCGGGAAGAGGATCATCAAGGGACTATTGGGCGTCAAAATACTTGCCGGAAGGGTTCATAAAAAGAACGAAAGAGGTGGGTTTAGTGATTCCCATGTGGGGCCCACAGGCGGAGATTTTGAGTCACAGATCGGTGAGGGGATTTGTGACACACTGCGGGTGGAACTCGTCATTAGAGAGCATAGTGAACGGAGTGGCGATGGTGACGTGGCCGTTGTATGCAGAGCAGAAGATGAACGCAGCGTTGCTGACTGAGGAAATGGGTGTGGCGGTTAGGTTGAGGGCGGAGGGTCAGGGAGTGGTGGAGAGGAAGGAGATCGAGAAGAAGGTGAGGATGATAATGGAAGGCAAAGAAGGTGAGGGAATTAGAGAGAGGGTTAAAGAGCTTAAAATTAGTGGGGGAAAAGCCGTCACCAAGGGTGGGTCTTCGTACAATTCCTTGGCTCGTGTGGCTTCGGAATGCGATATTTTCCGGCGCCGTAGAGACGGAGGGTATTAG
BLAST of CSPI06G30750 vs. Swiss-Prot
Match: U72E2_ARATH (UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana GN=UGT72E2 PE=1 SV=1)

HSP 1 Score: 436.8 bits (1122), Expect = 3.1e-121
Identity = 234/478 (48.95%), Postives = 312/478 (65.27%), Query Frame = 1

Query: 6   SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKPSLVNIV 65
           +K H A+  SPGMGH+IP +EL  RL  ++    T+FV+ T ++SA+S  L     V+IV
Sbjct: 4   TKPHAAMFSSPGMGHVIPVIELGKRLSANNGFHVTVFVLETDAASAQSKFLNSTG-VDIV 63

Query: 66  SLPHS--LSSLDPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALSIAH 125
            LP       +DP+  +   I  +M A+ P LRS IAA++ +P ALIVDLFGT AL +A 
Sbjct: 64  KLPSPDIYGLVDPDDHVVTKIGVIMRAAVPALRSKIAAMHQKPTALIVDLFGTDALCLAK 123

Query: 126 ELGMLGLVFMTTNAWYLSVSYLYPSFEKPMVDAHVYNHDPLVIPGCTPVRFEDTIEVFEL 185
           E  ML  VF+ TNA +L VS  YP+ +K + + H    +PL IPGC PVRFEDT++ + +
Sbjct: 124 EFNMLSYVFIPTNARFLGVSIYYPNLDKDIKEEHTVQRNPLAIPGCEPVRFEDTLDAYLV 183

Query: 186 NQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGNGKVNEVPIYPIGPL 245
             E VY  F R+      ADGIL NTW+++EP +LK+L     L  G+V  VP+YPIGPL
Sbjct: 184 PDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSLLNPKLL--GRVARVPVYPIGPL 243

Query: 246 TRNCEPTLESE----VLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVW 305
              C P   SE    VL WL+ QP+ESV+Y+SFGSGG L  +Q+TELAWGLE SQQRFVW
Sbjct: 244 ---CRPIQSSETDHPVLDWLNEQPNESVLYISFGSGGCLSAKQLTELAWGLEQSQQRFVW 303

Query: 306 VIRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRS 365
           V+RPP        + +A  G + D    +YLPEGF+ RT + G V+P W PQAEILSHR+
Sbjct: 304 VVRPPVDGSCCSEYVSANGGGTEDN-TPEYLPEGFVSRTSDRGFVVPSWAPQAEILSHRA 363

Query: 366 VRGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEGQGVVERK 425
           V GF+THCGW+S+LES+V GV M+ WPL+AEQ MNAALL++E+G+AVRL  + +  + R 
Sbjct: 364 VGGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDELGIAVRL-DDPKEDISRW 423

Query: 426 EIEKKVRMIMEGKEGEGIRERVKELKISG--GKAVTKGGSSYNSLARVASECDIFRRR 476
           +IE  VR +M  KEGE +R +VK+L+ S     ++  GG ++ SL RV  EC  F  R
Sbjct: 424 KIEALVRKVMTEKEGEAMRRKVKKLRDSAEMSLSIDGGGLAHESLCRVTKECQRFLER 473

BLAST of CSPI06G30750 vs. Swiss-Prot
Match: U72E1_ARATH (UDP-glycosyltransferase 72E1 OS=Arabidopsis thaliana GN=UGT72E1 PE=1 SV=1)

HSP 1 Score: 436.0 bits (1120), Expect = 5.2e-121
Identity = 226/474 (47.68%), Postives = 317/474 (66.88%), Query Frame = 1

Query: 6   SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKP----SL 65
           +K HVA+  SPGMGH+IP +EL  RL   H    T+FV+ T ++SA+S  L  P    +L
Sbjct: 4   TKPHVAMFASPGMGHIIPVIELGKRLAGSHGFDVTIFVLETDAASAQSQFLNSPGCDAAL 63

Query: 66  VNIVSLPH-SLSSL-DPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPAL 125
           V+IV LP   +S L DP+A     ++ MM  + P +RS I  +  +P ALIVDLFG  A+
Sbjct: 64  VDIVGLPTPDISGLVDPSAFFGIKLLVMMRETIPTIRSKIEEMQHKPTALIVDLFGLDAI 123

Query: 126 SIAHELGMLGLVFMTTNAWYLSVSYLYPSFEKPMVDAHVYNHDPLVIPGCTPVRFEDTIE 185
            +  E  ML  +F+ +NA +L+V+  +P+ +K M + H+    P+V+PGC PVRFEDT+E
Sbjct: 124 PLGGEFNMLTYIFIASNARFLAVALFFPTLDKDMEEEHIIKKQPMVMPGCEPVRFEDTLE 183

Query: 186 VFELNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGNGKVNEVPIYP 245
            F     ++Y  F  +     T DGI+ NTW D+EP TLK+L +   L  G++  VP+YP
Sbjct: 184 TFLDPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSLQDPKLL--GRIAGVPVYP 243

Query: 246 IGPLTRNCEPTLESE-VLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFV 305
           IGPL+R  +P+  +  VL WL++QPDESV+Y+SFGSGG+L  +Q+TELAWGLE+SQQRFV
Sbjct: 244 IGPLSRPVDPSKTNHPVLDWLNKQPDESVLYISFGSGGSLSAKQLTELAWGLEMSQQRFV 303

Query: 306 WVIRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHR 365
           WV+RPP    +  A+ +A  G  RD     YLPEGF+ RT E G ++  W PQAEIL+H+
Sbjct: 304 WVVRPPVDGSACSAYLSANSGKIRD-GTPDYLPEGFVSRTHERGFMVSSWAPQAEILAHQ 363

Query: 366 SVRGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVR-LRAEGQGVVE 425
           +V GF+THCGWNS LES+V GV M+ WPL+AEQ MNA LL EE+GVAVR  +   +GV+ 
Sbjct: 364 AVGGFLTHCGWNSILESVVGGVPMIAWPLFAEQMMNATLLNEELGVAVRSKKLPSEGVIT 423

Query: 426 RKEIEKKVRMIMEGKEGEGIRERVKELKISGGKAVT-KGGSSYNSLARVASECD 471
           R EIE  VR IM  +EG  +R+++K+LK +  ++++  GG ++ SL+R+A E +
Sbjct: 424 RAEIEALVRKIMVEEEGAEMRKKIKKLKETAAESLSCDGGVAHESLSRIADESE 474

BLAST of CSPI06G30750 vs. Swiss-Prot
Match: UFOG5_MANES (Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta GN=GT5 PE=2 SV=1)

HSP 1 Score: 432.6 bits (1111), Expect = 5.8e-120
Identity = 229/470 (48.72%), Postives = 310/470 (65.96%), Query Frame = 1

Query: 6   SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQK---PSLV 65
           SK H+ LL SPG+GHLIP LEL  R+V   N   T+F+VG+ +S+AE  +L+    P L 
Sbjct: 8   SKPHIVLLSSPGLGHLIPVLELGKRIVTLCNFDVTIFMVGSDTSAAEPQVLRSAMTPKLC 67

Query: 66  NIVSLPHSLSS--LDPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALS 125
            I+ LP    S  +DP A +C  +  +M    P  R++++A+  RPAA+IVDLFGT +L 
Sbjct: 68  EIIQLPPPNISCLIDPEATVCTRLFVLMREIRPAFRAAVSALKFRPAAIIVDLFGTESLE 127

Query: 126 IAHELGMLGLVFMTTNAWYLSVSYLYPSFEKPMVDAHVYNHDPLVIPGCTPVRFEDTIEV 185
           +A ELG+   V++ +NAW+L+++   P  +K +    V   +P+ IPGC PVR E+ ++ 
Sbjct: 128 VAKELGIAKYVYIASNAWFLALTIYVPILDKEVEGEFVLQKEPMKIPGCRPVRTEEVVDP 187

Query: 186 FELNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGNGKVNEVPIYPI 245
                 + Y  + R   E+ TADGIL NTW+ LEPTT  AL +   L  G+V +VP++PI
Sbjct: 188 MLDRTNQQYSEYFRLGIEIPTADGILMNTWEALEPTTFGALRDVKFL--GRVAKVPVFPI 247

Query: 246 GPLTRNCEPT-LESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVW 305
           GPL R   P     E+L WLD+QP ESV+YVSFGSGGTL  EQ+ ELAWGLE SQQRF+W
Sbjct: 248 GPLRRQAGPCGSNCELLDWLDQQPKESVVYVSFGSGGTLSLEQMIELAWGLERSQQRFIW 307

Query: 306 VIRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRS 365
           V+R P       AFFT G G+      S Y PEGF+ R + VGLV+P W PQ  I+SH S
Sbjct: 308 VVRQPTVKTGDAAFFTQGDGADD---MSGYFPEGFLTRIQNVGLVVPQWSPQIHIMSHPS 367

Query: 366 VRGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLR-AEGQGVVER 425
           V  F++HCGWNS LESI  GV ++ WP+YAEQ+MNA LLTEE+GVAVR +    + VV+R
Sbjct: 368 VGVFLSHCGWNSVLESITAGVPIIAWPIYAEQRMNATLLTEELGVAVRPKNLPAKEVVKR 427

Query: 426 KEIEKKVRMIMEGKEGEGIRERVKELKISGGKAVTKGGSSYNSLARVASE 469
           +EIE+ +R IM  +EG  IR+RV+ELK SG KA+ +GGSS+N ++ + +E
Sbjct: 428 EEIERMIRRIMVDEEGSEIRKRVRELKDSGEKALNEGGSSFNYMSALGNE 472

BLAST of CSPI06G30750 vs. Swiss-Prot
Match: U72E3_ARATH (UDP-glycosyltransferase 72E3 OS=Arabidopsis thaliana GN=UGT72E3 PE=1 SV=1)

HSP 1 Score: 428.7 bits (1101), Expect = 8.3e-119
Identity = 224/472 (47.46%), Postives = 316/472 (66.95%), Query Frame = 1

Query: 6   SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKPSLVNIV 65
           +K H A+  SPGMGH++P +ELA RL  +H    T+FV+ T ++S +S LL     V+IV
Sbjct: 4   TKPHAAMFSSPGMGHVLPVIELAKRLSANHGFHVTVFVLETDAASVQSKLLNSTG-VDIV 63

Query: 66  SLPH-SLSSL-DPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALSIAH 125
           +LP   +S L DPNA +   I  +M  + P LRS I A++  P ALI+DLFGT AL +A 
Sbjct: 64  NLPSPDISGLVDPNAHVVTKIGVIMREAVPTLRSKIVAMHQNPTALIIDLFGTDALCLAA 123

Query: 126 ELGMLGLVFMTTNAWYLSVSYLYPSFEKPMVDAHVYNHDPLVIPGCTPVRFEDTIEVFEL 185
           EL ML  VF+ +NA YL VS  YP+ ++ + + H     PL IPGC PVRFED ++ + +
Sbjct: 124 ELNMLTYVFIASNARYLGVSIYYPTLDEVIKEEHTVQRKPLTIPGCEPVRFEDIMDAYLV 183

Query: 186 NQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGNGKVNEVPIYPIGPL 245
             E VY    R+      ADGIL NTW+++EP +LK+L +   L  G+V  VP+YP+GPL
Sbjct: 184 PDEPVYHDLVRHCLAYPKADGILVNTWEEMEPKSLKSLQDPKLL--GRVARVPVYPVGPL 243

Query: 246 TRNCE-PTLESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVWVIR 305
            R  +  T +  V  WL++QP+ESV+Y+SFGSGG+L  +Q+TELAWGLE SQQRF+WV+R
Sbjct: 244 CRPIQSSTTDHPVFDWLNKQPNESVLYISFGSGGSLTAQQLTELAWGLEESQQRFIWVVR 303

Query: 306 PPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRSVRG 365
           PP    S   +F+A  G ++D    +YLPEGF+ RT + G +IP W PQAEIL+H++V G
Sbjct: 304 PPVDGSSCSDYFSAKGGVTKDN-TPEYLPEGFVTRTCDRGFMIPSWAPQAEILAHQAVGG 363

Query: 366 FVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEGQGVVERKEIE 425
           F+THCGW+S+LES++ GV M+ WPL+AEQ MNAALL++E+G++VR+  + +  + R +IE
Sbjct: 364 FLTHCGWSSTLESVLCGVPMIAWPLFAEQNMNAALLSDELGISVRV-DDPKEAISRSKIE 423

Query: 426 KKVRMIMEGKEGEGIRERVKELKISG--GKAVTKGGSSYNSLARVASECDIF 473
             VR +M   EGE +R +VK+L+ +     ++  GGS++ SL RV  EC  F
Sbjct: 424 AMVRKVMAEDEGEEMRRKVKKLRDTAEMSLSIHGGGSAHESLCRVTKECQRF 470

BLAST of CSPI06G30750 vs. Swiss-Prot
Match: U72B1_ARATH (UDP-glycosyltransferase 72B1 OS=Arabidopsis thaliana GN=UGT72B1 PE=1 SV=1)

HSP 1 Score: 350.9 bits (899), Expect = 2.2e-95
Identity = 204/474 (43.04%), Postives = 288/474 (60.76%), Query Frame = 1

Query: 4   QESKT-HVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGS-SSAESTLLQK-PS 63
           +ESKT HVA++ SPGMGHLIP +E A RLV  H L  T  + G G  S A+ T+L   PS
Sbjct: 2   EESKTPHVAIIPSPGMGHLIPLVEFAKRLVHLHGLTVTFVIAGEGPPSKAQRTVLDSLPS 61

Query: 64  LVNIVSLPH-SLSSLDPNAPICDIIISMMTASFPFLRS---SIAAVNPRPAALIVDLFGT 123
            ++ V LP   L+ L  +  I   I   +T S P LR    S       P AL+VDLFGT
Sbjct: 62  SISSVFLPPVDLTDLSSSTRIESRISLTVTRSNPELRKVFDSFVEGGRLPTALVVDLFGT 121

Query: 124 PALSIAHELGMLGLVFMTTNAWYLSVSYLYPSFEKPMVDAHVYNHDPLVIPGCTPVRFED 183
            A  +A E  +   +F  T A  LS     P  ++ +        +PL++PGC PV  +D
Sbjct: 122 DAFDVAVEFHVPPYIFYPTTANVLSFFLHLPKLDETVSCEFRELTEPLMLPGCVPVAGKD 181

Query: 184 TIEVFELNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGNGKVNEVP 243
            ++  +  +++ Y       +    A+GIL NT+ +LEP  +KAL E G      +++ P
Sbjct: 182 FLDPAQDRKDDAYKWLLHNTKRYKEAEGILVNTFFELEPNAIKALQEPG------LDKPP 241

Query: 244 IYPIGPLT----RNCEPTLESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLEL 303
           +YP+GPL     +  + T ESE LKWLD QP  SV+YVSFGSGGTL  EQ+ ELA GL  
Sbjct: 242 VYPVGPLVNIGKQEAKQTEESECLKWLDNQPLGSVLYVSFGSGGTLTCEQLNELALGLAD 301

Query: 304 SQQRFVWVIRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQA 363
           S+QRF+WVIR P G  ++  F +  +     +     LP GF++RTK+ G VIP W PQA
Sbjct: 302 SEQRFLWVIRSPSGIANSSYFDSHSQTDPLTF-----LPPGFLERTKKRGFVIPFWAPQA 361

Query: 364 EILSHRSVRGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEG 423
           ++L+H S  GF+THCGWNS+LES+V+G+ ++ WPLYAEQKMNA LL+E++  A+R RA  
Sbjct: 362 QVLAHPSTGGFLTHCGWNSTLESVVSGIPLIAWPLYAEQKMNAVLLSEDIRAALRPRAGD 421

Query: 424 QGVVERKEIEKKVRMIMEGKEGEGIRERVKELKISGGKAVTKGGSSYNSLARVA 467
            G+V R+E+ + V+ +MEG+EG+G+R ++KELK +  + +   G+S  +L+ VA
Sbjct: 422 DGLVRREEVARVVKGLMEGEEGKGVRNKMKELKEAACRVLKDDGTSTKALSLVA 464

BLAST of CSPI06G30750 vs. TrEMBL
Match: A0A0A0KGE4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G501940 PE=4 SV=1)

HSP 1 Score: 953.4 bits (2463), Expect = 1.1e-274
Identity = 477/480 (99.38%), Postives = 477/480 (99.38%), Query Frame = 1

Query: 1   MPGQESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKPS 60
           MPGQESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKPS
Sbjct: 1   MPGQESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKPS 60

Query: 61  LVNIVSLPHSLSSLDPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALS 120
           LVNIVSLPHSLSSLDPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALS
Sbjct: 61  LVNIVSLPHSLSSLDPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALS 120

Query: 121 IAHELGMLGLVFMTTNAWYLSVSYLYPSFEKPMVDAHVYNHDPLVIPGCTPVRFEDTIEV 180
           IAHELGMLGLVFMTTNAWYLSVSYLYPSFEK MVDAHVYNHDPLVIPGCTPVRFEDTIEV
Sbjct: 121 IAHELGMLGLVFMTTNAWYLSVSYLYPSFEKSMVDAHVYNHDPLVIPGCTPVRFEDTIEV 180

Query: 181 FELNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGNGKVNEVPIYPI 240
           FELNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLG GKVNEVPIYPI
Sbjct: 181 FELNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGYGKVNEVPIYPI 240

Query: 241 GPLTRNCEPTLESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVWV 300
           GPLTRN EPTLESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVWV
Sbjct: 241 GPLTRNGEPTLESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVWV 300

Query: 301 IRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRSV 360
           IRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRSV
Sbjct: 301 IRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRSV 360

Query: 361 RGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEGQGVVERKE 420
           RGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEGQGVVERKE
Sbjct: 361 RGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEGQGVVERKE 420

Query: 421 IEKKVRMIMEGKEGEGIRERVKELKISGGKAVTKGGSSYNSLARVASECDIFRRRRDGGY 480
           IEKKVRMIMEGKEGEGIRERVKELKISGGKAVTKGGSSYNSLARVASECDIFRRRRDGGY
Sbjct: 421 IEKKVRMIMEGKEGEGIRERVKELKISGGKAVTKGGSSYNSLARVASECDIFRRRRDGGY 480

BLAST of CSPI06G30750 vs. TrEMBL
Match: B9HEN9_POPTR (Glycosyltransferase OS=Populus trichocarpa GN=POPTR_0007s12400g PE=3 SV=1)

HSP 1 Score: 494.2 bits (1271), Expect = 1.8e-136
Identity = 257/473 (54.33%), Postives = 332/473 (70.19%), Query Frame = 1

Query: 4   QESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKPSLVN 63
           Q +K H ALL SPGMGHLIP LEL  RLV +H    TLFVV T +S+ +S L +    +N
Sbjct: 2   QNTKPHAALLASPGMGHLIPVLELGKRLVTYHGFHVTLFVVATDASTTQSRLKEPYPNIN 61

Query: 64  IVSLPH-SLSSL-DPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALSI 123
           I++LP   +S L DP A +   +  MM  + P LRS+I A+   P ALIVDLFGT A ++
Sbjct: 62  IITLPLVDISGLIDPAATVVTKLAVMMRETLPSLRSAILALKSPPTALIVDLFGTEAFAV 121

Query: 124 AHELGMLGLVFMTTNAWYLSVSYLYPSFEKPMVDAHVYNHDPLVIPGCTPVRFEDTIEVF 183
           A E  ML  VF T+NAW+ +++  +P+ ++ + D HV    PL IPGC  VRFEDT+  +
Sbjct: 122 AEEFNMLKYVFDTSNAWFFAITIYFPTIDRNLEDKHVIQKQPLRIPGCKSVRFEDTLGAY 181

Query: 184 ELNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGNGKVNEVPIYPIG 243
               +++Y+ + R   E+  ADGIL NTW+DLEPTTL AL +   L  G+V + P+YPIG
Sbjct: 182 LDRNDQMYIEYKRIGIEMPMADGILMNTWEDLEPTTLGALRDFQML--GRVAKAPVYPIG 241

Query: 244 PLTRNCEPTL-ESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVWV 303
           PL R   P++  ++VL WLD QP+ESVIYVSFGSGGTL  EQ+ ELAWGLELS+QRFVWV
Sbjct: 242 PLARPVGPSVPRNQVLNWLDNQPNESVIYVSFGSGGTLSTEQMAELAWGLELSKQRFVWV 301

Query: 304 IRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRSV 363
           +RPP   ++ GAFF    GS        +LPEGF+ RT+EVGLV+P+W PQ EIL+H SV
Sbjct: 302 VRPPIDNDAAGAFFNLDDGSE---GIPSFLPEGFLARTREVGLVVPLWAPQVEILAHPSV 361

Query: 364 RGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRA-EGQGVVERK 423
            GF++HCGWNS+LESI NGV M+ WPLYAEQKMNA +LTEE+GVAV+ +    + VV R 
Sbjct: 362 GGFLSHCGWNSTLESITNGVPMIAWPLYAEQKMNATILTEELGVAVQPKTLASERVVVRA 421

Query: 424 EIEKKVRMIMEGKEGEGIRERVKELKISGGKAV-TKGGSSYNSLARVASECDI 472
           EIE  VR IME +EG GIR+RV ELK SG KA+ +KGGSSYNSL+++A +C++
Sbjct: 422 EIEMMVRKIMEDEEGFGIRKRVNELKHSGEKALSSKGGSSYNSLSQIAKQCEL 469

BLAST of CSPI06G30750 vs. TrEMBL
Match: K7LND6_SOYBN (Glycosyltransferase OS=Glycine max GN=GLYMA_11G064400 PE=3 SV=1)

HSP 1 Score: 491.9 bits (1265), Expect = 8.9e-136
Identity = 255/479 (53.24%), Postives = 333/479 (69.52%), Query Frame = 1

Query: 6   SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKPSLVNIV 65
           SK H AL+ SPGMGHLIP LEL  RL+ HH+   T+F+V T S++  S +LQ+ S +NIV
Sbjct: 4   SKAHAALVASPGMGHLIPMLELGKRLLTHHSFHVTIFIVTTDSATTTSHILQQTSNLNIV 63

Query: 66  SLPHSLSS--LDPNAPICDIIISMMTASFPFLRSSIAAVN-PRPAALIVDLFGTPALSIA 125
            +P    S  L PN P+   I+  M  S PFLRSSI + N P P+ALIVD+FG  A  IA
Sbjct: 64  LVPPIDVSHKLPPNPPLAARIMLTMIDSIPFLRSSILSTNLPPPSALIVDMFGLAAFPIA 123

Query: 126 HELGMLGLVFMTTNAWYLSVSYLYPSFEKPMVDAHVYNHDPLVIPGCTPVRFEDTIEVFE 185
            +LGML  V+  T+AW+ +VS   P+ +K M++ H  +H+PLVIPGC  VRFEDT+E F 
Sbjct: 124 RDLGMLTYVYFATSAWFSAVSVYVPAMDKKMIERHAEHHEPLVIPGCEAVRFEDTLEPFL 183

Query: 186 LNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGNGKVNEVPIYPIGP 245
               E+Y G+   A+E+ TADGIL NTWQDLEP   KA+ E G L  G+  +  +YP+GP
Sbjct: 184 SPIGEMYEGYLAAAKEIVTADGILMNTWQDLEPAATKAVREDGIL--GRFTKGAVYPVGP 243

Query: 246 LTRNCEPTLESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVWVIR 305
           L R  E   E  VL W+D QP E+V+YVSFGSGGT+ E Q+ E+A GLELSQQRFVWV+R
Sbjct: 244 LVRTVEKKAEDAVLSWMDVQPAETVVYVSFGSGGTMSEVQMREVALGLELSQQRFVWVVR 303

Query: 306 PPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRSVRG 365
           PP   +++G+FF   +  S D     YLP+GF+KRT+ VG+V+PMW PQAEIL H +   
Sbjct: 304 PPCEGDTSGSFFEVSKNGSGDV-VLDYLPKGFVKRTEGVGVVVPMWAPQAEILGHPATGC 363

Query: 366 FVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAE-GQGVVERKEI 425
           FVTHCGWNS LES++NGV MV WPLYAEQKMNA +L+EE+GVAVR+  E G GVV R+EI
Sbjct: 364 FVTHCGWNSVLESVLNGVPMVAWPLYAEQKMNAFMLSEELGVAVRVAGEGGGGVVGREEI 423

Query: 426 EKKVRMIMEGKEGEGIRERVKELKISGGKAVTKGGSSYNSLARVASECDIFRRRRDGGY 481
            + VR +M  KEG G+R++VKELK+SG KA++K GSS++ L ++  +C +  +  +  Y
Sbjct: 424 AELVRRVMVDKEGVGMRKKVKELKVSGEKALSKFGSSHHWLCQMNKDCQVHAQASEADY 479

BLAST of CSPI06G30750 vs. TrEMBL
Match: A0A061DGH3_THECC (Glycosyltransferase OS=Theobroma cacao GN=TCM_000444 PE=3 SV=1)

HSP 1 Score: 489.6 bits (1259), Expect = 4.4e-135
Identity = 252/474 (53.16%), Postives = 339/474 (71.52%), Query Frame = 1

Query: 4   QESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKPSL-- 63
           Q +K HVALL SPG+GHLIP LEL  RLV HHN + T+FV+ + +S+A++ LL+  ++  
Sbjct: 2   QTTKPHVALLASPGLGHLIPVLELGKRLVTHHNFRITIFVLASEASTAQNQLLESSNMDV 61

Query: 64  VNIVSLPHSLSS--LDPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPAL 123
           +NIVSLP +  S  +DP A I   I+ +M  S P LRS+IAA+  RP+ALIVDLFGT AL
Sbjct: 62  LNIVSLPSAEISTKVDPGAHIVTKIVVIMRESLPGLRSAIAAMKSRPSALIVDLFGTEAL 121

Query: 124 SIAHELGMLGLVFMTTNAWYLSVSYLYPSFEKPMVDAHVYNHDPLVIPGCTPVRFEDTIE 183
            +A E  ML  VF+ +NAW+L ++   P+ EK + + HV    PL IPGC  VRFEDT+E
Sbjct: 122 PVADEFKMLKYVFIASNAWFLGITVYAPTVEKIVDEEHVKQQKPLKIPGCKSVRFEDTLE 181

Query: 184 VFELNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGNGKVNEVPIYP 243
            +    +++Y  + R   E+  ADGIL NT++DLEP TL++L++A  L  G+V +VP+YP
Sbjct: 182 AYLNRNDQLYGEYARVGLEIPEADGILVNTFEDLEPATLRSLTDAELL--GRVAKVPVYP 241

Query: 244 IGPLTRNCEP-TLESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFV 303
           IGP+ R   P  L   VL WLD+QP +SVIYVSFGSGGTL  +Q+TE+AWGLE SQQRF+
Sbjct: 242 IGPVVRTLGPLVLADPVLDWLDKQPSQSVIYVSFGSGGTLSAKQMTEIAWGLEQSQQRFI 301

Query: 304 WVIRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHR 363
           WV+RPP   +++G FFT G  S        YLP+GF+ RT++ GLV+PMW PQ +IL+H 
Sbjct: 302 WVVRPPVENDASGTFFTVGNDSD---GTPDYLPDGFLTRTRDRGLVLPMWAPQTDILAHP 361

Query: 364 SVRGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLR-AEGQGVVE 423
           SV GFV+HCGWNS++ES++NGV ++ WPLYAEQKMNA +LTEE+G+AVR + +    +VE
Sbjct: 362 SVGGFVSHCGWNSTMESLLNGVPLIAWPLYAEQKMNATMLTEELGLAVRPKMSTSSRIVE 421

Query: 424 RKEIEKKVRMIMEGKEGEGIRERVKELKISGGKAVTKGGSSYNSLARVASECDI 472
           RKE+E  VR IM  K+G+ IR+R KELK    KA++KGGSS  SL++VA E ++
Sbjct: 422 RKELEMVVRKIMVDKDGQEIRDRAKELKHIAQKALSKGGSSCTSLSQVAKEIEM 470

BLAST of CSPI06G30750 vs. TrEMBL
Match: E9M5E5_PUEML (Glycosyltransferase OS=Pueraria montana var. lobata PE=2 SV=1)

HSP 1 Score: 488.0 bits (1255), Expect = 1.3e-134
Identity = 250/476 (52.52%), Postives = 333/476 (69.96%), Query Frame = 1

Query: 6   SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKPS---LV 65
           SK H ALL SPGMGHLIP +EL  RL+ HH L  T+FVV T S++  S +LQ+ S    +
Sbjct: 4   SKPHAALLASPGMGHLIPMVELGKRLLTHHGLHVTIFVVTTDSAATTSQILQQTSNLTSL 63

Query: 66  NIVSLP--HSLSSLDPNAPICDIIISMMTASFPFLRSSIAAVN--PRPAALIVDLFGTPA 125
           NI+ +P       L PN P+   I+  M  S PF+RSSI +    P P+ALIVD+FG  A
Sbjct: 64  NIIHVPPIDVSDKLPPNPPLAIRILLTMLESLPFVRSSILSTTNLPPPSALIVDMFGLAA 123

Query: 126 LSIAHELGMLGLVFMTTNAWYLSVSYLYPSFEKPMVDAHVYNHDPLVIPGCTPVRFEDTI 185
             +A +LGML  V+  T+AW+ +V+  +P+ +K ++++H  NH+PL++PGC  V FEDT+
Sbjct: 124 FPMARDLGMLIYVYFATSAWFSAVTLYFPAMDKKLIESHAENHEPLMVPGCEAVLFEDTL 183

Query: 186 EVFELNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGNGKVNEVPIY 245
           E F     E+Y G+   A+E+ TADGIL NTWQDLEP   KA+ E G LG  +  + P++
Sbjct: 184 EPFLSPGGEMYEGYLTAAKEIVTADGILMNTWQDLEPAATKAVREDGILG--RFTKGPVH 243

Query: 246 PIGPLTRNCEPTLES---EVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQ 305
            +GPL R  E   E     VL+WLD QP +SVIYVSFGSGGT+ E+Q+ E+A GLELSQQ
Sbjct: 244 AVGPLVRTVETKPEDGKDAVLRWLDGQPADSVIYVSFGSGGTMSEDQMREVALGLELSQQ 303

Query: 306 RFVWVIRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEIL 365
           RFVWV+RPP   +++G+FF    G   D  A  YLPEGF+KRT+ VG+V+PMW PQAEIL
Sbjct: 304 RFVWVVRPPCEGDASGSFFDVANGGG-DVAALNYLPEGFVKRTEGVGVVVPMWAPQAEIL 363

Query: 366 SHRSVRGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEGQGV 425
            H +  GFVTHCGWNS LES++NGV MV WPLYAEQKMNA +L+EE+GVAVR+  EG GV
Sbjct: 364 GHPATGGFVTHCGWNSVLESVLNGVPMVAWPLYAEQKMNAFMLSEELGVAVRVAEEGGGV 423

Query: 426 VERKEIEKKVRMIMEGKEGEGIRERVKELKISGGKAVTKGGSSYNSLARVASECDI 472
           V  +++ + VR +M  KEG G+R++VKELK+SG KA+TK GSS++SL  ++ +C++
Sbjct: 424 VRGEQVAELVRRVMVDKEGVGMRKKVKELKLSGEKALTKFGSSHHSLCEMSKDCEV 476

BLAST of CSPI06G30750 vs. TAIR10
Match: AT5G66690.1 (AT5G66690.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 436.8 bits (1122), Expect = 1.7e-122
Identity = 234/478 (48.95%), Postives = 312/478 (65.27%), Query Frame = 1

Query: 6   SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKPSLVNIV 65
           +K H A+  SPGMGH+IP +EL  RL  ++    T+FV+ T ++SA+S  L     V+IV
Sbjct: 4   TKPHAAMFSSPGMGHVIPVIELGKRLSANNGFHVTVFVLETDAASAQSKFLNSTG-VDIV 63

Query: 66  SLPHS--LSSLDPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALSIAH 125
            LP       +DP+  +   I  +M A+ P LRS IAA++ +P ALIVDLFGT AL +A 
Sbjct: 64  KLPSPDIYGLVDPDDHVVTKIGVIMRAAVPALRSKIAAMHQKPTALIVDLFGTDALCLAK 123

Query: 126 ELGMLGLVFMTTNAWYLSVSYLYPSFEKPMVDAHVYNHDPLVIPGCTPVRFEDTIEVFEL 185
           E  ML  VF+ TNA +L VS  YP+ +K + + H    +PL IPGC PVRFEDT++ + +
Sbjct: 124 EFNMLSYVFIPTNARFLGVSIYYPNLDKDIKEEHTVQRNPLAIPGCEPVRFEDTLDAYLV 183

Query: 186 NQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGNGKVNEVPIYPIGPL 245
             E VY  F R+      ADGIL NTW+++EP +LK+L     L  G+V  VP+YPIGPL
Sbjct: 184 PDEPVYRDFVRHGLAYPKADGILVNTWEEMEPKSLKSLLNPKLL--GRVARVPVYPIGPL 243

Query: 246 TRNCEPTLESE----VLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVW 305
              C P   SE    VL WL+ QP+ESV+Y+SFGSGG L  +Q+TELAWGLE SQQRFVW
Sbjct: 244 ---CRPIQSSETDHPVLDWLNEQPNESVLYISFGSGGCLSAKQLTELAWGLEQSQQRFVW 303

Query: 306 VIRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRS 365
           V+RPP        + +A  G + D    +YLPEGF+ RT + G V+P W PQAEILSHR+
Sbjct: 304 VVRPPVDGSCCSEYVSANGGGTEDN-TPEYLPEGFVSRTSDRGFVVPSWAPQAEILSHRA 363

Query: 366 VRGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEGQGVVERK 425
           V GF+THCGW+S+LES+V GV M+ WPL+AEQ MNAALL++E+G+AVRL  + +  + R 
Sbjct: 364 VGGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDELGIAVRL-DDPKEDISRW 423

Query: 426 EIEKKVRMIMEGKEGEGIRERVKELKISG--GKAVTKGGSSYNSLARVASECDIFRRR 476
           +IE  VR +M  KEGE +R +VK+L+ S     ++  GG ++ SL RV  EC  F  R
Sbjct: 424 KIEALVRKVMTEKEGEAMRRKVKKLRDSAEMSLSIDGGGLAHESLCRVTKECQRFLER 473

BLAST of CSPI06G30750 vs. TAIR10
Match: AT3G50740.1 (AT3G50740.1 UDP-glucosyl transferase 72E1)

HSP 1 Score: 436.0 bits (1120), Expect = 2.9e-122
Identity = 226/474 (47.68%), Postives = 317/474 (66.88%), Query Frame = 1

Query: 6   SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKP----SL 65
           +K HVA+  SPGMGH+IP +EL  RL   H    T+FV+ T ++SA+S  L  P    +L
Sbjct: 4   TKPHVAMFASPGMGHIIPVIELGKRLAGSHGFDVTIFVLETDAASAQSQFLNSPGCDAAL 63

Query: 66  VNIVSLPH-SLSSL-DPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPAL 125
           V+IV LP   +S L DP+A     ++ MM  + P +RS I  +  +P ALIVDLFG  A+
Sbjct: 64  VDIVGLPTPDISGLVDPSAFFGIKLLVMMRETIPTIRSKIEEMQHKPTALIVDLFGLDAI 123

Query: 126 SIAHELGMLGLVFMTTNAWYLSVSYLYPSFEKPMVDAHVYNHDPLVIPGCTPVRFEDTIE 185
            +  E  ML  +F+ +NA +L+V+  +P+ +K M + H+    P+V+PGC PVRFEDT+E
Sbjct: 124 PLGGEFNMLTYIFIASNARFLAVALFFPTLDKDMEEEHIIKKQPMVMPGCEPVRFEDTLE 183

Query: 186 VFELNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGNGKVNEVPIYP 245
            F     ++Y  F  +     T DGI+ NTW D+EP TLK+L +   L  G++  VP+YP
Sbjct: 184 TFLDPNSQLYREFVPFGSVFPTCDGIIVNTWDDMEPKTLKSLQDPKLL--GRIAGVPVYP 243

Query: 246 IGPLTRNCEPTLESE-VLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFV 305
           IGPL+R  +P+  +  VL WL++QPDESV+Y+SFGSGG+L  +Q+TELAWGLE+SQQRFV
Sbjct: 244 IGPLSRPVDPSKTNHPVLDWLNKQPDESVLYISFGSGGSLSAKQLTELAWGLEMSQQRFV 303

Query: 306 WVIRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHR 365
           WV+RPP    +  A+ +A  G  RD     YLPEGF+ RT E G ++  W PQAEIL+H+
Sbjct: 304 WVVRPPVDGSACSAYLSANSGKIRD-GTPDYLPEGFVSRTHERGFMVSSWAPQAEILAHQ 363

Query: 366 SVRGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVR-LRAEGQGVVE 425
           +V GF+THCGWNS LES+V GV M+ WPL+AEQ MNA LL EE+GVAVR  +   +GV+ 
Sbjct: 364 AVGGFLTHCGWNSILESVVGGVPMIAWPLFAEQMMNATLLNEELGVAVRSKKLPSEGVIT 423

Query: 426 RKEIEKKVRMIMEGKEGEGIRERVKELKISGGKAVT-KGGSSYNSLARVASECD 471
           R EIE  VR IM  +EG  +R+++K+LK +  ++++  GG ++ SL+R+A E +
Sbjct: 424 RAEIEALVRKIMVEEEGAEMRKKIKKLKETAAESLSCDGGVAHESLSRIADESE 474

BLAST of CSPI06G30750 vs. TAIR10
Match: AT5G26310.1 (AT5G26310.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 428.7 bits (1101), Expect = 4.7e-120
Identity = 224/472 (47.46%), Postives = 316/472 (66.95%), Query Frame = 1

Query: 6   SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKPSLVNIV 65
           +K H A+  SPGMGH++P +ELA RL  +H    T+FV+ T ++S +S LL     V+IV
Sbjct: 4   TKPHAAMFSSPGMGHVLPVIELAKRLSANHGFHVTVFVLETDAASVQSKLLNSTG-VDIV 63

Query: 66  SLPH-SLSSL-DPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALSIAH 125
           +LP   +S L DPNA +   I  +M  + P LRS I A++  P ALI+DLFGT AL +A 
Sbjct: 64  NLPSPDISGLVDPNAHVVTKIGVIMREAVPTLRSKIVAMHQNPTALIIDLFGTDALCLAA 123

Query: 126 ELGMLGLVFMTTNAWYLSVSYLYPSFEKPMVDAHVYNHDPLVIPGCTPVRFEDTIEVFEL 185
           EL ML  VF+ +NA YL VS  YP+ ++ + + H     PL IPGC PVRFED ++ + +
Sbjct: 124 ELNMLTYVFIASNARYLGVSIYYPTLDEVIKEEHTVQRKPLTIPGCEPVRFEDIMDAYLV 183

Query: 186 NQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGNGKVNEVPIYPIGPL 245
             E VY    R+      ADGIL NTW+++EP +LK+L +   L  G+V  VP+YP+GPL
Sbjct: 184 PDEPVYHDLVRHCLAYPKADGILVNTWEEMEPKSLKSLQDPKLL--GRVARVPVYPVGPL 243

Query: 246 TRNCE-PTLESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVWVIR 305
            R  +  T +  V  WL++QP+ESV+Y+SFGSGG+L  +Q+TELAWGLE SQQRF+WV+R
Sbjct: 244 CRPIQSSTTDHPVFDWLNKQPNESVLYISFGSGGSLTAQQLTELAWGLEESQQRFIWVVR 303

Query: 306 PPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRSVRG 365
           PP    S   +F+A  G ++D    +YLPEGF+ RT + G +IP W PQAEIL+H++V G
Sbjct: 304 PPVDGSSCSDYFSAKGGVTKDN-TPEYLPEGFVTRTCDRGFMIPSWAPQAEILAHQAVGG 363

Query: 366 FVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEGQGVVERKEIE 425
           F+THCGW+S+LES++ GV M+ WPL+AEQ MNAALL++E+G++VR+  + +  + R +IE
Sbjct: 364 FLTHCGWSSTLESVLCGVPMIAWPLFAEQNMNAALLSDELGISVRV-DDPKEAISRSKIE 423

Query: 426 KKVRMIMEGKEGEGIRERVKELKISG--GKAVTKGGSSYNSLARVASECDIF 473
             VR +M   EGE +R +VK+L+ +     ++  GGS++ SL RV  EC  F
Sbjct: 424 AMVRKVMAEDEGEEMRRKVKKLRDTAEMSLSIHGGGSAHESLCRVTKECQRF 470

BLAST of CSPI06G30750 vs. TAIR10
Match: AT4G01070.1 (AT4G01070.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 350.9 bits (899), Expect = 1.2e-96
Identity = 204/474 (43.04%), Postives = 288/474 (60.76%), Query Frame = 1

Query: 4   QESKT-HVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGS-SSAESTLLQK-PS 63
           +ESKT HVA++ SPGMGHLIP +E A RLV  H L  T  + G G  S A+ T+L   PS
Sbjct: 2   EESKTPHVAIIPSPGMGHLIPLVEFAKRLVHLHGLTVTFVIAGEGPPSKAQRTVLDSLPS 61

Query: 64  LVNIVSLPH-SLSSLDPNAPICDIIISMMTASFPFLRS---SIAAVNPRPAALIVDLFGT 123
            ++ V LP   L+ L  +  I   I   +T S P LR    S       P AL+VDLFGT
Sbjct: 62  SISSVFLPPVDLTDLSSSTRIESRISLTVTRSNPELRKVFDSFVEGGRLPTALVVDLFGT 121

Query: 124 PALSIAHELGMLGLVFMTTNAWYLSVSYLYPSFEKPMVDAHVYNHDPLVIPGCTPVRFED 183
            A  +A E  +   +F  T A  LS     P  ++ +        +PL++PGC PV  +D
Sbjct: 122 DAFDVAVEFHVPPYIFYPTTANVLSFFLHLPKLDETVSCEFRELTEPLMLPGCVPVAGKD 181

Query: 184 TIEVFELNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGNGKVNEVP 243
            ++  +  +++ Y       +    A+GIL NT+ +LEP  +KAL E G      +++ P
Sbjct: 182 FLDPAQDRKDDAYKWLLHNTKRYKEAEGILVNTFFELEPNAIKALQEPG------LDKPP 241

Query: 244 IYPIGPLT----RNCEPTLESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLEL 303
           +YP+GPL     +  + T ESE LKWLD QP  SV+YVSFGSGGTL  EQ+ ELA GL  
Sbjct: 242 VYPVGPLVNIGKQEAKQTEESECLKWLDNQPLGSVLYVSFGSGGTLTCEQLNELALGLAD 301

Query: 304 SQQRFVWVIRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQA 363
           S+QRF+WVIR P G  ++  F +  +     +     LP GF++RTK+ G VIP W PQA
Sbjct: 302 SEQRFLWVIRSPSGIANSSYFDSHSQTDPLTF-----LPPGFLERTKKRGFVIPFWAPQA 361

Query: 364 EILSHRSVRGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEG 423
           ++L+H S  GF+THCGWNS+LES+V+G+ ++ WPLYAEQKMNA LL+E++  A+R RA  
Sbjct: 362 QVLAHPSTGGFLTHCGWNSTLESVVSGIPLIAWPLYAEQKMNAVLLSEDIRAALRPRAGD 421

Query: 424 QGVVERKEIEKKVRMIMEGKEGEGIRERVKELKISGGKAVTKGGSSYNSLARVA 467
            G+V R+E+ + V+ +MEG+EG+G+R ++KELK +  + +   G+S  +L+ VA
Sbjct: 422 DGLVRREEVARVVKGLMEGEEGKGVRNKMKELKEAACRVLKDDGTSTKALSLVA 464

BLAST of CSPI06G30750 vs. TAIR10
Match: AT4G36770.1 (AT4G36770.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 350.9 bits (899), Expect = 1.2e-96
Identity = 191/454 (42.07%), Postives = 281/454 (61.89%), Query Frame = 1

Query: 9   HVALLVSPGMGHLIPFLELANRLVLHHNL-QATLFVVGTGSSSAES----TLLQKPSLVN 68
           H AL+ SPGMGH +P LEL   L+ HH   + T+F+V    S ++S    TL+++     
Sbjct: 4   HGALVASPGMGHAVPILELGKHLLNHHGFDRVTVFLVTDDVSRSKSLIGKTLMEEDPKFV 63

Query: 69  IVSLPHSLSSLDPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALSIAH 128
           I  +P  +S  D +  +   +  MM  + P ++SS+  + PRP   +VDL GT AL +A 
Sbjct: 64  IRFIPLDVSGQDLSGSLLTKLAEMMRKALPEIKSSVMELEPRPRVFVVDLLGTEALEVAK 123

Query: 129 ELGMLGL-VFMTTNAWYLSVSYLYPSFEKPMVDAHVYNHDPLVIPGCTPVRFEDTIE--- 188
           ELG++   V +TT+AW+L+ +    S +K  +   + +   L+IPGC+PV+FE   +   
Sbjct: 124 ELGIMRKHVLVTTSAWFLAFTVYMASLDKQELYKQLSSIGALLIPGCSPVKFERAQDPRK 183

Query: 189 -VFELNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGNGKVNEVPIY 248
            + EL + +      R   E+ TADG+  NTW  LE  T+ +  +   LG   +  VP+Y
Sbjct: 184 YIRELAESQ------RIGDEVITADGVFVNTWHSLEQVTIGSFLDPENLGR-VMRGVPVY 243

Query: 249 PIGPLTRNCEPTLESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFV 308
           P+GPL R  EP L+  VL WLD QP ESV+YVSFGSGG L  EQ  ELA+GLEL+  RFV
Sbjct: 244 PVGPLVRPAEPGLKHGVLDWLDLQPKESVVYVSFGSGGALTFEQTNELAYGLELTGHRFV 303

Query: 309 WVIRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHR 368
           WV+RPP   + + + F   +  +       +LP GF+ RTK++GLV+  W PQ EIL+H+
Sbjct: 304 WVVRPPAEDDPSASMFDKTKNETEPL---DFLPNGFLDRTKDIGLVVRTWAPQEEILAHK 363

Query: 369 SVRGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEGQGVVER 428
           S  GFVTHCGWNS LESIVNGV MV WPLY+EQKMNA +++ E+ +A+++     G+V++
Sbjct: 364 STGGFVTHCGWNSVLESIVNGVPMVAWPLYSEQKMNARMVSGELKIALQINV-ADGIVKK 423

Query: 429 KEIEKKVRMIMEGKEGEGIRERVKELKISGGKAV 453
           + I + V+ +M+ +EG+ +R+ VKELK +  +A+
Sbjct: 424 EVIAEMVKRVMDEEEGKEMRKNVKELKKTAEEAL 446

BLAST of CSPI06G30750 vs. NCBI nr
Match: gi|700193601|gb|KGN48805.1| (hypothetical protein Csa_6G501940 [Cucumis sativus])

HSP 1 Score: 953.4 bits (2463), Expect = 1.6e-274
Identity = 477/480 (99.38%), Postives = 477/480 (99.38%), Query Frame = 1

Query: 1   MPGQESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKPS 60
           MPGQESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKPS
Sbjct: 1   MPGQESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKPS 60

Query: 61  LVNIVSLPHSLSSLDPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALS 120
           LVNIVSLPHSLSSLDPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALS
Sbjct: 61  LVNIVSLPHSLSSLDPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALS 120

Query: 121 IAHELGMLGLVFMTTNAWYLSVSYLYPSFEKPMVDAHVYNHDPLVIPGCTPVRFEDTIEV 180
           IAHELGMLGLVFMTTNAWYLSVSYLYPSFEK MVDAHVYNHDPLVIPGCTPVRFEDTIEV
Sbjct: 121 IAHELGMLGLVFMTTNAWYLSVSYLYPSFEKSMVDAHVYNHDPLVIPGCTPVRFEDTIEV 180

Query: 181 FELNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGNGKVNEVPIYPI 240
           FELNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLG GKVNEVPIYPI
Sbjct: 181 FELNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGYGKVNEVPIYPI 240

Query: 241 GPLTRNCEPTLESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVWV 300
           GPLTRN EPTLESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVWV
Sbjct: 241 GPLTRNGEPTLESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVWV 300

Query: 301 IRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRSV 360
           IRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRSV
Sbjct: 301 IRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRSV 360

Query: 361 RGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEGQGVVERKE 420
           RGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEGQGVVERKE
Sbjct: 361 RGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEGQGVVERKE 420

Query: 421 IEKKVRMIMEGKEGEGIRERVKELKISGGKAVTKGGSSYNSLARVASECDIFRRRRDGGY 480
           IEKKVRMIMEGKEGEGIRERVKELKISGGKAVTKGGSSYNSLARVASECDIFRRRRDGGY
Sbjct: 421 IEKKVRMIMEGKEGEGIRERVKELKISGGKAVTKGGSSYNSLARVASECDIFRRRRDGGY 480

BLAST of CSPI06G30750 vs. NCBI nr
Match: gi|778722457|ref|XP_004143577.2| (PREDICTED: UDP-glycosyltransferase 72E1, partial [Cucumis sativus])

HSP 1 Score: 919.1 bits (2374), Expect = 3.3e-264
Identity = 461/464 (99.35%), Postives = 461/464 (99.35%), Query Frame = 1

Query: 1   MPGQESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKPS 60
           MPGQESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKPS
Sbjct: 1   MPGQESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKPS 60

Query: 61  LVNIVSLPHSLSSLDPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALS 120
           LVNIVSLPHSLSSLDPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALS
Sbjct: 61  LVNIVSLPHSLSSLDPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALS 120

Query: 121 IAHELGMLGLVFMTTNAWYLSVSYLYPSFEKPMVDAHVYNHDPLVIPGCTPVRFEDTIEV 180
           IAHELGMLGLVFMTTNAWYLSVSYLYPSFEK MVDAHVYNHDPLVIPGCTPVRFEDTIEV
Sbjct: 121 IAHELGMLGLVFMTTNAWYLSVSYLYPSFEKSMVDAHVYNHDPLVIPGCTPVRFEDTIEV 180

Query: 181 FELNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGNGKVNEVPIYPI 240
           FELNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLG GKVNEVPIYPI
Sbjct: 181 FELNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGYGKVNEVPIYPI 240

Query: 241 GPLTRNCEPTLESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVWV 300
           GPLTRN EPTLESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVWV
Sbjct: 241 GPLTRNGEPTLESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVWV 300

Query: 301 IRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRSV 360
           IRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRSV
Sbjct: 301 IRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRSV 360

Query: 361 RGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEGQGVVERKE 420
           RGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEGQGVVERKE
Sbjct: 361 RGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEGQGVVERKE 420

Query: 421 IEKKVRMIMEGKEGEGIRERVKELKISGGKAVTKGGSSYNSLAR 465
           IEKKVRMIMEGKEGEGIRERVKELKISGGKAVTKGGSSYNSLAR
Sbjct: 421 IEKKVRMIMEGKEGEGIRERVKELKISGGKAVTKGGSSYNSLAR 464

BLAST of CSPI06G30750 vs. NCBI nr
Match: gi|659081042|ref|XP_008441118.1| (PREDICTED: UDP-glycosyltransferase 72E1 [Cucumis melo])

HSP 1 Score: 818.1 bits (2112), Expect = 7.8e-234
Identity = 411/462 (88.96%), Postives = 431/462 (93.29%), Query Frame = 1

Query: 18  MGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKPSLVNIVSLPHSLSSLDPN 77
           MGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAE  LLQKPS VNI+ LPH+ SSLD N
Sbjct: 1   MGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAEFALLQKPSPVNIIPLPHASSSLDSN 60

Query: 78  APICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALSIAHELGMLGLVFMTTNA 137
           API +II SMMTASFPFLRSSIAA NPRPAALIVDLFGTPALSIAHELGMLG VFMTT+A
Sbjct: 61  APIFNIISSMMTASFPFLRSSIAAANPRPAALIVDLFGTPALSIAHELGMLGFVFMTTSA 120

Query: 138 WYLSVSYLYPSFEKPMVDAHVYNHDPLVIPGCTPVRFEDTIEVFELNQEEVYVGFGRYAR 197
           W+LS+   YPSF+K MVDAHV NHDPLVIPGCTPVRFE+TIEVFELNQ+EVYVGFG +A 
Sbjct: 121 WFLSLFVFYPSFDKSMVDAHVDNHDPLVIPGCTPVRFENTIEVFELNQKEVYVGFGSFAS 180

Query: 198 ELGTADGILSNTWQDLEPTTLKALSEAGTLGNGKVNEVPIYPIGPLTRNCEPTLESEVLK 257
           ELGTADGILSNTWQDLEPTTLKALSEAGTLG GKVNEVPI+PIGPLT N +PTLESEVLK
Sbjct: 181 ELGTADGILSNTWQDLEPTTLKALSEAGTLGKGKVNEVPIFPIGPLTSNGDPTLESEVLK 240

Query: 258 WLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVWVIRPPEGTESTGAFFTAG 317
           WLDRQPDESVIYVSFGSGGTL EEQITELAWGLE+SQQRFVWVIRPP GTES G FFTAG
Sbjct: 241 WLDRQPDESVIYVSFGSGGTLREEQITELAWGLEMSQQRFVWVIRPPAGTESMGTFFTAG 300

Query: 318 RGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRSVRGFVTHCGWNSSLESIV 377
           RGSS D WAS++LPEGFIKRTKEVGLVIPMWGPQAEILSHRSVRGFVTHCGWNSSLESIV
Sbjct: 301 RGSSGDDWASEFLPEGFIKRTKEVGLVIPMWGPQAEILSHRSVRGFVTHCGWNSSLESIV 360

Query: 378 NGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAEGQGVVERKEIEKKVRMIMEGKEGEGI 437
           NGVAMVTWPLYAEQKMNAA+LTEE+GVAVR+RAEG G+V+RKEIE KVRMIMEGKEG GI
Sbjct: 361 NGVAMVTWPLYAEQKMNAAVLTEEVGVAVRVRAEGDGLVKRKEIENKVRMIMEGKEGGGI 420

Query: 438 RERVKELKISGGKAVTKGGSSYNSLARVASECDIFRRRRDGG 480
           RERVK LKISG KAVTKGGSSYNSLARVASECDIFRRRRDGG
Sbjct: 421 RERVKGLKISGEKAVTKGGSSYNSLARVASECDIFRRRRDGG 462

BLAST of CSPI06G30750 vs. NCBI nr
Match: gi|224094711|ref|XP_002310203.1| (UDP-glucoronosyl/UDP-glucosyl transferase family protein [Populus trichocarpa])

HSP 1 Score: 494.2 bits (1271), Expect = 2.6e-136
Identity = 257/473 (54.33%), Postives = 332/473 (70.19%), Query Frame = 1

Query: 4   QESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKPSLVN 63
           Q +K H ALL SPGMGHLIP LEL  RLV +H    TLFVV T +S+ +S L +    +N
Sbjct: 2   QNTKPHAALLASPGMGHLIPVLELGKRLVTYHGFHVTLFVVATDASTTQSRLKEPYPNIN 61

Query: 64  IVSLPH-SLSSL-DPNAPICDIIISMMTASFPFLRSSIAAVNPRPAALIVDLFGTPALSI 123
           I++LP   +S L DP A +   +  MM  + P LRS+I A+   P ALIVDLFGT A ++
Sbjct: 62  IITLPLVDISGLIDPAATVVTKLAVMMRETLPSLRSAILALKSPPTALIVDLFGTEAFAV 121

Query: 124 AHELGMLGLVFMTTNAWYLSVSYLYPSFEKPMVDAHVYNHDPLVIPGCTPVRFEDTIEVF 183
           A E  ML  VF T+NAW+ +++  +P+ ++ + D HV    PL IPGC  VRFEDT+  +
Sbjct: 122 AEEFNMLKYVFDTSNAWFFAITIYFPTIDRNLEDKHVIQKQPLRIPGCKSVRFEDTLGAY 181

Query: 184 ELNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGNGKVNEVPIYPIG 243
               +++Y+ + R   E+  ADGIL NTW+DLEPTTL AL +   L  G+V + P+YPIG
Sbjct: 182 LDRNDQMYIEYKRIGIEMPMADGILMNTWEDLEPTTLGALRDFQML--GRVAKAPVYPIG 241

Query: 244 PLTRNCEPTL-ESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVWV 303
           PL R   P++  ++VL WLD QP+ESVIYVSFGSGGTL  EQ+ ELAWGLELS+QRFVWV
Sbjct: 242 PLARPVGPSVPRNQVLNWLDNQPNESVIYVSFGSGGTLSTEQMAELAWGLELSKQRFVWV 301

Query: 304 IRPPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRSV 363
           +RPP   ++ GAFF    GS        +LPEGF+ RT+EVGLV+P+W PQ EIL+H SV
Sbjct: 302 VRPPIDNDAAGAFFNLDDGSE---GIPSFLPEGFLARTREVGLVVPLWAPQVEILAHPSV 361

Query: 364 RGFVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRA-EGQGVVERK 423
            GF++HCGWNS+LESI NGV M+ WPLYAEQKMNA +LTEE+GVAV+ +    + VV R 
Sbjct: 362 GGFLSHCGWNSTLESITNGVPMIAWPLYAEQKMNATILTEELGVAVQPKTLASERVVVRA 421

Query: 424 EIEKKVRMIMEGKEGEGIRERVKELKISGGKAV-TKGGSSYNSLARVASECDI 472
           EIE  VR IME +EG GIR+RV ELK SG KA+ +KGGSSYNSL+++A +C++
Sbjct: 422 EIEMMVRKIMEDEEGFGIRKRVNELKHSGEKALSSKGGSSYNSLSQIAKQCEL 469

BLAST of CSPI06G30750 vs. NCBI nr
Match: gi|356540737|ref|XP_003538841.1| (PREDICTED: UDP-glycosyltransferase 72E1-like [Glycine max])

HSP 1 Score: 491.9 bits (1265), Expect = 1.3e-135
Identity = 255/479 (53.24%), Postives = 333/479 (69.52%), Query Frame = 1

Query: 6   SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSSAESTLLQKPSLVNIV 65
           SK H AL+ SPGMGHLIP LEL  RL+ HH+   T+F+V T S++  S +LQ+ S +NIV
Sbjct: 4   SKAHAALVASPGMGHLIPMLELGKRLLTHHSFHVTIFIVTTDSATTTSHILQQTSNLNIV 63

Query: 66  SLPHSLSS--LDPNAPICDIIISMMTASFPFLRSSIAAVN-PRPAALIVDLFGTPALSIA 125
            +P    S  L PN P+   I+  M  S PFLRSSI + N P P+ALIVD+FG  A  IA
Sbjct: 64  LVPPIDVSHKLPPNPPLAARIMLTMIDSIPFLRSSILSTNLPPPSALIVDMFGLAAFPIA 123

Query: 126 HELGMLGLVFMTTNAWYLSVSYLYPSFEKPMVDAHVYNHDPLVIPGCTPVRFEDTIEVFE 185
            +LGML  V+  T+AW+ +VS   P+ +K M++ H  +H+PLVIPGC  VRFEDT+E F 
Sbjct: 124 RDLGMLTYVYFATSAWFSAVSVYVPAMDKKMIERHAEHHEPLVIPGCEAVRFEDTLEPFL 183

Query: 186 LNQEEVYVGFGRYARELGTADGILSNTWQDLEPTTLKALSEAGTLGNGKVNEVPIYPIGP 245
               E+Y G+   A+E+ TADGIL NTWQDLEP   KA+ E G L  G+  +  +YP+GP
Sbjct: 184 SPIGEMYEGYLAAAKEIVTADGILMNTWQDLEPAATKAVREDGIL--GRFTKGAVYPVGP 243

Query: 246 LTRNCEPTLESEVLKWLDRQPDESVIYVSFGSGGTLCEEQITELAWGLELSQQRFVWVIR 305
           L R  E   E  VL W+D QP E+V+YVSFGSGGT+ E Q+ E+A GLELSQQRFVWV+R
Sbjct: 244 LVRTVEKKAEDAVLSWMDVQPAETVVYVSFGSGGTMSEVQMREVALGLELSQQRFVWVVR 303

Query: 306 PPEGTESTGAFFTAGRGSSRDYWASKYLPEGFIKRTKEVGLVIPMWGPQAEILSHRSVRG 365
           PP   +++G+FF   +  S D     YLP+GF+KRT+ VG+V+PMW PQAEIL H +   
Sbjct: 304 PPCEGDTSGSFFEVSKNGSGDV-VLDYLPKGFVKRTEGVGVVVPMWAPQAEILGHPATGC 363

Query: 366 FVTHCGWNSSLESIVNGVAMVTWPLYAEQKMNAALLTEEMGVAVRLRAE-GQGVVERKEI 425
           FVTHCGWNS LES++NGV MV WPLYAEQKMNA +L+EE+GVAVR+  E G GVV R+EI
Sbjct: 364 FVTHCGWNSVLESVLNGVPMVAWPLYAEQKMNAFMLSEELGVAVRVAGEGGGGVVGREEI 423

Query: 426 EKKVRMIMEGKEGEGIRERVKELKISGGKAVTKGGSSYNSLARVASECDIFRRRRDGGY 481
            + VR +M  KEG G+R++VKELK+SG KA++K GSS++ L ++  +C +  +  +  Y
Sbjct: 424 AELVRRVMVDKEGVGMRKKVKELKVSGEKALSKFGSSHHWLCQMNKDCQVHAQASEADY 479

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
U72E2_ARATH3.1e-12148.95UDP-glycosyltransferase 72E2 OS=Arabidopsis thaliana GN=UGT72E2 PE=1 SV=1[more]
U72E1_ARATH5.2e-12147.68UDP-glycosyltransferase 72E1 OS=Arabidopsis thaliana GN=UGT72E1 PE=1 SV=1[more]
UFOG5_MANES5.8e-12048.72Anthocyanidin 3-O-glucosyltransferase 5 OS=Manihot esculenta GN=GT5 PE=2 SV=1[more]
U72E3_ARATH8.3e-11947.46UDP-glycosyltransferase 72E3 OS=Arabidopsis thaliana GN=UGT72E3 PE=1 SV=1[more]
U72B1_ARATH2.2e-9543.04UDP-glycosyltransferase 72B1 OS=Arabidopsis thaliana GN=UGT72B1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KGE4_CUCSA1.1e-27499.38Uncharacterized protein OS=Cucumis sativus GN=Csa_6G501940 PE=4 SV=1[more]
B9HEN9_POPTR1.8e-13654.33Glycosyltransferase OS=Populus trichocarpa GN=POPTR_0007s12400g PE=3 SV=1[more]
K7LND6_SOYBN8.9e-13653.24Glycosyltransferase OS=Glycine max GN=GLYMA_11G064400 PE=3 SV=1[more]
A0A061DGH3_THECC4.4e-13553.16Glycosyltransferase OS=Theobroma cacao GN=TCM_000444 PE=3 SV=1[more]
E9M5E5_PUEML1.3e-13452.52Glycosyltransferase OS=Pueraria montana var. lobata PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT5G66690.11.7e-12248.95 UDP-Glycosyltransferase superfamily protein[more]
AT3G50740.12.9e-12247.68 UDP-glucosyl transferase 72E1[more]
AT5G26310.14.7e-12047.46 UDP-Glycosyltransferase superfamily protein[more]
AT4G01070.11.2e-9643.04 UDP-Glycosyltransferase superfamily protein[more]
AT4G36770.11.2e-9642.07 UDP-Glycosyltransferase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|700193601|gb|KGN48805.1|1.6e-27499.38hypothetical protein Csa_6G501940 [Cucumis sativus][more]
gi|778722457|ref|XP_004143577.2|3.3e-26499.35PREDICTED: UDP-glycosyltransferase 72E1, partial [Cucumis sativus][more]
gi|659081042|ref|XP_008441118.1|7.8e-23488.96PREDICTED: UDP-glycosyltransferase 72E1 [Cucumis melo][more]
gi|224094711|ref|XP_002310203.1|2.6e-13654.33UDP-glucoronosyl/UDP-glucosyl transferase family protein [Populus trichocarpa][more]
gi|356540737|ref|XP_003538841.1|1.3e-13553.24PREDICTED: UDP-glycosyltransferase 72E1-like [Glycine max][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0016758 transferase activity, transferring hexosyl groups
molecular_function GO:0080043 quercetin 3-O-glucosyltransferase activity
molecular_function GO:0080044 quercetin 7-O-glucosyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI06G30750.1CSPI06G30750.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 5..477
score: 5.6E
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 252..409
score: 1.0
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 250..431
score: 1.
NoneNo IPR availablePANTHERPTHR11926:SF223UDP-GLYCOSYLTRANSFERASE 72E1-RELATEDcoord: 5..477
score: 5.6E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 8..465
score: 3.32E

The following gene(s) are paralogous to this gene:

None