CSPI06G19440 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI06G19440
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGlycosyltransferase
LocationChr6: 17551663 .. 17554215 (+)
RNA-Seq ExpressionCSPI06G19440
SyntenyCSPI06G19440
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATTCACTCTTTGGCGCCATTACTACTTTCGAAGCAGAGCGTCTCCATAGAAGAAGAAGAAGAGGAAGAAGAAGAAATAATGGAGATAGGCTCAGATCCACATATAATAGCATTTCCATTTCCATCACAAGGCCACATAAATCCTCAGCTTCAATTCGCAAAACGCTTAATCTCAAATGGAATCAAGCTAACATTGCTCACAACCTTACATGTTAGCCAACACTTGAAATTACAGGGCGATTATTCAAATTCCTTTAAGATTGAAGTCATTTCCGATGGTTCTGAGAATCGTCAAGAAACCGACACCATGAAACAAACTCTGGATCGATTCCAGCACAAGATGACCACAAACTTGCAAAATTACTTACACAAGGCCATGGATTCTTCCAATCCACCTCGATTTATTCTTTACGATTCAACAATGCCTTGGGTTCTGGATGTCGCTAAGGAGTTCGGAATCGCTAAGGCTCCGGTTTATACTCAATCTTGTGCCCTAAATAGTATAAATTATCATGTTCTTCATGGTCAATTGAAGCTTCCTCCTGAATCCTCAATTATTTCTCTGCCTTCTATGCCTCCGCTTTCCGCTAATGATCTTCCTGCCTACGATTACGATCCTGCTTCCGCTGATACCATCATCGAGTTTCTTACTAGTCAATATTCTAATATTGAAGATGCGGATCTGCTCTTTTGCAACACTTTTGACAAATTGGAAGGGGAGGTATGATTCTAGATCTTTCATTTTTCTATCTAATATTTCTCTATTATTATTATTATTTTAATATTTTTCATTATGAATTATTCCTCTGCTTTATGTATATATATATGAACGTGTTGTTTGGCTTAATCTATATGAAATCATGTATGTGATCACTGACTGAATCTGTTATTGAGTATTATGTGGTGTAAACAAGGATCAACTATCTCCACTTTTCACTTCATGGATAGAAATTGGTGGAATCTCATATCTGTGTTAAACAAACTGGAAAAATTCCATTTTTTTAAAAAATTTTGGTGAACATCTTGATTGTGTTGAATCTCTCTTATCAATATTAAATAAAGAAGGTTATCTTCTTTTTTGTCCTTTTTCAATATTTTATGAGCAATAATTAGGGATAACGTAAAAAAAATTAAATATTATTTATTTTGAAACGTATTTCTTATTAGCTTTTTTTTTTTTATATATATATAAATATTTGCATAGTATTATCCTTTACCATAATTTTTTTGTTGTTATTATTATTTAGAAAAAAAAAGATAAATTATAACACAATAATTTTTAATTGGGTGCAACATTTTTTTGTATCTAAAACTTTATTTGACTTTAGTATGCAGTAATTAAATACTAAATCAAAGAAAAATCTATCCAACCTAAAAAAAATGTAACAAAATTACATAATTAATAATTAGAAATGCTAATTTGTTTAAAAATCTATTTACTAATTTAAATTTATTTTTTGAATTTTTTAGATTTTTAGTATTTTTCTTAAATTATATCTGCATCAACATTTCTAAAATTAAGCACTCAATGTTTTGATTTGTATCCACGTTTTCTCAAATTGAAAACATGGAAATTAATATACTGCCTCAGCTTTCATTGATTTATATATATATATTTTTGAATGGTTAATAACCAAATGCAAGGAGGAAAAATGTTGTTTTTGTTATGATTTTTGATGTAGAAATATTAGAAGGAACAATAATAATGTTTGTTGATTTAGTGATTTACTTTAATGATATCAGATTATCAAATGGATGGAGAGCTGGGGACGGCCAGTGAAAGCCATAGGACCAACAATTCCATCAGCCTACTTAGACAAAAGAATAGAGAACGACAAATACTATGGACTTAGCCTATTCGATCCCAACCAAGATGACCATCTCATCAAATGGCTACAAACCAAGCCCCCATCTTCAGTCCTCTATGTTTCTTATGGAAGCATTGTCGAAATAAGCGAAGAACAGCTCAAAAACTTAGCGTTTGGAATCAAACAATCTGACAAATTCTTCCTGTGGGTCGTTAGAGAAACCGAAGCACGAAAACTTCCCCCAAACTTTATAGAAAGTGTTGGGGAGAAAGGGATTGTGGTCAGCTGGTGCTCGCAGCTCGACGTCTTGGCTCACCCGGCGATCGGTTGCTTCTTTACGCATTGCGGTTGGAACTCGACATTGGAGGCGCTGTGCTTGGGAGTCCCGGTTGTGGCATTCCCGCAGTGGGCGGATCAGGTGACTAATGCGAAGTTTATGGAAGACGTTTGGAAAGTTGGGAAGAGGGTTAAGGTGGATGAGAAGAGGATGGCTAGTGAAGAAGAGATAAGGAATTGTATTTGTGAAGTGATGGAAGAAGAGAGAGGTAGTGAGTTTAAGAAGAATTCATTGGAGTGGAAGCAATGGGCTAAAGAAGCTATGGAGGAAGGTGGGAGCTCTTATAATAATATTATGGAGTTTGTGTCTATGATTAAACAATCTTGATCTTTGTTATGATCATTTTTGGTGAGTAAGGTGGAGTAGGAAAGAAAAACACTTACAAGCTTACTTACTTTGATGTTATCG

mRNA sequence

AATTCACTCTTTGGCGCCATTACTACTTTCGAAGCAGAGCGTCTCCATAGAAGAAGAAGAAGAGGAAGAAGAAGAAATAATGGAGATAGGCTCAGATCCACATATAATAGCATTTCCATTTCCATCACAAGGCCACATAAATCCTCAGCTTCAATTCGCAAAACGCTTAATCTCAAATGGAATCAAGCTAACATTGCTCACAACCTTACATGTTAGCCAACACTTGAAATTACAGGGCGATTATTCAAATTCCTTTAAGATTGAAGTCATTTCCGATGGTTCTGAGAATCGTCAAGAAACCGACACCATGAAACAAACTCTGGATCGATTCCAGCACAAGATGACCACAAACTTGCAAAATTACTTACACAAGGCCATGGATTCTTCCAATCCACCTCGATTTATTCTTTACGATTCAACAATGCCTTGGGTTCTGGATGTCGCTAAGGAGTTCGGAATCGCTAAGGCTCCGGTTTATACTCAATCTTGTGCCCTAAATAGTATAAATTATCATGTTCTTCATGGTCAATTGAAGCTTCCTCCTGAATCCTCAATTATTTCTCTGCCTTCTATGCCTCCGCTTTCCGCTAATGATCTTCCTGCCTACGATTACGATCCTGCTTCCGCTGATACCATCATCGAGTTTCTTACTAGTCAATATTCTAATATTGAAGATGCGGATCTGCTCTTTTGCAACACTTTTGACAAATTGGAAGGGGAGATTATCAAATGGATGGAGAGCTGGGGACGGCCAGTGAAAGCCATAGGACCAACAATTCCATCAGCCTACTTAGACAAAAGAATAGAGAACGACAAATACTATGGACTTAGCCTATTCGATCCCAACCAAGATGACCATCTCATCAAATGGCTACAAACCAAGCCCCCATCTTCAGTCCTCTATGTTTCTTATGGAAGCATTGTCGAAATAAGCGAAGAACAGCTCAAAAACTTAGCGTTTGGAATCAAACAATCTGACAAATTCTTCCTGTGGGTCGTTAGAGAAACCGAAGCACGAAAACTTCCCCCAAACTTTATAGAAAGTGTTGGGGAGAAAGGGATTGTGGTCAGCTGGTGCTCGCAGCTCGACGTCTTGGCTCACCCGGCGATCGGTTGCTTCTTTACGCATTGCGGTTGGAACTCGACATTGGAGGCGCTGTGCTTGGGAGTCCCGGTTGTGGCATTCCCGCAGTGGGCGGATCAGGTGACTAATGCGAAGTTTATGGAAGACGTTTGGAAAGTTGGGAAGAGGGTTAAGGTGGATGAGAAGAGGATGGCTAGTGAAGAAGAGATAAGGAATTGTATTTGTGAAGTGATGGAAGAAGAGAGAGGTAGTGAGTTTAAGAAGAATTCATTGGAGTGGAAGCAATGGGCTAAAGAAGCTATGGAGGAAGGTGGGAGCTCTTATAATAATATTATGGAGTTTGTGTCTATGATTAAACAATCTTGATCTTTGTTATGATCATTTTTGGTGAGTAAGGTGGAGTAGGAAAGAAAAACACTTACAAGCTTACTTACTTTGATGTTATCG

Coding sequence (CDS)

ATGGAGATAGGCTCAGATCCACATATAATAGCATTTCCATTTCCATCACAAGGCCACATAAATCCTCAGCTTCAATTCGCAAAACGCTTAATCTCAAATGGAATCAAGCTAACATTGCTCACAACCTTACATGTTAGCCAACACTTGAAATTACAGGGCGATTATTCAAATTCCTTTAAGATTGAAGTCATTTCCGATGGTTCTGAGAATCGTCAAGAAACCGACACCATGAAACAAACTCTGGATCGATTCCAGCACAAGATGACCACAAACTTGCAAAATTACTTACACAAGGCCATGGATTCTTCCAATCCACCTCGATTTATTCTTTACGATTCAACAATGCCTTGGGTTCTGGATGTCGCTAAGGAGTTCGGAATCGCTAAGGCTCCGGTTTATACTCAATCTTGTGCCCTAAATAGTATAAATTATCATGTTCTTCATGGTCAATTGAAGCTTCCTCCTGAATCCTCAATTATTTCTCTGCCTTCTATGCCTCCGCTTTCCGCTAATGATCTTCCTGCCTACGATTACGATCCTGCTTCCGCTGATACCATCATCGAGTTTCTTACTAGTCAATATTCTAATATTGAAGATGCGGATCTGCTCTTTTGCAACACTTTTGACAAATTGGAAGGGGAGATTATCAAATGGATGGAGAGCTGGGGACGGCCAGTGAAAGCCATAGGACCAACAATTCCATCAGCCTACTTAGACAAAAGAATAGAGAACGACAAATACTATGGACTTAGCCTATTCGATCCCAACCAAGATGACCATCTCATCAAATGGCTACAAACCAAGCCCCCATCTTCAGTCCTCTATGTTTCTTATGGAAGCATTGTCGAAATAAGCGAAGAACAGCTCAAAAACTTAGCGTTTGGAATCAAACAATCTGACAAATTCTTCCTGTGGGTCGTTAGAGAAACCGAAGCACGAAAACTTCCCCCAAACTTTATAGAAAGTGTTGGGGAGAAAGGGATTGTGGTCAGCTGGTGCTCGCAGCTCGACGTCTTGGCTCACCCGGCGATCGGTTGCTTCTTTACGCATTGCGGTTGGAACTCGACATTGGAGGCGCTGTGCTTGGGAGTCCCGGTTGTGGCATTCCCGCAGTGGGCGGATCAGGTGACTAATGCGAAGTTTATGGAAGACGTTTGGAAAGTTGGGAAGAGGGTTAAGGTGGATGAGAAGAGGATGGCTAGTGAAGAAGAGATAAGGAATTGTATTTGTGAAGTGATGGAAGAAGAGAGAGGTAGTGAGTTTAAGAAGAATTCATTGGAGTGGAAGCAATGGGCTAAAGAAGCTATGGAGGAAGGTGGGAGCTCTTATAATAATATTATGGAGTTTGTGTCTATGATTAAACAATCTTGA

Protein sequence

MEIGSDPHIIAFPFPSQGHINPQLQFAKRLISNGIKLTLLTTLHVSQHLKLQGDYSNSFKIEVISDGSENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMDSSNPPRFILYDSTMPWVLDVAKEFGIAKAPVYTQSCALNSINYHVLHGQLKLPPESSIISLPSMPPLSANDLPAYDYDPASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDKRIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIKQSDKFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEALCLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSEFKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMIKQS*
Homology
BLAST of CSPI06G19440 vs. ExPASy Swiss-Prot
Match: K7NBW3 (Mogroside IE synthase OS=Siraitia grosvenorii OX=190515 GN=UGT74AC1 PE=1 SV=1)

HSP 1 Score: 743.4 bits (1918), Expect = 1.5e-213
Identity = 353/456 (77.41%), Postives = 409/456 (89.69%), Query Frame = 0

Query: 1   MEIGSDPHIIAFPFPSQGHINPQLQFAKRLISNGIKLTLLTTLHVSQHLKLQGDYSNSFK 60
           ME G D HI+ FPFPSQGHINP LQ +KRLI+ GIK++L+TTLHVS HL+LQG YSNS K
Sbjct: 1   MEKG-DTHILVFPFPSQGHINPLLQLSKRLIAKGIKVSLVTTLHVSNHLQLQGAYSNSVK 60

Query: 61  IEVISDGSENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMDSSNPPRFILYDSTMPWVLD 120
           IEVISDGSE+R ETDTM+QTLDRF+ KMT NL+++L KAM SSNPP+FILYDSTMPWVL+
Sbjct: 61  IEVISDGSEDRLETDTMRQTLDRFRQKMTKNLEDFLQKAMVSSNPPKFILYDSTMPWVLE 120

Query: 121 VAKEFGIAKAPVYTQSCALNSINYHVLHGQLKLPPESSIISLPSMPPLSANDLPAYDYDP 180
           VAKEFG+ +AP YTQSCALNSINYHVLHGQLKLPPE+  ISLPSMP L  +DLPAYD+DP
Sbjct: 121 VAKEFGLDRAPFYTQSCALNSINYHVLHGQLKLPPETPTISLPSMPLLRPSDLPAYDFDP 180

Query: 181 ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK 240
           AS DTII+ LTSQYSNI+DA+LLFCNTFDKLEGEII+WME+ GRPVK +GPT+PSAYLDK
Sbjct: 181 ASTDTIIDLLTSQYSNIQDANLLFCNTFDKLEGEIIQWMETLGRPVKTVGPTVPSAYLDK 240

Query: 241 RIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIKQSD 300
           R+ENDK+YGLSLF PN +D  +KWL +KP  SVLYVSYGS+VE+ EEQLK LA GIK++ 
Sbjct: 241 RVENDKHYGLSLFKPN-EDVCLKWLDSKPSGSVLYVSYGSLVEMGEEQLKELALGIKETG 300

Query: 301 KFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL 360
           KFFLWVVR+TEA KLPPNF+ESV EKG+VVSWCSQL+VLAHP++GCFFTHCGWNSTLEAL
Sbjct: 301 KFFLWVVRDTEAEKLPPNFVESVAEKGLVVSWCSQLEVLAHPSVGCFFTHCGWNSTLEAL 360

Query: 361 CLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE 420
           CLGVPVVAFPQWADQVTNAKF+EDVWKVGKRVK +E+R+AS+EE+R+CI EVME ER SE
Sbjct: 361 CLGVPVVAFPQWADQVTNAKFLEDVWKVGKRVKRNEQRLASKEEVRSCIWEVMEGERASE 420

Query: 421 FKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMIKQS 457
           FK NS+EWK+WAKEA++EGGSS  NI EFV+M+KQ+
Sbjct: 421 FKSNSMEWKKWAKEAVDEGGSSDKNIEEFVAMLKQT 454

BLAST of CSPI06G19440 vs. ExPASy Swiss-Prot
Match: Q9SYK9 (UDP-glycosyltransferase 74E2 OS=Arabidopsis thaliana OX=3702 GN=UGT74E2 PE=1 SV=1)

HSP 1 Score: 444.9 bits (1143), Expect = 1.1e-123
Identity = 228/455 (50.11%), Postives = 302/455 (66.37%), Query Frame = 0

Query: 8   HIIAFPFPSQGHINPQLQFAKRLISNGIKLTL-LTTLHVSQHLKLQGDYSNSFKIEVISD 67
           H+I  PFP QGHI P  QF KRL S G+KLTL L +   S   K + D    F I     
Sbjct: 6   HLIVLPFPGQGHITPMSQFCKRLASKGLKLTLVLVSDKPSPPYKTEHDSITVFPI----- 65

Query: 68  GSENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMD----SSNPPRFILYDSTMPWVLDVA 127
            S   QE +   Q LD +  ++ T+++N L K ++    S NPPR I+YDSTMPW+LDVA
Sbjct: 66  -SNGFQEGEEPLQDLDDYMERVETSIKNTLPKLVEDMKLSGNPPRAIVYDSTMPWLLDVA 125

Query: 128 KEFGIAKAPVYTQSCALNSINYHVLHGQLKLPP----ESSIISLPSMPPLSANDLPAYDY 187
             +G++ A  +TQ   + +I YHV  G   +P      S++ S PS P L+ANDLP++  
Sbjct: 126 HSYGLSGAVFFTQPWLVTAIYYHVFKGSFSVPSTKYGHSTLASFPSFPMLTANDLPSFLC 185

Query: 188 DPASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMES-WGRPVKAIGPTIPSAY 247
           + +S   I+  +  Q SNI+  D++ CNTFDKLE +++KW++S W  PV  IGPT+PS Y
Sbjct: 186 ESSSYPNILRIVVDQLSNIDRVDIVLCNTFDKLEEKLLKWVQSLW--PVLNIGPTVPSMY 245

Query: 248 LDKRIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIK 307
           LDKR+  DK YG SLF+    +  ++WL +K P+SV+Y+S+GS+V + E+Q+  LA G+K
Sbjct: 246 LDKRLSEDKNYGFSLFNAKVAE-CMEWLNSKEPNSVVYLSFGSLVILKEDQMLELAAGLK 305

Query: 308 QSDKFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTL 367
           QS +FFLWVVRETE  KLP N++E +GEKG++VSW  QLDVLAH +IGCF THCGWNSTL
Sbjct: 306 QSGRFFLWVVRETETHKLPRNYVEEIGEKGLIVSWSPQLDVLAHKSIGCFLTHCGWNSTL 365

Query: 368 EALCLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEER 427
           E L LGVP++  P W DQ TNAKFM+DVWKVG RVK +       EEI   + EVME E+
Sbjct: 366 EGLSLGVPMIGMPHWTDQPTNAKFMQDVWKVGVRVKAEGDGFVRREEIMRSVEEVMEGEK 425

Query: 428 GSEFKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSM 453
           G E +KN+ +WK  A+EA+ EGGSS  +I EFVSM
Sbjct: 426 GKEIRKNAEKWKVLAQEAVSEGGSSDKSINEFVSM 451

BLAST of CSPI06G19440 vs. ExPASy Swiss-Prot
Match: P0C7P7 (UDP-glycosyltransferase 74E1 OS=Arabidopsis thaliana OX=3702 GN=UGT74E1 PE=3 SV=1)

HSP 1 Score: 439.9 bits (1130), Expect = 3.6e-122
Identity = 228/455 (50.11%), Postives = 304/455 (66.81%), Query Frame = 0

Query: 8   HIIAFPFPSQGHINPQLQFAKRLISNGIKLTL-LTTLHVSQHLKLQGDYSNSFKIEVISD 67
           H+I  PFP+QGHI P  QF KRL S  +K+TL L +   S   K + D   +  +  IS+
Sbjct: 6   HVIVLPFPAQGHITPMSQFCKRLASKSLKITLVLVSDKPSPPYKTEHD---TITVVPISN 65

Query: 68  GSENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMD----SSNPPRFILYDSTMPWVLDVA 127
           G +  QE     + LD +  ++ ++++N L K ++    S NPPR ++YDSTMPW+LDVA
Sbjct: 66  GFQEGQE---RSEDLDEYMERVESSIKNRLPKLIEDMKLSGNPPRALVYDSTMPWLLDVA 125

Query: 128 KEFGIAKAPVYTQSCALNSINYHVLHGQLKLPP----ESSIISLPSMPPLSANDLPAYDY 187
             +G++ A  +TQ   +++I YHV  G   +P      S++ S PS+P L+ANDLP++  
Sbjct: 126 HSYGLSGAVFFTQPWLVSAIYYHVFKGSFSVPSTKYGHSTLASFPSLPILNANDLPSFLC 185

Query: 188 DPASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMES-WGRPVKAIGPTIPSAY 247
           + +S   I+  +  Q SNI+  D++ CNTFDKLE +++KW++S W  PV  IGPT+PS Y
Sbjct: 186 ESSSYPYILRTVIDQLSNIDRVDIVLCNTFDKLEEKLLKWIKSVW--PVLNIGPTVPSMY 245

Query: 248 LDKRIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIK 307
           LDKR+  DK YG SLF     +  ++WL +K PSSV+YVS+GS+V + ++QL  LA G+K
Sbjct: 246 LDKRLAEDKNYGFSLFGAKIAE-CMEWLNSKQPSSVVYVSFGSLVVLKKDQLIELAAGLK 305

Query: 308 QSDKFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTL 367
           QS  FFLWVVRETE RKLP N+IE +GEKG+ VSW  QL+VL H +IGCF THCGWNSTL
Sbjct: 306 QSGHFFLWVVRETERRKLPENYIEEIGEKGLTVSWSPQLEVLTHKSIGCFVTHCGWNSTL 365

Query: 368 EALCLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEER 427
           E L LGVP++  P WADQ TNAKFMEDVWKVG RVK D       EE    + EVME E+
Sbjct: 366 EGLSLGVPMIGMPHWADQPTNAKFMEDVWKVGVRVKADSDGFVRREEFVRRVEEVMEAEQ 425

Query: 428 GSEFKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSM 453
           G E +KN+ +WK  A+EA+ EGGSS  NI EFVSM
Sbjct: 426 GKEIRKNAEKWKVLAQEAVSEGGSSDKNINEFVSM 451

BLAST of CSPI06G19440 vs. ExPASy Swiss-Prot
Match: W8JMV4 (UDP glycosyltransferase 9 OS=Catharanthus roseus OX=4058 GN=UGT9 PE=2 SV=1)

HSP 1 Score: 401.7 bits (1031), Expect = 1.1e-110
Identity = 207/463 (44.71%), Postives = 299/463 (64.58%), Query Frame = 0

Query: 8   HIIAFPFPSQGHINPQLQFAKRLISNGIKLTLLTTLHVSQHLKLQGDYSNSFKIEVISDG 67
           HI+AFPFP++GHINP L    RL S G K+TL+TT+   + +K     +N   IE I DG
Sbjct: 14  HILAFPFPAKGHINPLLHLCNRLASKGFKITLITTVSTLKSVKT--SKANGIDIESIPDG 73

Query: 68  ---SENRQETDTMKQTLD----RFQHKMTTNLQNYLHKAMDSSNPPRFILYDSTMPWVLD 127
               +N Q    M+  ++    +F+     N    + K    + PP+ ++YDS+MPW+L+
Sbjct: 74  IPQEQNHQIITVMEMNMELYFKQFKASAIENTTKLIQKLKTKNPPPKVLIYDSSMPWILE 133

Query: 128 VAKEFGIAKAPVYTQSCALNSINYHVLHGQLKLPPESS---IISLPSMPPLSANDLPAYD 187
           VA E G+  A  +TQ C++++I YH+L G +KLP E+S   ++SLP +P L   DLP   
Sbjct: 134 VAHEQGLLGASFFTQPCSVSAIYYHMLQGTIKLPLENSENGMVSLPYLPLLEKKDLPGVQ 193

Query: 188 YDPASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMES-WGRPVKAIGPTIPSA 247
               +++ + E L  Q+SNI+D D +  NTFD LE E++ WM S W  P+  +GPT P++
Sbjct: 194 QFEDNSEALAELLADQFSNIDDVDYVLFNTFDALEIEVVNWMGSKW--PILTVGPTAPTS 253

Query: 248 --YLDKRIEN--DKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNL 307
              LDK+ +N  D      LF+ N  +  +KWL  +   +V+YVS+GS+  ++EEQ++ +
Sbjct: 254 MFLLDKKQKNYEDGRSINYLFETN-TEVCMKWLDQREIDTVIYVSFGSLASLTEEQMEQV 313

Query: 308 AFGIKQSDKFFLWVVRETEARKLPPNFIESVG-EKGIVVSWCSQLDVLAHPAIGCFFTHC 367
           +  + +S+ +FLWVVRE E  KLP +F E+   +KG+V++WC QLDVLAH ++ CF THC
Sbjct: 314 SQALIRSNCYFLWVVREEEENKLPKDFKETTSKKKGLVINWCPQLDVLAHKSVACFMTHC 373

Query: 368 GWNSTLEALCLGVPVVAFPQWADQVTNAKFMEDVWKVGKRV-KVDEKRMASEEEIRNCIC 427
           GWNSTLEALC GVP++  PQWADQ TNAK +E VWK+G  V K DE  +   E+I +CI 
Sbjct: 374 GWNSTLEALCSGVPMICMPQWADQTTNAKLIEHVWKIGVGVNKSDENGIVKREDIEDCIR 433

Query: 428 EVMEEERGSEFKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMI 454
           +V+E ERG E K+N+++WK+ AKEA+ EGGSSYNNI EF S +
Sbjct: 434 QVIESERGKELKRNAIKWKELAKEAVSEGGSSYNNIQEFSSSL 471

BLAST of CSPI06G19440 vs. ExPASy Swiss-Prot
Match: Q9SKC1 (UDP-glycosyltransferase 74C1 OS=Arabidopsis thaliana OX=3702 GN=UGT74C1 PE=2 SV=1)

HSP 1 Score: 401.4 bits (1030), Expect = 1.4e-110
Identity = 202/453 (44.59%), Postives = 282/453 (62.25%), Query Frame = 0

Query: 8   HIIAFPFPSQGHINPQLQFAKRLISNGIKLTLLTTLHVSQHLKLQGDYSNSFKIEVISDG 67
           H++ FP+P QGHINP +Q AKRL   GI  TL+      +      DY  S  +  I DG
Sbjct: 8   HVLFFPYPLQGHINPMIQLAKRLSKKGITSTLIIASKDHREPYTSDDY--SITVHTIHDG 67

Query: 68  SENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMDSSNPPRFILYDSTMPWVLDVAKEFGI 127
               +        LDRF +  + +L +++  A  S NPP+ ++YD  MP+ LD+AK+  +
Sbjct: 68  FFPHEHPHAKFVDLDRFHNSTSRSLTDFISSAKLSDNPPKALIYDPFMPFALDIAKDLDL 127

Query: 128 AKAPVYTQSCALNSINYHVLHGQLKLPPE----SSIISLPSMPPLSANDLPAYDYDPASA 187
                +TQ    + + YH+  G   +P +     ++ S P  P LS +DLP++  +  S 
Sbjct: 128 YVVAYFTQPWLASLVYYHINEGTYDVPVDRHENPTLASFPGFPLLSQDDLPSFACEKGSY 187

Query: 188 DTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWM-ESWGRPVKAIGPTIPSAYLDKRI 247
             + EF+  Q+SN+  AD + CNTFD+LE +++KWM + W  PVK IGP +PS +LD R+
Sbjct: 188 PLLHEFVVRQFSNLLQADCILCNTFDQLEPKVVKWMNDQW--PVKNIGPVVPSKFLDNRL 247

Query: 248 ENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIKQSDKF 307
             DK Y L       D+ ++KWL  +P  SV+YV++G++V +SE+Q+K +A  I Q+   
Sbjct: 248 PEDKDYELENSKTEPDESVLKWLGNRPAKSVVYVAFGTLVALSEKQMKEIAMAISQTGYH 307

Query: 308 FLWVVRETEARKLPPNFIESVGEK--GIVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL 367
           FLW VRE+E  KLP  FIE   EK  G+V  W  QL+VLAH +IGCF +HCGWNSTLEAL
Sbjct: 308 FLWSVRESERSKLPSGFIEEAEEKDSGLVAKWVPQLEVLAHESIGCFVSHCGWNSTLEAL 367

Query: 368 CLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE 427
           CLGVP+V  PQW DQ TNAKF+EDVWK+G RV+ D + ++S+EEI  CI EVME ERG E
Sbjct: 368 CLGVPMVGVPQWTDQPTNAKFIEDVWKIGVRVRTDGEGLSSKEEIARCIVEVMEGERGKE 427

Query: 428 FKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMI 454
            +KN  + K  A+EA+ EGGSS   I EFV+++
Sbjct: 428 IRKNVEKLKVLAREAISEGGSSDKKIDEFVALL 456

BLAST of CSPI06G19440 vs. ExPASy TrEMBL
Match: A0A0A0KGW2 (Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_6G366250 PE=3 SV=1)

HSP 1 Score: 936.8 bits (2420), Expect = 3.4e-269
Identity = 455/456 (99.78%), Postives = 456/456 (100.00%), Query Frame = 0

Query: 1   MEIGSDPHIIAFPFPSQGHINPQLQFAKRLISNGIKLTLLTTLHVSQHLKLQGDYSNSFK 60
           MEIGSDPHIIAFPFPSQGHINPQLQFAKRLIS+GIKLTLLTTLHVSQHLKLQGDYSNSFK
Sbjct: 7   MEIGSDPHIIAFPFPSQGHINPQLQFAKRLISHGIKLTLLTTLHVSQHLKLQGDYSNSFK 66

Query: 61  IEVISDGSENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMDSSNPPRFILYDSTMPWVLD 120
           IEVISDGSENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMDSSNPPRFILYDSTMPWVLD
Sbjct: 67  IEVISDGSENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMDSSNPPRFILYDSTMPWVLD 126

Query: 121 VAKEFGIAKAPVYTQSCALNSINYHVLHGQLKLPPESSIISLPSMPPLSANDLPAYDYDP 180
           VAKEFGIAKAPVYTQSCALNSINYHVLHGQLKLPPESSIISLPSMPPLSANDLPAYDYDP
Sbjct: 127 VAKEFGIAKAPVYTQSCALNSINYHVLHGQLKLPPESSIISLPSMPPLSANDLPAYDYDP 186

Query: 181 ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK 240
           ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK
Sbjct: 187 ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK 246

Query: 241 RIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIKQSD 300
           RIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIKQSD
Sbjct: 247 RIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIKQSD 306

Query: 301 KFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL 360
           KFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL
Sbjct: 307 KFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL 366

Query: 361 CLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE 420
           CLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE
Sbjct: 367 CLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE 426

Query: 421 FKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMIKQS 457
           FKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMIKQS
Sbjct: 427 FKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMIKQS 462

BLAST of CSPI06G19440 vs. ExPASy TrEMBL
Match: A0A1S3BCD1 (Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103488489 PE=3 SV=1)

HSP 1 Score: 900.6 bits (2326), Expect = 2.7e-258
Identity = 436/456 (95.61%), Postives = 448/456 (98.25%), Query Frame = 0

Query: 1   MEIGSDPHIIAFPFPSQGHINPQLQFAKRLISNGIKLTLLTTLHVSQHLKLQGDYSNSFK 60
           MEIGSDPHI+AFPFPSQGHINPQLQ AKRLISNGIK+TLLTTLHVSQHLKLQGDYSNSFK
Sbjct: 1   MEIGSDPHILAFPFPSQGHINPQLQLAKRLISNGIKVTLLTTLHVSQHLKLQGDYSNSFK 60

Query: 61  IEVISDGSENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMDSSNPPRFILYDSTMPWVLD 120
           IEVISDGSENRQETDTMKQTLDRFQHKMT NLQ+YL KAMDSSNPPRFILYDSTMPWVLD
Sbjct: 61  IEVISDGSENRQETDTMKQTLDRFQHKMTANLQHYLQKAMDSSNPPRFILYDSTMPWVLD 120

Query: 121 VAKEFGIAKAPVYTQSCALNSINYHVLHGQLKLPPESSIISLPSMPPLSANDLPAYDYDP 180
           VAKEFGIA+APVYTQSCALNSINYHVLHG+LKLPPESS ISLPSMP LSANDLPAYDYDP
Sbjct: 121 VAKEFGIARAPVYTQSCALNSINYHVLHGELKLPPESSTISLPSMPLLSANDLPAYDYDP 180

Query: 181 ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK 240
           ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK
Sbjct: 181 ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK 240

Query: 241 RIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIKQSD 300
           RIENDKYYGLSLFDPNQDD+LIKWLQTKPPSSVLYVSYGSIVEISEEQ+KNLA GIKQSD
Sbjct: 241 RIENDKYYGLSLFDPNQDDNLIKWLQTKPPSSVLYVSYGSIVEISEEQIKNLALGIKQSD 300

Query: 301 KFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL 360
           KFFLWVVRETEA+KLPPNFIESVGEKG+VVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL
Sbjct: 301 KFFLWVVRETEAKKLPPNFIESVGEKGLVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL 360

Query: 361 CLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE 420
           CLGVPVVAFPQWADQVTNAKF+EDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE
Sbjct: 361 CLGVPVVAFPQWADQVTNAKFLEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE 420

Query: 421 FKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMIKQS 457
           FKKNSLE K+WAKEAMEEGGSSY NIMEFV+MIKQS
Sbjct: 421 FKKNSLELKKWAKEAMEEGGSSYKNIMEFVAMIKQS 456

BLAST of CSPI06G19440 vs. ExPASy TrEMBL
Match: A0A5A7V9C9 (Glycosyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G00470 PE=3 SV=1)

HSP 1 Score: 900.6 bits (2326), Expect = 2.7e-258
Identity = 436/456 (95.61%), Postives = 448/456 (98.25%), Query Frame = 0

Query: 1   MEIGSDPHIIAFPFPSQGHINPQLQFAKRLISNGIKLTLLTTLHVSQHLKLQGDYSNSFK 60
           MEIGSDPHI+AFPFPSQGHINPQLQ AKRLISNGIK+TLLTTLHVSQHLKLQGDYSNSFK
Sbjct: 1   MEIGSDPHILAFPFPSQGHINPQLQLAKRLISNGIKVTLLTTLHVSQHLKLQGDYSNSFK 60

Query: 61  IEVISDGSENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMDSSNPPRFILYDSTMPWVLD 120
           IEVISDGSENRQETDTMKQTLDRFQHKMT NLQ+YL KAMDSSNPPRFILYDSTMPWVLD
Sbjct: 61  IEVISDGSENRQETDTMKQTLDRFQHKMTANLQHYLQKAMDSSNPPRFILYDSTMPWVLD 120

Query: 121 VAKEFGIAKAPVYTQSCALNSINYHVLHGQLKLPPESSIISLPSMPPLSANDLPAYDYDP 180
           VAKEFGIA+APVYTQSCALNSINYHVLHG+LKLPPESS ISLPSMP LSANDLPAYDYDP
Sbjct: 121 VAKEFGIARAPVYTQSCALNSINYHVLHGELKLPPESSTISLPSMPLLSANDLPAYDYDP 180

Query: 181 ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK 240
           ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK
Sbjct: 181 ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK 240

Query: 241 RIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIKQSD 300
           RIENDKYYGLSLFDPNQDD+LIKWLQTKPPSSVLYVSYGSIVEISEEQ+KNLA GIKQSD
Sbjct: 241 RIENDKYYGLSLFDPNQDDNLIKWLQTKPPSSVLYVSYGSIVEISEEQIKNLALGIKQSD 300

Query: 301 KFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL 360
           KFFLWVVRETEA+KLPPNFIESVGEKG+VVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL
Sbjct: 301 KFFLWVVRETEAKKLPPNFIESVGEKGLVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL 360

Query: 361 CLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE 420
           CLGVPVVAFPQWADQVTNAKF+EDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE
Sbjct: 361 CLGVPVVAFPQWADQVTNAKFLEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE 420

Query: 421 FKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMIKQS 457
           FKKNSLE K+WAKEAMEEGGSSY NIMEFV+MIKQS
Sbjct: 421 FKKNSLELKKWAKEAMEEGGSSYKNIMEFVAMIKQS 456

BLAST of CSPI06G19440 vs. ExPASy TrEMBL
Match: A0A6J1KAL0 (Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111492134 PE=3 SV=1)

HSP 1 Score: 780.0 bits (2013), Expect = 5.4e-222
Identity = 368/456 (80.70%), Postives = 417/456 (91.45%), Query Frame = 0

Query: 1   MEIGSDPHIIAFPFPSQGHINPQLQFAKRLISNGIKLTLLTTLHVSQHLKLQGDYSNSFK 60
           ME G D HIIAFPFPSQGHINPQLQF+KRLI+NGIK+TLLTTLHVS++LK QG Y++S +
Sbjct: 1   MEKGEDRHIIAFPFPSQGHINPQLQFSKRLIANGIKVTLLTTLHVSRNLKFQGAYADSVR 60

Query: 61  IEVISDGSENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMDSSNPPRFILYDSTMPWVLD 120
           I VISDGSE+RQ+TDTM+QTLDRF+ KM+ NL+NYL + MDSSNPPRFILYDSTMPWVL+
Sbjct: 61  IRVISDGSEDRQDTDTMRQTLDRFREKMSKNLENYLREVMDSSNPPRFILYDSTMPWVLE 120

Query: 121 VAKEFGIAKAPVYTQSCALNSINYHVLHGQLKLPPESSIISLPSMPPLSANDLPAYDYDP 180
           VAKEFG+ +APVYTQSCALNSINYHVLHG LKLPP+S  ISLPSMP L  NDLPAYDYDP
Sbjct: 121 VAKEFGLPRAPVYTQSCALNSINYHVLHGHLKLPPDSPTISLPSMPLLCTNDLPAYDYDP 180

Query: 181 ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK 240
           AS +TIIEFLTSQYSNI+DADLLFCNTF KLEGEIIKWMESWGRPVKAIGPT+PSAYLDK
Sbjct: 181 ASTETIIEFLTSQYSNIQDADLLFCNTFHKLEGEIIKWMESWGRPVKAIGPTLPSAYLDK 240

Query: 241 RIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIKQSD 300
           R+E+DKYYGLSLFDPN+D+  +KWL  KPP SVLYVSYGS+V + EEQLKN+A G K+S 
Sbjct: 241 RLEDDKYYGLSLFDPNKDE-CLKWLDNKPPGSVLYVSYGSLVVLGEEQLKNMALGFKESG 300

Query: 301 KFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL 360
           KFFLWVVRETE++KLPPNF+ESVGEKG++VSWCSQL VLAHPA+GCF THCGWNSTLEAL
Sbjct: 301 KFFLWVVRETESQKLPPNFMESVGEKGLMVSWCSQLQVLAHPAVGCFLTHCGWNSTLEAL 360

Query: 361 CLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE 420
            LGVPVVAFPQWADQVTNAKF+EDVWKVGKRVKV+E+R+ASEEEIR+CICEVME ER +E
Sbjct: 361 SLGVPVVAFPQWADQVTNAKFLEDVWKVGKRVKVNEERLASEEEIRSCICEVMEGERANE 420

Query: 421 FKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMIKQS 457
           FK NS+EW +WAKEAM+EGGSS  +IMEFV+MIKQ+
Sbjct: 421 FKSNSMEWMKWAKEAMDEGGSSDKDIMEFVAMIKQA 455

BLAST of CSPI06G19440 vs. ExPASy TrEMBL
Match: A0A6J1HD85 (Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111462448 PE=3 SV=1)

HSP 1 Score: 779.6 bits (2012), Expect = 7.0e-222
Identity = 368/456 (80.70%), Postives = 418/456 (91.67%), Query Frame = 0

Query: 1   MEIGSDPHIIAFPFPSQGHINPQLQFAKRLISNGIKLTLLTTLHVSQHLKLQGDYSNSFK 60
           ME G D HIIAFPFPSQGHINPQLQF+KRLI+NGIK+TLLTTLHVS++LK QG YS+S K
Sbjct: 1   MEKGEDRHIIAFPFPSQGHINPQLQFSKRLIANGIKVTLLTTLHVSRNLKFQGAYSDSVK 60

Query: 61  IEVISDGSENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMDSSNPPRFILYDSTMPWVLD 120
           I VISDGSE+RQ+TDTM+QTLDRF+ KMT NL+NYL + MDSSNPPRFILYDSTMPWVL+
Sbjct: 61  IRVISDGSEDRQDTDTMRQTLDRFREKMTKNLENYLREVMDSSNPPRFILYDSTMPWVLE 120

Query: 121 VAKEFGIAKAPVYTQSCALNSINYHVLHGQLKLPPESSIISLPSMPPLSANDLPAYDYDP 180
           VAKEFG+ +APVYTQSCALNSINYHVLHG LKLPP+S  ISLPSMP L  NDLPAYDYDP
Sbjct: 121 VAKEFGLPRAPVYTQSCALNSINYHVLHGYLKLPPDSPTISLPSMPLLCPNDLPAYDYDP 180

Query: 181 ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK 240
           AS +TIIEFLTSQYSNI+DADLLFCNTF KLEGEIIKWMESWGRPVKAIGPT+PSAYLDK
Sbjct: 181 ASTETIIEFLTSQYSNIQDADLLFCNTFHKLEGEIIKWMESWGRPVKAIGPTLPSAYLDK 240

Query: 241 RIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIKQSD 300
           R+E+DKYYGLSLFDPN+D+  +KWL +KPP SVLYVS+GS+V + EEQLKN+A G+K+S 
Sbjct: 241 RLEDDKYYGLSLFDPNKDE-CLKWLDSKPPGSVLYVSFGSLVVLGEEQLKNIALGVKESG 300

Query: 301 KFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL 360
           KFFLWVVRETE++KLPPNF+ESVGEKG++VSWCSQL VLAHPA+GCF THCGWNSTLEAL
Sbjct: 301 KFFLWVVRETESQKLPPNFMESVGEKGLMVSWCSQLQVLAHPAVGCFLTHCGWNSTLEAL 360

Query: 361 CLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE 420
            LGVPVVAFPQWADQVTNAKF+EDVWKVGKRVKV+E+R+ASEEEIR+CICEVME ER +E
Sbjct: 361 SLGVPVVAFPQWADQVTNAKFLEDVWKVGKRVKVNEERLASEEEIRSCICEVMEGERANE 420

Query: 421 FKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMIKQS 457
           FK NS+EW +WAKEAM+EGGSS  +IMEFV++I Q+
Sbjct: 421 FKSNSMEWMKWAKEAMDEGGSSDKDIMEFVAIINQA 455

BLAST of CSPI06G19440 vs. NCBI nr
Match: XP_031742553.1 (UDP-glycosyltransferase 74E2 [Cucumis sativus] >KGN47627.1 hypothetical protein Csa_018954 [Cucumis sativus])

HSP 1 Score: 936.8 bits (2420), Expect = 7.1e-269
Identity = 455/456 (99.78%), Postives = 456/456 (100.00%), Query Frame = 0

Query: 1   MEIGSDPHIIAFPFPSQGHINPQLQFAKRLISNGIKLTLLTTLHVSQHLKLQGDYSNSFK 60
           MEIGSDPHIIAFPFPSQGHINPQLQFAKRLIS+GIKLTLLTTLHVSQHLKLQGDYSNSFK
Sbjct: 7   MEIGSDPHIIAFPFPSQGHINPQLQFAKRLISHGIKLTLLTTLHVSQHLKLQGDYSNSFK 66

Query: 61  IEVISDGSENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMDSSNPPRFILYDSTMPWVLD 120
           IEVISDGSENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMDSSNPPRFILYDSTMPWVLD
Sbjct: 67  IEVISDGSENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMDSSNPPRFILYDSTMPWVLD 126

Query: 121 VAKEFGIAKAPVYTQSCALNSINYHVLHGQLKLPPESSIISLPSMPPLSANDLPAYDYDP 180
           VAKEFGIAKAPVYTQSCALNSINYHVLHGQLKLPPESSIISLPSMPPLSANDLPAYDYDP
Sbjct: 127 VAKEFGIAKAPVYTQSCALNSINYHVLHGQLKLPPESSIISLPSMPPLSANDLPAYDYDP 186

Query: 181 ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK 240
           ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK
Sbjct: 187 ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK 246

Query: 241 RIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIKQSD 300
           RIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIKQSD
Sbjct: 247 RIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIKQSD 306

Query: 301 KFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL 360
           KFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL
Sbjct: 307 KFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL 366

Query: 361 CLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE 420
           CLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE
Sbjct: 367 CLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE 426

Query: 421 FKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMIKQS 457
           FKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMIKQS
Sbjct: 427 FKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMIKQS 462

BLAST of CSPI06G19440 vs. NCBI nr
Match: XP_008445485.1 (PREDICTED: UDP-glycosyltransferase 74E2-like [Cucumis melo] >XP_008445487.1 PREDICTED: UDP-glycosyltransferase 74E2-like [Cucumis melo] >KAA0064743.1 UDP-glycosyltransferase 74E2-like [Cucumis melo var. makuwa])

HSP 1 Score: 900.6 bits (2326), Expect = 5.6e-258
Identity = 436/456 (95.61%), Postives = 448/456 (98.25%), Query Frame = 0

Query: 1   MEIGSDPHIIAFPFPSQGHINPQLQFAKRLISNGIKLTLLTTLHVSQHLKLQGDYSNSFK 60
           MEIGSDPHI+AFPFPSQGHINPQLQ AKRLISNGIK+TLLTTLHVSQHLKLQGDYSNSFK
Sbjct: 1   MEIGSDPHILAFPFPSQGHINPQLQLAKRLISNGIKVTLLTTLHVSQHLKLQGDYSNSFK 60

Query: 61  IEVISDGSENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMDSSNPPRFILYDSTMPWVLD 120
           IEVISDGSENRQETDTMKQTLDRFQHKMT NLQ+YL KAMDSSNPPRFILYDSTMPWVLD
Sbjct: 61  IEVISDGSENRQETDTMKQTLDRFQHKMTANLQHYLQKAMDSSNPPRFILYDSTMPWVLD 120

Query: 121 VAKEFGIAKAPVYTQSCALNSINYHVLHGQLKLPPESSIISLPSMPPLSANDLPAYDYDP 180
           VAKEFGIA+APVYTQSCALNSINYHVLHG+LKLPPESS ISLPSMP LSANDLPAYDYDP
Sbjct: 121 VAKEFGIARAPVYTQSCALNSINYHVLHGELKLPPESSTISLPSMPLLSANDLPAYDYDP 180

Query: 181 ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK 240
           ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK
Sbjct: 181 ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK 240

Query: 241 RIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIKQSD 300
           RIENDKYYGLSLFDPNQDD+LIKWLQTKPPSSVLYVSYGSIVEISEEQ+KNLA GIKQSD
Sbjct: 241 RIENDKYYGLSLFDPNQDDNLIKWLQTKPPSSVLYVSYGSIVEISEEQIKNLALGIKQSD 300

Query: 301 KFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL 360
           KFFLWVVRETEA+KLPPNFIESVGEKG+VVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL
Sbjct: 301 KFFLWVVRETEAKKLPPNFIESVGEKGLVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL 360

Query: 361 CLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE 420
           CLGVPVVAFPQWADQVTNAKF+EDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE
Sbjct: 361 CLGVPVVAFPQWADQVTNAKFLEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE 420

Query: 421 FKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMIKQS 457
           FKKNSLE K+WAKEAMEEGGSSY NIMEFV+MIKQS
Sbjct: 421 FKKNSLELKKWAKEAMEEGGSSYKNIMEFVAMIKQS 456

BLAST of CSPI06G19440 vs. NCBI nr
Match: XP_038886750.1 (mogroside IE synthase isoform X1 [Benincasa hispida])

HSP 1 Score: 828.6 bits (2139), Expect = 2.7e-236
Identity = 393/456 (86.18%), Postives = 431/456 (94.52%), Query Frame = 0

Query: 1   MEIGSDPHIIAFPFPSQGHINPQLQFAKRLISNGIKLTLLTTLHVSQHLKLQGDYSNSFK 60
           MEIG DPHII FPFPSQGHINPQLQFAKRLISNGIK+TLLTTLHVSQHLK+QGDYSN  K
Sbjct: 1   MEIGDDPHIIVFPFPSQGHINPQLQFAKRLISNGIKVTLLTTLHVSQHLKMQGDYSNFVK 60

Query: 61  IEVISDGSENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMDSSNPPRFILYDSTMPWVLD 120
           IEVISDGSENRQETDTM+QTLDRF+HKMT NL NYL KAM+SSNPPRFILYDSTMPWVL+
Sbjct: 61  IEVISDGSENRQETDTMRQTLDRFRHKMTKNLHNYLQKAMNSSNPPRFILYDSTMPWVLE 120

Query: 121 VAKEFGIAKAPVYTQSCALNSINYHVLHGQLKLPPESSIISLPSMPPLSANDLPAYDYDP 180
           VAKEFG+A+AP+YTQSCALNSIN+HVLHGQLKLPPESS ISLPSMP LS NDLPAYDYDP
Sbjct: 121 VAKEFGLARAPLYTQSCALNSINHHVLHGQLKLPPESSTISLPSMPLLSPNDLPAYDYDP 180

Query: 181 ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK 240
           AS DTII+FLTSQYSNI+DADLLFCNTFDKLEGEIIKWMES GRPVK IGPT+PSAYLDK
Sbjct: 181 ASVDTIIDFLTSQYSNIQDADLLFCNTFDKLEGEIIKWMESLGRPVKTIGPTVPSAYLDK 240

Query: 241 RIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIKQSD 300
           R++NDK+YGLSLF+PNQDD+L KWL TKPP+SVLY+SYGSIVE+ EEQLKNLA GIK+S 
Sbjct: 241 RVDNDKHYGLSLFEPNQDDYL-KWLNTKPPASVLYISYGSIVEVGEEQLKNLAHGIKESG 300

Query: 301 KFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL 360
           KFFLWVVR+TEA+KLPPNF+ESVGEKG+VV WCSQL+VLAHPA+GCFFTHCGWNSTLEAL
Sbjct: 301 KFFLWVVRDTEAQKLPPNFVESVGEKGLVVGWCSQLEVLAHPAVGCFFTHCGWNSTLEAL 360

Query: 361 CLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE 420
           CLGVPVVAFPQWADQVTNAKF+EDVWKVGKRVK++EKR+AS+EEIR+CI EVME ER +E
Sbjct: 361 CLGVPVVAFPQWADQVTNAKFLEDVWKVGKRVKLNEKRLASQEEIRSCIFEVMEGERANE 420

Query: 421 FKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMIKQS 457
           FK+NSLEWK+WAKEAM+EGGSS  NIMEFV MIKQS
Sbjct: 421 FKRNSLEWKKWAKEAMDEGGSSDKNIMEFVQMIKQS 455

BLAST of CSPI06G19440 vs. NCBI nr
Match: XP_022997134.1 (UDP-glycosyltransferase 74E2-like [Cucurbita maxima])

HSP 1 Score: 780.0 bits (2013), Expect = 1.1e-221
Identity = 368/456 (80.70%), Postives = 417/456 (91.45%), Query Frame = 0

Query: 1   MEIGSDPHIIAFPFPSQGHINPQLQFAKRLISNGIKLTLLTTLHVSQHLKLQGDYSNSFK 60
           ME G D HIIAFPFPSQGHINPQLQF+KRLI+NGIK+TLLTTLHVS++LK QG Y++S +
Sbjct: 1   MEKGEDRHIIAFPFPSQGHINPQLQFSKRLIANGIKVTLLTTLHVSRNLKFQGAYADSVR 60

Query: 61  IEVISDGSENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMDSSNPPRFILYDSTMPWVLD 120
           I VISDGSE+RQ+TDTM+QTLDRF+ KM+ NL+NYL + MDSSNPPRFILYDSTMPWVL+
Sbjct: 61  IRVISDGSEDRQDTDTMRQTLDRFREKMSKNLENYLREVMDSSNPPRFILYDSTMPWVLE 120

Query: 121 VAKEFGIAKAPVYTQSCALNSINYHVLHGQLKLPPESSIISLPSMPPLSANDLPAYDYDP 180
           VAKEFG+ +APVYTQSCALNSINYHVLHG LKLPP+S  ISLPSMP L  NDLPAYDYDP
Sbjct: 121 VAKEFGLPRAPVYTQSCALNSINYHVLHGHLKLPPDSPTISLPSMPLLCTNDLPAYDYDP 180

Query: 181 ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK 240
           AS +TIIEFLTSQYSNI+DADLLFCNTF KLEGEIIKWMESWGRPVKAIGPT+PSAYLDK
Sbjct: 181 ASTETIIEFLTSQYSNIQDADLLFCNTFHKLEGEIIKWMESWGRPVKAIGPTLPSAYLDK 240

Query: 241 RIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIKQSD 300
           R+E+DKYYGLSLFDPN+D+  +KWL  KPP SVLYVSYGS+V + EEQLKN+A G K+S 
Sbjct: 241 RLEDDKYYGLSLFDPNKDE-CLKWLDNKPPGSVLYVSYGSLVVLGEEQLKNMALGFKESG 300

Query: 301 KFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL 360
           KFFLWVVRETE++KLPPNF+ESVGEKG++VSWCSQL VLAHPA+GCF THCGWNSTLEAL
Sbjct: 301 KFFLWVVRETESQKLPPNFMESVGEKGLMVSWCSQLQVLAHPAVGCFLTHCGWNSTLEAL 360

Query: 361 CLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE 420
            LGVPVVAFPQWADQVTNAKF+EDVWKVGKRVKV+E+R+ASEEEIR+CICEVME ER +E
Sbjct: 361 SLGVPVVAFPQWADQVTNAKFLEDVWKVGKRVKVNEERLASEEEIRSCICEVMEGERANE 420

Query: 421 FKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMIKQS 457
           FK NS+EW +WAKEAM+EGGSS  +IMEFV+MIKQ+
Sbjct: 421 FKSNSMEWMKWAKEAMDEGGSSDKDIMEFVAMIKQA 455

BLAST of CSPI06G19440 vs. NCBI nr
Match: XP_022961784.1 (UDP-glycosyltransferase 74E2-like [Cucurbita moschata] >XP_022961785.1 UDP-glycosyltransferase 74E2-like [Cucurbita moschata])

HSP 1 Score: 779.6 bits (2012), Expect = 1.4e-221
Identity = 368/456 (80.70%), Postives = 418/456 (91.67%), Query Frame = 0

Query: 1   MEIGSDPHIIAFPFPSQGHINPQLQFAKRLISNGIKLTLLTTLHVSQHLKLQGDYSNSFK 60
           ME G D HIIAFPFPSQGHINPQLQF+KRLI+NGIK+TLLTTLHVS++LK QG YS+S K
Sbjct: 1   MEKGEDRHIIAFPFPSQGHINPQLQFSKRLIANGIKVTLLTTLHVSRNLKFQGAYSDSVK 60

Query: 61  IEVISDGSENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMDSSNPPRFILYDSTMPWVLD 120
           I VISDGSE+RQ+TDTM+QTLDRF+ KMT NL+NYL + MDSSNPPRFILYDSTMPWVL+
Sbjct: 61  IRVISDGSEDRQDTDTMRQTLDRFREKMTKNLENYLREVMDSSNPPRFILYDSTMPWVLE 120

Query: 121 VAKEFGIAKAPVYTQSCALNSINYHVLHGQLKLPPESSIISLPSMPPLSANDLPAYDYDP 180
           VAKEFG+ +APVYTQSCALNSINYHVLHG LKLPP+S  ISLPSMP L  NDLPAYDYDP
Sbjct: 121 VAKEFGLPRAPVYTQSCALNSINYHVLHGYLKLPPDSPTISLPSMPLLCPNDLPAYDYDP 180

Query: 181 ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGRPVKAIGPTIPSAYLDK 240
           AS +TIIEFLTSQYSNI+DADLLFCNTF KLEGEIIKWMESWGRPVKAIGPT+PSAYLDK
Sbjct: 181 ASTETIIEFLTSQYSNIQDADLLFCNTFHKLEGEIIKWMESWGRPVKAIGPTLPSAYLDK 240

Query: 241 RIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIKQSD 300
           R+E+DKYYGLSLFDPN+D+  +KWL +KPP SVLYVS+GS+V + EEQLKN+A G+K+S 
Sbjct: 241 RLEDDKYYGLSLFDPNKDE-CLKWLDSKPPGSVLYVSFGSLVVLGEEQLKNIALGVKESG 300

Query: 301 KFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL 360
           KFFLWVVRETE++KLPPNF+ESVGEKG++VSWCSQL VLAHPA+GCF THCGWNSTLEAL
Sbjct: 301 KFFLWVVRETESQKLPPNFMESVGEKGLMVSWCSQLQVLAHPAVGCFLTHCGWNSTLEAL 360

Query: 361 CLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE 420
            LGVPVVAFPQWADQVTNAKF+EDVWKVGKRVKV+E+R+ASEEEIR+CICEVME ER +E
Sbjct: 361 SLGVPVVAFPQWADQVTNAKFLEDVWKVGKRVKVNEERLASEEEIRSCICEVMEGERANE 420

Query: 421 FKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMIKQS 457
           FK NS+EW +WAKEAM+EGGSS  +IMEFV++I Q+
Sbjct: 421 FKSNSMEWMKWAKEAMDEGGSSDKDIMEFVAIINQA 455

BLAST of CSPI06G19440 vs. TAIR 10
Match: AT1G05680.1 (Uridine diphosphate glycosyltransferase 74E2 )

HSP 1 Score: 444.9 bits (1143), Expect = 7.9e-125
Identity = 228/455 (50.11%), Postives = 302/455 (66.37%), Query Frame = 0

Query: 8   HIIAFPFPSQGHINPQLQFAKRLISNGIKLTL-LTTLHVSQHLKLQGDYSNSFKIEVISD 67
           H+I  PFP QGHI P  QF KRL S G+KLTL L +   S   K + D    F I     
Sbjct: 6   HLIVLPFPGQGHITPMSQFCKRLASKGLKLTLVLVSDKPSPPYKTEHDSITVFPI----- 65

Query: 68  GSENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMD----SSNPPRFILYDSTMPWVLDVA 127
            S   QE +   Q LD +  ++ T+++N L K ++    S NPPR I+YDSTMPW+LDVA
Sbjct: 66  -SNGFQEGEEPLQDLDDYMERVETSIKNTLPKLVEDMKLSGNPPRAIVYDSTMPWLLDVA 125

Query: 128 KEFGIAKAPVYTQSCALNSINYHVLHGQLKLPP----ESSIISLPSMPPLSANDLPAYDY 187
             +G++ A  +TQ   + +I YHV  G   +P      S++ S PS P L+ANDLP++  
Sbjct: 126 HSYGLSGAVFFTQPWLVTAIYYHVFKGSFSVPSTKYGHSTLASFPSFPMLTANDLPSFLC 185

Query: 188 DPASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMES-WGRPVKAIGPTIPSAY 247
           + +S   I+  +  Q SNI+  D++ CNTFDKLE +++KW++S W  PV  IGPT+PS Y
Sbjct: 186 ESSSYPNILRIVVDQLSNIDRVDIVLCNTFDKLEEKLLKWVQSLW--PVLNIGPTVPSMY 245

Query: 248 LDKRIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIK 307
           LDKR+  DK YG SLF+    +  ++WL +K P+SV+Y+S+GS+V + E+Q+  LA G+K
Sbjct: 246 LDKRLSEDKNYGFSLFNAKVAE-CMEWLNSKEPNSVVYLSFGSLVILKEDQMLELAAGLK 305

Query: 308 QSDKFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTL 367
           QS +FFLWVVRETE  KLP N++E +GEKG++VSW  QLDVLAH +IGCF THCGWNSTL
Sbjct: 306 QSGRFFLWVVRETETHKLPRNYVEEIGEKGLIVSWSPQLDVLAHKSIGCFLTHCGWNSTL 365

Query: 368 EALCLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEER 427
           E L LGVP++  P W DQ TNAKFM+DVWKVG RVK +       EEI   + EVME E+
Sbjct: 366 EGLSLGVPMIGMPHWTDQPTNAKFMQDVWKVGVRVKAEGDGFVRREEIMRSVEEVMEGEK 425

Query: 428 GSEFKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSM 453
           G E +KN+ +WK  A+EA+ EGGSS  +I EFVSM
Sbjct: 426 GKEIRKNAEKWKVLAQEAVSEGGSSDKSINEFVSM 451

BLAST of CSPI06G19440 vs. TAIR 10
Match: AT1G05675.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 439.9 bits (1130), Expect = 2.5e-123
Identity = 228/455 (50.11%), Postives = 304/455 (66.81%), Query Frame = 0

Query: 8   HIIAFPFPSQGHINPQLQFAKRLISNGIKLTL-LTTLHVSQHLKLQGDYSNSFKIEVISD 67
           H+I  PFP+QGHI P  QF KRL S  +K+TL L +   S   K + D   +  +  IS+
Sbjct: 6   HVIVLPFPAQGHITPMSQFCKRLASKSLKITLVLVSDKPSPPYKTEHD---TITVVPISN 65

Query: 68  GSENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMD----SSNPPRFILYDSTMPWVLDVA 127
           G +  QE     + LD +  ++ ++++N L K ++    S NPPR ++YDSTMPW+LDVA
Sbjct: 66  GFQEGQE---RSEDLDEYMERVESSIKNRLPKLIEDMKLSGNPPRALVYDSTMPWLLDVA 125

Query: 128 KEFGIAKAPVYTQSCALNSINYHVLHGQLKLPP----ESSIISLPSMPPLSANDLPAYDY 187
             +G++ A  +TQ   +++I YHV  G   +P      S++ S PS+P L+ANDLP++  
Sbjct: 126 HSYGLSGAVFFTQPWLVSAIYYHVFKGSFSVPSTKYGHSTLASFPSLPILNANDLPSFLC 185

Query: 188 DPASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMES-WGRPVKAIGPTIPSAY 247
           + +S   I+  +  Q SNI+  D++ CNTFDKLE +++KW++S W  PV  IGPT+PS Y
Sbjct: 186 ESSSYPYILRTVIDQLSNIDRVDIVLCNTFDKLEEKLLKWIKSVW--PVLNIGPTVPSMY 245

Query: 248 LDKRIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIK 307
           LDKR+  DK YG SLF     +  ++WL +K PSSV+YVS+GS+V + ++QL  LA G+K
Sbjct: 246 LDKRLAEDKNYGFSLFGAKIAE-CMEWLNSKQPSSVVYVSFGSLVVLKKDQLIELAAGLK 305

Query: 308 QSDKFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTL 367
           QS  FFLWVVRETE RKLP N+IE +GEKG+ VSW  QL+VL H +IGCF THCGWNSTL
Sbjct: 306 QSGHFFLWVVRETERRKLPENYIEEIGEKGLTVSWSPQLEVLTHKSIGCFVTHCGWNSTL 365

Query: 368 EALCLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEER 427
           E L LGVP++  P WADQ TNAKFMEDVWKVG RVK D       EE    + EVME E+
Sbjct: 366 EGLSLGVPMIGMPHWADQPTNAKFMEDVWKVGVRVKADSDGFVRREEFVRRVEEVMEAEQ 425

Query: 428 GSEFKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSM 453
           G E +KN+ +WK  A+EA+ EGGSS  NI EFVSM
Sbjct: 426 GKEIRKNAEKWKVLAQEAVSEGGSSDKNINEFVSM 451

BLAST of CSPI06G19440 vs. TAIR 10
Match: AT2G31790.1 (UDP-Glycosyltransferase superfamily protein )

HSP 1 Score: 401.4 bits (1030), Expect = 1.0e-111
Identity = 202/453 (44.59%), Postives = 282/453 (62.25%), Query Frame = 0

Query: 8   HIIAFPFPSQGHINPQLQFAKRLISNGIKLTLLTTLHVSQHLKLQGDYSNSFKIEVISDG 67
           H++ FP+P QGHINP +Q AKRL   GI  TL+      +      DY  S  +  I DG
Sbjct: 8   HVLFFPYPLQGHINPMIQLAKRLSKKGITSTLIIASKDHREPYTSDDY--SITVHTIHDG 67

Query: 68  SENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMDSSNPPRFILYDSTMPWVLDVAKEFGI 127
               +        LDRF +  + +L +++  A  S NPP+ ++YD  MP+ LD+AK+  +
Sbjct: 68  FFPHEHPHAKFVDLDRFHNSTSRSLTDFISSAKLSDNPPKALIYDPFMPFALDIAKDLDL 127

Query: 128 AKAPVYTQSCALNSINYHVLHGQLKLPPE----SSIISLPSMPPLSANDLPAYDYDPASA 187
                +TQ    + + YH+  G   +P +     ++ S P  P LS +DLP++  +  S 
Sbjct: 128 YVVAYFTQPWLASLVYYHINEGTYDVPVDRHENPTLASFPGFPLLSQDDLPSFACEKGSY 187

Query: 188 DTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWM-ESWGRPVKAIGPTIPSAYLDKRI 247
             + EF+  Q+SN+  AD + CNTFD+LE +++KWM + W  PVK IGP +PS +LD R+
Sbjct: 188 PLLHEFVVRQFSNLLQADCILCNTFDQLEPKVVKWMNDQW--PVKNIGPVVPSKFLDNRL 247

Query: 248 ENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIKQSDKF 307
             DK Y L       D+ ++KWL  +P  SV+YV++G++V +SE+Q+K +A  I Q+   
Sbjct: 248 PEDKDYELENSKTEPDESVLKWLGNRPAKSVVYVAFGTLVALSEKQMKEIAMAISQTGYH 307

Query: 308 FLWVVRETEARKLPPNFIESVGEK--GIVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL 367
           FLW VRE+E  KLP  FIE   EK  G+V  W  QL+VLAH +IGCF +HCGWNSTLEAL
Sbjct: 308 FLWSVRESERSKLPSGFIEEAEEKDSGLVAKWVPQLEVLAHESIGCFVSHCGWNSTLEAL 367

Query: 368 CLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEEERGSE 427
           CLGVP+V  PQW DQ TNAKF+EDVWK+G RV+ D + ++S+EEI  CI EVME ERG E
Sbjct: 368 CLGVPMVGVPQWTDQPTNAKFIEDVWKIGVRVRTDGEGLSSKEEIARCIVEVMEGERGKE 427

Query: 428 FKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMI 454
            +KN  + K  A+EA+ EGGSS   I EFV+++
Sbjct: 428 IRKNVEKLKVLAREAISEGGSSDKKIDEFVALL 456

BLAST of CSPI06G19440 vs. TAIR 10
Match: AT2G31750.1 (UDP-glucosyl transferase 74D1 )

HSP 1 Score: 398.7 bits (1023), Expect = 6.5e-111
Identity = 207/456 (45.39%), Postives = 295/456 (64.69%), Query Frame = 0

Query: 8   HIIAFPFPSQGHINPQLQFAKRLISNGIKLTLLTTLHVSQHLKLQ----GDYSNSFKIEV 67
           +++ F FP QGHINP LQF+KRL+S  + +T LTT      +  +    G  +       
Sbjct: 8   NVLVFSFPIQGHINPLLQFSKRLLSKNVNVTFLTTSSTHNSILRRAITGGATALPLSFVP 67

Query: 68  ISDG-SENRQETDTMKQTLDRFQHKMTTNLQNYLHKAMDSSNP-PRFILYDSTMPWVLDV 127
           I DG  E+   TDT      +FQ     N+   L + + S +P P  ++YDS +P+VLDV
Sbjct: 68  IDDGFEEDHPSTDTSPDYFAKFQE----NVSRSLSELISSMDPKPNAVVYDSCLPYVLDV 127

Query: 128 AKEF-GIAKAPVYTQSCALNSINYHVLHGQLKLPPESSIISLPSMPPLSANDLPAYDYDP 187
            ++  G+A A  +TQS  +N+   H L G+ K     + + LP+MPPL  NDLP + YD 
Sbjct: 128 CRKHPGVAAASFFTQSSTVNATYIHFLRGEFK--EFQNDVVLPAMPPLKGNDLPVFLYDN 187

Query: 188 ASADTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMES-WGRPVKAIGPTIPSAYLD 247
                + E ++SQ+ N++D D    N+FD+LE E+++WM++ W  PVK IGP IPS YLD
Sbjct: 188 NLCRPLFELISSQFVNVDDIDFFLVNSFDELEVEVLQWMKNQW--PVKNIGPMIPSMYLD 247

Query: 248 KRIENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIKQS 307
           KR+  DK YG++LF+  Q +  + WL +KPP SV+YVS+GS+  + ++Q+  +A G+KQ+
Sbjct: 248 KRLAGDKDYGINLFNA-QVNECLDWLDSKPPGSVIYVSFGSLAVLKDDQMIEVAAGLKQT 307

Query: 308 DKFFLWVVRETEARKLPPNFIESVGEKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEA 367
              FLWVVRETE +KLP N+IE + +KG++V+W  QL VLAH +IGCF THCGWNSTLEA
Sbjct: 308 GHNFLWVVRETETKKLPSNYIEDICDKGLIVNWSPQLQVLAHKSIGCFMTHCGWNSTLEA 367

Query: 368 LCLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVDEKRMASEEEIRNCICEVMEE--ER 427
           L LGV ++  P ++DQ TNAKF+EDVWKVG RVK D+     +EEI  C+ EVME+  E+
Sbjct: 368 LSLGVALIGMPAYSDQPTNAKFIEDVWKVGVRVKADQNGFVPKEEIVRCVGEVMEDMSEK 427

Query: 428 GSEFKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMI 454
           G E +KN+    ++A+EA+ +GG+S  NI EFV+ I
Sbjct: 428 GKEIRKNARRLMEFAREALSDGGNSDKNIDEFVAKI 454

BLAST of CSPI06G19440 vs. TAIR 10
Match: AT2G43820.1 (UDP-glucosyltransferase 74F2 )

HSP 1 Score: 360.9 bits (925), Expect = 1.5e-99
Identity = 196/455 (43.08%), Postives = 273/455 (60.00%), Query Frame = 0

Query: 8   HIIAFPFPSQGHINPQLQFAKRLISNGIKLTLLTTLHVSQHLKLQGDYSNSFKIEVISDG 67
           H++A P+P+QGHI P  QF KRL   G+K TL  T  V     +  D S    I  ISDG
Sbjct: 7   HVLAVPYPTQGHITPFRQFCKRLHFKGLKTTLALTTFVFN--SINPDLSGPISIATISDG 66

Query: 68  SENR--QETDTMKQTLDRFQHKMTTNLQNYLHKAMDSSNPPRFILYDSTMPWVLDVAKEF 127
            ++   +  D++   L  F+   +  + + + K   S NP   I+YD+ +PW LDVA+EF
Sbjct: 67  YDHGGFETADSIDDYLKDFKTSGSKTIADIIQKHQTSDNPITCIVYDAFLPWALDVAREF 126

Query: 128 GIAKAPVYTQSCALNSINY--HVLHGQLKLPPESSIISLPSMPPLSANDLPAYDYDPASA 187
           G+   P +TQ CA+N + Y  ++ +G L+LP E        +P L   DLP++     S 
Sbjct: 127 GLVATPFFTQPCAVNYVYYLSYINNGSLQLPIE-------ELPFLELQDLPSFFSVSGSY 186

Query: 188 DTIIEFLTSQYSNIEDADLLFCNTFDKLEGEIIKWMESWGR--PVKAIGPTIPSAYLDKR 247
               E +  Q+ N E AD +  N+F +LE   +   E W +  PV  IGPTIPS YLD+R
Sbjct: 187 PAYFEMVLQQFINFEKADFVLVNSFQELE---LHENELWSKACPVLTIGPTIPSIYLDQR 246

Query: 248 IENDKYYGLSLFDPNQDDHLIKWLQTKPPSSVLYVSYGSIVEISEEQLKNLAFGIKQSDK 307
           I++D  Y L+LF+   D   I WL T+P  SV+YV++GS+ +++  Q++ LA  +  S+ 
Sbjct: 247 IKSDTGYDLNLFESKDDSFCINWLDTRPQGSVVYVAFGSMAQLTNVQMEELASAV--SNF 306

Query: 308 FFLWVVRETEARKLPPNFIESVG-EKGIVVSWCSQLDVLAHPAIGCFFTHCGWNSTLEAL 367
            FLWVVR +E  KLP  F+E+V  EK +V+ W  QL VL++ AIGCF THCGWNST+EAL
Sbjct: 307 SFLWVVRSSEEEKLPSGFLETVNKEKSLVLKWSPQLQVLSNKAIGCFLTHCGWNSTMEAL 366

Query: 368 CLGVPVVAFPQWADQVTNAKFMEDVWKVGKRVKVD-EKRMASEEEIRNCICEVMEEERGS 427
             GVP+VA PQW DQ  NAK+++DVWK G RVK + E  +A  EEI   I EVME ER  
Sbjct: 367 TFGVPMVAMPQWTDQPMNAKYIQDVWKAGVRVKTEKESGIAKREEIEFSIKEVMEGERSK 426

Query: 428 EFKKNSLEWKQWAKEAMEEGGSSYNNIMEFVSMIK 455
           E KKN  +W+  A +++ EGGS+  NI  FVS ++
Sbjct: 427 EMKKNVKKWRDLAVKSLNEGGSTDTNIDTFVSRVQ 447

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
K7NBW31.5e-21377.41Mogroside IE synthase OS=Siraitia grosvenorii OX=190515 GN=UGT74AC1 PE=1 SV=1[more]
Q9SYK91.1e-12350.11UDP-glycosyltransferase 74E2 OS=Arabidopsis thaliana OX=3702 GN=UGT74E2 PE=1 SV=... [more]
P0C7P73.6e-12250.11UDP-glycosyltransferase 74E1 OS=Arabidopsis thaliana OX=3702 GN=UGT74E1 PE=3 SV=... [more]
W8JMV41.1e-11044.71UDP glycosyltransferase 9 OS=Catharanthus roseus OX=4058 GN=UGT9 PE=2 SV=1[more]
Q9SKC11.4e-11044.59UDP-glycosyltransferase 74C1 OS=Arabidopsis thaliana OX=3702 GN=UGT74C1 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
A0A0A0KGW23.4e-26999.78Glycosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_6G366250 PE=3 SV=1[more]
A0A1S3BCD12.7e-25895.61Glycosyltransferase OS=Cucumis melo OX=3656 GN=LOC103488489 PE=3 SV=1[more]
A0A5A7V9C92.7e-25895.61Glycosyltransferase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G0... [more]
A0A6J1KAL05.4e-22280.70Glycosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111492134 PE=3 SV=1[more]
A0A6J1HD857.0e-22280.70Glycosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111462448 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
XP_031742553.17.1e-26999.78UDP-glycosyltransferase 74E2 [Cucumis sativus] >KGN47627.1 hypothetical protein ... [more]
XP_008445485.15.6e-25895.61PREDICTED: UDP-glycosyltransferase 74E2-like [Cucumis melo] >XP_008445487.1 PRED... [more]
XP_038886750.12.7e-23686.18mogroside IE synthase isoform X1 [Benincasa hispida][more]
XP_022997134.11.1e-22180.70UDP-glycosyltransferase 74E2-like [Cucurbita maxima][more]
XP_022961784.11.4e-22180.70UDP-glycosyltransferase 74E2-like [Cucurbita moschata] >XP_022961785.1 UDP-glyco... [more]
Match NameE-valueIdentityDescription
AT1G05680.17.9e-12550.11Uridine diphosphate glycosyltransferase 74E2 [more]
AT1G05675.12.5e-12350.11UDP-Glycosyltransferase superfamily protein [more]
AT2G31790.11.0e-11144.59UDP-Glycosyltransferase superfamily protein [more]
AT2G31750.16.5e-11145.39UDP-glucosyl transferase 74D1 [more]
AT2G43820.11.5e-9943.08UDP-glucosyltransferase 74F2 [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 12..446
e-value: 3.6E-145
score: 486.5
NoneNo IPR availableGENE3D3.40.50.2000Glycogen Phosphorylase B;coord: 247..434
e-value: 3.6E-145
score: 486.5
NoneNo IPR availablePANTHERPTHR11926:SF1147UDP-GLYCOSYLTRANSFERASE 74E1-RELATEDcoord: 8..453
NoneNo IPR availablePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 8..453
NoneNo IPR availableSUPERFAMILY53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 7..454
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 267..424
e-value: 1.9E-29
score: 102.8
IPR002213UDP-glucuronosyl/UDP-glucosyltransferaseCDDcd03784GT1_Gtf-likecoord: 7..442
e-value: 3.29734E-85
score: 265.184
IPR035595UDP-glycosyltransferase family, conserved sitePROSITEPS00375UDPGTcoord: 332..375

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI06G19440.1CSPI06G19440.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0080043 quercetin 3-O-glucosyltransferase activity
molecular_function GO:0080044 quercetin 7-O-glucosyltransferase activity
molecular_function GO:0008194 UDP-glycosyltransferase activity