CSPI04G22070 (gene) Wild cucumber (PI 183967)

NameCSPI04G22070
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionGlycosyltransferase
LocationChr4 : 20514611 .. 20516569 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTTCCTTTACAAAGATTCATTTAGTCTTCTTCTCCAAGCAAGAAAAATTGTTATATATGTGGTTAACCAACTCCAATCCAAAAGATAAGACAGGCTTGAAGATCCAAAAGATAAGACGCTAATTATCTTTTGAATCAAATGAACAAGTTTGAGTTAGTTTTCATACCTGGGCCGGGGATCGGCCATCTTGCATCCACGGTCGAACTGGCAAATGTTCTTGTTAGCCGAGATGACCGTCTCTCTGTGACTGTGCTCGCCATCAAGCTTCCCGATGACATCAAAACGACGACGGAACGTATTCAGTCACTTTCAACGTCTTTCGAGGGTAAATCTATACGCTTTATTGTTCTTCCTGAACTTCCCTTCCCAAACCAAAGTAGTGAACCTCCCCCCCTTATGCTGCAAGCATTCCTTGAAAGCCACAAGCCCCATGTGAGGGAAATTGTGACCAACTTAATTCATGACTCGAACCGTCTTGTCGGATTTGTCATTGATATGTTTTGCACCAGTATGATAAATGTGGCTAATGAATTTAAGGTTCCTTGTTATTTGTTTTACACATCCAATGCTGGCTTTCTTGATTTTAGCTTTCATCTCCAAGAGCTTTACAATCAAAACAATAGCACAGCAGAACAGTTGCAAAATTCAAATGTTGAGTTAGCTCTGCCAAGTTTTATCAATCCAATTCCTAATAAAGCCATCCCCCCCTTCTTATTTGATAAAGACATGGCTGCTTGGTTTCATGATAATACTAAAAGATTTAGATCAGAAGTCAAAGGTATTTTGATCAATACATTTGTAGAGATGGAACCACAAATAGTGAAATGGATGTCAAATGGCTCTTCGAAAATCCCAAAGGTGTATACTGTTGGACCCATTTTGCAGTTGAAGAGTATTGGTGTTACACAAAGCAACAATGCTCTAAGTGGTGCAGATATACTAAAGTGGCTAGATGATCAACCTCCAGCATCAGTGGTTTTCCTATGCTTTGGTAGCAAAGGAAGCTTTGACGAGGATCAAGTGCTAGAGATTGCTCGAGCACTGGAGCGAAGTGAGGTTCGCTTCTTATGGTCCCTTCGACAGCCCCCACCAAAGGGTAAGTTTGAAGAGCCAAGCAACTATGCTAACATCAACGACGTCCTACCTGAGGGATTTCTCAACCGAACAGCCGATATTGGAAGGGTCATCGGGTGGGCTCCACAAATAGAAATATTGTCCCATCCTGCCACTGGAGGATTCATATCACATTGTGGTTGGAATTCAACGTTGGAGAGTGTATGGCATGGTGTACCAATGGCAACATGGCCATTGTATGCCGAACAACAGTTTAACGCATTTGAAATGGTGGTGGAATTAGGATTGGCAGTGGAGCTCACATTAGACTACGTGAAGGATTTTCATATAGGAAGGTCGAGAATAGTGAGTGCAGAAGAGATAGAAAGCGGGATCAGAAAATTGATGGGCGATTCTGGTAATGAGATCAGGAAGAAAGTCAAAGTAAAAGGTGAAGAAAGTCGAAAAAGTATGATGGAAGGTGGATCCTCCTTCAATTCATTACGTCATTTCATTGATGATGCTTTGACTAACTTACAAGAGGGCAACTACTAATGCCATTTTGGTTTCATATAATTCATGAAAAAATGTCTGTAACAGTGAAGGAGAGATTTTTGAAATAAACAGAATATTTAGTGGTAAAGCTTTTCTTTTTGTTGGAATAAACCCCAAGTTCTGTATTTCTCAAATTCTCAGTCTTATCTCTCTCTGCTTACTTCTCTACCTATTTTCTCATTTCTCTGTTTCTTTCTTTATAACCCAATCCAATTTGAGTAAATTTGCGTTGAGTCCAAGCCTCCAAACCTCCAAACCTCCACTATATTCCTCTTTTATATCCTACCATATTTGTATAACTTTTTTAACTAACCACAAAAAAAATTAATTTAAGTTGTATTAATTTAAAT

mRNA sequence

ATGAACAAGTTTGAGTTAGTTTTCATACCTGGGCCGGGGATCGGCCATCTTGCATCCACGGTCGAACTGGCAAATGTTCTTGTTAGCCGAGATGACCGTCTCTCTGTGACTGTGCTCGCCATCAAGCTTCCCGATGACATCAAAACGACGACGGAACGTATTCAGTCACTTTCAACGTCTTTCGAGGGTAAATCTATACGCTTTATTGTTCTTCCTGAACTTCCCTTCCCAAACCAAAGTAGTGAACCTCCCCCCCTTATGCTGCAAGCATTCCTTGAAAGCCACAAGCCCCATGTGAGGGAAATTGTGACCAACTTAATTCATGACTCGAACCGTCTTGTCGGATTTGTCATTGATATGTTTTGCACCAGTATGATAAATGTGGCTAATGAATTTAAGGTTCCTTGTTATTTGTTTTACACATCCAATGCTGGCTTTCTTGATTTTAGCTTTCATCTCCAAGAGCTTTACAATCAAAACAATAGCACAGCAGAACAGTTGCAAAATTCAAATGTTGAGTTAGCTCTGCCAAGTTTTATCAATCCAATTCCTAATAAAGCCATCCCCCCCTTCTTATTTGATAAAGACATGGCTGCTTGGTTTCATGATAATACTAAAAGATTTAGATCAGAAGTCAAAGGTATTTTGATCAATACATTTGTAGAGATGGAACCACAAATAGTGAAATGGATGTCAAATGGCTCTTCGAAAATCCCAAAGGTGTATACTGTTGGACCCATTTTGCAGTTGAAGAGTATTGGTGTTACACAAAGCAACAATGCTCTAAGTGGTGCAGATATACTAAAGTGGCTAGATGATCAACCTCCAGCATCAGTGGTTTTCCTATGCTTTGGTAGCAAAGGAAGCTTTGACGAGGATCAAGTGCTAGAGATTGCTCGAGCACTGGAGCGAAGTGAGGTTCGCTTCTTATGGTCCCTTCGACAGCCCCCACCAAAGGGTAAGTTTGAAGAGCCAAGCAACTATGCTAACATCAACGACGTCCTACCTGAGGGATTTCTCAACCGAACAGCCGATATTGGAAGGGTCATCGGGTGGGCTCCACAAATAGAAATATTGTCCCATCCTGCCACTGGAGGATTCATATCACATTGTGGTTGGAATTCAACGTTGGAGAGTGTATGGCATGGTGTACCAATGGCAACATGGCCATTGTATGCCGAACAACAGTTTAACGCATTTGAAATGGTGGTGGAATTAGGATTGGCAGTGGAGCTCACATTAGACTACGTGAAGGATTTTCATATAGGAAGGTCGAGAATAGTGAGTGCAGAAGAGATAGAAAGCGGGATCAGAAAATTGATGGGCGATTCTGGTAATGAGATCAGGAAGAAAGTCAAAGTAAAAGGTGAAGAAAGTCGAAAAAGTATGATGGAAGGTGGATCCTCCTTCAATTCATTACGTCATTTCATTGATGATGCTTTGACTAACTTACAAGAGGGCAACTACTAA

Coding sequence (CDS)

ATGAACAAGTTTGAGTTAGTTTTCATACCTGGGCCGGGGATCGGCCATCTTGCATCCACGGTCGAACTGGCAAATGTTCTTGTTAGCCGAGATGACCGTCTCTCTGTGACTGTGCTCGCCATCAAGCTTCCCGATGACATCAAAACGACGACGGAACGTATTCAGTCACTTTCAACGTCTTTCGAGGGTAAATCTATACGCTTTATTGTTCTTCCTGAACTTCCCTTCCCAAACCAAAGTAGTGAACCTCCCCCCCTTATGCTGCAAGCATTCCTTGAAAGCCACAAGCCCCATGTGAGGGAAATTGTGACCAACTTAATTCATGACTCGAACCGTCTTGTCGGATTTGTCATTGATATGTTTTGCACCAGTATGATAAATGTGGCTAATGAATTTAAGGTTCCTTGTTATTTGTTTTACACATCCAATGCTGGCTTTCTTGATTTTAGCTTTCATCTCCAAGAGCTTTACAATCAAAACAATAGCACAGCAGAACAGTTGCAAAATTCAAATGTTGAGTTAGCTCTGCCAAGTTTTATCAATCCAATTCCTAATAAAGCCATCCCCCCCTTCTTATTTGATAAAGACATGGCTGCTTGGTTTCATGATAATACTAAAAGATTTAGATCAGAAGTCAAAGGTATTTTGATCAATACATTTGTAGAGATGGAACCACAAATAGTGAAATGGATGTCAAATGGCTCTTCGAAAATCCCAAAGGTGTATACTGTTGGACCCATTTTGCAGTTGAAGAGTATTGGTGTTACACAAAGCAACAATGCTCTAAGTGGTGCAGATATACTAAAGTGGCTAGATGATCAACCTCCAGCATCAGTGGTTTTCCTATGCTTTGGTAGCAAAGGAAGCTTTGACGAGGATCAAGTGCTAGAGATTGCTCGAGCACTGGAGCGAAGTGAGGTTCGCTTCTTATGGTCCCTTCGACAGCCCCCACCAAAGGGTAAGTTTGAAGAGCCAAGCAACTATGCTAACATCAACGACGTCCTACCTGAGGGATTTCTCAACCGAACAGCCGATATTGGAAGGGTCATCGGGTGGGCTCCACAAATAGAAATATTGTCCCATCCTGCCACTGGAGGATTCATATCACATTGTGGTTGGAATTCAACGTTGGAGAGTGTATGGCATGGTGTACCAATGGCAACATGGCCATTGTATGCCGAACAACAGTTTAACGCATTTGAAATGGTGGTGGAATTAGGATTGGCAGTGGAGCTCACATTAGACTACGTGAAGGATTTTCATATAGGAAGGTCGAGAATAGTGAGTGCAGAAGAGATAGAAAGCGGGATCAGAAAATTGATGGGCGATTCTGGTAATGAGATCAGGAAGAAAGTCAAAGTAAAAGGTGAAGAAAGTCGAAAAAGTATGATGGAAGGTGGATCCTCCTTCAATTCATTACGTCATTTCATTGATGATGCTTTGACTAACTTACAAGAGGGCAACTACTAA
BLAST of CSPI04G22070 vs. Swiss-Prot
Match: U7A16_PYRCO (UDP-glycosyltransferase 71A16 OS=Pyrus communis GN=UGT71A16 PE=1 SV=1)

HSP 1 Score: 477.6 bits (1228), Expect = 1.6e-133
Identity = 252/483 (52.17%), Postives = 333/483 (68.94%), Query Frame = 1

Query: 5   ELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPDDIKTTTERIQSLSTSFEGK 64
           +LVF+P PGIGH+ STVE+A  LV+RDD+L +TVL +KLP D   T       + S    
Sbjct: 6   QLVFVPAPGIGHIVSTVEMAKQLVARDDQLFITVLVMKLPYDQPFTN------TDSSISH 65

Query: 65  SIRFIVLPELPFPNQSSEPPP-LMLQAFLESHKPHVREIVTNLIHDSN--------RLVG 124
            I F+ LPE     Q + P P    + F+E+HK HVR+ V NL+ +S+        RL G
Sbjct: 66  RINFVNLPEAQLDKQDTVPNPGSFFRMFVENHKTHVRDAVINLLPESDQSESTSKPRLAG 125

Query: 125 FVIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELA 184
           FV+DMF  S+I+VANEF+VP Y+F+TSN+  L    H Q L ++      +L +S  ELA
Sbjct: 126 FVLDMFSASLIDVANEFEVPSYVFFTSNSSTLALLSHFQSLRDEGGIDITELTSSTAELA 185

Query: 185 LPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGS 244
           +PSFINP P   +P    DK+      +N  R++ + KGIL+NTF+E+E   + ++ +G 
Sbjct: 186 VPSFINPYPVAVLPGSFLDKESTKSTLNNVGRYK-QTKGILVNTFLELESHALHYLDSGV 245

Query: 245 SKIPKVYTVGPILQLKSIGVTQSNNALSGADILKWLDDQPPASVVFLCFGSKGSFDEDQV 304
            KIP VY VGP+L LKS      ++   G+DIL+WLDDQPP SVVFLCFGS GSF + QV
Sbjct: 246 -KIPPVYPVGPLLNLKS------SHEDKGSDILRWLDDQPPLSVVFLCFGSMGSFGDAQV 305

Query: 305 LEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQ 364
            EIA  LE S  RFLWSLRQPP KGK   PS+YA++  VLPEGFL+RTA +GRVIGWAPQ
Sbjct: 306 KEIACTLEHSGHRFLWSLRQPPSKGKRALPSDYADLKTVLPEGFLDRTATVGRVIGWAPQ 365

Query: 365 IEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLD 424
             IL HPA GGF+SHCGWNSTLES+W+GVP+A WP+YAEQ  NAF++VVELGLAVE+ +D
Sbjct: 366 AAILGHPAIGGFVSHCGWNSTLESIWNGVPIAAWPMYAEQNMNAFQLVVELGLAVEIKMD 425

Query: 425 YVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKVKVKGEESRKSMMEGGSSFNSLRH 479
           Y KD  +    +VSAE+IE GIR++M +  +++RK+VK   E+S+K++++GGSS++SL  
Sbjct: 426 YRKDSDV----VVSAEDIERGIRQVM-ELDSDVRKRVKEMSEKSKKALVDGGSSYSSLGR 469

BLAST of CSPI04G22070 vs. Swiss-Prot
Match: UFOG3_FRAAN (Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3 OS=Fragaria ananassa GN=GT3 PE=2 SV=1)

HSP 1 Score: 476.5 bits (1225), Expect = 3.6e-133
Identity = 254/478 (53.14%), Postives = 326/478 (68.20%), Query Frame = 1

Query: 5   ELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPDDIKTTTERIQSLSTSFE-- 64
           ELV IP PGIGHL ST+E+A +LVSRDD+L +TVL +  P   K T   +QSL+ S    
Sbjct: 6   ELVLIPSPGIGHLVSTLEIAKLLVSRDDKLFITVLIMHFPAVSKGTDAYVQSLADSSSPI 65

Query: 65  GKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHD-SNRLVGFVIDMF 124
            + I FI LP     +        ++  F+ES +PHV++ V NL    + RL GFV+DMF
Sbjct: 66  SQRINFINLPHTNMDHTEGSVRNSLV-GFVESQQPHVKDAVANLRDSKTTRLAGFVVDMF 125

Query: 125 CTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSFIN 184
           CT+MINVAN+  VP Y+F+TS A  L   FHLQEL +Q N    + ++S+ EL +PSF N
Sbjct: 126 CTTMINVANQLGVPSYVFFTSGAATLGLLFHLQELRDQYNKDCTEFKDSDAELIIPSFFN 185

Query: 185 PIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKIPKV 244
           P+P K +P  +  KD A  F +  KRFR E KGIL+NTF ++E   +  +S+  ++IP V
Sbjct: 186 PLPAKVLPGRMLVKDSAEPFLNVIKRFR-ETKGILVNTFTDLESHALHALSS-DAEIPPV 245

Query: 245 YTVGPILQLKSI-GVTQSNNALSGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIAR 304
           Y VGP+L L S      S+      DILKWLDDQPP SVVFLCFGS GSFDE QV EIA 
Sbjct: 246 YPVGPLLNLNSNESRVDSDEVKKKNDILKWLDDQPPLSVVFLCFGSMGSFDESQVREIAN 305

Query: 305 ALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEILS 364
           ALE +  RFLWSLR+ PP GK   PS+Y +   VLPEGFL+RT  IG+VIGWAPQ+ +L+
Sbjct: 306 ALEHAGHRFLWSLRRSPPTGKVAFPSDYDDHTGVLPEGFLDRTGGIGKVIGWAPQVAVLA 365

Query: 365 HPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDF 424
           HP+ GGF+SHCGWNSTLES+WHGVP+ATWPLYAEQQ NAF+ V EL LAVE+ + Y    
Sbjct: 366 HPSVGGFVSHCGWNSTLESLWHGVPVATWPLYAEQQLNAFQPVKELELAVEIDMSYRSKS 425

Query: 425 HIGRSRIVSAEEIESGIRKLMGDSGNEIRKKVKVKGEESRKSMMEGGSSFNSLRHFID 479
            +    +VSA+EIE GIR++M    ++IRK+VK   E+ +K++M+GGSS+ SL HFID
Sbjct: 426 PV----LVSAKEIERGIREVMELDSSDIRKRVKEMSEKGKKALMDGGSSYTSLGHFID 476

BLAST of CSPI04G22070 vs. Swiss-Prot
Match: U7A15_MALDO (UDP-glycosyltransferase 71A15 OS=Malus domestica GN=UGT71A15 PE=1 SV=1)

HSP 1 Score: 470.7 bits (1210), Expect = 2.0e-131
Identity = 250/483 (51.76%), Postives = 333/483 (68.94%), Query Frame = 1

Query: 5   ELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPDDIKTTTERIQSLSTSFEGK 64
           +LVF+P PGIGH+ STVE+A  L +RDD+L +TVL +KLP   +  T    S+S      
Sbjct: 6   QLVFVPAPGIGHIVSTVEMAKQLAARDDQLFITVLVMKLPY-AQPFTNTDSSIS-----H 65

Query: 65  SIRFIVLPELPFPNQSSEPPP-LMLQAFLESHKPHVREIVTNLIHDSN--------RLVG 124
            I F+ LPE     Q   P P    + F+E+HK HVR+ V N++ +S+        RL G
Sbjct: 66  RINFVNLPEAQPDKQDIVPNPGSFFRMFVENHKSHVRDAVINVLPESDQSESTSKPRLAG 125

Query: 125 FVIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELA 184
           FV+DMF  S+I+VANEFKVP YLF+TSNA  L    H Q L ++      +L +S  ELA
Sbjct: 126 FVLDMFSASLIDVANEFKVPSYLFFTSNASALALMSHFQSLRDEGGIDITELTSSTAELA 185

Query: 185 LPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGS 244
           +PSFINP P   +P  L D +      ++  +++ + KGIL+NTF+E+E   + ++ +G 
Sbjct: 186 VPSFINPYPAAVLPGSLLDMESTKSTLNHVSKYK-QTKGILVNTFMELESHALHYLDSGD 245

Query: 245 SKIPKVYTVGPILQLKSIGVTQSNNALSGADILKWLDDQPPASVVFLCFGSKGSFDEDQV 304
            KIP VY VGP+L LK      S++    +DIL+WLDDQPP SVVFLCFGS GSF E QV
Sbjct: 246 -KIPPVYPVGPLLNLK------SSDEDKASDILRWLDDQPPFSVVFLCFGSMGSFGEAQV 305

Query: 305 LEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQ 364
            EIA ALE S  RFLWSLR+PPP+GK   PS+Y ++  VLPEGFL+RTA +G+VIGWAPQ
Sbjct: 306 KEIACALEHSGHRFLWSLRRPPPQGKRAMPSDYEDLKTVLPEGFLDRTATVGKVIGWAPQ 365

Query: 365 IEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLD 424
             IL HPATGGF+SHCGWNSTLES+W+GVP+A WPLYAEQ  NAF++VVELGLAVE+ +D
Sbjct: 366 AAILGHPATGGFVSHCGWNSTLESLWNGVPIAAWPLYAEQNLNAFQLVVELGLAVEIKMD 425

Query: 425 YVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKVKVKGEESRKSMMEGGSSFNSLRH 479
           Y +D  +    +VSAE+IE GIR++M +  +++RK+VK   E+S+K++++GGSS++SL  
Sbjct: 426 YRRDSDV----VVSAEDIERGIRRVM-ELDSDVRKRVKEMSEKSKKALVDGGSSYSSLGR 469

BLAST of CSPI04G22070 vs. Swiss-Prot
Match: UFOG6_FRAAN (UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria ananassa GN=GT6 PE=1 SV=1)

HSP 1 Score: 468.0 bits (1203), Expect = 1.3e-130
Identity = 247/486 (50.82%), Postives = 336/486 (69.14%), Query Frame = 1

Query: 5   ELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPDDIKTTTERIQSLST--SFE 64
           EL+FIP PGIGH+ STVE+A +L+ RDD L +T+L +K P     +   I+SL+   S +
Sbjct: 6   ELIFIPIPGIGHIVSTVEIAKLLLCRDDNLFITILIMKFPFTADGSDVYIKSLAVDPSLK 65

Query: 65  GKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIH---DSNRLVGFVID 124
            + IRF+ LP+  F    +         F++SHK HV++ VT L+    ++ R+ GFVID
Sbjct: 66  TQRIRFVNLPQEHFQGTGATG----FFTFIDSHKSHVKDAVTRLMETKSETTRIAGFVID 125

Query: 125 MFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSF 184
           MFCT MI++ANEF +P Y+FYTS A  L   FHLQ L ++ N    + ++S+ EL + SF
Sbjct: 126 MFCTGMIDLANEFGLPSYVFYTSGAADLGLMFHLQALRDEENKDCTEFKDSDAELVVSSF 185

Query: 185 INPIPN-KAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKI 244
           +NP+P  + +P  +F+K+   +F +  KR+R E KGIL+NTF+E+EP  ++ +S+    +
Sbjct: 186 VNPLPAARVLPSVVFEKEGGNFFLNFAKRYR-ETKGILVNTFLELEPHAIQSLSSDGKIL 245

Query: 245 PKVYTVGPILQLKSIG-VTQSNNALSGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLE 304
           P VY VGPIL +KS G    S  +   +DIL+WLDDQPP+SVVFLCFGS G F EDQV E
Sbjct: 246 P-VYPVGPILNVKSEGNQVSSEKSKQKSDILEWLDDQPPSSVVFLCFGSMGCFGEDQVKE 305

Query: 305 IARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIE 364
           IA ALE+  +RFLWSLRQP  K K   PS+Y +   VLPEGFL+RT D+G+VIGWAPQ+ 
Sbjct: 306 IAHALEQGGIRFLWSLRQPS-KEKIGFPSDYTDYKAVLPEGFLDRTTDLGKVIGWAPQLA 365

Query: 365 ILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYV 424
           IL+HPA GGF+SHCGWNSTLES+W+GVP+ATWP YAEQQ NAFE+V EL LAVE+ + Y 
Sbjct: 366 ILAHPAVGGFVSHCGWNSTLESIWYGVPIATWPFYAEQQVNAFELVKELKLAVEIDMGYR 425

Query: 425 KDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKVKVKGEESRKSMMEGGSSFNSLRHFI 484
           KD  +    IVS E IE GI+++M +  +E+RK+VK   + SRK++ E GSS++SL  F+
Sbjct: 426 KDSGV----IVSRENIEKGIKEVM-EQESELRKRVKEMSQMSRKALEEDGSSYSSLGRFL 479

BLAST of CSPI04G22070 vs. Swiss-Prot
Match: U71E1_STERE (UDP-glycosyltransferase 71E1 OS=Stevia rebaudiana GN=UGT71E1 PE=2 SV=1)

HSP 1 Score: 444.9 bits (1143), Expect = 1.1e-123
Identity = 244/482 (50.62%), Postives = 324/482 (67.22%), Query Frame = 1

Query: 1   MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPDDIKTTTERIQSLSTS 60
           M+  ELVFIP PG GHL  TVELA +L+ RD RLSVT++ + L    K  TE    +   
Sbjct: 1   MSTSELVFIPSPGAGHLPPTVELAKLLLHRDQRLSVTIIVMNLWLGPKHNTEARPCVP-- 60

Query: 61  FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIH-DSNRLVGFVID 120
               S+RF+ +P       +   P   + AF+E HKP VR+IV  +I  DS RL GFV+D
Sbjct: 61  ----SLRFVDIP-CDESTMALISPNTFISAFVEHHKPRVRDIVRGIIESDSVRLAGFVLD 120

Query: 121 MFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSF 180
           MFC  M +VANEF VP Y ++TS A  L   FHLQ   +     A +L+NS+ EL++PS+
Sbjct: 121 MFCMPMSDVANEFGVPSYNYFTSGAATLGLMFHLQWKRDHEGYDATELKNSDTELSVPSY 180

Query: 181 INPIPNKAIPPFLFDKDMAA-WFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKI 240
           +NP+P K +P  + DK+  +  F D  +R R E KGI++N+   +E   ++++S+ ++ I
Sbjct: 181 VNPVPAKVLPEVVLDKEGGSKMFLDLAERIR-ESKGIIVNSCQAIERHALEYLSSNNNGI 240

Query: 241 PKVYTVGPILQLKSIGVTQSNNALSGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEI 300
           P V+ VGPIL L++    + ++A +  +I++WL++QP +SVVFLCFGS GSF+E QV EI
Sbjct: 241 PPVFPVGPILNLEN----KKDDAKTD-EIMRWLNEQPESSVVFLCFGSMGSFNEKQVKEI 300

Query: 301 ARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEI 360
           A A+ERS  RFLWSLR+P PK K E P  Y N+ +VLPEGFL RT+ IG+VIGWAPQ+ +
Sbjct: 301 AVAIERSGHRFLWSLRRPTPKEKIEFPKEYENLEEVLPEGFLKRTSSIGKVIGWAPQMAV 360

Query: 361 LSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVK 420
           LSHP+ GGF+SHCGWNSTLES+W GVPMA WPLYAEQ  NAF +VVELGLA E+ +DY  
Sbjct: 361 LSHPSVGGFVSHCGWNSTLESMWCGVPMAAWPLYAEQTLNAFLLVVELGLAAEIRMDYRT 420

Query: 421 DFHIG--RSRIVSAEEIESGIRKLMGDSGNEIRKKVKVKGEESRKSMMEGGSSFNSLRHF 479
           D   G      V+ EEIE GIRKLM D   EIR KVK   E+SR +++EGGSS+ S+  F
Sbjct: 421 DTKAGYDGGMEVTVEEIEDGIRKLMSD--GEIRNKVKDVKEKSRAAVVEGGSSYASIGKF 467

BLAST of CSPI04G22070 vs. TrEMBL
Match: A0A0A0L321_CUCSA (Glycosyltransferase OS=Cucumis sativus GN=Csa_4G618520 PE=3 SV=1)

HSP 1 Score: 982.2 bits (2538), Expect = 2.2e-283
Identity = 485/489 (99.18%), Postives = 488/489 (99.80%), Query Frame = 1

Query: 1   MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPDDIKTTTERIQSLSTS 60
           MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLP+DIKTTTERIQSLS S
Sbjct: 1   MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSAS 60

Query: 61  FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHDSNRLVGFVIDM 120
           FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHDSNRLVGFVIDM
Sbjct: 61  FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHDSNRLVGFVIDM 120

Query: 121 FCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSFI 180
           FCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSFI
Sbjct: 121 FCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSFI 180

Query: 181 NPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKIPK 240
           NPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKIPK
Sbjct: 181 NPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKIPK 240

Query: 241 VYTVGPILQLKSIGVTQSNNALSGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIAR 300
           VYTVGPILQLKSIGVTQSNNAL+GADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIAR
Sbjct: 241 VYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIAR 300

Query: 301 ALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEILS 360
           ALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEILS
Sbjct: 301 ALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEILS 360

Query: 361 HPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDF 420
           HPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDF
Sbjct: 361 HPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDF 420

Query: 421 HIGRSRIVSAEEIESGIRKLMGDSGNEIRKKVKVKGEESRKSMMEGGSSFNSLRHFIDDA 480
           HIGRSRIVSAEEIESGIRKLMGDSGNEIRKK+KVKGEESRKSMMEGGSSFNSLRHFIDDA
Sbjct: 421 HIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSLRHFIDDA 480

Query: 481 LTNLQEGNY 490
           LTNLQEGNY
Sbjct: 481 LTNLQEGNY 489

BLAST of CSPI04G22070 vs. TrEMBL
Match: K7NBW4_SIRGR (Glycosyltransferase OS=Siraitia grosvenorii GN=UDPG7 PE=2 SV=1)

HSP 1 Score: 642.1 bits (1655), Expect = 5.4e-181
Identity = 332/495 (67.07%), Postives = 386/495 (77.98%), Query Frame = 1

Query: 1   MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPDDIKTTTERIQSLSTS 60
           M KFELVFIP P +GHLA+ VE+AN+LV+RD RL+VT+L IKLP   KT  E IQSLS S
Sbjct: 1   MKKFELVFIPLPVMGHLAAMVEMANILVTRDQRLTVTILVIKLPLYGKTA-EYIQSLSAS 60

Query: 61  FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIH-----DSNRLVG 120
           F  +S+RFI+LPE+  P +S +    ML+AFLES+KP +RE + +L       DS RL G
Sbjct: 61  FASESMRFIILPEVLLPEESEKE--FMLKAFLESYKPIIREAIIDLTDSQMGPDSPRLAG 120

Query: 121 FVIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNS--TAEQLQNSNVE 180
           FV+DMFCT+MI+VANEF VP Y+F TSNAGFL  SFHLQELY++NNS    +QLQNSN E
Sbjct: 121 FVLDMFCTTMIDVANEFGVPSYVFCTSNAGFLALSFHLQELYDENNSKEVVKQLQNSNAE 180

Query: 181 LALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMS- 240
           +ALPSF+NPIP K IP    + D A+WFHD  +R+RS VKGILINTF ++E  ++  MS 
Sbjct: 181 IALPSFVNPIPGKMIPDIFSNDDTASWFHDQVERYRSGVKGILINTFAKLESHVMNSMSR 240

Query: 241 NGSSKIPKVYTVGPILQLKSIGVTQSNNALSGADILKWLDDQPPASVVFLCFGSKGSFDE 300
           + SS+ P +Y++GPIL LK+         L   DILKWLD+QPP SVVFLCFGS GSFDE
Sbjct: 241 SSSSRAPPLYSIGPILHLKNNNTVGPGGTLHCTDILKWLDNQPPVSVVFLCFGSMGSFDE 300

Query: 301 DQVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGW 360
           DQV EIA ALERS VRFLWSLRQPPPK KFE PS Y +I  VLPEGFL RTA IGRVIGW
Sbjct: 301 DQVKEIAHALERSGVRFLWSLRQPPPKDKFEAPSEYTDIKYVLPEGFLERTAGIGRVIGW 360

Query: 361 APQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVEL 420
           APQ+EIL+HPATGGF+SHCGWNSTLES+WHGVPMATWPLYAEQQF AFEMVVELGLAV++
Sbjct: 361 APQVEILAHPATGGFVSHCGWNSTLESMWHGVPMATWPLYAEQQFTAFEMVVELGLAVDI 420

Query: 421 TLDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKVKVKGEESRKSMMEGGSSFNS 480
           TLDY K  H  RSR+VSAEEI+SGIRKLM + G E+RKKVK K EESRKS+MEGGSSF S
Sbjct: 421 TLDYQKHPHGERSRVVSAEEIQSGIRKLM-EEGGEMRKKVKAKSEESRKSLMEGGSSFIS 480

Query: 481 LRHFIDDALTNLQEG 488
           L  FIDD L N  EG
Sbjct: 481 LGRFIDDVLGNGPEG 491

BLAST of CSPI04G22070 vs. TrEMBL
Match: A0A0A0L1T2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G618540 PE=4 SV=1)

HSP 1 Score: 568.5 bits (1464), Expect = 7.7e-159
Identity = 293/454 (64.54%), Postives = 349/454 (76.87%), Query Frame = 1

Query: 4   FELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPDDIKTTTERIQSLSTSFEG 63
           FEL+FIP PGIGHLASTVE+ANVLV+RD RLSVT+LA+KLP D+K   E I+SLSTSF G
Sbjct: 2   FELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVA-ECIESLSTSFAG 61

Query: 64  KSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIH------DSNRLVGFV 123
           K+I+F VLPE P P +S +         +ES+KP+VRE+V+NL        DS RLVG V
Sbjct: 62  KNIQFNVLPEPPLPEESKKD----FIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLV 121

Query: 124 IDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQN--NSTAEQLQNS-NVEL 183
           IDMFCT+MI+V NEF VPCY+FYT +A FL FS +LQELY +N  N   EQL NS NVEL
Sbjct: 122 IDMFCTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVEL 181

Query: 184 ALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNG 243
            LP+F+NPIP+K IP    +KD A WFH++ KRFR E+KGILINTF EME  + K   + 
Sbjct: 182 TLPNFVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAK---SY 241

Query: 244 SSKIPKVYTVGPILQLKSIGVTQSNNALSGADIL-KWLDDQPPASVVFLCFGSKGSFDED 303
           S  +P +Y VGP+L LK+ GV  S+ A + ADI+ KWLDDQPP+SVV +CFG+  SFDE 
Sbjct: 242 SQVLPPLYFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEA 301

Query: 304 QVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWA 363
           QV EIA ALE S VRF+WSLRQPPPKGKFE P NY +I + LPEGFL+RT  IGRVIGW 
Sbjct: 302 QVAEIANALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWT 361

Query: 364 PQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELT 423
            Q+EIL+HPA GGFISHCGWNS LESVWHGV +ATWP++AEQQFNAFEMVVELGLAVE+T
Sbjct: 362 SQVEILAHPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVT 421

Query: 424 LDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNE 448
           LDY   F   + R+VSAEEI+SGI+KLMG+  NE
Sbjct: 422 LDYRITFGEDKPRLVSAEEIKSGIKKLMGEESNE 447

BLAST of CSPI04G22070 vs. TrEMBL
Match: A0A0A0KZV7_CUCSA (Glycosyltransferase OS=Cucumis sativus GN=Csa_4G618530 PE=3 SV=1)

HSP 1 Score: 554.3 bits (1427), Expect = 1.5e-154
Identity = 275/445 (61.80%), Postives = 342/445 (76.85%), Query Frame = 1

Query: 52  ERIQSLSTSFEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIH--- 111
           + IQ LS SF GKSI  I+LPELP P +     P   Q  +E +KPHVRE + N ++   
Sbjct: 2   DHIQQLSASFVGKSIHLILLPELPLPQECQNGMP---QLLIEIYKPHVREAMANQVNSQT 61

Query: 112 --DSNRLVGFVIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNS--TA 171
             D  +LVGFV+DMFC +M++VA EFKVPCYLFYTS+A FL  +FHLQELY+QNNS    
Sbjct: 62  SPDFPQLVGFVLDMFCMTMVDVAKEFKVPCYLFYTSSAAFLALNFHLQELYDQNNSNRVV 121

Query: 172 EQLQNSNVE-LALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEM 231
           EQL+NS  E L +PSF+NPIP K IP      DMA W ++NT++FRSE+KGILINT  E+
Sbjct: 122 EQLKNSESESLTIPSFVNPIPGKVIPSIFVYNDMAVWLYENTRKFRSEIKGILINTCAEI 181

Query: 232 EPQIVKWMSNG-SSKIPKVYTVGPILQLKSIGVTQSNNALSGADILKWLDDQPPASVVFL 291
           E  +V  MS+G SS++P +Y VGPIL L+        N ++  +ILKWLDDQP ASV+FL
Sbjct: 182 ESHVVNMMSSGPSSQVPSLYCVGPILNLE--------NTVNRVNILKWLDDQPQASVIFL 241

Query: 292 CFGSKGSFDEDQVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNR 351
           CFGS GSFDE+QV EIA+ LERS V FLWSLRQPPPKGK+  PS+YA+I DVLPE FL+ 
Sbjct: 242 CFGSMGSFDEEQVKEIAQGLERSGVHFLWSLRQPPPKGKWVAPSDYADIKDVLPERFLDP 301

Query: 352 TADIGRVIGWAPQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEM 411
           TA++G++IGWAPQ+EIL+HP+ GGF+SHCGWNSTLES+W+GVPM  WP+YAEQQ NAF+M
Sbjct: 302 TANVGKIIGWAPQVEILAHPSIGGFVSHCGWNSTLESLWYGVPMVAWPMYAEQQLNAFQM 361

Query: 412 VVELGLAVELTLDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKVKVKGEESRKS 471
           VVELGLAVE+TLDY KD+ + RS++V+AEEIESGIRK+M D G+EIRK+VK + EE RK+
Sbjct: 362 VVELGLAVEITLDYQKDYRLERSKLVTAEEIESGIRKVM-DDGDEIRKQVKAESEEVRKA 421

Query: 472 MMEGGSSFNSLRHFIDDALTNLQEG 488
           +MEGGSS+ SL HFI+D L N   G
Sbjct: 422 VMEGGSSYISLVHFINDVLVNSSNG 434

BLAST of CSPI04G22070 vs. TrEMBL
Match: A0A0A0L341_CUCSA (Glycosyltransferase OS=Cucumis sativus GN=Csa_4G620550 PE=3 SV=1)

HSP 1 Score: 553.9 bits (1426), Expect = 2.0e-154
Identity = 284/495 (57.37%), Postives = 364/495 (73.54%), Query Frame = 1

Query: 1   MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPDDIKTTTERIQSLSTS 60
           M KFELVFIP PG GHLAS VE+AN L++RD RL+VT++A KLP D K   E IQSLS  
Sbjct: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKAN-EYIQSLSAQ 60

Query: 61  F--EGKSIRFIVLPELP-FPNQSSEPPPLMLQAFLESHKPHVRE-IVTNLIHDSNRLVGF 120
                 SI+FIVLPELP  PN  +      L+  LES+KPHV++ +++ L   +N L GF
Sbjct: 61  SLTNNNSIQFIVLPELPDIPNNGNR---FFLEVVLESYKPHVKQALISFLTTSTNHLAGF 120

Query: 121 VIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAE---QLQNSNVE 180
           V+D FC++M++VANEFKVP Y++YTS A +L FS HL++LY Q+NS+ E   QL++S+V 
Sbjct: 121 VLDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVN 180

Query: 181 LALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSN 240
           L++PS +N +P+K IP   F  + A WFH+  KR R +VKG+LINTF E+E   +  +S 
Sbjct: 181 LSVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLST 240

Query: 241 GSS-KIPKVYTVGPILQLKSIGVTQSNNALSGADILKWLDDQPPASVVFLCFGSKGSFDE 300
            SS ++P +Y+VGP+L L      ++   +   D+LKWLDDQP +SVVFLCFGS+G+F +
Sbjct: 241 DSSLQLPPLYSVGPVLHLN-----KNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKK 300

Query: 301 DQVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGW 360
           DQV EIARALERS VRF+WSLR+  P   F+   +Y N  D+LP+GFL+RT +IGRVI W
Sbjct: 301 DQVEEIARALERSRVRFIWSLRR--PGNVFQSSIDYTNFEDILPKGFLDRTQNIGRVISW 360

Query: 361 APQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVEL 420
           APQ+EIL HPATGGF+SHCGWNSTLES+WHGVPMATWP+YAEQQFNAF++VVELGLAVE+
Sbjct: 361 APQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEI 420

Query: 421 TLDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKVKVKGEESRKSMMEGGSSFNS 480
            + Y  +     + I+ AEEIE GIRKLM ++ NEIRKKVK K EE RKS++EGGSSF S
Sbjct: 421 KISYCIELKEQANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFIS 480

Query: 481 LRHFIDDALTNLQEG 488
           L  FIDD L+N   G
Sbjct: 481 LGKFIDDVLSNSTTG 484

BLAST of CSPI04G22070 vs. TAIR10
Match: AT3G21760.1 (AT3G21760.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 440.7 bits (1132), Expect = 1.2e-123
Identity = 239/491 (48.68%), Postives = 313/491 (63.75%), Query Frame = 1

Query: 3   KFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPDDIKTTTER--IQSLSTS 62
           K ELVFIP PG GHL   VE+A + V RDD LS+T++ I       ++     I SLS+ 
Sbjct: 2   KLELVFIPSPGDGHLRPLVEVAKLHVDRDDHLSITIIIIPQMHGFSSSNSSSYIASLSSD 61

Query: 63  FEGK-SIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHDS-----NRLV 122
            E + S   + +P+ P    S +  P     ++++ KP V+  V  L         +RL 
Sbjct: 62  SEERLSYNVLSVPDKP---DSDDTKPHFFD-YIDNFKPQVKATVEKLTDPGPPDSPSRLA 121

Query: 123 GFVIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNV-E 182
           GFV+DMFC  MI+VANEF VP Y+FYTSNA FL    H++ LY+  N     L++S+  E
Sbjct: 122 GFVVDMFCMMMIDVANEFGVPSYMFYTSNATFLGLQVHVEYLYDVKNYDVSDLKDSDTTE 181

Query: 183 LALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSN 242
           L +P    P+P K  P  L  K+        T+RFR E KGIL+NTF E+EPQ +K+ S 
Sbjct: 182 LEVPCLTRPLPVKCFPSVLLTKEWLPVMFRQTRRFR-ETKGILVNTFAELEPQAMKFFSG 241

Query: 243 GSSKIPKVYTVGPILQLKSIGVTQSNNALSGADILKWLDDQPPASVVFLCFGSKGSFDED 302
             S +P VYTVGP++ LK  G   S++  S  +IL+WLD+QP  SVVFLCFGS G F E 
Sbjct: 242 VDSPLPTVYTVGPVMNLKINGPNSSDDKQS--EILRWLDEQPRKSVVFLCFGSMGGFREG 301

Query: 303 QVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWA 362
           Q  EIA ALERS  RF+WSLR+  PKG    P  + N+ ++LPEGFL RTA+IG+++GWA
Sbjct: 302 QAKEIAIALERSGHRFVWSLRRAQPKGSIGPPEEFTNLEEILPEGFLERTAEIGKIVGWA 361

Query: 363 PQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELT 422
           PQ  IL++PA GGF+SHCGWNSTLES+W GVPMATWPLYAEQQ NAFEMV ELGLAVE+ 
Sbjct: 362 PQSAILANPAIGGFVSHCGWNSTLESLWFGVPMATWPLYAEQQVNAFEMVEELGLAVEVR 421

Query: 423 LDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKVKVKGEESRKSMMEGGSSFNSL 482
             +  DF      +++AEEIE GIR LM +  +++R +VK   E+S  ++M+GGSS  +L
Sbjct: 422 NSFRGDFMAADDELMTAEEIERGIRCLM-EQDSDVRSRVKEMSEKSHVALMDGGSSHVAL 481

Query: 483 RHFIDDALTNL 485
             FI D   N+
Sbjct: 482 LKFIQDVTKNI 484

BLAST of CSPI04G22070 vs. TAIR10
Match: AT3G21780.1 (AT3G21780.1 UDP-glucosyl transferase 71B6)

HSP 1 Score: 436.0 bits (1120), Expect = 3.0e-122
Identity = 238/490 (48.57%), Postives = 322/490 (65.71%), Query Frame = 1

Query: 3   KFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPDDIKTTTERIQSLSTSFE 62
           K ELVFIP P I HL +TVE+A  LV ++D LS+TV+ I         T  I SL+++  
Sbjct: 2   KIELVFIPSPAISHLMATVEMAEQLVDKNDNLSITVIIISFSSK---NTSMITSLTSN-- 61

Query: 63  GKSIRFIVLPELPFPNQSSEPPPLMLQA---FLESHKPHVREIVTNLIH----DSNRLVG 122
              +R+ ++          +  P  L+A    ++S KP VR+ V  L+     D+ RL G
Sbjct: 62  -NRLRYEII-------SGGDQQPTELKATDSHIQSLKPLVRDAVAKLVDSTLPDAPRLAG 121

Query: 123 FVIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYN-QNNSTAEQLQNSNVEL 182
           FV+DM+CTSMI+VANEF VP YLFYTSNAGFL    H+Q +Y+ ++     +L++S+VEL
Sbjct: 122 FVVDMYCTSMIDVANEFGVPSYLFYTSNAGFLGLLLHIQFMYDAEDIYDMSELEDSDVEL 181

Query: 183 ALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNG 242
            +PS  +P P K +P     K+   +F    +RFR E KGIL+NT  ++EPQ + ++SNG
Sbjct: 182 VVPSLTSPYPLKCLPYIFKSKEWLTFFVTQARRFR-ETKGILVNTVPDLEPQALTFLSNG 241

Query: 243 SSKIPKVYTVGPILQLKSIGVTQSNNALSGADILKWLDDQPPASVVFLCFGSKGSFDEDQ 302
           +  IP+ Y VGP+L LK++     +   S  +IL+WLD+QPP SVVFLCFGS G F E+Q
Sbjct: 242 N--IPRAYPVGPLLHLKNVNCDYVDKKQS--EILRWLDEQPPRSVVFLCFGSMGGFSEEQ 301

Query: 303 VLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAP 362
           V E A AL+RS  RFLWSLR+  P    E P  + N+ ++LPEGF +RTA+ G+VIGWA 
Sbjct: 302 VRETALALDRSGHRFLWSLRRASPNILREPPGEFTNLEEILPEGFFDRTANRGKVIGWAE 361

Query: 363 QIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTL 422
           Q+ IL+ PA GGF+SH GWNSTLES+W GVPMA WPLYAEQ+FNAFEMV ELGLAVE+  
Sbjct: 362 QVAILAKPAIGGFVSHGGWNSTLESLWFGVPMAIWPLYAEQKFNAFEMVEELGLAVEIKK 421

Query: 423 DYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKVKVKGEESRKSMMEGGSSFNSLR 482
            +  D  +GRS IV+AEEIE GI  LM +  +++RK+V    E+   ++M+GGSS  +L+
Sbjct: 422 HWRGDLLLGRSEIVTAEEIEKGIICLM-EQDSDVRKRVNEISEKCHVALMDGGSSETALK 472

Query: 483 HFIDDALTNL 485
            FI D   N+
Sbjct: 482 RFIQDVTENI 472

BLAST of CSPI04G22070 vs. TAIR10
Match: AT4G15280.1 (AT4G15280.1 UDP-glucosyl transferase 71B5)

HSP 1 Score: 426.8 bits (1096), Expect = 1.8e-119
Identity = 225/483 (46.58%), Postives = 304/483 (62.94%), Query Frame = 1

Query: 3   KFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPDDIKTTTERIQSLSTSFE 62
           K ELVFIP PGIGHL  TV+LA  L+  ++RLS+T++ I    D    +  I SL+T  +
Sbjct: 2   KIELVFIPLPGIGHLRPTVKLAKQLIGSENRLSITIIIIPSRFDAGDASACIASLTTLSQ 61

Query: 63  GKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHDSNR-LVGFVIDMF 122
              + +  +     P  +S+P P+  Q ++E  K  VR+ V   I D  R L GFV+DMF
Sbjct: 62  DDRLHYESISVAKQP-PTSDPDPVPAQVYIEKQKTKVRDAVAARIVDPTRKLAGFVVDMF 121

Query: 123 CTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSFIN 182
           C+SMI+VANEF VPCY+ YTSNA FL    H+Q++Y+Q      +L+NS  EL  PS   
Sbjct: 122 CSSMIDVANEFGVPCYMVYTSNATFLGTMLHVQQMYDQKKYDVSELENSVTELEFPSLTR 181

Query: 183 PIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKIPKV 242
           P P K +P  L  K+         + FR ++KGIL+NT  E+EP  +K  +     +P+V
Sbjct: 182 PYPVKCLPHILTSKEWLPLSLAQARCFR-KMKGILVNTVAELEPHALKMFNINGDDLPQV 241

Query: 243 YTVGPILQLKSIGVTQSNNALSGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIARA 302
           Y VGP+L L++     +++    ++IL+WLD+QP  SVVFLCFGS G F E+Q  E A A
Sbjct: 242 YPVGPVLHLEN----GNDDDEKQSEILRWLDEQPSKSVVFLCFGSLGGFTEEQTRETAVA 301

Query: 303 LERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEILSH 362
           L+RS  RFLW LR   P  K + P +Y N+ +VLPEGFL RT D G+VIGWAPQ+ +L  
Sbjct: 302 LDRSGQRFLWCLRHASPNIKTDRPRDYTNLEEVLPEGFLERTLDRGKVIGWAPQVAVLEK 361

Query: 363 PATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDFH 422
           PA GGF++HCGWNS LES+W GVPM TWPLYAEQ+ NAFEMV ELGLAVE+      D  
Sbjct: 362 PAIGGFVTHCGWNSILESLWFGVPMVTWPLYAEQKVNAFEMVEELGLAVEIRKYLKGDLF 421

Query: 423 IGRSRIVSAEEIESGIRKLMGDSGNEIRKKVKVKGEESRKSMMEGGSSFNSLRHFIDDAL 482
            G    V+AE+IE  IR++M +  +++R  VK   E+   ++M+GGSS  +L  FI D +
Sbjct: 422 AGEMETVTAEDIERAIRRVM-EQDSDVRNNVKEMAEKCHFALMDGGSSKAALEKFIQDVI 477

Query: 483 TNL 485
            N+
Sbjct: 482 ENM 477

BLAST of CSPI04G22070 vs. TAIR10
Match: AT3G21750.1 (AT3G21750.1 UDP-glucosyl transferase 71B1)

HSP 1 Score: 413.3 bits (1061), Expect = 2.1e-115
Identity = 214/493 (43.41%), Postives = 321/493 (65.11%), Query Frame = 1

Query: 3   KFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAI--KLPDDIKTTTERIQSLSTS 62
           K ELVFIP PG+GH+ +T  LA +LV+ D+RLSVT++ I  ++ DD  +      S+ T+
Sbjct: 2   KVELVFIPSPGVGHIRATTALAKLLVASDNRLSVTLIVIPSRVSDDASS------SVYTN 61

Query: 63  FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHD-----SNRLVG 122
            E + +R+I+LP     +Q+++     L ++++S KP VR +V+ +  D      +RL G
Sbjct: 62  SEDR-LRYILLPAR---DQTTD-----LVSYIDSQKPQVRAVVSKVAGDVSTRSDSRLAG 121

Query: 123 FVIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELA 182
            V+DMFCTSMI++A+EF +  Y+FYTSNA +L   FH+Q LY++      + +++ ++  
Sbjct: 122 IVVDMFCTSMIDIADEFNLSAYIFYTSNASYLGLQFHVQSLYDEKELDVSEFKDTEMKFD 181

Query: 183 LPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMS--N 242
           +P+   P P K +P  + +K    +     + FR+  KGIL+N+  +MEPQ + + S  N
Sbjct: 182 VPTLTQPFPAKCLPSVMLNKKWFPYVLGRARSFRA-TKGILVNSVADMEPQALSFFSGGN 241

Query: 243 GSSKIPKVYTVGPILQLKSIGVTQSNNALSGADILKWLDDQPPASVVFLCFGSKGSFDED 302
           G++ IP VY VGPI+ L+S G  +        +IL WL +QP  SVVFLCFGS G F E+
Sbjct: 242 GNTNIPPVYAVGPIMDLESSGDEEKRK-----EILHWLKEQPTKSVVFLCFGSMGGFSEE 301

Query: 303 QVLEIARALERSEVRFLWSLRQPPPKGKFEEP--SNYANINDVLPEGFLNRTADIGRVIG 362
           Q  EIA ALERS  RFLWSLR+  P G    P    + N+ ++LP+GFL+RT +IG++I 
Sbjct: 302 QAREIAVALERSGHRFLWSLRRASPVGNKSNPPPGEFTNLEEILPKGFLDRTVEIGKIIS 361

Query: 363 WAPQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVE 422
           WAPQ+++L+ PA G F++HCGWNS LES+W GVPMA WP+YAEQQFNAF MV ELGLA E
Sbjct: 362 WAPQVDVLNSPAIGAFVTHCGWNSILESLWFGVPMAAWPIYAEQQFNAFHMVDELGLAAE 421

Query: 423 LTLDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKVKVKGEESRKSMMEGGSSFN 482
           +  +Y +DF +    IV+A+EIE GI+  M +  +++RK+V    ++   ++++GGSS  
Sbjct: 422 VKKEYRRDFLVEEPEIVTADEIERGIKCAM-EQDSKMRKRVMEMKDKLHVALVDGGSSNC 472

Query: 483 SLRHFIDDALTNL 485
           +L+ F+ D + N+
Sbjct: 482 ALKKFVQDVVDNV 472

BLAST of CSPI04G22070 vs. TAIR10
Match: AT3G21790.1 (AT3G21790.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 407.9 bits (1047), Expect = 8.7e-114
Identity = 226/493 (45.84%), Postives = 318/493 (64.50%), Query Frame = 1

Query: 3   KFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPDDIKT-TTERIQSLSTSF 62
           KFELVFIP PGIGHL STVE+A +LV R+ RLS++V+ +    + +   ++ I +LS S 
Sbjct: 2   KFELVFIPYPGIGHLRSTVEMAKLLVDRETRLSISVIILPFISEGEVGASDYIAALSASS 61

Query: 63  EGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHD------SNRLVG 122
             + +R+ V+  +  P          ++  +++ +P VR  V  L+ D      S ++ G
Sbjct: 62  NNR-LRYEVISAVDQPTIEMTT----IEIHMKNQEPKVRSTVAKLLEDYSSKPDSPKIAG 121

Query: 123 FVIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELA 182
           FV+DMFCTSM++VANEF  P Y+FYTS+AG L  ++H+Q L ++N     +   ++ E  
Sbjct: 122 FVLDMFCTSMVDVANEFGFPSYMFYTSSAGILSVTYHVQMLCDENKYDVSENDYADSEAV 181

Query: 183 L--PSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSN 242
           L  PS   P P K +P  L        F +  ++FR E+KGIL+NT  E+EP ++K++S 
Sbjct: 182 LNFPSLSRPYPVKCLPHALAANMWLPVFVNQARKFR-EMKGILVNTVAELEPYVLKFLS- 241

Query: 243 GSSKIPKVYTVGPILQLKSIGVTQSNNALSGADILKWLDDQPPASVVFLCFGSKGSFDED 302
            SS  P VY VGP+L L++      +      +I++WLD QPP+SVVFLCFGS G F E+
Sbjct: 242 -SSDTPPVYPVGPLLHLEN--QRDDSKDEKRLEIIRWLDQQPPSSVVFLCFGSMGGFGEE 301

Query: 303 QVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWA 362
           QV EIA ALERS  RFLWSLR+  P    E P  + N+ +VLPEGF +RT DIG+VIGWA
Sbjct: 302 QVREIAIALERSGHRFLWSLRRASPNIFKELPGEFTNLEEVLPEGFFDRTKDIGKVIGWA 361

Query: 363 PQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELT 422
           PQ+ +L++PA GGF++HCGWNSTLES+W GVP A WPLYAEQ+FNAF MV ELGLAVE+ 
Sbjct: 362 PQVAVLANPAIGGFVTHCGWNSTLESLWFGVPTAAWPLYAEQKFNAFLMVEELGLAVEIR 421

Query: 423 LDYVKDFHIG--RSRIVSAEEIESGIRKLMGDSGNEIRKKVKVKGEESRKSMMEGGSSFN 482
             Y +  H+    +  V+AEEIE  I  LM +  +++RK+VK   E+   ++M+GGSS  
Sbjct: 422 -KYWRGEHLAGLPTATVTAEEIEKAIMCLM-EQDSDVRKRVKDMSEKCHVALMDGGSSRT 481

Query: 483 SLRHFIDDALTNL 485
           +L+ FI++   N+
Sbjct: 482 ALQKFIEEVAKNI 482

BLAST of CSPI04G22070 vs. NCBI nr
Match: gi|449456653|ref|XP_004146063.1| (PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis sativus])

HSP 1 Score: 982.2 bits (2538), Expect = 3.2e-283
Identity = 485/489 (99.18%), Postives = 488/489 (99.80%), Query Frame = 1

Query: 1   MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPDDIKTTTERIQSLSTS 60
           MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLP+DIKTTTERIQSLS S
Sbjct: 1   MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSAS 60

Query: 61  FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHDSNRLVGFVIDM 120
           FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHDSNRLVGFVIDM
Sbjct: 61  FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHDSNRLVGFVIDM 120

Query: 121 FCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSFI 180
           FCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSFI
Sbjct: 121 FCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSFI 180

Query: 181 NPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKIPK 240
           NPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKIPK
Sbjct: 181 NPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKIPK 240

Query: 241 VYTVGPILQLKSIGVTQSNNALSGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIAR 300
           VYTVGPILQLKSIGVTQSNNAL+GADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIAR
Sbjct: 241 VYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIAR 300

Query: 301 ALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEILS 360
           ALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEILS
Sbjct: 301 ALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEILS 360

Query: 361 HPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDF 420
           HPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDF
Sbjct: 361 HPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDF 420

Query: 421 HIGRSRIVSAEEIESGIRKLMGDSGNEIRKKVKVKGEESRKSMMEGGSSFNSLRHFIDDA 480
           HIGRSRIVSAEEIESGIRKLMGDSGNEIRKK+KVKGEESRKSMMEGGSSFNSLRHFIDDA
Sbjct: 421 HIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSLRHFIDDA 480

Query: 481 LTNLQEGNY 490
           LTNLQEGNY
Sbjct: 481 LTNLQEGNY 489

BLAST of CSPI04G22070 vs. NCBI nr
Match: gi|659129348|ref|XP_008464641.1| (PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo])

HSP 1 Score: 939.9 bits (2428), Expect = 1.8e-270
Identity = 465/489 (95.09%), Postives = 475/489 (97.14%), Query Frame = 1

Query: 1   MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPDDIKTTTERIQSLSTS 60
           MNKFELVFIPGPGIGHLASTVELANVL SRDDRLSVTVLAIKLP+DIKTT ERIQSLS S
Sbjct: 1   MNKFELVFIPGPGIGHLASTVELANVLASRDDRLSVTVLAIKLPNDIKTT-ERIQSLSAS 60

Query: 61  FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHDSNRLVGFVIDM 120
           FEGKSIRFIVLPELPFPNQSS PPPLMLQAFLESHKPHVREIVTNL +DSNRLVGFVIDM
Sbjct: 61  FEGKSIRFIVLPELPFPNQSSTPPPLMLQAFLESHKPHVREIVTNLTYDSNRLVGFVIDM 120

Query: 121 FCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSFI 180
           FCTSMINVANEFKVPCYLFYTSNAGFL FSFHLQELYNQNNST EQLQNSNVELALPSFI
Sbjct: 121 FCTSMINVANEFKVPCYLFYTSNAGFLAFSFHLQELYNQNNSTGEQLQNSNVELALPSFI 180

Query: 181 NPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKIPK 240
           NPIP+KAIPPFLFDKDMA WFHDNTKRFRS VKGILINTFVEMEPQ++KWMSNGSSKIPK
Sbjct: 181 NPIPSKAIPPFLFDKDMAVWFHDNTKRFRSGVKGILINTFVEMEPQMIKWMSNGSSKIPK 240

Query: 241 VYTVGPILQLKSIGVTQSNNALSGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIAR 300
           VYTVGPILQLKSIGVTQ NNAL+GADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIAR
Sbjct: 241 VYTVGPILQLKSIGVTQCNNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIAR 300

Query: 301 ALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEILS 360
           ALERSEVRF+WSLRQPPPKGKFEEPSNYA+INDVLPEGFLNRTADIGRVIGWAPQIEILS
Sbjct: 301 ALERSEVRFIWSLRQPPPKGKFEEPSNYADINDVLPEGFLNRTADIGRVIGWAPQIEILS 360

Query: 361 HPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDF 420
           HPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDF
Sbjct: 361 HPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDF 420

Query: 421 HIGRSRIVSAEEIESGIRKLMGDSGNEIRKKVKVKGEESRKSMMEGGSSFNSLRHFIDDA 480
           HIGRSR+VSAEEIESGIRKLMGD GNEIRKKVKVKGEESRKSMM GGSSFNSL HFIDDA
Sbjct: 421 HIGRSRVVSAEEIESGIRKLMGDYGNEIRKKVKVKGEESRKSMMVGGSSFNSLDHFIDDA 480

Query: 481 LTNLQEGNY 490
           L NL+EGNY
Sbjct: 481 LANLEEGNY 488

BLAST of CSPI04G22070 vs. NCBI nr
Match: gi|343466221|gb|AEM43004.1| (UDP-glucosyltransferase [Siraitia grosvenorii])

HSP 1 Score: 642.1 bits (1655), Expect = 7.8e-181
Identity = 332/495 (67.07%), Postives = 386/495 (77.98%), Query Frame = 1

Query: 1   MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPDDIKTTTERIQSLSTS 60
           M KFELVFIP P +GHLA+ VE+AN+LV+RD RL+VT+L IKLP   KT  E IQSLS S
Sbjct: 1   MKKFELVFIPLPVMGHLAAMVEMANILVTRDQRLTVTILVIKLPLYGKTA-EYIQSLSAS 60

Query: 61  FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIH-----DSNRLVG 120
           F  +S+RFI+LPE+  P +S +    ML+AFLES+KP +RE + +L       DS RL G
Sbjct: 61  FASESMRFIILPEVLLPEESEKE--FMLKAFLESYKPIIREAIIDLTDSQMGPDSPRLAG 120

Query: 121 FVIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNS--TAEQLQNSNVE 180
           FV+DMFCT+MI+VANEF VP Y+F TSNAGFL  SFHLQELY++NNS    +QLQNSN E
Sbjct: 121 FVLDMFCTTMIDVANEFGVPSYVFCTSNAGFLALSFHLQELYDENNSKEVVKQLQNSNAE 180

Query: 181 LALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMS- 240
           +ALPSF+NPIP K IP    + D A+WFHD  +R+RS VKGILINTF ++E  ++  MS 
Sbjct: 181 IALPSFVNPIPGKMIPDIFSNDDTASWFHDQVERYRSGVKGILINTFAKLESHVMNSMSR 240

Query: 241 NGSSKIPKVYTVGPILQLKSIGVTQSNNALSGADILKWLDDQPPASVVFLCFGSKGSFDE 300
           + SS+ P +Y++GPIL LK+         L   DILKWLD+QPP SVVFLCFGS GSFDE
Sbjct: 241 SSSSRAPPLYSIGPILHLKNNNTVGPGGTLHCTDILKWLDNQPPVSVVFLCFGSMGSFDE 300

Query: 301 DQVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGW 360
           DQV EIA ALERS VRFLWSLRQPPPK KFE PS Y +I  VLPEGFL RTA IGRVIGW
Sbjct: 301 DQVKEIAHALERSGVRFLWSLRQPPPKDKFEAPSEYTDIKYVLPEGFLERTAGIGRVIGW 360

Query: 361 APQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVEL 420
           APQ+EIL+HPATGGF+SHCGWNSTLES+WHGVPMATWPLYAEQQF AFEMVVELGLAV++
Sbjct: 361 APQVEILAHPATGGFVSHCGWNSTLESMWHGVPMATWPLYAEQQFTAFEMVVELGLAVDI 420

Query: 421 TLDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKVKVKGEESRKSMMEGGSSFNS 480
           TLDY K  H  RSR+VSAEEI+SGIRKLM + G E+RKKVK K EESRKS+MEGGSSF S
Sbjct: 421 TLDYQKHPHGERSRVVSAEEIQSGIRKLM-EEGGEMRKKVKAKSEESRKSLMEGGSSFIS 480

Query: 481 LRHFIDDALTNLQEG 488
           L  FIDD L N  EG
Sbjct: 481 LGRFIDDVLGNGPEG 491

BLAST of CSPI04G22070 vs. NCBI nr
Match: gi|449456657|ref|XP_004146065.1| (PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis sativus])

HSP 1 Score: 619.0 bits (1595), Expect = 7.1e-174
Identity = 321/494 (64.98%), Postives = 379/494 (76.72%), Query Frame = 1

Query: 4   FELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPDDIKTTTERIQSLSTSFEG 63
           FEL+FIP PGIGHLASTVE+ANVLV+RD RLSVT+LA+KLP D+K   E I+SLSTSF G
Sbjct: 2   FELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVA-ECIESLSTSFAG 61

Query: 64  KSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIH------DSNRLVGFV 123
           K+I+F VLPE P P +S +         +ES+KP+VRE+V+NL        DS RLVG V
Sbjct: 62  KNIQFNVLPEPPLPEESKKD----FIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLV 121

Query: 124 IDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQN--NSTAEQLQNS-NVEL 183
           IDMFCT+MI+V NEF VPCY+FYT +A FL FS +LQELY +N  N   EQL NS NVEL
Sbjct: 122 IDMFCTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVEL 181

Query: 184 ALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNG 243
            LP+F+NPIP+K IP    +KD A WFH++ KRFR E+KGILINTF EME  + K   + 
Sbjct: 182 TLPNFVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAK---SY 241

Query: 244 SSKIPKVYTVGPILQLKSIGVTQSNNALSGADIL-KWLDDQPPASVVFLCFGSKGSFDED 303
           S  +P +Y VGP+L LK+ GV  S+ A + ADI+ KWLDDQPP+SVV +CFG+  SFDE 
Sbjct: 242 SQVLPPLYFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEA 301

Query: 304 QVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWA 363
           QV EIA ALE S VRF+WSLRQPPPKGKFE P NY +I + LPEGFL+RT  IGRVIGW 
Sbjct: 302 QVAEIANALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWT 361

Query: 364 PQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELT 423
            Q+EIL+HPA GGFISHCGWNS LESVWHGV +ATWP++AEQQFNAFEMVVELGLAVE+T
Sbjct: 362 SQVEILAHPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVT 421

Query: 424 LDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKVKVKGEESRKSMMEGGSSFNSL 483
           LDY   F   + R+VSAEEI+SGI+KLMG+  NE+RKKVK K EESRKS+MEGGSSF SL
Sbjct: 422 LDYRITFGEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSL 481

Query: 484 RHFIDDALTNLQEG 488
             FIDD L N   G
Sbjct: 482 GKFIDDVLANSAGG 487

BLAST of CSPI04G22070 vs. NCBI nr
Match: gi|659129338|ref|XP_008464636.1| (PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo])

HSP 1 Score: 610.9 bits (1574), Expect = 1.9e-171
Identity = 316/495 (63.84%), Postives = 376/495 (75.96%), Query Frame = 1

Query: 4   FELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPDDIKTTTERIQSLSTSFEG 63
           FEL+FIP PGIGHLASTVE+ANVLV+RD RLSVT+LA+KLP D+K   E I+SLSTSF G
Sbjct: 2   FELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVA-ECIESLSTSFAG 61

Query: 64  KSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIH------DSNRLVGFV 123
           K+I+F VLPE P P +S +         +ES+KP+VRE V+N         DS RLVG V
Sbjct: 62  KNIQFNVLPEPPLPEESKKD----FIVLVESYKPYVREAVSNFTASAATSLDSPRLVGLV 121

Query: 124 IDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQN--NSTAEQLQNS-NVEL 183
           IDMFCT+MI+V NEF VPCY+FYT +A FL FS +LQELY +N  N   EQL NS NVEL
Sbjct: 122 IDMFCTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVEL 181

Query: 184 ALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNG 243
            LP+F NPIP+K IP    +KD A WFH++ KRFR E+KGILINTF EME    K   + 
Sbjct: 182 TLPNFANPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHAAK---SY 241

Query: 244 SSKIPKVYTVGPILQLKSIGVTQSNNALSGADIL-KWLDDQPPASVVFLCFGSKGSFDED 303
           S  +P +Y VGP+L LK+ GV  S+ A   ADI+ KWLDDQPP+SVV +CFG+  SFDE 
Sbjct: 242 SQVLPPLYFVGPVLHLKNAGVAGSSEAQDNADIIMKWLDDQPPSSVVLVCFGTMVSFDEA 301

Query: 304 QVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWA 363
           QV EIA ALE S VRF+WSLRQPPPKGKFE P NY ++ + LPEGFL+RT  IGRVIGW 
Sbjct: 302 QVAEIANALEESGVRFIWSLRQPPPKGKFEAPRNYNDVKNFLPEGFLDRTMSIGRVIGWT 361

Query: 364 PQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELT 423
            Q+EIL+HPA GGF+SHCGWNS LESVWHGVP+ATWP++AEQQFNAFEMVVELGLAVE+T
Sbjct: 362 SQVEILAHPAIGGFVSHCGWNSILESVWHGVPIATWPMHAEQQFNAFEMVVELGLAVEVT 421

Query: 424 LDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKVKVKGEESRKSMMEGGSSFNSL 483
           LDY   F   + R+VSAEE++SGI+KLMG+  +E+RKKVK K EES+KS+MEGGSSF SL
Sbjct: 422 LDYRITFGEDKPRLVSAEEVKSGIKKLMGEESDEVRKKVKAKSEESQKSVMEGGSSFISL 481

Query: 484 RHFIDDALTNLQEGN 489
             FIDD L N   G+
Sbjct: 482 GKFIDDVLANSTGGS 488

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
U7A16_PYRCO1.6e-13352.17UDP-glycosyltransferase 71A16 OS=Pyrus communis GN=UGT71A16 PE=1 SV=1[more]
UFOG3_FRAAN3.6e-13353.14Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3 OS=Fragaria ananassa GN... [more]
U7A15_MALDO2.0e-13151.76UDP-glycosyltransferase 71A15 OS=Malus domestica GN=UGT71A15 PE=1 SV=1[more]
UFOG6_FRAAN1.3e-13050.82UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria ananassa GN=GT6 PE=1... [more]
U71E1_STERE1.1e-12350.62UDP-glycosyltransferase 71E1 OS=Stevia rebaudiana GN=UGT71E1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L321_CUCSA2.2e-28399.18Glycosyltransferase OS=Cucumis sativus GN=Csa_4G618520 PE=3 SV=1[more]
K7NBW4_SIRGR5.4e-18167.07Glycosyltransferase OS=Siraitia grosvenorii GN=UDPG7 PE=2 SV=1[more]
A0A0A0L1T2_CUCSA7.7e-15964.54Uncharacterized protein OS=Cucumis sativus GN=Csa_4G618540 PE=4 SV=1[more]
A0A0A0KZV7_CUCSA1.5e-15461.80Glycosyltransferase OS=Cucumis sativus GN=Csa_4G618530 PE=3 SV=1[more]
A0A0A0L341_CUCSA2.0e-15457.37Glycosyltransferase OS=Cucumis sativus GN=Csa_4G620550 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G21760.11.2e-12348.68 UDP-Glycosyltransferase superfamily protein[more]
AT3G21780.13.0e-12248.57 UDP-glucosyl transferase 71B6[more]
AT4G15280.11.8e-11946.58 UDP-glucosyl transferase 71B5[more]
AT3G21750.12.1e-11543.41 UDP-glucosyl transferase 71B1[more]
AT3G21790.18.7e-11445.84 UDP-Glycosyltransferase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449456653|ref|XP_004146063.1|3.2e-28399.18PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis sativus][more]
gi|659129348|ref|XP_008464641.1|1.8e-27095.09PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo][more]
gi|343466221|gb|AEM43004.1|7.8e-18167.07UDP-glucosyltransferase [Siraitia grosvenorii][more]
gi|449456657|ref|XP_004146065.1|7.1e-17464.98PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis sativus][more]
gi|659129338|ref|XP_008464636.1|1.9e-17163.84PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016758 transferase activity, transferring hexosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G22070.1CSPI04G22070.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 1..485
score: 4.3E
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 277..412
score: 8.0
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePROSITEPS00375UDPGTcoord: 352..395
scor
NoneNo IPR availableunknownCoilCoilcoord: 153..173
scor
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 265..413
score: 1.3
NoneNo IPR availablePANTHERPTHR11926:SF242UDP-GLYCOSYLTRANSFERASE 71B2-RELATEDcoord: 1..485
score: 4.3E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 2..480
score: 1.33E