CSPI04G22120 (gene) Wild cucumber (PI 183967)

NameCSPI04G22120
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionGlycosyltransferase
LocationChr4 : 20533115 .. 20534724 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAAATTTGAGCTTGTTTTCATACCAATACCGGGGTCTGGTCACCTTGCTTCCATGGTTGAGATGGCAAATACTCTCCTCGCTCGAGATCATCGTCTTGCTGTCACAATGATTGCCTTTAAGCTACCCCTTGATCCCAAAGCTAATGAATATAATCAATCCCTTTCTGCACAGTCTCTTACCAACAACAACTCCATACAATTCATTGTCCTTCCTGAATTACCTGATATCCCAAACAATGGGAACCGTTTCTTCCTGGAAGTAGTTCTTGAAAGCTACAAACCCCATGTCAAACAAGCTCTTATCTCCTTTCTTACTACCTCCACCAACCATCTTGCTGGATTCGTGTTGGACTCGTTCTGCTCAACCATGGTTGATGTAGCTAATGAATTTAAGGTCCCTTCTTATGTGTACTACACTTCTTGTGCTGCCTATCTTGCTTTTAGCTTACATCTTGAACAACTCTACACACAAGATAATAGTAGTAATGAGGTAATTCAACAATTGAAGGATTCAGATGTTAATTTGAGTGTACCAAGTTTAGTGAATCAAGTTCCAAGTAAAACCATTCCAAGTGTCTTCTTTATTAACAATTTTGCTGTTTGGTTTCATGAACAAGCTAAAAGAATTAGATTTGATGTAAAAGGTGTTCTTATCAATACATTTGAGGAGCTGGAATCACATGCGTTATCTTCTTTGTCAACTGACTCCTCTTTGCAACTCCCACCTTTGTATTCTGTCGGACCTGTTTTGCACTTGAACAAGAACACTGAGACTATGGATGATGGAGATGTGTTGAAGTGGCTTGATGATCAACCACTTTCATCGGTGGTGTTTTTGTGCTTTGGAAGTAGAGGAGCTTTCAAAAAGGATCAGGTGGAGGAGATTGCACGAGCGCTTGAGAGAAGTAGAGTTCGTTTCATTTGGTCTCTTCGACGACCAGGGAATGTGTTTCAATCATCAATCGACTATACAAATTTTGAAGACATCTTACCTAAGGGATTTCTTGATCGAACAGAGAACATTGGGAGAGTCATCAGCTGGGCACCGCAAGTGGAGATATTAGGCCATCCAGCCACAGGTAAAGATCCTACAACTTGGATTTAGAAGTGAAAATATCTTAAGTGTATATATATGTATATATAAGAAAACATCCTTCAAATACAATTTTTTATTGATGTCTTTGTAAAATTGAAACACTAATACAGATATTTTAATTCTTGGTCATAGGTGGGTTCGTATCACATTGTGGTTGGAACTCGACGTTGGAAAGTTTGTGGCATGGCGTGCCGATGGCAACATGGCCAATGTATGCAGAGCAACAATTCAACGCATTTGATCTGGTGGTAGAATTGGGATTGGCTGTGGAGATCAAGATAAGTTATTGTATTGAACTTAAAGAACAAGCCAACCCAATAATAATGGCAGAAGAGATAGAAAGAGGAATTAGAAAGTTGATGGACAACAACAATAATGAGATAAGGAAGAAAGTGAAAACAAAAAGTGAAGAATGCAGAAAAAGTGTAATAGAAGGTGGATCCTCTTTCATCTCATTAGGAAAATTTATTGATGATGTTTTGAGCAACTCTACAACAGGAGGAAACTAA

mRNA sequence

ATGAAGAAATTTGAGCTTGTTTTCATACCAATACCGGGGTCTGGTCACCTTGCTTCCATGGTTGAGATGGCAAATACTCTCCTCGCTCGAGATCATCGTCTTGCTGTCACAATGATTGCCTTTAAGCTACCCCTTGATCCCAAAGCTAATGAATATAATCAATCCCTTTCTGCACAGTCTCTTACCAACAACAACTCCATACAATTCATTGTCCTTCCTGAATTACCTGATATCCCAAACAATGGGAACCGTTTCTTCCTGGAAGTAGTTCTTGAAAGCTACAAACCCCATGTCAAACAAGCTCTTATCTCCTTTCTTACTACCTCCACCAACCATCTTGCTGGATTCGTGTTGGACTCGTTCTGCTCAACCATGGTTGATGTAGCTAATGAATTTAAGGTCCCTTCTTATGTGTACTACACTTCTTGTGCTGCCTATCTTGCTTTTAGCTTACATCTTGAACAACTCTACACACAAGATAATAGTAGTAATGAGGTAATTCAACAATTGAAGGATTCAGATGTTAATTTGAGTGTACCAAGTTTAGTGAATCAAGTTCCAAGTAAAACCATTCCAAGTGTCTTCTTTATTAACAATTTTGCTGTTTGGTTTCATGAACAAGCTAAAAGAATTAGATTTGATGTAAAAGGTGTTCTTATCAATACATTTGAGGAGCTGGAATCACATGCGTTATCTTCTTTGTCAACTGACTCCTCTTTGCAACTCCCACCTTTGTATTCTGTCGGACCTGTTTTGCACTTGAACAAGAACACTGAGACTATGGATGATGGAGATGTGTTGAAGTGGCTTGATGATCAACCACTTTCATCGGTGGTGTTTTTGTGCTTTGGAAGTAGAGGAGCTTTCAAAAAGGATCAGGTGGAGGAGATTGCACGAGCGCTTGAGAGAAGTAGAGTTCGTTTCATTTGGTCTCTTCGACGACCAGGGAATGTGTTTCAATCATCAATCGACTATACAAATTTTGAAGACATCTTACCTAAGGGATTTCTTGATCGAACAGAGAACATTGGGAGAGTCATCAGCTGGGCACCGCAAGTGGAGATATTAGGCCATCCAGCCACAGGTGGGTTCGTATCACATTGTGGTTGGAACTCGACGTTGGAAAGTTTGTGGCATGGCGTGCCGATGGCAACATGGCCAATGTATGCAGAGCAACAATTCAACGCATTTGATCTGGTGGTAGAATTGGGATTGGCTGTGGAGATCAAGATAAGTTATTGTATTGAACTTAAAGAACAAGCCAACCCAATAATAATGGCAGAAGAGATAGAAAGAGGAATTAGAAAGTTGATGGACAACAACAATAATGAGATAAGGAAGAAAGTGAAAACAAAAAGTGAAGAATGCAGAAAAAGTGTAATAGAAGGTGGATCCTCTTTCATCTCATTAGGAAAATTTATTGATGATGTTTTGAGCAACTCTACAACAGGAGGAAACTAA

Coding sequence (CDS)

ATGAAGAAATTTGAGCTTGTTTTCATACCAATACCGGGGTCTGGTCACCTTGCTTCCATGGTTGAGATGGCAAATACTCTCCTCGCTCGAGATCATCGTCTTGCTGTCACAATGATTGCCTTTAAGCTACCCCTTGATCCCAAAGCTAATGAATATAATCAATCCCTTTCTGCACAGTCTCTTACCAACAACAACTCCATACAATTCATTGTCCTTCCTGAATTACCTGATATCCCAAACAATGGGAACCGTTTCTTCCTGGAAGTAGTTCTTGAAAGCTACAAACCCCATGTCAAACAAGCTCTTATCTCCTTTCTTACTACCTCCACCAACCATCTTGCTGGATTCGTGTTGGACTCGTTCTGCTCAACCATGGTTGATGTAGCTAATGAATTTAAGGTCCCTTCTTATGTGTACTACACTTCTTGTGCTGCCTATCTTGCTTTTAGCTTACATCTTGAACAACTCTACACACAAGATAATAGTAGTAATGAGGTAATTCAACAATTGAAGGATTCAGATGTTAATTTGAGTGTACCAAGTTTAGTGAATCAAGTTCCAAGTAAAACCATTCCAAGTGTCTTCTTTATTAACAATTTTGCTGTTTGGTTTCATGAACAAGCTAAAAGAATTAGATTTGATGTAAAAGGTGTTCTTATCAATACATTTGAGGAGCTGGAATCACATGCGTTATCTTCTTTGTCAACTGACTCCTCTTTGCAACTCCCACCTTTGTATTCTGTCGGACCTGTTTTGCACTTGAACAAGAACACTGAGACTATGGATGATGGAGATGTGTTGAAGTGGCTTGATGATCAACCACTTTCATCGGTGGTGTTTTTGTGCTTTGGAAGTAGAGGAGCTTTCAAAAAGGATCAGGTGGAGGAGATTGCACGAGCGCTTGAGAGAAGTAGAGTTCGTTTCATTTGGTCTCTTCGACGACCAGGGAATGTGTTTCAATCATCAATCGACTATACAAATTTTGAAGACATCTTACCTAAGGGATTTCTTGATCGAACAGAGAACATTGGGAGAGTCATCAGCTGGGCACCGCAAGTGGAGATATTAGGCCATCCAGCCACAGGTGGGTTCGTATCACATTGTGGTTGGAACTCGACGTTGGAAAGTTTGTGGCATGGCGTGCCGATGGCAACATGGCCAATGTATGCAGAGCAACAATTCAACGCATTTGATCTGGTGGTAGAATTGGGATTGGCTGTGGAGATCAAGATAAGTTATTGTATTGAACTTAAAGAACAAGCCAACCCAATAATAATGGCAGAAGAGATAGAAAGAGGAATTAGAAAGTTGATGGACAACAACAATAATGAGATAAGGAAGAAAGTGAAAACAAAAAGTGAAGAATGCAGAAAAAGTGTAATAGAAGGTGGATCCTCTTTCATCTCATTAGGAAAATTTATTGATGATGTTTTGAGCAACTCTACAACAGGAGGAAACTAA
BLAST of CSPI04G22120 vs. Swiss-Prot
Match: UFOG3_FRAAN (Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3 OS=Fragaria ananassa GN=GT3 PE=2 SV=1)

HSP 1 Score: 443.7 bits (1140), Expect = 2.5e-123
Identity = 240/486 (49.38%), Postives = 322/486 (66.26%), Query Frame = 1

Query: 2   KKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANE-YNQSLSAQS 61
           K  ELV IP PG GHL S +E+A  L++RD +L +T++    P   K  + Y QSL+  S
Sbjct: 3   KPAELVLIPSPGIGHLVSTLEIAKLLVSRDDKLFITVLIMHFPAVSKGTDAYVQSLADSS 62

Query: 62  LTNNNSIQFIVLPELPDIPNNGN-RFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLD 121
              +  I FI LP        G+ R  L   +ES +PHVK A+ +   + T  LAGFV+D
Sbjct: 63  SPISQRINFINLPHTNMDHTEGSVRNSLVGFVESQQPHVKDAVANLRDSKTTRLAGFVVD 122

Query: 122 SFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSV 181
            FC+TM++VAN+  VPSYV++TS AA L    HL++L  Q N       + KDSD  L +
Sbjct: 123 MFCTTMINVANQLGVPSYVFFTSGAATLGLLFHLQELRDQYNKD---CTEFKDSDAELII 182

Query: 182 PSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSS 241
           PS  N +P+K +P    + + A  F    KR R + KG+L+NTF +LESHAL +LS+D+ 
Sbjct: 183 PSFFNPLPAKVLPGRMLVKDSAEPFLNVIKRFR-ETKGILVNTFTDLESHALHALSSDA- 242

Query: 242 LQLPPLYSVGPVLHLNKNTETMDD------GDVLKWLDDQPLSSVVFLCFGSRGAFKKDQ 301
            ++PP+Y VGP+L+LN N   +D        D+LKWLDDQP  SVVFLCFGS G+F + Q
Sbjct: 243 -EIPPVYPVGPLLNLNSNESRVDSDEVKKKNDILKWLDDQPPLSVVFLCFGSMGSFDESQ 302

Query: 302 VEEIARALERSRVRFIWSLRR--PGNVFQSSIDYTNFEDILPKGFLDRTENIGRVISWAP 361
           V EIA ALE +  RF+WSLRR  P        DY +   +LP+GFLDRT  IG+VI WAP
Sbjct: 303 VREIANALEHAGHRFLWSLRRSPPTGKVAFPSDYDDHTGVLPEGFLDRTGGIGKVIGWAP 362

Query: 362 QVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKI 421
           QV +L HP+ GGFVSHCGWNSTLESLWHGVP+ATWP+YAEQQ NAF  V EL LAVEI +
Sbjct: 363 QVAVLAHPSVGGFVSHCGWNSTLESLWHGVPVATWPLYAEQQLNAFQPVKELELAVEIDM 422

Query: 422 SYCIELKEQANPIIMAEEIERGIRKLMDNNNNEIRKKVKTKSEECRKSVIEGGSSFISLG 478
           SY    + ++  ++ A+EIERGIR++M+ ++++IRK+VK  SE+ +K++++GGSS+ SLG
Sbjct: 423 SY----RSKSPVLVSAKEIERGIREVMELDSSDIRKRVKEMSEKGKKALMDGGSSYTSLG 478

BLAST of CSPI04G22120 vs. Swiss-Prot
Match: UFOG6_FRAAN (UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria ananassa GN=GT6 PE=1 SV=1)

HSP 1 Score: 428.7 bits (1101), Expect = 8.4e-119
Identity = 233/489 (47.65%), Postives = 321/489 (65.64%), Query Frame = 1

Query: 2   KKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANE-YNQSLSAQS 61
           K  EL+FIPIPG GH+ S VE+A  LL RD  L +T++  K P     ++ Y +SL+   
Sbjct: 3   KASELIFIPIPGIGHIVSTVEIAKLLLCRDDNLFITILIMKFPFTADGSDVYIKSLAVDP 62

Query: 62  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTS--TNHLAGFVL 121
                 I+F+ LP+          FF    ++S+K HVK A+   + T   T  +AGFV+
Sbjct: 63  SLKTQRIRFVNLPQEHFQGTGATGFF--TFIDSHKSHVKDAVTRLMETKSETTRIAGFVI 122

Query: 122 DSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLS 181
           D FC+ M+D+ANEF +PSYV+YTS AA L    HL+ L  ++N       + KDSD  L 
Sbjct: 123 DMFCTGMIDLANEFGLPSYVFYTSGAADLGLMFHLQALRDEENKD---CTEFKDSDAELV 182

Query: 182 VPSLVNQVPS-KTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTD 241
           V S VN +P+ + +PSV F      +F   AKR R + KG+L+NTF ELE HA+ SLS+D
Sbjct: 183 VSSFVNPLPAARVLPSVVFEKEGGNFFLNFAKRYR-ETKGILVNTFLELEPHAIQSLSSD 242

Query: 242 SSLQLPPLYSVGPVLHLNK------NTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKK 301
             +   P+Y VGP+L++        + ++    D+L+WLDDQP SSVVFLCFGS G F +
Sbjct: 243 GKIL--PVYPVGPILNVKSEGNQVSSEKSKQKSDILEWLDDQPPSSVVFLCFGSMGCFGE 302

Query: 302 DQVEEIARALERSRVRFIWSLRRPGNV---FQSSIDYTNFEDILPKGFLDRTENIGRVIS 361
           DQV+EIA ALE+  +RF+WSLR+P      F S  DYT+++ +LP+GFLDRT ++G+VI 
Sbjct: 303 DQVKEIAHALEQGGIRFLWSLRQPSKEKIGFPS--DYTDYKAVLPEGFLDRTTDLGKVIG 362

Query: 362 WAPQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVE 421
           WAPQ+ IL HPA GGFVSHCGWNSTLES+W+GVP+ATWP YAEQQ NAF+LV EL LAVE
Sbjct: 363 WAPQLAILAHPAVGGFVSHCGWNSTLESIWYGVPIATWPFYAEQQVNAFELVKELKLAVE 422

Query: 422 IKISYCIELKEQANPIIMAEEIERGIRKLMDNNNNEIRKKVKTKSEECRKSVIEGGSSFI 478
           I + Y    ++ +  I+  E IE+GI+++M+   +E+RK+VK  S+  RK++ E GSS+ 
Sbjct: 423 IDMGY----RKDSGVIVSRENIEKGIKEVME-QESELRKRVKEMSQMSRKALEEDGSSYS 476

BLAST of CSPI04G22120 vs. Swiss-Prot
Match: U7A16_PYRCO (UDP-glycosyltransferase 71A16 OS=Pyrus communis GN=UGT71A16 PE=1 SV=1)

HSP 1 Score: 427.2 bits (1097), Expect = 2.5e-118
Identity = 234/488 (47.95%), Postives = 324/488 (66.39%), Query Frame = 1

Query: 5   ELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYNQSLSAQSLTNN 64
           +LVF+P PG GH+ S VEMA  L+ARD +L +T++  KLP D      + S+S       
Sbjct: 6   QLVFVPAPGIGHIVSTVEMAKQLVARDDQLFITVLVMKLPYDQPFTNTDSSIS------- 65

Query: 65  NSIQFIVLPELP-----DIPNNGNRFFLEVVLESYKPHVKQALISFL-------TTSTNH 124
           + I F+ LPE        +PN G+  F  + +E++K HV+ A+I+ L       +TS   
Sbjct: 66  HRINFVNLPEAQLDKQDTVPNPGS--FFRMFVENHKTHVRDAVINLLPESDQSESTSKPR 125

Query: 125 LAGFVLDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKD 184
           LAGFVLD F ++++DVANEF+VPSYV++TS ++ LA   H + L  +       I +L  
Sbjct: 126 LAGFVLDMFSASLIDVANEFEVPSYVFFTSNSSTLALLSHFQSLRDEGGID---ITELTS 185

Query: 185 SDVNLSVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALS 244
           S   L+VPS +N  P   +P  F              R +   KG+L+NTF ELESHAL 
Sbjct: 186 STAELAVPSFINPYPVAVLPGSFLDKESTKSTLNNVGRYK-QTKGILVNTFLELESHALH 245

Query: 245 SLSTDSSLQLPPLYSVGPVLHLNKNTETMDDG-DVLKWLDDQPLSSVVFLCFGSRGAFKK 304
            L  DS +++PP+Y VGP+L+L  + E  D G D+L+WLDDQP  SVVFLCFGS G+F  
Sbjct: 246 YL--DSGVKIPPVYPVGPLLNLKSSHE--DKGSDILRWLDDQPPLSVVFLCFGSMGSFGD 305

Query: 305 DQVEEIARALERSRVRFIWSLRRPGNVFQSSI--DYTNFEDILPKGFLDRTENIGRVISW 364
            QV+EIA  LE S  RF+WSLR+P +  + ++  DY + + +LP+GFLDRT  +GRVI W
Sbjct: 306 AQVKEIACTLEHSGHRFLWSLRQPPSKGKRALPSDYADLKTVLPEGFLDRTATVGRVIGW 365

Query: 365 APQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEI 424
           APQ  ILGHPA GGFVSHCGWNSTLES+W+GVP+A WPMYAEQ  NAF LVVELGLAVEI
Sbjct: 366 APQAAILGHPAIGGFVSHCGWNSTLESIWNGVPIAAWPMYAEQNMNAFQLVVELGLAVEI 425

Query: 425 KISYCIELKEQANPIIMAEEIERGIRKLMDNNNNEIRKKVKTKSEECRKSVIEGGSSFIS 478
           K+ Y    ++ ++ ++ AE+IERGIR++M+  ++++RK+VK  SE+ +K++++GGSS+ S
Sbjct: 426 KMDY----RKDSDVVVSAEDIERGIRQVME-LDSDVRKRVKEMSEKSKKALVDGGSSYSS 471

BLAST of CSPI04G22120 vs. Swiss-Prot
Match: U7A15_MALDO (UDP-glycosyltransferase 71A15 OS=Malus domestica GN=UGT71A15 PE=1 SV=1)

HSP 1 Score: 425.6 bits (1093), Expect = 7.2e-118
Identity = 236/488 (48.36%), Postives = 331/488 (67.83%), Query Frame = 1

Query: 5   ELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYNQSLSAQSLTNN 64
           +LVF+P PG GH+ S VEMA  L ARD +L +T++  KLP       Y Q  +    + +
Sbjct: 6   QLVFVPAPGIGHIVSTVEMAKQLAARDDQLFITVLVMKLP-------YAQPFTNTDSSIS 65

Query: 65  NSIQFIVLPEL-PD----IPNNGNRFFLEVVLESYKPHVKQALISFL-------TTSTNH 124
           + I F+ LPE  PD    +PN G+  F  + +E++K HV+ A+I+ L       +TS   
Sbjct: 66  HRINFVNLPEAQPDKQDIVPNPGS--FFRMFVENHKSHVRDAVINVLPESDQSESTSKPR 125

Query: 125 LAGFVLDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKD 184
           LAGFVLD F ++++DVANEFKVPSY+++TS A+ LA   H + L  +       I +L  
Sbjct: 126 LAGFVLDMFSASLIDVANEFKVPSYLFFTSNASALALMSHFQSLRDEGGID---ITELTS 185

Query: 185 SDVNLSVPSLVNQVPSKTIP-SVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHAL 244
           S   L+VPS +N  P+  +P S+  + +     +  +K  +   KG+L+NTF ELESHAL
Sbjct: 186 STAELAVPSFINPYPAAVLPGSLLDMESTKSTLNHVSKYKQ--TKGILVNTFMELESHAL 245

Query: 245 SSLSTDSSLQLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKK 304
             L  DS  ++PP+Y VGP+L+L K+++     D+L+WLDDQP  SVVFLCFGS G+F +
Sbjct: 246 HYL--DSGDKIPPVYPVGPLLNL-KSSDEDKASDILRWLDDQPPFSVVFLCFGSMGSFGE 305

Query: 305 DQVEEIARALERSRVRFIWSLRRPGNVFQSSI--DYTNFEDILPKGFLDRTENIGRVISW 364
            QV+EIA ALE S  RF+WSLRRP    + ++  DY + + +LP+GFLDRT  +G+VI W
Sbjct: 306 AQVKEIACALEHSGHRFLWSLRRPPPQGKRAMPSDYEDLKTVLPEGFLDRTATVGKVIGW 365

Query: 365 APQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEI 424
           APQ  ILGHPATGGFVSHCGWNSTLESLW+GVP+A WP+YAEQ  NAF LVVELGLAVEI
Sbjct: 366 APQAAILGHPATGGFVSHCGWNSTLESLWNGVPIAAWPLYAEQNLNAFQLVVELGLAVEI 425

Query: 425 KISYCIELKEQANPIIMAEEIERGIRKLMDNNNNEIRKKVKTKSEECRKSVIEGGSSFIS 478
           K+ Y    +  ++ ++ AE+IERGIR++M+  ++++RK+VK  SE+ +K++++GGSS+ S
Sbjct: 426 KMDY----RRDSDVVVSAEDIERGIRRVME-LDSDVRKRVKEMSEKSKKALVDGGSSYSS 471

BLAST of CSPI04G22120 vs. Swiss-Prot
Match: U71E1_STERE (UDP-glycosyltransferase 71E1 OS=Stevia rebaudiana GN=UGT71E1 PE=2 SV=1)

HSP 1 Score: 411.8 bits (1057), Expect = 1.1e-113
Identity = 233/491 (47.45%), Postives = 315/491 (64.15%), Query Frame = 1

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYNQ----SL 60
           M   ELVFIP PG+GHL   VE+A  LL RD RL+VT+I   L L PK N   +    SL
Sbjct: 1   MSTSELVFIPSPGAGHLPPTVELAKLLLHRDQRLSVTIIVMNLWLGPKHNTEARPCVPSL 60

Query: 61  SAQSLTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGF 120
               +  + S   ++ P            F+   +E +KP V+  +   + + +  LAGF
Sbjct: 61  RFVDIPCDESTMALISPNT----------FISAFVEHHKPRVRDIVRGIIESDSVRLAGF 120

Query: 121 VLDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVN 180
           VLD FC  M DVANEF VPSY Y+TS AA L    HL+  + +D+   +  + LK+SD  
Sbjct: 121 VLDMFCMPMSDVANEFGVPSYNYFTSGAATLGLMFHLQ--WKRDHEGYDATE-LKNSDTE 180

Query: 181 LSVPSLVNQVPSKTIPSVFFINNF-AVWFHEQAKRIRFDVKGVLINTFEELESHALSSLS 240
           LSVPS VN VP+K +P V       +  F + A+RIR + KG+++N+ + +E HAL  LS
Sbjct: 181 LSVPSYVNPVPAKVLPEVVLDKEGGSKMFLDLAERIR-ESKGIIVNSCQAIERHALEYLS 240

Query: 241 TDSSLQLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVE 300
           ++++  +PP++ VGP+L+L    +     ++++WL++QP SSVVFLCFGS G+F + QV+
Sbjct: 241 SNNN-GIPPVFPVGPILNLENKKDDAKTDEIMRWLNEQPESSVVFLCFGSMGSFNEKQVK 300

Query: 301 EIARALERSRVRFIWSLRRPG--NVFQSSIDYTNFEDILPKGFLDRTENIGRVISWAPQV 360
           EIA A+ERS  RF+WSLRRP      +   +Y N E++LP+GFL RT +IG+VI WAPQ+
Sbjct: 301 EIAVAIERSGHRFLWSLRRPTPKEKIEFPKEYENLEEVLPEGFLKRTSSIGKVIGWAPQM 360

Query: 361 EILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISY 420
            +L HP+ GGFVSHCGWNSTLES+W GVPMA WP+YAEQ  NAF LVVELGLA EI++ Y
Sbjct: 361 AVLSHPSVGGFVSHCGWNSTLESMWCGVPMAAWPLYAEQTLNAFLLVVELGLAAEIRMDY 420

Query: 421 CIELKE--QANPIIMAEEIERGIRKLMDNNNNEIRKKVKTKSEECRKSVIEGGSSFISLG 480
             + K        +  EEIE GIRKLM  ++ EIR KVK   E+ R +V+EGGSS+ S+G
Sbjct: 421 RTDTKAGYDGGMEVTVEEIEDGIRKLM--SDGEIRNKVKDVKEKSRAAVVEGGSSYASIG 473

Query: 481 KFIDDVLSNST 483
           KFI+ V SN T
Sbjct: 481 KFIEHV-SNVT 473

BLAST of CSPI04G22120 vs. TrEMBL
Match: A0A0A0L341_CUCSA (Glycosyltransferase OS=Cucumis sativus GN=Csa_4G620550 PE=3 SV=1)

HSP 1 Score: 963.0 bits (2488), Expect = 1.4e-277
Identity = 483/486 (99.38%), Postives = 484/486 (99.59%), Query Frame = 1

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYNQSLSAQS 60
           MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEY QSLSAQS
Sbjct: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYIQSLSAQS 60

Query: 61  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDS 120
           LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDS
Sbjct: 61  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDS 120

Query: 121 FCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVP 180
           FCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVP
Sbjct: 121 FCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVP 180

Query: 181 SLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSL 240
           SLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSL
Sbjct: 181 SLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSL 240

Query: 241 QLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIARA 300
           QLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIARA
Sbjct: 241 QLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIARA 300

Query: 301 LERSRVRFIWSLRRPGNVFQSSIDYTNFEDILPKGFLDRTENIGRVISWAPQVEILGHPA 360
           LERSRVRFIWSLRRPGNVFQSSIDYTNFEDILPKGFLDRT+NIGRVISWAPQVEILGHPA
Sbjct: 301 LERSRVRFIWSLRRPGNVFQSSIDYTNFEDILPKGFLDRTQNIGRVISWAPQVEILGHPA 360

Query: 361 TGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIELKEQ 420
           TGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIELKEQ
Sbjct: 361 TGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIELKEQ 420

Query: 421 ANPIIMAEEIERGIRKLMDNNNNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVLSN 480
           ANPIIMAEEIERGIRKLMDNN NEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVLSN
Sbjct: 421 ANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVLSN 480

Query: 481 STTGGN 487
           STTGGN
Sbjct: 481 STTGGN 486

BLAST of CSPI04G22120 vs. TrEMBL
Match: K7NBW4_SIRGR (Glycosyltransferase OS=Siraitia grosvenorii GN=UDPG7 PE=2 SV=1)

HSP 1 Score: 566.2 bits (1458), Expect = 3.8e-158
Identity = 292/497 (58.75%), Postives = 373/497 (75.05%), Query Frame = 1

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYNQSLSAQS 60
           MKKFELVFIP+P  GHLA+MVEMAN L+ RD RL VT++  KLPL  K  EY QSLSA  
Sbjct: 1   MKKFELVFIPLPVMGHLAAMVEMANILVTRDQRLTVTILVIKLPLYGKTAEYIQSLSASF 60

Query: 61  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALI----SFLTTSTNHLAGF 120
            +   S++FI+LPE+     +   F L+  LESYKP +++A+I    S +   +  LAGF
Sbjct: 61  ASE--SMRFIILPEVLLPEESEKEFMLKAFLESYKPIIREAIIDLTDSQMGPDSPRLAGF 120

Query: 121 VLDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVN 180
           VLD FC+TM+DVANEF VPSYV+ TS A +LA S HL++LY  +N+S EV++QL++S+  
Sbjct: 121 VLDMFCTTMIDVANEFGVPSYVFCTSNAGFLALSFHLQELY-DENNSKEVVKQLQNSNAE 180

Query: 181 LSVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLST 240
           +++PS VN +P K IP +F  ++ A WFH+Q +R R  VKG+LINTF +LESH ++S+S 
Sbjct: 181 IALPSFVNPIPGKMIPDIFSNDDTASWFHDQVERYRSGVKGILINTFAKLESHVMNSMSR 240

Query: 241 DSSLQLPPLYSVGPVLHLNKNTETMDDG------DVLKWLDDQPLSSVVFLCFGSRGAFK 300
            SS + PPLYS+GP+LHL KN  T+  G      D+LKWLD+QP  SVVFLCFGS G+F 
Sbjct: 241 SSSSRAPPLYSIGPILHL-KNNNTVGPGGTLHCTDILKWLDNQPPVSVVFLCFGSMGSFD 300

Query: 301 KDQVEEIARALERSRVRFIWSLRRPG--NVFQSSIDYTNFEDILPKGFLDRTENIGRVIS 360
           +DQV+EIA ALERS VRF+WSLR+P   + F++  +YT+ + +LP+GFL+RT  IGRVI 
Sbjct: 301 EDQVKEIAHALERSGVRFLWSLRQPPPKDKFEAPSEYTDIKYVLPEGFLERTAGIGRVIG 360

Query: 361 WAPQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVE 420
           WAPQVEIL HPATGGFVSHCGWNSTLES+WHGVPMATWP+YAEQQF AF++VVELGLAV+
Sbjct: 361 WAPQVEILAHPATGGFVSHCGWNSTLESMWHGVPMATWPLYAEQQFTAFEMVVELGLAVD 420

Query: 421 IKISYCIELKEQANPIIMAEEIERGIRKLMDNNNNEIRKKVKTKSEECRKSVIEGGSSFI 480
           I + Y      + + ++ AEEI+ GIRKLM+    E+RKKVK KSEE RKS++EGGSSFI
Sbjct: 421 ITLDYQKHPHGERSRVVSAEEIQSGIRKLME-EGGEMRKKVKAKSEESRKSLMEGGSSFI 480

Query: 481 SLGKFIDDVLSNSTTGG 486
           SLG+FIDDVL N   GG
Sbjct: 481 SLGRFIDDVLGNGPEGG 492

BLAST of CSPI04G22120 vs. TrEMBL
Match: A0A0A0L321_CUCSA (Glycosyltransferase OS=Cucumis sativus GN=Csa_4G618520 PE=3 SV=1)

HSP 1 Score: 548.5 bits (1412), Expect = 8.1e-153
Identity = 283/495 (57.17%), Postives = 365/495 (73.74%), Query Frame = 1

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKAN-EYNQSLSAQ 60
           M KFELVFIP PG GHLAS VE+AN L++RD RL+VT++A KLP D K   E  QSLSA 
Sbjct: 1   MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSAS 60

Query: 61  SLTNNNSIQFIVLPELPDIPNNGNR---FFLEVVLESYKPHVKQALISFLTTSTNHLAGF 120
                 SI+FIVLPELP  PN  +      L+  LES+KPHV++ +++ L   +N L GF
Sbjct: 61  F--EGKSIRFIVLPELP-FPNQSSEPPPLMLQAFLESHKPHVRE-IVTNLIHDSNRLVGF 120

Query: 121 VLDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVN 180
           V+D FC++M++VANEFKVP Y++YTS A +L FS HL++LY Q+NS+ E   QL++S+V 
Sbjct: 121 VIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAE---QLQNSNVE 180

Query: 181 LSVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLST 240
           L++PS +N +P+K IP   F  + A WFH+  KR R +VKG+LINTF E+E   +  +S 
Sbjct: 181 LALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSN 240

Query: 241 DSSLQLPPLYSVGPVLHLN-----KNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKK 300
            SS ++P +Y+VGP+L L      ++   ++  D+LKWLDDQP +SVVFLCFGS+G+F +
Sbjct: 241 GSS-KIPKVYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDE 300

Query: 301 DQVEEIARALERSRVRFIWSLRRPG--NVFQSSIDYTNFEDILPKGFLDRTENIGRVISW 360
           DQV EIARALERS VRF+WSLR+P     F+   +Y N  D+LP+GFL+RT +IGRVI W
Sbjct: 301 DQVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGW 360

Query: 361 APQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEI 420
           APQ+EIL HPATGGF+SHCGWNSTLES+WHGVPMATWP+YAEQQFNAF++VVELGLAVE+
Sbjct: 361 APQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVEL 420

Query: 421 KISYCIELKEQANPIIMAEEIERGIRKLMDNNNNEIRKKVKTKSEECRKSVIEGGSSFIS 480
            + Y  +     + I+ AEEIE GIRKLM ++ NEIRKK+K K EE RKS++EGGSSF S
Sbjct: 421 TLDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNS 480

Query: 481 LGKFIDDVLSNSTTG 485
           L  FIDD L+N   G
Sbjct: 481 LRHFIDDALTNLQEG 487

BLAST of CSPI04G22120 vs. TrEMBL
Match: A0A067FE19_CITSI (Glycosyltransferase OS=Citrus sinensis GN=CISIN_1g045029mg PE=3 SV=1)

HSP 1 Score: 491.9 bits (1265), Expect = 9.0e-136
Identity = 262/491 (53.36%), Postives = 351/491 (71.49%), Query Frame = 1

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYNQSLSAQS 60
           MKK +LVFIP PG+GHL S VE+A  L+ RD RL+VT++  KLP D     Y QSL+A +
Sbjct: 1   MKKAQLVFIPSPGAGHLVSTVEVARLLVDRDDRLSVTVLIMKLPHDNTVATYTQSLAASN 60

Query: 61  LTNNNSIQFIVLPE-LPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTN--HLAGFV 120
           L++   I+FI LP+  PD  +   + F    +ES KPHVK+ + +    S +   LAGFV
Sbjct: 61  LSSR--IKFINLPDDQPDKESTPPKRFFGHFVESKKPHVKEVVANLTDESPDSPRLAGFV 120

Query: 121 LDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNL 180
           LD FC+ M++VA+EFKVPSY+++TS AA+L F L ++ L+ ++N++   I +LKDSD  L
Sbjct: 121 LDMFCTCMIEVADEFKVPSYLFFTSGAAFLGFMLRVQALHDEENTT---ITELKDSDAVL 180

Query: 181 SVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTD 240
            VP LVN VP+K  PSV F   +A   ++QA+  R   KG+++NTFEELESHA+ S S D
Sbjct: 181 EVPGLVNSVPAKVWPSVVFNKEWAEVLNQQARTFR-GTKGIMVNTFEELESHAVRSFS-D 240

Query: 241 SSLQLPPLYSVGPVLHLNKNTETMDDG------DVLKWLDDQPLSSVVFLCFGSRGAFKK 300
              + PPLY +GP+L++      + +G      D++ WLDDQP SSVVFLCFGS G+F +
Sbjct: 241 GKSKTPPLYPMGPILNIKGENYDLGEGGADKKADIMAWLDDQPESSVVFLCFGSWGSFGE 300

Query: 301 DQVEEIARALERSRVRFIWSLRRPGN--VFQSSIDYTNFEDILPKGFLDRTENIGRVISW 360
           DQV+EIA ALE+S  RF+WSLRRP +   F+   DY +  ++LP+GF+DRT NIG+VI W
Sbjct: 301 DQVKEIACALEQSGHRFLWSLRRPPSKDTFEKPSDYEDPTEVLPEGFMDRTANIGKVIGW 360

Query: 361 APQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEI 420
           APQ+ +L HPA GGFVSHCGWNSTLES+W GVP+ATWPMYAEQQFNAF+LVVELGLAVEI
Sbjct: 361 APQIAVLAHPAIGGFVSHCGWNSTLESIWFGVPIATWPMYAEQQFNAFELVVELGLAVEI 420

Query: 421 KISYCIELKEQANPIIMAEEIERGIRKLMDNNNNEIRKKVKTKSEECRKSVIEGGSSFIS 480
           K+ Y  ++  +   ++ AE IERGIR LM+ +N+E+RK+VK  SE+ RK++ +GGSSF S
Sbjct: 421 KMDYRNDIMIENPTVVNAEVIERGIRCLME-HNSEMRKRVKEMSEKARKALSDGGSSFSS 480

BLAST of CSPI04G22120 vs. TrEMBL
Match: V4SQP2_9ROSI (Glycosyltransferase OS=Citrus clementina GN=CICLE_v10011544mg PE=3 SV=1)

HSP 1 Score: 485.7 bits (1249), Expect = 6.5e-134
Identity = 260/491 (52.95%), Postives = 348/491 (70.88%), Query Frame = 1

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYNQSLSAQS 60
           MKK +LVFIP PG+GHL S VE+A  L+ RD  L+VT++  KLP D     Y QSL+A +
Sbjct: 15  MKKAQLVFIPSPGAGHLVSTVEVARLLVERDDGLSVTVLIMKLPHDNTVATYTQSLAASN 74

Query: 61  LTNNNSIQFIVLPE-LPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTN--HLAGFV 120
           L++   I+FI LP+  PD  +   + F    +ES KPHVK+ + +    S +   LAGFV
Sbjct: 75  LSSR--IKFINLPDDQPDKESTPPKRFFADFVESKKPHVKEVVANLTDESPDSPRLAGFV 134

Query: 121 LDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNL 180
           LD FC+ M++VA+EFKVPSY+++TS AA+L F L ++ L+ ++N++   I +LKDSD  L
Sbjct: 135 LDMFCTCMIEVADEFKVPSYLFFTSGAAFLGFMLRVQALHDEENTA---ITELKDSDAVL 194

Query: 181 SVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTD 240
            VP LVN VP+K  PSV F   +A   ++ A+R R   KG+++NTFEELESHA+ S S  
Sbjct: 195 EVPGLVNSVPAKVWPSVVFNKEWAEALYQHARRFR-GTKGIMVNTFEELESHAVRSFSNG 254

Query: 241 SSLQLPPLYSVGPVLHLNKNTETMDDG------DVLKWLDDQPLSSVVFLCFGSRGAFKK 300
            S + PPLY VGP+L++      + +G      D++ WLDDQP SSVVFLCFGS G+F +
Sbjct: 255 KS-KTPPLYPVGPILNIKGENYDLGEGGADKKADIMAWLDDQPESSVVFLCFGSWGSFGE 314

Query: 301 DQVEEIARALERSRVRFIWSLRRPGN--VFQSSIDYTNFEDILPKGFLDRTENIGRVISW 360
           DQV+EIA ALE+S  RF+WSLRR  +   F+   DY +  ++LP+GF+DRT NIG+VI W
Sbjct: 315 DQVKEIACALEQSGHRFLWSLRRAPSKDTFEKPSDYEDPTEVLPEGFMDRTANIGKVIGW 374

Query: 361 APQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEI 420
           APQV +L HP+ GGFVSHCGWNSTLES+W GVP+ATWPMYAEQQFNAF+LVVELGLAVEI
Sbjct: 375 APQVAVLAHPSIGGFVSHCGWNSTLESIWFGVPIATWPMYAEQQFNAFELVVELGLAVEI 434

Query: 421 KISYCIELKEQANPIIMAEEIERGIRKLMDNNNNEIRKKVKTKSEECRKSVIEGGSSFIS 480
           K+ Y  ++  +   ++ AE IERGIR LM+ +N+E+R +VK  SE+ RK++ +GGSSF S
Sbjct: 435 KMDYRNDIMIENPTVVNAEVIERGIRCLME-HNSEMRMRVKEMSEKARKALSDGGSSFSS 494

BLAST of CSPI04G22120 vs. TAIR10
Match: AT3G21760.1 (AT3G21760.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 405.2 bits (1040), Expect = 5.6e-113
Identity = 231/493 (46.86%), Postives = 312/493 (63.29%), Query Frame = 1

Query: 3   KFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYNQSLSAQSLT 62
           K ELVFIP PG GHL  +VE+A   + RD  L++T+I   +P     +  N S    SL+
Sbjct: 2   KLELVFIPSPGDGHLRPLVEVAKLHVDRDDHLSITIII--IPQMHGFSSSNSSSYIASLS 61

Query: 63  NNN----SIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTT-----STNHL 122
           +++    S   + +P+ PD  +    FF    ++++KP VK A +  LT      S + L
Sbjct: 62  SDSEERLSYNVLSVPDKPDSDDTKPHFF--DYIDNFKPQVK-ATVEKLTDPGPPDSPSRL 121

Query: 123 AGFVLDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDS 182
           AGFV+D FC  M+DVANEF VPSY++YTS A +L   +H+E LY   N     +  LKDS
Sbjct: 122 AGFVVDMFCMMMIDVANEFGVPSYMFYTSNATFLGLQVHVEYLYDVKNYD---VSDLKDS 181

Query: 183 DVN-LSVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALS 242
           D   L VP L   +P K  PSV     +      Q +R R + KG+L+NTF ELE  A+ 
Sbjct: 182 DTTELEVPCLTRPLPVKCFPSVLLTKEWLPVMFRQTRRFR-ETKGILVNTFAELEPQAMK 241

Query: 243 SLSTDSSLQLPPLYSVGPVLHLNKNTETMDD---GDVLKWLDDQPLSSVVFLCFGSRGAF 302
             S   S  LP +Y+VGPV++L  N     D    ++L+WLD+QP  SVVFLCFGS G F
Sbjct: 242 FFSGVDS-PLPTVYTVGPVMNLKINGPNSSDDKQSEILRWLDEQPRKSVVFLCFGSMGGF 301

Query: 303 KKDQVEEIARALERSRVRFIWSLRR--PGNVFQSSIDYTNFEDILPKGFLDRTENIGRVI 362
           ++ Q +EIA ALERS  RF+WSLRR  P        ++TN E+ILP+GFL+RT  IG+++
Sbjct: 302 REGQAKEIAIALERSGHRFVWSLRRAQPKGSIGPPEEFTNLEEILPEGFLERTAEIGKIV 361

Query: 363 SWAPQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAV 422
            WAPQ  IL +PA GGFVSHCGWNSTLESLW GVPMATWP+YAEQQ NAF++V ELGLAV
Sbjct: 362 GWAPQSAILANPAIGGFVSHCGWNSTLESLWFGVPMATWPLYAEQQVNAFEMVEELGLAV 421

Query: 423 EIKISYCIELKEQANPIIMAEEIERGIRKLMDNNNNEIRKKVKTKSEECRKSVIEGGSSF 481
           E++ S+  +     + ++ AEEIERGIR LM+  ++++R +VK  SE+   ++++GGSS 
Sbjct: 422 EVRNSFRGDFMAADDELMTAEEIERGIRCLME-QDSDVRSRVKEMSEKSHVALMDGGSSH 481

BLAST of CSPI04G22120 vs. TAIR10
Match: AT3G21790.1 (AT3G21790.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 390.2 bits (1001), Expect = 1.9e-108
Identity = 225/491 (45.82%), Postives = 322/491 (65.58%), Query Frame = 1

Query: 3   KFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPK--ANEYNQSLSAQS 62
           KFELVFIP PG GHL S VEMA  L+ R+ RL++++I      + +  A++Y  +LSA S
Sbjct: 2   KFELVFIPYPGIGHLRSTVEMAKLLVDRETRLSISVIILPFISEGEVGASDYIAALSASS 61

Query: 63  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNH-----LAG 122
              NN +++ V+  + D P       +E+ +++ +P V+  +   L   ++      +AG
Sbjct: 62  ---NNRLRYEVISAV-DQPTI-EMTTIEIHMKNQEPKVRSTVAKLLEDYSSKPDSPKIAG 121

Query: 123 FVLDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDV 182
           FVLD FC++MVDVANEF  PSY++YTS A  L+ + H++ L   +N  +       DS+ 
Sbjct: 122 FVLDMFCTSMVDVANEFGFPSYMFYTSSAGILSVTYHVQML-CDENKYDVSENDYADSEA 181

Query: 183 NLSVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLS 242
            L+ PSL    P K +P     N +   F  QA++ R ++KG+L+NT  ELE + L  LS
Sbjct: 182 VLNFPSLSRPYPVKCLPHALAANMWLPVFVNQARKFR-EMKGILVNTVAELEPYVLKFLS 241

Query: 243 TDSSLQLPPLYSVGPVLHL-NKNTETMDDG--DVLKWLDDQPLSSVVFLCFGSRGAFKKD 302
           +  +   PP+Y VGP+LHL N+  ++ D+   ++++WLD QP SSVVFLCFGS G F ++
Sbjct: 242 SSDT---PPVYPVGPLLHLENQRDDSKDEKRLEIIRWLDQQPPSSVVFLCFGSMGGFGEE 301

Query: 303 QVEEIARALERSRVRFIWSLRRPG-NVFQSSI-DYTNFEDILPKGFLDRTENIGRVISWA 362
           QV EIA ALERS  RF+WSLRR   N+F+    ++TN E++LP+GF DRT++IG+VI WA
Sbjct: 302 QVREIAIALERSGHRFLWSLRRASPNIFKELPGEFTNLEEVLPEGFFDRTKDIGKVIGWA 361

Query: 363 PQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIK 422
           PQV +L +PA GGFV+HCGWNSTLESLW GVP A WP+YAEQ+FNAF +V ELGLAVEI+
Sbjct: 362 PQVAVLANPAIGGFVTHCGWNSTLESLWFGVPTAAWPLYAEQKFNAFLMVEELGLAVEIR 421

Query: 423 ISYCIE-LKEQANPIIMAEEIERGIRKLMDNNNNEIRKKVKTKSEECRKSVIEGGSSFIS 481
             +  E L       + AEEIE+ I  LM+  ++++RK+VK  SE+C  ++++GGSS  +
Sbjct: 422 KYWRGEHLAGLPTATVTAEEIEKAIMCLME-QDSDVRKRVKDMSEKCHVALMDGGSSRTA 481

BLAST of CSPI04G22120 vs. TAIR10
Match: AT3G21780.1 (AT3G21780.1 UDP-glucosyl transferase 71B6)

HSP 1 Score: 389.8 bits (1000), Expect = 2.5e-108
Identity = 222/486 (45.68%), Postives = 315/486 (64.81%), Query Frame = 1

Query: 3   KFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYNQSLSAQSLT 62
           K ELVFIP P   HL + VEMA  L+ ++  L++T+I          +  N S+   SLT
Sbjct: 2   KIELVFIPSPAISHLMATVEMAEQLVDKNDNLSITVIIISF------SSKNTSMIT-SLT 61

Query: 63  NNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTS---TNHLAGFVLD 122
           +NN +++ ++      P        +  ++S KP V+ A+   + ++      LAGFV+D
Sbjct: 62  SNNRLRYEIISGGDQQPTELKA--TDSHIQSLKPLVRDAVAKLVDSTLPDAPRLAGFVVD 121

Query: 123 SFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSV 182
            +C++M+DVANEF VPSY++YTS A +L   LH++ +Y  ++  +  + +L+DSDV L V
Sbjct: 122 MYCTSMIDVANEFGVPSYLFYTSNAGFLGLLLHIQFMYDAEDIYD--MSELEDSDVELVV 181

Query: 183 PSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSS 242
           PSL +  P K +P +F    +  +F  QA+R R + KG+L+NT  +LE  AL+ LS  + 
Sbjct: 182 PSLTSPYPLKCLPYIFKSKEWLTFFVTQARRFR-ETKGILVNTVPDLEPQALTFLSNGN- 241

Query: 243 LQLPPLYSVGPVLHL-NKNTETMD--DGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEE 302
             +P  Y VGP+LHL N N + +D    ++L+WLD+QP  SVVFLCFGS G F ++QV E
Sbjct: 242 --IPRAYPVGPLLHLKNVNCDYVDKKQSEILRWLDEQPPRSVVFLCFGSMGGFSEEQVRE 301

Query: 303 IARALERSRVRFIWSLRR--PGNVFQSSIDYTNFEDILPKGFLDRTENIGRVISWAPQVE 362
            A AL+RS  RF+WSLRR  P  + +   ++TN E+ILP+GF DRT N G+VI WA QV 
Sbjct: 302 TALALDRSGHRFLWSLRRASPNILREPPGEFTNLEEILPEGFFDRTANRGKVIGWAEQVA 361

Query: 363 ILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYC 422
           IL  PA GGFVSH GWNSTLESLW GVPMA WP+YAEQ+FNAF++V ELGLAVEIK  + 
Sbjct: 362 ILAKPAIGGFVSHGGWNSTLESLWFGVPMAIWPLYAEQKFNAFEMVEELGLAVEIKKHWR 421

Query: 423 IELKEQANPIIMAEEIERGIRKLMDNNNNEIRKKVKTKSEECRKSVIEGGSSFISLGKFI 481
            +L    + I+ AEEIE+GI  LM+  ++++RK+V   SE+C  ++++GGSS  +L +FI
Sbjct: 422 GDLLLGRSEIVTAEEIEKGIICLME-QDSDVRKRVNEISEKCHVALMDGGSSETALKRFI 471

BLAST of CSPI04G22120 vs. TAIR10
Match: AT4G15280.1 (AT4G15280.1 UDP-glucosyl transferase 71B5)

HSP 1 Score: 379.4 bits (973), Expect = 3.3e-105
Identity = 212/482 (43.98%), Postives = 298/482 (61.83%), Query Frame = 1

Query: 3   KFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDP-KANEYNQSLSAQSL 62
           K ELVFIP+PG GHL   V++A  L+  ++RL++T+I      D   A+    SL+  S 
Sbjct: 2   KIELVFIPLPGIGHLRPTVKLAKQLIGSENRLSITIIIIPSRFDAGDASACIASLTTLSQ 61

Query: 63  TNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDSF 122
            +    + I + + P   ++ +    +V +E  K  V+ A+ + +   T  LAGFV+D F
Sbjct: 62  DDRLHYESISVAKQPPT-SDPDPVPAQVYIEKQKTKVRDAVAARIVDPTRKLAGFVVDMF 121

Query: 123 CSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVPS 182
           CS+M+DVANEF VP Y+ YTS A +L   LH++Q+Y Q       + +L++S   L  PS
Sbjct: 122 CSSMIDVANEFGVPCYMVYTSNATFLGTMLHVQQMYDQKKYD---VSELENSVTELEFPS 181

Query: 183 LVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSLQ 242
           L    P K +P +     +      QA+  R  +KG+L+NT  ELE HAL   + +    
Sbjct: 182 LTRPYPVKCLPHILTSKEWLPLSLAQARCFR-KMKGILVNTVAELEPHALKMFNINGD-D 241

Query: 243 LPPLYSVGPVLHL-NKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIARA 302
           LP +Y VGPVLHL N N +     ++L+WLD+QP  SVVFLCFGS G F ++Q  E A A
Sbjct: 242 LPQVYPVGPVLHLENGNDDDEKQSEILRWLDEQPSKSVVFLCFGSLGGFTEEQTRETAVA 301

Query: 303 LERSRVRFIWSLRR--PGNVFQSSIDYTNFEDILPKGFLDRTENIGRVISWAPQVEILGH 362
           L+RS  RF+W LR   P        DYTN E++LP+GFL+RT + G+VI WAPQV +L  
Sbjct: 302 LDRSGQRFLWCLRHASPNIKTDRPRDYTNLEEVLPEGFLERTLDRGKVIGWAPQVAVLEK 361

Query: 363 PATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIELK 422
           PA GGFV+HCGWNS LESLW GVPM TWP+YAEQ+ NAF++V ELGLAVEI+     +L 
Sbjct: 362 PAIGGFVTHCGWNSILESLWFGVPMVTWPLYAEQKVNAFEMVEELGLAVEIRKYLKGDLF 421

Query: 423 EQANPIIMAEEIERGIRKLMDNNNNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVL 481
                 + AE+IER IR++M+  ++++R  VK  +E+C  ++++GGSS  +L KFI DV+
Sbjct: 422 AGEMETVTAEDIERAIRRVME-QDSDVRNNVKEMAEKCHFALMDGGSSKAALEKFIQDVI 476

BLAST of CSPI04G22120 vs. TAIR10
Match: AT3G21750.1 (AT3G21750.1 UDP-glucosyl transferase 71B1)

HSP 1 Score: 377.1 bits (967), Expect = 1.6e-104
Identity = 204/488 (41.80%), Postives = 305/488 (62.50%), Query Frame = 1

Query: 3   KFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYNQSLSAQSLT 62
           K ELVFIP PG GH+ +   +A  L+A D+RL+VT+I     +   A+      S+    
Sbjct: 2   KVELVFIPSPGVGHIRATTALAKLLVASDNRLSVTLIVIPSRVSDDAS------SSVYTN 61

Query: 63  NNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFL-----TTSTNHLAGFV 122
           + + +++I+LP      +      L   ++S KP V+ A++S +     T S + LAG V
Sbjct: 62  SEDRLRYILLPARDQTTD------LVSYIDSQKPQVR-AVVSKVAGDVSTRSDSRLAGIV 121

Query: 123 LDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNL 182
           +D FC++M+D+A+EF + +Y++YTS A+YL    H++ LY +       + + KD+++  
Sbjct: 122 VDMFCTSMIDIADEFNLSAYIFYTSNASYLGLQFHVQSLYDEKELD---VSEFKDTEMKF 181

Query: 183 SVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLS-T 242
            VP+L    P+K +PSV     +  +   +A+  R   KG+L+N+  ++E  ALS  S  
Sbjct: 182 DVPTLTQPFPAKCLPSVMLNKKWFPYVLGRARSFR-ATKGILVNSVADMEPQALSFFSGG 241

Query: 243 DSSLQLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEE 302
           + +  +PP+Y+VGP++ L  + +     ++L WL +QP  SVVFLCFGS G F ++Q  E
Sbjct: 242 NGNTNIPPVYAVGPIMDLESSGDEEKRKEILHWLKEQPTKSVVFLCFGSMGGFSEEQARE 301

Query: 303 IARALERSRVRFIWSLRRPGNVFQSSI----DYTNFEDILPKGFLDRTENIGRVISWAPQ 362
           IA ALERS  RF+WSLRR   V   S     ++TN E+ILPKGFLDRT  IG++ISWAPQ
Sbjct: 302 IAVALERSGHRFLWSLRRASPVGNKSNPPPGEFTNLEEILPKGFLDRTVEIGKIISWAPQ 361

Query: 363 VEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKIS 422
           V++L  PA G FV+HCGWNS LESLW GVPMA WP+YAEQQFNAF +V ELGLA E+K  
Sbjct: 362 VDVLNSPAIGAFVTHCGWNSILESLWFGVPMAAWPIYAEQQFNAFHMVDELGLAAEVKKE 421

Query: 423 YCIELKEQANPIIMAEEIERGIRKLMDNNNNEIRKKVKTKSEECRKSVIEGGSSFISLGK 481
           Y  +   +   I+ A+EIERGI+  M+  ++++RK+V    ++   ++++GGSS  +L K
Sbjct: 422 YRRDFLVEEPEIVTADEIERGIKCAME-QDSKMRKRVMEMKDKLHVALVDGGSSNCALKK 471

BLAST of CSPI04G22120 vs. NCBI nr
Match: gi|449456659|ref|XP_004146066.1| (PREDICTED: anthocyanidin 3-O-glucosyltransferase 2 [Cucumis sativus])

HSP 1 Score: 963.0 bits (2488), Expect = 2.0e-277
Identity = 483/486 (99.38%), Postives = 484/486 (99.59%), Query Frame = 1

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYNQSLSAQS 60
           MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEY QSLSAQS
Sbjct: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYIQSLSAQS 60

Query: 61  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDS 120
           LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDS
Sbjct: 61  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDS 120

Query: 121 FCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVP 180
           FCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVP
Sbjct: 121 FCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVP 180

Query: 181 SLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSL 240
           SLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSL
Sbjct: 181 SLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSL 240

Query: 241 QLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIARA 300
           QLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIARA
Sbjct: 241 QLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIARA 300

Query: 301 LERSRVRFIWSLRRPGNVFQSSIDYTNFEDILPKGFLDRTENIGRVISWAPQVEILGHPA 360
           LERSRVRFIWSLRRPGNVFQSSIDYTNFEDILPKGFLDRT+NIGRVISWAPQVEILGHPA
Sbjct: 301 LERSRVRFIWSLRRPGNVFQSSIDYTNFEDILPKGFLDRTQNIGRVISWAPQVEILGHPA 360

Query: 361 TGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIELKEQ 420
           TGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIELKEQ
Sbjct: 361 TGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIELKEQ 420

Query: 421 ANPIIMAEEIERGIRKLMDNNNNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVLSN 480
           ANPIIMAEEIERGIRKLMDNN NEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVLSN
Sbjct: 421 ANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVLSN 480

Query: 481 STTGGN 487
           STTGGN
Sbjct: 481 STTGGN 486

BLAST of CSPI04G22120 vs. NCBI nr
Match: gi|659129340|ref|XP_008464637.1| (PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo])

HSP 1 Score: 837.0 bits (2161), Expect = 1.6e-239
Identity = 427/488 (87.50%), Postives = 447/488 (91.60%), Query Frame = 1

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYNQSLSAQS 60
           MKKFELVFIPIPGSGHLASM EMAN+LLARDHRLAVTMIA KLPLD K NEY QSL AQS
Sbjct: 1   MKKFELVFIPIPGSGHLASMFEMANSLLARDHRLAVTMIAIKLPLDAKVNEYIQSLYAQS 60

Query: 61  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALISFLTTSTNHLAGFVLDS 120
           LTNN SI+FI+LPELP  PN+ N+ F EVVLESYKPHVKQALISFLTTSTNHL GFVLDS
Sbjct: 61  LTNN-SIKFIILPELPPPPNDENKIFFEVVLESYKPHVKQALISFLTTSTNHLVGFVLDS 120

Query: 121 FCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNLSVP 180
           FC TMVDVANEFKVPSYVYYTS AAYLAFSLHLEQLYTQDNSSNEVIQQ KDS+VN SV 
Sbjct: 121 FCLTMVDVANEFKVPSYVYYTSSAAYLAFSLHLEQLYTQDNSSNEVIQQSKDSNVNFSVS 180

Query: 181 SLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTDSSL 240
           SLVNQVPSK IPSVFFINNFAVWFHEQAKRIRFDVKGVLINTF+ELESH +SSLSTDSSL
Sbjct: 181 SLVNQVPSKVIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFDELESHVISSLSTDSSL 240

Query: 241 QLPPLYSVGPVLHLNKNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKDQVEEIARA 300
           QLPPLY VGP+LHLNKNTETMDD  VLKWLDDQPL SVVFLCFGSRGAF+KDQVEEIARA
Sbjct: 241 QLPPLYPVGPILHLNKNTETMDDRVVLKWLDDQPLQSVVFLCFGSRGAFQKDQVEEIARA 300

Query: 301 LERSRVRFIWSLRRP-GNVFQSSIDYTNFEDILPKGFLDRTENIGRVISWAPQVEILGHP 360
           LERSRVRFIWSLRRP G+VFQSSIDYTNFEDILP+GFLDRT+NIGRVI WAPQVEILGHP
Sbjct: 301 LERSRVRFIWSLRRPSGDVFQSSIDYTNFEDILPEGFLDRTKNIGRVIKWAPQVEILGHP 360

Query: 361 ATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIKISYCIELKE 420
             GGFVSHCGWNSTLESLW+G+PMATWPMYAEQQFNAF+LVVELGLAVEI I Y  +LKE
Sbjct: 361 TIGGFVSHCGWNSTLESLWYGIPMATWPMYAEQQFNAFELVVELGLAVEITIDYQNDLKE 420

Query: 421 QANP-IIMAEEIERGIRKLMDNNNNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVL 480
              P I+ AEEIE+GIRKLMD+NNNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVL
Sbjct: 421 LDKPRILSAEEIEKGIRKLMDDNNNEIRKKVKTKSEECRKSVIEGGSSFISLGKFIDDVL 480

Query: 481 SNSTTGGN 487
            NS  G N
Sbjct: 481 INSPRGAN 487

BLAST of CSPI04G22120 vs. NCBI nr
Match: gi|343466221|gb|AEM43004.1| (UDP-glucosyltransferase [Siraitia grosvenorii])

HSP 1 Score: 566.2 bits (1458), Expect = 5.4e-158
Identity = 292/497 (58.75%), Postives = 373/497 (75.05%), Query Frame = 1

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYNQSLSAQS 60
           MKKFELVFIP+P  GHLA+MVEMAN L+ RD RL VT++  KLPL  K  EY QSLSA  
Sbjct: 1   MKKFELVFIPLPVMGHLAAMVEMANILVTRDQRLTVTILVIKLPLYGKTAEYIQSLSASF 60

Query: 61  LTNNNSIQFIVLPELPDIPNNGNRFFLEVVLESYKPHVKQALI----SFLTTSTNHLAGF 120
            +   S++FI+LPE+     +   F L+  LESYKP +++A+I    S +   +  LAGF
Sbjct: 61  ASE--SMRFIILPEVLLPEESEKEFMLKAFLESYKPIIREAIIDLTDSQMGPDSPRLAGF 120

Query: 121 VLDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVN 180
           VLD FC+TM+DVANEF VPSYV+ TS A +LA S HL++LY  +N+S EV++QL++S+  
Sbjct: 121 VLDMFCTTMIDVANEFGVPSYVFCTSNAGFLALSFHLQELY-DENNSKEVVKQLQNSNAE 180

Query: 181 LSVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLST 240
           +++PS VN +P K IP +F  ++ A WFH+Q +R R  VKG+LINTF +LESH ++S+S 
Sbjct: 181 IALPSFVNPIPGKMIPDIFSNDDTASWFHDQVERYRSGVKGILINTFAKLESHVMNSMSR 240

Query: 241 DSSLQLPPLYSVGPVLHLNKNTETMDDG------DVLKWLDDQPLSSVVFLCFGSRGAFK 300
            SS + PPLYS+GP+LHL KN  T+  G      D+LKWLD+QP  SVVFLCFGS G+F 
Sbjct: 241 SSSSRAPPLYSIGPILHL-KNNNTVGPGGTLHCTDILKWLDNQPPVSVVFLCFGSMGSFD 300

Query: 301 KDQVEEIARALERSRVRFIWSLRRPG--NVFQSSIDYTNFEDILPKGFLDRTENIGRVIS 360
           +DQV+EIA ALERS VRF+WSLR+P   + F++  +YT+ + +LP+GFL+RT  IGRVI 
Sbjct: 301 EDQVKEIAHALERSGVRFLWSLRQPPPKDKFEAPSEYTDIKYVLPEGFLERTAGIGRVIG 360

Query: 361 WAPQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVE 420
           WAPQVEIL HPATGGFVSHCGWNSTLES+WHGVPMATWP+YAEQQF AF++VVELGLAV+
Sbjct: 361 WAPQVEILAHPATGGFVSHCGWNSTLESMWHGVPMATWPLYAEQQFTAFEMVVELGLAVD 420

Query: 421 IKISYCIELKEQANPIIMAEEIERGIRKLMDNNNNEIRKKVKTKSEECRKSVIEGGSSFI 480
           I + Y      + + ++ AEEI+ GIRKLM+    E+RKKVK KSEE RKS++EGGSSFI
Sbjct: 421 ITLDYQKHPHGERSRVVSAEEIQSGIRKLME-EGGEMRKKVKAKSEESRKSLMEGGSSFI 480

Query: 481 SLGKFIDDVLSNSTTGG 486
           SLG+FIDDVL N   GG
Sbjct: 481 SLGRFIDDVLGNGPEGG 492

BLAST of CSPI04G22120 vs. NCBI nr
Match: gi|659129348|ref|XP_008464641.1| (PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo])

HSP 1 Score: 553.1 bits (1424), Expect = 4.7e-154
Identity = 287/494 (58.10%), Postives = 363/494 (73.48%), Query Frame = 1

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKANEYNQSLSAQS 60
           M KFELVFIP PG GHLAS VE+AN L +RD RL+VT++A KLP D K  E  QSLSA  
Sbjct: 1   MNKFELVFIPGPGIGHLASTVELANVLASRDDRLSVTVLAIKLPNDIKTTERIQSLSASF 60

Query: 61  LTNNNSIQFIVLPELPDIPNNGNR---FFLEVVLESYKPHVKQALISFLTTSTNHLAGFV 120
                SI+FIVLPELP  PN  +      L+  LES+KPHV++ +++ LT  +N L GFV
Sbjct: 61  --EGKSIRFIVLPELP-FPNQSSTPPPLMLQAFLESHKPHVRE-IVTNLTYDSNRLVGFV 120

Query: 121 LDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVNL 180
           +D FC++M++VANEFKVP Y++YTS A +LAFS HL++LY Q+NS+ E   QL++S+V L
Sbjct: 121 IDMFCTSMINVANEFKVPCYLFYTSNAGFLAFSFHLQELYNQNNSTGE---QLQNSNVEL 180

Query: 181 SVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLSTD 240
           ++PS +N +PSK IP   F  + AVWFH+  KR R  VKG+LINTF E+E   +  +S  
Sbjct: 181 ALPSFINPIPSKAIPPFLFDKDMAVWFHDNTKRFRSGVKGILINTFVEMEPQMIKWMSNG 240

Query: 241 SSLQLPPLYSVGPVLHLNKNTET-----MDDGDVLKWLDDQPLSSVVFLCFGSRGAFKKD 300
           SS ++P +Y+VGP+L L     T     ++  D+LKWLDDQP +SVVFLCFGS+G+F +D
Sbjct: 241 SS-KIPKVYTVGPILQLKSIGVTQCNNALNGADILKWLDDQPPASVVFLCFGSKGSFDED 300

Query: 301 QVEEIARALERSRVRFIWSLRRPG--NVFQSSIDYTNFEDILPKGFLDRTENIGRVISWA 360
           QV EIARALERS VRFIWSLR+P     F+   +Y +  D+LP+GFL+RT +IGRVI WA
Sbjct: 301 QVLEIARALERSEVRFIWSLRQPPPKGKFEEPSNYADINDVLPEGFLNRTADIGRVIGWA 360

Query: 361 PQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEIK 420
           PQ+EIL HPATGGF+SHCGWNSTLES+WHGVPMATWP+YAEQQFNAF++VVELGLAVE+ 
Sbjct: 361 PQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELT 420

Query: 421 ISYCIELKEQANPIIMAEEIERGIRKLMDNNNNEIRKKVKTKSEECRKSVIEGGSSFISL 480
           + Y  +     + ++ AEEIE GIRKLM +  NEIRKKVK K EE RKS++ GGSSF SL
Sbjct: 421 LDYVKDFHIGRSRVVSAEEIESGIRKLMGDYGNEIRKKVKVKGEESRKSMMVGGSSFNSL 480

Query: 481 GKFIDDVLSNSTTG 485
             FIDD L+N   G
Sbjct: 481 DHFIDDALANLEEG 486

BLAST of CSPI04G22120 vs. NCBI nr
Match: gi|449456653|ref|XP_004146063.1| (PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis sativus])

HSP 1 Score: 548.5 bits (1412), Expect = 1.2e-152
Identity = 283/495 (57.17%), Postives = 365/495 (73.74%), Query Frame = 1

Query: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKAN-EYNQSLSAQ 60
           M KFELVFIP PG GHLAS VE+AN L++RD RL+VT++A KLP D K   E  QSLSA 
Sbjct: 1   MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSAS 60

Query: 61  SLTNNNSIQFIVLPELPDIPNNGNR---FFLEVVLESYKPHVKQALISFLTTSTNHLAGF 120
                 SI+FIVLPELP  PN  +      L+  LES+KPHV++ +++ L   +N L GF
Sbjct: 61  F--EGKSIRFIVLPELP-FPNQSSEPPPLMLQAFLESHKPHVRE-IVTNLIHDSNRLVGF 120

Query: 121 VLDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVN 180
           V+D FC++M++VANEFKVP Y++YTS A +L FS HL++LY Q+NS+ E   QL++S+V 
Sbjct: 121 VIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAE---QLQNSNVE 180

Query: 181 LSVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLST 240
           L++PS +N +P+K IP   F  + A WFH+  KR R +VKG+LINTF E+E   +  +S 
Sbjct: 181 LALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSN 240

Query: 241 DSSLQLPPLYSVGPVLHLN-----KNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKK 300
            SS ++P +Y+VGP+L L      ++   ++  D+LKWLDDQP +SVVFLCFGS+G+F +
Sbjct: 241 GSS-KIPKVYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDE 300

Query: 301 DQVEEIARALERSRVRFIWSLRRPG--NVFQSSIDYTNFEDILPKGFLDRTENIGRVISW 360
           DQV EIARALERS VRF+WSLR+P     F+   +Y N  D+LP+GFL+RT +IGRVI W
Sbjct: 301 DQVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGW 360

Query: 361 APQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEI 420
           APQ+EIL HPATGGF+SHCGWNSTLES+WHGVPMATWP+YAEQQFNAF++VVELGLAVE+
Sbjct: 361 APQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVEL 420

Query: 421 KISYCIELKEQANPIIMAEEIERGIRKLMDNNNNEIRKKVKTKSEECRKSVIEGGSSFIS 480
            + Y  +     + I+ AEEIE GIRKLM ++ NEIRKK+K K EE RKS++EGGSSF S
Sbjct: 421 TLDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNS 480

Query: 481 LGKFIDDVLSNSTTG 485
           L  FIDD L+N   G
Sbjct: 481 LRHFIDDALTNLQEG 487

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
UFOG3_FRAAN2.5e-12349.38Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3 OS=Fragaria ananassa GN... [more]
UFOG6_FRAAN8.4e-11947.65UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria ananassa GN=GT6 PE=1... [more]
U7A16_PYRCO2.5e-11847.95UDP-glycosyltransferase 71A16 OS=Pyrus communis GN=UGT71A16 PE=1 SV=1[more]
U7A15_MALDO7.2e-11848.36UDP-glycosyltransferase 71A15 OS=Malus domestica GN=UGT71A15 PE=1 SV=1[more]
U71E1_STERE1.1e-11347.45UDP-glycosyltransferase 71E1 OS=Stevia rebaudiana GN=UGT71E1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L341_CUCSA1.4e-27799.38Glycosyltransferase OS=Cucumis sativus GN=Csa_4G620550 PE=3 SV=1[more]
K7NBW4_SIRGR3.8e-15858.75Glycosyltransferase OS=Siraitia grosvenorii GN=UDPG7 PE=2 SV=1[more]
A0A0A0L321_CUCSA8.1e-15357.17Glycosyltransferase OS=Cucumis sativus GN=Csa_4G618520 PE=3 SV=1[more]
A0A067FE19_CITSI9.0e-13653.36Glycosyltransferase OS=Citrus sinensis GN=CISIN_1g045029mg PE=3 SV=1[more]
V4SQP2_9ROSI6.5e-13452.95Glycosyltransferase OS=Citrus clementina GN=CICLE_v10011544mg PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G21760.15.6e-11346.86 UDP-Glycosyltransferase superfamily protein[more]
AT3G21790.11.9e-10845.82 UDP-Glycosyltransferase superfamily protein[more]
AT3G21780.12.5e-10845.68 UDP-glucosyl transferase 71B6[more]
AT4G15280.13.3e-10543.98 UDP-glucosyl transferase 71B5[more]
AT3G21750.11.6e-10441.80 UDP-glucosyl transferase 71B1[more]
Match NameE-valueIdentityDescription
gi|449456659|ref|XP_004146066.1|2.0e-27799.38PREDICTED: anthocyanidin 3-O-glucosyltransferase 2 [Cucumis sativus][more]
gi|659129340|ref|XP_008464637.1|1.6e-23987.50PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo][more]
gi|343466221|gb|AEM43004.1|5.4e-15858.75UDP-glucosyltransferase [Siraitia grosvenorii][more]
gi|659129348|ref|XP_008464641.1|4.7e-15458.10PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo][more]
gi|449456653|ref|XP_004146063.1|1.2e-15257.17PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016758 transferase activity, transferring hexosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G22120.1CSPI04G22120.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 1..480
score: 3.6E
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 10..408
score: 1.2
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePROSITEPS00375UDPGTcoord: 349..392
scor
NoneNo IPR availableunknownCoilCoilcoord: 427..447
scor
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 263..411
score: 1.0
NoneNo IPR availablePANTHERPTHR11926:SF242UDP-GLYCOSYLTRANSFERASE 71B2-RELATEDcoord: 1..480
score: 3.6E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 2..478
score: 2.51E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CSPI04G22120MELO3C026366.2Melon (DHL92) v3.6.1cpimedB308
The following gene(s) are paralogous to this gene:

None