Csa4G618520 (gene) Cucumber (Chinese Long) v2

NameCsa4G618520
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionUDP-glycosyltransferase 1; contains IPR002213 (UDP-glucuronosyl/UDP-glucosyltransferase)
LocationChr4 : 19796415 .. 19798047 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAGGCTTGAAGATCCAAAAGATAAGACGCTAATTATCTTTTGAATCAAATGAACAAGTTTGAGTTAGTTTTCATACCTGGGCCGGGGATCGGCCATCTTGCATCCACGGTCGAGCTGGCAAATGTTCTTGTTAGCCGAGATGACCGTCTCTCTGTGACTGTGCTCGCCATCAAGCTTCCCAATGACATCAAAACGACGACGGAACGTATTCAGTCACTTTCAGCGTCTTTCGAGGGTAAATCTATACGCTTTATTGTTCTTCCTGAACTTCCCTTCCCAAACCAAAGTAGTGAACCTCCCCCCCTTATGCTGCAAGCATTCCTTGAAAGCCACAAGCCCCATGTGAGGGAAATTGTGACCAACTTAATTCATGACTCGAACCGTCTTGTCGGATTTGTCATTGATATGTTTTGCACCAGTATGATAAATGTGGCTAATGAATTTAAGGTTCCTTGTTATTTGTTTTACACATCCAATGCTGGCTTTCTTGATTTTAGCTTTCATCTCCAAGAGCTTTACAATCAAAACAATAGCACAGCAGAACAGTTGCAAAATTCAAATGTTGAGTTAGCTCTGCCAAGTTTTATCAATCCAATTCCTAATAAAGCCATCCCCCCCTTCTTATTTGATAAAGACATGGCTGCTTGGTTTCATGATAATACTAAAAGATTTAGATCAGAAGTCAAAGGTATTTTGATCAATACATTTGTAGAGATGGAACCACAAATAGTCAAATGGATGTCAAATGGCTCTTCGAAAATCCCAAAGGTGTATACTGTTGGACCCATTTTGCAGTTGAAGAGTATTGGTGTTACACAAAGCAACAATGCTCTAAATGGTGCAGATATACTAAAGTGGCTAGATGATCAACCTCCAGCATCAGTGGTTTTCCTATGCTTTGGTAGCAAAGGAAGCTTTGACGAGGATCAAGTGCTAGAGATTGCTCGAGCACTGGAGCGAAGTGAGGTTCGCTTCTTATGGTCCCTTCGACAGCCCCCACCAAAGGGTAAGTTTGAAGAGCCAAGCAACTATGCTAACATCAACGACGTCCTACCTGAGGGATTTCTCAACCGAACAGCCGATATTGGAAGGGTCATCGGGTGGGCACCACAAATAGAAATATTGTCCCATCCTGCCACTGGAGGATTCATATCACATTGTGGTTGGAATTCAACGTTGGAGAGTGTATGGCATGGTGTACCAATGGCAACATGGCCATTGTATGCCGAACAACAATTTAACGCATTTGAAATGGTGGTGGAATTAGGATTGGCAGTGGAGCTCACATTAGACTACGTGAAGGATTTTCATATAGGAAGGTCGAGAATAGTGAGTGCAGAAGAGATAGAAAGCGGGATCAGAAAATTGATGGGCGATTCTGGTAATGAGATCAGGAAGAAAATCAAAGTAAAAGGTGAAGAAAGTCGAAAAAGTATGATGGAAGGTGGATCCTCCTTCAATTCATTACGTCATTTCATTGATGATGCTTTGACTAACTTACAAGAGGGCAACTACTAATGCCATTTTGGTTTCATATAATTCATGAAAAAATGTCTGTAACAGTGAAGGAGAGATTTTTGAAATAAACAGAATATTTAGTGGTAAAGCTTTTCTTTTTGTTGGAATAAACCCC

mRNA sequence

ATGAACAAGTTTGAGTTAGTTTTCATACCTGGGCCGGGGATCGGCCATCTTGCATCCACGGTCGAGCTGGCAAATGTTCTTGTTAGCCGAGATGACCGTCTCTCTGTGACTGTGCTCGCCATCAAGCTTCCCAATGACATCAAAACGACGACGGAACGTATTCAGTCACTTTCAGCGTCTTTCGAGGGTAAATCTATACGCTTTATTGTTCTTCCTGAACTTCCCTTCCCAAACCAAAGTAGTGAACCTCCCCCCCTTATGCTGCAAGCATTCCTTGAAAGCCACAAGCCCCATGTGAGGGAAATTGTGACCAACTTAATTCATGACTCGAACCGTCTTGTCGGATTTGTCATTGATATGTTTTGCACCAGTATGATAAATGTGGCTAATGAATTTAAGGTTCCTTGTTATTTGTTTTACACATCCAATGCTGGCTTTCTTGATTTTAGCTTTCATCTCCAAGAGCTTTACAATCAAAACAATAGCACAGCAGAACAGTTGCAAAATTCAAATGTTGAGTTAGCTCTGCCAAGTTTTATCAATCCAATTCCTAATAAAGCCATCCCCCCCTTCTTATTTGATAAAGACATGGCTGCTTGGTTTCATGATAATACTAAAAGATTTAGATCAGAAGTCAAAGGTATTTTGATCAATACATTTGTAGAGATGGAACCACAAATAGTCAAATGGATGTCAAATGGCTCTTCGAAAATCCCAAAGGTGTATACTGTTGGACCCATTTTGCAGTTGAAGAGTATTGGTGTTACACAAAGCAACAATGCTCTAAATGGTGCAGATATACTAAAGTGGCTAGATGATCAACCTCCAGCATCAGTGGTTTTCCTATGCTTTGGTAGCAAAGGAAGCTTTGACGAGGATCAAGTGCTAGAGATTGCTCGAGCACTGGAGCGAAGTGAGGTTCGCTTCTTATGGTCCCTTCGACAGCCCCCACCAAAGGGTAAGTTTGAAGAGCCAAGCAACTATGCTAACATCAACGACGTCCTACCTGAGGGATTTCTCAACCGAACAGCCGATATTGGAAGGGTCATCGGGTGGGCACCACAAATAGAAATATTGTCCCATCCTGCCACTGGAGGATTCATATCACATTGTGGTTGGAATTCAACGTTGGAGAGTGTATGGCATGGTGTACCAATGGCAACATGGCCATTGTATGCCGAACAACAATTTAACGCATTTGAAATGGTGGTGGAATTAGGATTGGCAGTGGAGCTCACATTAGACTACGTGAAGGATTTTCATATAGGAAGGTCGAGAATAGTGAGTGCAGAAGAGATAGAAAGCGGGATCAGAAAATTGATGGGCGATTCTGGTAATGAGATCAGGAAGAAAATCAAAGTAAAAGGTGAAGAAAGTCGAAAAAGTATGATGGAAGGTGGATCCTCCTTCAATTCATTACGTCATTTCATTGATGATGCTTTGACTAACTTACAAGAGGGCAACTACTAA

Coding sequence (CDS)

ATGAACAAGTTTGAGTTAGTTTTCATACCTGGGCCGGGGATCGGCCATCTTGCATCCACGGTCGAGCTGGCAAATGTTCTTGTTAGCCGAGATGACCGTCTCTCTGTGACTGTGCTCGCCATCAAGCTTCCCAATGACATCAAAACGACGACGGAACGTATTCAGTCACTTTCAGCGTCTTTCGAGGGTAAATCTATACGCTTTATTGTTCTTCCTGAACTTCCCTTCCCAAACCAAAGTAGTGAACCTCCCCCCCTTATGCTGCAAGCATTCCTTGAAAGCCACAAGCCCCATGTGAGGGAAATTGTGACCAACTTAATTCATGACTCGAACCGTCTTGTCGGATTTGTCATTGATATGTTTTGCACCAGTATGATAAATGTGGCTAATGAATTTAAGGTTCCTTGTTATTTGTTTTACACATCCAATGCTGGCTTTCTTGATTTTAGCTTTCATCTCCAAGAGCTTTACAATCAAAACAATAGCACAGCAGAACAGTTGCAAAATTCAAATGTTGAGTTAGCTCTGCCAAGTTTTATCAATCCAATTCCTAATAAAGCCATCCCCCCCTTCTTATTTGATAAAGACATGGCTGCTTGGTTTCATGATAATACTAAAAGATTTAGATCAGAAGTCAAAGGTATTTTGATCAATACATTTGTAGAGATGGAACCACAAATAGTCAAATGGATGTCAAATGGCTCTTCGAAAATCCCAAAGGTGTATACTGTTGGACCCATTTTGCAGTTGAAGAGTATTGGTGTTACACAAAGCAACAATGCTCTAAATGGTGCAGATATACTAAAGTGGCTAGATGATCAACCTCCAGCATCAGTGGTTTTCCTATGCTTTGGTAGCAAAGGAAGCTTTGACGAGGATCAAGTGCTAGAGATTGCTCGAGCACTGGAGCGAAGTGAGGTTCGCTTCTTATGGTCCCTTCGACAGCCCCCACCAAAGGGTAAGTTTGAAGAGCCAAGCAACTATGCTAACATCAACGACGTCCTACCTGAGGGATTTCTCAACCGAACAGCCGATATTGGAAGGGTCATCGGGTGGGCACCACAAATAGAAATATTGTCCCATCCTGCCACTGGAGGATTCATATCACATTGTGGTTGGAATTCAACGTTGGAGAGTGTATGGCATGGTGTACCAATGGCAACATGGCCATTGTATGCCGAACAACAATTTAACGCATTTGAAATGGTGGTGGAATTAGGATTGGCAGTGGAGCTCACATTAGACTACGTGAAGGATTTTCATATAGGAAGGTCGAGAATAGTGAGTGCAGAAGAGATAGAAAGCGGGATCAGAAAATTGATGGGCGATTCTGGTAATGAGATCAGGAAGAAAATCAAAGTAAAAGGTGAAGAAAGTCGAAAAAGTATGATGGAAGGTGGATCCTCCTTCAATTCATTACGTCATTTCATTGATGATGCTTTGACTAACTTACAAGAGGGCAACTACTAA

Protein sequence

MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSASFEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHDSNRLVGFVIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKIPKVYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSLRHFIDDALTNLQEGNY*
BLAST of Csa4G618520 vs. Swiss-Prot
Match: U7A16_PYRCO (UDP-glycosyltransferase 71A16 OS=Pyrus communis GN=UGT71A16 PE=1 SV=1)

HSP 1 Score: 477.2 bits (1227), Expect = 2.1e-133
Identity = 252/483 (52.17%), Postives = 333/483 (68.94%), Query Frame = 1

Query: 5   ELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSASFEGK 64
           +LVF+P PGIGH+ STVE+A  LV+RDD+L +TVL +KLP D +  T    S+S      
Sbjct: 6   QLVFVPAPGIGHIVSTVEMAKQLVARDDQLFITVLVMKLPYD-QPFTNTDSSIS-----H 65

Query: 65  SIRFIVLPELPFPNQSSEPPP-LMLQAFLESHKPHVREIVTNLIHDSN--------RLVG 124
            I F+ LPE     Q + P P    + F+E+HK HVR+ V NL+ +S+        RL G
Sbjct: 66  RINFVNLPEAQLDKQDTVPNPGSFFRMFVENHKTHVRDAVINLLPESDQSESTSKPRLAG 125

Query: 125 FVIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELA 184
           FV+DMF  S+I+VANEF+VP Y+F+TSN+  L    H Q L ++      +L +S  ELA
Sbjct: 126 FVLDMFSASLIDVANEFEVPSYVFFTSNSSTLALLSHFQSLRDEGGIDITELTSSTAELA 185

Query: 185 LPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGS 244
           +PSFINP P   +P    DK+      +N  R++ + KGIL+NTF+E+E   + ++ +G 
Sbjct: 186 VPSFINPYPVAVLPGSFLDKESTKSTLNNVGRYK-QTKGILVNTFLELESHALHYLDSGV 245

Query: 245 SKIPKVYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQV 304
            KIP VY VGP+L LKS      ++   G+DIL+WLDDQPP SVVFLCFGS GSF + QV
Sbjct: 246 -KIPPVYPVGPLLNLKS------SHEDKGSDILRWLDDQPPLSVVFLCFGSMGSFGDAQV 305

Query: 305 LEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQ 364
            EIA  LE S  RFLWSLRQPP KGK   PS+YA++  VLPEGFL+RTA +GRVIGWAPQ
Sbjct: 306 KEIACTLEHSGHRFLWSLRQPPSKGKRALPSDYADLKTVLPEGFLDRTATVGRVIGWAPQ 365

Query: 365 IEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLD 424
             IL HPA GGF+SHCGWNSTLES+W+GVP+A WP+YAEQ  NAF++VVELGLAVE+ +D
Sbjct: 366 AAILGHPAIGGFVSHCGWNSTLESIWNGVPIAAWPMYAEQNMNAFQLVVELGLAVEIKMD 425

Query: 425 YVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSLRH 479
           Y KD  +    +VSAE+IE GIR++M +  +++RK++K   E+S+K++++GGSS++SL  
Sbjct: 426 YRKDSDV----VVSAEDIERGIRQVM-ELDSDVRKRVKEMSEKSKKALVDGGSSYSSLGR 469

BLAST of Csa4G618520 vs. Swiss-Prot
Match: UFOG3_FRAAN (Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3 OS=Fragaria ananassa GN=GT3 PE=2 SV=1)

HSP 1 Score: 475.7 bits (1223), Expect = 6.1e-133
Identity = 254/478 (53.14%), Postives = 325/478 (67.99%), Query Frame = 1

Query: 5   ELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSL--SASFE 64
           ELV IP PGIGHL ST+E+A +LVSRDD+L +TVL +  P   K T   +QSL  S+S  
Sbjct: 6   ELVLIPSPGIGHLVSTLEIAKLLVSRDDKLFITVLIMHFPAVSKGTDAYVQSLADSSSPI 65

Query: 65  GKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHD-SNRLVGFVIDMF 124
            + I FI LP     +        ++  F+ES +PHV++ V NL    + RL GFV+DMF
Sbjct: 66  SQRINFINLPHTNMDHTEGSVRNSLV-GFVESQQPHVKDAVANLRDSKTTRLAGFVVDMF 125

Query: 125 CTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSFIN 184
           CT+MINVAN+  VP Y+F+TS A  L   FHLQEL +Q N    + ++S+ EL +PSF N
Sbjct: 126 CTTMINVANQLGVPSYVFFTSGAATLGLLFHLQELRDQYNKDCTEFKDSDAELIIPSFFN 185

Query: 185 PIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKIPKV 244
           P+P K +P  +  KD A  F +  KRFR E KGIL+NTF ++E   +  +S+  ++IP V
Sbjct: 186 PLPAKVLPGRMLVKDSAEPFLNVIKRFR-ETKGILVNTFTDLESHALHALSS-DAEIPPV 245

Query: 245 YTVGPILQLKSI-GVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIAR 304
           Y VGP+L L S      S+      DILKWLDDQPP SVVFLCFGS GSFDE QV EIA 
Sbjct: 246 YPVGPLLNLNSNESRVDSDEVKKKNDILKWLDDQPPLSVVFLCFGSMGSFDESQVREIAN 305

Query: 305 ALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEILS 364
           ALE +  RFLWSLR+ PP GK   PS+Y +   VLPEGFL+RT  IG+VIGWAPQ+ +L+
Sbjct: 306 ALEHAGHRFLWSLRRSPPTGKVAFPSDYDDHTGVLPEGFLDRTGGIGKVIGWAPQVAVLA 365

Query: 365 HPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDF 424
           HP+ GGF+SHCGWNSTLES+WHGVP+ATWPLYAEQQ NAF+ V EL LAVE+ + Y    
Sbjct: 366 HPSVGGFVSHCGWNSTLESLWHGVPVATWPLYAEQQLNAFQPVKELELAVEIDMSYRSKS 425

Query: 425 HIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSLRHFID 479
            +    +VSA+EIE GIR++M    ++IRK++K   E+ +K++M+GGSS+ SL HFID
Sbjct: 426 PV----LVSAKEIERGIREVMELDSSDIRKRVKEMSEKGKKALMDGGSSYTSLGHFID 476

BLAST of Csa4G618520 vs. Swiss-Prot
Match: U7A15_MALDO (UDP-glycosyltransferase 71A15 OS=Malus domestica GN=UGT71A15 PE=1 SV=1)

HSP 1 Score: 470.3 bits (1209), Expect = 2.5e-131
Identity = 249/483 (51.55%), Postives = 331/483 (68.53%), Query Frame = 1

Query: 5   ELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSASFEGK 64
           +LVF+P PGIGH+ STVE+A  L +RDD+L +TVL +KLP   +  T    S+S      
Sbjct: 6   QLVFVPAPGIGHIVSTVEMAKQLAARDDQLFITVLVMKLPY-AQPFTNTDSSIS-----H 65

Query: 65  SIRFIVLPELPFPNQSSEPPP-LMLQAFLESHKPHVREIVTNLIHDSN--------RLVG 124
            I F+ LPE     Q   P P    + F+E+HK HVR+ V N++ +S+        RL G
Sbjct: 66  RINFVNLPEAQPDKQDIVPNPGSFFRMFVENHKSHVRDAVINVLPESDQSESTSKPRLAG 125

Query: 125 FVIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELA 184
           FV+DMF  S+I+VANEFKVP YLF+TSNA  L    H Q L ++      +L +S  ELA
Sbjct: 126 FVLDMFSASLIDVANEFKVPSYLFFTSNASALALMSHFQSLRDEGGIDITELTSSTAELA 185

Query: 185 LPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGS 244
           +PSFINP P   +P  L D +      ++  +++ + KGIL+NTF+E+E   + ++ +G 
Sbjct: 186 VPSFINPYPAAVLPGSLLDMESTKSTLNHVSKYK-QTKGILVNTFMELESHALHYLDSGD 245

Query: 245 SKIPKVYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQV 304
            KIP VY VGP+L LKS      ++    +DIL+WLDDQPP SVVFLCFGS GSF E QV
Sbjct: 246 -KIPPVYPVGPLLNLKS------SDEDKASDILRWLDDQPPFSVVFLCFGSMGSFGEAQV 305

Query: 305 LEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQ 364
            EIA ALE S  RFLWSLR+PPP+GK   PS+Y ++  VLPEGFL+RTA +G+VIGWAPQ
Sbjct: 306 KEIACALEHSGHRFLWSLRRPPPQGKRAMPSDYEDLKTVLPEGFLDRTATVGKVIGWAPQ 365

Query: 365 IEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLD 424
             IL HPATGGF+SHCGWNSTLES+W+GVP+A WPLYAEQ  NAF++VVELGLAVE+ +D
Sbjct: 366 AAILGHPATGGFVSHCGWNSTLESLWNGVPIAAWPLYAEQNLNAFQLVVELGLAVEIKMD 425

Query: 425 YVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSLRH 479
           Y +D  +    +VSAE+IE GIR++M +  +++RK++K   E+S+K++++GGSS++SL  
Sbjct: 426 YRRDSDV----VVSAEDIERGIRRVM-ELDSDVRKRVKEMSEKSKKALVDGGSSYSSLGR 469

BLAST of Csa4G618520 vs. Swiss-Prot
Match: UFOG6_FRAAN (UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria ananassa GN=GT6 PE=1 SV=1)

HSP 1 Score: 465.7 bits (1197), Expect = 6.3e-130
Identity = 246/486 (50.62%), Postives = 334/486 (68.72%), Query Frame = 1

Query: 5   ELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSA--SFE 64
           EL+FIP PGIGH+ STVE+A +L+ RDD L +T+L +K P     +   I+SL+   S +
Sbjct: 6   ELIFIPIPGIGHIVSTVEIAKLLLCRDDNLFITILIMKFPFTADGSDVYIKSLAVDPSLK 65

Query: 65  GKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIH---DSNRLVGFVID 124
            + IRF+ LP+  F    +         F++SHK HV++ VT L+    ++ R+ GFVID
Sbjct: 66  TQRIRFVNLPQEHFQGTGATG----FFTFIDSHKSHVKDAVTRLMETKSETTRIAGFVID 125

Query: 125 MFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSF 184
           MFCT MI++ANEF +P Y+FYTS A  L   FHLQ L ++ N    + ++S+ EL + SF
Sbjct: 126 MFCTGMIDLANEFGLPSYVFYTSGAADLGLMFHLQALRDEENKDCTEFKDSDAELVVSSF 185

Query: 185 INPIPN-KAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKI 244
           +NP+P  + +P  +F+K+   +F +  KR+R E KGIL+NTF+E+EP  ++ +S+    +
Sbjct: 186 VNPLPAARVLPSVVFEKEGGNFFLNFAKRYR-ETKGILVNTFLELEPHAIQSLSSDGKIL 245

Query: 245 PKVYTVGPILQLKSIG-VTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLE 304
           P VY VGPIL +KS G    S  +   +DIL+WLDDQPP+SVVFLCFGS G F EDQV E
Sbjct: 246 P-VYPVGPILNVKSEGNQVSSEKSKQKSDILEWLDDQPPSSVVFLCFGSMGCFGEDQVKE 305

Query: 305 IARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIE 364
           IA ALE+  +RFLWSLRQP  K K   PS+Y +   VLPEGFL+RT D+G+VIGWAPQ+ 
Sbjct: 306 IAHALEQGGIRFLWSLRQPS-KEKIGFPSDYTDYKAVLPEGFLDRTTDLGKVIGWAPQLA 365

Query: 365 ILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYV 424
           IL+HPA GGF+SHCGWNSTLES+W+GVP+ATWP YAEQQ NAFE+V EL LAVE+ + Y 
Sbjct: 366 ILAHPAVGGFVSHCGWNSTLESIWYGVPIATWPFYAEQQVNAFELVKELKLAVEIDMGYR 425

Query: 425 KDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSLRHFI 484
           KD  +    IVS E IE GI+++M +  +E+RK++K   + SRK++ E GSS++SL  F+
Sbjct: 426 KDSGV----IVSRENIEKGIKEVM-EQESELRKRVKEMSQMSRKALEEDGSSYSSLGRFL 479

BLAST of Csa4G618520 vs. Swiss-Prot
Match: U71E1_STERE (UDP-glycosyltransferase 71E1 OS=Stevia rebaudiana GN=UGT71E1 PE=2 SV=1)

HSP 1 Score: 443.7 bits (1140), Expect = 2.6e-123
Identity = 242/482 (50.21%), Postives = 318/482 (65.98%), Query Frame = 1

Query: 1   MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSAS 60
           M+  ELVFIP PG GHL  TVELA +L+ RD RLSVT++ + L    K  TE    +   
Sbjct: 1   MSTSELVFIPSPGAGHLPPTVELAKLLLHRDQRLSVTIIVMNLWLGPKHNTEARPCVP-- 60

Query: 61  FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIH-DSNRLVGFVID 120
               S+RF+ +P       +   P   + AF+E HKP VR+IV  +I  DS RL GFV+D
Sbjct: 61  ----SLRFVDIP-CDESTMALISPNTFISAFVEHHKPRVRDIVRGIIESDSVRLAGFVLD 120

Query: 121 MFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSF 180
           MFC  M +VANEF VP Y ++TS A  L   FHLQ   +     A +L+NS+ EL++PS+
Sbjct: 121 MFCMPMSDVANEFGVPSYNYFTSGAATLGLMFHLQWKRDHEGYDATELKNSDTELSVPSY 180

Query: 181 INPIPNKAIPPFLFDKDMAA-WFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKI 240
           +NP+P K +P  + DK+  +  F D  +R R E KGI++N+   +E   ++++S+ ++ I
Sbjct: 181 VNPVPAKVLPEVVLDKEGGSKMFLDLAERIR-ESKGIIVNSCQAIERHALEYLSSNNNGI 240

Query: 241 PKVYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEI 300
           P V+ VGPIL L++       +     +I++WL++QP +SVVFLCFGS GSF+E QV EI
Sbjct: 241 PPVFPVGPILNLEN-----KKDDAKTDEIMRWLNEQPESSVVFLCFGSMGSFNEKQVKEI 300

Query: 301 ARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEI 360
           A A+ERS  RFLWSLR+P PK K E P  Y N+ +VLPEGFL RT+ IG+VIGWAPQ+ +
Sbjct: 301 AVAIERSGHRFLWSLRRPTPKEKIEFPKEYENLEEVLPEGFLKRTSSIGKVIGWAPQMAV 360

Query: 361 LSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVK 420
           LSHP+ GGF+SHCGWNSTLES+W GVPMA WPLYAEQ  NAF +VVELGLA E+ +DY  
Sbjct: 361 LSHPSVGGFVSHCGWNSTLESMWCGVPMAAWPLYAEQTLNAFLLVVELGLAAEIRMDYRT 420

Query: 421 DFHIG--RSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSLRHF 479
           D   G      V+ EEIE GIRKLM D   EIR K+K   E+SR +++EGGSS+ S+  F
Sbjct: 421 DTKAGYDGGMEVTVEEIEDGIRKLMSD--GEIRNKVKDVKEKSRAAVVEGGSSYASIGKF 467

BLAST of Csa4G618520 vs. TrEMBL
Match: A0A0A0L321_CUCSA (Glycosyltransferase OS=Cucumis sativus GN=Csa_4G618520 PE=3 SV=1)

HSP 1 Score: 985.7 bits (2547), Expect = 2.0e-284
Identity = 489/489 (100.00%), Postives = 489/489 (100.00%), Query Frame = 1

Query: 1   MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSAS 60
           MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSAS
Sbjct: 1   MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSAS 60

Query: 61  FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHDSNRLVGFVIDM 120
           FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHDSNRLVGFVIDM
Sbjct: 61  FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHDSNRLVGFVIDM 120

Query: 121 FCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSFI 180
           FCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSFI
Sbjct: 121 FCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSFI 180

Query: 181 NPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKIPK 240
           NPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKIPK
Sbjct: 181 NPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKIPK 240

Query: 241 VYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIAR 300
           VYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIAR
Sbjct: 241 VYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIAR 300

Query: 301 ALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEILS 360
           ALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEILS
Sbjct: 301 ALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEILS 360

Query: 361 HPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDF 420
           HPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDF
Sbjct: 361 HPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDF 420

Query: 421 HIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSLRHFIDDA 480
           HIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSLRHFIDDA
Sbjct: 421 HIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSLRHFIDDA 480

Query: 481 LTNLQEGNY 490
           LTNLQEGNY
Sbjct: 481 LTNLQEGNY 489

BLAST of Csa4G618520 vs. TrEMBL
Match: K7NBW4_SIRGR (Glycosyltransferase OS=Siraitia grosvenorii GN=UDPG7 PE=2 SV=1)

HSP 1 Score: 641.3 bits (1653), Expect = 9.3e-181
Identity = 332/495 (67.07%), Postives = 388/495 (78.38%), Query Frame = 1

Query: 1   MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSAS 60
           M KFELVFIP P +GHLA+ VE+AN+LV+RD RL+VT+L IKLP   KT  E IQSLSAS
Sbjct: 1   MKKFELVFIPLPVMGHLAAMVEMANILVTRDQRLTVTILVIKLPLYGKTA-EYIQSLSAS 60

Query: 61  FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIH-----DSNRLVG 120
           F  +S+RFI+LPE+  P +S +    ML+AFLES+KP +RE + +L       DS RL G
Sbjct: 61  FASESMRFIILPEVLLPEESEKE--FMLKAFLESYKPIIREAIIDLTDSQMGPDSPRLAG 120

Query: 121 FVIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNS--TAEQLQNSNVE 180
           FV+DMFCT+MI+VANEF VP Y+F TSNAGFL  SFHLQELY++NNS    +QLQNSN E
Sbjct: 121 FVLDMFCTTMIDVANEFGVPSYVFCTSNAGFLALSFHLQELYDENNSKEVVKQLQNSNAE 180

Query: 181 LALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMS- 240
           +ALPSF+NPIP K IP    + D A+WFHD  +R+RS VKGILINTF ++E  ++  MS 
Sbjct: 181 IALPSFVNPIPGKMIPDIFSNDDTASWFHDQVERYRSGVKGILINTFAKLESHVMNSMSR 240

Query: 241 NGSSKIPKVYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDE 300
           + SS+ P +Y++GPIL LK+         L+  DILKWLD+QPP SVVFLCFGS GSFDE
Sbjct: 241 SSSSRAPPLYSIGPILHLKNNNTVGPGGTLHCTDILKWLDNQPPVSVVFLCFGSMGSFDE 300

Query: 301 DQVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGW 360
           DQV EIA ALERS VRFLWSLRQPPPK KFE PS Y +I  VLPEGFL RTA IGRVIGW
Sbjct: 301 DQVKEIAHALERSGVRFLWSLRQPPPKDKFEAPSEYTDIKYVLPEGFLERTAGIGRVIGW 360

Query: 361 APQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVEL 420
           APQ+EIL+HPATGGF+SHCGWNSTLES+WHGVPMATWPLYAEQQF AFEMVVELGLAV++
Sbjct: 361 APQVEILAHPATGGFVSHCGWNSTLESMWHGVPMATWPLYAEQQFTAFEMVVELGLAVDI 420

Query: 421 TLDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNS 480
           TLDY K  H  RSR+VSAEEI+SGIRKLM + G E+RKK+K K EESRKS+MEGGSSF S
Sbjct: 421 TLDYQKHPHGERSRVVSAEEIQSGIRKLM-EEGGEMRKKVKAKSEESRKSLMEGGSSFIS 480

Query: 481 LRHFIDDALTNLQEG 488
           L  FIDD L N  EG
Sbjct: 481 LGRFIDDVLGNGPEG 491

BLAST of Csa4G618520 vs. TrEMBL
Match: A0A0A0L1T2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G618540 PE=4 SV=1)

HSP 1 Score: 565.8 bits (1457), Expect = 5.0e-158
Identity = 293/454 (64.54%), Postives = 348/454 (76.65%), Query Frame = 1

Query: 4   FELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSASFEG 63
           FEL+FIP PGIGHLASTVE+ANVLV+RD RLSVT+LA+KLP D+K   E I+SLS SF G
Sbjct: 2   FELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVA-ECIESLSTSFAG 61

Query: 64  KSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIH------DSNRLVGFV 123
           K+I+F VLPE P P +S +         +ES+KP+VRE+V+NL        DS RLVG V
Sbjct: 62  KNIQFNVLPEPPLPEESKKD----FIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLV 121

Query: 124 IDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQN--NSTAEQLQNS-NVEL 183
           IDMFCT+MI+V NEF VPCY+FYT +A FL FS +LQELY +N  N   EQL NS NVEL
Sbjct: 122 IDMFCTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVEL 181

Query: 184 ALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNG 243
            LP+F+NPIP+K IP    +KD A WFH++ KRFR E+KGILINTF EME  + K   + 
Sbjct: 182 TLPNFVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAK---SY 241

Query: 244 SSKIPKVYTVGPILQLKSIGVTQSNNALNGADIL-KWLDDQPPASVVFLCFGSKGSFDED 303
           S  +P +Y VGP+L LK+ GV  S+ A N ADI+ KWLDDQPP+SVV +CFG+  SFDE 
Sbjct: 242 SQVLPPLYFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEA 301

Query: 304 QVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWA 363
           QV EIA ALE S VRF+WSLRQPPPKGKFE P NY +I + LPEGFL+RT  IGRVIGW 
Sbjct: 302 QVAEIANALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWT 361

Query: 364 PQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELT 423
            Q+EIL+HPA GGFISHCGWNS LESVWHGV +ATWP++AEQQFNAFEMVVELGLAVE+T
Sbjct: 362 SQVEILAHPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVT 421

Query: 424 LDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNE 448
           LDY   F   + R+VSAEEI+SGI+KLMG+  NE
Sbjct: 422 LDYRITFGEDKPRLVSAEEIKSGIKKLMGEESNE 447

BLAST of Csa4G618520 vs. TrEMBL
Match: A0A0A0KZV7_CUCSA (Glycosyltransferase OS=Cucumis sativus GN=Csa_4G618530 PE=3 SV=1)

HSP 1 Score: 553.9 bits (1426), Expect = 2.0e-154
Identity = 276/445 (62.02%), Postives = 344/445 (77.30%), Query Frame = 1

Query: 52  ERIQSLSASFEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIH--- 111
           + IQ LSASF GKSI  I+LPELP P +     P +L   +E +KPHVRE + N ++   
Sbjct: 2   DHIQQLSASFVGKSIHLILLPELPLPQECQNGMPQLL---IEIYKPHVREAMANQVNSQT 61

Query: 112 --DSNRLVGFVIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNST--A 171
             D  +LVGFV+DMFC +M++VA EFKVPCYLFYTS+A FL  +FHLQELY+QNNS    
Sbjct: 62  SPDFPQLVGFVLDMFCMTMVDVAKEFKVPCYLFYTSSAAFLALNFHLQELYDQNNSNRVV 121

Query: 172 EQLQNSNVE-LALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEM 231
           EQL+NS  E L +PSF+NPIP K IP      DMA W ++NT++FRSE+KGILINT  E+
Sbjct: 122 EQLKNSESESLTIPSFVNPIPGKVIPSIFVYNDMAVWLYENTRKFRSEIKGILINTCAEI 181

Query: 232 EPQIVKWMSNG-SSKIPKVYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFL 291
           E  +V  MS+G SS++P +Y VGPIL L+        N +N  +ILKWLDDQP ASV+FL
Sbjct: 182 ESHVVNMMSSGPSSQVPSLYCVGPILNLE--------NTVNRVNILKWLDDQPQASVIFL 241

Query: 292 CFGSKGSFDEDQVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNR 351
           CFGS GSFDE+QV EIA+ LERS V FLWSLRQPPPKGK+  PS+YA+I DVLPE FL+ 
Sbjct: 242 CFGSMGSFDEEQVKEIAQGLERSGVHFLWSLRQPPPKGKWVAPSDYADIKDVLPERFLDP 301

Query: 352 TADIGRVIGWAPQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEM 411
           TA++G++IGWAPQ+EIL+HP+ GGF+SHCGWNSTLES+W+GVPM  WP+YAEQQ NAF+M
Sbjct: 302 TANVGKIIGWAPQVEILAHPSIGGFVSHCGWNSTLESLWYGVPMVAWPMYAEQQLNAFQM 361

Query: 412 VVELGLAVELTLDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKS 471
           VVELGLAVE+TLDY KD+ + RS++V+AEEIESGIRK+M D G+EIRK++K + EE RK+
Sbjct: 362 VVELGLAVEITLDYQKDYRLERSKLVTAEEIESGIRKVM-DDGDEIRKQVKAESEEVRKA 421

Query: 472 MMEGGSSFNSLRHFIDDALTNLQEG 488
           +MEGGSS+ SL HFI+D L N   G
Sbjct: 422 VMEGGSSYISLVHFINDVLVNSSNG 434

BLAST of Csa4G618520 vs. TrEMBL
Match: A0A0A0L341_CUCSA (Glycosyltransferase OS=Cucumis sativus GN=Csa_4G620550 PE=3 SV=1)

HSP 1 Score: 552.0 bits (1421), Expect = 7.4e-154
Identity = 284/495 (57.37%), Postives = 366/495 (73.94%), Query Frame = 1

Query: 1   MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSAS 60
           M KFELVFIP PG GHLAS VE+AN L++RD RL+VT++A KLP D K   E IQSLSA 
Sbjct: 1   MKKFELVFIPIPGSGHLASMVEMANTLLARDHRLAVTMIAFKLPLDPKAN-EYIQSLSAQ 60

Query: 61  F--EGKSIRFIVLPELP-FPNQSSEPPPLMLQAFLESHKPHVRE-IVTNLIHDSNRLVGF 120
                 SI+FIVLPELP  PN  +      L+  LES+KPHV++ +++ L   +N L GF
Sbjct: 61  SLTNNNSIQFIVLPELPDIPNNGNR---FFLEVVLESYKPHVKQALISFLTTSTNHLAGF 120

Query: 121 VIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAE---QLQNSNVE 180
           V+D FC++M++VANEFKVP Y++YTS A +L FS HL++LY Q+NS+ E   QL++S+V 
Sbjct: 121 VLDSFCSTMVDVANEFKVPSYVYYTSCAAYLAFSLHLEQLYTQDNSSNEVIQQLKDSDVN 180

Query: 181 LALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSN 240
           L++PS +N +P+K IP   F  + A WFH+  KR R +VKG+LINTF E+E   +  +S 
Sbjct: 181 LSVPSLVNQVPSKTIPSVFFINNFAVWFHEQAKRIRFDVKGVLINTFEELESHALSSLST 240

Query: 241 GSS-KIPKVYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDE 300
            SS ++P +Y+VGP+L L      ++   ++  D+LKWLDDQP +SVVFLCFGS+G+F +
Sbjct: 241 DSSLQLPPLYSVGPVLHLN-----KNTETMDDGDVLKWLDDQPLSSVVFLCFGSRGAFKK 300

Query: 301 DQVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGW 360
           DQV EIARALERS VRF+WSLR+P     F+   +Y N  D+LP+GFL+RT +IGRVI W
Sbjct: 301 DQVEEIARALERSRVRFIWSLRRPG--NVFQSSIDYTNFEDILPKGFLDRTQNIGRVISW 360

Query: 361 APQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVEL 420
           APQ+EIL HPATGGF+SHCGWNSTLES+WHGVPMATWP+YAEQQFNAF++VVELGLAVE+
Sbjct: 361 APQVEILGHPATGGFVSHCGWNSTLESLWHGVPMATWPMYAEQQFNAFDLVVELGLAVEI 420

Query: 421 TLDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNS 480
            + Y  +     + I+ AEEIE GIRKLM ++ NEIRKK+K K EE RKS++EGGSSF S
Sbjct: 421 KISYCIELKEQANPIIMAEEIERGIRKLMDNNKNEIRKKVKTKSEECRKSVIEGGSSFIS 480

Query: 481 LRHFIDDALTNLQEG 488
           L  FIDD L+N   G
Sbjct: 481 LGKFIDDVLSNSTTG 484

BLAST of Csa4G618520 vs. TAIR10
Match: AT3G21760.1 (AT3G21760.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 438.7 bits (1127), Expect = 4.6e-123
Identity = 237/491 (48.27%), Postives = 312/491 (63.54%), Query Frame = 1

Query: 3   KFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTER--IQSLSAS 62
           K ELVFIP PG GHL   VE+A + V RDD LS+T++ I   +   ++     I SLS+ 
Sbjct: 2   KLELVFIPSPGDGHLRPLVEVAKLHVDRDDHLSITIIIIPQMHGFSSSNSSSYIASLSSD 61

Query: 63  FEGK-SIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHDS-----NRLV 122
            E + S   + +P+ P    S +  P     ++++ KP V+  V  L         +RL 
Sbjct: 62  SEERLSYNVLSVPDKP---DSDDTKPHFFD-YIDNFKPQVKATVEKLTDPGPPDSPSRLA 121

Query: 123 GFVIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNV-E 182
           GFV+DMFC  MI+VANEF VP Y+FYTSNA FL    H++ LY+  N     L++S+  E
Sbjct: 122 GFVVDMFCMMMIDVANEFGVPSYMFYTSNATFLGLQVHVEYLYDVKNYDVSDLKDSDTTE 181

Query: 183 LALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSN 242
           L +P    P+P K  P  L  K+        T+RFR E KGIL+NTF E+EPQ +K+ S 
Sbjct: 182 LEVPCLTRPLPVKCFPSVLLTKEWLPVMFRQTRRFR-ETKGILVNTFAELEPQAMKFFSG 241

Query: 243 GSSKIPKVYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDED 302
             S +P VYTVGP++ LK  G   S++    ++IL+WLD+QP  SVVFLCFGS G F E 
Sbjct: 242 VDSPLPTVYTVGPVMNLKINGPNSSDD--KQSEILRWLDEQPRKSVVFLCFGSMGGFREG 301

Query: 303 QVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWA 362
           Q  EIA ALERS  RF+WSLR+  PKG    P  + N+ ++LPEGFL RTA+IG+++GWA
Sbjct: 302 QAKEIAIALERSGHRFVWSLRRAQPKGSIGPPEEFTNLEEILPEGFLERTAEIGKIVGWA 361

Query: 363 PQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELT 422
           PQ  IL++PA GGF+SHCGWNSTLES+W GVPMATWPLYAEQQ NAFEMV ELGLAVE+ 
Sbjct: 362 PQSAILANPAIGGFVSHCGWNSTLESLWFGVPMATWPLYAEQQVNAFEMVEELGLAVEVR 421

Query: 423 LDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSL 482
             +  DF      +++AEEIE GIR LM +  +++R ++K   E+S  ++M+GGSS  +L
Sbjct: 422 NSFRGDFMAADDELMTAEEIERGIRCLM-EQDSDVRSRVKEMSEKSHVALMDGGSSHVAL 481

Query: 483 RHFIDDALTNL 485
             FI D   N+
Sbjct: 482 LKFIQDVTKNI 484

BLAST of Csa4G618520 vs. TAIR10
Match: AT3G21780.1 (AT3G21780.1 UDP-glucosyl transferase 71B6)

HSP 1 Score: 433.0 bits (1112), Expect = 2.5e-121
Identity = 236/490 (48.16%), Postives = 320/490 (65.31%), Query Frame = 1

Query: 3   KFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSASFE 62
           K ELVFIP P I HL +TVE+A  LV ++D LS+TV+ I   +     T  I SL+++  
Sbjct: 2   KIELVFIPSPAISHLMATVEMAEQLVDKNDNLSITVIIISFSSK---NTSMITSLTSN-- 61

Query: 63  GKSIRFIVLPELPFPNQSSEPPPLMLQA---FLESHKPHVREIVTNLIH----DSNRLVG 122
              +R+ ++          +  P  L+A    ++S KP VR+ V  L+     D+ RL G
Sbjct: 62  -NRLRYEII-------SGGDQQPTELKATDSHIQSLKPLVRDAVAKLVDSTLPDAPRLAG 121

Query: 123 FVIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNS-TAEQLQNSNVEL 182
           FV+DM+CTSMI+VANEF VP YLFYTSNAGFL    H+Q +Y+  +     +L++S+VEL
Sbjct: 122 FVVDMYCTSMIDVANEFGVPSYLFYTSNAGFLGLLLHIQFMYDAEDIYDMSELEDSDVEL 181

Query: 183 ALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNG 242
            +PS  +P P K +P     K+   +F    +RFR E KGIL+NT  ++EPQ + ++SNG
Sbjct: 182 VVPSLTSPYPLKCLPYIFKSKEWLTFFVTQARRFR-ETKGILVNTVPDLEPQALTFLSNG 241

Query: 243 SSKIPKVYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQ 302
           +  IP+ Y VGP+L LK++     +     ++IL+WLD+QPP SVVFLCFGS G F E+Q
Sbjct: 242 N--IPRAYPVGPLLHLKNVNCDYVDK--KQSEILRWLDEQPPRSVVFLCFGSMGGFSEEQ 301

Query: 303 VLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAP 362
           V E A AL+RS  RFLWSLR+  P    E P  + N+ ++LPEGF +RTA+ G+VIGWA 
Sbjct: 302 VRETALALDRSGHRFLWSLRRASPNILREPPGEFTNLEEILPEGFFDRTANRGKVIGWAE 361

Query: 363 QIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTL 422
           Q+ IL+ PA GGF+SH GWNSTLES+W GVPMA WPLYAEQ+FNAFEMV ELGLAVE+  
Sbjct: 362 QVAILAKPAIGGFVSHGGWNSTLESLWFGVPMAIWPLYAEQKFNAFEMVEELGLAVEIKK 421

Query: 423 DYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSLR 482
            +  D  +GRS IV+AEEIE GI  LM +  +++RK++    E+   ++M+GGSS  +L+
Sbjct: 422 HWRGDLLLGRSEIVTAEEIEKGIICLM-EQDSDVRKRVNEISEKCHVALMDGGSSETALK 472

Query: 483 HFIDDALTNL 485
            FI D   N+
Sbjct: 482 RFIQDVTENI 472

BLAST of Csa4G618520 vs. TAIR10
Match: AT4G15280.1 (AT4G15280.1 UDP-glucosyl transferase 71B5)

HSP 1 Score: 422.9 bits (1086), Expect = 2.6e-118
Identity = 223/483 (46.17%), Postives = 301/483 (62.32%), Query Frame = 1

Query: 3   KFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSASFE 62
           K ELVFIP PGIGHL  TV+LA  L+  ++RLS+T++ I    D    +  I SL+   +
Sbjct: 2   KIELVFIPLPGIGHLRPTVKLAKQLIGSENRLSITIIIIPSRFDAGDASACIASLTTLSQ 61

Query: 63  GKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHDSNR-LVGFVIDMF 122
              + +  +     P  +S+P P+  Q ++E  K  VR+ V   I D  R L GFV+DMF
Sbjct: 62  DDRLHYESISVAKQP-PTSDPDPVPAQVYIEKQKTKVRDAVAARIVDPTRKLAGFVVDMF 121

Query: 123 CTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSFIN 182
           C+SMI+VANEF VPCY+ YTSNA FL    H+Q++Y+Q      +L+NS  EL  PS   
Sbjct: 122 CSSMIDVANEFGVPCYMVYTSNATFLGTMLHVQQMYDQKKYDVSELENSVTELEFPSLTR 181

Query: 183 PIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKIPKV 242
           P P K +P  L  K+         + FR ++KGIL+NT  E+EP  +K  +     +P+V
Sbjct: 182 PYPVKCLPHILTSKEWLPLSLAQARCFR-KMKGILVNTVAELEPHALKMFNINGDDLPQV 241

Query: 243 YTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIARA 302
           Y VGP+L L++     +++    ++IL+WLD+QP  SVVFLCFGS G F E+Q  E A A
Sbjct: 242 YPVGPVLHLEN----GNDDDEKQSEILRWLDEQPSKSVVFLCFGSLGGFTEEQTRETAVA 301

Query: 303 LERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEILSH 362
           L+RS  RFLW LR   P  K + P +Y N+ +VLPEGFL RT D G+VIGWAPQ+ +L  
Sbjct: 302 LDRSGQRFLWCLRHASPNIKTDRPRDYTNLEEVLPEGFLERTLDRGKVIGWAPQVAVLEK 361

Query: 363 PATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDFH 422
           PA GGF++HCGWNS LES+W GVPM TWPLYAEQ+ NAFEMV ELGLAVE+      D  
Sbjct: 362 PAIGGFVTHCGWNSILESLWFGVPMVTWPLYAEQKVNAFEMVEELGLAVEIRKYLKGDLF 421

Query: 423 IGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSLRHFIDDAL 482
            G    V+AE+IE  IR++M +  +++R  +K   E+   ++M+GGSS  +L  FI D +
Sbjct: 422 AGEMETVTAEDIERAIRRVM-EQDSDVRNNVKEMAEKCHFALMDGGSSKAALEKFIQDVI 477

Query: 483 TNL 485
            N+
Sbjct: 482 ENM 477

BLAST of Csa4G618520 vs. TAIR10
Match: AT3G21790.1 (AT3G21790.1 UDP-Glycosyltransferase superfamily protein)

HSP 1 Score: 407.1 bits (1045), Expect = 1.5e-113
Identity = 226/493 (45.84%), Postives = 318/493 (64.50%), Query Frame = 1

Query: 3   KFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKT-TTERIQSLSASF 62
           KFELVFIP PGIGHL STVE+A +LV R+ RLS++V+ +   ++ +   ++ I +LSAS 
Sbjct: 2   KFELVFIPYPGIGHLRSTVEMAKLLVDRETRLSISVIILPFISEGEVGASDYIAALSASS 61

Query: 63  EGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHD------SNRLVG 122
             + +R+ V+  +  P          ++  +++ +P VR  V  L+ D      S ++ G
Sbjct: 62  NNR-LRYEVISAVDQPTIEMTT----IEIHMKNQEPKVRSTVAKLLEDYSSKPDSPKIAG 121

Query: 123 FVIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELA 182
           FV+DMFCTSM++VANEF  P Y+FYTS+AG L  ++H+Q L ++N     +   ++ E  
Sbjct: 122 FVLDMFCTSMVDVANEFGFPSYMFYTSSAGILSVTYHVQMLCDENKYDVSENDYADSEAV 181

Query: 183 L--PSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSN 242
           L  PS   P P K +P  L        F +  ++FR E+KGIL+NT  E+EP ++K++S 
Sbjct: 182 LNFPSLSRPYPVKCLPHALAANMWLPVFVNQARKFR-EMKGILVNTVAELEPYVLKFLS- 241

Query: 243 GSSKIPKVYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDED 302
            SS  P VY VGP+L L++      +      +I++WLD QPP+SVVFLCFGS G F E+
Sbjct: 242 -SSDTPPVYPVGPLLHLEN--QRDDSKDEKRLEIIRWLDQQPPSSVVFLCFGSMGGFGEE 301

Query: 303 QVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWA 362
           QV EIA ALERS  RFLWSLR+  P    E P  + N+ +VLPEGF +RT DIG+VIGWA
Sbjct: 302 QVREIAIALERSGHRFLWSLRRASPNIFKELPGEFTNLEEVLPEGFFDRTKDIGKVIGWA 361

Query: 363 PQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELT 422
           PQ+ +L++PA GGF++HCGWNSTLES+W GVP A WPLYAEQ+FNAF MV ELGLAVE+ 
Sbjct: 362 PQVAVLANPAIGGFVTHCGWNSTLESLWFGVPTAAWPLYAEQKFNAFLMVEELGLAVEIR 421

Query: 423 LDYVKDFHIG--RSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFN 482
             Y +  H+    +  V+AEEIE  I  LM +  +++RK++K   E+   ++M+GGSS  
Sbjct: 422 -KYWRGEHLAGLPTATVTAEEIEKAIMCLM-EQDSDVRKRVKDMSEKCHVALMDGGSSRT 481

Query: 483 SLRHFIDDALTNL 485
           +L+ FI++   N+
Sbjct: 482 ALQKFIEEVAKNI 482

BLAST of Csa4G618520 vs. TAIR10
Match: AT3G21750.1 (AT3G21750.1 UDP-glucosyl transferase 71B1)

HSP 1 Score: 407.1 bits (1045), Expect = 1.5e-113
Identity = 210/491 (42.77%), Postives = 316/491 (64.36%), Query Frame = 1

Query: 3   KFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSASFE 62
           K ELVFIP PG+GH+ +T  LA +LV+ D+RLSVT++ I      + + +   S+  + E
Sbjct: 2   KVELVFIPSPGVGHIRATTALAKLLVASDNRLSVTLIVIPS----RVSDDASSSVYTNSE 61

Query: 63  GKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHD-----SNRLVGFV 122
            + +R+I+LP     +Q+++     L ++++S KP VR +V+ +  D      +RL G V
Sbjct: 62  DR-LRYILLPAR---DQTTD-----LVSYIDSQKPQVRAVVSKVAGDVSTRSDSRLAGIV 121

Query: 123 IDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALP 182
           +DMFCTSMI++A+EF +  Y+FYTSNA +L   FH+Q LY++      + +++ ++  +P
Sbjct: 122 VDMFCTSMIDIADEFNLSAYIFYTSNASYLGLQFHVQSLYDEKELDVSEFKDTEMKFDVP 181

Query: 183 SFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMS--NGS 242
           +   P P K +P  + +K    +     + FR+  KGIL+N+  +MEPQ + + S  NG+
Sbjct: 182 TLTQPFPAKCLPSVMLNKKWFPYVLGRARSFRA-TKGILVNSVADMEPQALSFFSGGNGN 241

Query: 243 SKIPKVYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQV 302
           + IP VY VGPI+ L+S G  +        +IL WL +QP  SVVFLCFGS G F E+Q 
Sbjct: 242 TNIPPVYAVGPIMDLESSGDEEKRK-----EILHWLKEQPTKSVVFLCFGSMGGFSEEQA 301

Query: 303 LEIARALERSEVRFLWSLRQPPPKGKFEEP--SNYANINDVLPEGFLNRTADIGRVIGWA 362
            EIA ALERS  RFLWSLR+  P G    P    + N+ ++LP+GFL+RT +IG++I WA
Sbjct: 302 REIAVALERSGHRFLWSLRRASPVGNKSNPPPGEFTNLEEILPKGFLDRTVEIGKIISWA 361

Query: 363 PQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELT 422
           PQ+++L+ PA G F++HCGWNS LES+W GVPMA WP+YAEQQFNAF MV ELGLA E+ 
Sbjct: 362 PQVDVLNSPAIGAFVTHCGWNSILESLWFGVPMAAWPIYAEQQFNAFHMVDELGLAAEVK 421

Query: 423 LDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSL 482
            +Y +DF +    IV+A+EIE GI+  M +  +++RK++    ++   ++++GGSS  +L
Sbjct: 422 KEYRRDFLVEEPEIVTADEIERGIKCAM-EQDSKMRKRVMEMKDKLHVALVDGGSSNCAL 472

Query: 483 RHFIDDALTNL 485
           + F+ D + N+
Sbjct: 482 KKFVQDVVDNV 472

BLAST of Csa4G618520 vs. NCBI nr
Match: gi|449456653|ref|XP_004146063.1| (PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis sativus])

HSP 1 Score: 985.7 bits (2547), Expect = 2.9e-284
Identity = 489/489 (100.00%), Postives = 489/489 (100.00%), Query Frame = 1

Query: 1   MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSAS 60
           MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSAS
Sbjct: 1   MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSAS 60

Query: 61  FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHDSNRLVGFVIDM 120
           FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHDSNRLVGFVIDM
Sbjct: 61  FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHDSNRLVGFVIDM 120

Query: 121 FCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSFI 180
           FCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSFI
Sbjct: 121 FCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSFI 180

Query: 181 NPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKIPK 240
           NPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKIPK
Sbjct: 181 NPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKIPK 240

Query: 241 VYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIAR 300
           VYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIAR
Sbjct: 241 VYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIAR 300

Query: 301 ALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEILS 360
           ALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEILS
Sbjct: 301 ALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEILS 360

Query: 361 HPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDF 420
           HPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDF
Sbjct: 361 HPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDF 420

Query: 421 HIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSLRHFIDDA 480
           HIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSLRHFIDDA
Sbjct: 421 HIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSLRHFIDDA 480

Query: 481 LTNLQEGNY 490
           LTNLQEGNY
Sbjct: 481 LTNLQEGNY 489

BLAST of Csa4G618520 vs. NCBI nr
Match: gi|659129348|ref|XP_008464641.1| (PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo])

HSP 1 Score: 942.6 bits (2435), Expect = 2.8e-271
Identity = 467/489 (95.50%), Postives = 476/489 (97.34%), Query Frame = 1

Query: 1   MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSAS 60
           MNKFELVFIPGPGIGHLASTVELANVL SRDDRLSVTVLAIKLPNDIKTT ERIQSLSAS
Sbjct: 1   MNKFELVFIPGPGIGHLASTVELANVLASRDDRLSVTVLAIKLPNDIKTT-ERIQSLSAS 60

Query: 61  FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIHDSNRLVGFVIDM 120
           FEGKSIRFIVLPELPFPNQSS PPPLMLQAFLESHKPHVREIVTNL +DSNRLVGFVIDM
Sbjct: 61  FEGKSIRFIVLPELPFPNQSSTPPPLMLQAFLESHKPHVREIVTNLTYDSNRLVGFVIDM 120

Query: 121 FCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNSTAEQLQNSNVELALPSFI 180
           FCTSMINVANEFKVPCYLFYTSNAGFL FSFHLQELYNQNNST EQLQNSNVELALPSFI
Sbjct: 121 FCTSMINVANEFKVPCYLFYTSNAGFLAFSFHLQELYNQNNSTGEQLQNSNVELALPSFI 180

Query: 181 NPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNGSSKIPK 240
           NPIP+KAIPPFLFDKDMA WFHDNTKRFRS VKGILINTFVEMEPQ++KWMSNGSSKIPK
Sbjct: 181 NPIPSKAIPPFLFDKDMAVWFHDNTKRFRSGVKGILINTFVEMEPQMIKWMSNGSSKIPK 240

Query: 241 VYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIAR 300
           VYTVGPILQLKSIGVTQ NNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIAR
Sbjct: 241 VYTVGPILQLKSIGVTQCNNALNGADILKWLDDQPPASVVFLCFGSKGSFDEDQVLEIAR 300

Query: 301 ALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWAPQIEILS 360
           ALERSEVRF+WSLRQPPPKGKFEEPSNYA+INDVLPEGFLNRTADIGRVIGWAPQIEILS
Sbjct: 301 ALERSEVRFIWSLRQPPPKGKFEEPSNYADINDVLPEGFLNRTADIGRVIGWAPQIEILS 360

Query: 361 HPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDF 420
           HPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDF
Sbjct: 361 HPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELTLDYVKDF 420

Query: 421 HIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSLRHFIDDA 480
           HIGRSR+VSAEEIESGIRKLMGD GNEIRKK+KVKGEESRKSMM GGSSFNSL HFIDDA
Sbjct: 421 HIGRSRVVSAEEIESGIRKLMGDYGNEIRKKVKVKGEESRKSMMVGGSSFNSLDHFIDDA 480

Query: 481 LTNLQEGNY 490
           L NL+EGNY
Sbjct: 481 LANLEEGNY 488

BLAST of Csa4G618520 vs. NCBI nr
Match: gi|343466221|gb|AEM43004.1| (UDP-glucosyltransferase [Siraitia grosvenorii])

HSP 1 Score: 641.3 bits (1653), Expect = 1.3e-180
Identity = 332/495 (67.07%), Postives = 388/495 (78.38%), Query Frame = 1

Query: 1   MNKFELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSAS 60
           M KFELVFIP P +GHLA+ VE+AN+LV+RD RL+VT+L IKLP   KT  E IQSLSAS
Sbjct: 1   MKKFELVFIPLPVMGHLAAMVEMANILVTRDQRLTVTILVIKLPLYGKTA-EYIQSLSAS 60

Query: 61  FEGKSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIH-----DSNRLVG 120
           F  +S+RFI+LPE+  P +S +    ML+AFLES+KP +RE + +L       DS RL G
Sbjct: 61  FASESMRFIILPEVLLPEESEKE--FMLKAFLESYKPIIREAIIDLTDSQMGPDSPRLAG 120

Query: 121 FVIDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQNNS--TAEQLQNSNVE 180
           FV+DMFCT+MI+VANEF VP Y+F TSNAGFL  SFHLQELY++NNS    +QLQNSN E
Sbjct: 121 FVLDMFCTTMIDVANEFGVPSYVFCTSNAGFLALSFHLQELYDENNSKEVVKQLQNSNAE 180

Query: 181 LALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMS- 240
           +ALPSF+NPIP K IP    + D A+WFHD  +R+RS VKGILINTF ++E  ++  MS 
Sbjct: 181 IALPSFVNPIPGKMIPDIFSNDDTASWFHDQVERYRSGVKGILINTFAKLESHVMNSMSR 240

Query: 241 NGSSKIPKVYTVGPILQLKSIGVTQSNNALNGADILKWLDDQPPASVVFLCFGSKGSFDE 300
           + SS+ P +Y++GPIL LK+         L+  DILKWLD+QPP SVVFLCFGS GSFDE
Sbjct: 241 SSSSRAPPLYSIGPILHLKNNNTVGPGGTLHCTDILKWLDNQPPVSVVFLCFGSMGSFDE 300

Query: 301 DQVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGW 360
           DQV EIA ALERS VRFLWSLRQPPPK KFE PS Y +I  VLPEGFL RTA IGRVIGW
Sbjct: 301 DQVKEIAHALERSGVRFLWSLRQPPPKDKFEAPSEYTDIKYVLPEGFLERTAGIGRVIGW 360

Query: 361 APQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVEL 420
           APQ+EIL+HPATGGF+SHCGWNSTLES+WHGVPMATWPLYAEQQF AFEMVVELGLAV++
Sbjct: 361 APQVEILAHPATGGFVSHCGWNSTLESMWHGVPMATWPLYAEQQFTAFEMVVELGLAVDI 420

Query: 421 TLDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNS 480
           TLDY K  H  RSR+VSAEEI+SGIRKLM + G E+RKK+K K EESRKS+MEGGSSF S
Sbjct: 421 TLDYQKHPHGERSRVVSAEEIQSGIRKLM-EEGGEMRKKVKAKSEESRKSLMEGGSSFIS 480

Query: 481 LRHFIDDALTNLQEG 488
           L  FIDD L N  EG
Sbjct: 481 LGRFIDDVLGNGPEG 491

BLAST of Csa4G618520 vs. NCBI nr
Match: gi|449456657|ref|XP_004146065.1| (PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis sativus])

HSP 1 Score: 616.3 bits (1588), Expect = 4.6e-173
Identity = 320/494 (64.78%), Postives = 378/494 (76.52%), Query Frame = 1

Query: 4   FELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSASFEG 63
           FEL+FIP PGIGHLASTVE+ANVLV+RD RLSVT+LA+KLP D+K   E I+SLS SF G
Sbjct: 2   FELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVA-ECIESLSTSFAG 61

Query: 64  KSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIH------DSNRLVGFV 123
           K+I+F VLPE P P +S +         +ES+KP+VRE+V+NL        DS RLVG V
Sbjct: 62  KNIQFNVLPEPPLPEESKKD----FIVLVESYKPYVREVVSNLTASAATSIDSPRLVGLV 121

Query: 124 IDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQN--NSTAEQLQNS-NVEL 183
           IDMFCT+MI+V NEF VPCY+FYT +A FL FS +LQELY +N  N   EQL NS NVEL
Sbjct: 122 IDMFCTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVEL 181

Query: 184 ALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNG 243
            LP+F+NPIP+K IP    +KD A WFH++ KRFR E+KGILINTF EME  + K   + 
Sbjct: 182 TLPNFVNPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHVAK---SY 241

Query: 244 SSKIPKVYTVGPILQLKSIGVTQSNNALNGADIL-KWLDDQPPASVVFLCFGSKGSFDED 303
           S  +P +Y VGP+L LK+ GV  S+ A N ADI+ KWLDDQPP+SVV +CFG+  SFDE 
Sbjct: 242 SQVLPPLYFVGPVLHLKNAGVAGSSEAQNNADIIMKWLDDQPPSSVVLVCFGTMVSFDEA 301

Query: 304 QVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWA 363
           QV EIA ALE S VRF+WSLRQPPPKGKFE P NY +I + LPEGFL+RT  IGRVIGW 
Sbjct: 302 QVAEIANALEESGVRFIWSLRQPPPKGKFEAPKNYNDIRNFLPEGFLDRTMSIGRVIGWT 361

Query: 364 PQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELT 423
            Q+EIL+HPA GGFISHCGWNS LESVWHGV +ATWP++AEQQFNAFEMVVELGLAVE+T
Sbjct: 362 SQVEILAHPAIGGFISHCGWNSVLESVWHGVLIATWPMHAEQQFNAFEMVVELGLAVEVT 421

Query: 424 LDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSL 483
           LDY   F   + R+VSAEEI+SGI+KLMG+  NE+RKK+K K EESRKS+MEGGSSF SL
Sbjct: 422 LDYRITFGEDKPRLVSAEEIKSGIKKLMGEESNEVRKKVKAKSEESRKSVMEGGSSFVSL 481

Query: 484 RHFIDDALTNLQEG 488
             FIDD L N   G
Sbjct: 482 GKFIDDVLANSAGG 487

BLAST of Csa4G618520 vs. NCBI nr
Match: gi|659129338|ref|XP_008464636.1| (PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo])

HSP 1 Score: 607.1 bits (1564), Expect = 2.8e-170
Identity = 314/495 (63.43%), Postives = 376/495 (75.96%), Query Frame = 1

Query: 4   FELVFIPGPGIGHLASTVELANVLVSRDDRLSVTVLAIKLPNDIKTTTERIQSLSASFEG 63
           FEL+FIP PGIGHLASTVE+ANVLV+RD RLSVT+LA+KLP D+K   E I+SLS SF G
Sbjct: 2   FELIFIPAPGIGHLASTVEMANVLVTRDHRLSVTLLAMKLPYDVKVA-ECIESLSTSFAG 61

Query: 64  KSIRFIVLPELPFPNQSSEPPPLMLQAFLESHKPHVREIVTNLIH------DSNRLVGFV 123
           K+I+F VLPE P P +S +         +ES+KP+VRE V+N         DS RLVG V
Sbjct: 62  KNIQFNVLPEPPLPEESKKD----FIVLVESYKPYVREAVSNFTASAATSLDSPRLVGLV 121

Query: 124 IDMFCTSMINVANEFKVPCYLFYTSNAGFLDFSFHLQELYNQN--NSTAEQLQNS-NVEL 183
           IDMFCT+MI+V NEF VPCY+FYT +A FL FS +LQELY +N  N   EQL NS NVEL
Sbjct: 122 IDMFCTTMIDVGNEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNVEL 181

Query: 184 ALPSFINPIPNKAIPPFLFDKDMAAWFHDNTKRFRSEVKGILINTFVEMEPQIVKWMSNG 243
            LP+F NPIP+K IP    +KD A WFH++ KRFR E+KGILINTF EME    K   + 
Sbjct: 182 TLPNFANPIPSKLIPTLFSNKDKAVWFHNHIKRFRLEIKGILINTFEEMESHAAK---SY 241

Query: 244 SSKIPKVYTVGPILQLKSIGVTQSNNALNGADIL-KWLDDQPPASVVFLCFGSKGSFDED 303
           S  +P +Y VGP+L LK+ GV  S+ A + ADI+ KWLDDQPP+SVV +CFG+  SFDE 
Sbjct: 242 SQVLPPLYFVGPVLHLKNAGVAGSSEAQDNADIIMKWLDDQPPSSVVLVCFGTMVSFDEA 301

Query: 304 QVLEIARALERSEVRFLWSLRQPPPKGKFEEPSNYANINDVLPEGFLNRTADIGRVIGWA 363
           QV EIA ALE S VRF+WSLRQPPPKGKFE P NY ++ + LPEGFL+RT  IGRVIGW 
Sbjct: 302 QVAEIANALEESGVRFIWSLRQPPPKGKFEAPRNYNDVKNFLPEGFLDRTMSIGRVIGWT 361

Query: 364 PQIEILSHPATGGFISHCGWNSTLESVWHGVPMATWPLYAEQQFNAFEMVVELGLAVELT 423
            Q+EIL+HPA GGF+SHCGWNS LESVWHGVP+ATWP++AEQQFNAFEMVVELGLAVE+T
Sbjct: 362 SQVEILAHPAIGGFVSHCGWNSILESVWHGVPIATWPMHAEQQFNAFEMVVELGLAVEVT 421

Query: 424 LDYVKDFHIGRSRIVSAEEIESGIRKLMGDSGNEIRKKIKVKGEESRKSMMEGGSSFNSL 483
           LDY   F   + R+VSAEE++SGI+KLMG+  +E+RKK+K K EES+KS+MEGGSSF SL
Sbjct: 422 LDYRITFGEDKPRLVSAEEVKSGIKKLMGEESDEVRKKVKAKSEESQKSVMEGGSSFISL 481

Query: 484 RHFIDDALTNLQEGN 489
             FIDD L N   G+
Sbjct: 482 GKFIDDVLANSTGGS 488

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
U7A16_PYRCO2.1e-13352.17UDP-glycosyltransferase 71A16 OS=Pyrus communis GN=UGT71A16 PE=1 SV=1[more]
UFOG3_FRAAN6.1e-13353.14Putative UDP-glucose flavonoid 3-O-glucosyltransferase 3 OS=Fragaria ananassa GN... [more]
U7A15_MALDO2.5e-13151.55UDP-glycosyltransferase 71A15 OS=Malus domestica GN=UGT71A15 PE=1 SV=1[more]
UFOG6_FRAAN6.3e-13050.62UDP-glucose flavonoid 3-O-glucosyltransferase 6 OS=Fragaria ananassa GN=GT6 PE=1... [more]
U71E1_STERE2.6e-12350.21UDP-glycosyltransferase 71E1 OS=Stevia rebaudiana GN=UGT71E1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L321_CUCSA2.0e-284100.00Glycosyltransferase OS=Cucumis sativus GN=Csa_4G618520 PE=3 SV=1[more]
K7NBW4_SIRGR9.3e-18167.07Glycosyltransferase OS=Siraitia grosvenorii GN=UDPG7 PE=2 SV=1[more]
A0A0A0L1T2_CUCSA5.0e-15864.54Uncharacterized protein OS=Cucumis sativus GN=Csa_4G618540 PE=4 SV=1[more]
A0A0A0KZV7_CUCSA2.0e-15462.02Glycosyltransferase OS=Cucumis sativus GN=Csa_4G618530 PE=3 SV=1[more]
A0A0A0L341_CUCSA7.4e-15457.37Glycosyltransferase OS=Cucumis sativus GN=Csa_4G620550 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G21760.14.6e-12348.27 UDP-Glycosyltransferase superfamily protein[more]
AT3G21780.12.5e-12148.16 UDP-glucosyl transferase 71B6[more]
AT4G15280.12.6e-11846.17 UDP-glucosyl transferase 71B5[more]
AT3G21790.11.5e-11345.84 UDP-Glycosyltransferase superfamily protein[more]
AT3G21750.11.5e-11342.77 UDP-glucosyl transferase 71B1[more]
Match NameE-valueIdentityDescription
gi|449456653|ref|XP_004146063.1|2.9e-284100.00PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis sativus][more]
gi|659129348|ref|XP_008464641.1|2.8e-27195.50PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo][more]
gi|343466221|gb|AEM43004.1|1.3e-18067.07UDP-glucosyltransferase [Siraitia grosvenorii][more]
gi|449456657|ref|XP_004146065.1|4.6e-17364.78PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis sativus][more]
gi|659129338|ref|XP_008464636.1|2.8e-17063.43PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002213UDP_glucos_trans
Vocabulary: Molecular Function
TermDefinition
GO:0016758transferase activity, transferring hexosyl groups
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016758 transferase activity, transferring hexosyl groups
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU141647cucumber EST collection version 3.0transcribed_cluster
CU171581cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa4G618520.1Csa4G618520.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU141647CU141647transcribed_cluster
CU171581CU171581transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePANTHERPTHR11926GLUCOSYL/GLUCURONOSYL TRANSFERASEScoord: 1..485
score: 1.2E
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePFAMPF00201UDPGTcoord: 278..412
score: 1.0
IPR002213UDP-glucuronosyl/UDP-glucosyltransferasePROSITEPS00375UDPGTcoord: 352..395
scor
NoneNo IPR availableunknownCoilCoilcoord: 153..173
scor
NoneNo IPR availableGENE3DG3DSA:3.40.50.2000coord: 265..413
score: 1.4
NoneNo IPR availablePANTHERPTHR11926:SF242UDP-GLYCOSYLTRANSFERASE 71B2-RELATEDcoord: 1..485
score: 1.2E
NoneNo IPR availableunknownSSF53756UDP-Glycosyltransferase/glycogen phosphorylasecoord: 2..480
score: 1.45E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Csa4G618520Cucsa.058370Cucumber (Gy14) v1cgycuB042
The following gene(s) are paralogous to this gene:

None