CsaV3_3G017050 (gene) Cucumber (Chinese Long) v3

Overview
NameCsaV3_3G017050
Typegene
OrganismCucumis sativus L. var. sativus cv. Chinese Long (Cucumber (Chinese Long) v3)
DescriptionArabidopsis thaliana protein of unknown function (DUF821)
Locationchr3: 12792039 .. 12795743 (-)
RNA-Seq ExpressionCsaV3_3G017050
SyntenyCsaV3_3G017050
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAAGAGTTTGGGTACAATTCTTGGGTTTTTATCATTTAAAATAAATGGGTAGCATCGTTCATAGAAACCCAATACCTAAACCCTATTTCCCCTATTTTTTCTTCTACGTTTTGCTCTTTGTTGTTGGCTATTTCATAATCTCTTCACAAATATCCGTAAGTTTTTTTTCTCTCTCTCTCTCTTTTTTACTTTTCAAAATCTGCATTTTATGGATTAGAAAATTTATAATCTCTTTTTAATATATATATATTTGTTTTAAAATTACGACGATTATTGAAACGATGTGTCGGATGATATAAATATTTTTGTTGTCACTTTTGGCGAAGCCGATGGGGGCGAGGAGAGAAAGGGAATTACAAAATTACCCTCAGAAGGAAGTCGAATTTTCTCCCATTAATTGTACGGCATATTCACGGAGCGAGAAATGGGATAGTGGGATCGGTCCCACCACAATAGAGGAAGAGGAAGAAGATGGAGACGGGAAAAATGAAAACACGTGTCCGGAATACTTCCGTTGGATCCACGAGGATCTAAAGCCGTGGGCTGAGACAGGGATCACGAGGGAGATGGTGGAGAGAGGGCGGGAGAATGCTACCTTCAGGCTGGTGATCGTCGGCGGTAGGGCTTACGTGGAGAAGTATTCAGAAGTGTTTCAAAGGAGGGATGTTTTTACGCTGTGGGGGATCCTACAATTGTTACGGTGGTACCCAGATCAAATTCCTGATTTGGACCTCATGTTTGCTTGTGAAGACCAGCCCACTGTTTTTATTGGTAATTATAGTGGGCCTGGGCCCAATTCAACGGCCCCACCTCCTTTGTTCCGGTACTGTGGAGACGATGACACCTTTGACATCGTTTTTCCTGATTGGTCCTTCTGGGGATGGTAAATATTTCTTCCCTTCTCCAACCCATTCAAATATTTGCTATTTCTTTTTAATTATTTTAAATGACAAAAATATACAAATAATAAACTATTATCTTTTTTATAGTAATAAATAATAATCTATTTTTATTTATTTTGAAAACAATATTTGAATAAATCTATACTTATGTTTAAATGTTTAAATGTTAAGAGGTCTATTACTATAGATTTTTAAGGTATAGAAGTTTATTGTTTTAACTGATGGATGGATTTTAGACAGGCACCACATCAATATATTTCTCTTCTTTTATTTTTATTTGTAAATATATGTGATGATTATTTATTGAATTATTGAATTATTGAATTATACAAGGCCAGAGATTAATTTAAAGCCATGGGAAACAGAGATGAAAGAACTAAAGGAAGCAAACCAAAGGAAAAAATGGATAGACAGAGAAAACTATGCTTTTTGGAAGGGAAATACTTTTATTTCTATGCCCAGATATCAACTTTTAAAATGCAGTCGCTCTACTCAATCCAAACTTCGTGTCTACATGCAGGTACATGTTATTATATTTCTTAATTATTTACTTATCATTCACCTCTAATACTATATTTTTCTTTTTTAGTCGAACCATTACTTCAATTTAATCCAACACCCTATAATAACACTACTACTACTCCCATTATTCTTAGGAGGGTTGTCAAATATAGTAAACTTTCAAATTCTATCAAAAATCTATCAAGGTTTATCAGTATCGATTAGAAATTTTGTTTTATTTATAAATATTTAGGCATTATTATTATTATACCTAGAATTTCATCGATAATAATGGTTATGCAGGATTGGCAAGAAGAAGGTAAACAAGGATTCAAAAACTCAAATCTTGCTGATCAATGTTTTTCTAGGTATTTCTTTTTTACACTTATTAACTTTTTTGTTATGTTTTTTCTTTGGGTTAAAGTTTGCCCGAGTTTTAATATATCTTTAAAATTTTTAAATATATATAGTTATTAATGTCACGTGCTATATAGGACTTTAAACTTTGGATTGTGTCTAATATGTCACACACATATTTAACATTTATTTAAAAACAATAAATTCTTTAATATAAAATTGAATAGTTTGTTTAATAGAAAATTATGAAGGTTTTTAAAATGATATAGAATAATTTAAGATACAAAATTGAAGATTATAAGAATGTATTAGACATTTGCTTGTAATGTTTAAAATCCTATTAAACATTTTAGTTTAATCACTTATTAGACGTTGATTTGATTCTCTTTTTTATAACTTTTGTGGAGCTATTTGATGCAAAATATATAATTGAAGTATTTCTTTTGTTGTTTGAGTGAATATATCAATGATATGAAAACTATTTTGTAGGTATAAAGTTTACATCGAGGGGATTGGTTGGTCAGTGAGTCTCAAATATATTCTTGCTTGTGATTCTATGACATTAATGGTAAAACCTCATTTCTATGATTTCTTCACAAGAAGTCTAGTGCCAATGCATCACTATTGGCCAATCAAAGATGATGATGATATGTGCAAATCTATCAAATTTGCTGTTGAGTGGGGGACTACCCACAAACAAAAGGTGCTTAATTTCATTATATCTAACAATTTAGTTTTAAATAATAATATCTATCGTCTCCTTGAAAAGATATATAGACACAACAATGTGATTGATTAATCACATTAACACTATTAGATTGTGTGAGAAAAACTATGAAAACCTTGATTGGCTAAAATTTATGTTTCTTAAATGTCCTTGTTAGGCACAAGCAATTGGGAAGGCAGCAAGTAAGTTCATGGAAGAGCAACTAAACATGGACAAGGTGTATGATTACATGTTTCATACTCTAAATGAATACTCCAAGCTCTTAACTTTCAAACCAACCATCCCACCAAATGCTACTGAAATTTCTTTGAACGATTTGGCTTGCCCTACCGAAGGCTTAGCTGCCAAGTCCATGATGGATACCCTCATAAAACGACCTTCCTTCTCGAGCCCTTGCTTCTTGCTTCCTCCTTTTAGCCCGTTTGCTCTCGACTACATTCGAACCAGAAAAGATATTCCAATCAAACAAATTGATATGTGGGAGAAAAATATGCCCTTTTAGAAGACCAATTCAAGGCTTGTCATTCGGTTTCTTTTTCAAATCTAAAATCATAGCTTAATAACCATTTTGCTTTCATAAATAATACTACTATCAAAAGATGCAGGTAGATTTTAATTTGTTTCTACCAAATTAGTCTTAGAAAAATGATAAGTATAGTTTGTAGAGAAGATAATCTTCTTTCAATTCAAATGTAACTTTAAAATCATTCGTTGAATAAAAAACTTAAACAAATGGGTGAAATAAATTTAATATTATATCGTAAAACATCTTCTGTTACTTGTCGGAACCTAACAGAATTTTACTCTTGCGCTCGAGTTAGTTTGAATACATAAAATTTTAAGTTGATGGATGATGTACATCAAAATTAATATATGATATCATGTAATAAACATATTTCTTGCCAATTTAATTTTGGGAAAGAGCAAAGGATTAATTTACCACAATTGATCACAAGTTGCAAAGCCATCAGAAATGTGTCCCTCAAAATCAAGGTCCCAATTGAGCATCTCAAAACCATAGCCCTCTCTACTTGTTCTTCTTCTTCCCTCACCATCAAAACTTCCTTCCATGGAGCTCCCGCCATTAACATCAGACACAGAAAGCCATTCTGCAAACAAGACCCTCGGTTGGCTTTCAATCCGCTCATTTCCCGAACTTCCCATCTCCTCCAACGGCGGATCTCGATTCGAAAAAACTCCACTTTTGAAATCTTGGA

mRNA sequence

ATGGGTAGCATCGTTCATAGAAACCCAATACCTAAACCCTATTTCCCCTATTTTTTCTTCTACGTTTTGCTCTTTGTTGTTGGCTATTTCATAATCTCTTCACAAATATCCCCGATGGGGGCGAGGAGAGAAAGGGAATTACAAAATTACCCTCAGAAGGAAGTCGAATTTTCTCCCATTAATTGTACGGCATATTCACGGAGCGAGAAATGGGATAGTGGGATCGGTCCCACCACAATAGAGGAAGAGGAAGAAGATGGAGACGGGAAAAATGAAAACACGTGTCCGGAATACTTCCGTTGGATCCACGAGGATCTAAAGCCGTGGGCTGAGACAGGGATCACGAGGGAGATGGTGGAGAGAGGGCGGGAGAATGCTACCTTCAGGCTGGTGATCGTCGGCGGTAGGGCTTACGTGGAGAAGTATTCAGAAGTGTTTCAAAGGAGGGATGTTTTTACGCTGTGGGGGATCCTACAATTGTTACGGTGGTACCCAGATCAAATTCCTGATTTGGACCTCATGTTTGCTTGTGAAGACCAGCCCACTGTTTTTATTGGTAATTATAGTGGGCCTGGGCCCAATTCAACGGCCCCACCTCCTTTGTTCCGGTACTGTGGAGACGATGACACCTTTGACATCGTTTTTCCTGATTGGTCCTTCTGGGGATGGCCAGAGATTAATTTAAAGCCATGGGAAACAGAGATGAAAGAACTAAAGGAAGCAAACCAAAGGAAAAAATGGATAGACAGAGAAAACTATGCTTTTTGGAAGGGAAATACTTTTATTTCTATGCCCAGATATCAACTTTTAAAATGCAGTCGCTCTACTCAATCCAAACTTCGTGTCTACATGCAGGATTGGCAAGAAGAAGGTAAACAAGGATTCAAAAACTCAAATCTTGCTGATCAATGTTTTTCTAGGTATAAAGTTTACATCGAGGGGATTGGTTGGTCAGTGAGTCTCAAATATATTCTTGCTTGTGATTCTATGACATTAATGGTAAAACCTCATTTCTATGATTTCTTCACAAGAAGTCTAGTGCCAATGCATCACTATTGGCCAATCAAAGATGATGATGATATGTGCAAATCTATCAAATTTGCTGTTGAGTGGGGGACTACCCACAAACAAAAGGTGCTTAATTTCATTATATCTAACAATTTAGTTTTAAATAATAATATCTATCGTCTCCTTGAAAAGATATATAGACACAACAATGTGATTGATTAA

Coding sequence (CDS)

ATGGGTAGCATCGTTCATAGAAACCCAATACCTAAACCCTATTTCCCCTATTTTTTCTTCTACGTTTTGCTCTTTGTTGTTGGCTATTTCATAATCTCTTCACAAATATCCCCGATGGGGGCGAGGAGAGAAAGGGAATTACAAAATTACCCTCAGAAGGAAGTCGAATTTTCTCCCATTAATTGTACGGCATATTCACGGAGCGAGAAATGGGATAGTGGGATCGGTCCCACCACAATAGAGGAAGAGGAAGAAGATGGAGACGGGAAAAATGAAAACACGTGTCCGGAATACTTCCGTTGGATCCACGAGGATCTAAAGCCGTGGGCTGAGACAGGGATCACGAGGGAGATGGTGGAGAGAGGGCGGGAGAATGCTACCTTCAGGCTGGTGATCGTCGGCGGTAGGGCTTACGTGGAGAAGTATTCAGAAGTGTTTCAAAGGAGGGATGTTTTTACGCTGTGGGGGATCCTACAATTGTTACGGTGGTACCCAGATCAAATTCCTGATTTGGACCTCATGTTTGCTTGTGAAGACCAGCCCACTGTTTTTATTGGTAATTATAGTGGGCCTGGGCCCAATTCAACGGCCCCACCTCCTTTGTTCCGGTACTGTGGAGACGATGACACCTTTGACATCGTTTTTCCTGATTGGTCCTTCTGGGGATGGCCAGAGATTAATTTAAAGCCATGGGAAACAGAGATGAAAGAACTAAAGGAAGCAAACCAAAGGAAAAAATGGATAGACAGAGAAAACTATGCTTTTTGGAAGGGAAATACTTTTATTTCTATGCCCAGATATCAACTTTTAAAATGCAGTCGCTCTACTCAATCCAAACTTCGTGTCTACATGCAGGATTGGCAAGAAGAAGGTAAACAAGGATTCAAAAACTCAAATCTTGCTGATCAATGTTTTTCTAGGTATAAAGTTTACATCGAGGGGATTGGTTGGTCAGTGAGTCTCAAATATATTCTTGCTTGTGATTCTATGACATTAATGGTAAAACCTCATTTCTATGATTTCTTCACAAGAAGTCTAGTGCCAATGCATCACTATTGGCCAATCAAAGATGATGATGATATGTGCAAATCTATCAAATTTGCTGTTGAGTGGGGGACTACCCACAAACAAAAGGTGCTTAATTTCATTATATCTAACAATTTAGTTTTAAATAATAATATCTATCGTCTCCTTGAAAAGATATATAGACACAACAATGTGATTGATTAA

Protein sequence

MGSIVHRNPIPKPYFPYFFFYVLLFVVGYFIISSQISPMGARRERELQNYPQKEVEFSPINCTAYSRSEKWDSGIGPTTIEEEEEDGDGKNENTCPEYFRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTLWGILQLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIVFPDWSFWGWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSRSTQSKLRVYMQDWQEEGKQGFKNSNLADQCFSRYKVYIEGIGWSVSLKYILACDSMTLMVKPHFYDFFTRSLVPMHHYWPIKDDDDMCKSIKFAVEWGTTHKQKVLNFIISNNLVLNNNIYRLLEKIYRHNNVID*
Homology
BLAST of CsaV3_3G017050 vs. NCBI nr
Match: KGN57403.2 (hypothetical protein Csa_011064 [Cucumis sativus])

HSP 1 Score: 874.4 bits (2258), Expect = 3.9e-250
Identity = 409/409 (100.00%), Postives = 409/409 (100.00%), Query Frame = 0

Query: 1   MGSIVHRNPIPKPYFPYFFFYVLLFVVGYFIISSQISPMGARRERELQNYPQKEVEFSPI 60
           MGSIVHRNPIPKPYFPYFFFYVLLFVVGYFIISSQISPMGARRERELQNYPQKEVEFSPI
Sbjct: 1   MGSIVHRNPIPKPYFPYFFFYVLLFVVGYFIISSQISPMGARRERELQNYPQKEVEFSPI 60

Query: 61  NCTAYSRSEKWDSGIGPTTIEEEEEDGDGKNENTCPEYFRWIHEDLKPWAETGITREMVE 120
           NCTAYSRSEKWDSGIGPTTIEEEEEDGDGKNENTCPEYFRWIHEDLKPWAETGITREMVE
Sbjct: 61  NCTAYSRSEKWDSGIGPTTIEEEEEDGDGKNENTCPEYFRWIHEDLKPWAETGITREMVE 120

Query: 121 RGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTLWGILQLLRWYPDQIPDLDLMFACEDQ 180
           RGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTLWGILQLLRWYPDQIPDLDLMFACEDQ
Sbjct: 121 RGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTLWGILQLLRWYPDQIPDLDLMFACEDQ 180

Query: 181 PTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIVFPDWSFWGWPEINLKPWETEMKELKE 240
           PTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIVFPDWSFWGWPEINLKPWETEMKELKE
Sbjct: 181 PTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIVFPDWSFWGWPEINLKPWETEMKELKE 240

Query: 241 ANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSRSTQSKLRVYMQDWQEEGKQGFKNSNL 300
           ANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSRSTQSKLRVYMQDWQEEGKQGFKNSNL
Sbjct: 241 ANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSRSTQSKLRVYMQDWQEEGKQGFKNSNL 300

Query: 301 ADQCFSRYKVYIEGIGWSVSLKYILACDSMTLMVKPHFYDFFTRSLVPMHHYWPIKDDDD 360
           ADQCFSRYKVYIEGIGWSVSLKYILACDSMTLMVKPHFYDFFTRSLVPMHHYWPIKDDDD
Sbjct: 301 ADQCFSRYKVYIEGIGWSVSLKYILACDSMTLMVKPHFYDFFTRSLVPMHHYWPIKDDDD 360

Query: 361 MCKSIKFAVEWGTTHKQKVLNFIISNNLVLNNNIYRLLEKIYRHNNVID 410
           MCKSIKFAVEWGTTHKQKVLNFIISNNLVLNNNIYRLLEKIYRHNNVID
Sbjct: 361 MCKSIKFAVEWGTTHKQKVLNFIISNNLVLNNNIYRLLEKIYRHNNVID 409

BLAST of CsaV3_3G017050 vs. NCBI nr
Match: XP_031737709.1 (O-glucosyltransferase rumi homolog [Cucumis sativus])

HSP 1 Score: 815.8 bits (2106), Expect = 1.6e-232
Identity = 378/378 (100.00%), Postives = 378/378 (100.00%), Query Frame = 0

Query: 1   MGSIVHRNPIPKPYFPYFFFYVLLFVVGYFIISSQISPMGARRERELQNYPQKEVEFSPI 60
           MGSIVHRNPIPKPYFPYFFFYVLLFVVGYFIISSQISPMGARRERELQNYPQKEVEFSPI
Sbjct: 1   MGSIVHRNPIPKPYFPYFFFYVLLFVVGYFIISSQISPMGARRERELQNYPQKEVEFSPI 60

Query: 61  NCTAYSRSEKWDSGIGPTTIEEEEEDGDGKNENTCPEYFRWIHEDLKPWAETGITREMVE 120
           NCTAYSRSEKWDSGIGPTTIEEEEEDGDGKNENTCPEYFRWIHEDLKPWAETGITREMVE
Sbjct: 61  NCTAYSRSEKWDSGIGPTTIEEEEEDGDGKNENTCPEYFRWIHEDLKPWAETGITREMVE 120

Query: 121 RGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTLWGILQLLRWYPDQIPDLDLMFACEDQ 180
           RGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTLWGILQLLRWYPDQIPDLDLMFACEDQ
Sbjct: 121 RGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTLWGILQLLRWYPDQIPDLDLMFACEDQ 180

Query: 181 PTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIVFPDWSFWGWPEINLKPWETEMKELKE 240
           PTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIVFPDWSFWGWPEINLKPWETEMKELKE
Sbjct: 181 PTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIVFPDWSFWGWPEINLKPWETEMKELKE 240

Query: 241 ANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSRSTQSKLRVYMQDWQEEGKQGFKNSNL 300
           ANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSRSTQSKLRVYMQDWQEEGKQGFKNSNL
Sbjct: 241 ANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSRSTQSKLRVYMQDWQEEGKQGFKNSNL 300

Query: 301 ADQCFSRYKVYIEGIGWSVSLKYILACDSMTLMVKPHFYDFFTRSLVPMHHYWPIKDDDD 360
           ADQCFSRYKVYIEGIGWSVSLKYILACDSMTLMVKPHFYDFFTRSLVPMHHYWPIKDDDD
Sbjct: 301 ADQCFSRYKVYIEGIGWSVSLKYILACDSMTLMVKPHFYDFFTRSLVPMHHYWPIKDDDD 360

Query: 361 MCKSIKFAVEWGTTHKQK 379
           MCKSIKFAVEWGTTHKQK
Sbjct: 361 MCKSIKFAVEWGTTHKQK 378

BLAST of CsaV3_3G017050 vs. NCBI nr
Match: KAA0033638.1 (O-glucosyltransferase rumi-like protein [Cucumis melo var. makuwa] >TYK22937.1 O-glucosyltransferase rumi-like protein [Cucumis melo var. makuwa])

HSP 1 Score: 638.3 bits (1645), Expect = 4.7e-179
Identity = 295/345 (85.51%), Postives = 315/345 (91.30%), Query Frame = 0

Query: 38  PMGARRERELQNYPQKEVEFSPINCTAYSRSEKWDSGIGPTTIEEEEED--GDGKNENTC 97
           P+GAR E ELQN P+KEVEFSP+NCTAYSR EKW S  GP TIE+EEED  G+ +NENTC
Sbjct: 8   PIGARIEVELQNDPRKEVEFSPVNCTAYSRREKWHSRRGP-TIEKEEEDAIGERQNENTC 67

Query: 98  PEYFRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTLW 157
           PEYF+WIHEDLKPWA TGITREMVERGR  ATFRLVIVGGR YVEKYSEV+QRRD+FTLW
Sbjct: 68  PEYFQWIHEDLKPWAGTGITREMVERGRGKATFRLVIVGGRVYVEKYSEVYQRRDIFTLW 127

Query: 158 GILQLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIVF 217
           GILQLLRWYPD+IPDLDLMF+CEDQP +FIGNYSGPGPNS APPPLFRYCGDDDT DIVF
Sbjct: 128 GILQLLRWYPDKIPDLDLMFSCEDQPNIFIGNYSGPGPNSMAPPPLFRYCGDDDTLDIVF 187

Query: 218 PDWSFWGWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSRS 277
           PDWSFWGWPEIN+KPWET MKELKE N RKKWI+RENYA+WKGN FISMPRY+LLKCSRS
Sbjct: 188 PDWSFWGWPEINIKPWETLMKELKEGNGRKKWINRENYAYWKGNAFISMPRYKLLKCSRS 247

Query: 278 TQS--KLRVYMQDWQEEGKQGFKNSNLADQCFSRYKVYIEGIGWSVSLKYILACDSMTLM 337
           TQ   K RVYMQDW +E KQGFKNSNLADQCFSRYK+YIEGIGWSVSLKYILACDSMTLM
Sbjct: 248 TQHDWKARVYMQDWHKEVKQGFKNSNLADQCFSRYKIYIEGIGWSVSLKYILACDSMTLM 307

Query: 338 VKPHFYDFFTRSLVPMHHYWPIKDDDDMCKSIKFAVEWGTTHKQK 379
           VKPHFYDFFTRSLVPMHHYWPIKDDDDMCKSIKFAVEWG  HK++
Sbjct: 308 VKPHFYDFFTRSLVPMHHYWPIKDDDDMCKSIKFAVEWGNAHKKE 351

BLAST of CsaV3_3G017050 vs. NCBI nr
Match: XP_038886324.1 (protein O-glucosyltransferase 1-like [Benincasa hispida])

HSP 1 Score: 560.1 bits (1442), Expect = 1.6e-155
Identity = 255/338 (75.44%), Postives = 287/338 (84.91%), Query Frame = 0

Query: 43  RERELQNYPQKEVEFSPINCTAYSRSEKWDSGIGPTTIEEEEEDGDGKNENTCPEYFRWI 102
           R+ EL  YP+ EV+FSP+NCTAYSRSEKW     PT + +EEED DG+N +TCPEYFRWI
Sbjct: 4   RDVELHIYPKMEVKFSPVNCTAYSRSEKWHMS-SPTRV-KEEEDRDGQNGDTCPEYFRWI 63

Query: 103 HEDLKPWAETGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTLWGILQLLR 162
           HEDL+PWA+TGITREMVERGR  A FRLVIV GR YVEKY+E FQ RD FTLWGILQLLR
Sbjct: 64  HEDLRPWAQTGITREMVERGRPAADFRLVIVDGRVYVEKYAEAFQSRDSFTLWGILQLLR 123

Query: 163 WYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIVFPDWSFWG 222
           WYP +IPDLDLMF C DQP +FIGNYSGP PN+TAPPPLFRYCG+DDT DI+FPDWSFWG
Sbjct: 124 WYPGKIPDLDLMFHCGDQPNIFIGNYSGPRPNTTAPPPLFRYCGNDDTLDILFPDWSFWG 183

Query: 223 WPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSRSTQS--KL 282
           WPEI +KPW + MKELK+ NQRKKWIDRE YA+WKGN  +S  RY+L KC+ STQ   K+
Sbjct: 184 WPEIKIKPWTSLMKELKQGNQRKKWIDREAYAYWKGNALVSWSRYRLRKCNLSTQYDWKV 243

Query: 283 RVYMQDWQEEGKQGFKNSNLADQCFSRYKVYIEGIGWSVSLKYILACDSMTLMVKPHFYD 342
           RVYMQDW +E KQGFKNSNLADQC  RYK+YIEGI WS SLKYILACDS+TLMV PH+YD
Sbjct: 244 RVYMQDWLKEVKQGFKNSNLADQCVYRYKIYIEGISWSASLKYILACDSVTLMVNPHYYD 303

Query: 343 FFTRSLVPMHHYWPIKDDDDMCKSIKFAVEWGTTHKQK 379
           FF+RSLVPMHHYWPIKDD++MC SIKFAV+WG  HKQK
Sbjct: 304 FFSRSLVPMHHYWPIKDDNEMCNSIKFAVDWGNAHKQK 339

BLAST of CsaV3_3G017050 vs. NCBI nr
Match: KAG6576728.1 (Protein O-glucosyltransferase 1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 520.0 bits (1338), Expect = 1.9e-143
Identity = 249/380 (65.53%), Postives = 293/380 (77.11%), Query Frame = 0

Query: 7   RNPIPKPYFPYFFFYVLLFVVGYFIISSQI---SPMGARRERELQNYPQK-EVEFSPINC 66
           R P+ +  F  F   V L +    IISS++    P    R+ EL  YP + EV+   +NC
Sbjct: 6   RKPVAQLRFVIFSVSVSLSIAACLIISSRLLRHVPTTNGRDAELHIYPHRSEVQLPSVNC 65

Query: 67  TAYSRSEKWDSGIGPTTIEEEEEDGDGKN-ENTCPEYFRWIHEDLKPWAETGITREMVER 126
           TA+S S K  +    + I  EE D D +N + TCPEYFRWIHEDL+PWA TGITREMVE 
Sbjct: 66  TAFSWSGKCRT--RSSVIVNEEGDRDRQNFDTTCPEYFRWIHEDLRPWAGTGITREMVES 125

Query: 127 GRENATFRLVIVGGRAYVEKYSEVFQRRDVFTLWGILQLLRWYPDQIPDLDLMFACEDQP 186
           GR  A FRLVI+ GRAYVEK+ + +Q RD FTLWGILQLLR YP +IPDLDLMF CED+P
Sbjct: 126 GRPKAGFRLVIIDGRAYVEKFMDAYQSRDKFTLWGILQLLRLYPGKIPDLDLMFNCEDRP 185

Query: 187 TVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIVFPDWSFWGWPEINLKPWETEMKELKEA 246
            +FIG+YSGPGPNSTAPPPLFRYCGDDDT DIVFPDWSFWGWPEIN+KPW   M++LK+ 
Sbjct: 186 NIFIGDYSGPGPNSTAPPPLFRYCGDDDTLDIVFPDWSFWGWPEINIKPWVPLMEDLKQG 245

Query: 247 NQRKKWIDRENYAFWKGNTFISMPRYQLLKC--SRSTQSKLRVYMQDWQEEGKQGFKNSN 306
           N+R KW +RE YA+WKGN  +SM RY+LL+C  SR    K RV+MQDW +E K+ FKNSN
Sbjct: 246 NKRTKWSNREAYAYWKGNIKVSMVRYKLLECNLSREHDWKARVFMQDWDKEQKERFKNSN 305

Query: 307 LADQCFSRYKVYIEGIGWSVSLKYILACDSMTLMVKPHFYDFFTRSLVPMHHYWPIKDDD 366
           LADQC  RYK+Y+EG+GWSVSLKYILACDS+TLMV P++YDFFTRSLVPMHHYWPIKDDD
Sbjct: 306 LADQCVHRYKIYVEGVGWSVSLKYILACDSVTLMVNPYYYDFFTRSLVPMHHYWPIKDDD 365

Query: 367 DMCKSIKFAVEWGTTHKQKV 380
           DMC SIKFAV+WG TH+QKV
Sbjct: 366 DMCNSIKFAVDWGNTHQQKV 383

BLAST of CsaV3_3G017050 vs. ExPASy Swiss-Prot
Match: Q29AU6 (O-glucosyltransferase rumi OS=Drosophila pseudoobscura pseudoobscura OX=46245 GN=rumi PE=3 SV=1)

HSP 1 Score: 85.5 bits (210), Expect = 1.5e-15
Identity = 74/285 (25.96%), Postives = 119/285 (41.75%), Query Frame = 0

Query: 91  NENTCPEYFRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQRRD 150
           N+  C  +   I  DL P+  TG++R+M+E      T R  I   R Y E+ + +F  R 
Sbjct: 67  NDANCSCHAAVIKSDLAPYKATGVSRQMIESSARYGT-RYKIYEKRLYREE-NCMFPAR- 126

Query: 151 VFTLWGILQLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGDDDT 210
                GI   L      +PD+DL+    D P + +   +G      A  P+  +    D 
Sbjct: 127 ---CQGIEHFLLPLVATLPDMDLVINTRDYPQINMAWGNG------AQGPILSFSKTKDH 186

Query: 211 FDIVFPDWSFW-GWPEINLKP-----WETEMKELKEANQRKKWIDRENYAFWKGNTFISM 270
            DI++P W+FW G P   L P     W+   ++L++      W  +    F++G+   S 
Sbjct: 187 RDIMYPAWTFWAGGPATKLHPRGIGRWDLMREKLEKRAAAIPWSQKRELGFFRGSR-TSD 246

Query: 271 PRYQLLKCSRSTQSKLRVYMQDWQEEGKQGFKNS------------NLADQCFSRYKVYI 330
            R  L+  SR         + + Q    QG+K+             +  D C  +Y    
Sbjct: 247 ERDSLILLSRRNPE-----LVEAQYTKNQGWKSPKDTLDAPPAGEVSFEDHCKYKYLFNF 306

Query: 331 EGIGWSVSLKYILACDSMTLMVKPHFYDFFTRSLVPMHHYWPIKD 358
            G+  S  LK++  C S+   V   + +FF   L P  HY P+K+
Sbjct: 307 RGVAASFRLKHLFLCQSLVFHVGDEWQEFFYDQLKPWVHYVPLKN 333

BLAST of CsaV3_3G017050 vs. ExPASy Swiss-Prot
Match: Q8T045 (O-glucosyltransferase rumi OS=Drosophila melanogaster OX=7227 GN=rumi PE=1 SV=1)

HSP 1 Score: 83.6 bits (205), Expect = 5.8e-15
Identity = 70/283 (24.73%), Postives = 116/283 (40.99%), Query Frame = 0

Query: 92  ENTCPEYFRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQRRDV 151
           ++ C  +   +  DL P+  TG+TR+M+E      T +  I G R Y +       R + 
Sbjct: 70  DSDCSCHANVLKRDLAPYKSTGVTRQMIESSARYGT-KYKIYGHRLYRDANCMFPARCE- 129

Query: 152 FTLWGILQLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGDDDTF 211
               GI   L      +PD+DL+    D P +           + A  P+F +    +  
Sbjct: 130 ----GIEHFLLPLVATLPDMDLIINTRDYPQL------NAAWGNAAGGPVFSFSKTKEYR 189

Query: 212 DIVFPDWSFW-GWPEINLKP-----WETEMKELKEANQRKKWIDRENYAFWKGNTFISMP 271
           DI++P W+FW G P   L P     W+   ++L++      W  + +  F++G+   S  
Sbjct: 190 DIMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIPWSQKRSLGFFRGSR-TSDE 249

Query: 272 RYQLLKCSRSTQSKLRVYMQDWQEEGKQGFKNS------------NLADQCFSRYKVYIE 331
           R  L+  SR         + + Q    QG+K+             +  D C  +Y     
Sbjct: 250 RDSLILLSRRNPE-----LVEAQYTKNQGWKSPKDTLDAPAADEVSFEDHCKYKYLFNFR 309

Query: 332 GIGWSVSLKYILACDSMTLMVKPHFYDFFTRSLVPMHHYWPIK 357
           G+  S  LK++  C S+   V   + +FF   L P  HY P+K
Sbjct: 310 GVAASFRLKHLFLCKSLVFHVGDEWQEFFYDQLKPWVHYVPLK 334

BLAST of CsaV3_3G017050 vs. ExPASy Swiss-Prot
Match: A0NDG6 (O-glucosyltransferase rumi homolog OS=Anopheles gambiae OX=7165 GN=AGAP004267 PE=3 SV=1)

HSP 1 Score: 82.8 bits (203), Expect = 9.9e-15
Identity = 71/278 (25.54%), Postives = 120/278 (43.17%), Query Frame = 0

Query: 91  NENTCPEYFRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQRRD 150
           N   C  +   +  DLKP+   GIT+EM+ R ++  T    ++G + Y ++   +F  R 
Sbjct: 67  NSTNCNCHADVLKADLKPFKAHGITKEMINRAKQYGT-HYQVIGHKLYRQREC-MFPAR- 126

Query: 151 VFTLWGILQLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGDDDT 210
                G+   +R     +PD+DL+  C D P +           S    P+  +    + 
Sbjct: 127 ---CSGVEHFVRPLLPLLPDMDLIVNCRDWPQIH-------RHWSKEKIPVLSFSKTAEY 186

Query: 211 FDIVFPDWSFW-GWPEINLKP-----WETEMKELKEANQRKKWIDRENYAFWKGNTFISM 270
            DI++P W+FW G P I L P     W+   + + +A+    W  +E  AF++G+   S 
Sbjct: 187 LDIMYPAWAFWEGGPAIALYPTGLGRWDLHRQTITKAS--ADWEAKEPKAFFRGSR-TSD 246

Query: 271 PRYQLLKCSRSTQSKLRVYM---QDWQEE----GKQGFKNSNLADQCFSRYKVYIEGIGW 330
            R  L+  SR+  S +       Q W+        +  +   L + C  R+     G+  
Sbjct: 247 ERDALVLLSRAQPSLVDAQYTKNQAWKSPQDTLNAEPAREVTLEEHCRYRFLFNFRGVAA 306

Query: 331 SVSLKYILACDSMTLMVKPHFYDFFTRSLVPMHHYWPI 356
           S   K++  C S+   V   + +FF  SL P  HY P+
Sbjct: 307 SFRFKHLFLCRSLVFHVGDEWQEFFYPSLKPWVHYVPV 328

BLAST of CsaV3_3G017050 vs. ExPASy Swiss-Prot
Match: Q7ZVE6 (Protein O-glucosyltransferase 2 OS=Danio rerio OX=7955 GN=poglut2 PE=2 SV=1)

HSP 1 Score: 68.9 bits (167), Expect = 1.5e-10
Identity = 72/297 (24.24%), Postives = 127/297 (42.76%), Query Frame = 0

Query: 84  EEDGDGKNENT-CPEYFRWIHEDLKPWAETGITR---EMVER-GRENATFRLVIVGGRAY 143
           E DG    +N  CP  F  I  DL  +      R   E+++R G+ ++     I   + Y
Sbjct: 140 EPDGALWEKNMHCPASFSQIESDLSIFQSVDPDRNAHEIIQRFGKSHSLCHYTIKNNQVY 199

Query: 144 VEKYSEVFQRRDVFTLWGILQLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAP 203
           ++ + E    R +F    +L L R    ++PD++      D P             S  P
Sbjct: 200 IKTHGEHVGFR-IFMDAFLLSLTR--KVKLPDIEFFVNLGDWPL-------EKRRASQNP 259

Query: 204 PPLFRYCGDDDTFDIVFPDWSFWGWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKG 263
            P+F +CG +DT DIV P +         +     +M  + + +    W  + N  FW+G
Sbjct: 260 SPVFSWCGSNDTRDIVMPTYDLTESVLETMGRVSLDMMSV-QGHTGPVWEKKINKGFWRG 319

Query: 264 NTFISMPRYQLLKCSRSTQSKLRVYMQDW----QEEGKQG--FKNSNLADQCFSRYKVYI 323
                  R +L+K +R+  + L   + ++     +E   G   K+ +  D    +Y++ +
Sbjct: 320 RD-SRKERLELVKLARANTAMLDAALTNFFFFKHDESLYGPLVKHVSFFDFFKYKYQINV 379

Query: 324 EGIGWSVSLKYILACDSMTLMVKPHFYDFFTRSLVPMHHYWPIKDD-DDMCKSIKFA 369
           +G   +  L Y+LA DS+       +Y+ F   L P  HY P + D  D+ + I++A
Sbjct: 380 DGTVAAYRLPYLLAGDSVVFKHDSIYYEHFYNELQPWVHYIPFRSDLSDLLEKIQWA 424

BLAST of CsaV3_3G017050 vs. ExPASy TrEMBL
Match: A0A0A0L8N9 (CAP10 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G182050 PE=4 SV=1)

HSP 1 Score: 738.4 bits (1905), Expect = 1.6e-209
Identity = 342/348 (98.28%), Postives = 344/348 (98.85%), Query Frame = 0

Query: 39  MGARRERELQNYPQKEVEFSPINCTAYSRSEKWDSGIGPTTIEEEEEDGDGKNENTCPEY 98
           MGARRERELQNYPQKEVEFSPINCTAYSRSEKWDSGIGPTTIEEEEEDGDGKNENTCPEY
Sbjct: 1   MGARRERELQNYPQKEVEFSPINCTAYSRSEKWDSGIGPTTIEEEEEDGDGKNENTCPEY 60

Query: 99  FRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTLWGIL 158
           FRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTLWGIL
Sbjct: 61  FRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTLWGIL 120

Query: 159 QLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIVFPDW 218
           QLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIVFPDW
Sbjct: 121 QLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIVFPDW 180

Query: 219 SFWGWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSRSTQS 278
           SFWGWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSRSTQS
Sbjct: 181 SFWGWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSRSTQS 240

Query: 279 KLRVYMQDWQEEGKQGFKNSNLADQCFSRYKVYIEGIGWSVSLKYILACDSMTLMVKPHF 338
           KLRVYMQDWQEEGKQGFKNSNLADQCFSRYKVYIEGIGWSVSLKYILACDSMTLMVKPHF
Sbjct: 241 KLRVYMQDWQEEGKQGFKNSNLADQCFSRYKVYIEGIGWSVSLKYILACDSMTLMVKPHF 300

Query: 339 YDFFTRSLVPMHHYWPIKDDDDMCKSIKFAVEWGTTHKQKVLNFIISN 387
           YDFFTRSLVPMHHYWPIKDDDDMCKSIKFAVEWGTTHKQK ++   SN
Sbjct: 301 YDFFTRSLVPMHHYWPIKDDDDMCKSIKFAVEWGTTHKQKQVSSWKSN 348

BLAST of CsaV3_3G017050 vs. ExPASy TrEMBL
Match: A0A5D3DGW6 (O-glucosyltransferase rumi-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold386G00150 PE=4 SV=1)

HSP 1 Score: 638.3 bits (1645), Expect = 2.3e-179
Identity = 295/345 (85.51%), Postives = 315/345 (91.30%), Query Frame = 0

Query: 38  PMGARRERELQNYPQKEVEFSPINCTAYSRSEKWDSGIGPTTIEEEEED--GDGKNENTC 97
           P+GAR E ELQN P+KEVEFSP+NCTAYSR EKW S  GP TIE+EEED  G+ +NENTC
Sbjct: 8   PIGARIEVELQNDPRKEVEFSPVNCTAYSRREKWHSRRGP-TIEKEEEDAIGERQNENTC 67

Query: 98  PEYFRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTLW 157
           PEYF+WIHEDLKPWA TGITREMVERGR  ATFRLVIVGGR YVEKYSEV+QRRD+FTLW
Sbjct: 68  PEYFQWIHEDLKPWAGTGITREMVERGRGKATFRLVIVGGRVYVEKYSEVYQRRDIFTLW 127

Query: 158 GILQLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIVF 217
           GILQLLRWYPD+IPDLDLMF+CEDQP +FIGNYSGPGPNS APPPLFRYCGDDDT DIVF
Sbjct: 128 GILQLLRWYPDKIPDLDLMFSCEDQPNIFIGNYSGPGPNSMAPPPLFRYCGDDDTLDIVF 187

Query: 218 PDWSFWGWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSRS 277
           PDWSFWGWPEIN+KPWET MKELKE N RKKWI+RENYA+WKGN FISMPRY+LLKCSRS
Sbjct: 188 PDWSFWGWPEINIKPWETLMKELKEGNGRKKWINRENYAYWKGNAFISMPRYKLLKCSRS 247

Query: 278 TQS--KLRVYMQDWQEEGKQGFKNSNLADQCFSRYKVYIEGIGWSVSLKYILACDSMTLM 337
           TQ   K RVYMQDW +E KQGFKNSNLADQCFSRYK+YIEGIGWSVSLKYILACDSMTLM
Sbjct: 248 TQHDWKARVYMQDWHKEVKQGFKNSNLADQCFSRYKIYIEGIGWSVSLKYILACDSMTLM 307

Query: 338 VKPHFYDFFTRSLVPMHHYWPIKDDDDMCKSIKFAVEWGTTHKQK 379
           VKPHFYDFFTRSLVPMHHYWPIKDDDDMCKSIKFAVEWG  HK++
Sbjct: 308 VKPHFYDFFTRSLVPMHHYWPIKDDDDMCKSIKFAVEWGNAHKKE 351

BLAST of CsaV3_3G017050 vs. ExPASy TrEMBL
Match: A0A6J1E6Y7 (O-glucosyltransferase rumi homolog OS=Cucurbita moschata OX=3662 GN=LOC111430515 PE=4 SV=1)

HSP 1 Score: 512.3 bits (1318), Expect = 1.9e-141
Identity = 235/340 (69.12%), Postives = 276/340 (81.18%), Query Frame = 0

Query: 43  RERELQNYPQK-EVEFSPINCTAYSRSEKWDSGIGPTTIEEEEEDGDGKNENTCPEYFRW 102
           R+ EL  YP + EV+   +NCTA+S SEK  +    + I  EE D D +N +TCPEYFRW
Sbjct: 17  RDAELHIYPHRSEVQLPSVNCTAFSWSEKCRT--RSSVIVNEEGDRDRQNFDTCPEYFRW 76

Query: 103 IHEDLKPWAETGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTLWGILQLL 162
           IHEDL+PW  TGITREM+E GR  A FRLVI+ GRAYVEK+ + +Q RD FTLWGILQLL
Sbjct: 77  IHEDLRPWTRTGITREMLESGRPKAGFRLVIIDGRAYVEKFMDAYQSRDKFTLWGILQLL 136

Query: 163 RWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIVFPDWSFW 222
           R YP +IPDLDLMF CED+P +FIG+YSGPGPNSTAPPP+FRYCGDDDT DIVFPDWSFW
Sbjct: 137 RLYPGKIPDLDLMFNCEDRPNIFIGDYSGPGPNSTAPPPVFRYCGDDDTLDIVFPDWSFW 196

Query: 223 GWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMPRYQLLKC--SRSTQSK 282
           GWPEIN+KPW   M++LK+ N+R KW +RE +A+WKGN  +SM RY+LL C  SR    K
Sbjct: 197 GWPEINIKPWVPLMEDLKQGNKRTKWSNREAHAYWKGNIKVSMVRYKLLGCNLSREHDWK 256

Query: 283 LRVYMQDWQEEGKQGFKNSNLADQCFSRYKVYIEGIGWSVSLKYILACDSMTLMVKPHFY 342
            RV+MQDW +E +Q FKNSNLADQC  RYK+Y+EG+GWSVSLKYILACDS+TLMV P++Y
Sbjct: 257 ARVFMQDWDKEQEQRFKNSNLADQCVHRYKIYVEGVGWSVSLKYILACDSVTLMVNPYYY 316

Query: 343 DFFTRSLVPMHHYWPIKDDDDMCKSIKFAVEWGTTHKQKV 380
           DFFTRSLVPMHHYWPIKDDDDMC SIKFAV+WG TH+QKV
Sbjct: 317 DFFTRSLVPMHHYWPIKDDDDMCNSIKFAVDWGNTHQQKV 354

BLAST of CsaV3_3G017050 vs. ExPASy TrEMBL
Match: A0A1S3CSM5 (O-glucosyltransferase rumi homolog OS=Cucumis melo OX=3656 GN=LOC103504495 PE=4 SV=1)

HSP 1 Score: 487.6 bits (1254), Expect = 4.9e-134
Identity = 227/273 (83.15%), Postives = 244/273 (89.38%), Query Frame = 0

Query: 38  PMGARRERELQNYPQKEVEFSPINCTAYSRSEKWDSGIGPTTIEEEEED--GDGKNENTC 97
           P+GAR E ELQNYP+KEVEFSP+NCTAYSR EKW S  GP TIE+EEED  G+ +NENTC
Sbjct: 8   PIGARIEVELQNYPRKEVEFSPVNCTAYSRREKWHSRRGP-TIEKEEEDAIGERQNENTC 67

Query: 98  PEYFRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTLW 157
           PEYF+WIHEDLKPWA TGITREMVERGR  ATFRLVIVGGR YVEKY E +QRRD+FTLW
Sbjct: 68  PEYFQWIHEDLKPWAGTGITREMVERGRGKATFRLVIVGGRVYVEKYLEAYQRRDIFTLW 127

Query: 158 GILQLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIVF 217
           GILQLLRWYPD+IPDLDLMF+CEDQP +FIGNYSGPGPNS APPPLFRYCGDDDT DIVF
Sbjct: 128 GILQLLRWYPDKIPDLDLMFSCEDQPNIFIGNYSGPGPNSMAPPPLFRYCGDDDTLDIVF 187

Query: 218 PDWSFWGWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSRS 277
           PDWSFWGWPEIN+KPWET MKELKE N RKKWI+RENYA+WKGN FISMPRY+LLKCSRS
Sbjct: 188 PDWSFWGWPEINIKPWETLMKELKEGNGRKKWINRENYAYWKGNAFISMPRYKLLKCSRS 247

Query: 278 TQS--KLRVYMQDWQEEGKQGFKNSNLADQCFS 307
           TQ   K RVYMQDW +E KQGFKNSNLADQCFS
Sbjct: 248 TQHDWKARVYMQDWHKEVKQGFKNSNLADQCFS 279

BLAST of CsaV3_3G017050 vs. ExPASy TrEMBL
Match: A0A803PQS7 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 455.3 bits (1170), Expect = 2.7e-124
Identity = 212/349 (60.74%), Postives = 259/349 (74.21%), Query Frame = 0

Query: 32  ISSQISPMGARRERELQNYPQKEVEFSPINCTAYSRSEKWDSGIGPTTIEEEEEDGDGKN 91
           I S  S    +++ E+   P+ ++E  P+NCT Y+ +    +   PTT   EE+D D  +
Sbjct: 58  ILSTTSHKYPQKQPEISKTPRPKIEI-PLNCTDYNPTHTCPTNY-PTTFNPEEDDQDRPS 117

Query: 92  ENTCPEYFRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQRRDV 151
            +TCPEYFRWIHEDL+PWA+TGITREMV+R +  A FRLVIV G+AYVEKY + FQ RDV
Sbjct: 118 PSTCPEYFRWIHEDLRPWAQTGITREMVDRAKRTANFRLVIVKGKAYVEKYQKSFQTRDV 177

Query: 152 FTLWGILQLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGDDDTF 211
           FTLWGILQLLR YP ++PDLDLMF C D P +    YS  GPN+T+PPPLFRYCGDD + 
Sbjct: 178 FTLWGILQLLRRYPGRVPDLDLMFDCVDWPVILSKAYS--GPNATSPPPLFRYCGDDSSL 237

Query: 212 DIVFPDWSFWGWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMPRYQLLK 271
           DIVFPDWSFWGWPEIN+KPW   MKEL+E N R +W+DRE YA+WKGN  ++  R  LLK
Sbjct: 238 DIVFPDWSFWGWPEINIKPWVPLMKELEEGNNRARWVDREPYAYWKGNPVVAATRQDLLK 297

Query: 272 CSRSTQSK--LRVYMQDWQEEGKQGFKNSNLADQCFSRYKVYIEGIGWSVSLKYILACDS 331
           C+ S Q +   RVY QDW  E +QG+K SNLA QC  RYK+YIEG  WSVS KYILACDS
Sbjct: 298 CNVSDQQEWNARVYAQDWGRETQQGYKQSNLASQCVHRYKIYIEGSAWSVSEKYILACDS 357

Query: 332 MTLMVKPHFYDFFTRSLVPMHHYWPIKDDDDMCKSIKFAVEWGTTHKQK 379
           +TL+VKP +YDFFTR L+P  HYWPIK DD  CKSIKFAV+WG +H+++
Sbjct: 358 VTLLVKPRYYDFFTRGLIPGQHYWPIK-DDHKCKSIKFAVDWGNSHQKE 401

BLAST of CsaV3_3G017050 vs. TAIR 10
Match: AT5G23850.1 (Arabidopsis thaliana protein of unknown function (DUF821) )

HSP 1 Score: 422.5 bits (1085), Expect = 3.7e-118
Identity = 192/345 (55.65%), Postives = 251/345 (72.75%), Query Frame = 0

Query: 36  ISPMGARRERELQNYPQKEVEFSPINCTAYSRSEKWDSGIGPTTIEEEEEDGDGKNENTC 95
           I+P   R    +   P+ E     ++C+A   +    S   PTT   E++D +     TC
Sbjct: 83  ITPKYPRPTTVITQSPKPEF---TLHCSANETTASCPSNKYPTTTSFEDDDTNHPPTATC 142

Query: 96  PEYFRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTLW 155
           P+YFRWIHEDL+PW+ TGITRE +ER ++ ATFRL IVGG+ YVEK+ + FQ RDVFT+W
Sbjct: 143 PDYFRWIHEDLRPWSRTGITREALERAKKTATFRLAIVGGKIYVEKFQDAFQTRDVFTIW 202

Query: 156 GILQLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIVF 215
           G LQLLR YP +IPDL+LMF C D P V    ++  G N+ +PPPLFRYCG+++T DIVF
Sbjct: 203 GFLQLLRKYPGKIPDLELMFDCVDWPVVRATEFA--GANAPSPPPLFRYCGNEETLDIVF 262

Query: 216 PDWSFWGWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSRS 275
           PDWSFWGW E+N+KPWE+ +KEL+E N+R KWI+RE YA+WKGN  ++  R  L+KC+ S
Sbjct: 263 PDWSFWGWAEVNIKPWESLLKELREGNERTKWINREPYAYWKGNPMVAETRQDLMKCNVS 322

Query: 276 TQSK--LRVYMQDWQEEGKQGFKNSNLADQCFSRYKVYIEGIGWSVSLKYILACDSMTLM 335
            + +   R+Y QDW +E K+G+K S+LA QC  RYK+YIEG  WSVS KYILACDS+TL+
Sbjct: 323 EEHEWNARLYAQDWIKESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILACDSVTLL 382

Query: 336 VKPHFYDFFTRSLVPMHHYWPIKDDDDMCKSIKFAVEWGTTHKQK 379
           VKPH+YDFFTR L+P HHYWP++ + D C+SIKFAV+WG +H QK
Sbjct: 383 VKPHYYDFFTRGLLPAHHYWPVR-EHDKCRSIKFAVDWGNSHIQK 421

BLAST of CsaV3_3G017050 vs. TAIR 10
Match: AT3G48980.1 (Arabidopsis thaliana protein of unknown function (DUF821) )

HSP 1 Score: 412.1 bits (1058), Expect = 5.1e-115
Identity = 201/389 (51.67%), Postives = 257/389 (66.07%), Query Frame = 0

Query: 17  YFFFYVLLF-VVGYFIISSQISPMGARRERELQNYPQKEVEFSP---------------- 76
           Y FF + LF ++G F+ +  +       E+E  +  ++E   SP                
Sbjct: 36  YAFFSIFLFLLLGAFLSTRLLLDPSVLIEKEAVSVTERETTQSPEYPQSTKLITEKPKEF 95

Query: 77  -INCTAYSRSEKWDSGI-----GPTTIEEE--EEDGDGKNENTCPEYFRWIHEDLKPWAE 136
            +NC A+S +   D+G       PT+      E + D     TCP+YFRWIHEDL+PW +
Sbjct: 96  TLNCAAFSGN---DTGTCPKDNYPTSFRSSAGEGESDRSPSATCPDYFRWIHEDLRPWEK 155

Query: 137 TGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTLWGILQLLRWYPDQIPDL 196
           TGITRE +ER    A FRL I+ GR YVEK+ E FQ RDVFT+WG +QLLR YP +IPDL
Sbjct: 156 TGITREALERANATAIFRLAIINGRIYVEKFREAFQTRDVFTIWGFVQLLRRYPGKIPDL 215

Query: 197 DLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFDIVFPDWSFWGWPEINLKPW 256
           +LMF C D P V    ++  G +   PPPLFRYC +D+T DIVFPDWS+WGW E+N+KPW
Sbjct: 216 ELMFDCVDWPVVKAAEFA--GVDQPPPPPLFRYCANDETLDIVFPDWSYWGWAEVNIKPW 275

Query: 257 ETEMKELKEANQRKKWIDRENYAFWKGNTFISMPRYQLLKCSRST--QSKLRVYMQDWQE 316
           E+ +KEL+E NQR KWIDRE YA+WKGN  ++  R  L+KC+ S     K R+Y QDW +
Sbjct: 276 ESLLKELREGNQRTKWIDREPYAYWKGNPTVAETRLDLMKCNLSEVYDWKARLYKQDWVK 335

Query: 317 EGKQGFKNSNLADQCFSRYKVYIEGIGWSVSLKYILACDSMTLMVKPHFYDFFTRSLVPM 376
           E K+G+K S+LA QC  RYK+YIEG  WSVS KYILACDS+TLMVKPH+YDFFTR + P 
Sbjct: 336 ESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILACDSVTLMVKPHYYDFFTRGMFPG 395

Query: 377 HHYWPIKDDDDMCKSIKFAVEWGTTHKQK 379
           HHYWP+K +DD C+SIKFAV+WG  H +K
Sbjct: 396 HHYWPVK-EDDKCRSIKFAVDWGNLHMRK 418

BLAST of CsaV3_3G017050 vs. TAIR 10
Match: AT1G63420.1 (Arabidopsis thaliana protein of unknown function (DUF821) )

HSP 1 Score: 386.0 bits (990), Expect = 3.9e-107
Identity = 176/325 (54.15%), Postives = 236/325 (72.62%), Query Frame = 0

Query: 58  SPINCTAYSRSEKWDSGIGPTTIEEEEEDGDGKNENTCPEYFRWIHEDLKPWAETGITRE 117
           S ++C+++    +  SG    T++        ++  +CP+YF+WIHEDLKPW ETGIT+E
Sbjct: 135 SSVDCSSFLNQNR--SGSCSRTLQSGYNQNQTESNRSCPDYFKWIHEDLKPWRETGITKE 194

Query: 118 MVERGRENATFRLVIVGGRAYVEKYSEVFQRRDVFTLWGILQLLRWYPDQIPDLDLMFAC 177
           MVERG+  A FRLVI+ G+ +VE Y +  Q RD FTLWGILQLLR YP ++PD+DLMF C
Sbjct: 195 MVERGKTTAHFRLVILNGKVFVENYKKSIQTRDAFTLWGILQLLRKYPGKLPDVDLMFDC 254

Query: 178 EDQPTVFIGNYSGPGPN-STAPPPLFRYCGDDDTFDIVFPDWSFWGWPEINLKPWETEMK 237
           +D+P +    Y+        APPPLFRYCGD  T DIVFPDWSFWGW EIN++ W   +K
Sbjct: 255 DDRPVIRSDGYNILNRTVENAPPPLFRYCGDRWTVDIVFPDWSFWGWQEINIREWSKVLK 314

Query: 238 ELKEANQRKKWIDRENYAFWKGNTFISMP-RYQLLKCSRST--QSKLRVYMQDWQEEGKQ 297
           E++E  ++KK+++R+ YA+WKGN F++ P R  LL C+ S+      R+++QDW  EG++
Sbjct: 315 EMEEGKKKKKFMERDAYAYWKGNPFVASPSREDLLTCNLSSLHDWNARIFIQDWISEGQR 374

Query: 298 GFKNSNLADQCFSRYKVYIEGIGWSVSLKYILACDSMTLMVKPHFYDFFTRSLVPMHHYW 357
           GF+NSN+A+QC  RYK+YIEG  WSVS KYILACDS+TLMVKP++YDFF+R+L P+ HYW
Sbjct: 375 GFENSNVANQCTYRYKIYIEGYAWSVSEKYILACDSVTLMVKPYYYDFFSRTLQPLQHYW 434

Query: 358 PIKDDDDMCKSIKFAVEWGTTHKQK 379
           PI+ D D C+SIKFAV+W   H QK
Sbjct: 435 PIR-DKDKCRSIKFAVDWLNNHTQK 456

BLAST of CsaV3_3G017050 vs. TAIR 10
Match: AT2G45830.1 (downstream target of AGL15 2 )

HSP 1 Score: 378.6 bits (971), Expect = 6.2e-105
Identity = 165/285 (57.89%), Postives = 211/285 (74.04%), Query Frame = 0

Query: 93  NTCPEYFRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQRRDVF 152
           +TCP YFRWIHEDL+PW ETG+TR M+E+ R  A FR+VI+ GR YV+KY +  Q RDVF
Sbjct: 117 STCPSYFRWIHEDLRPWKETGVTRGMLEKARRTAHFRVVILDGRVYVKKYRKSIQTRDVF 176

Query: 153 TLWGILQLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGDDDTFD 212
           TLWGI+QLLRWYP ++PDL+LMF  +D+PTV   ++   G    APPPLFRYC DD + D
Sbjct: 177 TLWGIVQLLRWYPGRLPDLELMFDPDDRPTVRSKDFQ--GQQHPAPPPLFRYCSDDASLD 236

Query: 213 IVFPDWSFWGWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMPRYQLLKC 272
           IVFPDWSFWGW E+N+KPW+  +  ++E N+  +W DR  YA+W+GN  ++  R  LL+C
Sbjct: 237 IVFPDWSFWGWAEVNIKPWDKSLVAIEEGNKMTQWKDRVAYAYWRGNPNVAPTRRDLLRC 296

Query: 273 SRSTQS--KLRVYMQDWQEEGKQGFKNSNLADQCFSRYKVYIEGIGWSVSLKYILACDSM 332
           + S Q     R+Y+QDW  E ++GFKNSNL +QC  RYK+YIEG  WSVS KYI+ACDSM
Sbjct: 297 NVSAQEDWNTRLYIQDWDRESREGFKNSNLENQCTHRYKIYIEGWAWSVSEKYIMACDSM 356

Query: 333 TLMVKPHFYDFFTRSLVPMHHYWPIKDDDDMCKSIKFAVEWGTTH 376
           TL V+P FYDF+ R ++P+ HYWPI+ D   C S+KFAV WG TH
Sbjct: 357 TLYVRPMFYDFYVRGMMPLQHYWPIR-DTSKCTSLKFAVHWGNTH 398

BLAST of CsaV3_3G017050 vs. TAIR 10
Match: AT3G61270.1 (Arabidopsis thaliana protein of unknown function (DUF821) )

HSP 1 Score: 377.1 bits (967), Expect = 1.8e-104
Identity = 165/293 (56.31%), Postives = 210/293 (71.67%), Query Frame = 0

Query: 88  DGKNENTCPEYFRWIHEDLKPWAETGITREMVERGRENATFRLVIVGGRAYVEKYSEVFQ 147
           +    +TCP YFRWIHEDL+PW +TGITR M+E     A FRLVI  G+AYV++Y +  Q
Sbjct: 90  NSSKSSTCPSYFRWIHEDLRPWKQTGITRGMIEEASRTAHFRLVIRNGKAYVKRYKKSIQ 149

Query: 148 RRDVFTLWGILQLLRWYPDQIPDLDLMFACEDQPTVFIGNYSGPGPNSTAPPPLFRYCGD 207
            RD FTLWGILQLLRWYP ++PDL+LMF  +D+P V   ++ G       PPP+FRYC D
Sbjct: 150 TRDEFTLWGILQLLRWYPGKLPDLELMFDADDRPVVRSVDFIG---QQKEPPPVFRYCSD 209

Query: 208 DDTFDIVFPDWSFWGWPEINLKPWETEMKELKEANQRKKWIDRENYAFWKGNTFISMPRY 267
           D + DIVFPDWSFWGW E+N+KPW   ++ +KE N   +W DR  YA+W+GN ++   R 
Sbjct: 210 DASLDIVFPDWSFWGWAEVNVKPWGKSLEAIKEGNSMTQWKDRVAYAYWRGNPYVDPGRG 269

Query: 268 QLLKCSRSTQSK--LRVYMQDWQEEGKQGFKNSNLADQCFSRYKVYIEGIGWSVSLKYIL 327
            LLKC+ +   +   R+Y+QDW +E K+GFKNSNL +QC  RYK+YIEG  WSVS KYI+
Sbjct: 270 DLLKCNATEHEEWNTRLYIQDWDKETKEGFKNSNLENQCTHRYKIYIEGWAWSVSEKYIM 329

Query: 328 ACDSMTLMVKPHFYDFFTRSLVPMHHYWPIKDDDDMCKSIKFAVEWGTTHKQK 379
           ACDSMTL VKP FYDF+ R ++P+ HYWPI+ DD  C S+KFAV WG TH+ K
Sbjct: 330 ACDSMTLYVKPRFYDFYIRGMMPLQHYWPIR-DDSKCTSLKFAVHWGNTHEDK 378

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KGN57403.23.9e-250100.00hypothetical protein Csa_011064 [Cucumis sativus][more]
XP_031737709.11.6e-232100.00O-glucosyltransferase rumi homolog [Cucumis sativus][more]
KAA0033638.14.7e-17985.51O-glucosyltransferase rumi-like protein [Cucumis melo var. makuwa] >TYK22937.1 O... [more]
XP_038886324.11.6e-15575.44protein O-glucosyltransferase 1-like [Benincasa hispida][more]
KAG6576728.11.9e-14365.53Protein O-glucosyltransferase 1, partial [Cucurbita argyrosperma subsp. sororia][more]
Match NameE-valueIdentityDescription
Q29AU61.5e-1525.96O-glucosyltransferase rumi OS=Drosophila pseudoobscura pseudoobscura OX=46245 GN... [more]
Q8T0455.8e-1524.73O-glucosyltransferase rumi OS=Drosophila melanogaster OX=7227 GN=rumi PE=1 SV=1[more]
A0NDG69.9e-1525.54O-glucosyltransferase rumi homolog OS=Anopheles gambiae OX=7165 GN=AGAP004267 PE... [more]
Q7ZVE61.5e-1024.24Protein O-glucosyltransferase 2 OS=Danio rerio OX=7955 GN=poglut2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L8N91.6e-20998.28CAP10 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G182050 PE=4 ... [more]
A0A5D3DGW62.3e-17985.51O-glucosyltransferase rumi-like protein OS=Cucumis melo var. makuwa OX=1194695 G... [more]
A0A6J1E6Y71.9e-14169.12O-glucosyltransferase rumi homolog OS=Cucurbita moschata OX=3662 GN=LOC111430515... [more]
A0A1S3CSM54.9e-13483.15O-glucosyltransferase rumi homolog OS=Cucumis melo OX=3656 GN=LOC103504495 PE=4 ... [more]
A0A803PQS72.7e-12460.74Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G23850.13.7e-11855.65Arabidopsis thaliana protein of unknown function (DUF821) [more]
AT3G48980.15.1e-11551.67Arabidopsis thaliana protein of unknown function (DUF821) [more]
AT1G63420.13.9e-10754.15Arabidopsis thaliana protein of unknown function (DUF821) [more]
AT2G45830.16.2e-10557.89downstream target of AGL15 2 [more]
AT3G61270.11.8e-10456.31Arabidopsis thaliana protein of unknown function (DUF821) [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Chinese Long) v3
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006598Glycosyl transferase CAP10 domainSMARTSM00672cap10coord: 167..405
e-value: 1.8E-83
score: 293.3
IPR006598Glycosyl transferase CAP10 domainPFAMPF05686Glyco_transf_90coord: 91..379
e-value: 2.3E-130
score: 435.1
NoneNo IPR availablePANTHERPTHR12203:SF74GLYCOSYLTRANSFERASEcoord: 23..379
NoneNo IPR availablePANTHERPTHR12203KDEL LYS-ASP-GLU-LEU CONTAINING - RELATEDcoord: 23..379

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_3G017050.1CsaV3_3G017050.1mRNA