Csa1G044870 (gene) Cucumber (Chinese Long) v2

NameCsa1G044870
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionMultidrug resistance protein MdtK; contains IPR002528 (Multi antimicrobial extrusion protein)
LocationChr1 : 4903764 .. 4906665 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTATGCATCGTTTTCCGTTTAACCCAAAAGAATGATCTTTCTGTAATATGCATTATATATATACCGAAGAGTCTTGACCCTTATTTTGAATAAATCCGACGGTGACAACGGCGATCACCGAATAACGTGATTACAACATATCTGTACTGTTTAAAAGCTTGATTGTTGCCTAGTCCTCTGTTTCGCCGGAGATGGAGGCGACGGCGCCATTTCTCGGCGTCAACGATGGAGACTATCCTCCGGTGAAGACATTTCGGGAGTTGAAGGATGTGGTATGGAGTGAAACGGTGAAGACTTGGGCGATCTCCGGTCCGGTGATATTTCAGATCGTTTGTCAGTACGGAACGAACTCTGTTACGAATATTTTTGTGGGTCAACTTGGAGAAATAGAGCTCTCTGGAGTTTCCATTGCCATCTCTGTTATTGCAACTTTTGCTTTTGGCTTCATGGTCTGTATATTCTGTCTTTTCCTTCTTCTTTGTTTGATTATTTCTCTCTTTTTTGTGTGTAACCAAAGCCTAATCTTTTGTTTGAATGAAATCTTATCTCTCGAGGAAAATGTATACAAGTTAGTTGTTGACTGTCTAACTAAACAGTTGAAACAACAACTATAATAGAAATAGTAAAACGGTCCCCGAGTGTGTCTCGACTCCTGAGTTGGGTCTGTCATTCCACCCCGAGTTTGTATTACATCACTTTTTTAGTTTAATGGGTGATTAGAACTTCATTTTCTTATTGAGTTTTTTATTGTTTTATTGGATAATCGAAGTTTTTATGTAAGATGGGTGATTATTTCTATCTCTAATGTGGGTGTCTAAAAATTGTTGACAGTTTGGCATGGGAAGTGCAACAGAAACGCTGTGTGGGCAAGCATTTGGGGCTGGACAAATCGACATGCTGGGAGTTTATATGCAGAGATCATGGATTATAATGTTCTTATGTGCCTTAATAATCACACCAGTTTATGTTTTTACCACTCCCATTTTGAAGCTTTTGGGGCAACAAGATGATGTGGCTGAACTGGCTGGGAGTTTCTCATTGCTCATACTCCCACAACTGTTCTCCTTTGTTGTGGCTTTTCCAACCCAAAAGTTTCTTCAAGCACAAAGCAAAGTGTGGACATTGGCTTGGATTGGCTTTGGGGCCCTTTTGGCTCATGTTTTGATGCTATGGCTCTTCATTTTTCAGTTTGGTTGGGGAACTACTGGGGCTGCTTTGGCCTTGAACATCTCTGGTTGGGGGATTTCCATTTCTCAATGCATTTATGTGATTGGTTGGTGTAGAGATGCTTGGCATGGTTTCTCTTGGTTGGCTTTCAAAGATTTGTGGGGATTTGTTAAGCTCTCATTTTCCTCTGCTATTATGTTTTGTTTGGAGATTTGGTACATGAGTACTATCATTATTCTTGCTGGTCATCTTCCAAATGCTGTCATTTCTGTTGATTCTCTTTCCATTTGGTATGTTATTGGGAGTTCTTTATCAATGGATGTTTGTTTCTTATGTTCTTGCTTTTCTATGGATCTGAACTTTTCTTTGGCTTATTTCTTTCTGGTTTTCTCAGCATGAACTTGGATGGATGGGAAAATATCATTTTCATTGGAATCAATGTAGCCATGAGGTAATGAAACTTTACATCTTCTGATCTTTTAGTACTATTTAAAATCTTATAGGGCCGTTTGATGTTTTAAAATTTGCACATTTATCCAGTAACCCTGTCTTTATCAAAAAACTTCATCCAACAATCCTACACCTCAAACACAAACTTACTATCTCTATCATATTAACTTCAGTTAGTGGGTCGAACGACCCTTAGATCCTCTTCTTTTCATCTCAAAATCAATAGGATCCACTTAGGCCCAGGGTAACGAATGCGTGTTTTCTGTTATTTGTTTCAGTGTTAGGGTCTCCAATGAACTCGGAAAGGCACGGCCTCGAGCTGCAGAGTACTCTGTCTATGTGACGGTCGTACAGTCTCTTCTGCTTGGTCTCCTTTTCATGGTCGCAATATTCTTTGCGAAGGATCATTTTGCTGTCATCTTCACAAGCAGTGTAACTGTGCAGAAATATGTTTCCAAATTAGCCTATCTTCTTGGCATAACCATGGTTCTCAACAGTGTCCAACCAGTCGTATCAGGTGAAGAACTAAAGTTCCATTCCCCCCATCCCCTTCTGTCTCTCTCTCTATATATATATTTATATATGTTGGTGAAGATATAATTTTGAATGACCAAATCCTGCGGTATTCTTGTGTTCAGGTGTGGCCATTGGAGCTGGATGGCAGACATTGGTGGCTTATATAAACTTAGGCTGCTATTACCTTTTTGGTCTCCCTCTTGGGATTATCTTAGGTTATGTAGCAAACTTTGGAGTGAAGGTATGTCAAAAAGCACCAAATCAAGATAGTATTTTCTAGATATAACTAACACACTGTCAGTTACTCACTCAAATCGATTGGTTACATGTAGGGGCTTTGGGGTGGAATGATAGCCGGGATTGCAATGCAGACGATTATGTTGCTGATTGTTCTGTACAAAACCAACTGGAAGAAAGAAGTGAGTGAAAGTAAAATGATTTTTCATTTTTAGTTGTTGTAGGGAAAGCAATGTTAATGGGGGGATTGTTGAATGATTATTATTGCAGGTAGAGGAAACTTCAGGAAGGCTGCAGAAATGGTCTGGACAAGGCAACAATAAGAGAGAAGAGACTAAAAGCTAAAAGAGATGCCATTAGGAGGAAAACTAGAAATAATTGCAACATTTTTCTTGCTTTATTTTATGTGTTTTTGGCTGCTCATTTCATTTTGAGGAAGAGGAATGCTTGAAGAAACTCGCTAGCTAATACGTGATTTTAACACAAATATTACAAACGAGATGAGTTGAATTGAGTTAATAAGTATTTAGAG

mRNA sequence

ATGGAGGCGACGGCGCCATTTCTCGGCGTCAACGATGGAGACTATCCTCCGGTGAAGACATTTCGGGAGTTGAAGGATGTGGTATGGAGTGAAACGGTGAAGACTTGGGCGATCTCCGGTCCGGTGATATTTCAGATCGTTTGTCAGTACGGAACGAACTCTGTTACGAATATTTTTGTGGGTCAACTTGGAGAAATAGAGCTCTCTGGAGTTTCCATTGCCATCTCTGTTATTGCAACTTTTGCTTTTGGCTTCATGTTTGGCATGGGAAGTGCAACAGAAACGCTGTGTGGGCAAGCATTTGGGGCTGGACAAATCGACATGCTGGGAGTTTATATGCAGAGATCATGGATTATAATGTTCTTATGTGCCTTAATAATCACACCAGTTTATGTTTTTACCACTCCCATTTTGAAGCTTTTGGGGCAACAAGATGATGTGGCTGAACTGGCTGGGAGTTTCTCATTGCTCATACTCCCACAACTGTTCTCCTTTGTTGTGGCTTTTCCAACCCAAAAGTTTCTTCAAGCACAAAGCAAAGTGTGGACATTGGCTTGGATTGGCTTTGGGGCCCTTTTGGCTCATGTTTTGATGCTATGGCTCTTCATTTTTCAGTTTGGTTGGGGAACTACTGGGGCTGCTTTGGCCTTGAACATCTCTGGTTGGGGGATTTCCATTTCTCAATGCATTTATGTGATTGGTTGGTGTAGAGATGCTTGGCATGGTTTCTCTTGGTTGGCTTTCAAAGATTTGTGGGGATTTGTTAAGCTCTCATTTTCCTCTGCTATTATGTTTTGTTTGGAGATTTGGTACATGAGTACTATCATTATTCTTGCTGGTCATCTTCCAAATGCTGTCATTTCTGTTGATTCTCTTTCCATTTGCATGAACTTGGATGGATGGGAAAATATCATTTTCATTGGAATCAATGTAGCCATGAGTGTTAGGGTCTCCAATGAACTCGGAAAGGCACGGCCTCGAGCTGCAGAGTACTCTGTCTATGTGACGGTCGTACAGTCTCTTCTGCTTGGTCTCCTTTTCATGGTCGCAATATTCTTTGCGAAGGATCATTTTGCTGTCATCTTCACAAGCAGTGTAACTGTGCAGAAATATGTTTCCAAATTAGCCTATCTTCTTGGCATAACCATGGTTCTCAACAGTGTCCAACCAGTCGTATCAGGTGTGGCCATTGGAGCTGGATGGCAGACATTGGTGGCTTATATAAACTTAGGCTGCTATTACCTTTTTGGTCTCCCTCTTGGGATTATCTTAGGTTATGTAGCAAACTTTGGAGTGAAGGGGCTTTGGGGTGGAATGATAGCCGGGATTGCAATGCAGACGATTATGTTGCTGATTGTTCTGTACAAAACCAACTGGAAGAAAGAAGTAGAGGAAACTTCAGGAAGGCTGCAGAAATGGTCTGGACAAGGCAACAATAAGAGAGAAGAGACTAAAAGCTAA

Coding sequence (CDS)

ATGGAGGCGACGGCGCCATTTCTCGGCGTCAACGATGGAGACTATCCTCCGGTGAAGACATTTCGGGAGTTGAAGGATGTGGTATGGAGTGAAACGGTGAAGACTTGGGCGATCTCCGGTCCGGTGATATTTCAGATCGTTTGTCAGTACGGAACGAACTCTGTTACGAATATTTTTGTGGGTCAACTTGGAGAAATAGAGCTCTCTGGAGTTTCCATTGCCATCTCTGTTATTGCAACTTTTGCTTTTGGCTTCATGTTTGGCATGGGAAGTGCAACAGAAACGCTGTGTGGGCAAGCATTTGGGGCTGGACAAATCGACATGCTGGGAGTTTATATGCAGAGATCATGGATTATAATGTTCTTATGTGCCTTAATAATCACACCAGTTTATGTTTTTACCACTCCCATTTTGAAGCTTTTGGGGCAACAAGATGATGTGGCTGAACTGGCTGGGAGTTTCTCATTGCTCATACTCCCACAACTGTTCTCCTTTGTTGTGGCTTTTCCAACCCAAAAGTTTCTTCAAGCACAAAGCAAAGTGTGGACATTGGCTTGGATTGGCTTTGGGGCCCTTTTGGCTCATGTTTTGATGCTATGGCTCTTCATTTTTCAGTTTGGTTGGGGAACTACTGGGGCTGCTTTGGCCTTGAACATCTCTGGTTGGGGGATTTCCATTTCTCAATGCATTTATGTGATTGGTTGGTGTAGAGATGCTTGGCATGGTTTCTCTTGGTTGGCTTTCAAAGATTTGTGGGGATTTGTTAAGCTCTCATTTTCCTCTGCTATTATGTTTTGTTTGGAGATTTGGTACATGAGTACTATCATTATTCTTGCTGGTCATCTTCCAAATGCTGTCATTTCTGTTGATTCTCTTTCCATTTGCATGAACTTGGATGGATGGGAAAATATCATTTTCATTGGAATCAATGTAGCCATGAGTGTTAGGGTCTCCAATGAACTCGGAAAGGCACGGCCTCGAGCTGCAGAGTACTCTGTCTATGTGACGGTCGTACAGTCTCTTCTGCTTGGTCTCCTTTTCATGGTCGCAATATTCTTTGCGAAGGATCATTTTGCTGTCATCTTCACAAGCAGTGTAACTGTGCAGAAATATGTTTCCAAATTAGCCTATCTTCTTGGCATAACCATGGTTCTCAACAGTGTCCAACCAGTCGTATCAGGTGTGGCCATTGGAGCTGGATGGCAGACATTGGTGGCTTATATAAACTTAGGCTGCTATTACCTTTTTGGTCTCCCTCTTGGGATTATCTTAGGTTATGTAGCAAACTTTGGAGTGAAGGGGCTTTGGGGTGGAATGATAGCCGGGATTGCAATGCAGACGATTATGTTGCTGATTGTTCTGTACAAAACCAACTGGAAGAAAGAAGTAGAGGAAACTTCAGGAAGGCTGCAGAAATGGTCTGGACAAGGCAACAATAAGAGAGAAGAGACTAAAAGCTAA

Protein sequence

MEATAPFLGVNDGDYPPVKTFRELKDVVWSETVKTWAISGPVIFQIVCQYGTNSVTNIFVGQLGEIELSGVSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIDMLGVYMQRSWIIMFLCALIITPVYVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQAQSKVWTLAWIGFGALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVIGWCRDAWHGFSWLAFKDLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICMNLDGWENIIFIGINVAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAKDHFAVIFTSSVTVQKYVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYLFGLPLGIILGYVANFGVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSGQGNNKREETKS*
BLAST of Csa1G044870 vs. Swiss-Prot
Match: DTX35_ARATH (Protein DETOXIFICATION 35 OS=Arabidopsis thaliana GN=DTX35 PE=2 SV=1)

HSP 1 Score: 682.9 bits (1761), Expect = 2.5e-195
Identity = 322/485 (66.39%), Postives = 393/485 (81.03%), Query Frame = 1

Query: 1   MEATAPFL---GVNDGDYPPVKTFRELKDVVWSETVKTWAISGPVIFQIVCQYGTNSVTN 60
           M+ TAP L   G  + DY P +++ ++K V+ +E+ K W I+ PV F I+CQYG +SVTN
Sbjct: 1   MDPTAPLLTHGGEVEEDYAPARSWTDVKRVLSTESAKLWMIAAPVGFNIICQYGVSSVTN 60

Query: 61  IFVGQLGEIELSGVSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIDMLGVYMQRSW 120
           IFVG +GE+ELS VSI++SVI TF+FGF+ GMGSA ETLCGQA+GAGQ++MLGVYMQRSW
Sbjct: 61  IFVGHIGEVELSAVSISLSVIGTFSFGFLLGMGSALETLCGQAYGAGQVNMLGVYMQRSW 120

Query: 121 IIMFLCALIITPVYVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQA 180
           II+F+    + P+Y+F TP+L+LLGQ +++A  AG F+LL +PQLFS    FPT KFLQA
Sbjct: 121 IILFVSCFFLLPIYIFATPVLRLLGQAEEIAVPAGQFTLLTIPQLFSLAFNFPTSKFLQA 180

Query: 181 QSKVWTLAWIGFGALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVIGWCR 240
           QSKV  +AWIGF AL  HV+MLWLFI +FGWGT GAALA NI+ WG +I+Q +YVIGWC 
Sbjct: 181 QSKVVAIAWIGFVALSLHVIMLWLFIIEFGWGTNGAALAFNITNWGTAIAQIVYVIGWCN 240

Query: 241 DAWHGFSWLAFKDLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICMN 300
           + W G SWLAFK++W FV+LS +SA+M CLEIWYM +II+L G L NAVI+VDSLSICMN
Sbjct: 241 EGWTGLSWLAFKEIWAFVRLSIASAVMLCLEIWYMMSIIVLTGRLDNAVIAVDSLSICMN 300

Query: 301 LDGWENIIFIGINVAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAKDH 360
           ++G E ++FIGIN A+SVRVSNELG  RPRAA+YSVYVTV QSLL+GL+FMVAI  A+DH
Sbjct: 301 INGLEAMLFIGINAAISVRVSNELGLGRPRAAKYSVYVTVFQSLLIGLVFMVAIIIARDH 360

Query: 361 FAVIFTSSVTVQKYVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYLFG 420
           FA+IFTSS  +Q+ VSKLAYLLGITMVLNSVQPVVSGVA+G GWQ LVAYINLGCYY+FG
Sbjct: 361 FAIIFTSSKVLQRAVSKLAYLLGITMVLNSVQPVVSGVAVGGGWQGLVAYINLGCYYIFG 420

Query: 421 LPLGIILGYVANFGVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSGQG 480
           LP G +LGY+ANFGV GLW GMIAG A+QT++LLIVLYKTNW KEVEET  R++KW G  
Sbjct: 421 LPFGYLLGYIANFGVMGLWSGMIAGTALQTLLLLIVLYKTNWNKEVEETMERMKKWGGSE 480

Query: 481 NNKRE 483
              ++
Sbjct: 481 TTSKD 485

BLAST of Csa1G044870 vs. Swiss-Prot
Match: DTX34_ARATH (Protein DETOXIFICATION 34 OS=Arabidopsis thaliana GN=DTX34 PE=2 SV=1)

HSP 1 Score: 607.1 bits (1564), Expect = 1.7e-172
Identity = 280/485 (57.73%), Postives = 374/485 (77.11%), Query Frame = 1

Query: 1   MEATAPFLG--VNDGDYPPVKTFRELKDVVWSETVKTWAISGPVIFQIVCQYGTNSVTNI 60
           + A +  LG    D D+PP+++FR+ K V   ET K W I+ P+ F I+C YG NS T+I
Sbjct: 56  IHAPSTLLGETTGDADFPPIQSFRDAKLVCVVETSKLWEIAAPIAFNILCNYGVNSFTSI 115

Query: 61  FVGQLGEIELSGVSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIDMLGVYMQRSWI 120
           FVG +G++ELS V+IA+SV++ F+FGF+ GM SA ETLCGQAFGAGQ+DMLGVYMQRSW+
Sbjct: 116 FVGHIGDLELSAVAIALSVVSNFSFGFLLGMASALETLCGQAFGAGQMDMLGVYMQRSWL 175

Query: 121 IMFLCALIITPVYVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQAQ 180
           I+   ++ + P+Y++ TP+L LLGQ+ ++AE++G F+  I+PQ+F+  + FPTQKFLQ+Q
Sbjct: 176 ILLGTSVCLLPLYIYATPLLILLGQEPEIAEISGKFTTQIIPQMFALAINFPTQKFLQSQ 235

Query: 181 SKVWTLAWIGFGALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVIGWCRD 240
           SKV  +AWIGF AL  H+ +L+LFI  F WG  GAA A ++S WGI+I+Q +YV+GWC+D
Sbjct: 236 SKVGIMAWIGFFALTLHIFILYLFINVFKWGLNGAAAAFDVSAWGIAIAQVVYVVGWCKD 295

Query: 241 AWHGFSWLAFKDLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICMNL 300
            W G SWLAF+D+W F+KLSF+SA+M CLEIWY  TII+L GHL + VI+V SLSICMN+
Sbjct: 296 GWKGLSWLAFQDVWPFLKLSFASAVMLCLEIWYFMTIIVLTGHLEDPVIAVGSLSICMNI 355

Query: 301 DGWENIIFIGINVAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAKDHF 360
           +GWE ++FIGIN A+SVRVSNELG   PRAA+YSV VTV++SL++G++  + I   +D F
Sbjct: 356 NGWEGMLFIGINAAISVRVSNELGSGHPRAAKYSVIVTVIESLVIGVVCAIVILITRDDF 415

Query: 361 AVIFTSSVTVQKYVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYLFGL 420
           AVIFT S  ++K V+ LAYLLGITM+LNS+QPV+SGVA+G GWQ  VAYINL CYY FGL
Sbjct: 416 AVIFTESEEMRKAVADLAYLLGITMILNSLQPVISGVAVGGGWQAPVAYINLFCYYAFGL 475

Query: 421 PLGIILGYVANFGVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSGQGN 480
           PLG +LGY  + GV+G+W GMI G ++QT++LL ++Y TNW KEVE+ S R+++W G G 
Sbjct: 476 PLGFLLGYKTSLGVQGIWIGMICGTSLQTLILLYMIYITNWNKEVEQASERMKQW-GAGY 535

Query: 481 NKREE 484
            K E+
Sbjct: 536 EKLEK 539

BLAST of Csa1G044870 vs. Swiss-Prot
Match: DTX33_ARATH (Protein DETOXIFICATION 33 OS=Arabidopsis thaliana GN=DTX33 PE=2 SV=1)

HSP 1 Score: 503.1 bits (1294), Expect = 3.5e-141
Identity = 238/480 (49.58%), Postives = 328/480 (68.33%), Query Frame = 1

Query: 2   EATAPFLGVNDGDYPPVKTFRELKDVVWS-----ETVKTWAISGPVIFQIVCQYGTNSVT 61
           + T P L   D   PP  T  +    VW+     E+ + W ++GP IF  + QY   ++T
Sbjct: 4   DKTLPLL---DPREPPELTGTKSASKVWAKEFGEESKRLWELAGPAIFTAISQYSLGALT 63

Query: 62  NIFVGQLGEIELSGVSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIDMLGVYMQRS 121
             F G+LGE+EL+ VS+  SVI+  AFG M GMGSA ETLCGQA+GAGQI M+G+YMQRS
Sbjct: 64  QTFSGRLGELELAAVSVENSVISGLAFGVMLGMGSALETLCGQAYGAGQIRMMGIYMQRS 123

Query: 122 WIIMFLCALIITPVYVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQ 181
           W+I+F  AL + PVY++  PIL   G+   +++ AG F+L ++PQLF++   FP QKFLQ
Sbjct: 124 WVILFTTALFLLPVYIWAPPILSFFGEAPHISKAAGKFALWMIPQLFAYAANFPIQKFLQ 183

Query: 182 AQSKVWTLAWIGFGALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVIGWC 241
           +Q KV  +AWI    L+ H +  WLFI  F WG  GAA+ LN S W I I Q +Y++   
Sbjct: 184 SQRKVLVMAWISGVVLVIHAVFSWLFILYFKWGLVGAAITLNTSWWLIVIGQLLYILITK 243

Query: 242 RD-AWHGFSWLAFKDLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSIC 301
            D AW GFS LAF+DL+GFVKLS +SA+M CLE WY+  ++++ G LPN +I VD++SIC
Sbjct: 244 SDGAWTGFSMLAFRDLYGFVKLSLASALMLCLEFWYLMVLVVVTGLLPNPLIPVDAISIC 303

Query: 302 MNLDGWENIIFIGINVAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAK 361
           MN++GW  +I IG N A+SVRVSNELG      A++SV V  + S L+G++ M+ +   K
Sbjct: 304 MNIEGWTAMISIGFNAAISVRVSNELGAGNAALAKFSVIVVSITSTLIGIVCMIVVLATK 363

Query: 362 DHFAVIFTSSVTVQKYVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYL 421
           D F  +FTSS  V    +++A LLG T++LNS+QPV+SGVA+GAGWQ LVAY+N+ CYY+
Sbjct: 364 DSFPYLFTSSEAVAAETTRIAVLLGFTVLLNSLQPVLSGVAVGAGWQALVAYVNIACYYI 423

Query: 422 FGLPLGIILGYVANFGVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSG 476
            GLP G++LG+  + GV+G+WGGM+AGI +QT++L+ ++Y TNW KE E+   R+Q+W G
Sbjct: 424 IGLPAGLVLGFTLDLGVQGIWGGMVAGICLQTLILIGIIYFTNWNKEAEQAESRVQRWGG 480

BLAST of Csa1G044870 vs. Swiss-Prot
Match: DTX32_ARATH (Protein DETOXIFICATION 32 OS=Arabidopsis thaliana GN=DTX32 PE=3 SV=1)

HSP 1 Score: 480.7 bits (1236), Expect = 1.9e-134
Identity = 226/474 (47.68%), Postives = 320/474 (67.51%), Query Frame = 1

Query: 11  NDGDYPPVKTFRELKDVVWSETVKTWAISGPVIFQIVCQYGTNSVTNIFVGQLGEIELSG 70
           +D D PP+   R+      +E+ K W ++GP IF   CQY   +VT I  G +  + L+ 
Sbjct: 24  SDTDMPPISGGRDFIRQFAAESKKLWWLAGPAIFTSFCQYSLGAVTQILAGHVNTLALAA 83

Query: 71  VSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIDMLGVYMQRSWIIMFLCALIITPV 130
           VSI  SVI+ F+ G M GMGSA  TLCGQA+GAGQ++M+G+Y+QRSWII+  CAL++   
Sbjct: 84  VSIQNSVISGFSVGIMLGMGSALATLCGQAYGAGQLEMMGIYLQRSWIILNSCALLLCLF 143

Query: 131 YVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQAQSKVWTLAWIGFG 190
           YVF TP+L LLGQ  ++++ AG FSL ++PQLF++ V F T KFLQAQSKV  +A I   
Sbjct: 144 YVFATPLLSLLGQSPEISKAAGKFSLWMIPQLFAYAVNFATAKFLQAQSKVIAMAVIAAT 203

Query: 191 ALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVIGWCRD-AWHGFSWLAFK 250
            LL H L+ WL + +  WG  G A+ LN+S W I ++Q +Y+ G     AW G SW+AFK
Sbjct: 204 VLLQHTLLSWLLMLKLRWGMAGGAVVLNMSWWLIDVTQIVYICGGSSGRAWSGLSWMAFK 263

Query: 251 DLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICMNLDGWENIIFIGI 310
           +L GF +LS +SA+M CLE+WY   +I+ AG+L N  +SV +LSICMN+ GW  ++  G 
Sbjct: 264 NLRGFARLSLASAVMVCLEVWYFMALILFAGYLKNPQVSVAALSICMNILGWPIMVAFGF 323

Query: 311 NVAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAKDHFAVIFTSSVTVQ 370
           N A+SVR SNELG   PR A++ + V ++ S+ +G++  V +   +D +  +F+    V+
Sbjct: 324 NAAVSVRESNELGAEHPRRAKFLLIVAMITSVSIGIVISVTLIVLRDKYPAMFSDDEEVR 383

Query: 371 KYVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYLFGLPLGIILGYVAN 430
             V +L  LL +T+V+N++QPV+SGVA+GAGWQ +VAY+N+GCYYL G+P+G++LGY   
Sbjct: 384 VLVKQLTPLLALTIVINNIQPVLSGVAVGAGWQGIVAYVNIGCYYLCGIPIGLVLGYKME 443

Query: 431 FGVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSGQGNNKREE 484
            GVKG+W GM+ G  +QT +LL ++Y+TNWKKE      R++KW G  +NKREE
Sbjct: 444 LGVKGIWTGMLTGTVVQTSVLLFIIYRTNWKKEASLAEARIKKW-GDQSNKREE 496

BLAST of Csa1G044870 vs. Swiss-Prot
Match: DTX30_ARATH (Protein DETOXIFICATION 30 OS=Arabidopsis thaliana GN=DTX30 PE=2 SV=1)

HSP 1 Score: 474.9 bits (1221), Expect = 1.0e-132
Identity = 224/472 (47.46%), Postives = 315/472 (66.74%), Query Frame = 1

Query: 6   PFLGVNDGDYPPVKTFRELKDVVWSETVKTWAISGPVIFQIVCQYGTNSVTNIFVGQLGE 65
           PF  V D   PP+ T          E  K W ++GP IF  + QY   + T +F G +  
Sbjct: 22  PFSSVED--IPPITTVGGFVKEFNVEVKKLWYLAGPAIFMSITQYSLGAATQVFAGHIST 81

Query: 66  IELSGVSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIDMLGVYMQRSWIIMFLCAL 125
           I L+ VS+  SVIA F+FG M GMGSA ETLCGQAFGAG++ MLGVY+QRSW+I+ + A+
Sbjct: 82  IALAAVSVENSVIAGFSFGVMLGMGSALETLCGQAFGAGKLSMLGVYLQRSWVILNVTAV 141

Query: 126 IITPVYVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQAQSKVWTLA 185
           I++ +Y+F  PIL  +GQ   ++   G FS+ ++PQ+F++ V +PT KFLQ+QSK+  +A
Sbjct: 142 ILSLLYIFAAPILAFIGQTPAISSATGIFSIYMIPQIFAYAVNYPTAKFLQSQSKIMVMA 201

Query: 186 WIGFGALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVI-GWCRDAWHGFS 245
            I   AL+ HVL+ W  I    WGT G A+ LN S W I ++Q +Y+  G C +AW GFS
Sbjct: 202 AISAVALVLHVLLTWFVIEGLQWGTAGLAVVLNASWWFIVVAQLVYIFSGTCGEAWSGFS 261

Query: 246 WLAFKDLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICMNLDGWENI 305
           W AF +LW FV+LS +SA+M CLE+WY+  +I+ AG+L NA ISV +LSICMN+ GW  +
Sbjct: 262 WEAFHNLWSFVRLSLASAVMLCLEVWYLMAVILFAGYLKNAEISVAALSICMNILGWTAM 321

Query: 306 IFIGINVAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAKDHFAVIFTS 365
           I IG+N A+SVRVSNELG   PR A++S+ V V+ S ++GL   +A+   +D +  +F  
Sbjct: 322 IAIGMNAAVSVRVSNELGAKHPRTAKFSLLVAVITSTVIGLAISIALLIFRDKYPSLFVG 381

Query: 366 SVTVQKYVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYLFGLPLGIIL 425
              V   V  L  +L +++V+N+VQPV+SGVA+GAGWQ +VAY+N+ CYY+FG+P G++L
Sbjct: 382 DEEVIIVVKDLTPILAVSIVINNVQPVLSGVAVGAGWQAVVAYVNIVCYYVFGIPFGLLL 441

Query: 426 GYVANFGVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSGQ 477
           GY  NFGV G+W GM+ G  +QTI+L  ++ +TNW  E     GR+++W G+
Sbjct: 442 GYKLNFGVMGIWCGMLTGTVVQTIVLTWMICRTNWDTEAAMAEGRIREWGGE 491

BLAST of Csa1G044870 vs. TrEMBL
Match: A0A061G135_THECC (Protein DETOXIFICATION OS=Theobroma cacao GN=TCM_015227 PE=3 SV=1)

HSP 1 Score: 690.6 bits (1781), Expect = 1.3e-195
Identity = 324/467 (69.38%), Postives = 391/467 (83.73%), Query Frame = 1

Query: 10  VNDGDYPPVKTFRELKDVVWSETVKTWAISGPVIFQIVCQYGTNSVTNIFVGQLGEIELS 69
           + +GDY P ++F+E+K V W ETVK W I+GP+ FQI+CQYGT SVTNIFVG +G IELS
Sbjct: 18  LENGDYGPARSFKEVKSVFWIETVKMWKIAGPIGFQIMCQYGTMSVTNIFVGHIGNIELS 77

Query: 70  GVSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIDMLGVYMQRSWIIMFLCALIITP 129
            V+IA++VI TF+FGFM GMGSA ETLCGQAFGAGQI MLGVYMQRSWII+     II P
Sbjct: 78  AVTIALAVIGTFSFGFMLGMGSALETLCGQAFGAGQIHMLGVYMQRSWIILLSSCFIILP 137

Query: 130 VYVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQAQSKVWTLAWIGF 189
            Y+F TP+LKLLGQ+D++A LAG F++LI+PQLFS  + FPTQKFLQAQSKV  LAWIGF
Sbjct: 138 FYIFATPLLKLLGQEDEIANLAGKFAILIIPQLFSLAITFPTQKFLQAQSKVNVLAWIGF 197

Query: 190 GALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVIGWCRDAWHGFSWLAFK 249
             L+ HV +LWLF+F F WGTTGAA+A +I+ W I+++Q  YVI W  + WHGFSWLAFK
Sbjct: 198 VTLIFHVGILWLFLFVFDWGTTGAAIAYDITSWVIALAQVAYVIFWSNEGWHGFSWLAFK 257

Query: 250 DLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICMNLDGWENIIFIGI 309
           ++W FV+LS SSA+M CLE+WYM ++I+L GHL NAVI+V SLSICMNL+GWE ++FIGI
Sbjct: 258 EIWAFVRLSISSALMLCLEVWYMMSMILLVGHLNNAVIAVGSLSICMNLNGWEAMLFIGI 317

Query: 310 NVAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAKDHFAVIFTSSVTVQ 369
           N AMSVRVSNELG   PRAA+YSVYVTV+QSLL+GLL MVAI   +DHFAVIFTSS  +Q
Sbjct: 318 NAAMSVRVSNELGLGHPRAAKYSVYVTVLQSLLIGLLCMVAIIITRDHFAVIFTSSEEMQ 377

Query: 370 KYVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYLFGLPLGIILGYVAN 429
           + V+ LAYLLG+TMVLNSVQPV+SGVAIG GWQTLVAYINLGCYY+FGLPLG +LGY AN
Sbjct: 378 RAVAHLAYLLGVTMVLNSVQPVISGVAIGGGWQTLVAYINLGCYYVFGLPLGFLLGYTAN 437

Query: 430 FGVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSGQ 477
            GV GLWGGMIAGI +QT++LL+VL++TNW KEVE+T+ R++KW GQ
Sbjct: 438 LGVMGLWGGMIAGIGLQTLLLLLVLFRTNWNKEVEQTTERMKKWGGQ 484

BLAST of Csa1G044870 vs. TrEMBL
Match: B9RIU7_RICCO (Protein DETOXIFICATION OS=Ricinus communis GN=RCOM_1582850 PE=3 SV=1)

HSP 1 Score: 683.7 bits (1763), Expect = 1.6e-193
Identity = 314/471 (66.67%), Postives = 390/471 (82.80%), Query Frame = 1

Query: 11  NDGDYPPVKTFRELKDVVWSETVKTWAISGPVIFQIVCQYGTNSVTNIFVGQLGEIELSG 70
           +D DY PVK+F+++K V W+ETVK W I+ P++F I+CQYG NSVTNIFVG +G+ ELS 
Sbjct: 14  DDEDYTPVKSFKDIKSVFWTETVKIWKIATPIVFNIMCQYGINSVTNIFVGHIGDFELSA 73

Query: 71  VSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIDMLGVYMQRSWIIMFLCALIITPV 130
           V+I++SVI TF+FGFM GMGSA ETLCGQAFGAGQ+ MLG+YMQRSWII+++  + + P+
Sbjct: 74  VAISLSVIGTFSFGFMLGMGSALETLCGQAFGAGQVHMLGIYMQRSWIILWITCIFLLPI 133

Query: 131 YVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQAQSKVWTLAWIGFG 190
           YVF TPILKLLGQ+D VA+LAG F++LI+PQLFS  V FPTQKFLQAQSKV  LAWIGF 
Sbjct: 134 YVFATPILKLLGQEDSVADLAGQFTILIIPQLFSLAVNFPTQKFLQAQSKVRVLAWIGFV 193

Query: 191 ALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVIGWCRDAWHGFSWLAFKD 250
           A + H+ +LWL I+ FGWGT+GAA+A +I+ WG+SI+Q +YVIGWC++ W G S  AFK+
Sbjct: 194 AFILHIPLLWLLIYVFGWGTSGAAIAYDITNWGMSIAQVVYVIGWCKEGWTGLSSSAFKE 253

Query: 251 LWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICMNLDGWENIIFIGIN 310
           +W FV+LS +SA+M CLEIWYM +II+L GHL NAVI+V SLSICMN +GWE ++FIG+N
Sbjct: 254 IWAFVRLSLASAVMLCLEIWYMMSIIVLTGHLDNAVIAVGSLSICMNFNGWEAMLFIGVN 313

Query: 311 VAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAKDHFAVIFTSSVTVQK 370
            A+SVRVSNELG   PRAA+YSVYVT+ QS L+GLL MV I   KDHFA+IFT+S  +Q 
Sbjct: 314 AAISVRVSNELGSGHPRAAKYSVYVTIFQSFLIGLLSMVIILITKDHFAIIFTNSKAMQV 373

Query: 371 YVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYLFGLPLGIILGYVANF 430
            VSKLA+LLGITMVLNS+QPV+ GVAIG+GWQ LVAYIN+GCYY+FGLPLG  LGY    
Sbjct: 374 AVSKLAFLLGITMVLNSIQPVIGGVAIGSGWQALVAYINIGCYYIFGLPLGFFLGYKTKL 433

Query: 431 GVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSGQGNNKR 482
           GV GLWGGMIAG A+QT++LLIVLY+TNW KEVE+TS R++KW GQ N ++
Sbjct: 434 GVAGLWGGMIAGTALQTLLLLIVLYRTNWNKEVEQTSERVRKWGGQENTEK 484

BLAST of Csa1G044870 vs. TrEMBL
Match: F4JTB2_ARATH (Protein DETOXIFICATION OS=Arabidopsis thaliana GN=DTX35 PE=3 SV=1)

HSP 1 Score: 682.9 bits (1761), Expect = 2.8e-193
Identity = 322/485 (66.39%), Postives = 395/485 (81.44%), Query Frame = 1

Query: 1   MEATAPFL---GVNDGDYPPVKTFRELKDVVWSETVKTWAISGPVIFQIVCQYGTNSVTN 60
           M+ TAP L   G  + DY P +++ ++K V+ +E+ K W I+ PV F I+CQYG +SVTN
Sbjct: 1   MDPTAPLLTHGGEVEEDYAPARSWTDVKRVLSTESAKLWMIAAPVGFNIICQYGVSSVTN 60

Query: 61  IFVGQLGEIELSGVSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIDMLGVYMQRSW 120
           IFVG +GE+ELS VSI++SVI TF+FGF+ GMGSA ETLCGQA+GAGQ++MLGVYMQRSW
Sbjct: 61  IFVGHIGEVELSAVSISLSVIGTFSFGFLLGMGSALETLCGQAYGAGQVNMLGVYMQRSW 120

Query: 121 IIMFLCALIITPVYVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQA 180
           II+F+    + P+Y+F TP+L+LLGQ +++A  AG F+LL +PQLFS    FPT KFLQA
Sbjct: 121 IILFVSCFFLLPIYIFATPVLRLLGQAEEIAVPAGQFTLLTIPQLFSLAFNFPTSKFLQA 180

Query: 181 QSKVWTLAWIGFGALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVIGWCR 240
           QSKV  +AWIGF AL  HV+MLWLFI +FGWGT GAALA NI+ WG +I+Q +YVIGWC 
Sbjct: 181 QSKVVAIAWIGFVALSLHVIMLWLFIIEFGWGTNGAALAFNITNWGTAIAQIVYVIGWCN 240

Query: 241 DAWHGFSWLAFKDLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICMN 300
           + W G SWLAFK++W FV+LS +SA+M CLEIWYM +II+L G L NAVI+VDSLSICMN
Sbjct: 241 EGWTGLSWLAFKEIWAFVRLSIASAVMLCLEIWYMMSIIVLTGRLDNAVIAVDSLSICMN 300

Query: 301 LDGWENIIFIGINVAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAKDH 360
           ++G E ++FIGIN A+SVRVSNELG  RPRAA+YSVYVTV QSLL+GL+FMVAI  A+DH
Sbjct: 301 INGLEAMLFIGINAAISVRVSNELGLGRPRAAKYSVYVTVFQSLLIGLVFMVAIIIARDH 360

Query: 361 FAVIFTSSVTVQKYVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYLFG 420
           FA+IFTSS  +Q+ VSKLAYLLGITMVLNSVQPVVSGVA+G GWQ LVAYINLGCYY+FG
Sbjct: 361 FAIIFTSSKVLQRAVSKLAYLLGITMVLNSVQPVVSGVAVGGGWQGLVAYINLGCYYIFG 420

Query: 421 LPLGIILGYVANFGVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSGQG 480
           LP G +LGY+ANFGV GLW GMIAG A+QT++LLIVLYKTNW KEVEET  R++KW G  
Sbjct: 421 LPFGYLLGYIANFGVMGLWSGMIAGTALQTLLLLIVLYKTNWNKEVEETMERMKKWGGSE 480

Query: 481 NNKRE 483
              ++
Sbjct: 481 TTSKD 485

BLAST of Csa1G044870 vs. TrEMBL
Match: A0A061G262_THECC (Protein DETOXIFICATION OS=Theobroma cacao GN=TCM_015227 PE=3 SV=1)

HSP 1 Score: 681.4 bits (1757), Expect = 8.1e-193
Identity = 324/480 (67.50%), Postives = 391/480 (81.46%), Query Frame = 1

Query: 10  VNDGDYPPVKTFRELKDVVWSETVKTWAISGPVIFQIVCQYGTNSVTNIFVGQLGEIELS 69
           + +GDY P ++F+E+K V W ETVK W I+GP+ FQI+CQYGT SVTNIFVG +G IELS
Sbjct: 18  LENGDYGPARSFKEVKSVFWIETVKMWKIAGPIGFQIMCQYGTMSVTNIFVGHIGNIELS 77

Query: 70  GVSIAISVIATFAFGFM-------------FGMGSATETLCGQAFGAGQIDMLGVYMQRS 129
            V+IA++VI TF+FGFM              GMGSA ETLCGQAFGAGQI MLGVYMQRS
Sbjct: 78  AVTIALAVIGTFSFGFMENCCCVDFTLSIMLGMGSALETLCGQAFGAGQIHMLGVYMQRS 137

Query: 130 WIIMFLCALIITPVYVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQ 189
           WII+     II P Y+F TP+LKLLGQ+D++A LAG F++LI+PQLFS  + FPTQKFLQ
Sbjct: 138 WIILLSSCFIILPFYIFATPLLKLLGQEDEIANLAGKFAILIIPQLFSLAITFPTQKFLQ 197

Query: 190 AQSKVWTLAWIGFGALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVIGWC 249
           AQSKV  LAWIGF  L+ HV +LWLF+F F WGTTGAA+A +I+ W I+++Q  YVI W 
Sbjct: 198 AQSKVNVLAWIGFVTLIFHVGILWLFLFVFDWGTTGAAIAYDITSWVIALAQVAYVIFWS 257

Query: 250 RDAWHGFSWLAFKDLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICM 309
            + WHGFSWLAFK++W FV+LS SSA+M CLE+WYM ++I+L GHL NAVI+V SLSICM
Sbjct: 258 NEGWHGFSWLAFKEIWAFVRLSISSALMLCLEVWYMMSMILLVGHLNNAVIAVGSLSICM 317

Query: 310 NLDGWENIIFIGINVAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAKD 369
           NL+GWE ++FIGIN AMSVRVSNELG   PRAA+YSVYVTV+QSLL+GLL MVAI   +D
Sbjct: 318 NLNGWEAMLFIGINAAMSVRVSNELGLGHPRAAKYSVYVTVLQSLLIGLLCMVAIIITRD 377

Query: 370 HFAVIFTSSVTVQKYVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYLF 429
           HFAVIFTSS  +Q+ V+ LAYLLG+TMVLNSVQPV+SGVAIG GWQTLVAYINLGCYY+F
Sbjct: 378 HFAVIFTSSEEMQRAVAHLAYLLGVTMVLNSVQPVISGVAIGGGWQTLVAYINLGCYYVF 437

Query: 430 GLPLGIILGYVANFGVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSGQ 477
           GLPLG +LGY AN GV GLWGGMIAGI +QT++LL+VL++TNW KEVE+T+ R++KW GQ
Sbjct: 438 GLPLGFLLGYTANLGVMGLWGGMIAGIGLQTLLLLLVLFRTNWNKEVEQTTERMKKWGGQ 497

BLAST of Csa1G044870 vs. TrEMBL
Match: D7MFY8_ARALL (Protein DETOXIFICATION OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_914011 PE=3 SV=1)

HSP 1 Score: 679.5 bits (1752), Expect = 3.1e-192
Identity = 323/482 (67.01%), Postives = 392/482 (81.33%), Query Frame = 1

Query: 4   TAPFL---GVNDGDYPPVKTFRELKDVVWSETVKTWAISGPVIFQIVCQYGTNSVTNIFV 63
           TAP L   G  + DY P +++ ++K V+ +E+ K W I+ PV F I+CQYG +SVTNIFV
Sbjct: 5   TAPLLTHGGEVEEDYAPARSWIDVKRVLSTESAKMWMIAAPVGFNIICQYGVSSVTNIFV 64

Query: 64  GQLGEIELSGVSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIDMLGVYMQRSWIIM 123
           G +GE+ELS VSI++SVI TF+FGF+ GMGSA ETLCGQA+GAGQ++MLGVYMQRSWII+
Sbjct: 65  GHIGEVELSAVSISLSVIGTFSFGFLLGMGSALETLCGQAYGAGQVNMLGVYMQRSWIIL 124

Query: 124 FLCALIITPVYVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQAQSK 183
           F+  L I P+Y+F TP+L+LLGQ +++A  AG F+LL +PQLFS    FPT KFLQAQSK
Sbjct: 125 FVSCLFILPIYIFATPVLRLLGQAEEIAVPAGQFTLLTIPQLFSLAFNFPTSKFLQAQSK 184

Query: 184 VWTLAWIGFGALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVIGWCRDAW 243
           V  +AWIGF AL  HV+MLWLFI  FGWGT GAALA NI+ WG +I+Q +YVIGWC + W
Sbjct: 185 VVAIAWIGFVALFLHVIMLWLFIIVFGWGTNGAALAFNITNWGTAIAQIVYVIGWCNEGW 244

Query: 244 HGFSWLAFKDLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICMNLDG 303
            G SWLAFK++W FV+LS +SA+M CLEIWYM +II+L G L NAVI+VDSLSICMN++G
Sbjct: 245 TGLSWLAFKEIWAFVRLSIASAVMLCLEIWYMMSIIVLTGRLDNAVIAVDSLSICMNING 304

Query: 304 WENIIFIGINVAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAKDHFAV 363
            E ++FIGIN A+SVRVSNELG  RPRAA+YSVYVTV QSLL+GL+FMVAI  A+DHFA+
Sbjct: 305 LEAMLFIGINAAISVRVSNELGLGRPRAAKYSVYVTVFQSLLIGLVFMVAIIIARDHFAI 364

Query: 364 IFTSSVTVQKYVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYLFGLPL 423
           IFTSS  +Q+ VSKLAYLLGITMVLNSVQPVVSGVA+G GWQ LVAYINLGCYY+FGLP 
Sbjct: 365 IFTSSKVLQRAVSKLAYLLGITMVLNSVQPVVSGVAVGGGWQGLVAYINLGCYYIFGLPF 424

Query: 424 GIILGYVANFGVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSGQGNNK 483
           G +LGY ANFGV GLW GMIAG A+QT++LLIVLYKTNW KEVEET  R++KW G     
Sbjct: 425 GYLLGYKANFGVMGLWSGMIAGTALQTLLLLIVLYKTNWNKEVEETMERMKKWGGSETTS 484

BLAST of Csa1G044870 vs. TAIR10
Match: AT4G25640.2 (AT4G25640.2 detoxifying efflux carrier 35)

HSP 1 Score: 682.9 bits (1761), Expect = 1.4e-196
Identity = 322/485 (66.39%), Postives = 393/485 (81.03%), Query Frame = 1

Query: 1   MEATAPFL---GVNDGDYPPVKTFRELKDVVWSETVKTWAISGPVIFQIVCQYGTNSVTN 60
           M+ TAP L   G  + DY P +++ ++K V+ +E+ K W I+ PV F I+CQYG +SVTN
Sbjct: 1   MDPTAPLLTHGGEVEEDYAPARSWTDVKRVLSTESAKLWMIAAPVGFNIICQYGVSSVTN 60

Query: 61  IFVGQLGEIELSGVSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIDMLGVYMQRSW 120
           IFVG +GE+ELS VSI++SVI TF+FGF+ GMGSA ETLCGQA+GAGQ++MLGVYMQRSW
Sbjct: 61  IFVGHIGEVELSAVSISLSVIGTFSFGFLLGMGSALETLCGQAYGAGQVNMLGVYMQRSW 120

Query: 121 IIMFLCALIITPVYVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQA 180
           II+F+    + P+Y+F TP+L+LLGQ +++A  AG F+LL +PQLFS    FPT KFLQA
Sbjct: 121 IILFVSCFFLLPIYIFATPVLRLLGQAEEIAVPAGQFTLLTIPQLFSLAFNFPTSKFLQA 180

Query: 181 QSKVWTLAWIGFGALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVIGWCR 240
           QSKV  +AWIGF AL  HV+MLWLFI +FGWGT GAALA NI+ WG +I+Q +YVIGWC 
Sbjct: 181 QSKVVAIAWIGFVALSLHVIMLWLFIIEFGWGTNGAALAFNITNWGTAIAQIVYVIGWCN 240

Query: 241 DAWHGFSWLAFKDLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICMN 300
           + W G SWLAFK++W FV+LS +SA+M CLEIWYM +II+L G L NAVI+VDSLSICMN
Sbjct: 241 EGWTGLSWLAFKEIWAFVRLSIASAVMLCLEIWYMMSIIVLTGRLDNAVIAVDSLSICMN 300

Query: 301 LDGWENIIFIGINVAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAKDH 360
           ++G E ++FIGIN A+SVRVSNELG  RPRAA+YSVYVTV QSLL+GL+FMVAI  A+DH
Sbjct: 301 INGLEAMLFIGINAAISVRVSNELGLGRPRAAKYSVYVTVFQSLLIGLVFMVAIIIARDH 360

Query: 361 FAVIFTSSVTVQKYVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYLFG 420
           FA+IFTSS  +Q+ VSKLAYLLGITMVLNSVQPVVSGVA+G GWQ LVAYINLGCYY+FG
Sbjct: 361 FAIIFTSSKVLQRAVSKLAYLLGITMVLNSVQPVVSGVAVGGGWQGLVAYINLGCYYIFG 420

Query: 421 LPLGIILGYVANFGVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSGQG 480
           LP G +LGY+ANFGV GLW GMIAG A+QT++LLIVLYKTNW KEVEET  R++KW G  
Sbjct: 421 LPFGYLLGYIANFGVMGLWSGMIAGTALQTLLLLIVLYKTNWNKEVEETMERMKKWGGSE 480

Query: 481 NNKRE 483
              ++
Sbjct: 481 TTSKD 485

BLAST of Csa1G044870 vs. TAIR10
Match: AT4G00350.1 (AT4G00350.1 MATE efflux family protein)

HSP 1 Score: 607.1 bits (1564), Expect = 9.8e-174
Identity = 280/485 (57.73%), Postives = 374/485 (77.11%), Query Frame = 1

Query: 1   MEATAPFLG--VNDGDYPPVKTFRELKDVVWSETVKTWAISGPVIFQIVCQYGTNSVTNI 60
           + A +  LG    D D+PP+++FR+ K V   ET K W I+ P+ F I+C YG NS T+I
Sbjct: 56  IHAPSTLLGETTGDADFPPIQSFRDAKLVCVVETSKLWEIAAPIAFNILCNYGVNSFTSI 115

Query: 61  FVGQLGEIELSGVSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIDMLGVYMQRSWI 120
           FVG +G++ELS V+IA+SV++ F+FGF+ GM SA ETLCGQAFGAGQ+DMLGVYMQRSW+
Sbjct: 116 FVGHIGDLELSAVAIALSVVSNFSFGFLLGMASALETLCGQAFGAGQMDMLGVYMQRSWL 175

Query: 121 IMFLCALIITPVYVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQAQ 180
           I+   ++ + P+Y++ TP+L LLGQ+ ++AE++G F+  I+PQ+F+  + FPTQKFLQ+Q
Sbjct: 176 ILLGTSVCLLPLYIYATPLLILLGQEPEIAEISGKFTTQIIPQMFALAINFPTQKFLQSQ 235

Query: 181 SKVWTLAWIGFGALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVIGWCRD 240
           SKV  +AWIGF AL  H+ +L+LFI  F WG  GAA A ++S WGI+I+Q +YV+GWC+D
Sbjct: 236 SKVGIMAWIGFFALTLHIFILYLFINVFKWGLNGAAAAFDVSAWGIAIAQVVYVVGWCKD 295

Query: 241 AWHGFSWLAFKDLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICMNL 300
            W G SWLAF+D+W F+KLSF+SA+M CLEIWY  TII+L GHL + VI+V SLSICMN+
Sbjct: 296 GWKGLSWLAFQDVWPFLKLSFASAVMLCLEIWYFMTIIVLTGHLEDPVIAVGSLSICMNI 355

Query: 301 DGWENIIFIGINVAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAKDHF 360
           +GWE ++FIGIN A+SVRVSNELG   PRAA+YSV VTV++SL++G++  + I   +D F
Sbjct: 356 NGWEGMLFIGINAAISVRVSNELGSGHPRAAKYSVIVTVIESLVIGVVCAIVILITRDDF 415

Query: 361 AVIFTSSVTVQKYVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYLFGL 420
           AVIFT S  ++K V+ LAYLLGITM+LNS+QPV+SGVA+G GWQ  VAYINL CYY FGL
Sbjct: 416 AVIFTESEEMRKAVADLAYLLGITMILNSLQPVISGVAVGGGWQAPVAYINLFCYYAFGL 475

Query: 421 PLGIILGYVANFGVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSGQGN 480
           PLG +LGY  + GV+G+W GMI G ++QT++LL ++Y TNW KEVE+ S R+++W G G 
Sbjct: 476 PLGFLLGYKTSLGVQGIWIGMICGTSLQTLILLYMIYITNWNKEVEQASERMKQW-GAGY 535

Query: 481 NKREE 484
            K E+
Sbjct: 536 EKLEK 539

BLAST of Csa1G044870 vs. TAIR10
Match: AT1G47530.1 (AT1G47530.1 MATE efflux family protein)

HSP 1 Score: 503.1 bits (1294), Expect = 2.0e-142
Identity = 238/480 (49.58%), Postives = 328/480 (68.33%), Query Frame = 1

Query: 2   EATAPFLGVNDGDYPPVKTFRELKDVVWS-----ETVKTWAISGPVIFQIVCQYGTNSVT 61
           + T P L   D   PP  T  +    VW+     E+ + W ++GP IF  + QY   ++T
Sbjct: 4   DKTLPLL---DPREPPELTGTKSASKVWAKEFGEESKRLWELAGPAIFTAISQYSLGALT 63

Query: 62  NIFVGQLGEIELSGVSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIDMLGVYMQRS 121
             F G+LGE+EL+ VS+  SVI+  AFG M GMGSA ETLCGQA+GAGQI M+G+YMQRS
Sbjct: 64  QTFSGRLGELELAAVSVENSVISGLAFGVMLGMGSALETLCGQAYGAGQIRMMGIYMQRS 123

Query: 122 WIIMFLCALIITPVYVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQ 181
           W+I+F  AL + PVY++  PIL   G+   +++ AG F+L ++PQLF++   FP QKFLQ
Sbjct: 124 WVILFTTALFLLPVYIWAPPILSFFGEAPHISKAAGKFALWMIPQLFAYAANFPIQKFLQ 183

Query: 182 AQSKVWTLAWIGFGALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVIGWC 241
           +Q KV  +AWI    L+ H +  WLFI  F WG  GAA+ LN S W I I Q +Y++   
Sbjct: 184 SQRKVLVMAWISGVVLVIHAVFSWLFILYFKWGLVGAAITLNTSWWLIVIGQLLYILITK 243

Query: 242 RD-AWHGFSWLAFKDLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSIC 301
            D AW GFS LAF+DL+GFVKLS +SA+M CLE WY+  ++++ G LPN +I VD++SIC
Sbjct: 244 SDGAWTGFSMLAFRDLYGFVKLSLASALMLCLEFWYLMVLVVVTGLLPNPLIPVDAISIC 303

Query: 302 MNLDGWENIIFIGINVAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAK 361
           MN++GW  +I IG N A+SVRVSNELG      A++SV V  + S L+G++ M+ +   K
Sbjct: 304 MNIEGWTAMISIGFNAAISVRVSNELGAGNAALAKFSVIVVSITSTLIGIVCMIVVLATK 363

Query: 362 DHFAVIFTSSVTVQKYVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYL 421
           D F  +FTSS  V    +++A LLG T++LNS+QPV+SGVA+GAGWQ LVAY+N+ CYY+
Sbjct: 364 DSFPYLFTSSEAVAAETTRIAVLLGFTVLLNSLQPVLSGVAVGAGWQALVAYVNIACYYI 423

Query: 422 FGLPLGIILGYVANFGVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSG 476
            GLP G++LG+  + GV+G+WGGM+AGI +QT++L+ ++Y TNW KE E+   R+Q+W G
Sbjct: 424 IGLPAGLVLGFTLDLGVQGIWGGMVAGICLQTLILIGIIYFTNWNKEAEQAESRVQRWGG 480

BLAST of Csa1G044870 vs. TAIR10
Match: AT1G23300.1 (AT1G23300.1 MATE efflux family protein)

HSP 1 Score: 480.7 bits (1236), Expect = 1.1e-135
Identity = 226/474 (47.68%), Postives = 320/474 (67.51%), Query Frame = 1

Query: 11  NDGDYPPVKTFRELKDVVWSETVKTWAISGPVIFQIVCQYGTNSVTNIFVGQLGEIELSG 70
           +D D PP+   R+      +E+ K W ++GP IF   CQY   +VT I  G +  + L+ 
Sbjct: 24  SDTDMPPISGGRDFIRQFAAESKKLWWLAGPAIFTSFCQYSLGAVTQILAGHVNTLALAA 83

Query: 71  VSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIDMLGVYMQRSWIIMFLCALIITPV 130
           VSI  SVI+ F+ G M GMGSA  TLCGQA+GAGQ++M+G+Y+QRSWII+  CAL++   
Sbjct: 84  VSIQNSVISGFSVGIMLGMGSALATLCGQAYGAGQLEMMGIYLQRSWIILNSCALLLCLF 143

Query: 131 YVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQAQSKVWTLAWIGFG 190
           YVF TP+L LLGQ  ++++ AG FSL ++PQLF++ V F T KFLQAQSKV  +A I   
Sbjct: 144 YVFATPLLSLLGQSPEISKAAGKFSLWMIPQLFAYAVNFATAKFLQAQSKVIAMAVIAAT 203

Query: 191 ALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVIGWCRD-AWHGFSWLAFK 250
            LL H L+ WL + +  WG  G A+ LN+S W I ++Q +Y+ G     AW G SW+AFK
Sbjct: 204 VLLQHTLLSWLLMLKLRWGMAGGAVVLNMSWWLIDVTQIVYICGGSSGRAWSGLSWMAFK 263

Query: 251 DLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICMNLDGWENIIFIGI 310
           +L GF +LS +SA+M CLE+WY   +I+ AG+L N  +SV +LSICMN+ GW  ++  G 
Sbjct: 264 NLRGFARLSLASAVMVCLEVWYFMALILFAGYLKNPQVSVAALSICMNILGWPIMVAFGF 323

Query: 311 NVAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAKDHFAVIFTSSVTVQ 370
           N A+SVR SNELG   PR A++ + V ++ S+ +G++  V +   +D +  +F+    V+
Sbjct: 324 NAAVSVRESNELGAEHPRRAKFLLIVAMITSVSIGIVISVTLIVLRDKYPAMFSDDEEVR 383

Query: 371 KYVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYLFGLPLGIILGYVAN 430
             V +L  LL +T+V+N++QPV+SGVA+GAGWQ +VAY+N+GCYYL G+P+G++LGY   
Sbjct: 384 VLVKQLTPLLALTIVINNIQPVLSGVAVGAGWQGIVAYVNIGCYYLCGIPIGLVLGYKME 443

Query: 431 FGVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSGQGNNKREE 484
            GVKG+W GM+ G  +QT +LL ++Y+TNWKKE      R++KW G  +NKREE
Sbjct: 444 LGVKGIWTGMLTGTVVQTSVLLFIIYRTNWKKEASLAEARIKKW-GDQSNKREE 496

BLAST of Csa1G044870 vs. TAIR10
Match: AT5G38030.1 (AT5G38030.1 MATE efflux family protein)

HSP 1 Score: 474.9 bits (1221), Expect = 5.8e-134
Identity = 224/472 (47.46%), Postives = 315/472 (66.74%), Query Frame = 1

Query: 6   PFLGVNDGDYPPVKTFRELKDVVWSETVKTWAISGPVIFQIVCQYGTNSVTNIFVGQLGE 65
           PF  V D   PP+ T          E  K W ++GP IF  + QY   + T +F G +  
Sbjct: 22  PFSSVED--IPPITTVGGFVKEFNVEVKKLWYLAGPAIFMSITQYSLGAATQVFAGHIST 81

Query: 66  IELSGVSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIDMLGVYMQRSWIIMFLCAL 125
           I L+ VS+  SVIA F+FG M GMGSA ETLCGQAFGAG++ MLGVY+QRSW+I+ + A+
Sbjct: 82  IALAAVSVENSVIAGFSFGVMLGMGSALETLCGQAFGAGKLSMLGVYLQRSWVILNVTAV 141

Query: 126 IITPVYVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQAQSKVWTLA 185
           I++ +Y+F  PIL  +GQ   ++   G FS+ ++PQ+F++ V +PT KFLQ+QSK+  +A
Sbjct: 142 ILSLLYIFAAPILAFIGQTPAISSATGIFSIYMIPQIFAYAVNYPTAKFLQSQSKIMVMA 201

Query: 186 WIGFGALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVI-GWCRDAWHGFS 245
            I   AL+ HVL+ W  I    WGT G A+ LN S W I ++Q +Y+  G C +AW GFS
Sbjct: 202 AISAVALVLHVLLTWFVIEGLQWGTAGLAVVLNASWWFIVVAQLVYIFSGTCGEAWSGFS 261

Query: 246 WLAFKDLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICMNLDGWENI 305
           W AF +LW FV+LS +SA+M CLE+WY+  +I+ AG+L NA ISV +LSICMN+ GW  +
Sbjct: 262 WEAFHNLWSFVRLSLASAVMLCLEVWYLMAVILFAGYLKNAEISVAALSICMNILGWTAM 321

Query: 306 IFIGINVAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAKDHFAVIFTS 365
           I IG+N A+SVRVSNELG   PR A++S+ V V+ S ++GL   +A+   +D +  +F  
Sbjct: 322 IAIGMNAAVSVRVSNELGAKHPRTAKFSLLVAVITSTVIGLAISIALLIFRDKYPSLFVG 381

Query: 366 SVTVQKYVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYLFGLPLGIIL 425
              V   V  L  +L +++V+N+VQPV+SGVA+GAGWQ +VAY+N+ CYY+FG+P G++L
Sbjct: 382 DEEVIIVVKDLTPILAVSIVINNVQPVLSGVAVGAGWQAVVAYVNIVCYYVFGIPFGLLL 441

Query: 426 GYVANFGVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSGQ 477
           GY  NFGV G+W GM+ G  +QTI+L  ++ +TNW  E     GR+++W G+
Sbjct: 442 GYKLNFGVMGIWCGMLTGTVVQTIVLTWMICRTNWDTEAAMAEGRIREWGGE 491

BLAST of Csa1G044870 vs. NCBI nr
Match: gi|659067038|ref|XP_008437344.1| (PREDICTED: protein TRANSPARENT TESTA 12 [Cucumis melo])

HSP 1 Score: 931.8 bits (2407), Expect = 4.9e-268
Identity = 459/484 (94.83%), Postives = 471/484 (97.31%), Query Frame = 1

Query: 1   MEATAPFLGVNDGDYPPVKTFRELKDVVWSETVKTWAISGPVIFQIVCQYGTNSVTNIFV 60
           MEA AP LGV DGDY PVKTFRELKD+VWSETVKTWAISGPVIFQIVCQYGTNSVTNIFV
Sbjct: 1   MEAAAPLLGVEDGDYAPVKTFRELKDMVWSETVKTWAISGPVIFQIVCQYGTNSVTNIFV 60

Query: 61  GQLGEIELSGVSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIDMLGVYMQRSWIIM 120
           GQLGEIELSGVSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQI MLGVYMQRSWIIM
Sbjct: 61  GQLGEIELSGVSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIHMLGVYMQRSWIIM 120

Query: 121 FLCALIITPVYVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQAQSK 180
           F+CALIITP+YVF TPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQAQSK
Sbjct: 121 FICALIITPIYVFATPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQAQSK 180

Query: 181 VWTLAWIGFGALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVIGWCRDAW 240
           VWTLAWIGFGALL HVLMLWLFIFQFGWGTTGAALALNISGWGISI+QCIYV+GWCRDAW
Sbjct: 181 VWTLAWIGFGALLIHVLMLWLFIFQFGWGTTGAALALNISGWGISIAQCIYVMGWCRDAW 240

Query: 241 HGFSWLAFKDLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICMNLDG 300
           HGFSWLAF+DLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICMNLDG
Sbjct: 241 HGFSWLAFRDLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICMNLDG 300

Query: 301 WENIIFIGINVAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAKDHFAV 360
           WENIIFIGINVAMSVRVSNELGKARPRAAEYSVYVTVV+SLLLGLLFMVAIFFAKDHFAV
Sbjct: 301 WENIIFIGINVAMSVRVSNELGKARPRAAEYSVYVTVVESLLLGLLFMVAIFFAKDHFAV 360

Query: 361 IFTSSVTVQKYVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYLFGLPL 420
           IFTSSVTVQKYV+KLAYLLGITMVLNSVQPV+SGVAIGAGWQ LVAYINLGCYY+FGLPL
Sbjct: 361 IFTSSVTVQKYVAKLAYLLGITMVLNSVQPVISGVAIGAGWQALVAYINLGCYYIFGLPL 420

Query: 421 GIILGYVANFGVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSGQGNNK 480
           GIILGYVANFGVKGLWGGMIAGIAMQTIMLLIVLYKTNW KEV ETSGRLQKW+GQ N  
Sbjct: 421 GIILGYVANFGVKGLWGGMIAGIAMQTIMLLIVLYKTNWNKEVAETSGRLQKWTGQDNKI 480

Query: 481 REET 485
           +EET
Sbjct: 481 KEET 484

BLAST of Csa1G044870 vs. NCBI nr
Match: gi|590673030|ref|XP_007038775.1| (Detoxifying efflux carrier 35 isoform 1 [Theobroma cacao])

HSP 1 Score: 690.6 bits (1781), Expect = 1.9e-195
Identity = 324/467 (69.38%), Postives = 391/467 (83.73%), Query Frame = 1

Query: 10  VNDGDYPPVKTFRELKDVVWSETVKTWAISGPVIFQIVCQYGTNSVTNIFVGQLGEIELS 69
           + +GDY P ++F+E+K V W ETVK W I+GP+ FQI+CQYGT SVTNIFVG +G IELS
Sbjct: 18  LENGDYGPARSFKEVKSVFWIETVKMWKIAGPIGFQIMCQYGTMSVTNIFVGHIGNIELS 77

Query: 70  GVSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIDMLGVYMQRSWIIMFLCALIITP 129
            V+IA++VI TF+FGFM GMGSA ETLCGQAFGAGQI MLGVYMQRSWII+     II P
Sbjct: 78  AVTIALAVIGTFSFGFMLGMGSALETLCGQAFGAGQIHMLGVYMQRSWIILLSSCFIILP 137

Query: 130 VYVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQAQSKVWTLAWIGF 189
            Y+F TP+LKLLGQ+D++A LAG F++LI+PQLFS  + FPTQKFLQAQSKV  LAWIGF
Sbjct: 138 FYIFATPLLKLLGQEDEIANLAGKFAILIIPQLFSLAITFPTQKFLQAQSKVNVLAWIGF 197

Query: 190 GALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVIGWCRDAWHGFSWLAFK 249
             L+ HV +LWLF+F F WGTTGAA+A +I+ W I+++Q  YVI W  + WHGFSWLAFK
Sbjct: 198 VTLIFHVGILWLFLFVFDWGTTGAAIAYDITSWVIALAQVAYVIFWSNEGWHGFSWLAFK 257

Query: 250 DLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICMNLDGWENIIFIGI 309
           ++W FV+LS SSA+M CLE+WYM ++I+L GHL NAVI+V SLSICMNL+GWE ++FIGI
Sbjct: 258 EIWAFVRLSISSALMLCLEVWYMMSMILLVGHLNNAVIAVGSLSICMNLNGWEAMLFIGI 317

Query: 310 NVAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAKDHFAVIFTSSVTVQ 369
           N AMSVRVSNELG   PRAA+YSVYVTV+QSLL+GLL MVAI   +DHFAVIFTSS  +Q
Sbjct: 318 NAAMSVRVSNELGLGHPRAAKYSVYVTVLQSLLIGLLCMVAIIITRDHFAVIFTSSEEMQ 377

Query: 370 KYVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYLFGLPLGIILGYVAN 429
           + V+ LAYLLG+TMVLNSVQPV+SGVAIG GWQTLVAYINLGCYY+FGLPLG +LGY AN
Sbjct: 378 RAVAHLAYLLGVTMVLNSVQPVISGVAIGGGWQTLVAYINLGCYYVFGLPLGFLLGYTAN 437

Query: 430 FGVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSGQ 477
            GV GLWGGMIAGI +QT++LL+VL++TNW KEVE+T+ R++KW GQ
Sbjct: 438 LGVMGLWGGMIAGIGLQTLLLLLVLFRTNWNKEVEQTTERMKKWGGQ 484

BLAST of Csa1G044870 vs. NCBI nr
Match: gi|255545210|ref|XP_002513666.1| (PREDICTED: protein DETOXIFICATION 35 [Ricinus communis])

HSP 1 Score: 683.7 bits (1763), Expect = 2.3e-193
Identity = 314/471 (66.67%), Postives = 390/471 (82.80%), Query Frame = 1

Query: 11  NDGDYPPVKTFRELKDVVWSETVKTWAISGPVIFQIVCQYGTNSVTNIFVGQLGEIELSG 70
           +D DY PVK+F+++K V W+ETVK W I+ P++F I+CQYG NSVTNIFVG +G+ ELS 
Sbjct: 14  DDEDYTPVKSFKDIKSVFWTETVKIWKIATPIVFNIMCQYGINSVTNIFVGHIGDFELSA 73

Query: 71  VSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIDMLGVYMQRSWIIMFLCALIITPV 130
           V+I++SVI TF+FGFM GMGSA ETLCGQAFGAGQ+ MLG+YMQRSWII+++  + + P+
Sbjct: 74  VAISLSVIGTFSFGFMLGMGSALETLCGQAFGAGQVHMLGIYMQRSWIILWITCIFLLPI 133

Query: 131 YVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQAQSKVWTLAWIGFG 190
           YVF TPILKLLGQ+D VA+LAG F++LI+PQLFS  V FPTQKFLQAQSKV  LAWIGF 
Sbjct: 134 YVFATPILKLLGQEDSVADLAGQFTILIIPQLFSLAVNFPTQKFLQAQSKVRVLAWIGFV 193

Query: 191 ALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVIGWCRDAWHGFSWLAFKD 250
           A + H+ +LWL I+ FGWGT+GAA+A +I+ WG+SI+Q +YVIGWC++ W G S  AFK+
Sbjct: 194 AFILHIPLLWLLIYVFGWGTSGAAIAYDITNWGMSIAQVVYVIGWCKEGWTGLSSSAFKE 253

Query: 251 LWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICMNLDGWENIIFIGIN 310
           +W FV+LS +SA+M CLEIWYM +II+L GHL NAVI+V SLSICMN +GWE ++FIG+N
Sbjct: 254 IWAFVRLSLASAVMLCLEIWYMMSIIVLTGHLDNAVIAVGSLSICMNFNGWEAMLFIGVN 313

Query: 311 VAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAKDHFAVIFTSSVTVQK 370
            A+SVRVSNELG   PRAA+YSVYVT+ QS L+GLL MV I   KDHFA+IFT+S  +Q 
Sbjct: 314 AAISVRVSNELGSGHPRAAKYSVYVTIFQSFLIGLLSMVIILITKDHFAIIFTNSKAMQV 373

Query: 371 YVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYLFGLPLGIILGYVANF 430
            VSKLA+LLGITMVLNS+QPV+ GVAIG+GWQ LVAYIN+GCYY+FGLPLG  LGY    
Sbjct: 374 AVSKLAFLLGITMVLNSIQPVIGGVAIGSGWQALVAYINIGCYYIFGLPLGFFLGYKTKL 433

Query: 431 GVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSGQGNNKR 482
           GV GLWGGMIAG A+QT++LLIVLY+TNW KEVE+TS R++KW GQ N ++
Sbjct: 434 GVAGLWGGMIAGTALQTLLLLIVLYRTNWNKEVEQTSERVRKWGGQENTEK 484

BLAST of Csa1G044870 vs. NCBI nr
Match: gi|727551789|ref|XP_010448397.1| (PREDICTED: protein TRANSPARENT TESTA 12-like [Camelina sativa])

HSP 1 Score: 683.7 bits (1763), Expect = 2.3e-193
Identity = 320/485 (65.98%), Postives = 396/485 (81.65%), Query Frame = 1

Query: 1   MEATAPFL---GVNDGDYPPVKTFRELKDVVWSETVKTWAISGPVIFQIVCQYGTNSVTN 60
           M+ T P L   G  + DY P +++ ++K V+++E+ K W I+ PV F I+CQYG +SVTN
Sbjct: 1   MDPTTPLLTDSGQPEEDYAPARSWTDVKRVLYTESAKMWLIAAPVGFNIICQYGVSSVTN 60

Query: 61  IFVGQLGEIELSGVSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIDMLGVYMQRSW 120
           IFVG +GE+ELS VSI++SVI +F+FGF+ GMGSA ETLCGQA+GAGQ++MLGVYMQRSW
Sbjct: 61  IFVGHIGEVELSAVSISLSVIGSFSFGFLLGMGSALETLCGQAYGAGQVNMLGVYMQRSW 120

Query: 121 IIMFLCALIITPVYVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQA 180
           II+F+  + + P+Y+F TP L+LLGQ +++A  AG F++L +PQLFS    FPT KFLQA
Sbjct: 121 IILFVSCICLLPLYIFATPALRLLGQAEEIAVPAGKFTILTIPQLFSLAFTFPTSKFLQA 180

Query: 181 QSKVWTLAWIGFGALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVIGWCR 240
           QSKV  +AWIGF AL+ HV MLWLFI  FGWGT GAALA N++ WG +I+Q +YVIGWC 
Sbjct: 181 QSKVVVIAWIGFAALVLHVGMLWLFIIVFGWGTNGAALAFNVTNWGTAIAQIVYVIGWCN 240

Query: 241 DAWHGFSWLAFKDLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICMN 300
           + W G SWLAFKD+W FV+LS +SA+M CLEIWYM +II+L GHL NAVI+VDSLSICMN
Sbjct: 241 EGWTGLSWLAFKDIWAFVRLSIASAVMLCLEIWYMMSIIVLTGHLDNAVIAVDSLSICMN 300

Query: 301 LDGWENIIFIGINVAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAKDH 360
           ++G E ++FIGIN A+SVRVSNELG  RPRAA+YSVYVTV++SLL+GL+FMVAI  A+DH
Sbjct: 301 VNGLEAMLFIGINAAISVRVSNELGLGRPRAAKYSVYVTVIESLLIGLIFMVAIIIARDH 360

Query: 361 FAVIFTSSVTVQKYVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYLFG 420
           FA+IFTSS  +Q+ VSKLAYLLGITMVLNSVQPV+SGVAIGAGWQ LVAYINLGCYY+FG
Sbjct: 361 FAIIFTSSKVIQRAVSKLAYLLGITMVLNSVQPVISGVAIGAGWQGLVAYINLGCYYIFG 420

Query: 421 LPLGIILGYVANFGVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSGQG 480
           LP G +LGY ANFGV GLW GMIAG A+QT++LLIVLYKTNW KEVEET  R++KW G  
Sbjct: 421 LPFGYLLGYKANFGVMGLWSGMIAGTALQTLLLLIVLYKTNWNKEVEETMERMKKWGGSE 480

Query: 481 NNKRE 483
              ++
Sbjct: 481 TTSKD 485

BLAST of Csa1G044870 vs. NCBI nr
Match: gi|334186918|ref|NP_001190838.1| (detoxifying efflux carrier 35 [Arabidopsis thaliana])

HSP 1 Score: 682.9 bits (1761), Expect = 4.0e-193
Identity = 322/485 (66.39%), Postives = 395/485 (81.44%), Query Frame = 1

Query: 1   MEATAPFL---GVNDGDYPPVKTFRELKDVVWSETVKTWAISGPVIFQIVCQYGTNSVTN 60
           M+ TAP L   G  + DY P +++ ++K V+ +E+ K W I+ PV F I+CQYG +SVTN
Sbjct: 1   MDPTAPLLTHGGEVEEDYAPARSWTDVKRVLSTESAKLWMIAAPVGFNIICQYGVSSVTN 60

Query: 61  IFVGQLGEIELSGVSIAISVIATFAFGFMFGMGSATETLCGQAFGAGQIDMLGVYMQRSW 120
           IFVG +GE+ELS VSI++SVI TF+FGF+ GMGSA ETLCGQA+GAGQ++MLGVYMQRSW
Sbjct: 61  IFVGHIGEVELSAVSISLSVIGTFSFGFLLGMGSALETLCGQAYGAGQVNMLGVYMQRSW 120

Query: 121 IIMFLCALIITPVYVFTTPILKLLGQQDDVAELAGSFSLLILPQLFSFVVAFPTQKFLQA 180
           II+F+    + P+Y+F TP+L+LLGQ +++A  AG F+LL +PQLFS    FPT KFLQA
Sbjct: 121 IILFVSCFFLLPIYIFATPVLRLLGQAEEIAVPAGQFTLLTIPQLFSLAFNFPTSKFLQA 180

Query: 181 QSKVWTLAWIGFGALLAHVLMLWLFIFQFGWGTTGAALALNISGWGISISQCIYVIGWCR 240
           QSKV  +AWIGF AL  HV+MLWLFI +FGWGT GAALA NI+ WG +I+Q +YVIGWC 
Sbjct: 181 QSKVVAIAWIGFVALSLHVIMLWLFIIEFGWGTNGAALAFNITNWGTAIAQIVYVIGWCN 240

Query: 241 DAWHGFSWLAFKDLWGFVKLSFSSAIMFCLEIWYMSTIIILAGHLPNAVISVDSLSICMN 300
           + W G SWLAFK++W FV+LS +SA+M CLEIWYM +II+L G L NAVI+VDSLSICMN
Sbjct: 241 EGWTGLSWLAFKEIWAFVRLSIASAVMLCLEIWYMMSIIVLTGRLDNAVIAVDSLSICMN 300

Query: 301 LDGWENIIFIGINVAMSVRVSNELGKARPRAAEYSVYVTVVQSLLLGLLFMVAIFFAKDH 360
           ++G E ++FIGIN A+SVRVSNELG  RPRAA+YSVYVTV QSLL+GL+FMVAI  A+DH
Sbjct: 301 INGLEAMLFIGINAAISVRVSNELGLGRPRAAKYSVYVTVFQSLLIGLVFMVAIIIARDH 360

Query: 361 FAVIFTSSVTVQKYVSKLAYLLGITMVLNSVQPVVSGVAIGAGWQTLVAYINLGCYYLFG 420
           FA+IFTSS  +Q+ VSKLAYLLGITMVLNSVQPVVSGVA+G GWQ LVAYINLGCYY+FG
Sbjct: 361 FAIIFTSSKVLQRAVSKLAYLLGITMVLNSVQPVVSGVAVGGGWQGLVAYINLGCYYIFG 420

Query: 421 LPLGIILGYVANFGVKGLWGGMIAGIAMQTIMLLIVLYKTNWKKEVEETSGRLQKWSGQG 480
           LP G +LGY+ANFGV GLW GMIAG A+QT++LLIVLYKTNW KEVEET  R++KW G  
Sbjct: 421 LPFGYLLGYIANFGVMGLWSGMIAGTALQTLLLLIVLYKTNWNKEVEETMERMKKWGGSE 480

Query: 481 NNKRE 483
              ++
Sbjct: 481 TTSKD 485

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
DTX35_ARATH2.5e-19566.39Protein DETOXIFICATION 35 OS=Arabidopsis thaliana GN=DTX35 PE=2 SV=1[more]
DTX34_ARATH1.7e-17257.73Protein DETOXIFICATION 34 OS=Arabidopsis thaliana GN=DTX34 PE=2 SV=1[more]
DTX33_ARATH3.5e-14149.58Protein DETOXIFICATION 33 OS=Arabidopsis thaliana GN=DTX33 PE=2 SV=1[more]
DTX32_ARATH1.9e-13447.68Protein DETOXIFICATION 32 OS=Arabidopsis thaliana GN=DTX32 PE=3 SV=1[more]
DTX30_ARATH1.0e-13247.46Protein DETOXIFICATION 30 OS=Arabidopsis thaliana GN=DTX30 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A061G135_THECC1.3e-19569.38Protein DETOXIFICATION OS=Theobroma cacao GN=TCM_015227 PE=3 SV=1[more]
B9RIU7_RICCO1.6e-19366.67Protein DETOXIFICATION OS=Ricinus communis GN=RCOM_1582850 PE=3 SV=1[more]
F4JTB2_ARATH2.8e-19366.39Protein DETOXIFICATION OS=Arabidopsis thaliana GN=DTX35 PE=3 SV=1[more]
A0A061G262_THECC8.1e-19367.50Protein DETOXIFICATION OS=Theobroma cacao GN=TCM_015227 PE=3 SV=1[more]
D7MFY8_ARALL3.1e-19267.01Protein DETOXIFICATION OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_914011 ... [more]
Match NameE-valueIdentityDescription
AT4G25640.21.4e-19666.39 detoxifying efflux carrier 35[more]
AT4G00350.19.8e-17457.73 MATE efflux family protein[more]
AT1G47530.12.0e-14249.58 MATE efflux family protein[more]
AT1G23300.11.1e-13547.68 MATE efflux family protein[more]
AT5G38030.15.8e-13447.46 MATE efflux family protein[more]
Match NameE-valueIdentityDescription
gi|659067038|ref|XP_008437344.1|4.9e-26894.83PREDICTED: protein TRANSPARENT TESTA 12 [Cucumis melo][more]
gi|590673030|ref|XP_007038775.1|1.9e-19569.38Detoxifying efflux carrier 35 isoform 1 [Theobroma cacao][more]
gi|255545210|ref|XP_002513666.1|2.3e-19366.67PREDICTED: protein DETOXIFICATION 35 [Ricinus communis][more]
gi|727551789|ref|XP_010448397.1|2.3e-19365.98PREDICTED: protein TRANSPARENT TESTA 12-like [Camelina sativa][more]
gi|334186918|ref|NP_001190838.1|4.0e-19366.39detoxifying efflux carrier 35 [Arabidopsis thaliana][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002528MATE_fam
Vocabulary: Biological Process
TermDefinition
GO:0006855drug transmembrane transport
GO:0055085transmembrane transport
Vocabulary: Molecular Function
TermDefinition
GO:0015238drug transmembrane transporter activity
GO:0015297antiporter activity
Vocabulary: Cellular Component
TermDefinition
GO:0016020membrane
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006855 drug transmembrane transport
biological_process GO:0055085 transmembrane transport
biological_process GO:0009901 anther dehiscence
biological_process GO:0009812 flavonoid metabolic process
biological_process GO:0009555 pollen development
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
cellular_component GO:0005774 vacuolar membrane
molecular_function GO:0015297 antiporter activity
molecular_function GO:0015238 drug transmembrane transporter activity
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU087128cucumber EST collection version 3.0transcribed_cluster
CU095479cucumber EST collection version 3.0transcribed_cluster
CU098293cucumber EST collection version 3.0transcribed_cluster
CU143984cucumber EST collection version 3.0transcribed_cluster
CU145685cucumber EST collection version 3.0transcribed_cluster
CU162297cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa1G044870.1Csa1G044870.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU098293CU098293transcribed_cluster
CU095479CU095479transcribed_cluster
CU145685CU145685transcribed_cluster
CU087128CU087128transcribed_cluster
CU162297CU162297transcribed_cluster
CU143984CU143984transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002528Multi antimicrobial extrusion proteinPFAMPF01554MatEcoord: 261..422
score: 3.4E-23coord: 41..200
score: 1.7
IPR002528Multi antimicrobial extrusion proteinTIGRFAMsTIGR00797TIGR00797coord: 41..436
score: 8.1
NoneNo IPR availablePANTHERPTHR11206MULTIDRUG RESISTANCE PROTEINcoord: 4..484
score: 2.6E
NoneNo IPR availablePANTHERPTHR11206:SF152MATE EFFLUX FAMILY PROTEINcoord: 4..484
score: 2.6E