Cp4.1LG05g05060.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG05g05060.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionProtein DETOXIFICATION
LocationCp4.1LG05 : 2926635 .. 2931268 (-)
Sequence length2190
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTAGCTTATAATTGTTAATAAATGCAAGGAAATTATAGGTAAATTACCGGCGGTGAAATTCGGTATCGAACACTGGCCGCTTTTCCTTAGCGGGAAGGACACTTGAGGCCTCTTCCCTTTGCCGCTTGCCTTTTGCCCGCCAACGCCAACGCCAACGCCAACGAATGCCTCACTTCCATTTGCTTCTCTTCGTCAACTGACCAACAAACTCGTCCTCTTCGCCTCCGACCCTCACAGGCTCCTACTAATCAAACCAATTTTTCAATACCTCAATTGGCAACTGGCTTCAGACAAAGACACCCTCTTCCCTCTTTCTTCCGTGCTTCCAATTTGAATTGGGGTTTCTTCCACGCCGTCGGAGTTGAGGTACCCAACTTGAAATGTCTTGGTGGCAATGTTTCAACCATGAGTCTCTTAGGAGGTTTATCTGTTTAGACTTTGGGTTGATTTTTTTTTTTTGTTTTCTTTTTTGGTCTAAAGGATTGTGGACCATGAGGGGCCTCCAGTTAGGAAGTTTTGCCCAAAGGTTAGGCTGTTTTTGGTGGAGAAGTTAATTTAGACTCTGTCTCTGGGTTAGTTTTCCATAGTTCTTCTTTATATATGTTGTCTTTGTTCCAACAAAGTTTTCACTTAGAGCTCAAACCCACTTCAATATTCTGTGTTGGGTTTTGAAACAATTGGATTATCGACGTTTTTCTGCAGGCTCTGGGTATCGATTTCGAACAGTTCAGGGGATGGCCTTCTCGATCATGTCTGACGAAGATGATCCTTATCCCTCTTGGGAAAAAACGAGAATGCCTATTCGTATTTTCTTCAAGGATGCCAGGTAAAGAAGCTGCTCAATATTTATCAAACTTTTTTGTTGTCTTTGTTTCGCTATGCTTTCTAGACACAGATTCAATGGGGGACTGTCTTTTAGCTTTCTGGATTATGATCCATTATGTTTGTAATAGGAAACATTCTTTCATGCGTGGTAATTCCTTTCTTGTTCTGTTTTTTGTGAACAGACATGTCTTTAAGTTGGATGAACTTGGTCGGGAAATAGCTCAGATTGCTTTGCCTGCCGCACTAGCTTTGGCAGCTGACCCTGTAGCTTCTCTGGTTGACACAGCATTCATTGGCCAAATAGGTAACTCTGCTTTTCTTTTCAAGATTTTTAGGAATCACGGATCTCTACAATTGTATGATATTGTCTACTTTAAGCATAAGCTCTCGTAACTTTGCTTTGGGCTTCCCCAAAAGGCCTCATACCAATGGACATGTATTCCTTACTTATTAACCCGTGATCAATCCCTAAATTAGCTAATATGCGACTCCCTCCCAACAATTCTCAACAAGGGTCGTGTTCATATAGCTATATCATGGATGCAGGTCCTGTGGAGCTTGCTGCCGTTGGAGTTGCTATTGCTTTATTCAATCAAGTTTCAAGAATTGCAATTTTCCCCCTTGTTAGTGTCACCACATCTTTTGTTGCTGAGGAAGATGCTATTGGAAGTGCTTGTAATGAAGCAAAGGATAATAACGATAAGGAGACAGGTTTATTCACAAATGATGAATCAAAATTGATGATCCCACACAATGGTATGCATTCCTTCATCCACTACTCAACGTTCTTCTGATTCATCATCTGATTCCATTTGTTATCCTTGACAGGGAAAACTGAAGAGAATGGAAGAAGATATATCCCATCGGCCTCTTCGGCTTTGGTTATCGGCGGTGTTCTTGGTCTCATACAAGCCATTTTCTTGATATCTGGAGCAAAACCTCTACTAAACTTCATGGGAGTCAAGTCAGTAAGTATTACCTATTGAAACTAATGCCTCAATCCTTTCTTGTACATGATTGTTGCTGCATAACATGAACGTAGGATTCGCCGATGATGACTTCTGCACAACAATACTTGACACTGCGGTCACTCGGTGCCCCGGCAGTTCTTCTCTCCTTGGCCATGCAAGGCGTCTTTCGCGGTTTTAAGGACACGAAAACTCCTTTATTTGCAACTGGTATACTTAAAACCACCTTTTCTTTTTGGAACCTCTGAATTATCCATGTTCATATTTCCCCCCTTGTTCTTCATCAGTGGCTGGAGATGCAACAAACATCATTCTAGACCCAATATTCATATTCATTTTTCGTTTAGGCGTCAGTGGTGCAGCCATTGCACACGTTATATCGCAGTAAGTGTTATGTTATACGTCGATTCCACGACTTCTCGAGCATTTTCTGGCTATTGTTTTTACATTTACTTCTTATTTTGTAGGTACCTAATAGCACTGATACTCTTTTGGAGATTAATGGGACAAGTTGATCTCTTACCTCCCAGTATCAAACATTTGCAATTTAGTCGGTTTCTGAAAAATGGTAAGAGATTTGTGTAAAGCATGCTGCAAAATCTGAAGATTTGATTAACCATGTTCATGCTTCTGTAGGCTTTCTATTATTAATGAGAGTCATTGCTGTGACGTTCTGCGTGACGCTTGCTGCGTCCCTGGCTGCACGCCAAGGATCCACATCAATGGCGGCATTTCAGGTCTGCTTGCAGGTCTGGTTGGCAATGTCTCTACTTGCCGATGGCTTAGCTGTTGCTGGGCAGGTATGTTAAAGATTTCTTTCAATCATTTGCATATGTATTCAGAACTACAAGCCTATCATGTTAAGTGTTCTTATTGTTCTTCAGACAATACTAGCAAGTGCATTTGCCCAAAACGACCATGATAAGGCAACGACTGCAGCATCACGAGTATTACAGGTTCCTATCGTTTTTTGAGAGATCTAATTACTTTTAGTAAAGGGAGTTTCTCTAATATATAGTAAACTATACTGCAGCTGGGATTGTTGCTGGGATTGGGGCTTGGAGTCTTCGTTGGAGCCGGGATGACATTCGGGGCAAAGTTATTTACAAGTGACGTCGATGTCCTCCACCTAATCGGCATAGGAACTCCAGTATGTCACCTTGTAACTCTGCCAACGCAAACTATAGGAATAAATCACTCATTTCAATATCAAACGGAAGCCAAAACTGAATTTTGTTTTAATCTGTACAGTTTATTGCTGCTATGCAACCAATCAATGCCTTGGCGTTTGTTTTTGATGGCATCAACTTTGGAGCTTCTGATTTTGCGTACTCAGCTTACTCCATGGTGAGTTCATAGTCCCCCTGTTTGCATACTCACCAATTATACAAAAAACAACTATGGTGGCTGGTTCTTGTCCTAAAATCTAGTCATGGTCGGGATCTTACAAATGTTGAGGATTGTTGGGAGATGAGTCCCACATCGGCTAAATAAGGGGTTGATCATGGGTTTATAAGTAAGGAACACTATCTTCATTTGTATGAGGCCGTTTGAGGAAACCAAAAGTAAAGCCACGAGAGCTTATGCTCAAAGTAGACAATATCATACAATTGTGGAGGTTCGTGGTTTCTGACATGGTATCAGAGCCATGCCCTTAACTTAGCCATGTCAATAGAATCCTCAAGTGTCGAACAAAGGGTGTACTTTGTTCGAAGGCTTCAGAATAAGAGTCGAGTCTCGATTAAGGGGAGGCTATTCGAGGGCTCCATAGGTCTTAGGGGAGGCTGTTCGAGGGCTCCATATGTCTTAGGGGAGGCTGTTCGAGGGCTCCATAGGTCTTAGGGGAGGCTTTATCGTGTACTTTGTTCGAGGGAAGGATGATTGAGGATTGTTGGGAGAGGAGTGTCACATCGGCTAATTAAGGGGTTGATCATGAGTTTATAAGTAAGGAACAGTATCTTTATTGGTATGAGACCTTTTGGGTAAACCAAAAGCAAAGCCACGAAAGCTTATGCTCAAAGTGAACAATATCATACCAATATGGAGGTCCGTGGTTCCTAACAACAATGATACGTATAGTAACTCGAGCCTAAAATGAACAAGATCGTAGCCTAAAATGAATTCTAGATGAGCTAAATGATGAGCTTTAATCATCTAACCATCCAAACTTACATTCCTTCTCTCCAGGTTCTGGTGGCTATTATCAGCATCTTCTGTTTGTTCATTCTCTCCTCAACTCAAGGATTCATCGGTATCTGGGTCGCCTTAACCATCTACATGAGCTTACGAACACTAGCCGGATTCTGGAGGTACTTCCATCTACACTCTATCTATCCGAACTCGTGTCATCCAACACTCGTTAACGAACGATCACAAATCAGCTACTATATTGCGGTGTTCATTACAAAAATCTTATGTTTTTGAACTGCACTGACCTTCGCTTACAGGGTCGGCACGGGAACAGGACCTTGGTATTTCCTCCAAAGCTAGATTCCAAGTCGTTTGTAGGACATGGATAAGCTGTTGGATGATGTTTTTTTCTTCCATTTATGTCCATACCTTCATATATAGGCTGGATTGCATCAGGACATTCAGCTTAAAACCTTAAAAAGTTGAAGCTTAGAGACCATGTACAGAAAAGACAAGAGAAGACAGCATGTTCTTGAACAAGAAGACAAGAAAACATAGAAGATTTGTACGAGTGTACGAGTGTTTTGTAGCTGTAAATCACAGTGTTCAGTTCAGTGTTACAAGATTTTATGCAGGTTGTTTGTTATATGCAATCCTCTCCTTTGAAACTATTTGTTTAAAAATGTATTTTTTTTTATTATTGGTAAAAA

mRNA sequence

ATTAGCTTATAATTGTTAATAAATGCAAGGAAATTATAGGTAAATTACCGGCGGTGAAATTCGGTATCGAACACTGGCCGCTTTTCCTTAGCGGGAAGGACACTTGAGGCCTCTTCCCTTTGCCGCTTGCCTTTTGCCCGCCAACGCCAACGCCAACGCCAACGAATGCCTCACTTCCATTTGCTTCTCTTCGTCAACTGACCAACAAACTCGTCCTCTTCGCCTCCGACCCTCACAGGCTCCTACTAATCAAACCAATTTTTCAATACCTCAATTGGCAACTGGCTTCAGACAAAGACACCCTCTTCCCTCTTTCTTCCGTGCTTCCAATTTGAATTGGGGTTTCTTCCACGCCGTCGGAGTTGAGACTCTGTCTCTGGGCTCTGGGTATCGATTTCGAACAGTTCAGGGGATGGCCTTCTCGATCATGTCTGACGAAGATGATCCTTATCCCTCTTGGGAAAAAACGAGAATGCCTATTCGTATTTTCTTCAAGGATGCCAGACATGTCTTTAAGTTGGATGAACTTGGTCGGGAAATAGCTCAGATTGCTTTGCCTGCCGCACTAGCTTTGGCAGCTGACCCTGTAGCTTCTCTGTTTCAAGAATTGCAATTTTCCCCCTTTGTCACCACATCTTTTGTTGCTGAGGAAGATGCTATTGGAAGTGCTTGTAATGAAGCAAAGGATAATAACGATAAGGAGACAGGTTTATTCACAAATGATGAATCAAAATTGATGATCCCACACAATGGGAAAACTGAAGAGAATGGAAGAAGATATATCCCATCGGCCTCTTCGGCTTTGGTTATCGGCGGTGTTCTTGGTCTCATACAAGCCATTTTCTTGATATCTGGAGCAAAACCTCTACTAAACTTCATGGGAGTCAAGTCAGATTCGCCGATGATGACTTCTGCACAACAATACTTGACACTGCGGTCACTCGGTGCCCCGGCAGTTCTTCTCTCCTTGGCCATGCAAGGCGTCTTTCGCGGTTTTAAGGACACGAAAACTCCTTTATTTGCAACTGTGGCTGGAGATGCAACAAACATCATTCTAGACCCAATATTCATATTCATTTTTCGTTTAGGCGTCAGTGGTGCAGCCATTGCACACGTTATATCGCAGTACCTAATAGCACTGATACTCTTTTGGAGATTAATGGGACAAGTTGATCTCTTACCTCCCAGTATCAAACATTTGCAATTTAGTCGGTTTCTGAAAAATGGCTTTCTATTATTAATGAGAGTCATTGCTGTGACGTTCTGCGTGACGCTTGCTGCGTCCCTGGCTGCACGCCAAGGATCCACATCAATGGCGGCATTTCAGGTCTGCTTGCAGGTCTGGTTGGCAATGTCTCTACTTGCCGATGGCTTAGCTGTTGCTGGGCAGACAATACTAGCAAGTGCATTTGCCCAAAACGACCATGATAAGGCAACGACTGCAGCATCACGAGTATTACAGCTGGGATTGTTGCTGGGATTGGGGCTTGGAGTCTTCGTTGGAGCCGGGATGACATTCGGGGCAAAGTTATTTACAAGTGACGTCGATGTCCTCCACCTAATCGGCATAGGAACTCCATTTATTGCTGCTATGCAACCAATCAATGCCTTGGCGTTTGTTTTTGATGGCATCAACTTTGGAGCTTCTGATTTTGCGTACTCAGCTTACTCCATGGTTCTGGTGGCTATTATCAGCATCTTCTGTTTGTTCATTCTCTCCTCAACTCAAGGATTCATCGGTATCTGGGTCGCCTTAACCATCTACATGAGCTTACGAACACTAGCCGGATTCTGGAGGGTCGGCACGGGAACAGGACCTTGGTATTTCCTCCAAAGCTAGATTCCAAGTCGTTTGTAGGACATGGATAAGCTGTTGGATGATGTTTTTTTCTTCCATTTATGTCCATACCTTCATATATAGGCTGGATTGCATCAGGACATTCAGCTTAAAACCTTAAAAAGTTGAAGCTTAGAGACCATGTACAGAAAAGACAAGAGAAGACAGCATGTTCTTGAACAAGAAGACAAGAAAACATAGAAGATTTGTACGAGTGTACGAGTGTTTTGTAGCTGTAAATCACAGTGTTCAGTTCAGTGTTACAAGATTTTATGCAGGTTGTTTGTTATATGCAATCCTCTCCTTTGAAACTATTTGTTTAAAAATGTATTTTTTTTTATTATTGGTAAAAA

Coding sequence (CDS)

ATGGCCTTCTCGATCATGTCTGACGAAGATGATCCTTATCCCTCTTGGGAAAAAACGAGAATGCCTATTCGTATTTTCTTCAAGGATGCCAGACATGTCTTTAAGTTGGATGAACTTGGTCGGGAAATAGCTCAGATTGCTTTGCCTGCCGCACTAGCTTTGGCAGCTGACCCTGTAGCTTCTCTGTTTCAAGAATTGCAATTTTCCCCCTTTGTCACCACATCTTTTGTTGCTGAGGAAGATGCTATTGGAAGTGCTTGTAATGAAGCAAAGGATAATAACGATAAGGAGACAGGTTTATTCACAAATGATGAATCAAAATTGATGATCCCACACAATGGGAAAACTGAAGAGAATGGAAGAAGATATATCCCATCGGCCTCTTCGGCTTTGGTTATCGGCGGTGTTCTTGGTCTCATACAAGCCATTTTCTTGATATCTGGAGCAAAACCTCTACTAAACTTCATGGGAGTCAAGTCAGATTCGCCGATGATGACTTCTGCACAACAATACTTGACACTGCGGTCACTCGGTGCCCCGGCAGTTCTTCTCTCCTTGGCCATGCAAGGCGTCTTTCGCGGTTTTAAGGACACGAAAACTCCTTTATTTGCAACTGTGGCTGGAGATGCAACAAACATCATTCTAGACCCAATATTCATATTCATTTTTCGTTTAGGCGTCAGTGGTGCAGCCATTGCACACGTTATATCGCAGTACCTAATAGCACTGATACTCTTTTGGAGATTAATGGGACAAGTTGATCTCTTACCTCCCAGTATCAAACATTTGCAATTTAGTCGGTTTCTGAAAAATGGCTTTCTATTATTAATGAGAGTCATTGCTGTGACGTTCTGCGTGACGCTTGCTGCGTCCCTGGCTGCACGCCAAGGATCCACATCAATGGCGGCATTTCAGGTCTGCTTGCAGGTCTGGTTGGCAATGTCTCTACTTGCCGATGGCTTAGCTGTTGCTGGGCAGACAATACTAGCAAGTGCATTTGCCCAAAACGACCATGATAAGGCAACGACTGCAGCATCACGAGTATTACAGCTGGGATTGTTGCTGGGATTGGGGCTTGGAGTCTTCGTTGGAGCCGGGATGACATTCGGGGCAAAGTTATTTACAAGTGACGTCGATGTCCTCCACCTAATCGGCATAGGAACTCCATTTATTGCTGCTATGCAACCAATCAATGCCTTGGCGTTTGTTTTTGATGGCATCAACTTTGGAGCTTCTGATTTTGCGTACTCAGCTTACTCCATGGTTCTGGTGGCTATTATCAGCATCTTCTGTTTGTTCATTCTCTCCTCAACTCAAGGATTCATCGGTATCTGGGTCGCCTTAACCATCTACATGAGCTTACGAACACTAGCCGGATTCTGGAGGGTCGGCACGGGAACAGGACCTTGGTATTTCCTCCAAAGCTAG

Protein sequence

MAFSIMSDEDDPYPSWEKTRMPIRIFFKDARHVFKLDELGREIAQIALPAALALAADPVASLFQELQFSPFVTTSFVAEEDAIGSACNEAKDNNDKETGLFTNDESKLMIPHNGKTEENGRRYIPSASSALVIGGVLGLIQAIFLISGAKPLLNFMGVKSDSPMMTSAQQYLTLRSLGAPAVLLSLAMQGVFRGFKDTKTPLFATVAGDATNIILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMGQVDLLPPSIKHLQFSRFLKNGFLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVWLAMSLLADGLAVAGQTILASAFAQNDHDKATTAASRVLQLGLLLGLGLGVFVGAGMTFGAKLFTSDVDVLHLIGIGTPFIAAMQPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFCLFILSSTQGFIGIWVALTIYMSLRTLAGFWRVGTGTGPWYFLQS
BLAST of Cp4.1LG05g05060.1 vs. Swiss-Prot
Match: DTX42_ARATH (Protein DETOXIFICATION 42 OS=Arabidopsis thaliana GN=DTX42 PE=2 SV=2)

HSP 1 Score: 567.8 bits (1462), Expect = 1.1e-160
Identity = 323/503 (64.21%), Postives = 367/503 (72.96%), Query Frame = 1

Query: 20  RMPIRIFFKDARHVFKLDELGREIAQIALPAALALAADPVASLFQEL---QFSPF----- 79
           R P+ IFF D R V K DELG EIA+IALPAALAL ADP+ASL       Q  P      
Sbjct: 13  RNPLYIFFSDFRSVLKFDELGLEIARIALPAALALTADPIASLVDTAFIGQIGPVELAAV 72

Query: 80  --------------------VTTSFVAEEDAIGSACNEAKDNNDK-ETGLFTNDESKL-M 139
                               +TTSFVAEEDA  S  +  +D+ +  E G+    E  + +
Sbjct: 73  GVSIALFNQVSRIAIFPLVSITTSFVAEEDACSSQQDTVRDHKECIEIGINNPTEETIEL 132

Query: 140 IPHNGKTEENG-----------------RRYIPSASSALVIGGVLGLIQAIFLISGAKPL 199
           IP   K   +                  +R IPSASSAL+IGGVLGL QA+FLIS AKPL
Sbjct: 133 IPEKHKDSLSDEFKTSSSIFSISKPPAKKRNIPSASSALIIGGVLGLFQAVFLISAAKPL 192

Query: 200 LNFMGVKSDSPMMTSAQQYLTLRSLGAPAVLLSLAMQGVFRGFKDTKTPLFATVAGDATN 259
           L+FMGVK DSPMM  +Q+YL+LRSLGAPAVLLSLA QGVFRGFKDT TPLFATV GD TN
Sbjct: 193 LSFMGVKHDSPMMRPSQRYLSLRSLGAPAVLLSLAAQGVFRGFKDTTTPLFATVIGDVTN 252

Query: 260 IILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMGQVDLLPPSIKHLQFSRFLKNG 319
           IILDPIFIF+FRLGV+GAA AHVISQYL+  IL W+LMGQVD+   S KHLQF RF+KNG
Sbjct: 253 IILDPIFIFVFRLGVTGAATAHVISQYLMCGILLWKLMGQVDIFNMSTKHLQFCRFMKNG 312

Query: 320 FLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVWLAMSLLADGLAVAGQTILASA 379
           FLLLMRVIAVTFCVTL+ASLAAR+GSTSMAAFQVCLQVWLA SLLADG AVAGQ ILASA
Sbjct: 313 FLLLMRVIAVTFCVTLSASLAAREGSTSMAAFQVCLQVWLATSLLADGYAVAGQAILASA 372

Query: 380 FAQNDHDKATTAASRVLQLGLLLGLGLGVFVGAGMTFGAKLFTSDVDVLHLIGIGTPFIA 439
           FA+ D+ +A   ASRVLQLGL+LG  L V +GAG+ FGA++FT D  VLHLI IG PF+A
Sbjct: 373 FAKKDYKRAAATASRVLQLGLVLGFVLAVILGAGLHFGARVFTKDDKVLHLISIGLPFVA 432

Query: 440 AMQPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFCLFILSSTQGFIGIWVALTIYM 476
             QPINALAFVFDG+NFGASDF Y+A S+V+VAI+SI CL  LSST GFIG+W  LTIYM
Sbjct: 433 GTQPINALAFVFDGVNFGASDFGYAAASLVMVAIVSILCLLFLSSTHGFIGLWFGLTIYM 492

BLAST of Cp4.1LG05g05060.1 vs. Swiss-Prot
Match: DTX43_ARATH (Protein DETOXIFICATION 43 OS=Arabidopsis thaliana GN=DTX43 PE=1 SV=1)

HSP 1 Score: 490.7 bits (1262), Expect = 1.8e-137
Identity = 284/507 (56.02%), Postives = 346/507 (68.24%), Query Frame = 1

Query: 18  KTRMPIRIFFKDARHVFKLDELGREIAQIALPAALALAADPVASLFQ------------- 77
           K  +P  + FKD RHVF  D  GREI  IA PAALALAADP+ASL               
Sbjct: 12  KKPIPFLVIFKDLRHVFSRDTTGREILGIAFPAALALAADPIASLIDTAFVGRLGAVQLA 71

Query: 78  -------------ELQFSPFV--TTSFVAEEDAIGSACNEAKDNN-----------DKET 137
                         +   P V  TTSFVAEED +     EA   N             E 
Sbjct: 72  AVGVSIAIFNQASRITIFPLVSLTTSFVAEEDTMEKMKEEANKANLVHAETILVQDSLEK 131

Query: 138 GLFT---NDESKLMIPHNGKTEENG--------RRYIPSASSALVIGGVLGLIQAIFLIS 197
           G+ +   ND ++   P    T+ N         +R I +AS+A+++G +LGL+QAIFLI 
Sbjct: 132 GISSPTSNDTNQPQQPPAPDTKSNSGNKSNKKEKRTIRTASTAMILGLILGLVQAIFLIF 191

Query: 198 GAKPLLNFMGVKSDSPMMTSAQQYLTLRSLGAPAVLLSLAMQGVFRGFKDTKTPLFATVA 257
            +K LL  MGVK +SPM++ A +YL++R+LGAPA+LLSLAMQG+FRGFKDTKTPLFATV 
Sbjct: 192 SSKLLLGVMGVKPNSPMLSPAHKYLSIRALGAPALLLSLAMQGIFRGFKDTKTPLFATVV 251

Query: 258 GDATNIILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMGQVDLLPPSIKHLQFSR 317
            D  NI+LDPIFIF+ RLG+ GAAIAHVISQY + LILF  L  +V+L+PP+   LQF R
Sbjct: 252 ADVINIVLDPIFIFVLRLGIIGAAIAHVISQYFMTLILFVFLAKKVNLIPPNFGDLQFGR 311

Query: 318 FLKNGFLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVWLAMSLLADGLAVAGQT 377
           FLKNG LLL R IAVTFC TLAA++AAR G+T MAAFQ+CLQVWL  SLL DGLAVAGQ 
Sbjct: 312 FLKNGLLLLARTIAVTFCQTLAAAMAARLGTTPMAAFQICLQVWLTSSLLNDGLAVAGQA 371

Query: 378 ILASAFAQNDHDKATTAASRVLQLGLLLGLGLGVFVGAGMTFGAKLFTSDVDVLHLIGIG 437
           ILA +FA+ D++K T  ASRVLQ+G +LGLGL VFVG G+ FGA +F+ D  V+HL+ IG
Sbjct: 372 ILACSFAEKDYNKVTAVASRVLQMGFVLGLGLSVFVGLGLYFGAGVFSKDPAVIHLMAIG 431

Query: 438 TPFIAAMQPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFCLFILSSTQGFIGIWVA 475
            PFIAA QPIN+LAFV DG+NFGASDFAY+AYSMV VA ISI  +  ++ T GFIGIW+A
Sbjct: 432 IPFIAATQPINSLAFVLDGVNFGASDFAYTAYSMVGVAAISIAAVIYMAKTNGFIGIWIA 491

BLAST of Cp4.1LG05g05060.1 vs. Swiss-Prot
Match: DTX44_ARATH (Protein DETOXIFICATION 44, chloroplastic OS=Arabidopsis thaliana GN=DTX44 PE=2 SV=1)

HSP 1 Score: 330.1 bits (845), Expect = 4.0e-89
Identity = 198/447 (44.30%), Postives = 281/447 (62.86%), Query Frame = 1

Query: 36  LDELGREIAQIALPAALALAADPVASLFQELQFSPFVTTSFVAEEDAIGSACN----EAK 95
           + ++G EI  IALPAALALAADP+ SL      + FV     AE  A+G + +     +K
Sbjct: 73  IGKIGMEIMSIALPAALALAADPITSLVD----TAFVGHIGSAELAAVGVSVSVFNLVSK 132

Query: 96  DNND---KETGLFTNDESKLMIPHNGKTEENGRRYIPSASSALVIGGVLGLIQAIFLISG 155
             N      T  F  +E  +    +  + E  ++ +PS S++LV+   +G+ +AI L  G
Sbjct: 133 LFNVPLLNVTTSFVAEEQAIAAKDDNDSIETSKKVLPSVSTSLVLAAGVGIAEAIALSLG 192

Query: 156 AKPLLNFMGVKSDSPMMTSAQQYLTLRSLGAPAVLLSLAMQGVFRGFKDTKTPLFATVAG 215
           +  L++ M +  DSPM   A+Q+L LR+ GAP ++++LA QG FRGFKDT TPL+A VAG
Sbjct: 193 SDFLMDVMAIPFDSPMRIPAEQFLRLRAYGAPPIVVALAAQGAFRGFKDTTTPLYAVVAG 252

Query: 216 DATNIILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMGQVDLLPPSIKHLQFSRF 275
           +  N +LDPI IF+   G+SGAA A VIS+YLIA IL W+L   V LL P IK  + +++
Sbjct: 253 NVLNAVLDPILIFVLGFGISGAAAATVISEYLIAFILLWKLNENVVLLSPQIKVGRANQY 312

Query: 276 LKNGFLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVWLAMSLLADGLAVAGQTI 335
           LK+G LL+ R +A+    TLA SLAA+ G T MA  Q+ L++WLA+SLL D LA+A Q++
Sbjct: 313 LKSGGLLIGRTVALLVPFTLATSLAAQNGPTQMAGHQIVLEIWLAVSLLTDALAIAAQSL 372

Query: 336 LASAFAQNDHDKATTAASRVLQLGLLLGLGLGVFVGAGMTFGAKLFTSDVDVLHLIGIGT 395
           LA+ ++Q ++ +A      VLQ+GL  G GL   +       + LFT+D +VL +   GT
Sbjct: 373 LATTYSQGEYKQAREVLFGVLQVGLATGTGLAAVLFITFEPFSSLFTTDSEVLKIALSGT 432

Query: 396 PFIAAMQPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFCLFILSSTQGFIGIWVAL 455
            F+A  QP+NALAFV DG+ +G SDF ++AYSMV+V  IS   + + + T G  GIW  L
Sbjct: 433 LFVAGSQPVNALAFVLDGLYYGVSDFGFAAYSMVIVGFISSLFMLVAAPTFGLAGIWTGL 492

Query: 456 TIYMSLRTLAGFWRVGTGTGPWYFLQS 476
            ++M+LR +AG WR+GT TGPW  L S
Sbjct: 493 FLFMALRLVAGAWRLGTRTGPWKMLWS 515

BLAST of Cp4.1LG05g05060.1 vs. Swiss-Prot
Match: DTX45_ARATH (Protein DETOXIFICATION 45, chloroplastic OS=Arabidopsis thaliana GN=DTX45 PE=2 SV=2)

HSP 1 Score: 293.9 bits (751), Expect = 3.2e-78
Identity = 171/404 (42.33%), Postives = 262/404 (64.85%), Query Frame = 1

Query: 72  VTTSFVAEEDAIGSACNEAKDNNDKETGLFTNDESKLMIPHNGKTEENGRRYIPSASSAL 131
           V TSFVAE+ A  +A + A +++  +            IP  G  E   R+ + S S+AL
Sbjct: 166 VATSFVAEDIAKIAAQDLASEDSQSD------------IPSQGLPE---RKQLSSVSTAL 225

Query: 132 VIGGVLGLIQAIFLISGAKPLLNFMGVKSDSPMMTSAQQYLTLRSLGAPAVLLSLAMQGV 191
           V+   +G+ +A+ L   + P L  MG++S S M   A+Q+L LR+LGAPA ++SLA+QG+
Sbjct: 226 VLAIGIGIFEALALSLASGPFLRLMGIQSMSEMFIPARQFLVLRALGAPAYVVSLALQGI 285

Query: 192 FRGFKDTKTPLFATVAGDATNIILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMG 251
           FRGFKDTKTP++    G+   + L P+FI+ FR+GV+GAAI+ VISQY +A+++   L  
Sbjct: 286 FRGFKDTKTPVYCLGIGNFLAVFLFPLFIYKFRMGVAGAAISSVISQYTVAILMLILLNK 345

Query: 252 QVDLLPPSIKHLQFSRFLKNGFLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVW 311
           +V LLPP I  L+F  +LK+G  +L R ++V   +T+A S+AARQG  +MAA Q+C+QVW
Sbjct: 346 RVILLPPKIGSLKFGDYLKSGGFVLGRTLSVLVTMTVATSMAARQGVFAMAAHQICMQVW 405

Query: 312 LAMSLLADGLAVAGQTILASAFAQNDHDKATTAASRVLQLGLLLGLGLGVFVGAGMTFGA 371
           LA+SLL D LA +GQ ++AS+ ++ D +      + VL++G++ G+ L + +G   +  A
Sbjct: 406 LAVSLLTDALASSGQALIASSASKRDFEGVKEVTTFVLKIGVVTGIALAIVLGMSFSSIA 465

Query: 372 KLFTSDVDVLHLIGIGTPFIAAMQPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFC 431
            LF+ D +VL ++  G  F+AA QPI ALAF+FDG+++G SDF Y+A SM++V  IS   
Sbjct: 466 GLFSKDPEVLRIVRKGVLFVAATQPITALAFIFDGLHYGMSDFPYAACSMMVVGGISSAF 525

Query: 432 LFILSSTQGFIGIWVALTIYMSLRTLAGFWRVGTGTGPWYFLQS 476
           +    +  G  G+WV L+++M LR +AGF R+    GPW+F+ +
Sbjct: 526 MLYAPAGLGLSGVWVGLSMFMGLRMVAGFSRLMWRKGPWWFMHT 554

BLAST of Cp4.1LG05g05060.1 vs. Swiss-Prot
Match: DINF_ECOLI (DNA-damage-inducible protein F OS=Escherichia coli (strain K12) GN=dinF PE=2 SV=1)

HSP 1 Score: 71.2 bits (173), Expect = 3.3e-11
Identity = 92/355 (25.92%), Postives = 157/355 (44.23%), Query Frame = 1

Query: 139 LIQAIFLISGAKPLLNFMG----------VKSDSPMMTSAQQYLTLRSLGAPAVLLSLAM 198
           L+Q + L  GA  L+  +           V     ++  A+++L +R L APA L +L +
Sbjct: 107 LVQPLLLALGAGALIALLRTPIIDLALHIVGGSEAVLEQARRFLEIRWLSAPASLANLVL 166

Query: 199 QGVFRGFKDTKTPLFATVAGDATNIILDPIFIFIFRLGVSGAAIAHVISQY---LIALIL 258
            G   G +  + P+   V G+  NI+LD   +    + V GAA+A VI++Y   LI L++
Sbjct: 167 LGWLLGVQYARAPVILLVVGNILNIVLDVWLVMGLHMNVQGAALATVIAEYATLLIGLLM 226

Query: 259 FWRLMGQVDLLPPSIKHL---QFSRFLKNGFLLLMRVIAVTFCVTLAASLAARQGSTSMA 318
             +++    +    +K      F R L     +++R + +  C      L AR GS  +A
Sbjct: 227 VRKILKLRGISGEMLKTAWRGNFRRLLALNRDIMLRSLLLQLCFGAITVLGARLGSDIIA 286

Query: 319 AFQVCLQVWLAMSLLADGLAVAGQTILASAFAQNDHDK---ATTAASRVLQLGLLLGLGL 378
              V + +    +   DG A A +     A+   D  +      AA R  Q G++  L  
Sbjct: 287 VNAVLMTLLTFTAYALDGFAYAVEAHSGQAYGARDGSQLLDVWRAACR--QSGIVALLFS 346

Query: 379 GVFVGAGMTFGAKLFTSDVDVLHLIGIGTPFIAAMQPINALAFVFDGINFGASDFAYSAY 438
            V++ AG    A L TS   +  L      +   +  +    ++ DG+  GA+       
Sbjct: 347 VVYLLAGEHIIA-LLTSLTQIQQLADRYLIWQVILPVVGVWCYLLDGMFIGATRATEMRN 406

Query: 439 SMVLVAIISIFCLFILS-STQGFIGIWVALTIYMSLR--TLAGFWRVGTGTGPWY 472
           SM + A  + F L +L+    G   +W+ALT++++LR  +LA  WR     G W+
Sbjct: 407 SMAVAA--AGFALTLLTLPWLGNHALWLALTVFLALRGLSLAAIWRRHWRNGTWF 456

BLAST of Cp4.1LG05g05060.1 vs. TrEMBL
Match: A0A0A0KXA7_CUCSA (Protein DETOXIFICATION OS=Cucumis sativus GN=Csa_4G291900 PE=3 SV=1)

HSP 1 Score: 675.2 bits (1741), Expect = 5.6e-191
Identity = 385/520 (74.04%), Postives = 403/520 (77.50%), Query Frame = 1

Query: 1   MAFSIMSDEDDPYPSWEKTRMPIRIFFKDARHVFKLDELGREIAQIALPAALALAADPVA 60
           MAFSIMS+EDDPYPSW+KT+ PIRIFFK+ARHVFKLDELGREIAQIALPAALALAADPVA
Sbjct: 1   MAFSIMSEEDDPYPSWDKTKTPIRIFFKNARHVFKLDELGREIAQIALPAALALAADPVA 60

Query: 61  SLFQ--------------------------ELQFSPFV--TTSFVAEEDAIGSACNEAKD 120
           SL                             +   P V  TTSFVAEED IGS   EA+D
Sbjct: 61  SLVDTAFIGQIGSVELAAVGVAIALFNQVSRIAIFPLVSVTTSFVAEEDTIGSVSIEAED 120

Query: 121 NNDKETGLFTNDESKLMIPHNGKTE------------------ENGRRYIPSASSALVIG 180
           NND E+G FTNDE   MIP NGK E                  ENGRRYIPSASSALVIG
Sbjct: 121 NNDMESGFFTNDEKSSMIPQNGKGEDAHHSRKPLEKKFENSKVENGRRYIPSASSALVIG 180

Query: 181 GVLGLIQAIFLISGAKPLLNFMGVKSDSPMMTSAQQYLTLRSLGAPAVLLSLAMQGVFRG 240
           GVLGLIQAIFLISGA+PLLNFMGVKSDS MMT AQQYLTLRSLGAPAVLLSLA+QGVFRG
Sbjct: 181 GVLGLIQAIFLISGARPLLNFMGVKSDSLMMTPAQQYLTLRSLGAPAVLLSLAIQGVFRG 240

Query: 241 FKDTKTPLFATVAGDATNIILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMGQVD 300
           FKDTKTPL+ATVAGDATNIILDPIFIF+FRLGVSGAAIAHVISQYLIALILFWRLMGQVD
Sbjct: 241 FKDTKTPLYATVAGDATNIILDPIFIFVFRLGVSGAAIAHVISQYLIALILFWRLMGQVD 300

Query: 301 LLPPSIKHLQFSRFLKNGFLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVWLAM 360
           LLPPSIKHLQFSRFLKNGFLLLMRV                       AF VC       
Sbjct: 301 LLPPSIKHLQFSRFLKNGFLLLMRV----------------------IAFTVC------- 360

Query: 361 SLLADGLAVAGQTILASAFAQNDHDKATTAASRVLQLGLLLGLGLGVFVGAGMTFGAKLF 420
               +GL    Q ILA+AFAQNDHDKAT AASRVLQLGL LGL L VF+G GMTFGA+LF
Sbjct: 361 ----NGL----QAILATAFAQNDHDKATAAASRVLQLGLFLGLMLAVFLGVGMTFGARLF 420

Query: 421 TSDVDVLHLIGIGTPFIAAMQPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFCLFI 475
           TSDVDVL LIGIG PF+AA QPINALAFVFDGINFGASDFAYSA SMVLVAIISIFCLFI
Sbjct: 421 TSDVDVLRLIGIGIPFVAATQPINALAFVFDGINFGASDFAYSACSMVLVAIISIFCLFI 480

BLAST of Cp4.1LG05g05060.1 vs. TrEMBL
Match: A0A061GU61_THECC (Protein DETOXIFICATION OS=Theobroma cacao GN=TCM_040698 PE=3 SV=1)

HSP 1 Score: 668.7 bits (1724), Expect = 5.3e-189
Identity = 364/515 (70.68%), Postives = 414/515 (80.39%), Query Frame = 1

Query: 5   IMSDEDDPYPSWEKTRMPIRIFFKDARHVFKLDELGREIAQIALPAALALAADPVASLFQ 64
           +M++EDDPY S  K R+PI IFFKD R+VFKLD+LG EIAQIALPAALAL ADP+ASL  
Sbjct: 1   MMAEEDDPYLSRVKMRLPIFIFFKDVRNVFKLDDLGSEIAQIALPAALALTADPIASLVD 60

Query: 65  EL---QFSPF-------------------------VTTSFVAEEDAIGSACNEAKDNNDK 124
                Q  P                          VTTSFVAEED IG   +EA+++   
Sbjct: 61  TAFIGQIGPVELAAVGVSIALFNQVSRIAIFPLVSVTTSFVAEEDTIGRVSSEAQESECL 120

Query: 125 ETGLFTNDESKLMIPHNGKTE-----------------ENGRRYIPSASSALVIGGVLGL 184
           ETG + N+ESK +IP    +E                 E  RR+IPSASSALVIGG+LGL
Sbjct: 121 ETGSYVNNESKELIPQKESSEGAYQPKTLGGSFDIVKFEPERRHIPSASSALVIGGILGL 180

Query: 185 IQAIFLISGAKPLLNFMGVKSDSPMMTSAQQYLTLRSLGAPAVLLSLAMQGVFRGFKDTK 244
           +QAIFLISGAKPLLNFMGV SDSPM+  AQQYLTLRSLGAPAVLLSLAMQGVFRGFKDTK
Sbjct: 181 LQAIFLISGAKPLLNFMGVSSDSPMLNPAQQYLTLRSLGAPAVLLSLAMQGVFRGFKDTK 240

Query: 245 TPLFATVAGDATNIILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMGQVDLLPPS 304
           TPL+ATVAGD TNIILDPIF+F+F LGVSGAAIAHVISQYLI++IL W+LM QVDLLPPS
Sbjct: 241 TPLYATVAGDVTNIILDPIFMFVFHLGVSGAAIAHVISQYLISVILLWKLMSQVDLLPPS 300

Query: 305 IKHLQFSRFLKNGFLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVWLAMSLLAD 364
           +KHLQFSRFLKNGFLLL+RV+AVTFC+TL+AS+AARQGSTSMAAFQVCLQVWLA SLLAD
Sbjct: 301 LKHLQFSRFLKNGFLLLIRVMAVTFCITLSASMAARQGSTSMAAFQVCLQVWLATSLLAD 360

Query: 365 GLAVAGQTILASAFAQNDHDKATTAASRVLQLGLLLGLGLGVFVGAGMTFGAKLFTSDVD 424
           GLAVAGQ ILASAFA+ DH+KAT  ASRVLQLGL+LGL L V +G G++FGAKLFT DV+
Sbjct: 361 GLAVAGQAILASAFAKGDHEKATATASRVLQLGLVLGLILAVVLGGGLSFGAKLFTKDVN 420

Query: 425 VLHLIGIGTPFIAAMQPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFCLFILSSTQ 475
           VLHLIG G PF+AA QPIN+LAFVFDG+NFGASDFAYSA+S+VLVAI+SI CL ILSS++
Sbjct: 421 VLHLIGTGIPFVAATQPINSLAFVFDGVNFGASDFAYSAFSLVLVAIVSIICLSILSSSR 480

BLAST of Cp4.1LG05g05060.1 vs. TrEMBL
Match: M5WPF4_PRUPE (Protein DETOXIFICATION OS=Prunus persica GN=PRUPE_ppa004360mg PE=3 SV=1)

HSP 1 Score: 654.4 bits (1687), Expect = 1.0e-184
Identity = 362/515 (70.29%), Postives = 399/515 (77.48%), Query Frame = 1

Query: 6   MSDEDDPYPSWEKTRMPIRIFFKDARHVFKLDELGREIAQIALPAALALAADPVASLFQE 65
           M++EDD Y +  K++ PI I FKD R VF LD+LG EIA IALPAALAL ADP+ASL   
Sbjct: 1   MAEEDDSYSNGNKSKTPIYILFKDCRFVFNLDKLGLEIASIALPAALALTADPIASLVDT 60

Query: 66  L---QFSPF-------------------------VTTSFVAEEDAIGSACNEAKDNNDKE 125
               Q  P                          VTTSFVAEED IG+A  E   N+  E
Sbjct: 61  AFIGQIGPVELAAVGVSIALFNQASRIAIFPLVSVTTSFVAEEDTIGTASPEENQNDYLE 120

Query: 126 TGLFTNDESKLMIPHNGK-----------------TEENGRRYIPSASSALVIGGVLGLI 185
           TG   N E++ +IP  G                  T  + +RYIPSASSA+VIG +LGLI
Sbjct: 121 TGSSINGETRQLIPERGTDQNAYNSKPVGASFEIVTTNHQKRYIPSASSAMVIGSILGLI 180

Query: 186 QAIFLISGAKPLLNFMGVKSDSPMMTSAQQYLTLRSLGAPAVLLSLAMQGVFRGFKDTKT 245
           QAIFLIS AKPLLNFMGV SDSPM+  AQQYL LRSLGAPAVLLSLAMQGVFRGFKDTKT
Sbjct: 181 QAIFLISAAKPLLNFMGVSSDSPMLKPAQQYLILRSLGAPAVLLSLAMQGVFRGFKDTKT 240

Query: 246 PLFATVAGDATNIILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMGQVDLLPPSI 305
           PL+ATVAGD TNIILDPIF+F+FRLGV+GAAI+HVISQYLI +IL WRLM QVDLLPPSI
Sbjct: 241 PLYATVAGDVTNIILDPIFMFVFRLGVNGAAISHVISQYLICVILLWRLMAQVDLLPPSI 300

Query: 306 KHLQFSRFLKNGFLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVWLAMSLLADG 365
           KHLQF RFLKNGFLLLMRVIAVTFCVTLAASLAARQG T MAAFQVCLQVWLA SLLADG
Sbjct: 301 KHLQFGRFLKNGFLLLMRVIAVTFCVTLAASLAARQGPTPMAAFQVCLQVWLATSLLADG 360

Query: 366 LAVAGQTILASAFAQNDHDKATTAASRVLQLGLLLGLGLGVFVGAGMTFGAKLFTSDVDV 425
           LAVAGQ ILASAFA+ DHDKAT  ASRVLQLGL+LGL L V +G G+ +GA+LFT DVDV
Sbjct: 361 LAVAGQAILASAFAKKDHDKATATASRVLQLGLVLGLMLAVILGVGLQYGARLFTKDVDV 420

Query: 426 LHLIGIGTPFIAAMQPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFCLFILSSTQG 476
           LHLI IG PF+AA QPINALAFVFDG+NFGASDFAYSA+SMV+VAI+SIF LFILSST G
Sbjct: 421 LHLISIGIPFVAATQPINALAFVFDGVNFGASDFAYSAFSMVMVAIVSIFVLFILSSTNG 480

BLAST of Cp4.1LG05g05060.1 vs. TrEMBL
Match: A0A067JDZ9_JATCU (Protein DETOXIFICATION OS=Jatropha curcas GN=JCGZ_25922 PE=3 SV=1)

HSP 1 Score: 653.7 bits (1685), Expect = 1.8e-184
Identity = 364/516 (70.54%), Postives = 405/516 (78.49%), Query Frame = 1

Query: 6   MSDEDDP-YPSWEKTRMPIRIFFKDARHVFKLDELGREIAQIALPAALALAADPVASLFQ 65
           M++EDD  YPS EK R+P+ IFFKD RHV K+DELG EIA IALPAALAL ADP+ASL  
Sbjct: 1   MAEEDDASYPSMEKKRIPLCIFFKDFRHVLKMDELGLEIASIALPAALALTADPIASLVD 60

Query: 66  EL---QFSPF-------------------------VTTSFVAEEDAIGSACNEAKDNNDK 125
                Q  P                          VTTSFVAEED IGS   E +D+   
Sbjct: 61  TAFIGQIGPVELAAVGVSIALFNQVSRIAIFPLVSVTTSFVAEEDTIGSVNPEVQDSESL 120

Query: 126 ETGLFTNDESKLMIPHNGKTE-----------------ENGRRYIPSASSALVIGGVLGL 185
           ETG   N ESK +IP N   E                 E+GRR+IPSASSALVIG +LG 
Sbjct: 121 ETGSVVNSESKELIPQNVSGEGAYKSKSAMSSFDIAKMESGRRHIPSASSALVIGAILGF 180

Query: 186 IQAIFLISGAKPLLNFMGVKSDSPMMTSAQQYLTLRSLGAPAVLLSLAMQGVFRGFKDTK 245
           IQAIFLISGAKPLLNFMGV SDSPM+  AQQYLTLRSLGAPAVLLSLAMQGVFRGFKDTK
Sbjct: 181 IQAIFLISGAKPLLNFMGVGSDSPMLRPAQQYLTLRSLGAPAVLLSLAMQGVFRGFKDTK 240

Query: 246 TPLFATVAGDATNIILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMGQVDLLPPS 305
           TPL+ATV GD TNIILDP+F+F+FRLGVSGAAIAHVISQYLI++IL WRLM +VDLLPPS
Sbjct: 241 TPLYATVTGDVTNIILDPVFMFVFRLGVSGAAIAHVISQYLISVILLWRLMEKVDLLPPS 300

Query: 306 IKHLQFSRFLKNGFLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVWLAMSLLAD 365
           +KHLQF +FLKNGFLLLMRVIAVTFCVTL+ASLAARQGST+MAAFQVCLQVWLA SLLAD
Sbjct: 301 VKHLQFGKFLKNGFLLLMRVIAVTFCVTLSASLAARQGSTAMAAFQVCLQVWLATSLLAD 360

Query: 366 GLAVAGQTILASAFAQNDHDKATTAASRVLQLGLLLGLGLGVFVGAGMTFGAKLFTSDVD 425
           GLAVAGQ ILA+AFA++D++KAT  ASRVLQLGLLLGL L + +G G++FG++LFTSDV+
Sbjct: 361 GLAVAGQAILATAFAKSDYEKATATASRVLQLGLLLGLMLAIILGVGLSFGSRLFTSDVN 420

Query: 426 VLHLIGIGTPFIAAMQPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFCLFILSSTQ 476
           VL +I IG PF+A  QPINALAFVFDG+NFGASDFAYSAYSMVLVAIISI  L  LSST 
Sbjct: 421 VLRMISIGIPFVAGTQPINALAFVFDGVNFGASDFAYSAYSMVLVAIISILSLLFLSSTY 480

BLAST of Cp4.1LG05g05060.1 vs. TrEMBL
Match: A0A0D2SWN8_GOSRA (Protein DETOXIFICATION OS=Gossypium raimondii GN=B456_006G128800 PE=3 SV=1)

HSP 1 Score: 646.7 bits (1667), Expect = 2.1e-182
Identity = 355/515 (68.93%), Postives = 403/515 (78.25%), Query Frame = 1

Query: 5   IMSDEDDPYPSWEKTRMPIRIFFKDARHVFKLDELGREIAQIALPAALALAADPVASLFQ 64
           +M++EDD YPS  K R PI IFFKD RHVFKLDELG EIAQIALPAALAL ADP+ASL  
Sbjct: 1   MMTEEDDLYPSSVKMRYPIFIFFKDVRHVFKLDELGSEIAQIALPAALALTADPIASLVD 60

Query: 65  --------------------------ELQFSPFV--TTSFVAEEDAIGSACNEAKDNNDK 124
                                      +   P V  TTSFVAEED IG   +EA++++  
Sbjct: 61  TAFIGQIGAVELAAVGVSIALFNQVSRIAIFPLVSVTTSFVAEEDTIGRVSSEAQESDYV 120

Query: 125 ETGLFTNDESKLMIPHNGKTE-----------------ENGRRYIPSASSALVIGGVLGL 184
           ETG   + ES  +IP     E                 E  RR+IPSASSALVIGG+LGL
Sbjct: 121 ETGSCVDTESNELIPQKECIEGTYRPKTLGSSFDVVKIEPERRHIPSASSALVIGGILGL 180

Query: 185 IQAIFLISGAKPLLNFMGVKSDSPMMTSAQQYLTLRSLGAPAVLLSLAMQGVFRGFKDTK 244
           +QA+FLISGAKPLLNFMG+ SDSPM+  AQQYLTLRSLGAPAVLLSLAMQGVFRGFKDTK
Sbjct: 181 LQALFLISGAKPLLNFMGISSDSPMLNPAQQYLTLRSLGAPAVLLSLAMQGVFRGFKDTK 240

Query: 245 TPLFATVAGDATNIILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMGQVDLLPPS 304
           TPL+ATVAGD  NIILDPIF+F+FRLGVSGAAIAHVISQYLI++IL W+LM QVDLLPPS
Sbjct: 241 TPLYATVAGDVANIILDPIFMFVFRLGVSGAAIAHVISQYLISVILLWKLMSQVDLLPPS 300

Query: 305 IKHLQFSRFLKNGFLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVWLAMSLLAD 364
           +KHL F RFLKNGFLLL+RV+AVTFC+TL+AS+AAR GSTSMAAFQVCLQVWLA SLLAD
Sbjct: 301 LKHLYFGRFLKNGFLLLIRVMAVTFCITLSASMAARLGSTSMAAFQVCLQVWLATSLLAD 360

Query: 365 GLAVAGQTILASAFAQNDHDKATTAASRVLQLGLLLGLGLGVFVGAGMTFGAKLFTSDVD 424
           GLAVAGQ ILAS+FA+ D++KAT  ASRVLQLGL+LGL L V +G G++FGAKLFT D D
Sbjct: 361 GLAVAGQAILASSFARKDNEKATATASRVLQLGLVLGLILAVILGGGLSFGAKLFTKDAD 420

Query: 425 VLHLIGIGTPFIAAMQPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFCLFILSSTQ 475
           VL LIG G PF+AA QPIN+LAFVFDG+NFGASDFAYSA+S+VLVAI SI CL ILSST 
Sbjct: 421 VLRLIGTGIPFVAATQPINSLAFVFDGVNFGASDFAYSAFSLVLVAIASIICLCILSSTH 480

BLAST of Cp4.1LG05g05060.1 vs. TAIR10
Match: AT1G51340.2 (AT1G51340.2 MATE efflux family protein)

HSP 1 Score: 567.8 bits (1462), Expect = 6.4e-162
Identity = 323/503 (64.21%), Postives = 367/503 (72.96%), Query Frame = 1

Query: 20  RMPIRIFFKDARHVFKLDELGREIAQIALPAALALAADPVASLFQEL---QFSPF----- 79
           R P+ IFF D R V K DELG EIA+IALPAALAL ADP+ASL       Q  P      
Sbjct: 13  RNPLYIFFSDFRSVLKFDELGLEIARIALPAALALTADPIASLVDTAFIGQIGPVELAAV 72

Query: 80  --------------------VTTSFVAEEDAIGSACNEAKDNNDK-ETGLFTNDESKL-M 139
                               +TTSFVAEEDA  S  +  +D+ +  E G+    E  + +
Sbjct: 73  GVSIALFNQVSRIAIFPLVSITTSFVAEEDACSSQQDTVRDHKECIEIGINNPTEETIEL 132

Query: 140 IPHNGKTEENG-----------------RRYIPSASSALVIGGVLGLIQAIFLISGAKPL 199
           IP   K   +                  +R IPSASSAL+IGGVLGL QA+FLIS AKPL
Sbjct: 133 IPEKHKDSLSDEFKTSSSIFSISKPPAKKRNIPSASSALIIGGVLGLFQAVFLISAAKPL 192

Query: 200 LNFMGVKSDSPMMTSAQQYLTLRSLGAPAVLLSLAMQGVFRGFKDTKTPLFATVAGDATN 259
           L+FMGVK DSPMM  +Q+YL+LRSLGAPAVLLSLA QGVFRGFKDT TPLFATV GD TN
Sbjct: 193 LSFMGVKHDSPMMRPSQRYLSLRSLGAPAVLLSLAAQGVFRGFKDTTTPLFATVIGDVTN 252

Query: 260 IILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMGQVDLLPPSIKHLQFSRFLKNG 319
           IILDPIFIF+FRLGV+GAA AHVISQYL+  IL W+LMGQVD+   S KHLQF RF+KNG
Sbjct: 253 IILDPIFIFVFRLGVTGAATAHVISQYLMCGILLWKLMGQVDIFNMSTKHLQFCRFMKNG 312

Query: 320 FLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVWLAMSLLADGLAVAGQTILASA 379
           FLLLMRVIAVTFCVTL+ASLAAR+GSTSMAAFQVCLQVWLA SLLADG AVAGQ ILASA
Sbjct: 313 FLLLMRVIAVTFCVTLSASLAAREGSTSMAAFQVCLQVWLATSLLADGYAVAGQAILASA 372

Query: 380 FAQNDHDKATTAASRVLQLGLLLGLGLGVFVGAGMTFGAKLFTSDVDVLHLIGIGTPFIA 439
           FA+ D+ +A   ASRVLQLGL+LG  L V +GAG+ FGA++FT D  VLHLI IG PF+A
Sbjct: 373 FAKKDYKRAAATASRVLQLGLVLGFVLAVILGAGLHFGARVFTKDDKVLHLISIGLPFVA 432

Query: 440 AMQPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFCLFILSSTQGFIGIWVALTIYM 476
             QPINALAFVFDG+NFGASDF Y+A S+V+VAI+SI CL  LSST GFIG+W  LTIYM
Sbjct: 433 GTQPINALAFVFDGVNFGASDFGYAAASLVMVAIVSILCLLFLSSTHGFIGLWFGLTIYM 492

BLAST of Cp4.1LG05g05060.1 vs. TAIR10
Match: AT3G08040.1 (AT3G08040.1 MATE efflux family protein)

HSP 1 Score: 490.7 bits (1262), Expect = 9.9e-139
Identity = 284/507 (56.02%), Postives = 346/507 (68.24%), Query Frame = 1

Query: 18  KTRMPIRIFFKDARHVFKLDELGREIAQIALPAALALAADPVASLFQ------------- 77
           K  +P  + FKD RHVF  D  GREI  IA PAALALAADP+ASL               
Sbjct: 12  KKPIPFLVIFKDLRHVFSRDTTGREILGIAFPAALALAADPIASLIDTAFVGRLGAVQLA 71

Query: 78  -------------ELQFSPFV--TTSFVAEEDAIGSACNEAKDNN-----------DKET 137
                         +   P V  TTSFVAEED +     EA   N             E 
Sbjct: 72  AVGVSIAIFNQASRITIFPLVSLTTSFVAEEDTMEKMKEEANKANLVHAETILVQDSLEK 131

Query: 138 GLFT---NDESKLMIPHNGKTEENG--------RRYIPSASSALVIGGVLGLIQAIFLIS 197
           G+ +   ND ++   P    T+ N         +R I +AS+A+++G +LGL+QAIFLI 
Sbjct: 132 GISSPTSNDTNQPQQPPAPDTKSNSGNKSNKKEKRTIRTASTAMILGLILGLVQAIFLIF 191

Query: 198 GAKPLLNFMGVKSDSPMMTSAQQYLTLRSLGAPAVLLSLAMQGVFRGFKDTKTPLFATVA 257
            +K LL  MGVK +SPM++ A +YL++R+LGAPA+LLSLAMQG+FRGFKDTKTPLFATV 
Sbjct: 192 SSKLLLGVMGVKPNSPMLSPAHKYLSIRALGAPALLLSLAMQGIFRGFKDTKTPLFATVV 251

Query: 258 GDATNIILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMGQVDLLPPSIKHLQFSR 317
            D  NI+LDPIFIF+ RLG+ GAAIAHVISQY + LILF  L  +V+L+PP+   LQF R
Sbjct: 252 ADVINIVLDPIFIFVLRLGIIGAAIAHVISQYFMTLILFVFLAKKVNLIPPNFGDLQFGR 311

Query: 318 FLKNGFLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVWLAMSLLADGLAVAGQT 377
           FLKNG LLL R IAVTFC TLAA++AAR G+T MAAFQ+CLQVWL  SLL DGLAVAGQ 
Sbjct: 312 FLKNGLLLLARTIAVTFCQTLAAAMAARLGTTPMAAFQICLQVWLTSSLLNDGLAVAGQA 371

Query: 378 ILASAFAQNDHDKATTAASRVLQLGLLLGLGLGVFVGAGMTFGAKLFTSDVDVLHLIGIG 437
           ILA +FA+ D++K T  ASRVLQ+G +LGLGL VFVG G+ FGA +F+ D  V+HL+ IG
Sbjct: 372 ILACSFAEKDYNKVTAVASRVLQMGFVLGLGLSVFVGLGLYFGAGVFSKDPAVIHLMAIG 431

Query: 438 TPFIAAMQPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFCLFILSSTQGFIGIWVA 475
            PFIAA QPIN+LAFV DG+NFGASDFAY+AYSMV VA ISI  +  ++ T GFIGIW+A
Sbjct: 432 IPFIAATQPINSLAFVLDGVNFGASDFAYTAYSMVGVAAISIAAVIYMAKTNGFIGIWIA 491

BLAST of Cp4.1LG05g05060.1 vs. TAIR10
Match: AT2G38330.1 (AT2G38330.1 MATE efflux family protein)

HSP 1 Score: 330.1 bits (845), Expect = 2.2e-90
Identity = 198/447 (44.30%), Postives = 281/447 (62.86%), Query Frame = 1

Query: 36  LDELGREIAQIALPAALALAADPVASLFQELQFSPFVTTSFVAEEDAIGSACN----EAK 95
           + ++G EI  IALPAALALAADP+ SL      + FV     AE  A+G + +     +K
Sbjct: 73  IGKIGMEIMSIALPAALALAADPITSLVD----TAFVGHIGSAELAAVGVSVSVFNLVSK 132

Query: 96  DNND---KETGLFTNDESKLMIPHNGKTEENGRRYIPSASSALVIGGVLGLIQAIFLISG 155
             N      T  F  +E  +    +  + E  ++ +PS S++LV+   +G+ +AI L  G
Sbjct: 133 LFNVPLLNVTTSFVAEEQAIAAKDDNDSIETSKKVLPSVSTSLVLAAGVGIAEAIALSLG 192

Query: 156 AKPLLNFMGVKSDSPMMTSAQQYLTLRSLGAPAVLLSLAMQGVFRGFKDTKTPLFATVAG 215
           +  L++ M +  DSPM   A+Q+L LR+ GAP ++++LA QG FRGFKDT TPL+A VAG
Sbjct: 193 SDFLMDVMAIPFDSPMRIPAEQFLRLRAYGAPPIVVALAAQGAFRGFKDTTTPLYAVVAG 252

Query: 216 DATNIILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMGQVDLLPPSIKHLQFSRF 275
           +  N +LDPI IF+   G+SGAA A VIS+YLIA IL W+L   V LL P IK  + +++
Sbjct: 253 NVLNAVLDPILIFVLGFGISGAAAATVISEYLIAFILLWKLNENVVLLSPQIKVGRANQY 312

Query: 276 LKNGFLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVWLAMSLLADGLAVAGQTI 335
           LK+G LL+ R +A+    TLA SLAA+ G T MA  Q+ L++WLA+SLL D LA+A Q++
Sbjct: 313 LKSGGLLIGRTVALLVPFTLATSLAAQNGPTQMAGHQIVLEIWLAVSLLTDALAIAAQSL 372

Query: 336 LASAFAQNDHDKATTAASRVLQLGLLLGLGLGVFVGAGMTFGAKLFTSDVDVLHLIGIGT 395
           LA+ ++Q ++ +A      VLQ+GL  G GL   +       + LFT+D +VL +   GT
Sbjct: 373 LATTYSQGEYKQAREVLFGVLQVGLATGTGLAAVLFITFEPFSSLFTTDSEVLKIALSGT 432

Query: 396 PFIAAMQPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFCLFILSSTQGFIGIWVAL 455
            F+A  QP+NALAFV DG+ +G SDF ++AYSMV+V  IS   + + + T G  GIW  L
Sbjct: 433 LFVAGSQPVNALAFVLDGLYYGVSDFGFAAYSMVIVGFISSLFMLVAAPTFGLAGIWTGL 492

Query: 456 TIYMSLRTLAGFWRVGTGTGPWYFLQS 476
            ++M+LR +AG WR+GT TGPW  L S
Sbjct: 493 FLFMALRLVAGAWRLGTRTGPWKMLWS 515

BLAST of Cp4.1LG05g05060.1 vs. TAIR10
Match: AT4G38380.1 (AT4G38380.1 MATE efflux family protein)

HSP 1 Score: 293.9 bits (751), Expect = 1.8e-79
Identity = 171/404 (42.33%), Postives = 262/404 (64.85%), Query Frame = 1

Query: 72  VTTSFVAEEDAIGSACNEAKDNNDKETGLFTNDESKLMIPHNGKTEENGRRYIPSASSAL 131
           V TSFVAE+ A  +A + A +++  +            IP  G  E   R+ + S S+AL
Sbjct: 166 VATSFVAEDIAKIAAQDLASEDSQSD------------IPSQGLPE---RKQLSSVSTAL 225

Query: 132 VIGGVLGLIQAIFLISGAKPLLNFMGVKSDSPMMTSAQQYLTLRSLGAPAVLLSLAMQGV 191
           V+   +G+ +A+ L   + P L  MG++S S M   A+Q+L LR+LGAPA ++SLA+QG+
Sbjct: 226 VLAIGIGIFEALALSLASGPFLRLMGIQSMSEMFIPARQFLVLRALGAPAYVVSLALQGI 285

Query: 192 FRGFKDTKTPLFATVAGDATNIILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMG 251
           FRGFKDTKTP++    G+   + L P+FI+ FR+GV+GAAI+ VISQY +A+++   L  
Sbjct: 286 FRGFKDTKTPVYCLGIGNFLAVFLFPLFIYKFRMGVAGAAISSVISQYTVAILMLILLNK 345

Query: 252 QVDLLPPSIKHLQFSRFLKNGFLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVW 311
           +V LLPP I  L+F  +LK+G  +L R ++V   +T+A S+AARQG  +MAA Q+C+QVW
Sbjct: 346 RVILLPPKIGSLKFGDYLKSGGFVLGRTLSVLVTMTVATSMAARQGVFAMAAHQICMQVW 405

Query: 312 LAMSLLADGLAVAGQTILASAFAQNDHDKATTAASRVLQLGLLLGLGLGVFVGAGMTFGA 371
           LA+SLL D LA +GQ ++AS+ ++ D +      + VL++G++ G+ L + +G   +  A
Sbjct: 406 LAVSLLTDALASSGQALIASSASKRDFEGVKEVTTFVLKIGVVTGIALAIVLGMSFSSIA 465

Query: 372 KLFTSDVDVLHLIGIGTPFIAAMQPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFC 431
            LF+ D +VL ++  G  F+AA QPI ALAF+FDG+++G SDF Y+A SM++V  IS   
Sbjct: 466 GLFSKDPEVLRIVRKGVLFVAATQPITALAFIFDGLHYGMSDFPYAACSMMVVGGISSAF 525

Query: 432 LFILSSTQGFIGIWVALTIYMSLRTLAGFWRVGTGTGPWYFLQS 476
           +    +  G  G+WV L+++M LR +AGF R+    GPW+F+ +
Sbjct: 526 MLYAPAGLGLSGVWVGLSMFMGLRMVAGFSRLMWRKGPWWFMHT 554

BLAST of Cp4.1LG05g05060.1 vs. NCBI nr
Match: gi|659097607|ref|XP_008449716.1| (PREDICTED: MATE efflux family protein 1 [Cucumis melo])

HSP 1 Score: 767.7 bits (1981), Expect = 1.2e-218
Identity = 423/521 (81.19%), Postives = 441/521 (84.64%), Query Frame = 1

Query: 1   MAFSIMSDEDDPYPSWEKTRMPIRIFFKDARHVFKLDELGREIAQIALPAALALAADPVA 60
           MAFSIMS+EDDPYP+W+KT+ PIRIFFKDAR+VFKLDELGREIA+IALPAALALAADPVA
Sbjct: 1   MAFSIMSEEDDPYPAWDKTKTPIRIFFKDARNVFKLDELGREIARIALPAALALAADPVA 60

Query: 61  SLFQ--------------------------ELQFSPFV--TTSFVAEEDAIGSACNEAKD 120
           SL                             +   P V  TTSFVAEED IGS   EA+D
Sbjct: 61  SLVDTAFIGQIGSVELAAVGVAIALFNQVSRIAIFPLVSVTTSFVAEEDTIGSVSIEAED 120

Query: 121 NNDKETGLFTNDESKLMIPHNGKTE------------------ENGRRYIPSASSALVIG 180
           NND ETG FTNDE   MIP NGK E                  ENGRRYIPSASSALVIG
Sbjct: 121 NNDTETGFFTNDEKSSMIPQNGKGEDAHHSRKPLDTTFENGKVENGRRYIPSASSALVIG 180

Query: 181 GVLGLIQAIFLISGAKPLLNFMGVKSDSPMMTSAQQYLTLRSLGAPAVLLSLAMQGVFRG 240
           GVLGLIQAIFLISGA+PLLNFMGVKSDS MMT AQQYLTLRSLGAPAVLLSLA+QGVFRG
Sbjct: 181 GVLGLIQAIFLISGARPLLNFMGVKSDSLMMTPAQQYLTLRSLGAPAVLLSLAIQGVFRG 240

Query: 241 FKDTKTPLFATVAGDATNIILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMGQVD 300
           FKDTKTPL+ATVAGDATNIILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMGQVD
Sbjct: 241 FKDTKTPLYATVAGDATNIILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMGQVD 300

Query: 301 LLPPSIKHLQFSRFLKNGFLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVWLAM 360
           LLPPSIKHLQFSRFLKNGFLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVWL  
Sbjct: 301 LLPPSIKHLQFSRFLKNGFLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVWLTT 360

Query: 361 SLLADGLAVAGQTILASAFAQNDHDKATTAASRVLQLGLLLGLGLGVFVGAGMTFGAKLF 420
           SLLADGLAVAGQ ILA+AFAQNDHDKAT AASRVLQLGL LGL L VF+G GMTFGAKLF
Sbjct: 361 SLLADGLAVAGQAILATAFAQNDHDKATAAASRVLQLGLFLGLMLSVFLGVGMTFGAKLF 420

Query: 421 TSDVDVLHLIGIGTPFIAAMQPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFCLFI 476
           TSDVDVL  IGIG PF+AA QPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFCLFI
Sbjct: 421 TSDVDVLRFIGIGIPFVAATQPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFCLFI 480

BLAST of Cp4.1LG05g05060.1 vs. NCBI nr
Match: gi|449448721|ref|XP_004142114.1| (PREDICTED: MATE efflux family protein 1 [Cucumis sativus])

HSP 1 Score: 765.8 bits (1976), Expect = 4.5e-218
Identity = 422/520 (81.15%), Postives = 440/520 (84.62%), Query Frame = 1

Query: 1   MAFSIMSDEDDPYPSWEKTRMPIRIFFKDARHVFKLDELGREIAQIALPAALALAADPVA 60
           MAFSIMS+EDDPYPSW+KT+ PIRIFFK+ARHVFKLDELGREIAQIALPAALALAADPVA
Sbjct: 1   MAFSIMSEEDDPYPSWDKTKTPIRIFFKNARHVFKLDELGREIAQIALPAALALAADPVA 60

Query: 61  SLFQ--------------------------ELQFSPFV--TTSFVAEEDAIGSACNEAKD 120
           SL                             +   P V  TTSFVAEED IGS   EA+D
Sbjct: 61  SLVDTAFIGQIGSVELAAVGVAIALFNQVSRIAIFPLVSVTTSFVAEEDTIGSVSIEAED 120

Query: 121 NNDKETGLFTNDESKLMIPHNGKTE------------------ENGRRYIPSASSALVIG 180
           NND E+G FTNDE   MIP NGK E                  ENGRRYIPSASSALVIG
Sbjct: 121 NNDMESGFFTNDEKSSMIPQNGKGEDAHHSRKPLEKKFENSKVENGRRYIPSASSALVIG 180

Query: 181 GVLGLIQAIFLISGAKPLLNFMGVKSDSPMMTSAQQYLTLRSLGAPAVLLSLAMQGVFRG 240
           GVLGLIQAIFLISGA+PLLNFMGVKSDS MMT AQQYLTLRSLGAPAVLLSLA+QGVFRG
Sbjct: 181 GVLGLIQAIFLISGARPLLNFMGVKSDSLMMTPAQQYLTLRSLGAPAVLLSLAIQGVFRG 240

Query: 241 FKDTKTPLFATVAGDATNIILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMGQVD 300
           FKDTKTPL+ATVAGDATNIILDPIFIF+FRLGVSGAAIAHVISQYLIALILFWRLMGQVD
Sbjct: 241 FKDTKTPLYATVAGDATNIILDPIFIFVFRLGVSGAAIAHVISQYLIALILFWRLMGQVD 300

Query: 301 LLPPSIKHLQFSRFLKNGFLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVWLAM 360
           LLPPSIKHLQFSRFLKNGFLLLMRVIAVTFCVTLAASL+ARQGSTSMAAFQVCLQVWL  
Sbjct: 301 LLPPSIKHLQFSRFLKNGFLLLMRVIAVTFCVTLAASLSARQGSTSMAAFQVCLQVWLTT 360

Query: 361 SLLADGLAVAGQTILASAFAQNDHDKATTAASRVLQLGLLLGLGLGVFVGAGMTFGAKLF 420
           SLLADGLAVAGQ ILA+AFAQNDHDKAT AASRVLQLGL LGL L VF+G GMTFGA+LF
Sbjct: 361 SLLADGLAVAGQAILATAFAQNDHDKATAAASRVLQLGLFLGLMLAVFLGVGMTFGARLF 420

Query: 421 TSDVDVLHLIGIGTPFIAAMQPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFCLFI 475
           TSDVDVL LIGIG PF+AA QPINALAFVFDGINFGASDFAYSA SMVLVAIISIFCLFI
Sbjct: 421 TSDVDVLRLIGIGIPFVAATQPINALAFVFDGINFGASDFAYSACSMVLVAIISIFCLFI 480

BLAST of Cp4.1LG05g05060.1 vs. NCBI nr
Match: gi|700199020|gb|KGN54178.1| (hypothetical protein Csa_4G291900 [Cucumis sativus])

HSP 1 Score: 675.2 bits (1741), Expect = 8.1e-191
Identity = 385/520 (74.04%), Postives = 403/520 (77.50%), Query Frame = 1

Query: 1   MAFSIMSDEDDPYPSWEKTRMPIRIFFKDARHVFKLDELGREIAQIALPAALALAADPVA 60
           MAFSIMS+EDDPYPSW+KT+ PIRIFFK+ARHVFKLDELGREIAQIALPAALALAADPVA
Sbjct: 1   MAFSIMSEEDDPYPSWDKTKTPIRIFFKNARHVFKLDELGREIAQIALPAALALAADPVA 60

Query: 61  SLFQ--------------------------ELQFSPFV--TTSFVAEEDAIGSACNEAKD 120
           SL                             +   P V  TTSFVAEED IGS   EA+D
Sbjct: 61  SLVDTAFIGQIGSVELAAVGVAIALFNQVSRIAIFPLVSVTTSFVAEEDTIGSVSIEAED 120

Query: 121 NNDKETGLFTNDESKLMIPHNGKTE------------------ENGRRYIPSASSALVIG 180
           NND E+G FTNDE   MIP NGK E                  ENGRRYIPSASSALVIG
Sbjct: 121 NNDMESGFFTNDEKSSMIPQNGKGEDAHHSRKPLEKKFENSKVENGRRYIPSASSALVIG 180

Query: 181 GVLGLIQAIFLISGAKPLLNFMGVKSDSPMMTSAQQYLTLRSLGAPAVLLSLAMQGVFRG 240
           GVLGLIQAIFLISGA+PLLNFMGVKSDS MMT AQQYLTLRSLGAPAVLLSLA+QGVFRG
Sbjct: 181 GVLGLIQAIFLISGARPLLNFMGVKSDSLMMTPAQQYLTLRSLGAPAVLLSLAIQGVFRG 240

Query: 241 FKDTKTPLFATVAGDATNIILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMGQVD 300
           FKDTKTPL+ATVAGDATNIILDPIFIF+FRLGVSGAAIAHVISQYLIALILFWRLMGQVD
Sbjct: 241 FKDTKTPLYATVAGDATNIILDPIFIFVFRLGVSGAAIAHVISQYLIALILFWRLMGQVD 300

Query: 301 LLPPSIKHLQFSRFLKNGFLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVWLAM 360
           LLPPSIKHLQFSRFLKNGFLLLMRV                       AF VC       
Sbjct: 301 LLPPSIKHLQFSRFLKNGFLLLMRV----------------------IAFTVC------- 360

Query: 361 SLLADGLAVAGQTILASAFAQNDHDKATTAASRVLQLGLLLGLGLGVFVGAGMTFGAKLF 420
               +GL    Q ILA+AFAQNDHDKAT AASRVLQLGL LGL L VF+G GMTFGA+LF
Sbjct: 361 ----NGL----QAILATAFAQNDHDKATAAASRVLQLGLFLGLMLAVFLGVGMTFGARLF 420

Query: 421 TSDVDVLHLIGIGTPFIAAMQPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFCLFI 475
           TSDVDVL LIGIG PF+AA QPINALAFVFDGINFGASDFAYSA SMVLVAIISIFCLFI
Sbjct: 421 TSDVDVLRLIGIGIPFVAATQPINALAFVFDGINFGASDFAYSACSMVLVAIISIFCLFI 480

BLAST of Cp4.1LG05g05060.1 vs. NCBI nr
Match: gi|590583985|ref|XP_007015048.1| (MATE efflux family protein isoform 1 [Theobroma cacao])

HSP 1 Score: 668.7 bits (1724), Expect = 7.6e-189
Identity = 364/515 (70.68%), Postives = 414/515 (80.39%), Query Frame = 1

Query: 5   IMSDEDDPYPSWEKTRMPIRIFFKDARHVFKLDELGREIAQIALPAALALAADPVASLFQ 64
           +M++EDDPY S  K R+PI IFFKD R+VFKLD+LG EIAQIALPAALAL ADP+ASL  
Sbjct: 1   MMAEEDDPYLSRVKMRLPIFIFFKDVRNVFKLDDLGSEIAQIALPAALALTADPIASLVD 60

Query: 65  EL---QFSPF-------------------------VTTSFVAEEDAIGSACNEAKDNNDK 124
                Q  P                          VTTSFVAEED IG   +EA+++   
Sbjct: 61  TAFIGQIGPVELAAVGVSIALFNQVSRIAIFPLVSVTTSFVAEEDTIGRVSSEAQESECL 120

Query: 125 ETGLFTNDESKLMIPHNGKTE-----------------ENGRRYIPSASSALVIGGVLGL 184
           ETG + N+ESK +IP    +E                 E  RR+IPSASSALVIGG+LGL
Sbjct: 121 ETGSYVNNESKELIPQKESSEGAYQPKTLGGSFDIVKFEPERRHIPSASSALVIGGILGL 180

Query: 185 IQAIFLISGAKPLLNFMGVKSDSPMMTSAQQYLTLRSLGAPAVLLSLAMQGVFRGFKDTK 244
           +QAIFLISGAKPLLNFMGV SDSPM+  AQQYLTLRSLGAPAVLLSLAMQGVFRGFKDTK
Sbjct: 181 LQAIFLISGAKPLLNFMGVSSDSPMLNPAQQYLTLRSLGAPAVLLSLAMQGVFRGFKDTK 240

Query: 245 TPLFATVAGDATNIILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMGQVDLLPPS 304
           TPL+ATVAGD TNIILDPIF+F+F LGVSGAAIAHVISQYLI++IL W+LM QVDLLPPS
Sbjct: 241 TPLYATVAGDVTNIILDPIFMFVFHLGVSGAAIAHVISQYLISVILLWKLMSQVDLLPPS 300

Query: 305 IKHLQFSRFLKNGFLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVWLAMSLLAD 364
           +KHLQFSRFLKNGFLLL+RV+AVTFC+TL+AS+AARQGSTSMAAFQVCLQVWLA SLLAD
Sbjct: 301 LKHLQFSRFLKNGFLLLIRVMAVTFCITLSASMAARQGSTSMAAFQVCLQVWLATSLLAD 360

Query: 365 GLAVAGQTILASAFAQNDHDKATTAASRVLQLGLLLGLGLGVFVGAGMTFGAKLFTSDVD 424
           GLAVAGQ ILASAFA+ DH+KAT  ASRVLQLGL+LGL L V +G G++FGAKLFT DV+
Sbjct: 361 GLAVAGQAILASAFAKGDHEKATATASRVLQLGLVLGLILAVVLGGGLSFGAKLFTKDVN 420

Query: 425 VLHLIGIGTPFIAAMQPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFCLFILSSTQ 475
           VLHLIG G PF+AA QPIN+LAFVFDG+NFGASDFAYSA+S+VLVAI+SI CL ILSS++
Sbjct: 421 VLHLIGTGIPFVAATQPINSLAFVFDGVNFGASDFAYSAFSLVLVAIVSIICLSILSSSR 480

BLAST of Cp4.1LG05g05060.1 vs. NCBI nr
Match: gi|802763325|ref|XP_012089999.1| (PREDICTED: MATE efflux family protein 1 isoform X1 [Jatropha curcas])

HSP 1 Score: 654.4 bits (1687), Expect = 1.5e-184
Identity = 364/517 (70.41%), Postives = 406/517 (78.53%), Query Frame = 1

Query: 5   IMSDEDDP-YPSWEKTRMPIRIFFKDARHVFKLDELGREIAQIALPAALALAADPVASLF 64
           +M++EDD  YPS EK R+P+ IFFKD RHV K+DELG EIA IALPAALAL ADP+ASL 
Sbjct: 1   MMAEEDDASYPSMEKKRIPLCIFFKDFRHVLKMDELGLEIASIALPAALALTADPIASLV 60

Query: 65  QEL---QFSPF-------------------------VTTSFVAEEDAIGSACNEAKDNND 124
                 Q  P                          VTTSFVAEED IGS   E +D+  
Sbjct: 61  DTAFIGQIGPVELAAVGVSIALFNQVSRIAIFPLVSVTTSFVAEEDTIGSVNPEVQDSES 120

Query: 125 KETGLFTNDESKLMIPHNGKTE-----------------ENGRRYIPSASSALVIGGVLG 184
            ETG   N ESK +IP N   E                 E+GRR+IPSASSALVIG +LG
Sbjct: 121 LETGSVVNSESKELIPQNVSGEGAYKSKSAMSSFDIAKMESGRRHIPSASSALVIGAILG 180

Query: 185 LIQAIFLISGAKPLLNFMGVKSDSPMMTSAQQYLTLRSLGAPAVLLSLAMQGVFRGFKDT 244
            IQAIFLISGAKPLLNFMGV SDSPM+  AQQYLTLRSLGAPAVLLSLAMQGVFRGFKDT
Sbjct: 181 FIQAIFLISGAKPLLNFMGVGSDSPMLRPAQQYLTLRSLGAPAVLLSLAMQGVFRGFKDT 240

Query: 245 KTPLFATVAGDATNIILDPIFIFIFRLGVSGAAIAHVISQYLIALILFWRLMGQVDLLPP 304
           KTPL+ATV GD TNIILDP+F+F+FRLGVSGAAIAHVISQYLI++IL WRLM +VDLLPP
Sbjct: 241 KTPLYATVTGDVTNIILDPVFMFVFRLGVSGAAIAHVISQYLISVILLWRLMEKVDLLPP 300

Query: 305 SIKHLQFSRFLKNGFLLLMRVIAVTFCVTLAASLAARQGSTSMAAFQVCLQVWLAMSLLA 364
           S+KHLQF +FLKNGFLLLMRVIAVTFCVTL+ASLAARQGST+MAAFQVCLQVWLA SLLA
Sbjct: 301 SVKHLQFGKFLKNGFLLLMRVIAVTFCVTLSASLAARQGSTAMAAFQVCLQVWLATSLLA 360

Query: 365 DGLAVAGQTILASAFAQNDHDKATTAASRVLQLGLLLGLGLGVFVGAGMTFGAKLFTSDV 424
           DGLAVAGQ ILA+AFA++D++KAT  ASRVLQLGLLLGL L + +G G++FG++LFTSDV
Sbjct: 361 DGLAVAGQAILATAFAKSDYEKATATASRVLQLGLLLGLMLAIILGVGLSFGSRLFTSDV 420

Query: 425 DVLHLIGIGTPFIAAMQPINALAFVFDGINFGASDFAYSAYSMVLVAIISIFCLFILSST 476
           +VL +I IG PF+A  QPINALAFVFDG+NFGASDFAYSAYSMVLVAIISI  L  LSST
Sbjct: 421 NVLRMISIGIPFVAGTQPINALAFVFDGVNFGASDFAYSAYSMVLVAIISILSLLFLSST 480

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
DTX42_ARATH1.1e-16064.21Protein DETOXIFICATION 42 OS=Arabidopsis thaliana GN=DTX42 PE=2 SV=2[more]
DTX43_ARATH1.8e-13756.02Protein DETOXIFICATION 43 OS=Arabidopsis thaliana GN=DTX43 PE=1 SV=1[more]
DTX44_ARATH4.0e-8944.30Protein DETOXIFICATION 44, chloroplastic OS=Arabidopsis thaliana GN=DTX44 PE=2 S... [more]
DTX45_ARATH3.2e-7842.33Protein DETOXIFICATION 45, chloroplastic OS=Arabidopsis thaliana GN=DTX45 PE=2 S... [more]
DINF_ECOLI3.3e-1125.92DNA-damage-inducible protein F OS=Escherichia coli (strain K12) GN=dinF PE=2 SV=... [more]
Match NameE-valueIdentityDescription
A0A0A0KXA7_CUCSA5.6e-19174.04Protein DETOXIFICATION OS=Cucumis sativus GN=Csa_4G291900 PE=3 SV=1[more]
A0A061GU61_THECC5.3e-18970.68Protein DETOXIFICATION OS=Theobroma cacao GN=TCM_040698 PE=3 SV=1[more]
M5WPF4_PRUPE1.0e-18470.29Protein DETOXIFICATION OS=Prunus persica GN=PRUPE_ppa004360mg PE=3 SV=1[more]
A0A067JDZ9_JATCU1.8e-18470.54Protein DETOXIFICATION OS=Jatropha curcas GN=JCGZ_25922 PE=3 SV=1[more]
A0A0D2SWN8_GOSRA2.1e-18268.93Protein DETOXIFICATION OS=Gossypium raimondii GN=B456_006G128800 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G51340.26.4e-16264.21 MATE efflux family protein[more]
AT3G08040.19.9e-13956.02 MATE efflux family protein[more]
AT2G38330.12.2e-9044.30 MATE efflux family protein[more]
AT4G38380.11.8e-7942.33 MATE efflux family protein[more]
Match NameE-valueIdentityDescription
gi|659097607|ref|XP_008449716.1|1.2e-21881.19PREDICTED: MATE efflux family protein 1 [Cucumis melo][more]
gi|449448721|ref|XP_004142114.1|4.5e-21881.15PREDICTED: MATE efflux family protein 1 [Cucumis sativus][more]
gi|700199020|gb|KGN54178.1|8.1e-19174.04hypothetical protein Csa_4G291900 [Cucumis sativus][more]
gi|590583985|ref|XP_007015048.1|7.6e-18970.68MATE efflux family protein isoform 1 [Theobroma cacao][more]
gi|802763325|ref|XP_012089999.1|1.5e-18470.41PREDICTED: MATE efflux family protein 1 isoform X1 [Jatropha curcas][more]
The following terms have been associated with this mRNA:
Vocabulary: Biological Process
TermDefinition
GO:0055085transmembrane transport
GO:0006855drug transmembrane transport
Vocabulary: Cellular Component
TermDefinition
GO:0016020membrane
Vocabulary: Molecular Function
TermDefinition
GO:0015297antiporter activity
GO:0015238drug transmembrane transporter activity
Vocabulary: INTERPRO
TermDefinition
IPR002528MATE_fam
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006855 drug transmembrane transport
biological_process GO:0055085 transmembrane transport
cellular_component GO:0016020 membrane
molecular_function GO:0015297 antiporter activity
molecular_function GO:0015238 drug transmembrane transporter activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG05g05060Cp4.1LG05g05060gene


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG05g05060.1:five_prime_utr:003Cp4.1LG05g05060.1:five_prime_utr:003five_prime_UTR
Cp4.1LG05g05060.1:five_prime_utr:002Cp4.1LG05g05060.1:five_prime_utr:002five_prime_UTR
Cp4.1LG05g05060.1:five_prime_utr:001Cp4.1LG05g05060.1:five_prime_utr:001five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG05g05060.1:cds:014Cp4.1LG05g05060.1:cds:014CDS
Cp4.1LG05g05060.1:cds:013Cp4.1LG05g05060.1:cds:013CDS
Cp4.1LG05g05060.1:cds:012Cp4.1LG05g05060.1:cds:012CDS
Cp4.1LG05g05060.1:cds:011Cp4.1LG05g05060.1:cds:011CDS
Cp4.1LG05g05060.1:cds:010Cp4.1LG05g05060.1:cds:010CDS
Cp4.1LG05g05060.1:cds:009Cp4.1LG05g05060.1:cds:009CDS
Cp4.1LG05g05060.1:cds:008Cp4.1LG05g05060.1:cds:008CDS
Cp4.1LG05g05060.1:cds:007Cp4.1LG05g05060.1:cds:007CDS
Cp4.1LG05g05060.1:cds:006Cp4.1LG05g05060.1:cds:006CDS
Cp4.1LG05g05060.1:cds:005Cp4.1LG05g05060.1:cds:005CDS
Cp4.1LG05g05060.1:cds:004Cp4.1LG05g05060.1:cds:004CDS
Cp4.1LG05g05060.1:cds:003Cp4.1LG05g05060.1:cds:003CDS
Cp4.1LG05g05060.1:cds:002Cp4.1LG05g05060.1:cds:002CDS
Cp4.1LG05g05060.1:cds:001Cp4.1LG05g05060.1:cds:001CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG05g05060.1:three_prime_utr:001Cp4.1LG05g05060.1:three_prime_utr:001three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG05g05060.1Cp4.1LG05g05060.1-proteinpolypeptide


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002528Multi antimicrobial extrusion proteinPFAMPF01554MatEcoord: 277..430
score: 2.7E-12coord: 127..218
score: 1.4
IPR002528Multi antimicrobial extrusion proteinTIGRFAMsTIGR00797TIGR00797coord: 72..445
score: 2.3E
NoneNo IPR availablePANTHERPTHR11206MULTIDRUG RESISTANCE PROTEINcoord: 8..79
score: 1.5E-273coord: 114..474
score: 1.5E
NoneNo IPR availablePANTHERPTHR11206:SF128MATE EFFLUX FAMILY PROTEIN 1coord: 8..79
score: 1.5E-273coord: 114..474
score: 1.5E