Previous Article | Next Article ![]()
Applied and Environmental Microbiology, April 2006, p. 2905-2917, Vol. 72, No. 4
0099-2240/06/$08.00+0 doi:10.1128/AEM.72.4.2905-2917.2006
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Jaap Kingma,1
Michael Arand,2
Marcel G. Wubbolts,3 and
Dick B. Janssen1*
Biochemical Laboratory, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Nijenborgh 4, 9747 AG Groningen, The Netherlands,1 Institute of Pharmacology and Toxicology, University of Zürich, Winterthurerstrasse 190, CH-8057 Zürich, Switzerland,2 DSM Pharma Chemicals, Advanced Synthesis, Catalysis & Development, DSM Research, P.O. Box 18, 6160 MD Geleen, The Netherlands3
Received 5 September 2005/ Accepted 19 January 2006
|
|
|---|
/ß-hydrolase fold enzymes that possess a nucleophilic aspartate revealed that these enzymes can be classified into eight phylogenetic groups that all contain putative epoxide hydrolases. To determine their catalytic activities, 10 putative bacterial epoxide hydrolase genes and 2 known bacterial epoxide hydrolase genes were cloned and overexpressed in Escherichia coli. The production of active enzyme was strongly improved by fusion to the maltose binding protein (MalE), which prevented inclusion body formation and facilitated protein purification. Eight of the 12 fusion proteins were active toward one or more of the 21 epoxides that were tested, and they converted both terminal and nonterminal epoxides. Four of the new epoxide hydrolases showed an uncommon enantiopreference for meso-epoxides and/or terminal aromatic epoxides, which made them suitable for the production of enantiopure (S,S)-diols and (R)-epoxides. The results show that the expression of epoxide hydrolase genes that are detected by analyses of genomic databases is a useful strategy for obtaining new biocatalysts. |
|
|---|
Since many microbial genome sequences are available in the public domain, it is useful to screen these databases for genes that might encode new enzymes with interesting properties. Novel epoxide hydrolases can be identified by performing a BLAST search of the genomic databases, using amino acid sequences of known epoxide hydrolases as queries. This approach will result in putative epoxide hydrolases but also in amino acid sequences from structurally and mechanistically related enzymes, such as esterases and dehalogenases (33), which can be filtered out using conserved epoxide hydrolase sequence motifs that define the active site (Fig. 1). The putative epoxide hydrolase-encoding genes can subsequently be cloned and overexpressed in a host with no endogenous epoxide hydrolase activity, such as Escherichia coli, and the activity of the encoded proteins can be tested.
![]() View larger version (35K): [in a new window] |
FIG. 1. Structure and mechanism of /ß-hydrolase fold epoxide hydrolases. The active site shown in panels A and B is from the epoxide hydrolase from A. radiobacter AD1 (EchA) (31, 35, 36). (A) Tyrosine-assisted ring opening of the epoxide by the nucleophilic aspartate. (B) Hydrolysis of the alkyl-enzyme intermediate by an activated water molecule. (C) General topology and structure of /ß-hydrolase fold epoxide hydrolases. I, main domain with /ß-hydrolase fold; II, cap domain; 1, catalytic nucleophile (D107 in EchA); 2, histidine base (H275); 3, charge relay acid, which can be positioned at either 3a or 3b (D246 is at position 3b); 4, ring-opening tyrosines (Y152/Y215); 5, H-G-X-P motif (the X is usually an aromatic residue in epoxide hydrolases [W38]); 6, G-X-Sm-X-S/T motif. The region of helix 4a can be nonhelical or missing in some epoxide hydrolases. The length of the loop between helixes 6 and 7 can vary considerably. Extensions with additional domains of 40 to 300 amino acids can occur at both the N and C termini.
|
/ß-hydrolase fold family, to which lipases, esterases, and haloalkane dehalogenases also belong. These enzymes consist of a main domain that is composed of a central ß-sheet surrounded by
-helices and a variable cap domain positioned on top of the substrate binding site (Fig. 1) (33). They use a catalytic mechanism that involves an Asp/Ser/Cys-His-Asp/Glu (nucleophile-histidine-acid) catalytic triad located on top loops of the main domain. The positions of these residues are structurally conserved in the order nucleophile-histidine-acid (Fig. 1A and B) (33), but in the primary amino acid sequence the order is nucleophile-acid-histidine, with the acid located either in front of the cap domain (Fig. 1C, position 3a) or after the cap domain (Fig. 1C, position 3b).
In the case of
/ß-hydrolase fold epoxide hydrolases, the catalytic triad nucleophile is an invariable aspartate that opens the epoxide ring by nucleophilic attack (Fig. 1A) (4, 35). The ring-opening reaction is assisted by two conserved tyrosines that are located in the cap domain (Fig. 1C) (6, 36). The resulting alkyl-enzyme intermediate is subsequently hydrolyzed by a water molecule that is activated by a histidine that functions as a proton acceptor and, in turn, is assisted by the acidic residue (Fig. 1B). The negative charge that develops on the carbonyl oxygen of the nucleophilic aspartate during hydrolysis of the alkyl enzyme intermediate is stabilized by two backbone amides that are contributed by the residue following the catalytic nucleophile and residue X in a conserved H-G-X-P motif (Fig. 1C). This motif is located between strand ß3 and helix
1 (Fig. 1C, position 5) and is conserved in haloalkane dehalogenases and epoxide hydrolases. Residue X is usually an aromatic residue in epoxide hydrolases, whereas it is an asparagine or a glutamate in haloalkane dehalogenases. The side chain of the amino acid at this position lines the active site (31, 51). Between the H-G-X-P motif and the catalytic nucleophile, there is a conserved G-X-Sm-X-S/T motif of unknown function (Fig. 1C, position 6).
This paper describes the screening of various genomic databases for epoxide hydrolases of the
/ß-hydrolase fold family. Based on phylogenetic analysis of the resulting sequences and comparison to other
/ß-hydrolase fold enzymes that have a nucleophilic aspartate, the epoxide hydrolases were divided into different phylogenetic groups. Ten of these putative epoxide hydrolases, together with two known bacterial epoxide hydrolases, were cloned and overexpressed in E. coli and subsequently tested for their biocatalytic potential.
|
|
|---|
![]() View larger version (11K): [in a new window] |
FIG. 2. Epoxide substrates used in this study. 1, para-methylstyrene oxide; 2, styrene oxide; 3, para-chlorostyrene oxide; 4, meta-chlorostyrene oxide; 5, para-nitrostyrene oxide; 6, cis-stilbene oxide; 7, trans-stilbene oxide; 8, chalcone- ,ß-epoxide; 9, phenyl glycidyl ether; 10, para-nitrophenyl glycidyl ether; 11, epichlorohydrin; 12, epibromohydrin; 13, vinyl oxirane; 14, tert-butyloxirane; 15, 1,2-epoxybutane; 16, 1,2-epoxypentane; 17, 1,2-epoxyhexane; 18, 1,2-epoxyheptane; 19, 1,2-epoxyoctane; 20, cis-2,3-epoxybutane; 21, trans-2,3-epoxybutane; 22, fosfomycin; 23, cyclohexene oxide; 24, limonene-1,2-epoxide; 25, 3,4-epoxytetrahydrofuran.
|
/ß-hydrolase fold family and possess an invariable aspartate as the catalytic nucleophile. The resulting phylogenetic tree was displayed using TreeView and edited. The multiple sequence alignments were displayed as bit scores for each position, using the WebLogo sequence generator (http://weblogo.berkeley.edu).
Cloning of epoxide hydrolase genes.
Genes encoding putative epoxide hydrolases were amplified from respective sources of genomic DNA (see below), using the primer pairs described in the supplemental material, and subsequently cloned into the pMAL-c2x plasmid vector. Upon bacterial expression of these constructs, epoxide hydrolases were obtained as fusion proteins with a maltose binding protein (MalE) domain linked to the N terminus of the epoxide hydrolase domain via a polyasparagine linker and a small decapeptide. The following DNA sources served as templates: whole-cell material from an overnight culture (Bsueh, Bfueh1, Npueh1, Npueh2, and Ppueh), genomic DNA added up to 0.05 ng µl1 (Draeh, Rpaeh2, Scoeh6, and Tfueh), and plasmid DNA (AraEchA [43], Coreh [29], and MtuEphF). Primers were used at 0.4 nM in a reaction mixture with a 0.2 mM concentration of each deoxynucleoside triphosphate and 0.025 U µl1 Pwo DNA polymerase or, in the case of DNAs with high G+C contents (see the supplemental material), 0.05 U µl1 Pfu polymerase. The reactions with high-G+C-content DNA were performed in the presence of 5% (vol/vol) dimethyl sulfoxide (DMSO). For genes with no high G+C content, the temperature program used was 15 min at 94°C without polymerase, followed by 30 cycles of 60 s at 94°C, 45 s at 58°C, and 70 s at 72°C and a final step of 4 min at 72°C. For genes with high G+C content, the temperature program used was 15 min at 94°C without polymerase, followed by 30 cycles of 60 s at 95°C, 45 s at 68°C 0.5°C cycle1 (each cycle the temperature was lowered 0.5°C), and 120 s at 72°C and a final step of 4 min at 72°C. For PCR amplification of MtuEphF, the following program was used: 15 min at 94°C without polymerase, followed by 30 cycles of 60 s at 94°C, 45 s at 50°C, and 70 s at 68°C and a final step of 4 min at 68°C. The PCR products were digested with various restriction enzymes (see the supplemental material for details) and subsequently ligated into BamHI-HindIII-, EcoRI-HindIII-, EcoRI-PstI-, or XbaI-HindIII-digested pMAL-c2x plasmid DNA, using T4 DNA ligase. The ligation mixtures were transformed into E. coli TOP10 cells by electroporation. The transformants were plated on LB medium containing ampicillin. Colonies were checked for inserts by using PCR with Taq polymerase and with colony material as the template. Positive colonies were used to inoculate 5 ml of liquid LB medium and grown overnight at 37°C. Plasmid DNA was extracted using a plasmid purification kit from Roche, and the inserts were sequenced.
Protein expression and purification.
Induction conditions for the expression of fusion proteins were optimized by adding up to 0.4 mM IPTG to a 5-ml culture with an optical density at 600 nm (OD600) of
0.2, followed by overnight incubation at various temperatures ranging from 8 to 37°C. The cells were harvested and resuspended in 0.4 ml TEDANG buffer (20 mM Tris-HCl, pH 7.4, 1 mM EDTA, 1 mM dithiothreitol, 0.02% [wt/vol] NaN3, 200 mM NaCl, 10% [vol/vol] glycerol) at 4°C. After brief sonication, a cell extract (CFE) was obtained by centrifugation. Cell extracts were analyzed for expression by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The levels of expression were determined from Coomassie-stained gels with the GelPro Analyzer program. Total protein contents were determined using the Bradford reagent. For Bsueh, expression was optimized further by transforming the pMAL-c2x-Bsueh construct into E. coli BL21(DE3)/pLysS Rosetta, which has elevated levels of rare tRNAs that could enhance expression.
Preparative-scale production of proteins was achieved by induction of 1- to 3-liter cultures at an OD600 of
0.2 with 0.4 mM IPTG, followed by overnight incubation at temperatures ranging from 8 to 30°C. Cells were harvested by centrifugation, washed, and resuspended in TEDANG buffer at 4°C. The cells were lysed by sonication, and cell extracts were obtained by centrifugation at 200,000 x g for 90 min.
Purification of proteins from CFE was achieved using amylose resin to selectively bind the fusion proteins. Resin with bound fusion protein was subsequently poured into a column, and after the column was washed with TEDANG buffer, the fusion protein was eluted with the same buffer containing 10 mM maltose. Fractions containing active fusion protein were pooled and concentrated. The protein content of the purified enzyme fractions was determined by A280 measurements. The extinction coefficient was calculated from the amino acid sequences of the MalE-EH fusion proteins, using Lasergene-Protean.
Spectrophotometric epoxide hydrolase assays.
All continuous spectrophotometric measurements were performed on a Kontron Uvikon 930 UV/VIS spectrophotometer. Epoxide hydrolase activities toward para-nitrostyrene oxide (pNSO; epoxide 5 in Fig. 2) and para-nitrophenyl glycidyl ether (pNPGE; epoxide 10) were determined in 100 mM Tris-SO4 (pH 7.5) as described previously (43). Errors in values of initial activities calculated from the decrease (pNSO, 310 nm) or increase (pNPGE, 350 nm) in absorbance were <10%, as indicated by duplicate measurements. The detection limit for these initial activities was 0.002 U mg1 (units are defined in µmol min1).
The steady-state kinetic parameters of the purified novel epoxide hydrolases were determined by measuring progress curves for the conversion of enantiopure pNSO (epoxide 5) and pNPGE (epoxide 10). The substrate was added at a concentration of up to 0.5 mM, with a final concentration of DMSO of <1% (vol/vol). A suitable amount of purified protein was added to start the reaction. The spectrophotometric conversion traces for the separate enantiomers of epoxides 5 and 10 were directly fitted to Michaelis-Menten kinetics, as described before, to obtain kcat, Km, and kcat/Km values (36, 43). In case the Km was much higher than the substrate concentration used, the spectrophotometric traces were fitted according to first-order kinetics, and the first-order rate constant equals kcat/Km. In this case, only lower limits of kcat and Km were obtained. More details on the fitting procedures used can be found in the supplemental material. Errors in values for kcat, Km, and kcat/Km were <10%, as indicated by duplicate measurements.
Substrate profiling of the various epoxide hydrolases was done using the adrenaline test (46). Initial testing of various vicinal diols in a reaction with IO4 and adrenaline revealed that adenochrome was rapidly produced with all diols that are formed from the epoxides shown in Fig. 2, except for diols derived from glycidyl ethers (epoxides 9 and 10), which do not react fast enough with IO4 (<50% conversion in 45 min). Substrates were added at concentrations of up to 1 mM from 50 mM stock solutions in water (epoxide 22) or DMSO (all other substrates) to 100 µl 100 mM sodium phosphate (pH 8.0) in microtiter plates. Purified protein or CFE was transferred to 100 mM sodium phosphate (pH 8.0) by desalting over an Econopack desalting column in order to remove components that can be oxidized by IO4, such as glycerol and Tris. After 1 h of incubation of the enzyme with the substrate at room temperature, up to 1 mM NaIO4 was added from a 10 mM stock solution, and the mixture was incubated for 45 min, after which L-epinephrine was added to determine the amount of remaining NaIO4. The A490 value, which is a measure of the amount of adenochrome formed by the reaction with remaining IO4 (46), was determined for each well. After correction for incubations without substrate but with enzyme and visual inspection of the microtiter plates, the activities toward the different substrates were scored as high (++, 60 to 100% conversion), intermediate (+, 10 to 60% conversion), or low (, <10% conversion).
Kinetic resolution.
The enantioselectivities of the enzymes for various epoxides (Fig. 2) were determined by performing a kinetic resolution of 0.5 to 2 mM of racemic substrate in 12 to 20 ml of 100 mM Tris-SO4, pH 7.5, at room temperature or 30°C. Sampling, chiral gas chromatography (GC), and HPLC analysis were done as described before (43). The conditions for chiral GC and HPLC analysis are described in the supplemental material. The E values were calculated from the resulting data, as described before (26, 43).
Conversion of meso-epoxides.
The activities of the expressed epoxide hydrolases toward meso-epoxides 20 and 23 were determined at a 2 mM substrate concentration in 20 ml 100 mM Tris-SO4, pH 7.5. The reaction was started by adding a suitable amount of CFE or purified enzyme, and samples were taken in time and extracted with hexane containing an internal standard. The samples were analyzed by GC (see the supplemental material for the conditions used). Errors in values of initial activities were <10%, as indicated by duplicate measurements. The detection limit for these conversions was 0.002 µmol min1 mg1.
The diastereomeric composition of the product resulting from the conversion of epoxide 20 to 2,3-butanediol (diol 20a) was determined by dimethoxypropane extraction of a 2-ml sample from an incubation of 5 mM of substrate with enzyme. The organic phase was incubated with Amberlite/H+ resin for 1 h and subsequently dried over a short column of anhydrous MgSO4. The same procedure was performed with commercially available reference compounds of the three different diastereomers of diol 20a. Analysis was done by chiral GC (see the supplemental material for GC methods).
The diastereomeric composition of 1,2-cyclohexanediol (diol 23a) was determined by extracting 2 ml of a mixture of 10 mM substrate and enzyme with diethyl ether. The organic phase was dried over a short column of anhydrous MgSO4. Commercially obtained samples of the three different diastereomers of diol 23a were used as reference compounds. Analysis was done by chiral GC (see the supplemental material for the conditions used).
The E values for the formation of (R,R)- and (S,S)-diols from meso-epoxides were calculated from the ee of the product (eep) by using equation 1, as follows:
![]() | (1) |
|
|
|---|
/ß-hydrolase fold enzymes that have the conserved nucleophilic aspartate, including 42 epoxide hydrolases with known activity, haloalkane dehalogenases, and fluoroacetate dehalogenases. Analysis of the multiple sequence alignment revealed that 18 hydrolases that were annotated in the databases as epoxide hydrolases most likely are fluoroacetate dehalogenases (see below). The remaining 274 putative epoxide hydrolases originated from 91 taxonomically different organisms (Table 1). From this set, 35 sequences were removed before further phylogenetic analysis since they were 100% identical to sequences from closely related organisms. A multiple sequence alignment was done with the remaining 239 putative epoxide hydrolases found in sequencing projects, 6 putative epoxide hydrolases from other sources, 42 known epoxide hydrolases, 19 (putative) fluoroacetate dehalogenases, and 8 (putative) haloalkane dehalogenases. |
View this table: [in a new window] |
TABLE 1. Epoxide hydrolases in various classes of sequenced organisms
|
The phylogenetic tree that resulted from the final multiple sequence alignment identified eight different phylogenetic groups (Fig. 3). On average, the
/ß-hydrolase fold and cap domain sequences together (as indicated in Fig. 1C) were 290 to 310 amino acids long (Table 2). In some phylogenetic groups, the average total sequence length was significantly larger, which was, in most cases, the result of an N- or C-terminal extension. The nucleophilic aspartate, the histidine, and the H-G-X-P and G-X-Sm-X-S/T motifs aligned perfectly for all sequences. For all putative epoxide hydrolases, two ring-opening tyrosines were identified in the cap domain. The position of the first tyrosine (Y152 in EchA) (Fig. 1) especially varied considerably between the different groups, but it was conserved within each group. The charge relay acid was predominantly an aspartate, usually located at position b (Fig. 1; Table 2). The average amino acid sequence identity of the complete sequences was only 14%, or 19% if only sequences with known activity were included, so a low sequence identity appears to be normal for this class of enzymes. The relatively long nodes compared to the branches of the phylogenetic tree also indicate low overall identity, even within a group (20 to 35%) (Fig. 3). Apparently, only a few residues are crucial for proper performance of the various enzymes, and the overall
/ß-hydrolase fold and active-site geometry allow significant sequence variation, as observed in previous studies on phylogenetic relationships between functionally different
/ß-hydrolase fold enzymes (8, 9). This is most strikingly demonstrated by the fact that the amino acid sequences of the epoxide hydrolases from A. radiobacter AD1 (35) and A. niger (3) are <10% identical, but the
/ß-hydrolase fold part of their X-ray structures and their active sites are virtually superimposable (31, 51). Most of the phylogenetic groups of epoxide hydrolases consist of homologs from evolutionarily very distant organisms. Since the overall identity within each group is rather low, and since the high identities that do occur are between enzymes that originated from closely related organisms, the majority of these genes are probably orthologs that were separated during speciation. Since bacteria are biased toward deleting genes that are no longer needed (28), several specialized microorganisms such as lactic acid bacteria probably lost their epoxide hydrolase genes during evolution. Indeed, most epoxide hydrolases occur in organisms with large genomes (>8 Mb), even though the size of their collective genomes is similar to that of the small-genome organisms (<2 Mb) (Fig. 4). Lateral transfer may play a role in some cases, as suggested by the situation encountered in Mycobacterium tuberculosis, which contains 11
/ß-fold hydrolases (3 dehalogenases and 8 epoxide hydrolases) in a 4-Mb genome that harbors less than 4,000 genes in total.
![]() View larger version (86K): [in a new window] |
FIG. 3. Phylogenetic tree of /ß-hydrolase fold enzymes with a nucleophilic aspartate. Symbols: , enzymes for which an epoxide hydrolase function is described in the literature or databases; , enzymes identified in this study and active with one or more epoxides; , putative epoxide hydrolases for which no epoxide substrate was found. The black nodes are (putative) epoxide hydrolases, red nodes are (putative) haloacetate dehalogenases, green nodes are (putative) haloalkane dehalogenases, and orange nodes in group 3 are putative epoxide hydrolases with a C-terminal part that is homologous to short-chain dehydrogenase/reductase family proteins (SDR proteins). Groups 1 to 8 are described in Table 2.
|
|
View this table: [in a new window] |
TABLE 2. Phylogenetic groups of /ß-hydrolase fold enzymes with a nucleophilic aspartate
|
![]() View larger version (19K): [in a new window] |
FIG. 4. Occurrence of putative epoxide hydrolases in prokaryotes and archaea in relation to genome size. The number of organisms with a given genome size is indicated above each percentage bar, with the size of the collective genomes in this category shown in parentheses (in Mb). The percentages of organisms that have at least one putative epoxide hydrolase are indicated in black. The horizontal line indicates the average number of epoxide hydrolase-containing organisms (17.5%).
|
Characteristics of phylogenetic groups.
The members of phylogenetic group 1 epoxide hydrolases originated predominantly from proteobacteria (>70% of the group members). This group includes an epoxide hydrolase that was recently identified in an environmental gene library (49). The members of subgroup 1A are, on average, 40 amino acids longer than the other members of this group. Using SignalP software (http://www.cbs.dtu.dk/services/SignalP), these extra amino acids were identified as an N-terminal signal peptide for protein excretion. A more thorough analysis of the various characteristic signal peptide regions suggested that these proteins are secreted via a twin arginine translocation pathway (1). This pathway usually translocates completely folded proteins, including cofactor-dependent enzymes, across cellular membranes (14).
Group 2 contains epoxide hydrolases from bacterial, archaeal, and eukaryotic origins. Some of the eukaryotic epoxide hydrolases originated from multicellular organisms. Thus far, only two classes of eukaryotic epoxide hydrolases from multicellular organisms are known, which are the microsomal/juvenile hormone epoxide hydrolases (group 5) and the cytosolic/plant epoxide hydrolases (group 8). The putative eukaryotic epoxide hydrolases that cluster in group 2 have an N-terminal extension of unknown function and are therefore somewhat longer than the enzymes in this group that are of prokaryotic origin (Table 2). Group 2 contains a relatively large number of putative epoxide hydrolases that originated from cyanobacteria (20%).
Group 3 consists predominantly of putative epoxide hydrolases from actinobacteria, ß-proteobacteria, and fungi. About one-third of the group members consist of an epoxide hydrolase domain (N-terminal) and a C-terminal domain that is homologous to SDR proteins (Table 2). The epoxide hydrolase domain of the fungal epoxide hydrolase sequences is, on average, 34 amino acids longer than that of the other group members.
In group 4, both epoxide hydrolases and haloalkane dehalogenases appear to be present (Fig. 3; Table 2). These structurally related enzymes, which also have similar mechanisms (2), can be distinguished at the amino acid level by the presence of the ring-opening tyrosines, which are missing in the haloalkane dehalogenases. Since these tyrosines do not always align perfectly, the H-G-X-P motif, which is located between sheet ß3 and helix
1 (Fig. 1), is more suitable for distinguishing epoxide hydrolases from haloalkane dehalogenases. The main chain amide of the X residue is part of the oxyanion hole in both enzyme classes (Fig. 1B). The side chain is part of the wall of the active site and can therefore interact with the substrate (31, 51). In epoxide hydrolases, the X residue is always an aromatic amino acid, whereas in haloalkane dehalogenases it is a more hydrophilic or even charged residue such as asparagine or glutamate (Fig. 5A). Although the exact function of the motif is unknown, it can possibly be used to distinguish between epoxide hydrolases and haloalkane dehalogenases. The fungal enzymes present in this group consist predominantly of homologous epoxide hydrolase sequences of various Saccharomyces spp. (Fig. 3).
![]() View larger version (37K): [in a new window] |
FIG. 5. Multiple sequence alignments of conserved regions in functionally different /ß-hydrolase fold enzymes, represented as bit scores for each position, with the total height indicating the degree of conservation (maximum level, 4.32) and the relative heights of different symbols at the same position indicating the frequencies of the amino acids at that position. (A) H-G-X-P region in epoxide hydrolases and haloalkane dehalogenases. (B) Region around the nucleophilic aspartate in epoxide hydrolases and fluoroacetate dehalogenases.
|
/ß-hydrolase main domain. In contrast to the case for most epoxide hydrolases, the charge relay acid in this group of epoxide hydrolases is a glutamate in 75% of the cases instead of an aspartate (Table 2). Group 6 contains both fluoroacetate dehalogenases and epoxide hydrolases. For all members of this group, the charge relay aspartate is located at position a instead of the more common position b (Fig. 1; Table 2). The putative fluoroacetate dehalogenase sequences all grouped together with the fluoroacetate dehalogenase from Moraxella sp. (24) (Fig. 3). These sequences contain three consecutive arginines distal from the nucleophilic aspartate in a DRXXRXXXR motif, whereas epoxide hydrolases usually have a conserved aromatic residue distally flanking the catalytic nucleophile (Fig. 5B). It was proposed that one of these arginines plays a role in the binding of the acid part of the fluoroacetate substrate (25). Furthermore, all of these sequences have the fluoroacetate dehalogenase from Moraxella sp. (24) as their highest scoring BLAST hit for proteins that have known activity. These putative haloacetate dehalogenases have often erroneously been annotated as epoxide hydrolases, including in a recent phylogenetic study of epoxide hydrolases (9). The presence of a conserved tyrosine in the cap domains of haloacetate dehalogenases that aligns perfectly with one of the ring-opening tyrosines of epoxide hydrolases contributes to the confusion. The epoxide hydrolase sequences in group 7 are similar to those in group 6. The first conserved ring-opening tyrosine and the charge relay aspartate, however, are located at different positions (Fig. 1; Table 2).
Group 8 contains a large number of known epoxide hydrolases from plants and mammals, including the well-known mammalian cytosolic epoxide hydrolases (Fig. 3; Table 2). There are no sequences originating from other, lower eukaryotic organisms, such as fungi, in this group. The plant epoxide hydrolases occur only in this group. The cap domain of this group of epoxide hydrolases has a 30-amino-acid excursion between helixes
6 and
7 (Fig. 1), resulting in a larger size of the epoxide hydrolase segment (Table 2).
Overexpression and purification of putative epoxide hydrolases.
Ten putative epoxide hydrolase-encoding genes from different phylogenetic groups and the bacterial epoxide hydrolases from A. radiobacter (35) (AraEchA) and Corynebacterium sp. strain C12 (29) (Coreh) were selected to be cloned and overexpressed in E. coli (Table 3). Since initial attempts to overexpress the epoxide hydrolases from Mycobacterium tuberculosis H37v (MtuEphF) and Pseudomonas putida KT2440 (Ppueh) in E. coli using the pGEF system (38) resulted in inclusion bodies (data not shown), it was decided to overexpress the enzymes as maltose binding protein-epoxide hydrolase fusion proteins (MalE-EH). The MalE tag was expected to facilitate proper folding of the epoxide hydrolased in E. coli (37). Moreover, it serves as a convenient purification tag. The MalE-EH fusion proteins were produced in E. coli at levels of 10 to 60% of the total soluble protein in CFEs, including the enzymes for which expression yielded inclusion bodies when no MalE was fused to the epoxide hydrolase (Table 3). The CFE prepared from an expression culture of the MalE-Bsueh (originating from Bacillus subtilis) construct in E. coli TOP10 cells resulted in rather low specific activities (0.03 µmol min1 mg1), possibly due to poor expression levels caused by the large number of rare codons present in the Bsueh gene. Expression of the MalE-Bsueh protein in E. coli BL21(DE3)/pLysS Rosetta, an E. coli variant that has increased levels of rare tRNAs, resulted in a twofold improvement in specific activity in the cell extract.
|
View this table: [in a new window] |
TABLE 3. Properties of MalE-EH fusion proteins
|
1.5, with a total activity yield of 84%. The MalE-EH fusion proteins of the three epoxide hydrolases from phylogenetic group 2 (Fig. 3; Table 2) were also produced at a preparative scale and conveniently purified from cell extracts using the MalE tag. The enzymes from Deinococcus radiodurans (Draeh) and Nostoc punctiforme (Npueh1) were obtained at >95% purity with good yields. The B. subtilis enzyme could only be obtained at 50 to 70% purity with a single round of amylose resin purification. The enzyme was unstable under the purification conditions used and was therefore not purified further (see the supplemental material for details on enzyme purifications).
Activity of MalE-epoxide hydrolase fusion proteins.
CFEs and/or purified samples of the MalE-EH fusion proteins were tested for activity towards a range of epoxides, using either a colorimetric assay (substrates 5 and 10) (Fig. 2) or an indirect assay that measures periodate depletion by its reaction with a diol (substrates 1, 2, 4, 7, 8, 11 to 14, 16, 17, and 19 to 25). In the latter assay, the diol that is formed by epoxide hydrolase activity reacts with periodate, and the amount of remaining periodate is measured by incubating it with adrenaline to yield a colored product (46). Using these assays, it appeared that of the 12 epoxide hydrolases that were tested, 8 were active with one or more epoxides (Table 4). The substrate ranges of the enzymes were generally very broad and included styrene oxide derivatives, phenyl glycidyl ethers, and most terminal aliphatic epoxides. Only a few enzymes showed activity with the sterically more demanding
,ß-disubstituted epoxides (compounds 6 to 8, 20, 23, and 24). Among the proteins for which no activity was detected were MtuEphF and Ppueh. It is likely that the inactive proteins were correctly folded since they were expressed as soluble proteins. Activity was detected with proteins from different groups, and in combination with data in the literature, it was concluded that epoxide hydrolases occur in at least seven of the phylogenetic groups described above (Fig. 3; Table 2). The fact that Npueh2 is active toward pNPGE (compound 10) (Tables 3 and 4) shows that group 4 contains both haloalkane dehalogenases and epoxide hydrolases (Fig. 3). The nature of residue X in the H-G-X-P motif (motif 5 in Fig. 1), as described above, could indeed be used for distinction between the two functional classes (Fig. 5A).
|
View this table: [in a new window] |
TABLE 4. Substrate ranges of cloned bacterial epoxide hydrolases
|
An analysis of the genes carried in the vicinity of the six putative epoxide hydrolase genes for which activity was found did not clearly reveal a function for the enzymes. However, the genes encoding the epoxide hydrolases from B. subtilis, Burkholderia fungorum (Bfueh1), and R. palustris are located near genes encoding putative enzymes that may be involved in the detoxification of xenobiotics or antibiotics. The genes surrounding the epoxide hydrolase genes from D. radiodurans and N. punctiforme are of unknown or unrelated function. Thus, the biological function of the epoxide hydrolases identified here is uncertain.
Enantioselectivity and unusual enantiopreference of group 2 enzymes.
The enantioselectivity of the active epoxide hydrolase fusion proteins was tested with various chiral substrates. The enantioselectivities with styrene epoxides and phenyl glycidyl ethers were generally low (E < 10) to modest (10 < E < 50) (Tables 3 and 5). This could possibly be influenced by the fact that the tested enzymes were fusion proteins, but the MalE-AraEchA enzyme showed no reduction in stereospecificity compared to the native enzyme (43), so in this case the MalE protein did not influence the catalytic properties of the enzyme. The highest E values were found for para-nitrostyrene oxide, using the D. radiodurans enzyme (Draeh), and for para-nitrophenyl glycidyl ether, using the B. subtilis epoxide hydrolase (Bsueh).
|
View this table: [in a new window] |
TABLE 5. Activities and enantioselectivities of novel group 2 epoxide hydrolases for terminal epoxides
|
Determination of the steady-state kinetic parameters for pNSO (compound 5) and pNPGE (compound 10) (Table 6) revealed that Npueh1 has a rather low Km for either one of their enantiomers (<0.2 mM). Indeed, the kinetic resolution data for this enzyme for most substrates (Table 5) could only be fitted when the Km values for the preferred enantiomers were set lower than 0.2 mM, indicating that this enzyme has a low Km for the preferred enantiomer with most terminal epoxides. In contrast, with the Draeh enzyme, the Km was far above the substrate concentration used for pNPGE and pNSO (>0.5 mM), and the shapes of the kinetic resolution curves indicate that this is also the case for most other epoxides. These high Km values indicate that the natural substrate for Draeh is probably not among the epoxides tested.
|
View this table: [in a new window] |
TABLE 6. Steady-state kinetic parametersa for Draeh and Npueh1 acting on each enantiomer of pNSO (substrate 5) and pNPGE (substrate 10)
|
Conversion of meso-epoxides.
Five of the 12 recombinant epoxide hydrolases were active toward the meso-epoxides 20 and/or 23 (Table 4). The regioselective conversion of these (R,S)-configured meso compounds will result in optically enriched vicinal diols due to inversion of configuration at one of the two stereocenters. The activities and enantioselectivities of the MalE-EH fusion proteins toward meso-epoxides 20 and 23 were determined by following substrate depletion over time and measuring the enantiomeric composition of the resulting diol product (Table 7). The activities were rather low for most substrates (<0.05 µmol min1 mg1), but the enantiomeric excess of the product (eep) was in some cases very high (>95%). With Npueh1, the formation of (1S,2S)-1,2-cyclohexanediol from cyclohexene oxide occurred with an eep of >95%, which is a significant improvement over the 58% eep that was reported for an epoxide hydrolase (BD10090) obtained from an environmental gene library (49). Epoxide hydrolases with excellent enantioselectivities toward meso-epoxides could thus be obtained from genomic databases.
|
View this table: [in a new window] |
TABLE 7. Activities of different fusion proteins for meso-epoxides
|
The various phylogenetic groups of
/ß-hydrolase epoxide hydrolases harbor orthologs from phylogenetically very different organisms. Since the level of sequence identity within the groups is low, this is probably not caused by recent lateral gene transfer. A more likely explanation is that the common ancestor of epoxide hydrolases was present early in evolution and widespread among the various species. As a result of speciation, enzymes that cluster in a multiple sequence alignment are now present in phylogenetically unrelated organisms.
Active recombinant epoxide hydrolases could rapidly be obtained by cloning the genes as fusions to the C-terminal part of the maltose binding protein (MalE). This facilitated expression, prevented the formation of insoluble inclusion bodies, and served as a convenient purification tag. Preliminary experiments showed that the production of soluble protein failed for only 2 of the 12 enzymes that were synthesized as fusion proteins. This approach for obtaining new enzymes for biocatalysis is attractive since the gene of a positive hit is immediately available for further improvement by site-directed mutagenesis or directed evolution.
The cloning and characterization of the putative epoxide hydrolases led to several new active enzymes that can be evaluated as biocatalysts for enantioselective conversion. Three enzymes were (S) selective toward aromatic substrates, which is rather uncommon and has been observed only for a few other epoxide hydrolases (21, 27, 42, 50). Furthermore, the enzymes from B. fungorum (Bfueh1) and N. punctiforme (Npueh1) converted the prochiral substrate cyclohexene oxide to optically enriched (1S,2S)-1,2-cyclohexanediol with much higher enantioselectivities than previously reported (Table 7) (49). Thus, new epoxide hydrolases with interesting biocatalytic activities can be obtained using genomic databases, which defines genome analysis as a promising strategy to widen the scope of biocatalysts, as previously shown for carbohydrate-converting enzymes (17).
The work of B.V.L. was supported by a grant from DSM to the University of Groningen.
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
Present address: Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, CB2 1GA Cambridge, United Kingdom. ![]()
|
|
|---|
/ß hydrolase fold. Protein Eng. 5:197-211.This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»