Previous Article | Next Article ![]()
Applied and Environmental Microbiology, February 2009, p. 823-836, Vol. 75, No. 3
0099-2240/09/$08.00+0 doi:10.1128/AEM.01951-08
Copyright © 2009, American Society for Microbiology. All Rights Reserved.
,
*
Valeria Cafaro,1,
Giuseppe Bozza,1 and
Alberto Di Donato1,3
Dipartimento di Biologia Strutturale e Funzionale, Università di Napoli Federico II, Complesso Universitario di Monte S. Angelo, Via Cinthia 4, 80126 Naples, Italy,1 Facoltà di Scienze Biotecnologiche, Università di Napoli Federico II,2 CEINGE-Biotecnologie Avanzate S.c.ar.l., Naples, Italy3
Received 21 August 2008/ Accepted 28 November 2008
|
|
|---|
|
|
|---|
As BMMs allow bacteria to grow on hydrocarbons or xenobiotics as the sole source of carbon and energy, several members of this protein family, including the soluble methane monooxygenases (MMOs) (15, 21), alkene monooxygenases (8), phenol hydroxylases (PHs)/toluene 2-monooxygenases (T2MOs) (3, 22), and toluene monooxygenases (TMOs) such as toluene 4-monooxygenase (T4MO) from Pseudomonas mendocina KR1 (38) and toluene/o-xylene monooxygenase (ToMO) from Pseudomonas sp. strain OX1 (6), have been characterized thoroughly.
All these enzymes possess defined substrate specificity, regioselectivity, and enantioselectivity properties. For example, TMOs and PHs perform two consecutive hydroxylation reactions with aromatic rings, but usually TMOs are more efficient in the first hydroxylation step, whereas PHs are more efficient in the second (3, 5). Moreover, each TMO and PH shows its own characteristic regioselectivity. T4MO produces more than 96% p-cresol from toluene (25), whereas ToMO produces a mixture of the three isomers of cresol (5). PHs usually produce a large excess of o-cresol—70 and 90% in the cases of PH from Pseudomonas sp. strain OX1 (5) and T2MO from Burkholderia cepacia G4 (22), respectively.
Thus, it appears that the BMM family constitutes an archive of powerful catalysts that could be used to construct new catalysts for the bioremediation of environmentally harmful substances and for industrial biosyntheses. Certainly, an understanding of the molecular determinants of BMM substrate specificity, regioselectivity and enantioselectivity properties is preliminary to the rational design of new, improved catalysts, as proved by the large number of studies on BMM catalytic mechanisms and on synthetic analogues capable of catalyzing reactions similar to those catalyzed by BMMs (14, 28, 39).
The results of several structural and functional studies suggest that the catalytic mechanisms of BMMs are very similar (13, 19). The major subunit (A in TMOs and
in MMOs) of the hydroxylase component (the H complex) contains a di-iron cluster bound to four glutamate and two histidine residues. These residues, and several other conserved hydrophilic residues, form an H bond network on one side of the iron ions (13, 19). On the other side, nonconserved hydrophobic residues form the substrate binding pocket (13, 19). The catalytically active diferrous form, interacting with dioxygen, produces a di-iron(III) intermediate (the peroxo intermediate) which, at least in the case of MMOs, turns into a di-iron(IV) form known as diamond core (19). The peroxo and diamond core intermediates each transfer one oxygen atom to the substrate (19). The possible intermediates involved in the transfer of oxygen to the aromatic ring (16, 19, 20) are shown in Fig. 1. The reactive species, likely a di-iron(III) intermediate, attacks the pi-electron system of the aromatic ring, forming epoxide 1 or delocalized carbocation 2. The opening of the epoxide ring eventually provides the delocalized carbocation. The migration of a hydride from the sp3-hybridized carbon to the adjacent atom then converts the carbocation to the more stable ketone 3. Finally, the dissociation of the ketone and its tautomerization yield phenolic product 4.
![]() View larger version (8K): [in a new window] |
FIG. 1. Possible intermediates in the aromatic hydroxylation reaction catalyzed by ToMO. Intermediates 1, 2, and 3 are an epoxide, a carbocation, and a ketone, respectively. R1 and R2 are hydrogen atoms or methyl groups. The geometrical features of the di-iron(III)-(hydro)peroxide intermediate and the details of the O-O bond cleavage reaction are not known.
|
In this paper, we report the effects of the substitution of six residues in the ToMO A active site on substrate specificity and regioselectivity. Furthermore, we present a detailed analysis of the molecular determinants of regioselectivity based on the docking of substrates and hypothetical intermediates of the aromatic hydroxylation reaction into the active site of the crystallographic structure of ToMO (Protein Data Bank [PDB] code 1T0Q [32]), followed by a Monte Carlo optimization. The results show that (i) the fine-tuning of TMO regioselectivity can be achieved through a careful alteration of the shape of the active site pocket and that (ii) the effects of mutations on regioselectivity can be quantitatively predicted using the procedure described herein.
|
|
|---|
ToMO A mutagenesis.
Plasmids for the expression of ToMO complexes with mutated ToMO A subunits were prepared by site-directed mutagenesis of plasmid pTOU as described previously (4). Sequences of the mutagenic oligonucleotides are reported in Table S1 in the supplemental material.
Determination of apparent kinetic parameters and identification of products.
Assays were performed as described previously (4, 5) using E. coli JM109 cells transformed with plasmid pBZ1260 or plasmid pTOU, which expresses wild-type ToMO or ToMO mutant enzymes, respectively.
All kinetic parameters were determined using whole cells (4, 5). Enzymatic activity on phenol was measured by monitoring the production of catechol in continuous coupled assays with recombinant catechol 2,3-dioxygenase from Pseudomonas sp. strain OX1 (5). The determination of apparent kinetic parameters for benzene, toluene, o-xylene, and naphthalene was carried out by a discontinuous assay (5). For the calculation of the kcat values, amounts of proteins were calculated as described previously (5). All the ToMO mutant enzymes showed expression levels similar to that of the wild-type enzyme.
Reaction products were identified as described previously (5). All the regiospecificity studies were performed using substrate concentrations higher than the Km values. Under these conditions, absolute yields of products were proportional to kcat values.
Modeling of substrates and intermediates into the active site of ToMO A.
Substrates and reaction intermediates were docked into the active site of ToMO A by using the Monte Carlo energy minimization strategy. The ZMM-MVM molecular modeling package (ZMM Software Inc. [http://www.zmmsoft.com]) was used for all calculations. This software allows conformational searches using generalized coordinates such as torsion and bond angles instead of conventional Cartesian coordinates (40).
Atom-atom interactions were evaluated using assisted model building with energy refinement force fields (37) with a cutoff distance of 8 Å. Conformational energy calculations included van der Waals, electrostatic, H bond, and torsion components. A hydration component was not included. Electrostatic interactions were assessed with a relative dielectric constant of 4.
Substrate and intermediate structures were prepared using the PyMOL software (DeLano Scientific LLC). Geometry was optimized using the Zl module of ZMM. Partial charges were attributed using the complete neglect of differential overlap method in the HyperChem software (HyperCube Inc. [http://www.hyper.com]).
The X-ray structure of the ToMO A-thioglycolate complex (PDB code 1T0Q) was used to build the models of the ToMO A-substrate and ToMO A-intermediate complexes. To reduce computational time, a double-shell model of the enzyme was built. The inner shell included 28 ToMO A residues surrounding the active site cavity. During energy calculation procedures, the side chain torsion angles—but not the backbone torsion angles—of these residues were allowed to vary. Due to the asymmetric shape of the cavity, which is flat with the di-iron cluster on one end and evolutionarily nonconserved hydrophobic residues on the opposite side, the residues of the inner shell were selected manually. They included the ligands of the iron ions, all residues with at least one side chain atom contributing to the hydrophobic part of the active site cavity, and all residues with at least one side chain atom less than 5 Å from the previous residues.
The outer shell included 139 residues, which did not belong to the inner shell and were located less than about 16 Å from the active site pocket. During energy calculation procedures, both the backbone and the side chain torsion angles of the outer shell residues were not allowed to vary.
To further restrict the conformational freedom of the iron cluster and of the protein-ligand complexes, two flat-bottom parabolic penalty functions—the so-called constraints—available in ZMM were used. These functions increase the conformational energy of the system if it deviates from specified parameters. The atom-atom distance constraint applies a force to the system when the distance between two specified atoms deviates from a specified value or interval. This type of constraint was used to fix the distances between the two iron ions and between each iron ion and the surrounding atoms, including the bridging water molecule and the terminal water molecule. Atom-atom distance constraints were also used to fix the conformation of the ligands of the di-iron cluster. A force constant of 1,000 kcal/mol/Å was used. The atom-atomic coordinate constraint applies a force to the system when an atom moves farther than a specified distance from particular Cartesian coordinates. This constrain was used to prevent the movement of the O
1 atom of the Glu103 side chain more than 0.5 Å from the original position observed in the crystallographic structure.
A +2 charge was arbitrarily assigned to each iron atom, both to account for electron density transfer from the ligands to iron ions and to avoid strong electrostatic attractive and repulsive interactions with negative and positive atoms, respectively, of substrates and intermediates. Similarly, both the bridging and terminal solvent molecules observed in the 1T0Q structure were modeled as neutral water molecules rather than OH– ions. van der Waals radii of iron ions and oxygen atoms of water molecules were arbitrarily set to 0.8 and 1.5 Å, respectively, in order to reduce steric hindrance inside the di-iron cluster.
Complexes with total energies of up to 8 kcal/mol higher than that of the lowest-energy complex were stored for the analysis of energy contributions. Total energy was partitioned into intrareceptor, intraligand, and receptor-ligand energies and energies of the constraints. Receptor-ligand energy was further partitioned into van der Waals, electrostatic, and H bond components. Moreover, receptor-ligand energy was also partitioned (i) per active site residue in order to evaluate the contribution of each residue to the binding of ligands and (ii) per ligand atom in order to evaluate the contributions of the ring and methyl substituents. Intrareceptor energies gave an estimation of the energy costs for receptor (ToMO A) conformational changes upon ligand binding.
The docking procedure is described in detail in the supplemental material. The PDB files for the initial manually generated complexes and the ZMM instruction files containing the lists of mobile residues, constraints, and parameters used during calculations are available upon request.
|
|
|---|
![]() View larger version (52K): [in a new window] |
FIG. 2. Active site pocket of ToMO A. Panels A and B show the active site of the crystal structure of ToMO A (PDB code 1T0Q). Only residues contributing to define the edge of the cavity are shown. In panel B, the grid cuts the cavity in such a way as to provide the largest section. Carbon atoms are shown in red (Glu134 and Glu231), green (Ala107), yellow (Met180), blue (Glu103), magenta (Phe176), cyan (Ile100), and orange (Phe205). (C) Superimposition of the PDB code 1T0Q crystal structure (colored as in panels A and B) onto the structure of the complex ToMO-CCI 2 for the reaction leading to phenol production from benzene (carbon atoms are shown in white). (D) Superimposition of the complex ToMO-CCI 2 (carbon atoms are shown in white) onto the complex ToMO-ketone 3 (carbon atoms are shown in green) for the reaction leading to phenol production from benzene. THG, thioglycolate. CCI indicates CCI 2 and KTI indicates ketone 3.
|
In a previous paper (4), we explained the regiospecificity of ToMO A for toluene by hypothesizing that there are at least three different positions in the active site pocket which can accommodate the methyl group of toluene. These three subsites can orient the methyl group of the substrate such that its ortho, meta, or para carbon is presented to the di-iron center. Thus, it is the difference in the affinities of the subsites for the binding of the methyl group which determines the relative abundances of three different enzyme-substrate complexes, which can account for the observed distribution of cresols produced by ToMO. Using a manual docking procedure (4), we mapped an ortho subsite located among Ala107, Met180, and Glu103, a meta subsite among Glu103, Phe176, and Ile100, and a para subsite located between Ile100 and Phe205.
Figure 3 shows a new, more complex kinetic model of ToMO regioselectivity. According to model i, toluene would bind to the active site in three different catalytically productive orientations, thus giving rise to three different enzyme-toluene (ET) complexes that lead to the production of o-, m-, and p-cresol isomers (complexes ETo, ETm, and ETp, respectively) through at least one enzyme-intermediate (EI) complex (EIo, EIm, or EIp, corresponding to o-, m-, or p-cresol, respectively). According to this model, the ET-EI conversion is the rate-limiting step. As the interactions between toluene and the active site cavity are limited to van der Waals interactions and as the active site cavity is larger than the toluene molecule, it is likely that the interconversion of the ET complexes is fast with respect to their transformation to cresols. The new model can be described by six equilibrium constants: Ko-m, Ko-p, and Km-p for the conversions ETo
ETm, ETo
ETp, and ETm
ETp, respectively, and K
o, K
m, and K
p for the conversion of each productive ET complex to the corresponding transition state (ET
) complex. Each ET
complex can turn into the other two activated complexes through the ET complexes. Therefore, we can define three equilibrium constants for the conversions ET
o
ET
m, ET
o
ET
p, and ET
m
ET
p, which will be the products of the constants defined above:
![]() | (1) |
![]() | (2) |
![]() | (3) |
G
o-m,
G
m-p, and
G
o-p are the free-energy differences for the three conversions, R is the gas constant, and T is the absolute temperature.
![]() View larger version (18K): [in a new window] |
FIG. 3. Kinetic models for the hydroxylation reaction of toluene and o-xylene. CCIs 5 to 12 are the CCIs deriving from toluene and o-xylene shown in Fig. 4. oC, mC, and pC, o-, m-, and p-cresols.
|
complexes at equilibrium.
Each ET
provides the corresponding EI, which in turn releases a cresol isomer. Each ET
-EI transformation should proceed with the same rate,
, which is given by the following well-known relation:
= kT/h, where k and h are the Boltzmann and Planck constants, respectively, and T is the absolute temperature. According to model i (Fig. 3), the relative abundances of o-, m-, and p-cresol isomers produced by the enzyme are determined by the relative abundances of the three transition state ET
complexes at equilibrium. Therefore, calculating the energy differences,
G
o-m,
G
m-p, and
G
o-p, should allow the prediction of the percentages of cresols formed.
Two components should contribute to these energy differences: (i) the covalent bond energy (bE
), which includes the energy of the bonds among ligand atoms and between ligand and protein atoms, for example, those between the oxygen atom transferred to the substrate and each iron of the cluster (Fig. 1), and (ii) the ligand-protein noncovalent bond energy (nbE
). Assuming that bE
is scarcely influenced by the position of the substituent, the
G
values should depend essentially on the nbE
contributions:
![]() | (4) |
![]() | (5) |
![]() | (6) |
complexes yield 2,3-dimethylphenol (2,3-DMP) and 3,4-DMP, respectively. In the following sections, we discuss several pieces of evidence which support these kinetic models.
Modeling toluene and o-xylene into the active site of ToMO A.
We have tried to dock substrates into the active site cavity of ToMO A by the Monte Carlo method, as it allows effective exploration of the conformational space with less central processing unit time than other, more time-consuming methods such as molecular dynamics. In all docking experiments, the backbone of ToMO A and the structure of the di-iron cluster were held rigid whereas at least two layers of side chains around the active site cavity were allowed to move to improve the fit of ligands inside the cavity. The Monte Carlo energy minimization of ToMO A without ligands in the active site cavity showed that only three residues contributing to the surface of the cavity, i.e., Ile100, Thr201, and Phe205, were particularly mobile, being able to assume several conformations. However, the mobility of Phe205 was limited to the
2 torsion angle. In the second layer of residues, Leu208, Leu272, Gln204 and, to a lesser extent, His96 were able to adopt different conformations.
When toluene and o-xylene were docked into the active site, more than 10 low-energy orientations for each substrate were found, giving rise to several different binding complexes (data not shown). These results suggest that aromatic substrates can assume several binding orientations inside the active site, in agreement with the models in Fig. 3.
Modeling the intermediates of benzene into the active site of ToMO A.
As it is well-known that active sites are complementary to activated transition states and to unstable intermediates rather than to substrates and products (some examples can be found in references 11, 12, 24, 27, and 33 and references therein), we tried to identify catalytically productive binding modes through the docking of the intermediates of the hydroxylation reaction. As shown in Fig. 1, two or three intermediates are supposed to be involved in the conversions of aromatic hydrocarbons to phenols (16): (i) an aromatic carbocation, (ii) an unsaturated ketone, and possibly (iii) an epoxide. The carbocationic intermediate (CCI) has a critical role in the regioselectivity of the reaction because, after its formation, the nature of the product is irreversibly defined. In contrast, the epoxide intermediate formed from toluene or from o-xylene can yield two different isomers, depending on which C-O bond of the epoxide ring undergoes cleavage.
We initially tested this procedure by docking the three possible intermediates of benzene hydroxylation into the active site of wild-type ToMO. Benzene was chosen instead of toluene or o-xylene because in this case a single molecular species exists for each intermediate. Docking was carried out by fixing the oxygen atom transferred to the aromatic ring at the same coordinates found for the bridging oxygen of the thioglycolate anion in the crystal structure of ToMO A (PDB code 1T0Q). The assumption that the oxygen atom is bound to the di-iron cluster even after its transfer to the substrate (Fig. 1) limits the degrees of freedom of the intermediates and provides a fixed point which can be used as a rotation center for the ligand.
Binding energy values for the ToMO-benzene intermediate complexes reported in Table 1 indicate that the CCI fits the active site better than the other intermediates. The main contribution to the tight binding of the CCI depends on electrostatic interactions (Table 1), but van der Waals contacts also play an important role. In the CCI, the oxygen atom is bound to a single carbon atom of the ring and the C-O bond forms an angle of
130° with the ring, which can thus be placed almost exactly in the plane of the cavity as shown in Fig. 2C, thus maximizing the steric interaction with the cavity. On the other hand, in the molecule of the epoxide intermediate, the six-atom ring and the epoxide ring form an angle of 105° and the oxygen atom lies above the central point of the C-C bond of the epoxide ring. As a consequence, when the oxygen atom is located at the bridging position of the di-iron cluster, too-close contacts between the six-atom ring and the active site cavity and between the three-atom ring and the di-iron cluster are generated (data not shown). As for the ketonic intermediate, the oxygen atom is in the same plane as the carbon atom ring. This geometry prevents the positioning of the ring in the plane of the cavity but, interestingly, pushes it toward the tunnel (Fig. 2D) which connects the active site to the exterior of the molecule.
|
View this table: [in a new window] |
TABLE 1. Interaction energy values for ToMO A-benzene reaction intermediates
|
Modeling the intermediates of toluene and o-xylene into the active site of ToMO A.
Given the increased stability of the ToMO-CCI complex relative to the ToMO-epoxide and ToMO-ketone complexes, in the case of benzene we decided to pay particular attention to the modeling of the ToMO-CCI complexes corresponding to toluene and o-xylene. Docking the CCIs of toluene and o-xylene is a complex procedure because several isomers exist. As shown in Fig. 4, toluene may generate up to five CCIs and four possible intermediates may be produced from reactions starting with o-xylene. Intermediate couples 5 and 6, 7 and 8, 10 and 11, and 12 and 13 are enantiomers. CCIs 5 and 6 yield o-cresol, CCIs 7 and 8 yield m-cresol, CCIs 10 and 11 yield 2,3-DMP, and CCIs 12 and 13 yield 3,4-DMP. CCI 9 yields p-cresol.
![]() View larger version (11K): [in a new window] |
FIG. 4. Chemical structures of the possible CCIs deriving from toluene and o-xylene. CCIs 5 and 6, CCIs 7 and 8, and CCI 9 are the possible intermediates for the transformation of toluene into o-, m-, and p-cresols, respectively. CCIs 10 and 11 and CCIs 12 and 13 are the possible intermediates for the transformation of o-xylene into 2,3- and 3,4-DMPs, respectively.
|
Figure 5A shows a model of the positioning of CCIs 10 and 12, which lead to 2,3-DMP and 3,4-DMP, respectively, into the active site. The models of CCIs 5 and 7, leading to o- and m-cresol, respectively, are completely superimposable onto the model of CCI 10 (data not shown), whereas CCI 9, which yields p-cresol, has an orientation similar to that of CCI 12 (data not shown). Figure 5B shows that the ring of CCI 12 is placed exactly in the plane of the grid in Fig. 2 and that it mimics the orientation of thioglycolate even better than the CCI of benzene shown in Fig. 2C. Thus, our data indicate that CCIs 5, 7, 9, 10, and 12 dock into the active site of ToMO A and place their methyl groups into subsites located on the border of the pocket, as hypothesized previously (4). However, the ToMO-CCI complexes suggest rather different positioning of the subsites for methyl groups from that in the original model. The model in Fig. 5A shows that only the ortho and para subsites are unambiguously defined. In this new model, the ortho subsite, located among residues Glu134, Leu192, and Ala107, is closer to the di-iron cluster than that in our previous model whereas the new para subsite is defined by residues Glu103, Phe176, and Ile100 (the meta subsite in the original model). Moreover, two alternative meta subsites (designated m1a and m1b) can be mapped. The existence of two alternative meta subsites may depend on the close proximity of the ortho and para subsites. This geometry is incompatible with the simultaneous docking of three adjacent methyl groups. Therefore, when CCI 10 is docked into the cavity, the ortho methyl groups block the ortho subsite whereas the meta methyl group partially fills the para subsite, thus defining the m1b subsite (residues Glu103, Phe176, and Met180). On the other hand, when CCI 12 is docked into the cavity, the para methyl group fills the para subsite, whereas the meta group partly occupies the ortho subsite, thus defining the m1a subsite (residues Ala107, Glu103, and Met180). The van der Waals contributions of the methyl groups of CCI 10 (subsites o and m1b) are –1.12 and –2.02 kcal/mol, respectively, whereas their contributions in the case of CCI 12 (subsites m1a and p) are –1.64 and –1.65 kcal/mol, respectively. Hence, it seems that a methyl group positioned into the m1b subsite gives a greater contribution to the binding energy than one positioned into the m1a subsite. An indirect confirmation of this observation comes from the docking of CCI 7, which leads to m-cresol. In this case, the single methyl group is predicted to occupy subsite m1b.
![]() View larger version (27K): [in a new window] |
FIG. 5. Structures of the ToMO-CCI 10 and ToMO-CCI 12 complexes. (A) Superimposition of the ToMO-CCI 10 complex (carbon atoms are in white) onto the ToMO-CCI 12 complex (carbon atoms are in green). (B) Superimposition of the PDB structure 1T0Q (carbon atoms are in magenta) onto the structure of the ToMO-CCI 12 complex (carbon atoms are in green). (C) Superimposition of the complexes ToMO-CCI 10 and ToMO-CCI 12 (colored as in panel A) onto the complex of ToMO with the CCI of the reaction m-xylene 2,4-DMP (carbon atoms are in magenta). The surface of the cavity of the complex ToMO-CCI 10 is shown as a mesh. The mesh is colored to show the contributions of residues Glu134, Ala107, Met180, Glu103, Phe176, and Ile100.
|
In conclusion, our docking data provide a very detailed map of the active site residues potentially involved in regioselectivity.
From binding energies to percentages of products.
In the hypothesis that the CCI is the first intermediate formed during the hydroxylation reaction, as the toluene
CCI reaction is endergonic, the ET
transition states should be similar to enzyme-CCI complexes. Therefore, the
nbE
o-m,
nbE
m-p, and
nbE
o-p values defined by equations 4 to 6 can be estimated through the Monte Carlo docking of the CCIs. Using equations 1 to 6 and the binding energy values provided by the ZMM software for toluene CCIs 5, 7, and 9, we predicted the percentages of cresols. Predicted percentages of o-, m-, and p-cresols (39.7, 19.9, and 40.4%, respectively) were very similar to experimentally determined percentages (36, 19, and 45%, respectively). Similarly, using the binding energy values for o-xylene CCIs 10 and 12, we found that predicted percentages of 2,3-, and 3,4-DMPs (17.9 and 82.1%, respectively) were very similar to experimentally determined percentages (19 and 81%, respectively). Thus, it may be concluded that the hypotheses of completely steric control of regioselectivity and the use of the ToMO A-CCI noncovalent bond energies for calculating the relative stabilities of the ET
or EX
complexes are essentially correct.
As a control, the docking procedure was repeated using the two possible epoxides deriving from toluene, i.e., toluene 2,3-epoxide and toluene 3,4-epoxide, which provide o-cresol/m-cresol and m-cresol/p-cresol, respectively. The binding energy of toluene 3,4-epoxide was found to be about 2 kcal/mol higher than that of toluene 2,3-epoxide. In a system at equilibrium, this energy difference would imply the formation of less than 5% 2,3-epoxide. Even assuming that the 2,3-epoxide intermediate converts entirely to o-cresol, this finding is not in agreement with the experimentally determined percentage of o-cresol (36%). Similarly, in the case of the ketonic intermediates, the isomer leading to m-cresol showed the higher binding energy (data not shown), in disagreement with the experimental data.
Modeling the CCIs of toluene and o-xylene in the active sites of ToMO A mutant forms.
To further test our hypothesis, all the residues located on the edge of the cavity (Ala107, Met180, Glu103, Phe176, Ile100, and Phe205) were selected for mutational studies to experimentally verify whether changes at these sites would affect the regioselectivity of the enzyme in a predictable way. These residues were all changed to hydrophobic residues to preserve the hydrophobic nature of the pocket.
Ala107 was changed to larger residues, such as Val and Ile, in order to hinder the ortho site. Met180 was changed to Ile in an enzyme already carrying the mutation E103G in order to obtain a double-mutant enzyme designated (E103G, M180I)-ToMO A, in which all the residues facing the active site pocket are identical to the corresponding residues present in homologous T4MO.
Glu103, previously mutated to Gly, Leu, and Met (4), was changed to the β-branched residues Val and Ile in order to increase hindrance in the region between the meta and para sites. Phe176 was changed to Leu and Ile in order to enlarge this region.
Ile100 contributes to defining the hypothetical para site, but it is also at the boundary between the active site pocket and the tunnel which connects the pocket to the surface of the protein. Moreover, it is less closely packed than the other residues of the active site and bulges from the surface of the pocket. The corresponding residue in MMOs, Leu110, has been defined previously as the gate that controls the access to the active site (2, 29). Therefore, to explore the entire range of side chain dimensions, Ile100 was changed to Ala, Val, Leu, Met, Phe, and Trp.
Phe205, which is located at the boundary between the active site pocket and the tunnel, like Ile100, was changed to Leu.
All mutant enzymes were assayed with phenol, benzene, toluene, o-xylene, and naphthalene. The majority of the mutations did not change the catalytic efficiency with respect to that of wild-type ToMO or caused only minor changes. Only mutations I100A, I100W, F205L, A107V, and A107I were found to reduce significantly the kcat values for all the substrates (Table 2). Possible explanations are discussed briefly in the supplemental material.
|
View this table: [in a new window] |
TABLE 2. Apparent kcat values of ToMO and ToMO mutant proteins on benzene, toluene, o-xylene, and naphthalene
|
Tables 3 and 4 report the results obtained after the docking of toluene- and o-xylene-derived CCIs, respectively, to ToMO A mutant enzymes at positions 107, 103, 180, and 176. The regioselectivities of all mutant enzymes, with the exception of those of (E103L)-ToMO for toluene and (F176L)-ToMO for o-xylene, were predicted with fairly good accuracy. Minor differences between experimental and predicted percentages may depend on the small differences in binding energy values (
G
) among ET
o, ET
m, ET
p, EX
3,4, and EX
2,3 complexes. In fact, our approach predicts
G
values of 0.1 to 0.3 kcal/mol, which are 1% or less of the calculated binding energy values (30 to 36 kcal/mol).
|
View this table: [in a new window] |
TABLE 3. Comparison between experimentally determined and calculated percentages of cresol isomers produced by wild-type ToMO and ToMO mutant proteins
|
|
View this table: [in a new window] |
TABLE 4. Comparison between experimentally determined and calculated percentages of DMP isomers produced by wild-type ToMO and ToMO mutant proteins
|
o) and to the stabilization of the para orientation (complex ET
p). Moreover, the ZMM software yields individual group contributions to the total binding energy (see Materials and Methods). Our data (not shown) indicate that the contribution of the methyl group in the ortho site to the binding energy, due essentially to van der Waals interactions, is barely influenced by the E
G mutation, changing from –1.34 kcal/mol for the wild-type protein to –1.47 kcal/mol for the E103G mutant form. Rather, the predicted destabilization of the ET
o complex (0.69 kcal/mol) depends on the loss of van der Waals contacts between the C-H groups at positions 3 and 4 of the ring (Fig. 4 numbering) and the larger active site pocket. On the other hand, the enlargement of the para subsite removes clashes between the para methyl group and the pocket and increases the contribution of the para methyl group to the binding energy (from –0.76 to –2.1 kcal/mol). In the case of o-xylene, the mutation E103G increases the relative stability of the EX
3,4 complex, which provides 3,4-DMP, both increasing the stability of the EX
3,4 complex and decreasing the stability of the EX
2,3 complex (Table 4). The values for the individual components of the calculated binding energy indicate that the increase in the stability of the EX
3,4 complex is due mainly to improved binding of the methyl groups and that the decrease in the stability of the EX
2,3 complex depends on the worse accommodation of the substrate ring.
Mutations F176I and F176L have effects similar to that of mutation E103G on the regioselectivity of ToMO A, but these effects depend on different factors. Indeed, these two mutations leave the stability of the ET
p complex (Table 3) almost unchanged with respect to that of the wild-type enzyme, whereas they decrease the stability of ET
o and ET
m complexes by 1.42 and 0.95 kcal/mol, respectively. Moreover, the values for the individual components of the binding energy indicate that the decrease in the stability of the ET
o complex is due mainly to a loss of van der Waals contacts between the active site pocket and the ring of the substrate (about 0.9 kcal/mol) and that the contribution of the methyl group to the binding energy is not affected by the mutation (the change is less than 0.1 kcal/mol). In contrast, in the case of the ET
p complex, the loss of van der Waals contacts between the active site pocket and the ring of the substrate (about 0.8 kcal/mol) is counterbalanced by an increased contribution by the methyl group of the substrate (about 1 kcal/mol).
As for mutations E103I and E103V, it should be noted that they insert β-branched residues at position 103. This insertion generates clashes between the side chain and the ring of the substrates, thus lowering the stability of all the ToMO-CCI complexes with respect to that of the wild-type enzyme (Tables 3 and 4). The greater destabilization of the ET
p complex than of ET
o and ET
m (Table 3) leads to increased production of o- and m-cresols, whereas slightly different conformations of the valine and isoleucine side chains (data not shown) make the (E103V)-ToMO-ET
m1a complex more stable than the (E103I)-ToMO-ET
m1b complex. These results give a molecular reason for the enhanced production of m-cresol by mutant (E103V)-ToMO with respect to that by mutant (E103I)-ToMO. Moreover, due to the slightly different orientations of the two side chains, mutation E103V decreases the stability of the EX
3,4 complex much more than that of the EX
2,3 complex and mutation E103I has the opposite effect (Table 4). As a consequence, mutation E103V increases the percentage of 2,3-DMP whereas mutation E103I decreases it with respect to that produced by wild-type ToMO. In both cases, the decrease in the stability of the EX
2,3 and EX
3,4 complexes is due to worse accommodation of both the substrate ring and the methyl groups.
The Ala107 side chain, according to our model, contributes to the surface of the ortho site. In agreement with the model, the mutation of Ala107 to Val or Ile, whose side chains fill the cavity of the ortho site, completely abolishes the production of o-cresol and 2,3-DMP.
Modeling intermediates in the case of Ile100 mutant enzymes.
Several mutations at position 100, which defines the para subsite, surprisingly increase the production of m-cresol (Table 5). Replacement with alanine, leucine, phenylalanine, and valine increases the production of m-cresol, whereas replacement with methionine and tryptophan decreases it. It is also interesting that the effects of mutations at positions 100 and 103 are additive, as shown by the behavior of the double-mutant enzymes (I100L, E103G)-ToMO A and (I100V, E103V)-ToMO A. The individual mutations I100V and E103V increase the percentage of m-cresol from 19 to 47 and 34%, respectively, whereas the corresponding double mutant produces 62% m-cresol. Furthermore, mutation I100L increases the percentage of m-cresol from 19 to 38%, whereas mutation E103G decreases it to 6%. In this case, the double-mutant enzyme (I100L, E103G)-ToMO A produces 17% m-cresol. As shown in Fig. 5A, a second hypothetical meta site (the m2 site) may exist between Ile100 and Phe205 (the para subsite in our previous model [4]) which is occupied by the methyl group when CCI 8 (Fig. 4) is docked into the active site. However, the binding of CCI 8 is different from the binding of the other intermediates because the hindrance of the side chains of Ile100 and Phe205 forces the ring of the intermediate into a plane which forms an angle of about 25° with the plane that contains intermediates 5, 7, 9, 10, and 12 (Fig. 6A). Thus, it may be that this different geometry of the wild-type ToMO A-CCI 8 complex—with the aromatic ring out of the plane of the grid in Fig. 2B—makes the complex catalytically unproductive. In fact, this geometrical constraint is the reason that, in the case of the wild-type protein, we have limited our docking analysis to intermediates 5, 7, 9, 10, and 12 only (see above).
|
View this table: [in a new window] |
TABLE 5. Comparison between experimentally determined and calculated percentages of cresol isomers produced by wild-type ToMO and ToMO variants mutated at position 100
|
![]() View larger version (53K): [in a new window] |
FIG. 6. Comparison between the orientation of CCI 10 docked into wild-type ToMO (WT) and the orientations of CCI 8 docked into wild-type and mutant ToMOs. (A) The surface of the cavity of the ToMO-CCI 10 complex is shown as a mesh. is the angle between the ring of CCI 10 docked into wild-type ToMO and CCI 8 docked into wild-type and mutant ToMOs. (B) Correlation between angle shown in panel A and exp(– bE /RT) values. Data for the mutation I100V were not used in the linear fit.
|
m2 (model iii in Fig. 3). Model iii includes three extra
G
values,
G
o-m2,
G
m1-m2, and
G
p-m2, which determine the relative abundance of the ET
m2 complex with respect to total ET
complexes and, hence, the amount of m-cresol produced by the pathway ETm2
ET
m2
enzyme-CCI 8.
As the orientation of CCI 8 inside the active site is different for each mutant protein and different from those of intermediates 5, 7, 9, 10, and 12, the hypothesis that the
G
values depend essentially on the noncovalent bond energy contributions is no longer valid. On the contrary, because of nonoptimal binding of the activated substrate to the cluster, the covalent bond energy component of the ET
m2 complex, bE
m2, should be generally higher than that of the other three ET
complexes, bE
:
![]() | (7) |
![]() | (8) |
![]() | (9) |
bE
= (bE
m2 – bE
)
0.
Therefore, the equilibrium constants which determine the relative abundance of the ET
m2 complex can be expressed as function of the
bE
values:
![]() | (10) |
![]() | (11) |
![]() | (12) |
nbE
m2-p/RT) x exp(–
bE
/RT)]? If not, please clarify relationship between these expressions; should space between them be deleted, or should a chem point or some other symbol be added in place of the multiplication sign?– strategy, we predicted nbE
o, nbE
m1, nbE
p, and nbE
m2 for each mutation at position 100. Then, using equations 1 to 12, we determined the exp(–
bE
/RT) values, which provided predicted cresol percentages similar to the experimentally determined ones. Table 5 shows that, except for mutations I100F and I100W, for each mutation at position 100, a single exp(–
bE
/RT) value exists which yields good agreement between experimental and predicted percentages of products. Moreover, Fig. 6B shows that there is good correlation between these exp(–
bE
/RT) values and the angle
formed between the plane in which intermediates 5, 7, 9, 10, and 12 lie (the plane of the grid in Fig. 2) and the plane where intermediate 8 is located (Fig. 6A). This interesting finding suggests that the energy of the transition state increases with the angle
and further supports the hypothesis that the predicted Monte Carlo-minimized complexes ToMO A-benzene-CCI and ToMO A-CCI 5, 7, 9, 10 or 12 illustrate the geometry of the catalytically productive orientation of aromatic substrates inside the ToMO A active site pocket. Moreover, we want also to underline that our model provides a simple explanation for the additive effects of mutations at positions 100 and 103. In fact, according to our model, the observed percentage of m-cresol is the sum of the percentages of m-cresol produced from toluene docked with its methyl group in subsite m1 and from toluene docked with its methyl group in subsite m2. Moreover, the residue at position 103 prevalently contributes to the m1 site, whereas the residue at position 100 contributes to the m2 site. Thus, the combined effect of a mutation at position 103, which improves interactions at subsite m1, and a mutation at position 100, which partially opens subsite m2, is the production of a percentage of m-cresol close to the sum of the percentages produced by the single mutations. This is the case for the double mutant (I100V, E103V)-ToMO. In the case of the double mutant (I100L, E103G)-ToMO, the effect of mutation E103G, which decreases the ability of the m1 site to anchor a methyl group, is counterbalanced by the effect of mutation I100L on the catalytic efficiency of the m2 site. Consequently, no apparent change in m-cresol production is observed. As for mutations I100F and I100W, the lower level of agreement between predicted and experimental data may be due to the fact that mutations I100F and I100W increase significantly the volume of the side chain at position 100. As described above, the crystallographic structure of the (I100W)-ToMO mutant enzyme (18) shows that the tryptophan side chain points toward the active site cavity, partially hindering it. Most likely, the simultaneous accommodation inside the active site cavity of the bulky side chain of tryptophan—or phenylalanine—and of the substrate may require changes in the conformation of the ToMO A backbone which cannot be predicted by our strategy, as all the docking experiments were carried out by holding the backbone of ToMO A in a rigid position and allowing the movement of only the side chains closer to the cavity.
As for o-xylene, our docking data indicate that there is no mutation at position 100 which allows the positioning of intermediates 11 and 13 in the active site with a conformation similar to those of intermediates 10 and 12 (data not shown). Thus, in the case of o-xylene, there is no need for hypothesizing a kinetic model which takes into account the involvement of an m2 subsite. Consequently, we used kinetic model ii and calculated the data reported in the lower part of Table 4. Also in this case, the predicted regioselectivities of the mutant enzymes were in good agreement with the experimental values.
Regioselectivity on naphthalene.
From a steric point of view, naphthalene can be described as an ortho disubstituted benzene derivative bearing groups larger than methyl groups. Four CCIs can be produced from naphthalene, similar to those shown in Fig. 4 for o-xylene. Our docking data indicate that only intermediates equivalent to CCIs 10, 12, and 13 in Fig. 4 can fit the active site (in Table 6, these complexes are named EN
, EN
β1, and EN
β2, respectively, and their models are shown in Fig. S5 in the supplemental material). However, none of these three intermediates can assume exactly the same orientation as the intermediates derived from benzene, toluene, and o-xylene. This finding is in agreement with the observation that the kcat value of wild-type ToMO for naphthalene is considerably lower than those measured for the more physiological substrates. Data in Table 6 show that in the case of naphthalene, our predictions are only qualitatively correct. In fact, for all the mutant enzymes we have studied, the calculated
-naphthol/β-naphthol ratio was higher than that observed experimentally. However, the model correctly predicts that mutations which reduce the volume of the residues at positions 100 and 176 (Ile and Phe, respectively) increase the percentage of β-naphthol. Ile and Phe side chains likely impair the correct positioning of the naphthalene reaction intermediates corresponding to CCIs 12 and 13 (i.e., the intermediates which form the complexes EN
β1 and EN
β2, respectively).
|
View this table: [in a new window] |
TABLE 6. Comparison between experimentally determined and calculated percentages of naphthol isomers produced by wild-type ToMO and ToMO mutant proteins
|
is not higher than the rate of the hydroxylation reaction, more β-naphthol than the amount predicted by a model based on fast equilibrium events will be formed.
Regioselectivity on polar substrates: the case of phenol.
An intriguing feature of several multicomponent monooxygenases is their specificity in the second hydroxylation step, which produces exclusively catechol from phenol and (di)methylcatechols from cresols and DMPs. It should also be remembered that Tao et al. and Vardar and Wood (34, 35), using random mutagenesis, were able to obtain several ToMO and T4MO mutant forms which produce hydroquinone in different amounts. Mutant (I100Q)-ToMO is particularly interesting, as it produces 80% hydroquinone and only 20% catechol, the physiological product of ToMO. Even if a detailed analysis is beyond the scope of this paper, we have tested our approach with this mutant enzyme and with wild-type ToMO.
We have docked the two possible enantiomeric CCIs of the phenol-to-catechol hydroxylation reaction (CCIs 14 and 15 in Fig. 7) into the ToMO active site on the hypothesis that, like those for methyl groups, subsites for the hydroxyl group should exist.
![]() View larger version (11K): [in a new window] |
FIG. 7. Chemical structures of the CCIs deriving from phenol. CCIs 14 and 15 are the possible intermediates of the phenol-catechol reaction. CCI 16 is the intermediate of the phenol-hydroquinone reaction. The positive charge is delocalized on both the ring and the OH group.
|
Conclusions.
BMMs have broad substrate specificities, coupled with specific regioselectivity properties, in the hydroxylation reaction of aromatic substrates. These features are metabolically relevant, because they are the basis for the capabilities of several microorganisms to grow on selected molecules. Moreover, given the catalytic potentials of BMMs, they may constitute a powerful tool for the bioremediation of harmful substances and may serve as specific biocatalysts in (regio)selective syntheses.
Results from several structural and functional studies suggest that the different regioselectivities of BMMs depend on differences in the shape of the active site pocket (2, 4, 7, 17, 26, 30, 34, 35). However, a detailed description of the molecular basis for the regioselectivities of these enzymes is still lacking. This situation is particularly inconvenient because it impairs the possibility to attempt rational modifications to produce new catalysts and/or new microorganisms endowed with specific, advantageous properties.
In this study, we have developed a procedure based on the docking of the intermediates of the hydroxylation reaction into the active site pocket of a specific monooxygenase, ToMO A. This approach allows for (i) a detailed analysis of the molecular determinants of the enzyme's regioselectivity, (ii) the prediction of the regioselectivity properties of mutant forms of the enzyme, in the absence of any experimental data, and (iii) the prediction of the catalytically productive orientation of a substrate inside the active site pocket. Thus, this procedure is a valuable tool for the design of mutant monooxygenases for use in biosynthesis and bioremediation procedures, and its applicability may also be extended to other kinds of substrates and other multicomponent monooxygenases.
Finally, the results of the docking experiments reported in this paper have very interesting implications for the catalytic mechanism of TMOs. The optimal fit between the ToMO active site pocket and the delocalized carbocation and the good agreement between experimentally determined regioselectivity and the regioselectivity predicted using the delocalized carbocations as ligands strongly suggest that the delocalized carbocation is a crucial intermediate in aromatic hydroxylation reactions.
This work was supported by grants from the Ministry of University and Research (PRIN/2002 and PRIN/2004).
Published ahead of print on 12 December 2008. ![]()
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
Valeria Cafaro and Eugenio Notomista contributed equally to the paper. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»