Previous Article | Next Article ![]()
Applied and Environmental Microbiology, October 2003, p. 5772-5781, Vol. 69, No. 10
0099-2240/03/$08.00+0 DOI: 10.1128/AEM.69.10.5772-5781.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering, BioProcess Engineering Research Center, and Bioinformatics Research Center,1 Department of BioSystems, Korea Advanced Institute of Science and Technology, Daejeon 305-701,3 Korea Basic Science Institute, Daejeon 305-333, Republic of Korea2
Received 22 April 2003/ Accepted 15 July 2003
|
|
|---|
|
|
|---|
Recent advances in genomics and transcriptomics have allowed many functional genomic-based approaches to be taken toward understanding global metabolic changes caused by genotypic and/or environmental changes (26, 32, 36, 38, 39). Similarly, proteome profiling can also be employed to examine the changes in the expression levels of many proteins under particular conditions to be compared (3, 29, 39). Even though transcriptome profiling is truly useful, as it can generate a full spectrum of data on the expression levels of all of the genes at the mRNA level, there is growing evidence showing poor correlation between mRNA and protein abundances for a number of genes (2, 14). However, proteome profiling is presently limited by the fact that many fewer proteins than genes can be identified on a two-dimensional (2D) gel. Nonetheless, it is possible to identify protein spots showing altered expression levels, which may help us to understand metabolic and physiological changes and thereby to design metabolic and cellular engineering strategies. Several groups have carried out proteome analysis of E. coli cells overproducing recombinant proteins and have described metabolic changes under these conditions (19, 33). However, the results obtained from these studies have not been extended to actual engineering of cells to achieve enhanced recombinant protein production.
Here we report profiling of the proteome of recombinant E. coli during the overproduction of human leptin, identification of a target gene to be manipulated, and engineering of metabolic pathways to achieve increased leptin productivity. In order to understand how the overproduction of foreign proteins influences the synthesis of critical cellular proteins and thus to identify relevant target genes to be engineered, proteome analyses were performed (i) in the presence of the control plasmid only, (ii) before and after induction, and (iii) on inclusion body (IB) fractions.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Bacterial strains and plasmids used in this study
|
![]() View larger version (12K): [in a new window] |
FIG. 1. Schematic diagrams of plasmids pAC104CysK (A) and pEDIL-12p40 (B). The 966-bp PCR product which encodes the E. coli BL21(DE3) cysK gene was digested with EcoRV and ScaI and cloned into pACYC184 at the EcoRV site to make pAC104CysK. The 918-bp PCR product which encodes mature human interleukin-12 ß chain was digested with AseI and BamHI and cloned into pET21c at the NdeI and BamHI sites to make pEDIL-12p40.
|
Fed-batch culture conditions.
Fed-batch cultures were carried out in a 6.6-liter jar fermentor (Bioflo 3000; New Brunswick Scientific Co., Edison, N.J.) containing 1.8 liters of R/2 medium plus 20 g of glucose per liter. The R/2 medium (pH 6.8) contains, per liter, 2 g of (NH4)2HPO4, 6.75 g of KH2PO4, 0.85 g of citric acid, 0.7 g of MgSO4 · 7H2O, and 5 ml of a trace metal solution. The trace metal solution contains, per liter of 5 M HCl, 10 g of FeSO4 · 7H2O, 2.25 g of ZnSO4 · 7H2O, 1 g of CuSO4 · 5H2O, 0.5 g of MnSO4 · 5H2O, 0.23 g of Na2B4O7 · 10H2O, 2 g of CaCl2 · 2H2O, and 0.1 g of (NH4)6MO7O24. A seed culture was prepared in a 1-liter flask containing 200 ml of R/2 medium. Except for periods when the pH increased due to glucose depletion, it was kept at 6.8 by adding 28% (vol/vol) ammonia water. The dissolved oxygen concentration was kept at 40% of air saturation by automatically increasing the agitation speed to 1,000 rpm and by changing the percentage of pure oxygen. A nutrient feeding solution was added by using the pH-stat (with high limit) feeding strategy (25). The feeding solution contained 800 g of glucose per liter and 20 g of MgSO4 · 7H2O per liter. When the pH rose to a value of 0.08 greater than its set point (pH 6.8) due to the depletion of glucose, the appropriate volume of feeding solution was automatically added to increase the glucose concentration in the culture broth to 0.7 g/liter. Cell growth was monitored by measuring the optical density at 600 nm (OD600) (DU series 600 spectrophotometer; Beckman, Fullerton, Calif.). For the expression of leptin, G-CSF, and interleukin-12 ß chain, cells were induced with 1 mM IPTG at an OD600 of 30 or 90.
One-dimensional gel electrophoresis.
For protein quantification, cells at the same concentrations were harvested by centrifugation at 3,500 x g for 5 min at 4°C. Protein samples were analyzed by electrophoresis on sodium dodecyl sulfate-12% (wt/vol) polyacrylamide gels as described by Laemmli (22). The gels were stained with Coomassie brilliant blue R250 (Bio-Rad, Hercules, Calif.), and the protein bands were quantified with a GS-710 calibrated imaging densitometer (Bio-Rad).
2D gel electrophoresis and peptide mass fingerprinting.
2D gel electrophoresis experiments were carried out with a Protean II xi 2-D cell (Bio-Rad) by procedures described previously (39). Culture broth was centrifuged for 5 min at 3,500 x g and 4°C. The pellet was washed four times with TE solution (10 mM Tris-HCl, 1 mM EDTA [pH 8.0]) and suspended in double-distilled water, followed by four cycles of sonication (each for 10 s at 10% of maximum output with high-intensity ultrasonic liquid processors [Sonics & Material Inc., Newtown, Conn.]). By centrifugation of the cell extract at 10,000 x g and 4°C for 20 min, the supernatant containing soluble proteins and the pellet containing inclusion bodies were obtained. The pellet was washed twice with 1% (vol/vol) Triton X-100 and once with double-distilled water to remove contaminants. After protein quantification by the Bradford assay with bovine serum albumin as a standard (8), protein samples (200 µg) were dried by vacuum centrifugation, resuspended in 340 µl of isoelectric focusing denaturation buffer {9 M urea, 0.5% 3-[(3-cholamidopropyl)-dimethylammonio]-1-propanesulfonate [CHAPS], 10 mM dithiothreitol, 0.2% [wt/vol] Bio-lyte pH 3 to 10, 0.001% [wt/vol] bromophenol blue), and carefully loaded on immobilized pH gradient strips (pH 3 to 10; 17 cm) (Bio-Rad). The loaded immobilized pH gradient strips were rehydrated for 12 h and focused at 20°C for 15 min at 250 V, followed by 60,000 V · h with a Protean isoelectric focusing cell (Bio-Rad). The strips were exchanged in equilibration buffer and then were placed on sodium dodecyl sulfate-12% polyacrylamide gels prepared by the standard protocol (22). Protein spots were visualized with a silver staining kit (Amersham Biosciences, Uppsala, Sweden), and the stained gels were scanned with a GS-710 calibrated imaging densitometer (Bio-Rad). Melanie II software (Bio-Rad) was used to identify spots and to quantify spot densities on a volume basis (i.e., integration of spot optical intensity over the spot area). Matrix-assisted laser desorption ionization-time-of-flight (MALDI-TOF) mass spectrometer analysis was carried out as described previously (15).
|
|
|---|
![]() View larger version (41K): [in a new window] |
FIG. 2. Time profiles of cell growth and recombinant protein production during the fed-batch cultures. (A) Cell densities (OD600) of E. coli BL21(DE3) ( ) and E. coli BL21(DE3)(pACYC184) ( ); (B) cell density (OD600) of E. coli BL21(DE3)(pAC104CysK); (C to H) cell density (OD600) ( ), dry cell weight (DCW) ( ), and recombinant protein content ( ) without (C, E, and G) and with (D, F, and H) cysK coexpression for the production of leptin (C, D, E, and F) and interleukin-12 ß chain (G and H). The dashed lines indicate the time of induction. S1, S2, S3, S4, and S5 are the sampling points for proteome analyses.
|
|
View this table: [in a new window] |
TABLE 2. Proteins identified from 2D electrophoresis
|
S, encoded by the rpoS gene, is a central regulator for the expression of many stationary-phase-responsive genes (23). However, the above-mentioned proteins, except for HdeB, are not regulated by
S. These stationary-phase-responsive proteins are known to be induced under conditions of slow growth (16). Therefore, it seems that the presence of a plasmid itself resulted in slower growth, as shown by the observation that the growth rate of E. coli BL21(DE3)(pEDOb5) was less than half of that of E. coli BL21(DE3) and consequently induced expression of the stationary-phase-responsive proteins. To examine if this is truly due to the presence of a plasmid itself, we carried out pH-stat fed-batch culture of E. coli BL21(DE3)(pACYC184). The apparent specific growth rate was 0.22 h-1, which is less than a half of that of E. coli BL21(DE3) without the plasmid (Fig. 2A). These results suggest that plasmid presence itself can induce stationary-phase-responsive proteins during fed-batch culture. Other than these proteins, the levels of 14 proteins (AceF, NusA, RfaD, PotD, TalB, Bla, HisJ, FliY, AccB, DksA, SgaH, AroK, GreA, and YjgF) increased in the presence of the plasmid (Table 2).
![]() View larger version (108K): [in a new window] |
FIG. 3. 2D gel electrophoresis of samples for Fig. 2. The soluble fractions of samples S1 (A), S2 (B), S3 (C), S4 (E), and S5 (F), as well as the insoluble fraction of sample S3 (D), were analyzed. Identified proteins shown by numbers corresponding to those in Table 2. Proteins showing increased and decreased levels are indicated by circles and rectangles, respectively.
|
Effect of leptin overproduction.
In order to investigate the effect of overproducing leptin on the host cell physiology, we compared the proteomes of S2 and S3 (Fig. 3B, C, and D). Overproduction of leptin altered the levels of 47% (41 proteins) of the total identified proteins (Table 2). In agreement with previous studies (10, 19, 33), the levels of heat shock proteins such as DnaK and GroEL increased upon overproduction of leptin. Also, relatively large amounts of the small heat shock proteins IbpA and IbpB, which are good indicators for the formation of IBs (1, 24), were associated with the IB fraction (Fig. 3B). Production of these proteins is known to be regulated by
32 (12). This indicates that overproduction of recombinant protein (leptin) acts as a stress to the cells. The levels of nine proteins (AcnB, AceF, AtpD, SdhA, MalE, HdhA, AtpC, UspA, and OmpF) increased after leptin production (Table 2).
Twenty-eight proteins were repressed by the overproduction of leptin. The levels of proteins involved in protein synthesis, such as protein elongation factors (EF-Tu and EF-Ts) and 30S ribosomal protein (RpsF), decreased. It was interesting that the protein elongation factor EF-Tu was associated with IBs (Fig. 3D). This coincided with the decreased level of soluble EF-Tu, indicating that the association of EF-Tu with IBs may be one reason for the observed reduction in overall protein synthesis. This in turn shows that the overproduction of recombinant proteins negatively influences the capacity of the cellular translational machinery (37).
We also observed decreased levels of enzymes involved in the biosynthesis of serine family (GlyA and CysK), aromatic family (AroG, TrpA, and TrpD), and branched-chain family (LeuC) amino acids following induction of leptin overproduction. This suggests that the overproduction of leptin leads to an imbalance in free amino acid stocks and translational machinery (5, 13, 31). The levels of enzymes involved in the synthesis of serine family amino acids were most strongly downregulated by leptin overproduction. This can be explained by the fact that leptin contains much more serine (11.6% of total amino acids) than common E. coli proteins do (average of 5.6%). It should be remembered that the presence of a plasmid itself decreased the levels of GlyA and CysK by 40%. Leptin production resulted in a further decrease of their levels by ca. 60%.
Interestingly, the levels of OmpF and MalE were upregulated after overproduction of leptin, while those of GlyA, OppA, LivJ, and LivK were downregulated. All of these proteins are regulated by leucine-responsive regulatory protein (Lrp). Lrp activates transcription of the ompF and malE genes but represses that of the glyA, oppA, livJ, and livK genes (9, 28). Some metabolic pathways affected by Lrp are shown in Fig. 4A. Lrp activates 3-phosphoglycerate dehydrogenase (encoded by serA), which converts the glycolytic intermediate 3-phosphoglycerate to serine, while it represses serine hydroxymethyltransferase (encoded by glyA) and serine deaminase (encoded by sdaA) of the serine degradation pathway. It seems that leptin could not be produced immediately after induction due to the insufficient pool of available serine (Fig. 4A). As time passed, Lrp activated the pathways for the biosynthesis of serine and other amino acids. This led to an increase in the amino acid pool required for protein synthesis and consequently to leptin production that reached the maximum level in 8 h.
![]() View larger version (22K): [in a new window] |
FIG. 4. Metabolic pathway of recombinant E. coli overproducing leptin, quantified from the S3/S2 (A), S4/S2 (B), and S5/S2 (C) protein level ratios. Boldface arrows indicate that the protein levels were increased by more than 1.5-fold, while dotted arrows indicate that the protein levels were decreased to less than 0.6-fold. Some metabolic pathways affected by Lrp are also indicated in panel A. Reactions regulated by Lrp enzymes are marked + and - for positive and negative regulation, respectively. Abbreviations: 6PGL, 6-phosphogluconolactone; 6PG, 6-phosphogluconate; 2K3D6PG, 2-dehydro-3-deoxy-6-phosphogluconate; R5P, ribose-5-phosphate; RL5P, ribulose-5-phosphate; X5P, xylulose-5-phosphate; S7P, sedoheptulose-7-phosphate; E4P, erythrose-4-phosphate; F6P, fructose-6-phosphate; F1,6P2, fructose-1,6-diphosphate; G3P, glyceraldehydes-3-phosphate; DHAP, dihydroxyacetone-phosphate; G1,3P2, glycerate-1,3-diphosphate; 3PG, 3-phosphate-glycerate; 2PG, 2-phosphate-glycerate; PEP, phosphoenolpyruvate; PYR, pyruvate; AcCoA, acetyl coenzyme A; CIT, citrate; ICT, isocitrate; -KG, -ketoglutarate; Suc-CoA, succinyl coenzyme A; SUC, succinic acid; FUM, fumaric acid; MAL, malic acid; OAA, oxaloacetate.
|
Metabolic pathways affected by cysK coexpression.
In order to examine the changes in metabolism caused by cysK coexpression in leptin-producing cells, we analyzed proteomes of S4 (before induction) and S5 (after induction) of the cysK-coexpressing strain (Fig. 2G and H and 3E and F) and compared the profiles with that of S2 (Table 2). Two beneficial effects of cysK coexpression were evident. First, EF-Tu was no longer trapped in the IBs, and consequently more EF-Tu existed in soluble form (Table 2; Fig. 3E and F). This may be due to the increased levels of chaperones, including DnaK, GroEL, and HtpG, which helped correct folding of EF-Tu and prevented trapping of EF-Tu within IBs. The availability of more EF-Tu would have strengthened the protein biosynthesis capacity. Second, metabolic fluxes seemed to be altered to allow more efficient production of leptin. Before induction, the levels of TpiA, GapA, Pgk, and AceF were increased in cysK-coexpressing cells compared with control cells, while that of GpmA decreased. As shown in Fig. 4B, TpiA, GapA, and Pgk catalyze the reactions leading to be formation of 3-phosphoglycerate, which is further converted to serine. Through these metabolic changes, the cysK coexpression activated the serine biosynthetic pathway even prior to induction, which led to immediate production of leptin. After induction, the levels of these proteins were similar in cells with and without cysK coexpression (Fig. 4A and C). In summary, cysK coexpression resulted in the central metabolic pathway fluxes being adjusted to allow efficient production of leptin, a serine-rich protein. In addition to this positive effect, cysK coexpression made cells maintain a high protein biosynthetic capacity. All of these factors contributed to achieve high productivity and specific productivity of leptin. Even though the reasons for this phenomenon could not be explained, these results clearly demonstrate the effectiveness of the approach taken to improve the leptin productivity.
Enhanced productivity of another serine-rich protein by cysK coexpression.
To examine whether cysK coexpression can increase the specific productivity of other serine-rich proteins, interleukin-12 ß chain (11.1% serine) was selected as another model protein. The amino acid compositions of E. coli proteins show no dramatic deviations from those in the complete protein sequence database, with leucine and alanine being most abundant and cysteine and tryptophan being least abundant (20). We therefore selected G-CSF (18.84% leucine and 11.59% alanine) as a control protein having an amino acids composition similar to that of E coli proteins.
As expected, production of interleukin-12 ß chain was improved by cysK coexpression, while production of G-CSF was not affected. When recombinant E. coli BL21(DE3)(pEDIL-12p40) was induced at an OD600 of 30, the dry cell weight and the maximum interleukin-12 ß chain content reached at 7 h after induction were 23 ± 0.7 g/liter and 9% ± 0.5% of the total protein, respectively (Fig. 2E). On the other hand, when BL21(DE3)(pEDIL-12p40)(pAC104CysK) was induced at an OD600 of 30, the dry cell weight and the maximum interleukin-12 ß chain content at 2 h after induction were 24 ± 0.8 g/liter and 8% ± 0.3% of the total protein, respectively (Fig. 2F). The maximum G-CSF content (35% of the total protein) was reached at 2 and 3 h after induction with and without cysK coexpression, respectively. Therefore, cysK coexpression increased interleukin-12 ß chain productivity by threefold. These results suggest that cysK coexpression may improve production of any serine-rich protein.
Improved cell growth by cysK coexpression during high-cell-density culture.
Consistent with other studies (4, 7), the introduction of the plasmid caused changes in the levels of cellular proteins and adversely affected cell growth. Interestingly, it was found from proteome profiling that the levels of proteins involved in the biosynthesis of serine family amino acids decreased during the high-cell-density culture of E. coli BL21(DE3). If this is one of the factors negatively affecting cell growth, cysK coexpression should cause recovery of the growth rate. Therefore, we compared the growth rates of wild-type E. coli BL21(DE3), E. coli BL21(DE3)(pACYC184), and E. coli BL21(DE3)(pAC104CysK)(Fig. 2A and B). The presence of the plasmid decreased the growth rate by 50% compared with that of the wild-type strain. Surprisingly, cysK coexpression restored the growth rate of the plasmid-carrying strain to nearly that of the wild type, indicating that the reduced biosynthesis of serine family amino acids caused a lower growth rate during the high-cell-density culture of E. coli BL21(DE3). To see if this is a universal phenomenon in E. coli, we carried out high-cell-density cultures of E. coli W3110 harboring pACYC184 or pAC104CysK. It was found that cysK coexpression did not increase the specific growth rate of recombinant W3110. Therefore, the beneficial effect of cysK coexpression on the growth rate may be specific for E. coli BL21(DE3).
In summary, the growth of recombinant BL21(DE3) and leptin production could be enhanced by cysK coexpression. Coexpression of the cysK gene could increase the biosynthetic flux of serine family amino acids and indirectly repress EF-Tu aggregation by inducing the expression of heat shock proteins, leading to improved cell growth and a three- to fourfold increase in the productivity of serine-rich recombinant proteins. It should be mentioned that the high productivities of serine-rich proteins achieved by cysK coexpression are not solely due to the improved cell growth. As seen above, the growth rate was increased by approximately twofold by cysK coexpression, while the productivities of recombinant proteins increased by three- to fourfold. This is the first report on improving recombinant protein productivity by engineering the metabolic pathways based on the results of proteome analysis. Consequently, we propose a strategy for the rational engineering of metabolic pathways and cellular properties based on the results of proteome profiling. The procedure is to (i) obtain the proteome profiles of E. coli (or recombinant E. coli) under different conditions of interest, (ii) identify potentially limiting enzymes in the biosynthetic pathways, (iii) examine theoretically and/or experientially the possible flux changes that can be achieved by amplifying (or knocking out) the activities of the enzymes identified, (iv) select the final candidate enzymes to be amplified (or knocked out), (v) examine the effects of this metabolic and cellular engineering on achieving the desired objectives, and (vi) repeat steps ii to iv until the objectives are accomplished. This strategy may be extended beyond serine-rich proteins to increase the yield and productivity of other recombinant proteins in industrial bioprocesses.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»