Previous Article | Next Article ![]()
Applied and Environmental Microbiology, February 2003, p. 769-778, Vol. 69, No. 2
0099-2240/03/$08.00+0 DOI: 10.1128/AEM.69.2.769-778.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Elizabeth A. Edwards,1 Steven N. Liss,2 and Roberta Fulthorpe3*
Department of Chemical Engineering and Applied Chemistry, University of Toronto,1 Department of Chemistry, Biology, and Chemical Engineering, Ryerson University, Toronto,2 Division of Physical and Environmental Sciences, University of Toronto, Scarborough, Ontario, Canada3
Received 30 April 2002/ Accepted 7 November 2002
|
|
|---|
|
|
|---|
DNA microarrays have primarily been used in medical research to investigate gene expression patterns in eukaryotic cells such as human or yeast cells for which mRNA extraction, purification, and cDNA synthesis protocols are well established. The majority of prokaryotic microarray studies, however, have been focused on the genome of a single organism (17, 32), often Escherichia coli, which is of limited applicability to the complex microbial ecosystems found in wastewater treatment systems, soils, and groundwaters. The use of DNA microarrays for the monitoring of prokaryotic gene expression, especially in mixed communities, is less developed in part due to inherent difficulties related to extracting bacterial RNA and priming cDNA synthesis from bacterial mRNA that lacks a polyadenylated tail. Nonetheless, the use of microarrays for the detection of prokaryotic gene expression (4, 13, 23) and the quantitation of bacterial DNA (3) has been demonstrated. Arbitrary primers (7), random hexanucleotide primers (33), and species-specific C-terminal directed primers (6) have all been used with some success.
We describe here the manufacture and testing of a prototype DNA microarray composed of known microbial catabolic and metabolic genes from a variety of organisms. Our aim was to establish the methodology, sensitivity, and applicability of this technique for measuring gene expression in a complex environment; in particular, our focus was the study of pulp and paper waste water treatment systems. Maintaining good performance of these biological wastewater treatment systems is relatively difficult: contaminant removal rates can vary significantly between and within systems (19). Recently, the importance of analytical tools to better understand biological wastewater treatment processes, particularly for monitoring and modeling, has been emphasized (30). Although it is well established that different microbial species exist in different biological wastewater treatment systems, no clear correlation exists between species identity and system performance. Analysis of gene expression patterns will allow us to tease out the environmental factors that significantly impact the induction and repression of metabolic functions within a meaningful context independent of culture-based methods. This can lead to more enlightened approaches to the optimization of bioreactors and wastewater treatment systems. Furthermore, this technology could lead to advances in novel gene identification through the detection of differential gene expression under specific environmental conditions. Before the benefits of this technology can be realized, however, effective methods for mRNA extraction from complex systems, cDNA labeling, hybridization, and data standardization must be demonstrated. Moreover, the limits of our detection must also be understood.
Our first prototype microarray was composed of 64 genes from a number of organisms. The purpose of the present study was to demonstrate the feasibility of detecting gene expression patterns in wastewater by using DNA microarrays and to establish current detection limits.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Genes on prototype DNA microarray
|
|
View this table: [in a new window] |
TABLE 2. PCR conditions used to amplify genes used on prototype microarray
|
RNA extraction and purification.
RNA was extracted from pure and mixed cultures by using Trizol LS Reagent (Life Technologies, Gaithersburg, Md.) according to the manufacturer's instructions. Liquid microbial cultures were centrifuged in 15-ml disposable tubes at 1,700 x g for 5 min to pellet the cells. Activated sludge RNA was also extracted by using Trizol and further purified with an RNeasy spin column (Qiagen). RNA was quantified by spectrophotometry at 260 nm. Treatment of RNA with Rnase-free DNase (Roche) for 15 min at 37°C in 10 mM Tris (pH 7.5) and 10 mM MgCl2 had no significant impact on the microarray signal strength, confirming that RNA, not residual DNA, was the predominant template in labeling reactions.
Reverse transcription and labeling.
Total extracted bacterial RNA was labeled with a cyanine dye (either Cy3 or Cy5) in an "indirect" process by a modification of the two-step labeling method available from the University Health Network Microarray Centre (www.uhnres.utoronto.ca/services/microarray). First, cDNA was synthesized from RNA in a reverse transcription reaction mixture containing modified aminoacyl-dUTPs. After purification, the cDNA was labeled in a chemical reaction where monofunctional cyanine dyes binds to the aminoacyl dUTPs. In comparison to "direct labeling" (in which cyanine-labeled dNTPs are used directly in the reverse transcription reaction), indirect labeling avoids problems of differential incorporation of the Cy3- and Cy5-labeled dNTPs, results in lower background fluorescence, and is more sensitive. We observed at least a twofold increase in fluorescence intensity with the indirect method versus the direct method; consequently, the indirect method was adopted for all experiments. The labeling procedure was carried out as follows. Bacterial RNA (0.5 to 10.0 µg) was combined in 1x first strand buffer (6.0 µl), random hexamer (4.0 µl), 10.0 mM dithiothreitol, 500 µM dNTP mix (dATP, dCTP, and dGTP), 150 µM dTTP (Life Technologies), and 150 µM aminoacyl-dUTP (Sigma, St. Louis, Mo.) in a final reaction volume of 40.0 µl. The mixture was incubated at 25°C for 5 min, after which 400 U of Superscript II reverse transcriptase (Life Technologies) was added, and incubation was then continued at 25°C for another 10 min. The reaction was warmed slowly to 37°C in an air incubator for 5 min and then transferred to a 42°C heating block for 2 h. After incubation, reverse transcriptase was inactivated by heating to 95°C for 5 min, followed by cooling on ice. NaOH (167 mM) was added, and the reaction was heated to 65°C for 15 min to degrade RNA, after which the solution was neutralized with 148 mM HCl and 70 mM Tris (pH 7.5).
cDNA purification.
The reverse transcription reaction volume was increased to 100 µl with sterile distilled water, and the cDNA was purified by using a QiaQuick spin column according to the manufacturer's instructions, except that the elution step was carried out twice with 50 µl of H2O for 5 min. cDNA was precipitated with 0.3 M sodium acetate (pH 4.8), glycogen (0.2 g/liter; Life Technologies), and 1 volume of 100% ethanol, followed by incubation at -70°C for 30 min and centrifugation for 10 min at top speed in a microcentrifuge (Eppendorf 5417C) at 4°C. The pellet was washed briefly with ice-cold ethanol (70%) centrifuged for 5 min at full speed and air dried briefly.
cDNA labeling with reactive cyanine dyes.
After precipitation, cDNA was resuspended in 5.0 µl of H2O and then heated for 1 min at 42°C to dissolve the DNA. Then, 3.0 µl of dye solution was added to the cDNA solution, followed by mixing with a pipette. The dye solution consisted of 2.0 µl of either the Cy3 or the Cy5 monofunctional reactive dyes (Amersham Pharmacia, Baie d'Urfé, Quebec, Canada) mixed with 2.0 µl of 100% dimethyl sulfoxide in 0.3 M sodium bicarbonate (pH 9.0). Tubes with cDNA and dye solution were incubated for 1 h in the dark at room temperature to allow the chemical labeling reaction to proceed. In this reaction, the monofunctional dyes bind to the aminoacyl dUTP nucleotides previously incorporated during the reverse transcription reaction.
Purification of labeled cDNA.
After monofunctional dye labeling, the reaction volumes were increased to 100 µl and cDNA was purified by using QiaQuick spin columns according to the manufacturer's instructions except that washing was performed three times with 75% ethanol and elution was performed three separate times with 50 µl of elution buffer. After elution, cDNA was precipitated as described above, and the pellets were resuspended in 2.5 µl of water for use in hybridization.
Hybridization of cDNA to DNA microarrays.
A 37.0-µl mixture containing 30 µl of DIG Easy Hyb buffer (Roche), 1.0 µl of a 10.0-µg/µl mixture of salmon sperm DNA, 1.0 µl of yeast tRNA (Sigma), 2.5 µl of Cy3-labeled cDNA (from sample 1), and 2.5 µl of Cy5-labeled cDNA (from sample 2) was placed on a 24-by-30-mm coverslip (Corning). DNA microarrays were touched to the drop of hybridization on the coverslip and quickly inverted. Microarrays were incubated overnight at 37°C in sealed plastic microscope slide boxes supported over DIG Easy Hyb buffer to maintain humidity. After hybridization, the arrays were washed three times in 650 ml of 0.1x SSC-0.1% sodium dodecyl sulfate, followed by three washes in 0.1x SSC at 22 to 35°C. Slides were immediately dried by centrifugation in microscope slide boxes lined with filter paper at 46 x g for 5 min.
Scanning of arrays and data analysis.
Arrays were scanned at excitation wave lengths of 532 and 635 nm to detect the Cy3 and Cy5 dyes, respectively. Arrays were scanned by using a GenePix 4000A microarray scanner (Axon, Foster City, Calif.). Typically, each microarray was scanned at two photomultiplier tube gain settings for analysis. Data were corrected for background attributed to nonspecific binding of the probes to the glass slide and the arrayed genes by using GenePix Pro 3.0 software (Axon).
Signal standardization.
Variability in signal intensity within and between arrays can result from differences in cDNA probe concentrations, RNA quality and quantity, labeling efficiency, scanner settings, different fluorescent properties of the dyes, hybridization conditions, and other random sources of variation. In order to make comparisons between experimental treatments on a given array or between different arrays, signals need to be standardized for all of these parameters. We compared different methods of standardization (see Results).
Experiment 1: detection of tfd genes in pure culture.
Parallel overnight cultures of JMP134 were grown at 30°C with shaking on minimal medium with 6.6 mM pyruvic acid as a carbon source. Total RNA was extracted from 50-ml subsamples from both cultures at time zero. Immediately after a sample was taken for the t = 0 h time point, one culture received 6.6 mM pyruvic acid (noninduced) while another received 2.8 mM 2,4-D (induced). RNA was extracted at 4, 7, and 24 h postinduction from both cultures. The control (noninduced) culture at each time point was labeled with Cy5, whereas the induced culture was labeled with Cy3. Four microarrays, one for each time point, were analyzed.
Experiment 2: detection of tfd genes in mixed cultures.
Four unidentified isolated organisms derived from a laboratory-scale sequencing batch reactor treating pulp mill effluent (29) were used to construct an artificial mixed microbial culture. Each of the isolates grew as distinctive colonies on agar containing glucose and could be easily distinguished from JMP134. This mixed culture was grown in liquid culture on minimal medium supplemented with 1.4 mM glucose overnight at 30°C with shaking. An overnight culture of JMP134 was also grown in minimal medium with 1.4 mM glucose as the carbon source. The JMP134 was serially diluted into four 50-ml cultures that were mixed with the constructed culture in a ratio of 1:1 such that the final JMP134 populations were 3.7 x 106, 3.7 x 105, 3.7 x 104, and 3.7 x 103 cells/ml and the constructed culture population was 1.0 x 108 cells/ml. In addition to these four dilution cultures, the pure culture of JMP134, and the constructed mixed culture without JMP134 were also included in this experiment. Two parallel sets of these six cultures were prepared. One set was amended with 2.0 mM 2,4-D (induced), whereas the other was amended with 1.4 mM glucose (noninduced). RNA extractions from both noninduced and induced cultures were performed at 6 h postinduction. Plate counts with minimal medium agar supplemented with 5 mM 2,4-D as the sole carbon source were performed at the time of RNA extraction to confirm the density of JMP134 in each culture. Six microarrays were used for this experiment. Labeled cDNA from 2,4-D (Cy3) and glucose-amended (Cy5) parallel cultures were compared on the same microarray.
Experiment 3: detection of resin acid degradation (tdt) genes in mixed pulp mill bioreactor cultures.
A sample of untreated (primary) pulp mill effluent from a Kraft mill in Cornwall, Ontario, Canada, was inoculated with a bioreactor sludge sample and grown for 16 h with stirring and aeration. The inoculum (sludge sample) was taken from a bench-scale sequencing batch reactor fed pulp and paper mill wastewater described by Tripathi and Allen (29). At time zero this activated sludge culture was divided into two cultures that were amended either with 0.5 mM dehydroabietic acid (DHA; a resin acid) or with a 0.8 mM concentration of cellobiose as carbon sources. RNA was extracted from 7.5 ml of each culture by using Trizol or Trizol plus a Qiagen RNeasy purification step at 0, 3, 6, and 24 h after carbon source amendment. For each time point, extracted RNA from the cellobiose-amended culture was labeled with Cy3, and that from the DHA-amended culture was labeled with Cy5; both were hybridized onto a series of four microarrays, one for each time point.
Experiment 4: comparison of specific primers for cDNA to random hexamer primers for cDNA synthesis.
The potential for primers directed toward specific genes to increase the sensitivity of gene expression detection compared to nonspecific random hexamer priming was tested with an undefined mixed culture spiked with JMP134. Minimal medium (50 ml) with 1.4 mM glucose was inoculated with the constructed mixed culture and grown overnight at 30°C with shaking. An overnight culture of JMP134 was also grown in the same manner. At time zero, the mixed culture was subdivided into six subcultures, which were made up to 100 ml and amended with 2.0 mM 2,4-D (cultures 1 and 3 to 6) or 1.4 mM glucose (culture 2). Cultures 3, 4, 5, and 6 were spiked with 10 µl, 100 µl, 1 ml, and 10 ml of the JMP134 culture, respectively. Plate counts of the JMP134 spike culture and all of the cultures at the time of RNA extraction were performed on both 2,4-D agar and plate count agar. A mix of seven specific primers were hybridized to mRNA during the cDNA synthesis step in the indirect labeling approach utilizing Cy5 as described above. These primers were directed to carAB (5'-TCAGGATCCTTTCAGCCCGAAACGTGC-3'), rpoN (5'-GGTGGACACGTGGCTGCCGAAGAAGTA-3'), pdhA (5'-GCACGTACTTCTGGCCTTCTTGGTTCC-3'), manA (5'-GCTCCAAAATTAGTGAAATTGC-3'), tfdA (5'-ACGGAGTTCTGYGAYATG-3'), tfdB (5'-ATAGCGGTGRTTCATYTC-3'), and limC (5'-CGAGGATTGACAGGTTGTAGCT-3'). cDNA was also produced by using random hexamer-primed reverse transcription labeled with Cy3. The Cy3-labeled random-primed and the Cy5-labeled specific-primed cDNAs were then hybridized to microarrays to compare signal intensity for the specifically primed genes compared to that of the random-primed gene.
|
|
|---|
Distribution of signal data.
Fluorescence levels from all DNA spots after hybridization with cDNA derived from control (noninduced) cultures were not normally distributed, nor were the ratios of data from each fluorophore (Fig. 1). However, a log2 transformation of the ratios was sufficient to normalize the ratio data. Because a significant number of ratios were at or close to zero, we used a log2(x + 1) transformation [the log(x + 1) transformation is commonly used to normalize data that exhibits a positively skewed distribution and is necessary when zero values of x are common] (2, 35). Log10 transformations are normally used, but we use log2 here so that transformed values largely reflect the degree of expression increase of a treated culture over control. Thus, a ratio of 1 gives a value of 1, whereas a ratio of 2 gives a value of 1.6, and a ratio of 3 gives a value of 2. After this the log2(x + 1) value becomes increasingly similar to log2(x), and the y axis can be read as the fold increase.
![]() View larger version (40K): [in a new window] |
FIG. 1. Frequency distribution of data from a typical micrarray experiment. The number of DNA spots giving signals within the intervals indicated on the x axesraw fluorescence for each fluor, the ratio of Cy3 to Cy5 fluorescence, and the log2(ratio + 1)are shown. There is a progressively better fit to normal distribution, as indicated by the Shapiro-Wilk statistic. The probability that a distribution differs from normality is indicated on each of the plots, only the log-tranformed ratio did not differ significantly from normal.
|
![]() ![]() ![]() ![]() View larger version (142K): [in a new window] |
FIG. 2. Gene expression in a pure culture of R. eutropha JMP134 measured at 0, 4, 7, and 24 h after induction with 2,4-D. The y axis shows the log2(ratio + 1), where the ratio is equal to the standardized fluorescence for induced culture divided by the standardized fluorescence for the control culture. Data were standardized on the basis of total microarray fluorescence, excluding ribosomal and tfd genes prior to averaging of replicate data. (A) Induction of JMP134 tfd genes; (B) induction of tfdA genes of different lengths and similarities to JMP134; (C) induction of tfdC-like genes; (D) expression patterns of non-tfd genes. The induction signal drops for all genes at 24 h as 2,4-D is depleted from the media.
|
Experiment 2: detection of tfd genes in mixed cultures.
The induction of the various full-length tfd genes was detected at JMP134 populations from 103 to 107 cells/ml in a mixed culture of 108 cells/ml (Fig. 3A). Induction of the tfdA gene was statistically detectable in JMP134 populations of as low as 3.7 x 103 cells/ml (Student t test compared to control, P < 0.0197) (Fig. 3A, top panel). However, the signals for tfdA genes in the control cultures were unusually low (close to or below background) in this experiment, so this result was perhaps atypical. Detection of the tfdA genes was more significant above 3.7 x 104 cells/ml (P < 0.0002). The lowest significant detection of the tfdC gene occurred at 3.7 x 105 cells/ml (P < 0.01782). For tfdB and tfdE genes, the lowest detection levels were 3.7 x 106 cells/ml (P < 0.00004 and <0.00003, respectively). Results for tfdF are obscured because of high variability. Detection limits with different tfdA gene fragments and homologues were similar (Fig. 3B). Detection of the tfdA 300-bp PCR fragments from JMP134 and TFD41 was significant at 106 (P < 0.0239) and 105 cells/ml (P < 0.0357), respectively (Fig. 3B, middle panels). High variation in signals from the low similarity tfdA from RASC allowed no induction detection at any population (Fig. 3B, bottom panel). Detection of the tfdC homologues was significant only for the gene amplified from the chlorobenzoate degrader HH83 at 106 cells/ml (P < 0.001) but not for CLAB3 and WV71 (Fig. 3C). No induction was observed for control genes (not induced by 2,4-D [data not shown]). Overall, we can say that detection of tfd gene induction at populations of 105 cells/ml in a background of 108 cells/ml of other bacteria is certainly achievable. Lower detection limits may be possible with longer gene fragments that are 100% similar, if backgrounds are low. Since a great deal of sequence variation typically exists within catabolic gene families, it will be important for researchers to include as many sequence variants as possible for a given function.
![]() ![]() ![]() View larger version (93K): [in a new window] |
FIG. 3. Gene expression in a bioreactor community spiked with JMP134 at different concentrations and induced with 2,4-D. The y axes are as described for Fig. 2. RNA was sampled at 6 h postinduction from induced and noninduced cultures. The control sample (bars labeled C) was the total bioreactor community population (108 cells/ml) with no JMP134 added. The populations of JMP134 were 3.7 times the amounts shown on x axis, where the highest population was from a pure culture. (A) Induction of JMP134 tfd genes; (B) induction of tfdA genes of different lengths and similarities to JMP134; (C) induction of tfdC-like genes.
|
Experiment 3: detection of resin acid degradation genes in mixed pulp mill bioreactor cultures.
The resin acid DHA is a substance commonly found in pulp mill effluents (21, 34). The addition of DHA to pulp mill effluent bioreactor cultures led to slight but highly statistically significant increases in the expression of resin acid degradation genes tdtA (P < 0.0003), tdtB (P < 0.00014), and tdtL (P < 0.000014) (Fig. 4). There was no increase (after 1 day) of the regulatory tdtR gene, and there were no significant differences between time points in the levels of rpoN, amoB, or the overall average spot signal intensity ratios for the array.
![]() View larger version (39K): [in a new window] |
FIG. 4. Detection of resin acid degradation genes in pulp and paper treatment community spiked with DHA. tdtA, tdtB, and tdtL are functional resin acid degradation genes; tdtR is a regulatory gene. rpoN is a sigma factor protein gene (not expected to change), and amoB is the gene for ammonia oxidase small subunit (not expected to change). The average data for the whole microarray are also indicated. The y axis is as described for Fig. 2. Shown are bars giving the log2 ratio of DHA- to glucose-grown culture signals plus the standard deviations of the ratio (n = 4). The first (solid) bar of each pair indicates the time zero level; the second (shaded) bar is the culture sampled after 25 h. An asterisk indicates significant changes in expression at P 0.0003.
|
|
View this table: [in a new window] |
TABLE 3. Ratios of specifically primed signals to randomly primed signals
|
Clearly, a number of challenges still remain to be overcome to most effectively apply microarray technology to monitoring gene expression in complex microbial ecosystems. These include increasing the sensitivity (limit of detection) of the procedure, decreasing the amount of time required for performing each analysis (now
4 days), and increasing the number of microbial genes per array. Increasing the ease with which the number of genes per array could be approached through the use of oligonucleotide probes instead of PCR products as in the present study. Oligonucleotide probes would obviate the need to obtain genetic material in the form of cloned genes or cultured isolates; instead short regions of published gene sequences could simply be synthesized. Kane et al. (16) demonstrated that 50-mer oligonucleotides were useful as probes on DNA microarrays. An alternate approach to the use of known genes would be the production of libraries of genes derived from genomic DNA or cDNA from a specific environment (for example, a wastewater treatment system). Such an array could then be used in the identification of previously unknown genes active under various conditions; in addition, correlation of gene expression patterns to phenotypic properties might be possible that could be useful in monitoring the efficiency and health of treatment systems. Despite the challenges, the potential for monitoring in situ microbial gene expression in wastewater systems appears feasible; future studies will focus on addressing and optimizing approaches and more thoroughly assessing the capabilities of this promising technology.
We thank Pascale Macgregor, Ronit Andorn-Broza, Fred Betterman, and Jing Sung for technical assistance. We especiallly thank Neil Winegarden and others at the Microarray Centre at the University Health Network. We also thank the numerous researchers who sent genetic material used in the present study, including P. Barbieri, B. J. Berger, A. Chakrabarty, A. Cook, E. Diaz, R. Eaton, H. Engesser, D. Gibson, P. Hallenbeck, C. Harwood, B. Hedlund, J. Heider, W. Hillen, P. Hoffman, T. C. Huang, Y. Katayama, F. Kunst, A. Kolsto, K. Inatomi, G. Lloyd-Jones, S. Kaplan, A. Kulakov, T. M. Louie, H. Matusaki, H. Mori, C. Murrell, Y. Nagata, H. Nojiri, T. Omata, R. Parales, D. H. Pieper, A. Sorokin, H. Saeki, B. Witholt, R. Wittich, T. Wood, S. Vuilleumier, I. Yamamoto, G. Zylstra, and the late R. Cam Wyndham.
Present address: GeoSyntec Consultants, Guelph, Ontario N1G 5G3, Canada. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»