Previous Article | Next Article ![]()
Applied and Environmental Microbiology, April 2009, p. 2414-2422, Vol. 75, No. 8
0099-2240/09/$08.00+0 doi:10.1128/AEM.02270-08
Copyright © 2009, American Society for Microbiology. All Rights Reserved.
,
Laboratory of Microbiology, Fungal Genomics Group, Wageningen University and Research Centre, Dreijenplein 10, Building 316, 6703 HB Wageningen, The Netherlands
Received 2 October 2008/ Accepted 11 February 2009
|
|
|---|
|
|
|---|
Recent technologies such as global transcriptome analysis by DNA microarrays or quantitative real-time PCR (qPCR) require the use of replicate biological samples for high-quality data. Given the difficulties in culturing A. niger, obtaining transcript data without wasting resources requires proper experimental design. The key to designing such experiments is to determine how much replication is needed—the sample size (36). A larger amount of replicates leads to increased statistical accuracy of measurement, whereas insufficient replication impedes data analysis. The required number of independent samples depends on a variety of factors as follows: the organism under study and its biological variation, the magnitude of the expected gene response, the statistical power to detect the genuine gene response to a condition, and the false discovery rate (13, 36).
Consensus is emerging on what comprises a "proper microarray experiment" (13). According to this consensus, defining the experimental objectives and requirements is a necessity before actually starting an experiment (39). During the experiment, standardized protocols and methods reduce the variability that is introduced in each process step (27). A selected experimental design makes data analysis and interpretation as simple and as powerful as possible. Finally, for publication of the microarray results and data, the minimum information about a microarray experiment (MIAME) guidelines (4) are adopted. Notwithstanding this consensus, Jafari and Azuaje have published an extensive review on papers describing microarray methods and applications of microarray technology (15) and concluded that recently published gene expression data analysis studies often lack key information that is required to interpret and evaluate published data.
The goal of the present study was to minimize the variation in A. niger batch fermentations by optimization of protocols and procedures. The variation between fermentations was determined with an analysis of variance components of data obtained by qPCR. This relatively inexpensive technology is used to measure transcript levels for few genes in many samples simultaneously. Furthermore, qPCR is routinely used to validate microarray results (11, 21).
The effects of the optimization and quality control measures for our experimental setup were assessed by examination of the global transcriptional response toward induction with D-xylose. The xylanolytic system of A. niger is under the control of the transcriptional activator XlnR, and the genes under its control are well documented (34). Recently, the transcriptional response toward D-xylose was examined by microarray analysis for three Aspergillus species (2). The availability of these data on the transcriptional response toward D-xylose allows for validation of the biological response observed during our studies.
|
|
|---|
argB pyrA6 prtF28 goxC17 cspA1) is derived from CBS 120.49. All media were based on minimal medium (25), contained 100 mM sorbitol as the carbon source, and were supplemented with uridine and arginine. To obtain spores, 20 spores per mm2 were plated onto complete medium plates (25), incubated for 5 days at 30°C, and allowed to mature at 4°C for 24 h. The spore suspension was washed and stored at 4°C until use.
Fermentation.
Four 2.5-liter glass fermentors (Applikon) with 2.2 liters of minimal medium were kept at a constant temperature of 30 ± 0.5°C, while fermentor headplates were kept at 8°C. A total of 1.0 x 106 spores per ml were added to the fermentor. During germination, each fermentor was aerated through the headspace (50 liters/h) and stirred at 300 rpm. When dissolved oxygen levels dropped below 60% for over 5 min, the stirrer speed was set to 750 rpm, and aeration was switched to sparger inlet. This switching time point was defined as T equals 0 h. Each fermentor was induced with either 0.1 mM sorbitol or D-xylose at T equals 14 h. Samples were snap-frozen in liquid nitrogen directly after filtration, with less than 20 s between sampling and snap-freezing.
RNA isolation.
Frozen mycelium was ground for 40 s using a dismembrator (Braun Melsungen). A Trizol-chloroform extraction preceded total RNA extraction with RNeasy minicolumns (Qiagen), according to the manufacturer's protocol for yeast. RNA integrity was assessed with an Experion system (Bio-Rad) by visual inspection of the electropherograms. Graphs depicting RNA integrity categories were used as visual aids (28). Electropherograms approximating an RNA integrity number of 8 or lower or with a 28S/18S ratio below 1.8 were discarded.
qPCR measurements.
Variation in transcript levels was determined for seven A. niger genes and a synthetic control RNA transcript (Table 1). This synthetic control RNA transcript, a bacterial kanamycin synthetase-encoding gene fused to a eukaryotic poly(A) tail (Promega), is spiked to total RNA prior to cDNA synthesis and can correct for various efficiencies of reverse transcription or PCR itself (12). The first four genes of Table 1 were used as endogenous reference genes. These A. niger reference genes showed little variation in transcript levels on more than 100 microarrays that were run in our laboratory prior to this study (D. van der Veen, J. M. Oliveira, E. S. Martens-Uzunova, and L. H. de Graaff, unpublished data) and were selected using the method suggested by Lee and coworkers (18). No elevated expression levels are expected for these genes (10, 16, 19, 22). Expression levels for malate synthase, whose expression is not influenced by addition of D-xylose, were also measured. Finally, the transcriptional response upon the addition of D-xylose was measured by determining the transcript levels of two genes, xlnB and xlnD. Primers were designed using the Primer3 web interface (26) and are given in Table 1. Primer design criteria were as follows: length, 20 to 22 bp; melting temperature, 60 ± 1°C; and GC content, 50% ± 5%. Amplicons ranged from 125 to 150 bp and had a melting temperature of 80 ± 5°C.
|
View this table: [in a new window] |
TABLE 1. Primers used in this study
|
A Rotor-Gene 3000 qPCR machine (Corbett Life Science) was used for thermocycling under the following conditions: 15 min at 95°C, followed by 40 amplification cycles (denaturation, 15 s at 95°C; annealing, 15 s at 58°C; elongation, 20 s at 72°C). For each single run, nontemplate control samples for every primer pair used in that run were included. After the amplification cycles, a melting step was performed. Quality control was done by inspection of the melting curve, and samples with significant primer-dimer formation were not considered for analysis. The cycle threshold value and amplification efficiency were determined with the Rotor-Gene software using the comparative quantification method (the cycle threshold value corresponds to the Rotor-Gene software "take-off" value). The relative expression ratio of gene expression was calculated by following Pfaffl (24) as follows: ratio = (Egene)
Ct (pre – post)/(Ekana)
Ct (control – sample). In this formula, Ekana denotes the amplification efficiency of the synthetic kanamycin transcript, and Egene denotes that of the gene for which the ratio is determined.
Ct denotes the cycle threshold difference between pre- and postinduced fermentor samples for the gene and kanamycin transcripts.
Microarray processing.
cDNA and cRNA synthesis and labeling and array hybridization were performed by following the Affymetrix users' manual (1) using the one-cycle target labeling and control reagent kit and starting with 5 µg of total RNA as template material. Fifteen micrograms of fragmented and labeled cRNA was hybridized to custom-made Aspergillus niger arrays at 45°C for 16 h. Washing and staining were done using the hybridization, wash, and stain kit (Affymetrix), using a GeneChip FS-450 fluidics station and an Agilent G2500A gene array scanner. Scanned images were converted into .CEL files using Microarray Suite version 5 software (Affymetrix).
Data analysis.
For the 1,920 qPCR measurements obtained from the 5-week fermentor experiment, variance components were calculated by restricted maximum likelihood (REML) variance components analysis (REML sparse algorithm with average information optimization) (23) using GenStat 9.2 software (VSN International). Per gene, three REML analyses were run using each gene's cycle threshold, amplification efficiency, and expression ratio values as response variates. A random model, yw.f.b.r.d.q.s = µ +
week +
w.fermentor +
w.f.biomass +
w.f.b.RNA +
w.f.b.r.cDNA +
w.f.b.r.c.qPCR, was applied, using
w.f.b.r.c.qPCR as the residual term (subscripts are abbreviated after first usage; e.g.,
w.f.biomass is
week.fermentor.biomass).
For microarray data analysis, .CEL files of the individual array images were imported into GeneSpring 7.3 (Agilent Technologies) using its robust multichip average (RMA) preprocessor to obtain RMA-normalized signal values for all arrays (14). Probe sets with an RMA-normalized signal below 37.7—three times the lowest value detected—on all arrays were discarded, leaving 9,320 probe sets (64%). In comparison, when using the Affymetrix MAS 5.1 software-derived flag calls, an average of 5,948 probe sets (41%) are called "present" per array. For the six microarrays used in this study, statistically significant differentially expressed genes were determined using the limma package (29). A Student's two-sample t test between the sorbitol and D-xylose arrays was executed, using empirical Bayesian statistics to moderate within-gene standard errors, Benjamini and Hochberg's "false discovery rate" to correct for multiple testing (3), and an adjusted P value of <0.05 to discriminate between differentially expressed genes. To check for the influence of unequal sample size bias, testing was recalculated with the inclusion of four additional sorbitol-grown samples derived from cultures grown identically (our unpublished data), giving similar results.
Microarray data accession number.
Raw and RMA-normalized array data were deposited in NCBI's GEO database (9) under series entry GSE11405.
|
|
|---|
The observed transcript levels for xlnB and xlnD are given in Fig. 1. A D-xylose concentration of 0.1 mM is able to induce both genes, with an expression ratio for xlnD of 190-fold at 60 min after induction. This lowest concentration of 0.1 mM D-xylose also discriminates best between noninduced and induced states: for this concentration, transcript levels increase for 60 min and return to preinduced levels in the next hour. For both the 1-mM and 50-mM concentrations, elevated transcript levels for xlnB and xlnD could be detected up to 3 hours after induction. We decided to induce fermentor cultures with 0.1 mM D-xylose and to sample them 1 h after induction.
![]() View larger version (17K): [in a new window] |
FIG. 1. Expression ratios of xlnB and xlnD. The expression ratios of the xlnB and xlnD genes measured after induction with either 0.1 (black), 1 (dark gray), or 50 (light gray) mM of D-xylose. The sampling time in minutes is presented on the horizontal axis. No transcript levels were determined for the 50-mM fermentations for both genes at 30 min. At 60 min, a bad qPCR run prevented xlnD ratio calculation.
|
![]() View larger version (27K): [in a new window] |
FIG. 2. Experimental design. Schematic representation of hierarchical experimental design used. In a 5-week period (gray boxes), four fermentors were run. Three fermentors were induced to 0.1 mM D-xylose, and the fourth fermentor was induced to 0.1 mM sorbitol. Each fermentation run was sampled twice, and from each mycelial sample, total RNA was extracted twice independently. The total RNA obtained was split in two, and independent cDNA synthesis reactions were performed. Each cDNA sample that was made was used to measure eight genes in duplicate using qPCR. The gray arrow indicates an identical processing step.
|
|
View this table: [in a new window] |
TABLE 2. Descriptive statistics
|
|
View this table: [in a new window] |
TABLE 3. Relative variance components estimatesa
|
|
View this table: [in a new window] |
TABLE 4. RNA sample properties
|
![]() View larger version (24K): [in a new window] |
FIG. 3. Technical variation between samples. Scatter plot of microarray data. (A) Two RNA samples of sample 86 (86-1 and 86-2) were processed independently and hybridized to two Affymetrix arrays. The RMA-normalized data of all 14,554 probe sets are plotted in a scatter plot. (B) Total RNA from two independent noninduced control fermentations (sorbitol), 96 and 52, are hybridized, and RMA normalized-data are plotted. The lines above and below the diagonal line indicate twofold difference relative to the diagonal line.
|
|
View this table: [in a new window] |
TABLE 5. Differentially expressed genes on D-xylose induction
|
|
|
|---|
Important parameters for high-quality qPCR measurements are the specificity of a primer pair for its target, amplification efficiency, and reproducibility (5, 20). Typically, when a newly designed primer pair is tested prior to use in quantification, a range of different annealing temperatures or primer concentrations is examined to identify conditions where only a single DNA fragment is amplified. In this study, a multitude of target genes were measured using a single PCR profile and primer concentration. The possibility that this approach affects the robustness of our qPCR measurements was assessed by the examination of the variation that is introduced in the amplification efficiency values. Within-gene variation is minimal, even when the amplification efficiency per gene varies. The most variable amplification efficiency values are for gene An02g04120, with a mean value of 1.72 and a standard deviation of 0.04 (n = 326). The reproducibility of this generalized method can be also derived from the analysis of variance while looking at the variance component titled "qPCR measurement" (Table 3), which includes pipetting and the actual measurement. The median relative contribution is 6.1% of the total variation (Table 3). Transcript An02g04120 again shows the highest variation, 20.1%, in this step. When this percentage is placed in its biological context, 95% of all cycle threshold values will range between 21.3 ± 0.9, which translates to 1.6-fold differences. We conclude that our standardized qPCR method is both precise and accurate.
Before the actual determination of the sources of variation in our experiment, we hypothesized that most variation is introduced by differences in the day-to-day handling and processing. The next largest variation was anticipated to be the use of the individual fermentor vessels. The results of the analysis of variance components comply with our initial hypothesis: day-to-day variation contributes to about 70% of the total variation (Table 3). This step includes the growth and harvesting of spores, the preparation of fermentation media, and the assembly of fermentors. The large effect that the day-to-day variation has on the total variation can effectively be excluded by examination of the expression ratio. This ratio presents the relative change in a gene's transcript level between preinduced and postinduced fermentation conditions. As this ratio is calculated from expression data of samples taken from the same fermentor, this result effectively cancels out the contribution of day-to-day variation. When fermentor cultures are compared by this gene expression ratio, the analysis of variance components of the ratio-derived data shows that the three steps of fermentation, cDNA synthesis, and actual qPCR measurement each contribute about equally in the case of the four endogenous reference genes (Table 3).
The effect that the individual fermentation vessels have on the variation in transcript level is 60% and 80% for the two D-xylose-induced genes, xlnB and xlnD, respectively. For the malate synthase-encoding gene, the effect is 75%. Different amounts of fungal cells present in a fermentor cannot explain this effect, as only a weak correlation of cycle threshold values with the biomass concentration is found (i.e., for xlnB, R2 = 0.35). One explanation is that small differences between fermentor headplates and vessels result in unique mixture characteristics for each fermentor. These fermentor-specific effects are reflected in small physiological differences between cultures, which may account for the observed differences in transcript levels. Since samples are taken 1 h after induction with D-xylose, such differences may affect the actual D-xylose concentration per fermentor, leading to a high reproducibility of a gene's expression within a fermentor but variation of its expression between fermentors. For example, xlnD induction is on average 240-fold higher for fermentors with headplate "2" but only 100-fold higher for fermentors with headplate "4."
D-Xylose-induced genes assessed by DNA microarray analysis.
The qPCR measurements clearly showed elevated transcript levels of xlnB and xlnD after induction with D-xylose. For xlnB, the average increase for all 15 fermentations was by 16-fold compared to that of noninduced conditions, while for xlnD, the average increase was by 138-fold. For the six DNA microarrays analyzed, the average increase is by 6-fold for xlnB and by 110-fold for xlnD after induction with D-xylose. The results for a gene measured by both methods are in good agreement; for instance, the Spearman rank correlation coefficient for xlnD measurements obtained by qPCR and microarray technologies is 0.90.
Comparison of the noninduced and D-xylose-induced samples that were hybridized onto microarrays revealed 24 genes that are statistically differentially expressed after induction with D-xylose (Table 5). Degradation of complex polysaccharide substrates starts with the uptake of signal molecules such as D-xylose that activate specific induction pathways, resulting in the expression and secretion of enzymes necessary to degrade and metabolize the polysaccharide. However, since D-xylose is rarely found by itself under natural conditions, it is likely that A. niger interprets the presence of D-xylose as proxy for the availability of complex carbohydrate polymers, such as (hemi)cellulose. Not only is this heterogenic response reflected in the induction of secreted enzymes but also in the activation of multiple metabolic pathways (Fig. 4). For instance, genes encoding the second step in L-arabinose metabolism and enzymes of the classical D-galactose catabolic route are significantly upregulated as well.
![]() View larger version (14K): [in a new window] |
FIG. 4. Enzymes induced by D-xylose. Schematic of the enzymes encoded by significantly differentially expressed genes after induction by D-xylose. (Top) Extracellular enzymes acting on complex polysaccharides, arabinoxylan, and cellulose. (Middle) Sugar transporters. (Bottom) Metabolic routes of the degradation of D-xylose, L-arabinose, and D-galactose. The enzymes encoded by significantly differentially expressed genes are indicated by stars.
|
The transcriptional regulator XlnR activates the transcription of (hemi)cellulose-degrading enzymes (34) as well as the transcription of genes encoding metabolic enzymes (34, 33). A cis-acting element in the promoter region of D-xylose-induced genes, described as 5'-GGCTAAA-3' (34) and proposed to be generalized to 5'-GGNTAAA-3' after comparative transcriptome analysis of three Aspergillus species (2), was detected in all but five statistically significant differentially expressed genes (Table 5), and also, the GAL1p ortholog An16g04160 has this consensus motif present 153 bp upstream in its promoter region.
In this and other studies (2), it is observed that nonstrictly D-xylose metabolism-related genes have elevated transcript levels upon D-xylose induction. The most likely explanation for this is that the fungus fine-tunes its response toward the diversity of complex substrates it encounters by coordinated action of partly overlapping regulatory systems. Expression of the XlnR-controlled ferulic acid esterase A is greater when A. niger is induced with both D-xylose and ferulic acid relative to induction by D-xylose or ferulic acid alone (7). Aromatic compounds such as ferulic acid are not only part of arabinoxylan but are also part of pectin. No XlnR-related motif is present in the promoter region of the polysaccharide-degrading enzymes acting on cellulose, BglA, and EglA, nor is that present in the promoter region of one of the uncharacterized family 30 glycosyl hydrolases. The induction of these genes in the absence of the consensus motif suggests the action of other transcriptional activators besides the xylanolytic activator XlnR. Interestingly, one of the genes of unclear function encodes a hypothetical transcription regulatory protein, and its expression patterns correlate strongly with the three genes of the Leloir pathway.
Andersen and coworkers have described a conserved set of 23 genes for which transcription is elevated on D-xylose medium (2). Nine of these D-xylose-responsive genes are not present in our gene list, including three genes encoding sugar transporters, three glycosyl hydrolases, and three metabolic enzyme-encoding genes (encoding an aldose 1-epimerase, a short-chain dehydrogenase, and an aldehyde reductase). The difference between the two gene lists can be explained by the experimental approach chosen: Andersen and coworkers have grown A. niger with either D-glucose or D-xylose as the sole carbon source and have sampled at D-xylose levels around 12 mM. In this study, sorbitol-grown cultures were induced with minute concentrations of D-xylose only.
In conclusion, the work presented in this study has resulted in an improved method for the generation of high-quality qPCR and microarray data from fermentations of the filamentous fungus Aspergillus niger. The decreased variation improves data quality and eases the use of data analysis, which is an essential prerequisite to study transcript profiling and gene regulation.
This work provides new insights into the mechanisms following D-xylose induction. The data link xylose metabolism not only to L-arabinose metabolism but also to D-galactose metabolism.
Published ahead of print on 20 February 2009. ![]()
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»