Environmental Sciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831,1 Department of Microbiology, Miami University, Oxford, Ohio 450562
Received 22 October 2004/ Accepted 23 March 2005
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Several previous studies indicated that a single 50-mer (11, 21, 29), 60-mer (9, 24), or 70-mer (2, 7) oligonucleotide probe per gene could produce hybridization signals comparable to those obtained with PCR amplicon arrays. Kane et al. (11) demonstrated that the sensitivity of microarrays was not substantially different for PCR probes (322 to 393 bases) and 50-mer oligonucleotide probes. Both probe types could reproducibly detect approximately 10 gene copies per cell and discern threefold changes in cDNA levels, but only three genes were targeted by three PCR probes and six oligonucleotide probes (two for each gene). Also, the PCR probes were 320 to 390 bp long, and it is not known how oligonucleotide probes compare to longer PCR probes (e.g., the 889- to 995-bp probes used in this study). In addition, various studies reported previously in the literature used oligonucleotide probes that ranged from 20 to 70 nt long, but a direct comparison of the performance of oligonucleotide probes of different lengths has not been reported previously.
In this study, microarrays that contained oligonucleotide probes of different lengths were compared in order to determine a better alternative to PCR amplicon microarrays. We systematically evaluated the hybridization conditions for oligonucleotide microarrays in terms of overall hybridization signal intensity, numbers of genes detected, specificity, sensitivity, and gene expression differentiation. The probes of different lengths were derived from the same 96 ORFs from Shewanella oneidensis MR-1. Our results indicated that the performance of the 70-mer probes was most comparable to the performance of PCR probes under the hybridization conditions used (45°C and 50% formamide), and thus a 70-mer oligonucleotide array is an alternative to a PCR amplicon microarray, although the sensitivity is lower than that of an array with PCR probes.
| MATERIALS AND METHODS |
|---|
|
|
|---|
75%. For genes that had a level of identity to other genes of >75%, a maximum internal region that had <75% identity was selected as a probe, or the cutoff value was increased to 85%. All 96 primer pairs were synthesized and then arrayed in 96-well plates by MWG Biotech Inc. (High Point, NC). The information about 96 S. oneidensis MR-1 ORFs, the PCR primer pairs, and the sizes of PCR products is summarized in Table S1 in the supplemental material. Each gene was amplified eight times in a 96-well plate. The amplified PCR products were pooled and purified using a QIAquick PCR purification kit (QIAGEN Inc., California) according to the protocol of the manufacturer. The concentrations of pooled PCR products were 200 to 500 ng/µl. The purified PCR fragments were visualized and checked for size by agarose gel electrophoresis in the presence of ethidium bromide.
Oligonucleotide probe design.
Oligonucleotide probes (30-, 40-, 50-, 60-, and 70-mers) for the 96 ORFs were designed using a modified version of the PRIMEGENS software (35). Each of the PCR-amplified DNA probe sequences selected was compared with the entire sequence database using BLAST (1) and was aligned with other sequences that exhibited more than 85% identity using dynamic programming. Based on the global optimal alignments, segments exhibiting <85% nucleotide identity to the corresponding aligned regions of any BLAST hit sequences were selected as potential probes. Among the segments identified as potential probes, one probe for each gene was selected based on the G+C content, melting temperature (Tm), and lack of self-complementary. The information about the 70-mer oligonucleotide sequences, melting temperatures, and G+C contents are summarized in Table S2 in the supplemental material. For comparison, the longer probe sequences contained the sequences of shorter probes. All of the oligonucleotides designed were commercially synthesized without modification by MWG Biotech Inc. (High Point, NC). The concentration of oligonucleotides was adjusted to 100 pmol/µl.
Microarray construction.
PCR probes and oligonucleotide probes prepared in 50% dimethyl sulfoxide (Sigma Chemical Co., Missouri) were spotted onto SuperAmine glass slides (Telechem International, California) using a PixSys 5500 robotic printer (Cartesian Technologies Inc., California). For each gene there were two replicates on a single slide. The PCR-oligonucleotide microarrays contained PCR products and 70-, 60-, 50-, 40-, and 30-mer oligonucleotides, and in total, there were 1,152 spots on a single slide. UV cross-linking (300 mJ) and washing were carried out according to the protocol of the manufacturer (Telechem International, California).
Preparation of PCR and artificial targets.
PCR targets were prepared as described above and labeled with random primers and the Klenow fragment (genomic DNA [gDNA] labeling methods). Different amounts of a PCR target mixture were used according to the experimental needs. Four artificial oligonucleotide targets (T1-SO1679, T2-SO1744, T3-SO2680, and T4-SO0848) that were complementary to the 70-mer perfect match oligonucleotide probes were synthesized at the Molecular Structure Facility at Michigan State University (East Lansing, MI). The oligonucleotides were labeled at the 5' end with Cy5 (T1-SO1679, T2-SO1744, and T3-SO2680) or Cy3 (T4-SO0848) fluorescent dye during synthesis. The different labeled oligonucleotide targets were normalized based on fluorescent dye concentration and mixed at equal ratios.
S. oneidensis MR-1 growth.
For general extraction of genomic DNA and RNA, S. oneidensis MR-1 cells were grown in LB medium at 30°C overnight in a shaker (180 rpm) and collected by centrifugation at 3,500 x g for 5 min. For the pH treatment experiment, S. oneidensis MR-1 cells were grown in LB medium at 30°C overnight in a shaker (180 rpm), and 1.0 ml of the cells was inoculated into 250 ml of new LB medium (pH 7) and grown for 5 h (at the mid-logarithmic phase with an optical density of
0.6). Cells were centrifuged and resuspended in 25 ml of LB medium at pH 4.0, 7.0 (control), or 10.0. There were three replicates for each pH treatment. Cultures (10 ml) were sampled at 30 min and 60 min, and cells were harvested by centrifugation at 3,500 x g for 5 min. Cell pellets were immediately frozen in liquid nitrogen and stored at 80°C until RNA was extracted.
Genomic DNA extraction, purification, and labeling.
Genomic DNA was isolated and purified from S. oneidensis MR-1 as described previously (37). The purified genomic DNA was fluorescently labeled by random priming using the Klenow fragment of DNA polymerase. Thirty-five microliters of mixture I containing 500 ng of genomic DNA and 20 µl of random primers (Invitrogen Life Technologies, California) was heated at 98°C for 3 to 5 min, quickly cooled on ice, and then centrifuged. Fifteen microliters of mixture II containing 1 µl of a solution containing 5 mM dATP, 5 mM dGTP, 5 mM dTTP, and 2.5 mM dCTP, 2 µl (80 U) of the Klenow fragment (Invitrogen Life Technologies, California), and 0.5 µl of Cy3-dUTP fluorescent dye (Amersham BioSciences, United Kingdom) was added to mixture I. A 50-µl (total volume) labeling reaction solution was incubated for 3 h at 42°C. The labeling reaction was terminated by heating the solution at 98°C for 3 min. The tubes were removed and quickly placed on ice. After a quick centrifugation, the sample was hydrolyzed in 50 mM NaOH at 37°C for 10 min and then neutralized with the same amount of HCl. The labeled cDNA probe was purified immediately using a QIAquick PCR purification column and was concentrated in a Savant Speedvac centrifuge (Savant Instruments Inc., Holbrook, NY).
RNA extraction, purification, and labeling.
Total cellular RNA was isolated using TRIzol reagent (Invitrogen Life Technologies, California). RNA samples were treated with RNase-free DNase I (Ambion, Austin, TX) and were purified using a Mini RNeasy kit (QIAGEN, Chatsworth, CA). The concentration and purity of RNA samples were estimated with a spectrophotometer using A260/A230 and A260/A280 ratios of >1.85, as well as by agarose gel electrophoresis. Total cellular RNA (10 µg) was incubated at 70°C for 10 min in the presence of 10 µg of random primers (Invitrogen Life Technologies, California). The labeling reaction was catalyzed by 200 U of Superscript II RNase H reverse transcriptase (Invitrogen Life Technologies, California) in the presence of 500 µM dATP, 500 µM dGTP, 500 µM dCTP, 25 µM dTTP, and the fluorophor Cy5-dUTP or Cy3-dUTP (Amersham BioSciences, United Kingdom) at a concentration of 25 µM. The reverse transcription reaction was allowed to proceed for 2 h at 42°C, and this was followed by RNA hydrolysis in 1 N NaOH at 37°C for 10 min. The labeled cDNA probe was purified immediately using a QIAquick PCR purification column and was concentrated in a Savant Speedvac centrifuge (Savant Instruments Inc., Holbrook, NY). For each biological sample, three slides were used. In addition to the duplicates of arrays on the same slide, three biological cell samples produced a total of 18 possible spots for each gene.
Microarray hybridization, washing, and scanning.
For determination of the overall hybridization signals, sensitivity, and the number of detected genes, a gDNA or RNA sample labeled with a single dye was used. For the pH stress study, the dye-swap method was used. Hybridization was performed using three replicates, with each slide containing two replicates on microarrays so that for each gene there was a total of six data points. The microarrays were hybridized at 45°C overnight in the presence of 50% formamide. The labeled cDNAs were resuspended in 20 to 25 µl of hybridization solution that contained 50% formamide, 1 mM dithiothreitol, 3x SSC (1x SSC is 0.15 M NaCl plus 0.015 M sodium citrate), 0.3% sodium dodecyl sulfate (SDS), and 0.8 µg/µl of herring sperm DNA (Invitrogen Life Technologies, California). The sample was incubated at 98°C for 5 min, centrifuged to collect condensation, and kept at 50 to 60°C. The sample was immediately applied to a microarray slide, and hybridization was carried out in a waterproof Corning hybridization chamber (Corning Life Science, New York) submerged in a 45°C water bath in the dark for 16 h. After hybridization, the coverslips were immediately removed, and the slides were washed in a buffer containing 1x SSC and 0.1% SDS for 5 min at 37°C. The microarrays were washed in a new buffer with 0.1x SSC and 0.1% SDS for another 5 min at room temperature. Finally, the microarrays were washed with distilled water for 30 s at room temperature and dried with compressed air or by centrifugation. Microarrays were scanned using the ScanArray 5000 microarray analysis system (Packard BioChip Technologies, Massachusetts). Normally, 95 to 100% laser power and 70 to 80% photomultiplier tube efficiency were selected for scanning.
Data normalization and analysis.
To determine signal intensities for each spot, 16-bit TIFF scanned images were analyzed using the software ImaGene 5.5 (Biodiscovery Inc., California). Prior to normalization, negative spots, empty spots, and poor spots flagged by ImaGene were removed in Excel (Microsoft). Any spots with values that were >2 standard deviations above the local background level in both channels were rejected. The resulting data files were loaded onto GeneSpring, version 5.1 (Silicon Genetics, Redwood, CA), and channel normalization was accomplished by the software with the Lowess method. The Cy5/Cy3 ratio was transformed to a logarithmic (base 2) value, and a gene was considered differentially expressed if the logarithmic value was greater than 1 (up-regulated) or less than 1 (down-regulated). The signal-to-noise ratio (SNR) was also computed for each spot to discriminate true signals from noise. The SNR ratio was calculated as follows: SNR = (signal mean background mean)/(background standard deviation). A commonly accepted criterion for the minimum signal (threshold) that can be accurately quantified is an SNR of >3.0 (31).
| RESULTS |
|---|
|
|
|---|
|
|
As expected, the number of spots detected decreased when labeled cDNA was used as the target to hybridize with the array (Fig. 2B) because not all ORFs were expressed. At 42°C, 45°C, 50°C, and 60°C, the PCR probes detected 65, 69, 72, and 17 ORFs, respectively, and the 70-mer probes detected 48, 53, 49, and 1 ORFs, respectively. A total of 29 to 39 ORFs were detected by the 60-mer probes and 10 to 23 ORFs were detected by the 50-mer probes at temperatures from 42°C to 50°C. The number of ORFs detected was not dramatically affected by hybridization temperatures ranging from 42°C to 50°C, but only a few ORFs (17 ORFs for PCR probes and 0 to 2 ORFs for oligonucleotide probes) could be detected at 60°C (Fig. 2B). The 40-mer and 30-mer probes did not have substantial signal intensities (Fig. 1) or gene coverage (Fig. 2). The results suggested that the optimal temperatures were 45°C for oligonucleotide probes longer than 50-mer and 50°C for PCR probes, although hybridization differences between 45°C and 50°C were not observed in each case.
Specificity.
In order to determine whether the signals detected were specific at 45°C, PCR-amplified targets or synthesized oligonucleotide targets of four loci (SO1679, SO1744, SO2680, and SO0848) were used for hybridization. Five picograms of PCR target per locus was hybridized with the array at 45°C in the presence of 50% formamide. All four targets were specifically detected by the PCR probes corresponding to the four loci, and no substantial signals were observed with other PCR probes at an SNR of >3.0 (data not shown).
To further evaluate the specificity of oligonucleotide probes, artificial targets of the 70-mer probes were synthesized. Three targets (T1-SO1679, T2-SO1744, and T3-SO2680) were end labeled with Cy5, and one target (T4-SO0848) was end labeled with Cy3. Each target (2.0 pg) was mixed and hybridized with the array at 45°C in the presence of 50% formamide. The results showed that all four targets specifically hybridized with the corresponding oligonucleotide probes (40-mer to 70-mer). The 30-mer oligonucleotide probes detected only two of the four targets at an SNR of >3.0. The SNR values for all other loci varied from 0 to 1.0 (data not shown). The data indicated that all signals detected by the PCR or oligonucleotide probes were specific under the conditions tested (45°C and 50% formamide).
Sensitivity of oligonucleotide arrays compared to sensitivity of the DNA array.
To assess sensitivity, different amounts (0.2, 1.0, 5.0, 25, 100, and 500 ng) of gDNA from S. oneidensis MR-1 were labeled with the Cy3 dye and hybridized with the array at 45°C in the presence of 50% formamide. The signal intensities detected by the PCR probes increased as the gDNA concentration increased (0.2 to 5 ng) and did not change dramatically from 25 to 500 ng (Fig. 3A). The 70-mer probes gave signal intensities similar to those observed with the PCR probes at relatively high concentrations of gDNA (100 and 500 ng), but decreased signals were detected at low target concentrations (0.2 to 25 ng). The 70-mer oligonucleotides had two- to threefold-stronger signals than the 60-mer and 50-mer oligonucleotide probes (Fig. 3A).
|
For simplicity and quantitative comparison, the sensitivity of the PCR-oligonucleotide microarray was evaluated by measuring the minimum amount of targets (gDNA or PCR products) required for detecting more than 50% of the ORFs with an SNR of >3.0. Based on this working definition, the numbers of ORFs detected (of a total of 96 ORFs) at an SNR of >3.0 are shown in Table 1 for different probes hybridized with different concentrations of gDNA. The sensitivity of the PCR probes was fivefold higher than that of the 70-mer probes, which was fourfold higher than that of the 60-mer and 50-mer oligonucleotide probes.
|
|
|
|
|
1.2-fold) when cells were exposed to pH 4.0 for 30 min, but it did significantly increase approximately threefold after the 60-min treatment. SO2743, one of the most significantly up-regulated loci during the pH 4.0 exposure, is predicted to encode an acetyl-coenzyme A synthetase (EC 6.2.1.1), which plays an important role in primary metabolism. SO0848 is predicted to encode a periplasmic nitrate reductase (EC 1.7.99.4), and it was significantly down-regulated at low or high pH. | DISCUSSION |
|---|
|
|
|---|
Previous experiments of Relógio et al. (20) indicated that 60-mer oligonucleotide probes provided significantly improved (10-fold) sensitivity compared with 25-mer oligonucleotide probes, but the specificity was significantly lower (
fourfold) than that obtained with 25-mer oligonucleotide probes. In this study, the signal intensities detected by 70-mer probes were significantly higher than those detected by 50-mer probes when labeled gDNA or cDNA templates were used as targets. The results suggested that the 70-mer oligonucleotide probes achieved sensitivity that was closest to that of the PCR probes and that the 50-mer and 60-mer oligonucleotide probes were able to provide reasonably good sensitivity (20-fold lower) compared to the sensitivity of the PCR probes. The data also indicated that 40-mer or 30-mer probes were not suitable for construction of oligonucleotide arrays using the methods described here when they were compared to 50-mer, 60-mer, or 70-mer probes.
Sensitivity is a critical parameter, particularly for environmental studies, when biomass can be low, or for low-level molecules in individual cells. When PCR-generated functional gene arrays were used, the detection limit for nirS (nitrite reductase) genes was approximately 1 ng of pure gDNA and 25 ng of soil community DNA (34). A previous study demonstrated that when oligonucleotide arrays with capture and detector probes were used, the detection sensitivity for Geobacter chapellei small-subunit rRNA gene sequences in soil extracts was approximately 500 ng of total RNA (26). Recently, studies with 50-mer functional gene arrays showed that the detection limit for some functional genes could be 5 to 10 ng of pure gDNA and 50 to 100 ng in a mixture of gDNA from different organisms (21, 29).
As expected, the PCR-generated probes had the best sensitivity, but the 70-mer probes had fourfold-greater sensitivity than the 60-mer and 50-mer probes (25 ng versus 100 ng). These results were in general agreement with the previous observations (21, 26, 29, 34) when the types of probes and samples were considered.
In order to determine the sensitivity of PCR probes for individual targets, the same array was used to hybridize with a mixture of four PCR targets generated from four loci (SO1679, SO1744, SO2680, and SO0848). The results showed that three loci (SO1744, SO2680, and SO0848) could be detected at a level of 0.05 pg of PCR target per locus and one locus (SO1679) could be detected at a level of 1.0 pg by the four PCR-generated probes, suggesting that the sensitivities of individual genes can differ to a large degree (data not shown). This gene-dependent phenomenon has been observed previously with 50-mer oligonucleotide arrays in bioremediation studies (21), but it has not been understood. Due to the gene-dependent nature, it is helpful to distinguish between microarray sensitivity and probe sensitivity. We define probe sensitivity as the minimum amount of target required for individual probes to obtain reproducible signals at a defined SNR threshold (e.g., 3.0). In the example described above, the probe sensitivities for SO1744 and SO1679 were 0.05 and 1.0 pg of PCR products, which are equivalent to 0.5 and 10 ng of gDNA, respectively. This may be a very important consideration for the assessment of individual genes in environmental studies. We define microarray sensitivity as the minimum amount of target required for a defined number of genes (e.g., 50%) to give reproducible signals at a defined SNR (e.g., 3.0). By this definition, the PCR, 70-mer, 60-mer, and 50-mer probes had sensitivities of 5.0, 25, 100, and 100 ng of gDNA, respectively. Microarray sensitivity may be more applicable to whole-genome arrays because the number of genes is generally known and most changes in transcript levels are expected to be detected.
Microarray hybridization and washing conditions greatly affect microarray sensitivity and specificity, so experimental conditions need to be optimized. For example, sensitivity can be increased by destroying stable secondary structures of probes and enhancing duplex formation (25). Specificity may be increased by preventing interactions between probes and noncomplementary targets and/or preventing nonspecific binding (25). Increased hybridization temperatures may improve the specificity of oligonucleotide probes, but the sensitivity may decrease substantially, especially for shorter oligonucleotides. In our experiments, all hybridizations were performed at 45°C in the presence of 50% formamide, and washing was at 37°C for 5 min.
Long oligonucleotide probes (50-mer to 70-mer) showed power for differentiating gene expression patterns under experimental conditions similar to that of PCR probes, but some differences in the expression patterns were also observed for the different probes (Table 3). Differences in sensitivity between PCR and oligonucleotide probes might explain this observation. For example, the mRNA levels of genes that showed different patterns were low and were just above the sensitivity level for PCR probes but below the sensitivity level of 70-mer, 60-mer, and 50-mer probes. The hybridization behavior of some ORFs listed in Table 3, such as SO0105, SO4180, SO2320, SO4661, SO2389, SO2953, SO2113, SO2654 and SO2930, could be explained by sensitivities that decrease in the following order: PCR, 70-mer, 60-mer, and 50-mer probes.
It is still difficult to explain why some ORFs (i.e., SO0017, SO1383, SO2743, SO3051, SO2697, SO1679, SO1157, SO1277, and SO2886) were detected by a shorter oligonucleotide probe instead of a longer oligonucleotide probe. A possible explanation could be related to probe or probe-target interactions on the microarray surface. Differences in signal intensity for individual probes are complicated by many factors, such as steric hindrance on the microarray surface (23) and stable secondary structures (25). To examine if any of the oligonucleotide probes shown in Table 3 have strong secondary structures, the oligonucleotides were checked with the MFOLD software (39). The predictions indicated that significant secondary structures were not present at 60°C (approximately equivalent to the conditions at 45°C with 50% formamide). As mentioned above, the Tm and G+C content may affect hybridization signals to some degree. For the 70-mer oligonucleotides in Table 3, the G+C contents and Tm values for SO0105 (60% and 80°C) and SO0017 (52% and 78°C) are higher and the G+C contents and Tm values for SO4180 (28% and 68°C), SO2697 (31% and 69°C), and SO1383 (35% and 70°C) are lower than the G+C contents and Tm values for other 70-mer probes (averages, 45% and 75°C). When the expression levels across a whole genome are analyzed, some individual probes may have to be reevaluated. Additional work is needed to discern the binding behavior for DNA molecules on glass surfaces and how these parameters can be incorporated into appropriate software algorithms.
Although detecting the responses to pH stress was not the focus of this study, some of the changes detected appear to be biologically meaningful for possible responses to pH stress. For example, in E. coli, the product of SO2743 is thought to be involved in acetate utilization as cells enter the stationary phase of growth (12). Perhaps the induction of a similar protein allows the consumption of acid in response to an acid shock in S. oneidensis MR-1. Another example is SO0848 encoding a periplasmic nitrate reductase, and the enzyme may be repressed by ammonium produced by degradation of some amino acids under pH stress conditions (27). The observations described above indicated that the differential gene expression data were correlated with physiological roles of S. oneidensis cells in response to pH stress and that changes in the expression patterns of the biologically relevant ORFs were detected with both the PCR and longer oligonucleotide probes.
Based on the concentrations of PCR and oligonucleotide probes, the copy number of oligonucleotide probes should be 20 to 500 times higher than that of PCR probes. This allows oligonucleotide probes to have a wider dynamic range than PCR probes so that different levels of RNA can be detected. If the amount of probes binding to the glass slide was saturated under our test conditions, as shown by Relógio (20), the copy number of the oligonucleotide probes was still more than 10 times higher than that of the PCR probes. Owing to the low sensitivity, oligonucleotide arrays may be useful for detecting RNA species at a high levels that may be saturated on PCR amplicon arrays.
The G+C content and Tm of an oligonucleotide are interesting. An oligonucleotide design program should constrict Tm values in a narrow range (e.g., 5°C), and it should also consider the overall G+C content of the whole genome. For example, the S. oneidensis MR-1 genome has a G+C content of 54%, and an oligonucleotide probe should have a G+C content close to the genome G+C content. In this study, the results showed that some oligonucleotides with lower G+C contents and Tm values did have low hybridization signals. Therefore, when probes for a genome with a substantially decreased G+C content are designed, the probe-target relationship may have to be reevaluated if stretches of more evenly distributed G+C content are not available.
| ACKNOWLEDGMENTS |
|---|
This research was supported by the United States Department of Energy under the Genomics:GTL, Microbial Genome, and Natural and Accelerated Bioremediation (NABIR) programs of the Office of Biological and Environmental Research, Office of Science. Oak Ridge National Laboratory is managed by University of Tennessee-Battelle LLC for the Department of Energy under contract DE-AC05-00OR22725.
| FOOTNOTES |
|---|
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
| REFERENCES |
|---|
|
|
|---|
70 is the principal sigma factor responsible for transcription of acs, which encodes acetyl coenzyme A synthetase in Escherichia coli. J. Bacteriol. 182:551-554.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| J. Bacteriol. | Microbiol. Mol. Biol. Rev. | Eukaryot. Cell | All ASM Journals |
|---|