Previous Article | Next Article ![]()
Applied and Environmental Microbiology, May 2008, p. 2717-2727, Vol. 74, No. 9
0099-2240/08/$08.00+0 doi:10.1128/AEM.02195-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.

Department of Microbial Ecology, Centre of Ecological and Evolutionary Studies, University of Groningen, Kerklaan 30, 9751 NN Haren, The Netherlands
Received 26 September 2007/ Accepted 20 February 2008
|
|
|---|
|
|
|---|
|
|
|---|
DNA extraction from soil.
DNA was extracted from soil with the UltraClean Soil DNA kit (Mo Bio Laboratories Inc.). DNA isolation was performed according to the manufacturer's instructions, modified as follows. Extra glass beads (0.15 to 0.30 g; bead size, 0.1 mm) were added to the soil samples, and the cells were disrupted by bead beating (mini-bead beater; Bio Spec Products) two times for 30 s. Final purification of the extracted DNA was performed with the Wizard DNA cleanup system (Promega).
Primers.
Primers for PCR amplification were designed to be specific for bacterial 16S rRNA gene targets. A 17-mer forward primer, designated F-968 (5'-AA CGC GAA GAA CCT TAC-3'), to which a 40-mer GC clamp (5'-CGC CCG GGG CGC GCC CCG GGC GGG GCG GGG GCA CGG GGG G) was attached at the 5' end was combined with 13 different reverse primers situated roughly around position 1400 (Table 1) to amplify the bacterial 16S rRNA gene fragments (22).
|
View this table: [in a new window] |
TABLE 1. Reverse primers used in PCR experiments in this study
|
PCR amplification.
PCR mixtures were composed as follows. Seven microliters of 10x Stoffel PCR buffer (Applera, Nieuwerkerk a/d IJssel, The Netherlands), 100 nmol MgCl2 (Applera), 0.5 µl formamide, 0.5 µg T4 gene 32 protein (Roche, Almere, The Netherlands), 10 nmol of each deoxyribonucleoside triphosphate, 10 pmol of each primer, and 3 U of 10 U/µl AmpliTaq DNA polymerase, Stoffel fragment (Applera), were combined with H2O (Applera) to 50 µl in a 0.2-ml Microfuge tube. After the addition of 5 ng of template DNA, the mixtures were incubated in a Gene Amp PCR system 9700 (Applera) programmed as follows: initial denaturation of double-stranded DNA for 5 min at 94°C; 10 (touchdown) cycles consisting of 1 min at 94°C, 1 min at 60°C, and 2 min at 72°C with a decrease in the annealing temperature of 0.5°C per cycle; 25 cycles consisting of 1 min at 94°C, 1 min at 55°C, and 2 min at 72°C; and extension for 30 min at 72°C. All amplification products were purified with the Wizard PCR DNA purification system (Promega, Madison, WI) and analyzed by electrophoresis in 1.0% (wt/vol) agarose gels, followed by ethidium bromide staining (1.2 mg/liter ethidium bromide in 1x Tris-acetate-EDTA) (19, 28).
DGGE.
Denaturing gradient gel electrophoresis (DGGE) was performed with the Ingeny Phor-U system (Ingeny International, Goes, The Netherlands). The PCR products were loaded onto a polyacrylamide gel (6% [wt/vol] acrylamide in 0.5x TAE buffer [2.42 g Tris base, 0.82 g sodium acetate, 0.185 g EDTA, 1 liter of H2O; pH adjusted to 7.8 with acetic acid]) with a 40 to 70% denaturant gradient (100% denaturant was 3.5 M urea and 32% [vol/vol] deionized formamide). The wells were loaded with roughly equal amounts of DNA (about 500 ng), and electrophoresis was carried out in 0.5x TAE buffer at 100 V for 16 h at 60°C. The gels were stained for 90 min in 0.5x TAE buffer with SYBR gold (final concentration, 0.5 µg/liter; Invitrogen, Breda, The Netherlands). Images of the gels were obtained by Imagemaster VDS (Amersham Biosciences, Buckinghamshire, United Kingdom) and stored as TIFF files. The DGGE patterns were compared by clustering the different lanes by Pearson's product-moment correlation coefficient with GelCompar II software (Applied Maths, Sint-Martens-Latem, Belgium) by the unweighted-pair group method with arithmetic mean, rolling-disk background subtraction, and no optimization (17, 27).
Preparation of clone libraries and sequencing.
Cleaned PCR products derived from the four reverse primers in conjunction with the forward primer with a GC clamp, F968-GC, were ligated into the pGEM-T easy vector (Promega, Madison, WI) and introduced into competent Escherichia coli MM294 cells (Sylphium Life Sciences, Groningen, The Netherlands) by transformation as described by the pGEM-T manufacturer's protocol. Blue-white screening was used, and white colonies were randomly picked. Colony PCR was then performed with pGEM-T primers T7F (5'-TAATACGACTCACTATAGGG-3') and SP6R (5'-GATTTAGGTGACACTATAG-3'). Clones that had yielded PCR products of the correct sizes, as determined by gel electrophoresis, were selected, and inserts were sequenced with standard primer T7F. Sequencing was performed on an ABI 377 machine (Applied Biosystems) (28).
Sequence analysis, construction of trees, and statistical analyses.
Analysis of the sequences was done with Chromas (Technelysium, Tewantin, Australia). Chimera check with Bellerophon was used to check for chimeric sequences (http://greengenes.lbl.gov/cgi-bin/nph-bel3_interface.cgi) (13). Bellerophon is a program for detecting chimeric sequences in a multiple-sequence data set by comparative analysis. It was specifically developed to detect 16S rRNA gene chimeras in PCR clone libraries but can be applied to other gene data sets. Chimeric sequences were not detected (13). The partial 16S rRNA gene sequences were compared with sequences in GenBank with nucleotide-nucleotide BLAST (BLAST-N) to obtain the nearest phylogenetic neighbors (www.ncbi.nlm.nih.gov/BLAST/). Sequences showing more than 97% similarity were considered to belong to the same operational taxonomic unit (OTU) (1). Trees were constructed from libraries obtained with each reverse primer by neighbor joining within the program MEGA 3.1 (The Biodesign Institute) and bootstrapped with 500 repetitions. These trees were used to obtain broader groupings—supported by checks with the Ribosomal Database Project (RDP) database—which served to prepare histograms revealing the compositions of the bacterial communities detected by each of the four reverse primers combined with the forward primer (18).
The program DOTUR (distance-based OTU and richness determination) was used to create rarefaction curves and to determine the Shannon diversity index, as well as the bias-corrected Chao1 estimator of richness (http://www.plantpath.wisc.edu/fac/joh/dotur.html) (14, 29). The Shannon index (H') was used to determine the diversity of bacteria present in the clone libraries created with the four reverse primers by the following equation:
![]() |
To compare the clone libraries for similarities, we used LIBSHUFF (30, 33). Differences between clone libraries were thus determined by calculating homologous and heterologous coverage curves and assessing the difference between the two curves, with the Cramér-von Mises statistic according to the method described by Singleton et al. (30, 33). For each comparison, if the lowest of the two P values calculated by LIBSHUFF is lower than or equal to the so-called critical P value (value for four libraries, 0.0043; relaxed, 0.0075) given by the program, a significant difference in the composition of the communities sampled in each library is indicated. The analyses were performed with the web-based LIBSHUFF program (version 0.96; J. R. Henriksen; http://libshuff.mib.uga.edu).
Nucleotide sequence accession numbers.
The nucleotide sequences obtained in this study have been submitted to the GenBank database and assigned accession numbers EU407826 to EU408221.
|
|
|---|
DGGE fingerprints.
High-molecular-weight DNA was recovered from all replicates of the two grassland soil samples, RG and G. The mean molecular size was 25 kb, and an average of about 15 to 20 µg DNA was obtained per g of soil, as indicated by electrophoresis on agarose gels and comparison to a marker. After Wizard DNA purification, the soil DNA preparations were colorless and sufficiently pure for PCR amplification, as evidenced by ready amplification with the canonical 16S rRNA gene-based PCR system. The electrophoresis of the PCR products obtained with the 13 different reverse primers under nondenaturating conditions in all cases revealed the expected bands of approximately 450 bp (data not shown).
To validate the differences caused by using the different reverse primers, a within-experiment check was needed to assess the method's reproducibility. Therefore, the reproducibility of the DGGE method was tested prior to the actual assessment of the profiles generated with the 13 different reverse primers. This was done by performing PCR on the basis of the 13 reverse primers in triplicate assays, running DGGE, and analyzing the resulting patterns by GelCompar. Analysis of the DGGE patterns thus obtained yielded a clear clustering among reverse primer types, as the within-treatment replicates clustered tightly together and were actually indistinguishable (data not shown). Hence, the reproducibility of the PCR-DGGE technique was very high and there were clear effects of the type of reverse primer used. We then compared the DGGE profiles generated with the 13 primer sets (one replicate each) in a single gel to circumvent the problem of gel-to-gel variation. The clustering of the patterns on this gel revealed the appearance of two broad clusters, one cluster consisting of the patterns produced by all of the reverse primers containing a G at position 14 (designated cluster a) and one cluster of those produced by reverse primers with an A at this position (designated cluster b). Since four primers contained an R degeneracy (A or G) at position 14 (designated cluster c), these were taken out of further analysis to remove any confounding effects of this degeneracy on the clustering. Figure 1A shows the DGGE patterns of PCR products produced with the nine remaining reverse primers (1378*, 1a, 2a, 2*a, 3a, 1b, 2b, 2*b, and 3b) in combination with the forward GC-clamped primer on DNA from grass rhizosphere sample RG. Analysis by GelCompar produced two main clusters, designated I and II, at low levels of similarity (Fig. 1B). The patterns produced by all reverse primers with a G at position 14 clustered together, and so did those generated with the reverse primers with an A at position 14. No other conspicuous pattern variations were discerned. Thus, the G/A switch at position 14 was apparently more determinative than the GC/CG alteration at the 5' end of the primer and/or the GGGAACG addition to the 3' end. To ascertain whether this clustering was consistent across samples, a similar analysis was performed on soil DNA taken from the grassland bulk soil (G). Analysis with GelCompar indeed supported the clustering that was also seen for sample RG. Calculation of the Shannon diversity indices (GelCompar) showed that primer R1401-1a yielded patterns with the highest diversity index (3.23), followed by R1401-1b (3.18), R1401-2a (3.16), and R1401-2b (2.68).
![]() View larger version (98K): [in a new window] |
FIG. 1. Comparison of bacterial DGGE patterns obtained from soil sample RG by PCR with canonical primer F968 (joined to a GC clamp) and reverse primers R1401-1a, -2a, -2*a, -3a, -1378*, -1b, -2b, -2*b, and -3b (A) and the corresponding dendrogram (clustering by the unweighted-pair group method with arithmetic mean) (B). M, molecular size markers; 1a, 2a, 2*a, 3a, 1378*, 1b, 2b, 2*b, and 3b, reverse primers used.
|
Clone libraries produced with four selected primer sets.
Clone libraries were successfully produced from the same soil sample, RG, with each of the four reverse primers (R1401-1a, R1401-2a, R1401-1b, and R1401-2b) in combination with the canonical forward primer. The clones obtained in the pGEM-T easy vector system were then checked for the presence of inserts of the expected size (about 450 bp) and a random set of about 100 clones with inserts per reverse primer was sequenced on the ABI377 apparatus. Rarefaction analysis (Fig. 2) of the sequences obtained revealed that coverage, as expected, was not complete; we estimated that a >10- to 30-fold increased sampling effort would be required to achieve fair coverage. The coverage obtained with primer R1401-1b was conspicuously lower than that produced with the other three primers, whereas that obtained with primer R1401-2b was the highest. On the other hand, the Shannon diversity index determined on the basis of the sequences (threshold, 97%) revealed rather similar values between the primers. As expected, the value produced with primer R1401-1b (4.12) was slightly higher than those produced with the other primers (R1401-2a, 4.07; R1401-2b, 3.99; R1401-1a, 3.93). The ChaoI species richness estimator was the highest for primers R1401-2a (232) and R1401-2b (221) and lower for the other primers (R1401-1b, 195; R1401-1a, 192).
![]() View larger version (8K): [in a new window] |
FIG. 2. Rarefaction curves of bacterial clone libraries of soil sample RG generated with primer sets consisting of canonical primer U968f combined with four different reverse primers (R1401-1a, -2a, -1b, and -2b). Sequences were compared to GenBank rRNA gene-based entries. The number of different OTUs (defined at a 97% similarity cutoff) in each sample is plotted versus the number of sequences sampled.
|
|
View this table: [in a new window] |
TABLE 2. Comparison of four 16S rRNA gene libraries using LIBSHUFFa
|
97% (R-1401-1a, 55.3%; R1401-1b, 42.7%; R1401-2a, 44.7%; R1401-2b, 50.5%). Moreover, 16.0% (R1401-1a), 15.6% (R1401-1b), 17.5% (R1401-2a), and 12.6% (R1401-2b) of the clones matched database entries between 95 and 97% and 25.5% (R1401-1a), 33.4% (R1401-1b), 34.0% (R1401-2a), and 32.0% (R1401-2b) of them matched database entries between 90 and 94%. Finally, low percentages of the clone libraries had matches in the database of 84 to 90% (R1401-1a, 3.2%; R1401-1b, 8.3%; R1401-2a, 3.8%; R1401-2b, 4.9%). The database entries with which matches were found at different similarity levels were often cultured organisms of defined taxonomic status, except for a few entries found in the phyla Acidobacteria, Verrucomicrobia, and Chloroflexi.
![]() ![]() View larger version (73K): [in a new window] |
FIG. 3. Phylogenetic trees constructed on the basis of partial 16S rRNA gene sequences (about 400 bp) generated from soil with canonical forward primer F968-GC and reverse primers R1401-1a (A, 94 sequences), R1401-2a (B, 103 sequences), R1401-1b (C, 96 sequences), and R1401-2b (D, 103 sequences). The trees were calculated with the neighbor-joining algorithm. To simplify the phylogenetic trees, clones are marked as follows: no symbol, sequences with 97% similarity; , 95 to 96% similarity; , 90 to 94% similarity; , <90% similarity. Grouping is according to phylum or class (Proteobacteria) or PN candidate divisions. Abbreviations: Acido, Acidobacteria; Actino, Actinobacteria; Bacter, Bacteroidetes; Chlam, Chlamydiae; Chloro, Chloroflexi; Chlorob, Chlorobi; Cyano, Cyanobacteria; Deino, Deinococcus; Ferri, Ferribacter; Firm, Firmicutes; Gemma, Gemmatimonadetes; -, β-, -, and -Prot, Alpha-, Beta-, Gamma-, and Deltaproteobacteria, respectively; TM7, candidate division TM7; Verruco, Verrucomicrobia.
|
![]() View larger version (25K): [in a new window] |
FIG. 4. Frequency histogram of the clones found in this study on the basis of 97%, 95% to 96%, 90% to 94%, and <90% matching OTU. The x axis shows the major existing or possibly novel phylogenetic groups (phylum or class) found in the clone libraries with each of the reverse primers used (R1401-1a, -2a, R1401-1b, and -2b), whereas the y axis shows the number of clones found in each phylogenetic group. *, these groups matched unculturable bacteria in the NCBI database.
|
The majority of the clones generated with all four primer sets fell into two main phyla, namely, Firmicutes (R1401-1a, 30.9%; R1401-1b, 18.8%; R1401-2a, 21.4%; R1401-2b, 32.0%) and the alpha, beta, gamma, and delta subgroups of Proteobacteria (R1401-1a, 27.6%; R1401-1b, 30.3%; R1401-2a, 28.1%; R1401-2b, 29.2%) (Table 3). The other phyla found with all four primers were Actinobacteria, Acidobacteria, and Chloroflexi. A main difference between the libraries generated with the four primer sets was the distribution of the Proteobacteria among the different subgroups. While 65.4% of the clones produced with primer R1401-1a were affiliated with the gamma subdivision of Proteobacteria, substantially lower frequencies of clones belonging to this subdivision were found with the other primers (R1401-1b, 24.1%; R1401-2a, 31.0%; R1401-2b, 43.3%). Also, while 37.9% of the clones obtained with primer R1401-1b fell within the delta subdivision of Proteobacteria, lower frequencies of this group were found with the other primers (R1401-1a, 3.8%; R1401-2a, 17.2%; R1401-2b, 16.7%). Members of Gemmatimonadetes and Verrucomicrobia were found with primers R1401-1a, R1401-1b, and R1401-2a, but primer R1401-2b did not yield any representative of these phyla. Further, primer R1401-1a stood out by having no clones related to the class Rubrobacteridae but having the highest number of clones associated with the class Actinobacteridae.
|
View this table: [in a new window] |
TABLE 3. Relative abundances of major phylogenetic groups found in clone libraries constructed with forward primer F968-GC and four different reverse primers in a single soil samplea
|
Sequences with low matches to the database.
Some clones, loosely affiliated with the phyla Chlorobi and candidate division TM7, had lower than 90% matches with EMBL database entries and, when considering the phylogenetic tree, might define novel groups. All of the clones with lower than 90% matches to the database were also compared to the RDP database. This analysis confirmed that the clones affiliated with the phyla Chlorobi and candidate division TM7 actually belonged to these two groups and not to a group outside of these phyla. Finally, 10 clones (produced with any of the four reverse primers) had no specific association with any of the known divisions or candidate divisions. Hence, these were phylogenetically divided into seven putatively novel (PN) division level groups named PN-1 through PN-7. Group PN-1, which was represented by two clones (2.2%), was found in the library produced with primer R1401-1a. Groups PN-2, PN-3, and PN-4 (primer R1401-1b) represented, respectively, two clones (2.1%), one clone (1%), and one clone (1%). Both primers R1401-1b and R1401-2b yielded one representative (1%) of group PN-5, and primer R1401-2b produced one clone (1%) that represented group PN-7. Group PN-6, represented by one clone (1%), was found in the library created with R1401-2a.
|
|
|---|
It has been shown before that bacterial community fingerprints obtained from soil are, to a great extent, reproducible, and so the levels of experimental variation due to methodological variation are often minimal (11). In this study, this high reproducibility of the PCR-DGGE method within methodological replicates was confirmed, providing the background for subsequent analyses in which the patterns generated with 13 primers (Table 1) were compared side by side on a single gel. Analysis of these patterns revealed the existence of two main clusters which grouped on the basis of the G/A alteration at position 14 of the reverse primers used. The existence of the two clusters was confirmed by excluding the reverse primers which carry an R (A or G) at position 14 of their sequences (R1401-1c, -2c, -3c, and -2*c) (Fig. 1A). Hence, the G-to-A switch at position 14 had a dramatic influence on the DGGE profiles obtained from the same soil sample. This was corroborated by the clear differences in the theoretical match percentages of the different primers with database sequences. In contrast, other changes in the reverse primers, such as the GGGAACG addition at the 3' end and the GC/CG alteration at the 5' end, had lesser effects. Statistical analyses of the DGGE profiles obtained showed that primer R1401-1a yielded patterns with the highest diversity, followed by primers R1401-1b, -2a, and -2b.
To further resolve the issue of reverse primer specificity effects, we produced clone libraries with primers R1401-1a, -1b, -2a, and -2b, which were examined with respect to the nature of the amplified sequences. First, rarefaction analysis indicated considerable incompleteness of sampling, an expected finding given the high bacterial diversity in most soils, including grassland (2, 5, 9). The difference in the rate of "exhaustion" of soil bacterial diversity between the different primers, notably, R1401-1b versus R1401-2b, was interesting, as it indicated that, indeed, different parts of the same soil bacterial community were being accessed. To further theoretically underpin these data, we analyzed the degree to which the four primers matched clones of the 139,340-member Waseca County farm soil metagenome available via the web. Primers R1401-1a, -1b, -2a, and -2b matched sequences present in seven, nine, two, and two clones in the library, respectively. These data are grossly consistent with an expected frequency of occurrence of the 16S rRNA operon in a soil DNA library of 1/104 to 105 clones but do not shed light on the exact prevalence because of statistical uncertainties.
Moreover, the analysis of the clone libraries by LIBSHUFF in several cases indicated a clear effect of the reverse primer type on the composition of the clone libraries, and hence we could confirm the initial observation—obtained by DGGE—that different parts of the bacterial community present in the same soil are targeted by different reverse primers, sometimes differing by only a single nucleotide.
However, the analysis of the different bacterial sequence types produced by the different primers, with a few exceptions, did not reveal major differences in the observed frequencies of specific sequence types (at >97% similarity). We attribute this finding to the diversity that is found within many bacterial taxa, allowing the amplification of apparently similar taxa with (slightly) different bacterial primers. An alternative explanation for this finding might lie in the nature of the primer-to-target annealing process, which is known to be determined, to a large extent, by the 3'-end sequence but may allow the production of amplicons resulting from sloppy annealing.
Overall and quite expectedly, the analysis of the sequences produced in the four libraries revealed the amplification, to different extents, of members of 14 different bacterial phyla from soil, namely, Firmicutes, Proteobacteria, Actinobacteria, Acidobacteria, Verrucomicrobia, Chloroflexi, Gemmatimonadetes, Chlorobi, Bacteroidetes, Cyanobacteria, Chlamydiae, Ferribacter, Deinococcus, and candidate division TM7, the dominant phyla being Firmicutes and Proteobacteria. The differences observed between the amplification rates of the different primers (Fig. 4) clearly indicate different annealing preferences. However, the finding of the 14 major groups is, overall, fairly consistent with data obtained in a range of other studies of different soils, and most of these phyla are now recognized as forming the numerically dominant parts of the bacterial communities in a majority of soils (36). For instance, Mummey et al. (20) recently found that members of Gemmatimonadetes, Actinobacteria (subdivision Rubrobacteridae), Acidobacteria, and the alpha subgroup of Proteobacteria were among the dominant members of the bacterial communities in two soils. In our study, the majority of the clones matched members of recognized bacterial divisions; however, a small fraction of the clones revealed the existence in the grassland soil of seven potential novel candidate divisions, PN-1 through PN-7, found to different extents in the libraries produced with all of the primers. This finding indicates that, in spite of the finding of a large fraction of sequences that match database entries with taxonomic status, our understanding of the extent of microbial diversity in soil is still incomplete, even at deeper phylogenetic levels. Moreover, even slightly different amplification systems may yield different phylogenetic novelty. However, given the scarcity of the current data (only a few clones found per putatively novel group), a more precise definition of the PN groups should await the production of more sequence information from other studies.
Although the data obtained from the clone libraries and the DGGE patterns indicated clear differences in the amplified bacterial communities, there was not an obvious connecting point between the data. On the basis of the matching diversity found within the RDP database (Table 1), the theoretically "best" reverse primers for use in amplifications would be R1401-1a or -2a, followed by R1401-1b and -2b. However, the numbers of matching RDP database entries do not necessarily represent the bacterial diversity typical for soil. Thus, the database does not (yet) contain categories of organisms such as the class "Spartobacteria" within Verrucomicrobia, the order "Solibacterales" within the phylum Acidobacteria, and the phylum Fibrobacteres. In our study, theoretically "suboptimal" primer R1401-1b gave high diversity indices on the basis of both DGGE patterns and clone libraries. Specifically, this primer yielded, among 100 clones, the highest numbers of different phyla (11) and novel candidate divisions (4) within the domain Bacteria. Thus, the G-to-A alteration at position 14 of the primer apparently strongly influenced the patterns generated, allowing primer R1401-1b to give high indices in both DGGE and clone libraries.
Considering the foregoing, we conclude that the present study, on the basis of the comparison of existing and newly designed primers, revealed a major effect of the sequence of the reverse primer used on the "apparent" (observed) soil bacterial diversity and community composition. Where the different primers thus provide different angles at the extant bacterial community in soil, if a single primer would have to be recommended on the basis of the observed levels of diversity, this might be R1401-1b. However, the concomitant use of primers R1401-1a and -1b might offer the best view of the diversity of bacterial communities in soil.
Published ahead of print on 29 February 2008. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»