Previous Article | Next Article ![]()
Applied and Environmental Microbiology, November 2006, p. 6965-6971, Vol. 72, No. 11
0099-2240/06/$08.00+0 doi:10.1128/AEM.00690-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Department of Biology, Southern Oregon University, 1250 Siskiyou Boulevard, Ashland, Oregon 97520
Received 25 March 2006/ Accepted 11 August 2006
|
|
|---|
100 km) in Northern California and Southern Oregon by producing 16S rRNA clone libraries from pairs of samples taken on either side of the serepentine-nonserpentine edaphic boundary at three geographical sites. At the division level, the serpentine and nonserpentine communities were similar to each other and to previous data from forest soils. Comparisons of both richness and Shannon diversity produced no significant differences between any of the libraries, but the vast majority of phylogenetically based tests were significant, even with only 50 sequences per library. These results suggest that most samples were distinct, consisting of a collection of lineages generally not found in other samples. The pattern of results showed that serpentine communities tended to be more similar to each other than they were to nonserpentine communities, and these differences were at a lower taxonomic scale. Comparisons of two nonserpentine communities generally showed differences, and some results suggest that the geographical site may control community composition as well. These results show the power of phylogenetic tests to discern differences between 16S rRNA libraries compared to tests that discard DNA data to bin sequences into operational taxonomic units, and they stress the importance of replication at larger scales for inferences regarding microbial biogeography. |
|
|---|
To date, investigations of serpentine soils have been largely culture based and have focused on the heavy-metal resistance of specific strains or the ecology of metal cycling (1). They have revealed that some bacteria resist a range of heavy metals common in serpentine soils by means of plasmid-borne genes (19, 24), and investigators have found that nickel-rich litter from the leaves of nickel-hyperaccumulating plants creates a high-nickel soil microenvironment, from which they isolated several strains of highly nickel-resistant bacteria. The goal of this paper was to compare bacterial communities from serpentine soils to communities from immediately adjacent nonserpentine soils by using 16S rRNA gene clone libraries. Furthermore, I replicated the serpentine-to-nonserpentine community comparison at three different sites over a large-scale landscape (
100 km).
The question of microbial biogeography in natural environments has become increasingly controversial in recent years. While there is evidence to support the traditional microbiological view that "everything is everywhere" and that the global microbial community consists of a relatively small number of cosmopolitan species (7-9), an increasing number of molecularly based studies have found evidence for biogoegraphical patterns in microbial distribution and diversity (7a, 10, 12, 16, 18, 25). However, studies specifically designed to compare microbial communities across large-scale landscapes and that incorporate replication of these comparisons are rare. Although Fierer and Jackson's results (7a) show the importance of edaphic factors in soil microbial diversity, their use of the coarse-resolution terminal restriction fragment length polymorphism technique provides little detail about the specific lineages that respond to particular edaphic factors. The dramatic chemical differences between serpentine and nonserpentine soils make plausible the existence and detection of consistent differences in the presence and diversity of specific sequence-defined lineages unique to serpentine soils across larger-scale regional geography, should such differences exist.
From an evolutionary perspective, we can state two general hypotheses concerning the bacterial communities of serpentine soils, defined by how they might have adapted to their unique chemical environment. This could either be (i) through long-term evolution of phylogenetically distinct lineages specially adapted to serpentine soils or (ii) through rapid evolution of a small number of adaptive genes, perhaps including horizontal gene transfer or plasmid-borne genes (24). The first hypothesis predicts that a comparison of serpentine to nonserpentine soils will reveal that each community is comprised of different sets of more deeply divergent lineages, while the second hypothesis predicts that essentially the same lineages exist in both communities, with differences being due to a small number of adaptive genes and not likely detectable by comparison of 16S rRNA genes.
An important aspect of this study is that I provide true replication, not pseudoreplication, of the serpentine-to-nonserpentine community comparisons over a geographical extent of
100 km. One can argue that true replication is accomplished, for example, by taking several soil samples from a 1-m2 plot and investigating each independently. However, inferences made from these samples are valid only for that 1-m2 plot, in part because of the large potential for soil heterogeneity, but also due to our ignorance of larger-scale microbial biogeography. Even when samples are taken over a larger area, such as an agricultural field or larger study plot, extrapolating the generality of the results to areas outside the study site remains speculation. The general lack of true replication to date in molecular studies is understandable, given the labor-intensiveness and expense of clone library construction and sequencing and the daunting potential for enormous soil heterogeneity over virtually every scale from the microscopic to the regional landscape. The desire to make such general inferences about larger-scale bacterial communities, however, necessitates replication of community profiles over an appropriately larger scale.
In this study, I constructed 16S clone libraries from three different subalpine forest sites in northern California and southern Oregon (Fig. 1). At each of these sites, pairs of soil samples were collected on either side of a sharp boundary between serpentine soil and an immediately adjacent nonserpentine soil, allowing me to make three independent serpentine-to-nonserpentine comparisons. This study design allowed me to (i) test whether there is a consistent pattern of differences in bacterial community composition across a large geographical area, (ii) identify which bacterial groups might be responsible for any observed differences, and (iii) characterize any observed differences in ways that will guide future investigations. The study design allowed independent comparison of the effects of soil type (serpentine versus nonserpentine), as well as the effects of a geographical site, on the microbial community composition.
![]() View larger version (46K): [in a new window] |
FIG. 1. Map of the study area showing the three geographical sampling sites, indicated by triangles, in the context of the serpentine outcrops in Northern California and Southern Oregon, indicated in black. The dashed lines show county borders. A paired set of soil samples was taken at each of these three sites for a total of six clone libraries. The Dutchman's Peak site is at 42.050°N, 122.846°W; Mt. Eddy is at 41.317°N, 122.490°W; and East Boulder Lake at 41.234°N, 122.785°W.
|
|
|
|---|
DNA extraction and PCR.
I performed the DNA extractions using the MoBio UltraClean Soil DNA Kit (MoBio Corp., Carlsbad, CA) according to the manufacturer's protocol and using 90 seconds of bead beating at 4,200/min. I performed PCR for the 16S rRNA gene using the universal bacterial 16S rRNA gene primers 8F (5'-AGRGTTYGATYMTGGCTCAG-3') and 1492R (5'-CGGCTACCTTGTTACGACTT-3'). PCR conditions were as follows: a buffer of 60 mM Tris-HCl, 15 mM ammonium sulfate, pH 8.5, with 2.5 mM MgCl2, 200 µM deoxynucleoside triphosphates, 0.4 µM of each primer, 1 unit of Taq polymerase, and 0.4 µl of a 101 dilution of the pooled DNA extract as the template in 20-µl reaction volumes. For each DNA extract, I performed 16 replicate PCRs and pooled the product prior to clone library construction in order to minimize the effect of any PCR drift in individual reactions during amplification. The PCR cycling conditions were 94°C for a 1-minute initial denaturation, followed by 35 cycles of 94°C for 30 seconds, 52°C for 1 minute, and 72°C for 1 minute and 30 seconds, with a final extension of 72°C for 10 min. The PCR products were run out on a 1% low-melting-point agarose gel, and the
1.5-kb fragments were excised from the gel and purified with the QIAEXII Gel Extraction Kit (QIAGEN Corp., Valencia, CA) in preparation for cloning.
Clone library construction and sequencing.
I used the pGEM-T Easy cloning vector system (Promega Corp., Madison, WI) according to the manufacturer's protocol to clone the pooled PCR products. I chose 50 random clones from each library for sequencing using the primer 926F (5'-AAACTYAAAKGAATTGACGG-3') and the ABI BigDye v3.0 cycle-sequencing kit (Applied Biosystems, Foster City, CA). I used an ABI 310 Genetic Analyzer for the actual sequencing. I manually checked and edited all sequencer chromatograms prior to alignment. The risk of sequencing chimeric sequences was reduced by the fact that, while I cloned almost the entire length of the 16S rRNA gene, I sequenced only the last third of it, thus missing a large percentage of potential chimeric breakpoints. Sequences were checked for possible chimeric origin by using the Ribosomal Database Project's CheckChimera program and the Mallard program, which is based on the Pintail algorithm by Ashelford et al. (2); I did not use the Bellerophon chimera detection software due to the relatively short sequence lengths. A few potentially suspicious sequences were discarded, and additional clones were sequenced until each library consisted of 50 nonchimeric sequences. The final data for each clone consisted of approximately 580 bases at the 3' end of the 16S rRNA gene, which contained the V6, V7, V8, and V9 hypervariable regions.
Phylogenetic analyses.
I performed manual sequence alignments based on the 16S rRNA structural diagrams of Cannone et al. (4a), including 38 known sequences representative of major bacterial divisions. I used a conservative sequence mask that resulted in an alignment of 568 base positions for phylogenetic analyses. Exploratory phylogenetic analyses revealed that Aquifex and Thermotoga were basal to all new sequences, so I used these two as outgroups in all further phylogenetic analyses rather than archaeal sequences. In order to compare two different libraries, I combined the 50 sequences from each library, added Aquifex and Thermotoga, and estimated their phylogeny with maximum likelihood using fastDNAml. This was repeated for all 15 possible pairwise comparisons of the six libraries. In order to directly compare sites, I combined the serpentine and nonserpentine sequences from both samples at a site and then estimated the phylogeny rooted with Aquifex and Thermotoga using fastDNAml. This was repeated for the three possible site-versus-site comparisons. Finally, to make an overall comparison of the pooled serpentine-versus-nonserpentine sequences, all 300 sequences were combined, and an enormous computational effort was devoted to estimating a maximum likelihood phylogeny rooted with Aquifex and Thermotoga using fastDNAml. I similarly constructed Jukes-Cantor distance matrices for all levels of hierarchical pairwise comparisons (sample versus sample, site versus site, and serpentine versus nonserpentine) by using DNADIST from the PHYLIP package.
Statistical analyses.
I performed the phylogenetic P tests described by Martin (15) at all levels of comparison (sample versus sample, site versus site, and serpentine versus nonserpentine) using the maximum likelihood trees. In these tests, the soil type was coded as a character on the tree, and the minimum number of changes between soil types required to explain the tree was computed by MacClade (14). The number of changes was compared to the distribution of changes on 1,0000 random trees in order to produce a P value for the likelihood of the observed number of changes. These P values were used to determine if phylogeny significantly covaried with soil type. I used Arlequin (22) to perform the F test as described by Martin (15). In this test, FST values between sets of sequences are based on the average pairwise distance of two sequences within a set compared to the average pairwise distance of two sequences from different sets. Significance is tested by randomly permuting the sequences into two sets 10,000 times and computing a null distribution of FST values to which the observed FST value can be compared for significance by calculating a P value. I used S-LIBSHUFF (21, 23) to make pairwise comparisons between clone libraries at all levels (sample versus sample, site versus site, and serpentine versus nonserpentine) by using the Cramer-von Mies statistic. These tests use distance matrices to estimate the coverage of a clone library over a range of taxonomic levels, enabling the detection of significantly different microbial communities (as defined by nonoverlapping coverage), as well as whether one library is a subset of another (defined by differences in reciprocal coverage). S-LIBSHUFF calculates P values by using a random-permutation procedure to establish the statistical significance of overlapping coverage.
To estimate diversity statistics, I used DOTUR (20) to compute the Shannon diversity and the Chao and ACE richness estimates at a range of taxonomic levels based on the Jukes-Cantor distance matrices. This program uses the furthest-neighbor method to collapse similar sequences into groups at arbitrary levels of taxonomic similarity and then computes the Shannon, Chao, and ACE statistics for that taxonomic level. I computed these statistics for each sample separately, for each site separately, and also for the overall serpentine and nonserpentine sequences. Bootstrapping procedures within the DOTUR program assess the confidence limits of these estimators.
Nucleotide sequence accession numbers.
The GenBank accession numbers for the 16S rRNA gene sequences from each of the six samples are DQ457699 through DQ457998.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Numbers of clones allied to known and candidate divisions from the 16S rRNA gene libraries from serpentine and nonserpentine soils
|
![]() View larger version (11K): [in a new window] |
FIG. 2. Results of phylogenetic P tests. (A) Matrix of P values comparing maximum likelihood phylogenetic trees from individual clone libraries using Martin's phylogenetic P test. The first letter of each sample code is either S for serpentine or N for nonserpentine, and the next two letters encode the specific site (DP for Dutchman's Peak, EB for East Boulder Lake, or ME for Mount Eddy). The four quadrants of the matrix are groupings of like tests (nonserpentine versus nonserpentine in the upper left, serpentine versus nonserpentine in the lower left, and serpentine versus serpentine in the lower right). Significant P values (those less than 0.05) are in boldface; nonsignificant results indicate that 16S phylogeny does not covary with the sampling site. (B) Comparisons of one site to another, with serpentine and nonserpentine samples from each site pooled. (C) Overall serpentine-nonserpentine comparison.
|
![]() View larger version (23K): [in a new window] |
FIG. 3. FST values based on average pairwise differences within and between 16S clone libraries and results of significance tests. (See the legend to Fig. 2 for site codes.) (A) FST values based on average pairwise differences within and between 16S clone libraries are given above the diagonal; below the diagonal are P values from significance testing by permutation tests using Arlequin. The four quadrants of the matrix are groupings of like tests (nonserpentine versus nonserpentine in the upper left, serpentine versus nonserpentine in the lower left, and serpentine versus serpentine in the lower right). P values of less than 0.05 are in boldface; nonsignificant results indicate that there is not significant population differentiation based on a matrix of pairwise distances. (B) Comparisons of one site to another, with serpentine and nonserpentine samples from each site pooled. (C) Overall serpentine-nonserpentine comparison.
|
![]() View larger version (13K): [in a new window] |
FIG. 4. Results of S-LIBSHUFF clone library comparisons. (See the legend to Fig. 2 for site codes.) (A) Matrix of P values produced by S-LIBSHUFF comparisons of clone library distance matrices. The values test the heterologous coverage CROW,COLUMN versus the homologous coverage, CROW. The four quadrants of the matrix are groupings of like tests (nonserpentine versus nonserpentine in the upper left, serpentine versus nonserpentine in the lower left, and serpentine versus serpentine in the lower right). P values of less than 0.05 are in boldface; nonsignificant results indicate overlapping coverage of the second library by the first. For cases in which CXY is not significant and CYX is, library X is considered to be a subset of library Y. The matrix suggests that SME and NME are both separate subsets of SDP and that NEB is a subset of both SEB and SME. (B) Comparisons of one site to another, with serpentine and nonserpentine samples from each site pooled. These results suggest that the Mount Eddy community is a subset of the Dutchman's Peak community. (C) Overall serpentine-nonserpentine comparison.
|
|
|
|---|
The combined results of the P tests, F tests, and library shuffling do suggest that serpentine communities are more similar to each other than serpentine communities are to nonserpentine communities. The pattern of significant results in the P tests (Fig. 2A), F tests (Fig. 3A), and library shuffling (Fig. 4A) all show a bias toward nonsignificant results in the quadrant comparing serpentine communities to other serpentine communities, with most of the serpentine-to-nonserpentine comparisons and nonserpentine-to-nonserpentine comparisons producing significant results. This suggests that the soil type or, more accurately, the parent rock type does help to determine microbial community composition across a broad geographic region at my sampling sites, which are all moderate-elevation forested sites with poorly developed soils. The fact that all of the nonserpentine-to-nonserpentine comparisons in all three tests were significant may be partially explained by the fact that I have used "nonserpentine" as a negative definition, which obscures a diversity of parent rock types among the nonserpentine sites. The geology ranges from quartz biotite schist at Dutchman's Peak to diorite at Mt. Eddy to quartz diorite at East Boulder Lake. If this edaphic diversity helps to determine community composition in these samples, then we might expect some significant differences among the comparisons (7a).
Although large differences in the ACE and Chao richness estimators were calculated from individual samples, there were no significant differences at the sample, site, or soil type level. Shannon diversities estimated by DOTUR produced a much narrower range among samples (from 3.5 to 3.8), with no significant differences. These results are consistent with those of studies that have determined that very large numbers of sequences must be obtained from clone libraries of complex environments in order to accurately estimate a number of parameters, such as species richness (5, 6, 17).
Comparisons by geographical site.
To make site-based comparisons, we pooled the serpentine and nonserpentine libraries at each site to create a larger sequence set that represented the community at that geographical site. The P tests, F tests, and library-shuffling tests based on site-versus-site comparisons showed conflicting results. The Dutchman's Peak-versus-East Boulder Lake comparison produced significant results in all of the statistical tests, while the other two comparisons produced a mix of significant and nonsignificant results (Fig. 2B, 3B, and 4B). These results suggest that the geographical site can play a role in determining soil bacterial communities. However, in doing the site-based tests, frequently two very distinctive communities were combined and compared to a combination of two other very distinctive communities, a process that could tend to produce spurious significant results and reduce the effective power of these statistical tests.
Overall comparison by soil type.
The relative abundances of different bacterial divisions from serpentine and nonserpentine communities were not significantly different from each other (Table 1) and are generally similar to those in other culture-independent studies of forest soils (3, 11). At the broadest taxonomic scale, there is little evidence for a marked change in community composition or unique dominant groups on serpentine soils (Table 1). However, the P tests, F tests, and library-shuffling results (Fig. 2C, 3C, and 4C) comparing overall serpentine to overall nonserpentine communities did all have significant results, suggesting that differences in the communities do exist at a finer taxonomic scale than the division level. In order to discover which phylogenetic groups might be contributing to the significant overall serpentine-to-nonserpentine P-test result, I attempted to remove the significance by pruning branches off the overall tree and recalculating the P test until the result was nonsignificant (data not shown). It took at least four separate prunings of a minimum of 57 sequences in order to remove the overall serpentine-to-nonserpentine significance, supporting the idea that there is no single major group that is largely responsible for this difference. There is some suggestion of higher-level groups that are unique to serpentine soils, such as a clade within the green nonsulfur bacterium-related bacteria and those related to the OP8 division (Table 1), but they are minor components of the overall communities, and the differences are not significant with my sample sizes.
Conclusions.
One hypothesis is that the chemistry of serpentine soils constrains the community composition to be drawn from a limited number of bacterial groups. This would predict a pattern similar to the one I observed, with most serpentine-to-serpentine results being nonsignificant, while the larger pool of groups from which other soil type communities may draw from would result in a larger number of significant differences. The library-shuffling data do not support the idea that serpentine communities are generally subsets of other communitiesthe overall serpentine-to-nonserpentine comparison (Fig. 4C) is significant in both directions, and of those comparisons which did identify one library as a subset of another (Fig. 4A), there are actually more cases of nonserpentine communities being subsets of serpentine communities than vice versa. The general pattern of more differences between serpentine and nonserpentine communities at the lower taxonomic levels is consistent with horizontal gene transfer of relatively few genes allowing adaptation to the serpentine environment.
Despite having only a tiny sample size of three geographical sites and a relatively small number of 50 sequences per library, a pattern does emerge from the results of the P tests, F tests, and library-shuffling tests that suggests that serpentine communities are phylogenetically more similar to each other than they are to nonserpentine communities, but at a lower taxonomic level than the division. I have controlled my samples to be from moderate-elevation poorly developed forest soils in southern Oregon and northern California, so I cannot infer similarities among serpentine soils of different types and from other geographical locations. However, my results suggest that future work devoted to expanding the range of soil types (e.g., low-elevation serpentine grasslands or Darlingtonia bogs) within the Klamath Mountains region, as well as expanding the geographical scope outside the region to similar elevations and qualities of forest soils, will be a fruitful avenue for defining the range of serpentine environments over which such a pattern of similarity may hold. Using true replication to determine the geographical and soil type limits over which bacterial communities in natural soil environments are phylogenetically similar would be an important advance in microbial biogeography.
Published ahead of print on 1 September 2006. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»