Previous Article | Next Article ![]()
Applied and Environmental Microbiology, September 2005, p. 5544-5550, Vol. 71, No. 9
0099-2240/05/$08.00+0 doi:10.1128/AEM.71.9.5544-5550.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Department of Biology, Duke University, Durham, North Carolina 27708,1 Centre for Biodiversity and Conservation Biology, Royal Ontario Museum, and Department of Botany, University of Toronto, Toronto, Canada2
Received 10 November 2004/ Accepted 15 April 2005
|
|
|---|
|
|
|---|
Sequence-based studies of DNA obtained directly from environmental samples are revolutionizing our view of microbial diversity (18). This technique involves using PCR to amplify target regions of DNA extracted directly from environmental samples. These amplicons are then cloned to isolate individual fragments and sequenced. The application of this approach to a wide range of substrates has resulted in the identification of a large number of new bacterial species that cannot be cultivated, many of which constitute novel lineages or are dominant components of microbial communities (11). This technique has become an indispensable tool in bacterial soil ecology, but its use in studies of eukaryotic microorganisms has been limited. Sequence-based studies have been carried out on fungi from grass leaves (6), plant roots (10, 35, 38), and soil (30), but most studies have been restricted to the use of methods based on fragment length polymorphisms (13, 39) or denaturing gradient gel electrophoresis (2, 37). These methods collectively have the potential to improve our understanding of fungal biodiversity and facilitate ecological studies of soil communities.
Most recent studies of uncultured microbial diversity have sequenced genes coding for the small-subunit (SSU) rRNA. Sequencing of SSU rRNA is highly effective in bacteria, but this region does not evolve rapidly enough in eukaryotes for use in more than higher level taxonomic assignments (38). The internal transcribed spacers (ITS) that separate the SSU and large-subunit (LSU) rRNAs in eukaryotes evolve at a much faster rate and can be used to identify fungi to genus and often to species level. This region has become a critical marker for species identification from fungal vegetative structures such as mycorrhizae (14, 35). Due in large part to the use of ITS sequence data for fungal molecular systematic studies (5), a large database of publicly available ITS sequences from diverse fungal taxa now exists (40). If the taxonomic coverage of this database is sufficient such that all sequences from environmental samples will have a close match in the database, the ITS rRNA will provide high-level resolution among different members of fungal communities while simultaneously providing information on taxonomic composition.
Our objectives in this study were (i) to determine the feasibility of using sequence-based biotic inventories for soil fungi to quantify species richness and to detect correlations between the taxonomic composition of soil communities and environmental variables and (ii) to compare the utility of the ITS and SSU rRNA regions for characterizing soil fungi. We hypothesized that a sequence-based approach would detect higher levels of fungal diversity in the soil, representing a greater taxonomic breadth than is usually detected with traditional survey methods, and that distinct fungal communities could be detected at different depths in the soil profile. We expected the ITS region to provide finer-scale taxonomic resolution of community composition than the SSU while still permitting taxonomic placement of sequences because of the extensive sequence database available. If this approach is successful, it can be used to greatly increase our understanding of the dynamics of soil fungal communities through rapid and unbiased inventories of soil fungi.
|
|
|---|
At each site, 1-m2 quadrats were sampled at 5-m intervals along transects spaced approximately 5 m apart, for a total of 50 soil samples per plot. For each quadrat, all of the litter was collected and one soil core 2.5 cm in diameter and 20 cm deep was taken. In each soil core distinct zonation corresponding to O (organic material and detritus), A (organic matter-rich mineral soil), and B (clay-rich mineral soil) horizons could be visually detected. Each soil core was subdivided by zone, and all samples within a soil horizon type were composited to overcome any within-plot spatial heterogeneity. Litter samples were air dried for 24 h and homogenized by grinding in a Wiley mill (Thomas Scientific, Swedesboro, NJ). Organic and mineral soil samples were sieved using a 2-mm sieve to remove large rocks and roots. Samples were stored at 4°C for 1 week or less before processing. Subsamples (50 ml) were finely ground using a sterile mortar and pestle and liquid nitrogen and stored at 20°C until DNA was extracted. Samples were designated according to plot (P, pine; MH, mixed hardwood) and horizon (L, litter; O, organic; A, A horizon; and B, B horizon).
Molecular methods.
DNA was isolated from 0.5 g of the finely ground subsamples using the Mo Bio Ultraclean soil DNA extraction kit (Mo Bio Laboratories, Solana Beach, CA) according to the manufacturer's instructions. The ITS and SSU rRNA regions were amplified from these extracts using universal primers (ITS1, ITS4, NS1, and NS2) (42) as well as taxon-specific primers ITS1F (fungus-specific) and ITS4B (Basidiomycota specific) (16). PCR was performed using the following protocol: initial denaturation at 94°C for 3 min, followed by 35 cycles of 94°C for 1 min, 50°C for 30 s, and 72°C for 1 min, with a final extension at 72°C for 7 min. A modified PCR protocol consisting of an initial denaturation at 94°C for 5 min, followed by 20 cycles of 94°C for 1 min, 50°C for 30 s, and 72°C for 2 min, with a final extension at 72°C for 7 min was used for some PCRs to assess the effect of cycle number and extension time on the generation of chimeric sequences and other PCR artifacts (Table 1). Each 25-µl PCR consisted of 25 µg bovine serum albumin, 0.625 U Taq DNA polymerase (Abgene, Rochester, NY), 1.5 mM MgCl2, deoxynucleoside triphosphates (0.2 mM each), primers (0.5 µM each), and PCR buffer.
|
View this table: [in a new window] |
TABLE 1. Number of sequences and OTUs recovered from each library
|
White colonies were screened for inserts by adding whole cells directly to PCRs and amplifying the inserts using M13 F and R primers (Invitrogen, Carlsbad, CA) and a "colony PCR" protocol: initial denaturation of 94°C for 10 min, followed by 35 cycles of 94°C for 1 min, 55°C for 1 min, and 72°C for 1 min, with a final extension of 72°C for 10 min. PCR products containing the insert were purified by affinity chromatography using Qiaquick spin columns (QIAGEN, Valencia, CA). Cycle sequencing was performed with Big Dye chemistry version 3.1 (Applied Biosystems, Foster City, CA) and sequences were determined with an ABI 3700 (Applied Biosystems). All PCR and sequencing reactions were run on a Primus 96 Plus thermal cycler (MWG AG Biotech, High Point, NC).
Bioinformatics and phylogenetic analyses.
Forward and reverse sequence reads were assembled using Sequencher 4.0 (Gene Codes, Ann Arbor, MI). Conserved motifs were identified in the SSU and LSU regions flanking each ITS sequence and the sequences were trimmed to these motifs to ensure that all sequences had the same start and end point. Pairwise alignments were generated among all ITS sequences using the Needleman-Wunsch global alignment algorithm (26) as implemented in the program Needle in the EMBOSS software suite (28). This program calculates pairwise similarity values by dividing the number of matching nucleotides by the total length of the alignment.
Operational taxonomic units (OTUs) were defined as a group of sequences sharing at least 97% pairwise similarity, which is within the range of intraspecific ITS sequence divergence. A Perl script (written by J. E. Stajich using the Bioperl Toolkit [32] and available from the authors upon request) was used to cluster sequences using single-linkage clustering. Since terminal gaps were treated as mismatches in these analyses, pairwise similarity was calculated separately for sequences where the terminal motifs could not be identified by using only homologous parts of the sequences to avoid underestimating their similarity to the other full-length sequences.
Representative sequences for each ITS OTU were queried against GenBank using BLAST (1). All sequences for which the ITS1 and ITS2 regions matched unrelated sequences (i.e., different genera) from the database with an alignment score
200 were assumed to be chimeric sequences resulting from PCR recombination and were excluded from subsequent analyses. Each sequence was identified to the lowest taxonomic rank common to all of the top BLAST hits using the taxonomy of Kirk et al. (21). Due to the high degree of sequence conservation in the 5.8S rRNA (located between the ITS1 and ITS2 regions), all sequences had moderate to high BLAST scores for this portion of the sequence. Thus, a match was required in at least half of the ITS1 or ITS2 region to confidently identify sequences below kingdom or phylum. Most sequences with an alignment score over 600 could be identified to the level of genus, but this was not possible for some taxonomically problematic groups. We did not attempt to identify OTUs to the level of species. The taxonomic placement of sequences with high similarity to anamorphic fungi were determined either by looking at the best BLAST match for a teleomorph or by consulting phylogenetic studies which included that species (23, 31).
All unique SSU sequences were queried against GenBank using BLAST (1). Chimeric sequences were identified using the program Chimera Check from the Ribosomal Database Project II (25) and eliminated from subsequent analyses. The remaining sequences were subjected to automated multiple alignment using ClustalW (36) and the resulting alignment was edited by eye using MacClade 4.0 (24). Ambiguously aligned regions were excluded from the alignment. Phylogenetic analysis was conducted with the neighbor joining method using PAUP*4.0b10 (34).
Diversity analyses.
All diversity calculations were carried out using the program EstimateS 6.0b (9). Shannon-Weiner diversity indices were calculated for each clone library using OTUs as a proxy for species. Species-effort curves were used to determine how sensitive the observed richness was to the number of clones sampled. These curves were constructed by randomly resampling different numbers of sequences from the data set without replacement, and determining the number of OTUs in each sample. This resampling was repeated 1000 times for each sample size and the mean and variance of these richness estimators were calculated. Species-effort curves also were constructed for estimated richness using the ACE nonparametric richness estimator (7). Differences in community composition among soil horizons within and between plots were investigated by plotting relative abundances of different taxonomic groups at hierarchically nested taxonomic scales for each library.
Nucleotide sequence accession numbers.
All sequences were deposited in GenBank, with accession numbers AY969316 to AY970290 for the ITS sequences and AY969135 to AY969315 for the SSU sequences. Tables listing the BLAST identity of each OTU and their distribution across libraries can be found in the supplemental material.
|
|
|---|
The 975 nonchimeric sequences comprised 454 unique OTUs, 301 of which were singletons occurring only once in the entire data set. The remaining 153 OTUs ranged in abundance from 2 to 51 sequences (0.2% and 5.2% of the data set, respectively). Since primer specificity affects the breadth of phylogenetic diversity recovered, only libraries generated with the universal primer pair ITS1 and ITS4 were considered when comparing diversity and community composition among soil horizons and between plots.
One SSU library was generated for each plot, and 100 and 92 clones were sequenced for the P and MH plots, respectively. Eleven of these sequences were determined to be chimeric and removed from subsequent analyses. A total of 153 unique sequence types were recovered, with each occurring between 1 and 12 times in the data set. The phylogenetic analysis of SSU sequences confirmed the results of BLAST searches as all Ascomycota and Basidiomycota sequences formed monophyletic groups (Fig. 1). Four sequences that matched members of the Chytridiomycota and Zygomycota fell outside their corresponding clusters, but all fungal sequences grouped together.
![]() View larger version (25K): [in a new window] |
FIG. 1. Proportions of SSU sequences belonging to different fungal phyla and other eukaryotic kingdoms. The width of triangles corresponds to the number of sequences in each, and the height corresponds to the maximum amount of sequence divergence within clades (see scale bars).
|
![]() View larger version (23K): [in a new window] |
FIG. 2. Proportional distribution of different taxonomic groups in ITS clone libraries. The x axis indicates the proportion of sequences assigned to each taxonomic group, and bar width corresponds to the proportion of all sequences belonging to each library. A. Proportion of all ITS sequences belonging to each fungal kingdom. B. Proportion of Ascomycota ITS sequences belonging to each subclass. C. Proportion of Basidiomycota ITS sequences belonging to each order.
|
Composition of Ascomycota sequences.
Sixty-six percent of all Ascomycota ITS sequences recovered in this study could be identified to subclass or lower. The most abundant subclasses were the Leotiomycetidae and Dothidomycetidae. Members of the Sordariomycetidae, Chaetothyriomycetidae, Pezizomycetidae, and Eurotiomycetidae also were found in many samples. There were no clear compositional differences with depth at this taxonomic scale, but Dothidiomycetidae sequences were most abundant in the mixed hardwood plot, while Leotiomycetidae sequences dominated in the pine plot (Fig. 2B). Twenty-five percent of the Ascomycota ITS sequences could be identified to genus, with the most abundant genera including Phialophora (Leotiomycetidae), Hormonema (Dothidiomycetidae), and Alternaria (Dothidiomycetidae) (Table 2).
|
View this table: [in a new window] |
TABLE 2. Number of sequences recovered for the most common genera encountereda
|
Richness and diversity estimates.
Shannon-Weiner diversity values for clone libraries ranged from 3.58 for MH-O-6 to 5.61 for P-A (Table 1). Overall, diversity was positively correlated with sample size, except for library MH-A-1 (101 sequences, H = 3.62), for which 49% of the sequences belong to a single OTU. Species-effort curves for all samples have a positive slope throughout and show no evidence of approaching saturation, indicating that true richness is much higher than we have captured in our samples (19) (Fig. 3A and B). Similar results were obtained when we examined the data pooled by horizon (Fig. 3C) or plot (Fig. 4).
![]() View larger version (9K): [in a new window] |
FIG. 3. Species-effort curves of richness for fungal OTUs. A. Mixed hardwood plot. B. Pine plot. C. Pooled data from both plots.
|
![]() View larger version (16K): [in a new window] |
FIG. 4. Species-effort curves for observed and estimated fungal OTU richness in each plot.
|
|
|
|---|
The large range of intraspecific ITS variation reported in the literature complicates the determination of an appropriate sequence similarity cutoff. In some cases, different species have been found to have ITS similarity greater than 99% (12, 20) while intraspecific ITS similarity of 90% or lower has been reported for others (22, 27). The degree of ITS sequence variation within and between fungal species is an issue that deserves greater attention if we are to continue to rely on this gene region to delimit species, but clustering sequences with 97% or greater sequence similarity is likely a conservative measure of species richness, especially when sampling from a single locality.
Comparisons of ITS sequencing with traditional fungal survey methods.
Despite methodological limitations, this study demonstrates the potential of large-scale sequencing projects targeting the ITS rRNA for quantifying and characterizing soil fungal diversity. We recovered 412 unique OTUs from a total of 863 fungal sequences. This level of richness is comparable to that found in a 1,500-m2 forest plot in Switzerland, where 408 species were recovered from fruiting body surveys (33), and to richness levels found for endophytic fungi isolated from leaves of two tropical trees, where 418 morphospecies corresponding to an estimated 347 genetic species were recovered (3). However, the study in Switzerland required 21 years of sampling and identification of 71,222 fruiting bodies (33); likewise, the endophyte study involved examining 1,472 cultures that were isolated from 1,992 leaf segments (3).
We captured this level of richness from 863 fungal sequences isolated from only two 2,000-m2 forest plots in a single locale at a single point in time. Given the lack of saturation in our species-effort curves, we would have vastly exceeded their richness estimates if our sampling effort had been comparable to that of these other studies. Using DNA sequences as the primary sampling unit also requires much less processing of each sample than more labor-intensive methods such as fruiting body collection and culture-based census taking and is therefore amenable to automation of data collection and analysis, making large-scale biotic surveys conducted over short time periods feasible.
Comparisons of ITS sequencing with other molecular survey methods.
While sequencing rRNA clones is standard practice in environmental bacteriology, most fungal studies using environmental sampling have employed either fingerprinting methods based on banding patterns obtained from restriction site polymorphisms (13) or denaturing gradient gel electrophoresis profiles (2, 37). These methods can be useful for detecting overall community shifts in response to environmental variables and for tracking the presence or absence of individual OTUs in different samples. However, they are limited in their ability to enumerate species richness in complex communities and do not provide any information about the taxonomic affiliation of the OTUs.
When sequencing efforts have been conducted, most studies have targeted either SSU or LSU rRNA. These regions are useful for phylogenetic characterization of novel lineages (30, 38), but lack sufficient resolution to discriminate among closely related species, resulting in underestimation of diversity. While most fungal ITS sequences were identifiable, 12% could not be assigned to a phylum. Many of these unidentifiable sequences may represent novel lineages with no representatives known from cultures or fruiting bodies, but almost all SSU sequences cluster within known fungal groups in phylogenetic analysis, suggesting that they may represent known fungi which are not yet represented in the ITS database. Since the ITS region allows fine-scale taxonomic resolution but lacks sufficient conservation for phylogenetic analyses of unknown sequences, we recommend that future studies coamplify the ITS with either the SSU or LSU rRNA region as a single fragment.
Taxonomic breadth of libraries and compositional differences with depth in soil profile.
Comparing the relative importance of major groups of fungi and other soil microbes has been challenging for soil ecologists. Because the primers used to generate clone libraries for this study have been shown to have broad specificity across the fungi, metazoa, and protozoa, we would expect the composition of our libraries to be roughly proportional to the number of genomes of each group present in our sample. This is confirmed by the overall consistency found between ITS and SSU clone libraries obtained using independent priming sites despite differences between these regions in selective constraints. We did find the Chytridiomycota and Zygomycota to be underrepresented in the ITS libraries (0% and 1.5%, respectively) relative to SSU libraries (4% each). While this may reflect differences in primer specificity, it should be noted that 7% of our fungal ITS sequences could not be identified to phylum and may belong to these groups which are underrepresented in public databases relative to other fungal phyla (40).
While sample diversity was high relative to our sampling effort, we were able to detect compositional differences at higher taxonomic scales. Many of the taxa found in the litter layer, such as Mycena species, are known decomposers, while mycorrhizal Basidiomycota sequences, including Russula species, were most common in the O, A, and B layers. The B horizon samples contained a higher proportion of unidentifiable sequences and were considerably less diverse than the other horizons, which may reflect a reduction in niche breadth with soil depth.
Implications for fungal diversity.
We found a remarkably large number of species from just a few grams of soil, but this is only a fraction of the estimated number of fungal species worldwide. Based on extrapolations from the ratio of fungi to vascular plants in the British Isles to the a worldwide vascular plant species richness estimate of 270,000 species, global fungal richness has been estimated to be 1.5 million species (17). Based on our ITS-based estimates of soil fungal richness (491 and 616 for P and MH, respectively) and the vascular plant richness of these plots (26 and 48, respectively; R. Peet, personal communication), this calculation yields global soil fungal species richness estimates ranging from 3.5 to 5.1 million. These values are almost certainly underestimates because the richness estimators continue to increase with increasing sampling and because of the conservative 97% ITS similarity clustering used to designate OTUs.
Given the results of the present study, along with data suggesting that endophytic (3) and insect-associated (41) fungi also represent vast amounts of formerly uncharted fungal diversity, the true magnitude of fungal diversity could be orders of magnitude higher than previously suggested.
This work was supported in part by National Science Foundation grant MCB-00-84207 to R.V., J.M., and J. E. Johnson.
Supplemental material for this article may be found at http://aem.asm.org. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»