Previous Article | Next Article ![]()
Applied and Environmental Microbiology, May 2003, p. 2555-2562, Vol. 69, No. 5
0099-2240/03/$08.00+0 DOI: 10.1128/AEM.69.5.2555-2562.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Max Planck Institute for Terrestrial Microbiology, D-35043 Marburg/Lahn, Germany
Received 8 November 2002/ Accepted 5 February 2003
|
|
|---|
|
|
|---|
The caveats of the cloning approach (30), namely, the lack of analysis of a statistically significant number of clones required for complex communities, has encouraged the use of molecular techniques, which map the diversity of the community structure by PCR-based fingerprinting. In contrast to cloning analysis, fingerprinting techniques such as denaturing/thermal gradient gel electrophoresis (DGGE/TGGE) (for a review, see reference 19), single-stranded site conformational polymorphism (SSCP) (12, 25), and terminal restriction fragment length polymorphism (T-RFLP) (4, 13) (for reviews, see references 11 and 17) analyses allow the physical separation of the total pool of amplified community gene products.
Typically, T-RFLP analysis involves amplification of target genes from whole-community DNA extracts by using specific primer pairs, one of which is fluorescently labeled. Subsequently, amplicons are digested with restriction enzymes (usually tetranucleotide recognizing) and fragments are size separated via gel electrophoresis on automated sequencers, whereby only the labeled terminal fragments (T-RFs) are detected and quantified. Individual T-RFs can be assigned presumptively to operational taxonomic units, which ideally correspond to phylogenetically related microorganisms, based on in silico search for matching restriction sites in sequences from clone libraries established in parallel from the same sample. The 16S rRNA gene has been used extensively as marker gene for T-RFLP analysis (for reviews, see references 11 and 17).
In general, the T-RFLP technique has proven to be a reproducible and accurate tool for community fingerprinting (4, 13, 18, 22). Since T-RFLP analysis is based on PCR amplification, all biases related to this technique apply (30) and a number of important parameters related to PCR have been identified; it has been found that initial DNA template concentration, number of PCR cycles, annealing temperature, and the choice of Taq DNA polymerase from different manufacturers may affect the composition of T-RFLP profiles (4, 22).
T-RFLP-based gene ratios were found to be influenced by preferential gene amplification of specific templates (PCR drift [23, 28, 29]) when degenerated primers for the amplification of the mcrA (methyl coenzyme M reductase) gene were used (14, 16). On the other hand, Lueders and Friedrich (16) demonstrated that PCR-T-RFLP analysis can accurately reflect template ratios of archaeal 16S rRNA genes in a model community with defined amounts of 16S rRNA gene copies from five different methanogens.
In addition to PCR factors, the composition of T-RFLP profiles can be influenced by factors related to the restriction digestion, such as partially digested PCR products observed in T-RFLP profiles of pure cultures (3, 4) or environmental samples (22, 27). Additional restriction fragments (RFs) in T-RFLP profiles of pure cultures were attributed to either incomplete digestion of the amplicons or sequence heterogeneity of the template, i.e., multiple copies of 16S rRNA genes in single species with different terminal restriction sites (4). If the occurrence of additional peaks originates from incomplete digestion, this may be revealed under limiting restriction enzyme concentration (22). At any rate, incompletely digested PCR products from a complex microbial community may result in additional T-RFs and, consequently, an overestimation of diversity (22).
The present study was initiated to systematically examine the frequent occurrence of unexpected RFs in addition to the expected T-RFs after in vitro digestion of individual environmental 16S rRNA gene clones. These additional, nonterminal RFs in T-RFLP profiles were designated pseudo-T-RFs. Our results indicate that partially single-stranded amplicons are involved in the formation of pseudo-T-RFs.
|
|
|---|
T-RFLP analysis.
16S rRNA genes were specifically amplified using the primer combination of 6-carboxyfluorescein (FAM)-labeled primers 27f (5'-AGA-GTT-TGA-TCC-TGG-CTC-AG-3') (5) and 907r (5'-CCG-TCA-ATT-CCT-TTR-AGT-TT-3') (20) for Bacteria and Ar109f (5'-ACK-GCT-CAG-TAA-CAC-GT-3') (7) and FAM-labeled Ar912r (5'-CTC-CCC-CGC-CAA-TTC-CTT-TA-3') (15) for Archaea. The standard reaction mixture contained, in a total volume of 50 µl, 1x PCR buffer II (Applied Biosystems, Weiterstadt, Germany), 1.5 mM MgCl2, a 50 µM concentration of each of the four deoxynucleoside triphosphates (Amersham Pharmacia Biotech, Freiburg, Germany), a 0.5 µM concentration of each primer (MWG Biotech, Ebersberg, Germany), and 1.25 U of Ampli Taq DNA polymerase (Applied Biosystems). In addition, 1 µl of a 1:30 dilution of Pachnoda gut DNA extract, 0.5 µl of a 1:10 dilution of clonal M13 product (including 16S rRNA gene sequence inserts), or 1 µl of pure-culture DNA extract was added as the template. All reaction mixtures were prepared at 4°C in 0.2-ml reaction tubes to avoid nonspecific priming. Amplification was started by placing the reaction tubes immediately into the preheated (94°C) block of a GeneAmp 9700 thermocycler (Applied Biosystems). The standard thermal profiles for the amplification of bacterial 16S rRNA genes were as follows: initial denaturation (94°C for 3 min) followed by 16 (clonal DNA templates) or 32 (environmental DNA templates) cycles of denaturation (94°C for 30 s), annealing (52°C for 30 s), and extension (72°C for 60 s). Thermal profiles for the amplification of archaeal 16S rRNA genes started with an initial denaturation (94°C for 3 min) followed by 16 (clonal and pure-culture DNA templates) or 35 to 38 (environmental templates) cycles of denaturation (94°C for 45 s), annealing (52°C for 45 s), and extension (72°C for 90 s). After terminal extension (72°C for 5 to 7 min), samples were stored at 4°C until further analysis. Aliquots (5 µl) of 16S rRNA amplicons were analyzed by gel electrophoresis on 1% agarose gels and visualized after being stained with ethidium bromide. PCR products were purified with the MinElute PCR purification kit (Qiagen, Hilden, Germany).
Prior to digestion, amplicon concentrations were determined photometrically. DNA (75 ng for amplicons from the gut DNA extract, 50 ng for clonal amplicons), 2.5 U of restriction enzymes (MspI, TaqI, and AluI [Promega, Mannheim, Germany]; MspI, HpaII, HhaI, HaeIII, and BstUI [New England Biolabs, Frankfurt am Main, Germany]; and BsiSI [Minotech Biotechnology, Heraklion, Crete, Greece]), 1 µl of 10x incubation buffer, and 1 µg of bovine serum albumin (if recommended) were combined in a total volume of 10 µl and digested for 3 h at 37°C (MspI, AluI, HpaII, HhaI, and HaeIII), 55°C (BsiSI), 60°C (BstUI), 65°C (TaqI), or 70°C (BsiSI). Fluorescently labeled T-RFs were size separated on an ABI 373A automated sequencer (Applied Biosystems) using an internal size standard (GeneScan-1000 ROX; Applied Biosystems). T-RFLP electropherograms were analyzed with GeneScan 2.1 software (Applied Biosystems) (15).
Mung bean nuclease digest.
The single-stranded DNA parts of 16S rRNA gene amplicons were digested using mung bean nuclease. Approximately 1,000 ng of PCR product was incubated for 1 h at 30°C with 5 U of mung bean nuclease (New England Biolabs) and 10 µl of 10x reaction buffer in a total volume of 100 µl. The digestion was stopped by phenol-chloroform-isoamyl alcohol (25:24:1) extraction, and DNA was recovered by ethanol precipitation. Digested amplicons were purified using the MinElute PCR purification kit.
|
|
|---|
![]() View larger version (24K): [in a new window] |
FIG. 1. Occurrence of pseudo-T-RFs in T-RFLP profiles of a single clone depending on the restriction enzyme used. 16S rRNA gene T-RFLP electropherograms were derived from clone PeM75 (affiliated with Lactobacillales). Numbers indicate restriction sites (RS) for the respective enzyme detected in the clonal sequence between bases 1 and 900 (length of the PCR product), counted from the labeled 5' end. Bold numbers indicate restriction sites with corresponding T-RFs in the electropherogram. RFU, relative fluorescence units.
|
![]() View larger version (30K): [in a new window] |
FIG. 2. Effect of mung bean nuclease digestion on the occurrence of pseudo-T-RFs in T-RFLP profiles (AluI digests) of environmental, clonal, and pure-culture samples. Insets show the T-RFLP profile after mung bean nuclease digestion. The number of PCR cycles used to produce the amplicons is indicated. Fragment lengths of pseudo-T-RFs are shown in bold. Clone PeMAr04 is affiliated with the kingdom Crenarchaeota. MB, Methanobactericeae; CR, Crenarchaeota; RFU, relative fluorescence units.
|
![]() View larger version (22K): [in a new window] |
FIG. 3. (A to C) T-RFLP analysis of clone PeH59 (affiliated with the CFB phylum) amplicons after restriction digestion with different enzymes, resulting in the expected T-RFs only (MspI [A]) or in the formation of pseudo-T-RFs (AluI [B] and HhaI [C]). (D) 16S rRNA gene secondary structure of clone PeH59 as predicted by the mfold software including the sequence stretches around detected pseudo-T-RFs. RS, restriction sites. Bold numbers indicate restriction sites with corresponding T-RFs in the electropherogram. RFU, relative fluorescence units.
|
The frequent occurrence of pseudo-T-RFs in T-RFLP profiles of clones suggested their likely occurrence also in T-RFLP profiles of complex microbial communities. In fact, potential pseudo-T-RFs were identified in T-RFLP profiles of environmental samples, i.e., gut DNA extracts of P. ephippiata larvae and soil which was used for feeding the larvae (Egert et al., unpublished), by comparing predicted T-RFs of clones to those present in the mixed-community T-RFLP profile.
For example, the T-RFLP profile of archaeon-specific 16S rRNA gene amplicons from midgut DNA extracts with AluI digestions was characterized by three T-RFs, two of which could be presumptively assigned to clonal sequences affiliated with the Methanobacteriaceae (T-RF of 64 bp; 6 clones) and Crenarchaeota (125 bp; 12 clones) (Fig. 2A). However, the prominent peak at 165 bp in the electropherogram was not reflected by any clone sequence; i.e., no clone sequence showed a primary, real terminal AluI restriction site of 165 bp. In vitro digestion of clonal PCR amplicons revealed that all clones related to Methanobacteriaceae and Crenarchaeota displayed an additional 165-bp RF (shown in Fig. 2B for the crenarchaeotal clone PeMAr04). Therefore, it was assumed that the 165-bp T-RF in the midgut T-RFLP profile was a pseudo-T-RF.
Involvement of partly single-stranded amplicons in pseudo-T-RF formation.
The occurrence of multiple RFs in T-RFLP profiles from single species has been reported for pure cultures (4, 6) and clonal PCR amplicons (6, 27), which were explained by 16S rRNA gene sequence heterogeneity, e.g., multiple rRNA operons in a single species (4), or partial digestion of the PCR products (4, 22, 27). Sequence heterogeneity of 16S rRNA genes can be excluded as a reason for the formation of pseudo-T-RFs, because we used amplicons of clonal origin. Nevertheless, a characteristic of all clones with pseudo-T-RFs was that the primary terminal restriction site was cleaved by the restriction enzyme for only a fraction of the amplicon pool; i.e., they were only partially digested (Fig. 1, 2B, and 3).
All efforts to overcome a bias related to partial digestion of amplicons were not successful. Use of twice as much enzyme (5 U) as in a typical digest (22) and extension of the digestion time (6 and 24 h) did not relieve the occurrence or the intensity of pseudo-T-RF peaks. It is noteworthy, though, that peaks with a size corresponding to full-length amplicons (
900 bp) were not present in T-RFLP profiles of clones with and without pseudo-T-RFs (e.g., Fig. 1, 3, and 4), which would have been indicative of incomplete digestion because of limiting enzyme concentration or suboptimal reaction conditions (22).
![]() View larger version (28K): [in a new window] |
FIG. 4. Effect of restriction digest temperature on the formation of pseudo-T-RFs of clone PeH59. Restriction digests were performed using BsiSI at 55°C (A) and 70°C (B) and by using a 3-min denaturation of the PCR amplicon prior to the addition of enzyme and incubation at 70°C (C). Bold numbers indicate restriction sites with corresponding T-RFs in the electropherogram. RFU, relative fluorescence units.
|
After mung bean nuclease digestion, pseudo-T-RFs were not detectable in environmental (Pachnoda gut), clonal, and pure-culture-derived T-RFLP profiles (Fig. 2). These data indicate clearly that the formation of pseudo-T-RFs results from the presence of at least partly single-stranded DNA amplicons. Single-stranded DNA is not a substrate for type II restriction endonucleases (21), and so the presence of single-stranded 5'-DNA ends of part of the amplicon poolon otherwise double-stranded PCR productsprovides an explanation of why the terminal restriction site was not cut. Similarly, mung bean nuclease treatment was used to remove single-stranded DNA artifacts prior to SSCP (10) and DGGE analysis (26).
By comparing T-RFLP patterns of individual clones with different restriction endonucleases, it became evident that the amplicon pool contains PCR products which are single stranded to different degrees. For example, BstUI digestions of clone PeM75 amplicons yielded pseudo-T-RFs of 413 and 538 bp (Fig. 1D), which suggests that a small part of the amplicon pool is single stranded, at least up to the second BstUI restriction site of PeM75 at bp 413.
The secondary structure of 16S rRNA gene sequences influences restriction digests. Some clones displayed pseudo-T-RFs with one enzyme but not with the other when amplicons from the same PCR batch were analyzed by T-RFLP. For example, clone PeH59 had a primary MspI restriction site at 81 bp and eight subsequent restriction sites as revealed by sequence data analysis (Fig. 3A), but pseudo-T-RFs were not formed. Digests with AluI (Fig. 3B), HhaI (Fig. 3C), and BstUI (data not shown) revealed pseudo-T-RFs up to 638 bp (AluI [Fig. 3B]), which suggests that some amplicons were single stranded at least up to bp 241. According to a model which involves the formation of transiently formed secondary structures composed of recognition sequences with twofold rotational symmetry ("canonical structures"), many type II restriction endonucleases cleave single-stranded DNA (21). Inspection of possible secondary structures as calculated with the program mfold (24) (M. Zuker; http://www.bioinfo.rpi.edu/applications/mfold/old/dna/) showed that the primary MspI restriction site of clone PeH59 was able to form a canonical structure (i.e., a local secondary structure) by folding back with an upstream single-stranded sequence (Fig. 3D). Although the secondary structures did not form a perfect palindrome, it is likely that the primary restriction site of single-stranded amplicons was indeed cleaved by MspI, since pseudo-T-RFs downstream from the primary restriction site were not detected. In contrast, the primary AluI restriction site of clone PeH59 most probably did not form a sterically sufficient secondary structure from single-stranded DNA, and thus AluI did not cleave single-stranded amplicons at the primary recognition site, which corroborates the presence of a pseudo-T-RF at 638 bp. It should be noted, however, that the predicted secondary structures represent the most thermodynamically stable structures according to the underlying model (24) as implemented in mfold; thus, these structures may actually not exist in the reaction mixture of the T-RFLP digest. However, restriction digests conducted at different temperatures provide experimental evidence that canonical structures in single strands might be the reason why some clonal amplicons do not show pseudo-T-RF formation with certain nucleases. Clone PeH59 did not show pseudo-T-RFs when digested with MspI at 37°C (Fig. 3A) or BsiSI at 55°C (Fig. 4A); BsiSI is an isoschizomer of MspI which is not inactivated by heat. However, at 70°C (Fig. 4B), pseudo-T-RFs occurred when BsiSI was used and were even more pronounced when the amplicons were denatured for 3 min at 94°C prior to digestion (Fig. 4C). At increased digestion temperature, canonical structures in single-stranded amplicons are likely to become unstable, rendering the restriction sites inaccessible to the nuclease, which in turn leads to the formation of pseudo-T-RFs. Interestingly, pseudo-T-RFs were not detectable using TaqI as the restriction endonuclease at a digestion temperature of 65°C, even with clones that possessed multiple TaqI restriction sites and displayed pseudo-T-RFs with other nucleases (tested only for archaeal clones [data not shown]). Possibly, TaqI cleaves single-stranded amplicons not involved in canonical structures at a higher rate than the other restriction enzymes analyzed, making TaqI suitable as an endonuclease that avoids formation of pseudo-T-RFs in T-RFLP analysis.
In general, the extent of pseudo-T-RF formation (for all restriction endonucleases tested) decreased with increasing distance of the terminal restriction site from the 5' labeled end of the amplicon (Fig. 5), which supports the hypothesis that amplicons are partly single stranded.
![]() View larger version (20K): [in a new window] |
FIG. 5. Effect of the position of the terminal restriction site on the extent of pseudo-T-RF formation, based on in vitro T-RF formation of 56 bacterial clones with MspI as the restriction endonuclease. The peak area of the pseudo-T-RF is compared to the peak area of the primary T-RF and given as a percentage. Clones were obtained from a 16S rRNA gene clone library derived from the midgut of cetoniid beetle larvae (Egert et al., unpublished).
|
100 to 10-3 ng µl-1), or (vi) higher (i.e., more stringent) annealing temperatures (55, 57, 59, 61, or 64°C) to determine whether the formation of single-stranded DNA could result from incorrectly annealed primers. Unexpectedly, at annealing temperatures of 61 and 64°C, the number of pseudo-T-RFs even increased. Lower annealing temperatures (50, 48, and 46°C) did not affect the formation of pseudo-T-RFs.
![]() View larger version (16K): [in a new window] |
FIG. 6. Effect of PCR cycle number on the extent of pseudo-T-RF formation observed with amplicons of clone PeM75 after MspI digestion. The peak area of the pseudo-T-RF is compared to the peak area of the primary T-RF and given as a percentage. Error bars (which represent standard deviation) are based on three replicates.
|
The formation of partly single-stranded 16S rRNA gene amplicons during PCR may result from template secondary structures (10), which causes the polymerase to pause or fall off the template (23). However, use of the PCR enhancer betaine at various concentrations, which had been shown to be effective in improving the amplification yield and the specificity of templates with high G+C content or secondary structures (9), did not prevent the formation of pseudo-T-RFs.
Model for the formation of pseudo-T-RFs.
Based on the above results, we propose the following model for the formation of pseudo-T-RFs during T-RFLP analysis of 16S rRNA genes. During PCR of 16S rRNA genes from clonal, pure-culture, and environmental DNA extracts, some of the amplicons formed are at least partly single stranded (as proven by mung bean nuclease digests [Fig. 2]). Since single-stranded terminal restriction sites cannot be cleaved by restriction endonucleases, "pseudo"-terminal restriction sites downstream from the expected primary restriction site can be detected by T-RFLP analysis. The ability of the 16S RNA molecule to backfold with itself (8) may result in an incomplete synthesis of a fraction of 16S rRNA gene amplicons during PCR (Fig. 7A). The involvement of PCR in the generation of (partly) single-stranded amplicons is corroborated by the strong dependence on the number of PCR cycles (Fig. 6). Similarly, the number of PCR cycles has been implicated as a controlling factor in a kinetic model which describes the reannealing of single-stranded templates as a source of the PCR bias (28). Accordingly, the formation of single-stranded amplicons may be viewed as an extension of the original kinetic model of template reannealing: when the amplicon concentration increases at greater PCR cycle numbers, the rate of interaction between single-stranded template molecules increases, which may result not only in interstrand reannealing as described by Suzuki and Giovannoni (28) but also in intrastrand annealing, hence the formation of local secondary structures. In turn, these temporary secondary structures of template molecules may cause the DNA polymerase molecules to fall off with higher frequency (23), thereby leaving the template strands (partially) unamplified. Furthermore, we hypothesize that single-stranded 16S rRNA gene amplicons can form local palindromic secondary structures, which in turn allow restriction enzymes to cut "single-stranded" DNA (21). This hypothesis helps explain why T-RFLP analyses with certain enzymes yield pseudo-T-RFs whereas others from the same PCR amplification do not (Fig. 7B): a secondary restriction site will be detected in T-RFLP analysis only if the primary restriction is not part of a canonical structure. We could show that higher temperatures (70°C, [Fig. 4B and C]) during restriction digestion resulted in the formation of pseudo-T-RFs, most probably because local secondary structures were unstable under these conditions and consequently were no longer substrates for the restriction enzyme. Thus, the sequence context around the primary restriction site most probably will determine whether even a single-stranded amplicon can be digested at its primary, real terminal restriction site.
![]() View larger version (25K): [in a new window] |
FIG. 7. Schematic model of pseudo-T-RF formation. (A) PCR-related parameters influencing the formation of partly single-stranded amplicons. (B) Involvement of the secondary structure of partly single-stranded amplicons in the formation of pseudo-T-RFs. dsDNA, double-stranded DNA; ssDNA, single-stranded DNA; solid triangles, restriction site cut (MspI); open triangle, restriction site not cut.
|
|
|
|---|
Since the extent of pseudo-T-RF formation is likely to be dependent on the species (gene) composition of the system under investigation and the chosen restriction endonuclease(s), it is advisable to perform T-RFLP analysis and cloning in parallel. Although this results in increased effort, the T-RF patterns of clones should be determined by in vitro T-RFLP analysis under the applied PCR and T-RFLP conditions, in particular when T-RFs are supposed to be quantitatively assigned to species or phylogenetic groups. Assigning T-RFs solely on the basis of in silico or database search is insufficient because of the potential occurrence of pseudo-T-RFs in T- RFLP profiles. In agreement with several other studies (22, 23, 28, 29), the number of PCR cycles should be limited to a minimum because pseudo-T-RF formation increases linearly with the cycle number. Beyond T-RFLP fingerprinting, the formation of (partly) single-stranded 16S rRNA gene amplicons during PCR may also affect other core techniques in microbial ecology, e.g., 16S rRNA gene cloning. In 16S rRNA gene clone libraries, sequences with a strong tendency to produce single-stranded amplicons are likely to be underrepresented because the single-stranded fraction of the amplicons cannot be ligated into the cloning vector.
We thank Bianca Wagner for excellent technical assistance, and we thank Gesche Braker and Tillmann Lueders for data on the in vitro T-RF formation pattern of cloned functional genes.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»