Previous Article | Next Article ![]()
Applied and Environmental Microbiology, November 2006, p. 6902-6906, Vol. 72, No. 11
0099-2240/06/$08.00+0 doi:10.1128/AEM.00849-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Department of Biotechnology Engineering,1 National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, P.O. Box 653, Be'er-Sheva 84105, Israel2
Received 11 April 2006/ Accepted 23 August 2006
|
|
|---|
|
|
|---|
The choice of primers to be used in studies to assess the diversity of prokaryotes is not trivial (for reviews, see references 1 and 8). Primer complementarity to a large fraction of the gene sequences in a database, such as the ribosomal database project (RDP) database (16), does not necessarily mean that the primer is optimal. No database today represents the estimated total number of at least 10 million bacterial species, with possible high sequence divergence (4). Additionally, sequences in the database may be incomplete or corrupt. It may therefore be prudent to requestion the universality of these so-called "universal" primers (1, 8).
To enhance the universality of primers for the amplification of related sequences of 16S rRNA genes from different microorganisms, degenerate primers may be designed to have a number of nucleotide options at several positions in the internal primer sequence. This will allow annealing to and amplification of a variety of related sequences. When fourfold degeneracy is required for a given location, the natural base inosine may be used. Inosine is biologically found in the 5' nucleotide of the tRNA anticodon, known as the Watson-Crick wobble (3). The primers containing inosine compensate for the high rate of degeneracy of the targeted codons and can substantially reduce overall primer degeneracy as well as false priming and nontarget gene amplification (11, 20). When designing a specific primer, inclusion of degenerate bases or inosine at the 3' end of the primer is usually considered undesirable, as annealing of the last three bases on the 3' end can be enough to initiate PCR at the wrong sites (30). On the other hand, mismatched nucleotides at positions at the 3' terminus and 1 bp before the 3' terminus have previously been shown to be detrimental to the amplification process (13, 23). This is primarily due to a need for a perfect 3' base pair to allow enzymatic synthesis rather than to any thermodynamic effect on duplex formation (2).
A single base mutation at the 3'-end position homologous to that of a universal primer in a given organism will therefore have a much greater effect than mutations in neighboring bases. This may result in misrepresentation of that genotype in a PCR-based DNA library. Replacing the 3'-terminal position of a universal primer with the base inosine, thus lending the "one-eyed king" (8) an extra I, may compensate for such mutations, revealing higher microbial diversity. Here we demonstrate the utility of such primers in effective eubacterial 16S rRNA gene amplification and the subsequent study and analysis of the microbial diversity of saline industrial wastewater, compared to regular primers.
|
|
|---|
Primer design and PCR amplification.
Total DNA was amplified by PCR with a Mastercycler gradient thermocycler (Eppendorf, Westbury, N.Y.), using specific 16S rRNA primers for bacteria, namely, forward primer 8F (5'-GGATCCAGACTTTGATYMTGGCTCAG), as described by Felske et al. (7) but modified by being shortened from the 5' end, and reverse primer 907R (5'-CCGTCAATTCMTTTGAGTTT), as described by Lane et al. (15). Another pair of primers, 8F-I and 907R-I, was obtained by replacement with inosine at the 3' termini. The universal primers were tested to eliminate dimer or nonspecific annealing by using the Amplify 1.0 computer software package (Bill Engels, University of Wisconsin). The Probe Match program available at the Ribosomal Database Project II (RDP II) website (http://rdp.cme.msu.edu/index.jsp) (16) was used to assess the specificity and universality of the primers.
The primers used in the PCR amplifications were obtained from Sigma-Genosys. The reaction mixtures included 12.5 µl ReddyMix (PCR master mix containing 1.5 mM MgCl2 and a 0.2 mM concentration of each deoxynucleoside triphosphate; ABgene, Surrey, United Kingdom), 1 pmol each of the forward and reverse primers, 1 to 2 µl of the sample preparation, and water to bring the total volume to 25 µl. An initial denaturation hot start of 4 min at 95°C was followed by 30 cycles of the following incubation pattern: 94°C for 30 s, 50 to 54°C for 40 s, and 72°C for 70 s. A final extension at 72°C for 20 min concluded the reaction.
When primers with inosine at the 3' termini were used, a significant improvement in yield was observed when the MgCl2 concentration was increased to 5.5 mM with a supplement of 0.1 µg/µl analytical-grade bovine serum albumin (data not shown).
Clone library construction and sequencing.
The PCR products were purified by electrophoresis on an 0.8% agarose gel (Sigma), stained with ethidium bromide, and visualized with a UV transilluminator. The approximately 0.9-kbp heterologous 16S rRNA DNA products were excised from the gel, and the DNA products were purified from the gel slice by using a Wizard PCR prep kit (Promega, Madison, Wis.). The gel-purified PCR products were cloned into the pCRII-TOPO-TA cloning vector as specified by Invitrogen (Carlsbad, CA) and transformed into calcium chloride-competent E. coli DH5
cells according to the manufacturer's instructions and standard techniques (22).
Plasmid DNA was isolated from individual clones by using a Wizard Plus SV minipreps DNA purification system (Promega, Madison, Wis.). Aliquots from a subset of the samples of purified plasmid DNA were digested with the restriction enzyme EcoRI (MBI Fermentas) for more than 4 h at 37°C, and the digested product was separated by electrophoresis on a 1% agarose gel (agarose low electroendosmosis; Hispanagar, Spain). After being stained with ethidium bromide, the bands were visualized using a UV transilluminator to select clones containing the appropriately sized insert.
The clones with the correct plasmid insert were then used for sequencing. Sequencing (with M13-F and M13-R primers annealed to the plasmid) was performed using an ABI PRISM dye terminator cycle sequencing ready reaction kit with AmpliTaq DNA polymerase FS and an ABI model 373A DNA sequencer (Perkin-Elmer).
Sequence analyses.
MEGA (Molecular Evolutionary Genetics Analysis, version 3.1) (12) was used to dereplicate the libraries of 16S rRNA gene sequences for subsequent analyses by comparing all the sequences in a data set to each other, grouping sequences with
97% and
90% identity together, and outputting a representative sequence from each group.
The all-rRNA gene sequences of each group were first compared with those in the GenBank database using the basic local alignment search tool BLAST (http://www.ncbi.nlm.nih.gov/BLAST/BLAST.cgi). Classifier version 1.0 (to assign 16S rRNA sequences to a taxonomical hierarchy) and Library Compare (to compare two sequence libraries using the RDP classifier), available at the Ribosomal Database Project II website (16), were used to find diversity in different ranks of related sequences. A 97 to 100% match of the unknown clone with the GenBank data set was considered an accurate identification to the species level, 93 to 96% similarity was accepted as genus-level identification, and an 86 to 92% match was considered an accurate identification of a related organism (25). The sequences from appropriate libraries were aligned using ClustalW (EMBL-EBI Center for Research and Services in Bioinformatics; http://www.ebi.ac.uk/clustalw/index.html), and positions not sequenced for all isolates or with alignment uncertainties were removed. Phylogenetic trees were constructed by the neighbor-joining method (21) with the MEGA package (12). Bootstrap resampling analysis (6) of 100 replicates was performed to estimate the confidence levels of the tree topologies.
Statistical analysis.
The statistical significance of the difference between the primers' abilities to enhance different phyla and subclasses was determined by Kruskal-Wallis nonparametric analysis of variance, followed by post hoc multiple comparisons. Analyses were performed using STATISTICA (a data analysis software system) version 7.0 (StatSoft, Inc.). Calculation of the regression lines was performed using SigmaPlot 2000 V.6 (SPSS Science, Chicago, IL).
Nucleotide sequence accession numbers.
The sequences from this study have been deposited in the NCBI GenBank database under accession numbers DQ458316 to DQ458468.
|
|
|---|
The 8F and 907R primers were designed from sequences of culturable bacteria, mostly proteobacteria (15, 29). Using these primers and similar ones is likely to produce a highly biased database. The 8F primer was assessed by using Probe Match at the RDP II website (16) and was found to complement relatively few sequences, half of them belonging to the phylum Proteobacteria; this is disconcerting as this primer is commonly used. This may be attributed to the fact that sequences of the region complementary to 8F are often unknown or ambiguous (1, 8). The 907R primer was complementary to 97,600 of 197,833 bacterial sequences existing in the RDP II database at the time the test was performed, with 38,625 being complementary to sequences belonging to the phylum Proteobacteria. Furthermore, in a comparison with 500 bacterial sequences, the base homologous to the guanine at the 3' terminus of 8F was found to be variable, while the base homologous to the thymine at the 3' terminus of 907R is considered conserved (but not totally conserved) (1). Therefore, with the number of known sequences continuously increasing, the specificity and utility of primers that were previously developed using a much smaller data set should frequently be reassessed.
In the current study, the microbial diversity of a high-salinity industrial wastewater evaporation pond was chosen to assess the efficacy of replacing nucleotides with inosine at the 3' termini of universal primers 8F and 907R. The pond (pond 204) is part of an array of evaporation ponds located in the Ramat Hovav industrial area in the Negev Desert, Israel. The salt concentration in the pond at the time of sampling was around 12%. The ponds receive a mixture of high-strength industrial wastewaters from various industries in the area, making a unique habitat for various microorganisms (14). The 8F/907R universal primer pair was compared with the 8F-I/907R-I primer pair, where nucleotides at the 3' termini were replaced with inosine. The annealing temperatures of the two primer pairs were tested using genomic DNA from E. coli with a gradient PCR technique. The conventional universal primers 8F/907R displayed a maximum yield at about 49°C, with a maximal functional temperature of 58°C. The 8F-I/907R-I primers, with inosine at the 3' termini, displayed a maximum yield at about 47°C, with a maximal functional temperature of 56°C.
The total genomic DNA from the wastewater was amplified using the 8F/907R primers at a 54°C annealing temperature and the 8F-I/907R-I primers at 54°C and 50°C annealing temperatures to construct three libraries of 16S rRNA genes, with 48, 54, and 51 clones sequenced, respectively. From a plot of the cumulative number of different sequences against the number of clones, we could estimate the differences in microbial diversity yielded by the two primer sets at various annealing temperatures. We depicted approximations of the cumulative number of sequences versus the number of clones and estimated the total possible number of different sequences based on an infinite number of clones being obtained (Fig. 1), as proposed by Sekiguchi et al. (24). When the first 48 sequences of each library were grouped on the basis of
97% similarity, use of the 8F-I/907R-I primers at annealing temperatures of 54°C and 50°C and the 8F/907R primers at 54°C yielded 28, 29, and 23 separate 16S rRNA gene groups, respectively (Fig. 1A). Grouping on the basis of
90% identity returned a somewhat different image, with 20, 22, and 11 groups, respectively (Fig. 1B). From this estimation, it is suggested that the microbial diversity depicted by the use of 8F-I/907R-I with inosine at the 3' termini may be almost twice as high as that depicted by the primers without inosine at that position. The comparatively low number of sequence groups obtained by the 8F/907R primers implies a greater universality of the 8F-I/907R-I primers. When all 153 sequences (from the three clone libraries) were similarly reviewed on the basis of 97 and 90% identities, 56 and 27 different sequence groups, respectively, were obtained. These values are lower than the theoretical number of groups for 153 clones according to the regression line fitted to the cumulative number of sequences from primers with inosine at the 3' termini at either temperature (data not shown).
![]() View larger version (15K): [in a new window] |
FIG. 1. Estimation of microbial diversity in a wastewater environment as determined by using the 8F/907R primer set at an annealing temperature of 54°C ( , dotted line) and the 8F-I/907R-I primer set at 54°C ( , dashed line) and 50°C (, solid line).The cumulative number of identical 16S rRNA gene sequences is plotted against the total number of clones with either 97% (A) or 90% (B) identity. The regression lines were calculated using the modified hyperbolic equation y = x/(ax b), where y is the cumulative number of different sequences, x is the total number of clones, and a and b are the coefficients proposed by Sekiguchi et al. (24). R2 values are higher than 0.967 for all regression lines.
|
-Proteobacteria genus Desulfovibrio (204i-11, 204i-40, 204i-51, 204i-50-11, and 204i-50-64) (Fig. 2), a subclass completely undetected by the 8F/907R primer pair in the course of this study. However, the 8F-I/907R-I primers appeared to be biased against
-Proteobacteria at a 54°C annealing temperature, with only one sequence (204i-42) (Fig. 2) obtained representing that particular subclass. The results of the Kruskal-Wallis nonparametric analysis of variance test showed significant differences between the microbial community structures on the phylum level (P < 0.001). Multiple post hoc comparisons of the mean ranks for all groups showed significant differences (P < 0.02) between all the primer pairs. This excluded the possibility that these differences are due to an insufficient number of sequences obtained for each primer pair. The increased diversity obtained with primers 8F-I/907R-I indeed confirms the usefulness of the primers with inosine substituted for the nucleotides at the 3' termini for enhancing our knowledge regarding microbial diversity in environmental samples. The value of inosine at the 3' positions of primers has been previously demonstrated by Batzer et al. (2) for the detection of evolutionary mutations in primate species that could not be amplified using regular primers. |
View this table: [in a new window] |
TABLE 1. Diversity at the phylum level of three libraries of 16S rRNA genes obtained by use of universal primer sets with and without inosine at the 3' termini
|
![]() View larger version (18K): [in a new window] |
FIG. 2. Phylogenetic tree based on 16S rRNA gene sequences that were retrieved from industrial wastewater by use of primer set 8F/907R at an annealing temperature of 54°C (blue circles) and primer set 8F-I/907R-I at 54°C (red triangles) and at 50°C (green triangles). The tree was constructed by the neighbor-joining method (21) with the MEGA package (12), using partial sequences of 16S rRNA genes. The bar represents two substitutions per 100 nucleotide positions. Bootstrap probabilities (6) are indicated at branch nodes.
|
I · G > I · I (17). Watkins and SantaLucia (28) demonstrated the importance of the inosine nearest-neighbor parameters which have a large influence on the hybridization stability of I · X pairs. The stability trend for the base pair in the positions 5' and 3' of the I · X pair is G · C > C · G > A · T > T · A. This information may be useful in designing a probe/primer of optimal stability (17, 28). The problems associated with universal primer design have been discussed previously (1, 8). So-called "conserved regions" in the 16S rRNA genes used for the bacterial universal primers were aligned by Watanabe et al. (27) and found to include many mismatches in the cores and in the 3' ends of the primers. The introduction of inosine residues into the cores of the bacterial universal primers homologous to these regions enabled the amplification and detection by PCR/denaturing gradient gel electrophoresis of phylotypes that were not detected using the original primers with the same groundwater samples (27). Inosine-containing primers are therefore considered useful in detecting more-diverse populations in the environment.
The advantage of inosine at the 3' termini of the primers (8F-I/907R-I) is in its ability to match all four nucleotides; instead of 16 possible variations at the 3' terminus, one of a pair of degenerate primers is used. However, the limitation of inosine is in its different thermodynamic stabilities in relation to each of the four nucleotides. This can be partially overcome by using low annealing temperatures (as shown by the results at the 54°C and 50°C annealing temperatures [Table 1 and Fig. 2] and possibly also by altering the magnesium and primer concentrations. The capabilities of different DNA polymerases used in the PCR to synthesize DNA with an inosine-containing template should also be taken into consideration (9). Another disadvantage of inosine is its recognition by Taq polymerase as guanine, which prevents the study of nucleotide diversity by subsequent sequencing analysis.
New primers that are both universal and specific are thus required. Ideally, they must be specific to the domain in question while being complementary to sequences in all taxa within that domain. Detection of a base analog which is more discriminatory against mismatches than the normal bases (the ideal base analog would pair with the same stability to all bases) could be used to increase the selectivity of the probe/primer at positions of unambiguous sequence (17).
In terms of microbial community structure, the difference between the results obtained with the two primer pairs (8F-I/907R-I and 8F/907R) with and without the substitution of inosine for the nucleotides at the 3' termini was clear (Table 1 and Fig. 1 and 2). Using a pair of universal primers with inosine at the 3' termini can expand the observed diversity of a microbial community under study but is not guaranteed to amplify all species existing in the environment. When approaching a given environmental sample, it may be prudent to use primers with and without inosine at the 3' termini, thus increasing the richness of 16S rRNA gene PCR amplicons, to better reflect the true diversity of the microbial community.
We thank Esti Kramarsky-Winter for useful comments on the manuscript.
Published ahead of print on 1 September 2006. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»