Previous Article | Next Article ![]()
Applied and Environmental Microbiology, February 2008, p. 840-849, Vol. 74, No. 3
0099-2240/08/$08.00+0 doi:10.1128/AEM.01933-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.
,
Carlos Ríos-Velázquez,3
Jo Handelsman,1* and
Gary Ruvkun2
Departments of Bacteriology and Plant Pathology, University of Wisconsin—Madison, 1550 Linden Drive, Madison, Wisconsin 53706-1521,1 Department of Genetics, Harvard University, and Department of Molecular Biology, Massachusetts General Hospital, 185 Cambridge Street, Boston, Massachusetts 02114-2790,2 Department of Biology, University of Puerto Rico—Mayagüez, Mayagüez, Puerto Rico3
Received 22 August 2007/ Accepted 29 November 2007
|
|
|---|
|
|
|---|
20 to 30 nucleotides (nt), 16S rRNA gene primer design has been constrained to target conserved regions of those lengths. New thermostable polymerases have recently become available (40), opening the possibility of changes in primer design. Preliminary data suggested that some of these polymerases may be able to utilize primers shorter than the minimum length of
20 to 30 nt typically recognized by the standard enzyme used for PCR, Taq (Thermus aquaticus) DNA polymerase. We posited that decreasing the length of 16S rRNA gene PCR primers would broaden the scope of sequences detectable by PCR. In this study we employ a new polymerase, S-Tbr, and short primers to demonstrate the utility of a "miniprimer" PCR method, both computationally and with environmental samples.
|
|
|---|
Environmental sample preparation.
For PCR from soil, three
1-g samples were collected from the courtyard outside the Wellman building on the Massachusetts General Hospital campus (Boston, MA) and boiled in 1 ml sterile, nucleic acid-free water for 5 min. After centrifugation to pellet solid material, the supernatants were combined, diluted 1:10 in sterile nucleic acid-free water, and used for PCR. A boiling method was used to prepare DNA quickly to test miniprimers and was not intended to provide material for a comprehensive analysis of a soil sample. The microbial mat samples were collected from a mature mat community of the Candelaria lagoon, which is one of a series of evaporative salterns in Cabo Rojo, southwest Puerto Rico (17°56'N, 67°11'W), and immediately frozen for storage. The mat is a laminated structure of
2.0 to 2.5 cm in thickness and has distinctive green, pink, and black layers. For a detailed physical and geochemical description of similar mats from Cabo Rojo, see reference 11. To prepare microbial mat DNA, mat samples were thawed on ice and three
1-cm-wide vertical sections were cut so that all mat layers were present in the pieces removed. The sections were mixed to form a sample of
20 g and then mixed to homogeneity with 20 ml of sterile, nucleic acid-free water. After centrifugation to collect the solid material, the liquid was removed and three
1-g samples were removed from the top, middle, and bottom of the pellet. The nine samples were processed individually using a bead-beating method (MO BIO PowerSoil kit; MO BIO Laboratories, Carlsbad, CA); the protocol included the optional 70°C initial incubation in solution C1. Equal volumes of the nine DNA samples were combined, and this mixture was used as the template in the PCRs. For each PCR method tested, three separate PCRs were performed with the microbial mat template, and the products of the three PCRs were combined for cloning and subsequent sequencing. All sample handling was performed in a clean, sterile laminar flow hood.
Cloning and sequencing.
PCR products were extracted from
1.0% agarose gels and cloned using the TOPO cloning kit for sequencing (Invitrogen, Carlsbad, CA). Escherichia coli genomic DNA was prepared using the QIAamp DNA mini kit (Qiagen, Valencia, CA) or purchased from USB Corporation (Cleveland, OH); to prepare Halobacterium salinarum genomic DNA, H. salinarum strain MPK1 (24) was grown in culture as previously described (12), followed by genomic DNA preparation using the QIAamp DNA mini kit. Automated DNA sequencing of clones was performed by the Massachusetts General Hospital DNA Sequencing Core Facility (Cambridge, MA) or by the University of Wisconsin Biotechnology Center (Madison, WI) using sequencing primers located outside the bounds of the cloned 16S rRNA gene inserts. Sequence reads were assembled using Phred and Phrap as implemented in XplorSeq (D. Frank, unpublished data) or the Staden package, release 1.6.0. Soil study clones A01 and A04 were fully sequenced with two- to fivefold coverage by independent sequence reads and had at least threefold coverage of all highly conserved regions. Mallard was used as the primary tool to identify and remove anomalous or chimeric 16S rRNA gene sequences (7); for some ambiguous cases, Bellerophon (20), Pintail (6), BLAST, and manual checking were used to supplement results from Mallard analysis.
Informatics and computation.
Sequences were aligned using the NAST aligner (16) and imported into Arb version 05.10.26 (27), using the full greengenes database (see below) as a 16S rRNA reference database (15). The Arb database of Cabo Rojo and reference sequences is available at plantpath.wisc.edu/
isen/CR. Distance matrices calculated in Arb using the Olsen correction and Lane mask (25) were used to cluster sequences into operational taxonomic units (OTUs) by pairwise identity and a furthest-neighbor algorithm (33) and for calculating similarity indices to compare libraries (34). Microbial mat sequences were inserted into the Arb dendrogram using the parsimony tool and Lane mask. Primary taxonomic assignments were informed by the resulting Arb dendrogram and the Lawrence Berkeley National Laboratory (LBNL) greengenes classifier. Novel taxa identified with colloquial names and for which references are not otherwise indicated in the text include BD5-13 (23), BRC1 (13), FW129 (10), NB23 (28), OP11 (21), WS2, and WS6 (17); publications for AC1 and Eub6 were not identified by our search. Similarities to database records were calculated by a maximum likelihood method as implemented in PHYLIP, using F84 correction, a transition:transversion ratio of 2.0, empirically determined base frequencies, and the Lane mask (25). Sequences that were not placed into previously described taxa or that formed monophyletic groups of greater than 0.15 distance unit from the closest previously isolated sequence in the full LBNL greengenes database were subjected to additional characterization as follows. First, novel sequences were aligned with reference sequences representing major defined lines of bacterial descent (logical intersection of the "core" and "GOLD_stamp" sequence sets from greengenes.arb) and with their nearest-neighbor sequences. Then, the alignments were used to infer phylogenies by neighbor joining using evolutionary distances corrected by the Kimura two-parameter or maximum-likelihood correction using empirically determined gamma distribution and base frequencies (PAUP*4b10), maximum parsimony using heuristic search (PAUP*4b10), and maximum likelihood using the general time-reversible model (35). Bootstrap analysis was used to assess the robustness of inferred topologies. Finally, taxonomic identifiers CR1 to CR6 were assigned to monophyletic clusters that were reproducibly unaffiliated with previously described taxa in all topologies and which also contained at least three sequences greater than 1,000 nt from independent libraries.
Database searches.
The environmental sequence database "env_nt" was downloaded from NCBI on 9 May 2006 (1,047,779 sequences and 1,063,283,128 total nucleotides; the list of accession numbers is available on plantpath.wisc.edu/
isen/CR). The following 16S rRNA gene sequence databases were downloaded from LBNL (14): the nonredundant, nonchimeric ("revered") database of full-length rRNA gene sequences (11 November 2005 version) and the full 16S rRNA gene database (greengenes.arb; 27 April 2007 version). For searches of the environmental sequence database, putative PCR primer binding sites were identified with BLAST and putative amplicons were identified by Perl scripts (available at plantpath.wisc.edu/
isen/CR). For each primer search, pertinent BLAST parameters were as follows: no gaps, M = 1, N = –3, V = 1,000,000, and B = 1,000,000, and both the word size (W) and score threshold (S) were set to the length of the primer sequence used for the search. Putative 16S rRNA gene amplicons found in the environmental sequence database were confirmed by a BLAST score ratio (31) of greater than 0.80, computed for each sequence by dividing the BLAST score for its best match to the revered 16S rRNA gene database by the BLAST score of the sequence compared to itself; default BLAST parameters were used for these searches. All BLAST searches were implemented using WU-BLAST (blastn 2.0MP-WashU [10 May 2005]; W. Gish, personal communication).
Final note on primer sequences, nomenclature, and citation.
In reviewing the literature to prepare this work, we found many instances of ambiguity regarding 16S rRNA gene PCR primers. These ambiguities resulted primarily from using the same name (e.g., "27F") to refer to primers having different sequences. Because of this, we feel it is important to be cautious in reporting 16S rRNA gene primer sequences, especially as more variations of commonly used primers are developed and published. Though there may eventually be a need to agree on a more systematic method of naming primers, most ambiguities can easily be minimized in publications by reporting the sequences of primers instead of designating them by a common name and reference.
Nucleotide sequence accession numbers.
These sequence data have been submitted to GenBank under accession numbers EU245047 to EU246330.
|
|
|---|
49°C (Fig. 1C). To distinguish this new PCR method from traditional Taq PCR, we refer to the shortened primers as "miniprimers" and to PCR using them as "miniprimer PCR". |
View this table: [in a new window] |
TABLE 1. Common variants of 16S rRNA primersa
|
![]() View larger version (29K): [in a new window] |
FIG. 1. Establishing conditions for miniprimer PCR. (A) Minimum primer length. Beginning with a 20-nt primer, a series of shortened primers having a common 5' end was designed. These primers were used in combination with a 19-nt reverse primer (B) and a 10-nt primer (C) to demonstrate that 10-nt primers can be used for PCR. (B) Amplification using the forward primer series. When electrophoresed on an agarose gel, PCR with S-Tbr produces detectable amplification using primers as short as 10 nt (F10/R19). (C) Miniprimer PCR. Agarose gel electrophoresis of PCR products with S-Tbr and a range of annealing temperatures demonstrates robust amplification for a pair of 10-nt primers (F10/R10). In panels B and C, agarose gels are stained with ethidium bromide and arrows mark the expected size of PCR products.
|
10-nt regions within several long 16S rRNA gene primers. We based our designs on variants most often reported in the literature (Table 1). The 788F-10 and 797R-10 primers (Fig. 2A) were designed to target the 10-nt region found to be the longest string of totally conserved bases in an alignment of 500 bacterial 16S rRNA gene sequences (8). By use of several combinations of miniprimers, PCRs were performed to amplify 16S rRNA genes from Escherichia coli or Halobacterium salinarum genomic DNA (from the domains Bacteria and Archaea, respectively). First, we selected miniprimer pairs that produced amplicons of the expected sizes with a minimum of spurious products. Then, amplicons from several PCRs were sequenced to verify that they were regions of 16S rRNA genes. All sequences from amplicons of the expected sizes were from the correct regions of the 16S rRNA gene targeted by each particular miniprimer pair. This analysis yielded several pairs of miniprimers that performed well for the amplification of bacterial and archaeal 16S rRNA genes (Fig. 2); in particular, 27F-10, 524F-10, and 2F-10 paired with 1505R-10 have reproducibly performed well in our analyses.
![]() View larger version (22K): [in a new window] |
FIG. 2. Miniprimers designed and tested. (A) Sequences of miniprimers tested. (B) Performance of miniprimer pairs. Forward and reverse miniprimers were used to PCR amplify 16S rRNA genes from bacterial (Escherichia coli [Ec]) or archaeal (Halobacterium salinarum[Hs]) genomic DNA. PCR was determined successful if a band of the correct size was detected with minimum background. Success or failure with each template is indicated with + or –, respectively, in the appropriate column; a mark of +/– indicates that the product was in low abundance or contained a moderate amount of background but may be clonable. In several cases, amplicons were sequenced to confirm them to be the expected 16S rRNA sequences; confirmation is indicated by a + in the column labeled "16S." nd, not determined.
|
1,000 nt, lowering the probability of identifying long amplicons. To compare the abundances of putative amplicons defined by pairs of long primers and miniprimers, matched primer pairs were used to target the same regions of the 16S rRNA gene (Fig. 3). Miniprimers identified 1,648 putative amplicons and the standard primers identified 448; surprisingly, of the 1,648 miniprimer sequences, 971 were not identified by any long-primer pair tested. By BLAST comparison against a database of 30,312 nonredundant, nonchimeric bacterial and archaeal 16S rRNA gene sequences of at least 1,350 nt, 1,068 miniprimer and 301 long-primer sequences were verified to be regions of 16S rRNA genes (Fig. 3). Importantly, each miniprimer pair identified 2- to 10-fold more putative 16S rRNA gene amplicons (Fig. 3). This analysis predicted that miniprimer pairs can amplify more 16S rRNA gene sequences in environmental samples than can standard long primers.
![]() View larger version (32K): [in a new window] |
FIG. 3. Comparison of putative miniprimer and long-primer amplicons in the GenBank environmental sequence database. Pairs of long primers and miniprimers were used to identify putative 16S rRNA gene amplicons in the GenBank environmental sequence database. Horizontal lines separate aligned, matched pairs of long primers (gray shading) and miniprimers (no shading) that target the same 16S rRNA gene region. Sequences delimited by forward and reverse primer sequences separated by an appropriate distance were identified. The numbers of total sequences found (Sequences), unique sequences present within the total (Unique), and 16S rRNA gene sequences present within the unique sequences (16S) are indicated in the appropriate columns. 16S rRNA gene sequences were identified by a BLAST score ratio of >0.80 (see Materials and Methods). Degenerate positions are indicated using R for A or G, M for A or C, and W for A or T; na, not applicable.
|
Evaluation of miniprimer PCR with a soil sample.
To determine whether the computational analysis reflected experimental results, miniprimer PCR was used in a study to amplify 16S rRNA genes from soil by use of the primer pairs 27F-10/1505R-10, 12F-10/1505R-10, and 524F-10/1505R-10 (Table 1 and Fig. 2). In this study, we did not attempt a comprehensive analysis of a soil sample but only an assessment of the miniprimers and whether they could detect sequences with mismatches to the standard primers. After PCR and cloning, 32 clones from each primer pair were partially sequenced from one end to examine the primer binding regions. Products from the 27F-10/1505R-10 PCR were chosen for further study because 27F-10 comprises the first 10 nt from the 5' half of the 27F suite of primers (Table 1) and thus enabled the identification of amplicons whose sequences had mismatches to the 3' half of 27F (see S2 in the supplemental material). In addition, the 27F-10/1505R-10 miniprimer pair amplifies nearly the entire 16S rRNA gene, and it performed well in pilot experiments. Very-high-quality sequences were obtained for 10 of the 32 27F-10/1505R-10 sequences; of these 10 end sequences, 5 began with the 27F-10 sequence and 5 began with the 1505R-10 sequence. Two of the sequences anchored by the miniprimer 27F-10, A01 and A04, contained mismatches to the 27F-P binding sequence and also to the sequence of 27F-HT, a degenerate primer designed to be more general (Table 1; also see S2 in the supplemental material).
After A01 and A04 were fully sequenced, it was noted that the A01 sequence also had a mismatch within the 4 nt at the 3' end of the 1492R binding site that are not shared by 1505R-10 (see S2 in the supplemental material). The A01 and A04 sequences were aligned to the comprehensive 16S rRNA gene sequence database to assess the sequences within conserved regions (see S2 in the supplemental material). The alignment revealed that A01 and A04 contained mismatches within the 27F-P and 1492R-P primer binding regions and within other regions of the 16S rRNA gene that are very highly conserved across the Bacteria domain (see S2 in the supplemental material). Importantly, predictions of rRNA secondary structure based on the Arb alignments supported the presence of the divergent bases in the A01 and A04 sequences—of the nine divergent bases observed (see S2 in the supplemental material), seven are located in base-paired regions, and for five of these bases, the expected complementary changes are present at the proper locations to preserve Watson-Crick pairing, and in the remaining two cases, G:G and G:A base pairs are present for all members of the phylogenetic clusters in which A01 and A04 reside, respectively. An examination of 20 randomly selected divergent bases within conserved helical regions also revealed the expected complementary bases at the correct locations to preserve Watson-Crick pairing for all 20 sites chosen (data not shown). Thus, given the covariation present in the sequences, it is exceedingly unlikely that the divergent bases arose from PCR errors or chimeric sequences.
Miniprimer PCR characterization of the Candelaria microbial mat community.
We applied the method to a mature microbial mat community from an extreme environment, the hypersaline Candelaria lagoon of the Cabo Rojo salterns. This mat community was chosen for study because it was expected to have intermediate diversity and interesting phylotypes. To identify members of the mat community for this study, we constructed six 16S rRNA gene sequence libraries with either the miniprimer or standard long-primer techniques.
We constructed libraries with the miniprimer pair 27F-10/1505R-10 and two different long-primer pairs: 27F-P/1492R-P and 27F-HT/1492R-HT (Table 1). The primers 27F-P and 1492R-P are "first-generation" nondegenerate universal bacterial primers that remain in widespread use today (1, 5, 18, 19, 39), and 27F-HT and 1492R-HT are more recently published primers that are based on the original first-generation primers and include several degenerate positions to broaden their scope of targets (37). These primers produce nearly full-length 16S rRNA gene sequences and thus provide a fair assessment of the miniprimer method relative to standard methods in common use; duplicate libraries were constructed with each of the primer pairs (Table 2).
|
View this table: [in a new window] |
TABLE 2. Candelaria microbial mat 16S rRNA gene libraries constructed
|
![]() View larger version (17K): [in a new window] |
FIG. 4. Distribution of microbial mat library sequences in major bacterial divisions identified. The 10 taxonomic divisions with the highest representation in the libraries are shown as their relative proportions within the miniprimer (M) and two types of long-primer (P, 27F-P/1492R-10; H, 27F-HT/1492R-HT) libraries. The aggregate distribution of all libraries is also shown (all). Phylogenetic assignments were performed in Arb; the "other divisions" category comprises 31 divisions whose representation is less than 3 to 5% in each library. Proper Proteobacteria class names with standing have been shortened using Greek letters.
|
![]() View larger version (15K): [in a new window] |
FIG. 5. Distribution of microbial mat best database matches. The fractions of sequences whose best database matches fall into the designated distance intervals are shown for the long-primer (P, 27F-P/1492R-10; H, 27F-HT/1492R-HT) and miniprimer (M) libraries. Distances of 0.05, 0.10, and 0.20 approximately define the taxonomic levels of genus, order, and division, respectively (32). Plotted values and errors are the averages and standard deviations of six maximum likelihood calculations using six different taxonomies (15).
|
![]() View larger version (27K): [in a new window] |
FIG. 6. Microbial mat sequences with low-scoring database matches. Miniprimer (red), long-primer (green and blue), and reference (black) sequences were assembled into a phylogenetic tree by use of maximum likelihood (35). The best-scoring tree of 20 independent calculations is shown; branches supported by bootstrap values of greater than 50 (100 iterations) are labeled. Some bootstrap values near closely spaced terminal clusters have been removed for clarity; none of the removed values were less than 50 and most were greater than 80. Uncultured isolates are identified by their accession numbers and by the environment from which the sequences were cloned; cultured isolates are in italics. Taxonomic groups are indicated for clusters of three or more sequences that could be classified. Seven archaeal 16S rRNA gene sequences were used to root the tree.
|
To evaluate the reproducibility of the three library construction methods and to compare the methods, the 16S rRNA gene sequence libraries were compared to each other by calculating similarity indices (34) using 97% sequence similarity to define OTUs. Here, we focus on the classic incidence-based Sørenson similarity index, a measure of membership, and the Clayton
, a nonparametric maximum likelihood estimator of community structure that considers abundance (41). These calculations demonstrate two trends (Table 3): first, duplicate libraries constructed with the same primers show high levels of similarity in both membership and structure; and second, libraries constructed with long primers are typically more similar to each other than to libraries constructed with miniprimers. These trends are particularly evident when relative abundances are considered (
, Table 3). Importantly, the data suggest no significant dependence of library membership or structure on the particular polymerase used, as demonstrated by the high similarities of duplicate libraries constructed with the same primer pairs, 27F-P/1492R-P or 27F-HT/1492R-HT, but amplified with different polymerases (Tables 2 and 3). Moreover, the miniprimer method is as reproducible as either of the other methods tested here using long-primer pairs. Thus, these data suggest that differences in compositions of the libraries constructed with miniprimers and long-primer libraries are not due to the enzyme used for amplification.
|
View this table: [in a new window] |
TABLE 3. Estimation of library similaritiesa
|
Lastly, we examined the miniprimer clone sequences to quantify mismatches within the regions that would be targeted by the long primers 27F-P, 27F-HT, 1492R-P, and 1492R-HT (Fig. 7). For the 27F binding region, 309 and 266 of the 598 miniprimer sequences deviated from the 27F-P and 27F-HT primer sequences, respectively, at one or more nucleotides; sequences with two mismatches to 27F-P and 27F-HT numbered 20 and 10, respectively, and 1 sequence had three mismatches to the 27F-P sequence. Though the miniprimer sequence consensus largely agreed with the sequences of the long 27F primers, the nucleotide frequency at position 12 did not agree with the degeneracy present at this position in 27F-HT. Position 12 of 27F-HT is designed to match A or C; however, while in the cloned miniprimer sequence libraries C was present in the majority of sequences, G was present more than four times as often as was A, and T was observed twice. Similar analysis of the 1492R binding region identified one sequence from the miniprimer clone libraries having a single mismatch to 1492R-P and 1492R-HT (data not shown).
![]() View larger version (24K): [in a new window] |
FIG. 7. Mismatches of 27F-P and 27F-HT primers to microbial mat miniprimer library sequences. (A) Portions of long primers 27F-P and 27F-HT that bind past the 3' end of miniprimer 27F-10. (B) Bases represented by degenerate codes in 27F-HT. (C and D) The relative (C) and absolute (D) base frequencies observed within the 598 sequences of the miniprimer sequence libraries at each position within the 3' halves of the 27F-P and 27F-HT binding sites. Note that at some positions the quantities of bases shown in panel D are not large enough to be resolved by the representation shown in panel C.
|
|
|
|---|
Potential amplification of nontarget sequences.
One may expect that the shorter size of miniprimers could increase promiscuous binding to targets other than the 16S rRNA gene. However, analysis of environmental sequences suggests this may not be problematic. Of the 1,129 predicted miniprimer amplicons in the NCBI environmental sequence database (Fig. 3), 61 (
5.4%) were found not to be 16S rRNA gene sequences; a similar calculation for long primers yields a similar false-positive rate (19/320
5.9%). In addition, the rates of identifying mitochondrial and chloroplast sequences (see S1 in the supplemental material) were also the same (8/685
3/205
1%) for both methods. In support of this, the microbial mat libraries contained no eukaryote, chloroplast, or mitochondrial sequences, though eukaryotes are known to associate with the Candelaria mats (11). Overall, the results suggest that miniprimers can identify more 16S rRNA gene sequences without an increased rate of amplifying false positives. However, much additional testing and sequencing of amplicons isolated from environmental samples should be completed to test this prediction.
Benefits of miniprimer PCR.
Adding miniprimer PCR to the tools used for analyzing microbial communities may enable a more accurate measure of 16S rRNA gene sequences in environmental samples by expanding the sequences detectable by PCR. For example, in the 1 Gbp of randomly cloned sequence in the environmental sequence database, if one assumes a microbial genome size of 2 to 3 Mbp and an average of 2.5 16S rRNA genes per genome (2), it is expected that
800 to 1,200 16S rRNA genes are present in the sample. When totaled, the miniprimer computational searches identified 1,068 putative 16S rRNA gene amplicons (Fig. 3), in close agreement with the estimate, while the searches with the longer primers totaled 301 (Fig. 3). If one assumes that decreasing the length of a PCR primer by 1 nt can increase by fourfold the number of perfectly matched sequence targets, then reducing the length of a primer from 20 nt to 10 nt could theoretically increase the possible targets by a factor of a million (
410). Sequence variation within the regions targeted by these particular miniprimers may render the theoretical limit meaningless in this instance; nonetheless, this upper limit could inform other applications of the method.
In addition to expanding the range of detectable targets, combining miniprimer PCR with standard techniques might increase the accuracy of environmental sampling by enabling estimates of microbial diversity to reflect sampled communities more closely. When PCR is used to analyze samples containing multiple heterogeneous templates, many mechanisms can result in a biased distribution of amplicons that does not accurately represent the distribution of templates in the sample (3, 22, 30, 36). Miniprimers may reduce these kinds of biases in two ways. First, because miniprimers are shorter and can target the most conserved regions of the 16S rRNA gene, the potential for mismatch is reduced compared with long primers. The degeneracy of 27F-HT was not appropriate for the nucleotide frequencies of the miniprimer sequences (Fig. 7), suggesting that particular communities may be better analyzed with a miniprimer such as 27F-10. Second, because miniprimers target smaller regions, the requirement for degenerate positions in primers can be eliminated or minimized, thus removing or decreasing preferential binding of particular primer species within a degenerate primer mixture. Obviously, the use of shortened primers may introduce different biases into PCR-based analyses of microbial communities, and more study is required to understand that potential, perhaps using known mixtures of bacterial DNAs. However, despite the biases of any PCR method, miniprimer PCR may be useful to supplement traditional methods either to reduce or make apparent the biases that occur in analysis of environmental samples and allow a more accurate description of microbial communities.
Miniprimer design.
Though general rules have been established for Taq PCR primer design, it is not yet evident what parameters require optimization when primers are substantially shortened. Successful amplification was obtained using miniprimers whose G+C content covered a broad range from 30 to 90% (Fig. 2). Thus, product formation appeared unrelated to the G+C content of miniprimers. In our tests, a high rate of success has been obtained with 1505R-10 paired with many different forward primers (Fig. 2). Of the putative primer binding sequences identified in the environmental sequence database, the fewest sites were found for 1505R-10, suggesting that a productive miniprimer maximizes specificity for the target sequence in the context of the total template sequence present in the PCR. This may be a good rule with which to begin miniprimer design. It should be noted that the miniprimers used here are initial versions and that they should be optimized further. For instance, neither 27F-10 nor 27F perfectly matches Chlamydia, and 788F-10 does not match Dehalococcoides. Thus, some problems associated with long primers remain concerns in miniprimer design, and regularly checking, curating, and updating miniprimer sequences are as required for miniprimers as for the longer primers.
Future prospects for miniprimer PCR.
The use of miniprimers in PCR can benefit many areas of biological research in several ways. PCR with miniprimers may enable PCR strategies that are not possible when primer design is restricted to a minimum length of
20 nt. Some strategies might benefit from using even shorter primers—preliminary data indicate that miniprimers can be as short as 8 nt when paired with a primer of 15 to 20 nt (data not shown). Also, miniprimer sequences provide a small amount of additional sequence data that may be helpful for making fine discriminations among closely related sequences. Finally, the combination of highly processive enzymes and shorter primers may decrease the cost of PCR. The increased processivity enables PCR to be performed with smaller amounts of enzyme (40), and shorter primers are less costly to synthesize. Although cost savings may be minimal for small projects, these considerations become important for projects requiring very large numbers of PCRs.
This work was supported by grant number F32AI065067 from the National Institute of Allergy and Infectious Diseases, grant number NRA-01-01-ASTID-020 from the National Aeronautics and Space Administration, and grant numbers MCB-0132085 and MCB-0455620 from the National Science Foundation Microbial Observatory Program.
The content is solely our responsibility and does not necessarily represent the official views of the National Institute of Allergy and Infectious Diseases, the National Institutes of Health, the National Aeronautics and Space Administration, or the National Science Foundation.
Published ahead of print on 14 December 2007. ![]()
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
Present address: Finney Capital, 489 Douglass St., San Francisco, CA 94114. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»