Previous Article | Next Article ![]()
Applied and Environmental Microbiology, April 2007, p. 2416-2422, Vol. 73, No. 8
0099-2240/07/$08.00+0 doi:10.1128/AEM.02474-06
Copyright © 2007, American Society for Microbiology. All Rights Reserved.
,
U.S. Environmental Protection Agency, Office of Research and Development, National Risk Management Research Laboratory, 26 West Martin Luther King Drive, MS-387, Cincinnati, Ohio,1 Department of Microbiology and Immunology and Department of Biology, University of Louisville, Louisville, Kentucky2
Received 23 October 2006/ Accepted 20 December 2006
|
|
|---|
|
|
|---|
The lack of accurate methods for identifying sources of fecal pollution has stimulated the recent development of a number of microbial source tracking (MST) methods. In general terms, these MST methods can be divided into culture-based and culture-independent techniques (23). Most culture-based methods for identification of human sources depend on matching panels of environmental bacterial isolates with known human fecal indicator type strains. A major limitation of this approach is its requirement for the development of large collections of isolates from both water and human fecal samples. Thus, MST methods that do not require cultivation, such as the direct detection of bacterial 16S rRNA gene sequences using PCR, are becoming more widespread (5, 9). While these assays have now been used in field applications (5, 9, 20), a recent study demonstrates understandable cross-reactivity when highly conserved genomic regions are targeted (7). Since ribosomal genes are not directly involved in microbe-host interactions, it is possible that other bacterial genetic markers encoding factors related to host specificity might be better candidates for MST assays. Although a significant number of bacterial genes have been identified as relevant to host-microbe interactions in the human gut (14, 15), the challenge remains to identify which of these genes are from bacteria that are truly restricted to this specific niche. We hypothesize that direct comparisons of the genetic coding capacities of entire human fecal bacterial communities can identify such factors involved in host-microbe interactions and that these would be the best targets for PCR assays designed to identify sources of human fecal contamination (21, 22).
We recently developed a nucleic acid analysis method called genome fragment enrichment (GFE) to identify differences in the genomes of phylogenetically related bacterial species and to identify differences in total microbial community DNA obtained from different sources (21, 22). Here we describe the extension of this approach to address the highly significant problem of identification of human fecal contamination in water, and we report important differences encountered relative to other animal sources. A large number of candidate marker sequences are described, with a major characteristic being that almost half are predicted to encode bacterially secreted or cell surface factors located at the interface with host cells. Four new human-specific PCR assays developed by this approach and potentially useful in diagnosing human fecal pollution in environmental samples are described.
|
|
|---|
GFE.
A single round of GFE was used to select potential human-specific fecal community genetic markers as described previously (22). Briefly, biotin-labeled sheared total fecal DNA from a single reference human specimen was first prehybridized with sheared DNA fragments from total fecal DNA from an individual reference pig specimen. This "blocked" biotin-labeled DNA was then hybridized to equilibrium in solution with additional DNA fragments from the original source (human total fecal DNA) that contained defined terminal sequence "tags" that had been added by primer extension as described previously (22). DNA hybrids were then isolated by streptavidin binding, and the captured tagged genomic fragments were amplified by PCR using a single primer complementary to the defined 5' fragment tags (13). The required specificity of the final PCR using either the K9-PCR or F9-PCR tag complementary primer (11) was verified using reference sheared human and pig total fecal DNA as templates. Five GFE PCRs were performed in individual tubes and then mixed to reduce sample variability in amplifying complex nucleic acid pools. The same pig fecal sample used as the GFE "blocker" in a previous report (22) was selected for this study to allow a direct comparison of animal-specific PCR assay development success rates between studies and because human and pigs share similar anatomies, physiologies, and diets.
DNA sequencing.
Final GFE products from five identical parallel PCRs were pooled and incorporated into the pCR4-TOPO plasmid vector as described by the manufacturer (Invitrogen; Carlsbad, CA). Individual Escherichia coli clones were then cultured in 300 µl of Luria broth containing ampicillin (10 µg/ml) and screened for inserts by PCR using M13F and M13R primers. Prior to sequencing, PCR products were purified using the QiaQuick 96 PCR purification kit (QIAGEN; Valencia, CA). Sequencing was performed on both strands at the Cincinnati Children's Hospital Medical Center Genomics Core Facility (Cincinnati, OH) by the dye-terminator method, using an ABI PRISM 3730XL DNA analyzer (Applied Biosystems, Foster City, CA).
Dot blot hybridizations.
Dot blot hybridizations with cloned GFE sequences and pig fecal DNA (GFE blocker) probe were used to identify any "false-positive" plasmid clone inserts obtained by GFE that were not unique to the original human total fecal DNA source. Probe preparation, hybridization conditions, and detection were performed as described previously (22).
Data analysis.
DNA sequence reads were assembled using SeqMan II (DNAstar, Inc.; Madison, WI) and used to search the National Center for Biotechnology Information (NCBI) RefSeq database using BLASTx software (1) (http://www.ncbi.nlm.nih.gov/BLAST/); BLASTx hits with expectation values of
1E03 were designated as homologous. Gene function attributes for DNA sequences were assigned based on annotations available in the Comprehensive Microbial Resource genome database (CMR) at The Institute for Genomic Research (http://www.tigr.org/tigr-scripts/CMR2/CMRGenomes.spl).
Selection of relevant markers for PCR assays.
Twenty-six DNA sequences were chosen for the development of PCR assays based on best BLASTx match annotated gene function attributes potentially relevant to interaction with hosts. Target DNA sequences selected were those where (i) the BLASTx best hit contained either a predicted transmembrane helix or a secretory signal region based on either SignalP (3) or TMHMM prediction (16) or where there was published experimental evidence of cell surface or extracellular activity; (ii) DNA sequences were annotated as encoding noncystolic protease or factors involved in capsular polysaccharide synthesis; or (iii) sequences were predicted to encode proteins with a function other than that associated with DNA metabolism, protein synthesis, antibiotic or multidrug resistance, or mobile element functions. PCR primers were designed using PrimerSelect (DNAstar, Inc.; Madison, WI) with default settings. Candidate primer sequences were aligned with homologous sequences (e-value of
e03) from the NCBI BLASTx analysis using ClustalW (24) with default settings (MegAlign; DNAstar, Inc., Madison, WI). Primer sets that aligned to variable DNA regions among homologous sequences were selected for optimization, host specificity, and limit-of-detection assays.
Primer optimization, host specificity, and limit of detection.
Optimal annealing temperatures were measured for each primer pair using thermal-gradient PCR as described previously (22). To assess specificity, each PCR assay was tested against DNA extracts (2 ng/reaction) from target (human) and nontarget animal fecal samples representing 106 individual specimens. Reference fecal samples represented 11 species of animals, including Bos taurus (cow), Gallus gallus (chicken), Anser sp. (Canadian goose), Canis familiaris (dog), Felis cattus (cat), Capra aegagrus (domestic goat), Sus scrofa (pig), Ovis aries (sheep), Equus caballus (horse), Odocoileus virginianus (whitetail deer), and Homo sapiens (human). The spatial robustness of each human-specific primer set was estimated by testing each PCR assay against a panel of wastewater samples (primary effluent and stabilization pond samples) representing 18 different human populations spanning 11 states. To test for nonspecific amplification of DNA from representative environmental microorganisms and explore the application of each human-specific primer set for water quality monitoring applications, PCR assays were performed using DNA extracted from recreational water (n = 6), storm water (n = 1), and treated wastewater effluent (n = 1) filtrates. The lower limit of detection for each primer set was estimated using serial dilutions of human total fecal DNA starting with a concentration of 10 ng/µl. All validation PCR assays were performed in duplicate. No-template, extraction blank, and water filtration blank PCR control assays were performed to test for the presence of extraneous DNA molecules introduced during laboratory experiments.
PCR assay controls.
Each sample tested yielded the expected PCR product when amplified with the Bacteroides-Prevotella 16S rRNA-specific primers 32F and 708R (4), indicating a lack of PCR inhibitors. To test for the presence of extraneous DNA molecules introduced during laboratory manipulations, no-template (n = 612), extraction blank (n = 35), and water filtration blank (n = 5) controls were included in PCR assays. In all cases the results were negative. Due to the requirement for the single-primer amplification step (13) amplifying DNA fragment pools via terminal tag sequences and the complex nature of metagenomic templates, we also tested the specificities of K9-PCR and F9-PCR primers. No amplification was seen when the K9-PCR primer was tested with sheared human metagenomic DNA, nor when the F9-PCR primer was tested against pig metagenomic DNA (data not shown).
|
|
|---|
![]() View larger version (37K): [in a new window] |
FIG. 1. Dot blot hybridization analysis of putative host-specific DNA fragments. PCR amplicons from all nonredundant clone sequences (93 shown) were transferred to nylon membranes and hybridized to a biotin-labeled human (A) or pig (B) fecal metagenomic DNA probe. Positive controls included 500 ng of human fecal metagenomic DNA (panel A, row A, column 11; panel B, row B, column 9) and 500 ng (panel A, row A, column 12; panel B, row B, column 12) of pig fecal metagenomic DNA. None of the "no DNA" controls (panel A, row B, column 12 and row H, column 12; panel B, row A, column 12 and row H, column 12) hybridized to the probe.
|
1 x 103 (see Table S1 in the supplemental material). The BLASTx hit with the lowest expectation value for each region was designated the best BLASTx hit (see Table S1 in the supplemental material). The top BLASTx hits for each region averaged only 57.6% sequence identity (ID) to the actual GFE plasmid clone sequences. Fifty-three of the 289 human fecal community DNA sequences (18.3%) (Fig. 2) showed no similarity to any previously reported gene sequences. All GFE sequence regions obtained were then assigned to 1 of the 18 functional groups described in the CMR based on annotations of best BLASTx sequence matches (Fig. 2) (see Table S1 in the supplemental material). The individual categories most frequently assigned were "unknown function" (14.8%) and "hypothetical proteins" (10.7%). Based on additional existing bioinformatic analyses of best-match genetic regions, 126 GFE sequences were annotated as likely to encode membrane-associated or putative extracellular proteins. Only two distinct DNA sequences were indicated by more than one nonredundant clone obtained by GFE. These sequences were similar to those encoding a hypothetical protein, DR1284 (28), isolated from Deinococcus radiodurans R1, and a previously uncharacterized DNA sequence of unknown function (see Table S1 in the supplemental material). Overall, DNA sequences obtained by GFE were therefore predominately representative of previously uncharacterized coding capacity of the bacteria that compose the human intestinal microbiota.
![]() View larger version (22K): [in a new window] |
FIG. 2. Functional group assignments for GFE nonredundant sequences. Functional groups are listed along the y axis, and the percentage of GFE sequences (total number = 290) for each group assignment is shown along the x axis.
|
|
View this table: [in a new window] |
TABLE 1. Putative human-specific PCR assay primers, optimal annealing temperatures, and target DNA annotation
|
|
View this table: [in a new window] |
TABLE 2. Summary of putative human-specific PCR assay specificity testsa
|
A series of initial studies was then performed to explore the future utility of these PCR assays for environmental monitoring of human fecal pollution. DNA extracts isolated from various wastewater facilities were used as a template to estimate the spatial robustness of each human-specific PCR assay. Three human fecal matter-specific PCR assays (assays 19, 22, and 30) yielded target products for all of the wastewater samples, while assay 7 showed the lowest spatial distribution, with amplification from only 61% of the samples.
To demonstrate the potential of each human-specific PCR assay for environmental monitoring, each primer set was tested with DNA isolated from recreational water, storm water, and treated wastewater effluent samples. Both the Heiserman Stream sample (Milford, OH), taken from an area situated 100 m downstream of a treated wastewater discharge pipe, and the treated wastewater sample (Arrowhead, OH) tested positive in PCR assays 19, 22, and 30. A river sample (9-Mile Creek, OH) taken from an area approximately 1,000 m downstream of a treated wastewater discharge pipe also tested positive in assays 19 and 30. All assays yielded no detectable PCR product for the remaining storm water and river samples, as shown in Fig. 3.
![]() View larger version (59K): [in a new window] |
FIG. 3. Gel electrophoresis of PCR products from reactions with human-specific PCR assays 19, 22, 30, and 7 (A, B, C, and D, respectively). Each PCR assay was tested against DNA extracts from recreational water, storm water, and treated wastewater samples. Sources: Miami Trails, OH, column 1 (lane 1); Lower East Fork I and II, OH (lanes 2 and 6, respectively); O'Bannon, OH, (lane 3); 9-Mile, OH (lane 4); Middle East Fork, OH (lane 5); Heiserman, OH (lane 7); Arrowhead, OH (lane 8); extraction blank (lane 9); no template controls (lanes 10-11); and human fecal DNA (lane 12). PCR products in lanes 10 and 11 from panel A are primer dimers.
|
|
|
|---|
Comparative sequence analysis of the GFE plasmid clone sequences obtained suggests that much of the genetic capacity of the reference human fecal microbial community not present in a pig specimen resided in previously uncharacterized microbial genes. Although many more GFE fragments would need to be sequenced to generate a complete assessment of the genetic differences between these two microbial communities, we were, perhaps surprisingly, able to obtain desired human bacterium-specific DNA marker sequences by analysis of only a relatively small number of plasmid clones and only two source specimens. Classification of a limited number of GFE clone sequences into functional groups indicated an abundance of genes that potentially encode bacterial membrane-associated or extracellular proteins. Among these, a striking 85% of 126 falling into the category of "surface-associated" factors (see Table S1 in the supplemental material) were predicted by SignalP (3) to encode secreted proteins. These findings suggest that a potential major difference between the reference human and pig fecal microbial communities is in their capacity for producing distinct secreted factors. We also previously observed this trend in the analyses of reference cow and pig fecal microbial communities (22). These findings are consistent with a hypothesis that highly specific bacterial markers may be found in genes related to host specificity, where surface and secreted factors are often involved in interactions with distinct types of host cells and tissues, in modifying the external bacterial cell surface, and in obtaining necessary nutrients from highly defined external environments.
In terms of the potential utility of the specific marker assays described for water quality monitoring, initial tests of the four human-specific PCR assays exhibited good spatial robustness across 11 states by consistently testing positive for almost all contaminated wastewater samples representing 18 different human populations. These preliminary experiments suggest that these PCR assays, particularly assays 19 and 30, may have a future utility in environmental monitoring and merit more extensive characterization. However, in order to realize the potential of these PCR assays for MST applications, several issues remain to be addressed. These include survival of target DNA molecules in the environment, relevance of each PCR assay to current culture-based fecal indicator methods used to monitor water quality, and establishing a link between the prevalence of genetic markers described and relevant public health risks. Both the broad distribution of these microbial genetic makers across human populations and the level of specificity established do encourage us to further explore the potential of the assays described for more accurately identifying human fecal contamination in our waters.
We are grateful to Mark Meckes, Janet Blannon, Matt Morrison, Sam Myoda, and Don Stoeckel for providing fecal and wastewater samples.
Any opinions expressed in this paper are those of the author(s) and do not necessarily reflect the official positions and policies of the U.S. Environmental Protection Agency, and any mention of products or trade names does not constitute recommendation for use.
Published ahead of print on 5 January 2007. ![]()
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»