Previous Article | Next Article ![]()
Applied and Environmental Microbiology, August 2003, p. 4942-4950, Vol. 69, No. 8
0099-2240/03/$08.00+0 DOI: 10.1128/AEM.69.8.4942-4950.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Laboratory of Microbial Ecology and Technology, Ghent University, B-9000 Ghent, Belgium
Received 20 December 2002/ Accepted 29 May 2003
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
A variety of methods, such as phage susceptibility (27) and multiple antibiotic resistance (MAR) profiles (21, 34), have been proposed to identify fecal sources in water. However, these techniques have some inherent uncertainties. The susceptibility of bacteria to specific phages undergoes periodic cycling due to the natural adaptation-counteradaptation cycle (49), and the relationship between phage and bacterial numbers is still not clear (12). MAR profiles can distinguish between human and nonhuman sources of Escherichia coli (26) and streptococci (23). However, antibiotic resistance genes are typically encoded on plasmids. These mobile genetic elements are often lost or obtained in response to altered environmental conditions, raising questions as to the stability of these markers (16) and the susceptibility of MAR profiles to changing environmental conditions (19).
Presently, DNA fingerprinting techniques such as ribotyping (7, 25, 39), pulsed-field gel electrophoresis (38), and PCR analysis of the 16S-23S rRNA intergenic spacer region (ISR) (6) and of repetitive extragenic palindromic (rep) sequences of E. coli isolates (15) are used to discriminate between human and nonhuman sources of fecal material. These methods depend on a library which consists of a collection of fingerprints of microorganisms from different potential fecal sources. The aim of these methods is to compare the fingerprints of environmental isolates to the library, which would indicate if the fecal pollution in the environment is derived from a particular host group represented in the library.
Rep-genotyping uses primers complementary to interspersed conserved repetitive DNA elements, present in multiple copies throughout the genome (44). Three families of repetitive elements have been identified: the repetitive extragenic palindromic (REP) sequences (42), the enterobacterial repetitive intergenic consensus (ERIC) sequences (29), and the BOX sequences (35). These repetitive elements are thought to be highly evolutionarily conserved because rep sites are essential protein-DNA interaction sites or because these sequences may propagate themselves as "selfish" DNA by gene conversion (44). Amplification of the distinct genomic regions located between these repetitive elements results in a distinctive strain pattern (45).
rRNA (rrn) operons are typically arranged in the order 16S-ISR-23S-ISR-5S and are present in multiple copies. For example, E. coli has seven rrn operons (10). These ISRs are under minimal selective pressure and often vary among strains, whereas the flanking rRNA genes are highly conserved. Amplification of the ISR can therefore be performed with universal primers targeted to conserved sites in the 16S and 23S rRNA genes (31). Thus, ISR analysis can be performed on all organisms and yet has the ability to discriminate between species and strains (2). The 16S-23S ISR is believed to be involved in the processing of precursor rRNA and in some organisms, such as E. coli, contains coding sequences for tRNA molecules (10, 46).
The objective of this study was to find candidate source-specific E. coli fingerprints as potential genotypic markers for raw sewage, horses, dogs, gulls, and cows. We evaluated 16S-23S rRNA ISR-PCR and rep-PCR analysis of E. coli isolates as tools to identify nonpoint fecal sources. The BOXA1R primer, based on the boxA sequence, was used in rep-PCR analysis (35). E. coli was selected because it has long been used as an indicator of human enteric pathogens in aquatic systems and is present in most warm-blooded animals (3, 22). Recent studies have shown that both ISR-PCR and rep-PCR analyses can conceptually differentiate between human and nonhuman fecal sources (6, 15). However, questions remain on whether these techniques can discriminate between nonhuman sources such as agricultural runoff or canine fecal material. Moreover, the stability of BOX and ISR fingerprints during the transition of E. coli from its primary habitat (the intestines of warm-blooded animals) to a secondary habitat (a water body) is unknown (19, 47). If these genotypic fingerprints are unstable, then this approach is likely of little use. Finally, the practicality of these methods is unresolved. For example, sample size requirements are critical to field implementation of these approaches since high costs are involved in the isolation and analysis of microorganisms (23).
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
DNA extraction.
Half a plate of colonies grown on plate count agar was suspended in 500 µl of DNase- and RNase-free filter-sterilized water (Sigma-Aldrich Chemie, Steinheim, Germany). DNA was extracted according to the protocol from Hambly et al. (24). Fifty microliters of lysis buffer (1% [wt/wt] sodium dodecyl sulfate, 0.05 M EDTA, pH 8) was added to the cell suspension. After 15 min of incubation at room temperature, a 1:1 (vol/vol) phenol extraction was performed and followed by two 1:1 (vol/vol) phenol-chloroform extractions. Between each extraction step, the suspension was centrifuged for 10 min at 14,360 x g. DNA was precipitated by adding 10% (vol/vol) 3.3 M sodium acetate (pH 5.2) and 75% (vol/vol) isopropyl alcohol (100%), incubation for 1 h at -20°C, and centrifugation for 30 min at 14,360 x g. The pellet obtained was resuspended in 200 µl of TE buffer (10 mM Tris-HCl, 1 mM EDTA, pH 8.0). DNA concentrations of extractions of 10 isolates were measured with a spectrophotometer and found to be within 2 to 10 µg/µl.
Rep-PCR.
BOX-PCR fingerprints were obtained with the BOXA1R primer (5'-CTACGGCAAGGCGACGCTGACG- 3') (45). PCR was performed as described by Rademaker and de Bruijn (40) with 2 µl of E. coli DNA in a volume of 25 µl. The PCR mixture contained 0.4 µM primer BOXA1R, 5 mM each deoxynucleoside triphosphate, 20 µl of 5x Gitschier buffer (83 mM NH4SO4, 335 mM Tris-HCl [pH 8.8], 55.5 mM MgCl2, 33.5 mM EDTA, and 150 mM ß-mercaptoethanol in 200 ml of buffer), 8 U of Taq DNA polymerase (Promega, Madison, Wis.), 0.8 µl (20 mg/ml) of bovine serum albumin (Boehringer), 10 µl of dimethyl sulfoxide (Sigma-Aldrich Chemie), and DNase- and RNase-free filter-sterilized water (Sigma-Aldrich Chemie) to a final volume of 100 µl.
Each PCR was performed with a 9600 thermal cycler (Perkin-Elmer, Norwalk, Conn.). The PCR was initiated by incubating the reaction mixture at 95°C for 2 min, followed by 30 cycles of 3 s at 94°C, 30 s at 92°C, 1 min at 50°C, and 8 min at 65°C. The reaction was terminated with an extension step consisting of 8 min of incubation at 65°C. All PCR experiments contained a positive control (DNA of E. coli ATCC 25922) and a negative control (no DNA). DNA concentrations of PCR products of 10 isolates were measured with a spectrophotometer and found to be within 1.5 to 2 µg/µl. PCR samples (10 µl) were loaded on a 1.5% horizontal agarose gel (Gibco-BRL 15510-027; Invitrogen S.A., Merelbeke, Belgium) together with a 1-kb size ladder (0.5 µg/well; N.V. Invitrogen S.A.), and the positive control. All gels were run in 0.5x TAE (10 mM Tris, 5 mM acetate, 0.1 mM EDTA, pH 7.4) for 17.5 h at 4°C and 70 V, stained for 20 min in 0.5x TAE buffer containing 0.5 µg of ethidium bromide per ml, and immediately photographed on a UV transillumination table with a video camera module (Vilbert Lourmat, Marne-la-Vallée, France).
ISR-PCR and DGGE.
The two universal primers employed for amplification of the 16S-23S ISR were adapted from Jensen et al. (31). The forward primer, designated G1 (5'-GAAGTCGTAACAAAGG-3'), is located approximately 30 to 40 bp upstream of the spacer boundary. A GC-clamp (5'-CGCCCGCCGCGCCCCGCGCCGTCCCGCCGCCCCCCGCCCCC-3') was added to the 5' end of the G1 primer (36). The reverse primer, L1 (5'-CAAGGCATCCACCGT-3'), is located approximately 20 bp downstream of the spacer boundary.
PCR was carried out in a volume of 25 µl, which contained 1 µl of E. coli DNA. The PCR mixture contained 2 µM G1 and 0.2 µM L1, 200 µM each deoxynucleoside triphosphate, 1.5 mM MgCl2, 10 µl of thermophilic DNA polymerase 10x reaction buffer (MgCl2 free), 2.5 U of Taq DNA polymerase (Promega), 0.25 µl (20 mg/ml) of bovine serum albumin (Boehringer), and DNase- and RNase-free filter-sterilized water (Sigma-Aldrich Chemie) to a final volume of 100 µl. The PCR was initiated with 3 min at 94°C, followed by 25 cycles of 1 min at 94°C, 2 min at 55°C, and 2 min at 72°C. The final cycle was followed by an additional 7 min at 72°C. All PCR experiments contained a positive control (DNA of E. coli ATCC 25922) and a negative control (no DNA).
Denaturing gradient gel electrophoresis (DGGE) (36) was performed with the Bio-Rad D Gene System (Bio-Rad, Hercules, Calif.). Samples (10 µl) were loaded onto 6% (wt/vol) polyacrylamide gels with a denaturing gradient ranging from 35 to 50% (where 100% denaturant contains 7 M urea and 40% formamide). Electrophoretic charge was applied in the same direction as the denaturants. All gels were run in 1x TAE buffer for 16 h at 60°C and 45 V, stained for 20 min in 200 ml of 1x TAE containing 16 µl of SYBR Green I nucleic acid gel stain (1:10,000 dilution; FMC BioProducts, Rockland, Maine), and immediately photographed as described above.
Fingerprint stability experiment.
The stability of the BOX and ISR E. coli fingerprints was assessed at three different temperatures (4, 12, and 28°C) in natural water. Two E. coli isolates from one dog (D1 and D2) and two E. coli isolates from one cow (C1 and C2) were grown overnight in nutrient broth (Oxoid Ltd. CM1). Then 50 µl of the overnight cultures was inoculated into 5 ml of autoclaved canal water (21 ± 1 mg of total organic carbon liter-1, 7.5 ± 0.1 mg of NO3- liter-1, <0.1 mg of NO2- liter-1) (Coupure, Ghent, Belgium), resulting in an initial concentration of approximately 106 CFU/ml. The tubes were shaken every 2 days to supply the cultures with oxygen. The fingerprint stability experiment ran for 150 days, after which isolates were recovered on MacConkey agar and transferred to plate count agar. The BOX and ISR fingerprints of five colonies per E. coli isolate and per temperature were compared to the fingerprint of the original isolate.
BOX and ISR fingerprint analysis.
Gel images and data were analyzed with Bionumerics software (version 1.5; Applied Maths, Kortrijk, Belgium). For the BOX fingerprints, band positions on each gel were normalized with the 1-kb size ladder as an external reference standard. Only DNA fragments ranging from 2,036 bp to 506 bp were used in the analysis (17). The bands were assigned to band classes, which resulted in transformation of a fingerprint into a binary table (0, absent in band class; 1, present in band class). The position tolerance settings for assigning band classes were calculated by tolerance and optimization analysis with the band-based Dice coefficient (13). The ISR fingerprints were analyzed similarly except for normalization. The ISR fingerprint of the E. coli ATCC 25922 strain was used as the external reference standard for normalization.
A matrix of similarities between the band class binary tables of the fingerprints was calculated based on the Dice coefficient (13). Dendrograms were created with the unweighted pair group method with arithmetic averages (UPGMA) linkage. To differentiate between different BOX and ISR fingerprints, a dendrogram was constructed based on Dice similarities between the binary tables of the E. coli ATCC 25922 strain, used as a positive control in all PCR runs and loaded on all gels. The similarity value of the cluster that contained all E. coli ATCC 25922 strains served as the similarity cutoff to identify distinctive BOX and ISR fingerprints.
Richness estimation.
The richness of the E. coli population in each fecal source type was estimated by the Chao1 index (9) with the freeware program EstimateS (version 6.0; R. K. Colwell, http://viceroy.eeb.uconn.edu/estimates). For each fecal source, a dendrogram was created with UPGMA linkage based on the matrix of similarities between the binary tables of the E. coli isolates of one sample. Clusters with a similarity coefficient lower than the similarity cutoff, were considered distinctive BOX and ISR fingerprints. The amount of E. coli isolates present in the distinctive BOX and ISR fingerprints served as input for the nonparametric richness estimation.
Selection of candidate source-specific E. coli fingerprints.
To search for candidate source-specific E. coli fingerprints, a dendrogram was created with UPGMA linkage based on the matrix of similarities between the binary tables of the 267 E. coli isolates from the five fecal sources. Clusters with a similarity coefficient lower than the similarity cutoff were considered distinctive BOX and ISR fingerprints. Distinctive BOX and ISR fingerprints that contained E. coli isolates from multiple samples of one fecal source type were selected as candidate source-specific E. coli fingerprints for the technique used.
Internal stability of BOX and ISR library.
Discriminant analysis was used to verify the internal stability of each fecal source, i.e., to calculate how many of the 267 isolates would be assigned to the correct fecal source. This was done by jackknife analysis (maximum similarity) on the Dice similarity data of the BOX and ISR fingerprints individually and for the combination of both fingerprints of each E. coli isolate. The average rate of correct classification (ARCC) for the library was obtained by averaging the correct classification percentages for all sources (48).
| RESULTS |
|---|
|
|
|---|
BOX and ISR fingerprints.
Figure 1 shows typical fingerprints of E. coli isolates from a gull sample generated by rep-PCR with primer BOXA1R and by ISR-PCR. The BOX fingerprints were relatively complex and generally consisted of 15 to 25 bands. The ISR fingerprints were less complex and consisted of two to six bands.
|
Selection of candidate source-specific E. coli fingerprints.
Figure 2 gives the distribution of the 267 E. coli isolates from the different fecal sources between the distinctive BOX and ISR fingerprints. A high diversity in fingerprints was encountered: 267 isolates yielded 59 distinctive BOX fingerprints and 87 distinctive ISR fingerprints. One horse-specific E. coli fingerprint was found with BOX-PCR, and one cow- and raw sewage-specific E. coli fingerprint was obtained with ISR-PCR (Fig. 3) Most of the candidate source-specific E. coli fingerprints of the different isolates were identical, although we do not have sequence information to confirm this. These candidate source-specific E. coli fingerprints were present in only two of the three source samples. Neither the gull nor the individual dogs had any BOX and ISR fingerprints in common. In each individual gull and dog, multiple E. coli isolates had identical BOX and ISR fingerprints.
|
|
|
Richness estimation.
Figure 4 gives the Chao1 richness estimation curves for both fingerprinting techniques, based on one sample of each fecal source. In total, 139 isolates were used in the BOX-PCR analysis and 158 isolates in the ISR-PCR analysis. Nonparametric richness estimators have been developed to estimate species richness from samples of macroorganisms. Recently, interest is emerging in applying these tools to microorganisms (28). The operational taxonomic units for the nonparametric richness estimation for this study were defined not as species but as the number of distinctive BOX and ISR fingerprints. The Chao estimators (8, 9) compare the proportion of fingerprints that have been observed before, i.e., in another sampling, to those that were observed only once. In a very rich community, the probability that a fingerprint will be observed more than once is low, and most fingerprints are only represented by one individual in a sample. However, in a depauperate community, the probability that a fingerprint will be observed more than once is higher, and many fingerprints are observed multiple times in a sample (28). In this study, the Chao1 estimator was used to determine the number of E. coli isolates needed to reliably estimate the richness of the E. coli population in the different fecal sources. Since the number of samples per fecal source was limited to three individuals, the actual E. coli richness cannot be conclusively assessed from this study.
|
Fingerprint stability experiment.
All four isolates incubated at all three temperatures except C2 at 4°C yielded colonies after plating on MacConkey agar. The BOX and ISR fingerprints of five colonies per E. coli isolate and per temperature were compared to the fingerprint of the original isolate. Figure 5 shows the BOX and ISR fingerprints of C1 of the five colonies typed per temperature. No change in BOX and ISR fingerprint was observed for this isolate at the different incubation temperatures after prolonged incubation. The same result was found for all the other isolates.
|
| DISCUSSION |
|---|
|
|
|---|
As noted by others (11, 47), the E. coli isolates were highly diverse, with only two candidate source-specific E. coli fingerprints for cow and raw sewage, identified out of the 87 ISR fingerprints present in the 267 isolates typed. Similarly, there was only one candidate source-specific E. coli fingerprint for horse out of a total of 59 BOX fingerprints. Most of the candidate source-specific E. coli fingerprints in the different isolates were identical, although we do not have sequence information to confirm this. The use of the similarity cutoff to select candidate source-specific E. coli fingerprints was rather stringent and based on the reproducibility of DNA extraction, PCR, and electrophoresis of a standard E. coli strain. This approach limits the number of candidate source-specific E. coli fingerprints because a lower similarity cutoff results in fewer candidate source-specific E. coli fingerprints, whereas a higher similarity cutoff can increase the number of candidate source-specific E. coli fingerprints. For example, when the similarity cutoff for ISR-PCR analysis was increased to 80%, one additional candidate source-specific E. coli fingerprint for horse was found.
The observed high diversity suggests that more samples per fecal source type need to be analyzed to determine if the candidate source-specific E. coli fingerprints are indeed source specific. Instead of isolating E. coli from additional fecal samples, primers can be designed for the candidate source-specific E. coli fingerprints. Multiple fecal samples can then be analyzed with a multiplex PCR to determine the source specificity of these candidate source-specific E. coli fingerprints.
Questions also remain on whether the use of these candidate source-specific E. coli fingerprints is geographically limited and why the candidate source-specific E. coli fingerprints were present in only two of the three source samples. The latter could be explained if these candidate source-specific E. coli fingerprints are intrinsic to some source individuals or by temporal diversity of E. coli (18). Follow-up studies are needed to assess the temporal diversity of these source-specific E. coli fingerprints. It is possible that the candidate source-specific E. coli fingerprints are only present during a certain period of the year, which would limit their applicability as potential genotypic markers.
Another approach to evaluating if 16S-23S rRNA ISR-PCR and rep-PCR analyses have the potential to identify fecal sources is by means of jackknife analysis. Despite the limited number of candidate source-specific E. coli fingerprints found, we were able to create a fingerprint library with an average rate of correct classification (ARCC) for the five source categories of 83% for BOX-PCR analysis, 67% for ISR-PCR analysis, and 84% for BOX- plus ISR-PCR analysis. According to Harwood et al. (26), policy makers find correct classification rates of 60 to 70% very useful. Sometimes water quality managers are primarily interested in discriminating between animal and human contamination and consider determining the major source(s) of animal contamination a secondary objective. When the data were pooled into a human-derived (raw sewage) and an animal group, the ARCC value remained similar for BOX-PCR analysis (82%) but increased from 67 to 72% for ISR-PCR analysis.
BOX-PCR analysis resulted in excellent classification rates for all fecal sources except raw sewage, which still had a relatively good classification rate of 68%. ISR-PCR analysis resulted in low classification rates for raw sewage and cow and high classification rates for horse, gull, and dog. Raw sewage streams can contain small amounts of other fecal sources due to storm water runoff in the sewer system (30). This explains why, for both techniques, most raw sewage fingerprints were incorrectly classified during jackknife analysis. This inherent mixed character can be circumvented with human feces as the source for human fecal contamination instead of raw sewage (7, 15). Nevertheless, we used raw sewage because it is one of the most important human fecal contamination sources at the Belgian coast.
The high classification rates for gull and dog obtained with BOX- and ISR-PCR analysis should be interpreted with caution. During jackknife analysis each fingerprint present in the library is individually taken out of its fecal source group and reassigned to a fecal source group by comparing maximum similarity of the fingerprint with all the other fingerprints remaining in the library. It was found that the different source individuals for gull and dog did not have BOX and ISR fingerprints in common and each source individual contained multiple identical BOX and ISR fingerprints. Therefore it is more likely that gull or dog fingerprints are correctly classified since identical fingerprints are still present in the gull and dog fecal group. Jackknife analysis does not correct for such information. This demonstrates the necessity to use an external validation data set of E. coli BOX and ISR fingerprints out of an additional sample for each fecal source before this fingerprint library can be applied in the field.
The Chao1 estimator indicated that enough isolates were analyzed to characterize the richness of the E. coli population with BOX- and ISR-PCR for horse, gull, and raw sewage. For these sources, 32 isolates were sufficient to obtain a richness estimate independent of sample size. This number includes a safety margin for horse and raw sewage, since the Chao1 estimator leveled off before 32 isolates. However, for dog and cow the number of isolates differed according to the technique used. Additional BOX fingerprints for cow and ISR fingerprints for dog would be obtained when more isolates would be analyzed. In contrast to our approach, some studies obtained only one to three E. coli per individual, and multiple individuals were sampled to obtain a larger number of E. coli isolates (7, 15). With this approach, the composition of the E. coli population within one individual of a fecal source remains unknown, and isolates which are thought to be specific for one source might be common E. coli clones between several sources. In this study, generally 32 isolates of a fecal source were sufficient to characterize the richness of the E. coli population from that source. It is not clear how many individuals need to be sampled to characterize the richness of the E. coli population between different individuals of a fecal source.
The BOX and ISR fingerprints were stable in natural water over a 150-day period at all incubation temperatures. Since the water temperature at the Belgian coast varies from 4 to 20°C during the year, it was decided to incubate the E. coli isolates at 4, 12, and 28°C. The isolates were incubated for 150 days under low-nutrient conditions because Naas et al. (37) found a large amount of genetic variation in E. coli isolates under conditions of starvation, which further increased with the length of time spent in the stationary phase. Our results do not correspond with the findings of Wittam (47), who used multilocus enzyme electrophoresis. The results of the multilocus enzyme electrophoresis analysis indicated substantial changes in E. coli population composition during transition from the host to the external environment. Given the dependence of multilocus enzyme electrophoresis on loci and operating conditions (41), it is difficult to compare his results to our fingerprint results. Therefore, the stability of genotypic markers in E. coli during transition from the host to the external environment may be dependent on the microbial source tracking method used.
In conclusion, 16S-23S rRNA ISR-PCR and rep-PCR analyses of E. coli isolates have the potential to identify nonpoint sources of fecal pollution. A fairly small number of isolates was needed to find candidate source-specific E. coli fingerprints that were stable under the simulated environmental conditions. In view of automation, direct detection of source-specific genotypic markers in water samples without isolation holds promise. However, to make this approach widely applicable, the possible temporal diversity, geographical limitations, and survival of these source-specific genotypic markers in the environment need to be assessed.
| ACKNOWLEDGMENTS |
|---|
We thank Astrid Van Bossche for technical assistance and Wim De Windt, Nico Boon, Liesbeth De Vos, Sofie Dobbelaere, Lutgart Raskin, Kurt Sys, and Kristof Verthé for critically reading the manuscript.
| FOOTNOTES |
|---|
Present address: Department of Soil Science, University of Saskatchewan, Saskatoon, Saskatchewan S7N 5A8, Canada. ![]()
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| J. Bacteriol. | Microbiol. Mol. Biol. Rev. | Eukaryot. Cell | All ASM Journals |
|---|