Previous Article | Next Article ![]()
Applied and Environmental Microbiology, February 2005, p. 867-875, Vol. 71, No. 2
0099-2240/05/$08.00+0 doi:10.1128/AEM.71.2.867-875.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
National Research Council Plant Biotechnology Institute,1 Department of Animal and Poultry Science, University of Saskatchewan,2 Prairie Swine Centre, Inc., Saskatoon, Saskatchewan,3 UBC Centre for Disease Control and Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, British Columbia, Canada4
Received 18 May 2004/ Accepted 16 September 2004
|
|
|---|
|
|
|---|
We have previously demonstrated that the chaperonin-60 gene is a useful alternative to the 16S rRNA gene as a target for microbial population studies (19). The chaperonin-60 gene (also known as groEL or hsp60) encodes the 60-kDa type I chaperonin found in all eubacteria and eukaryotes. Degenerate PCR primers have been designed that amplify an approximately 550-bp region of the gene corresponding to nucleotides 274 to 828 of the Escherichia coli chaperonin-60 sequence. This primer pair amplifies the chaperonin-60 sequence from eubacteria and eukaryotes and has been successfully applied to the problem of microbial species identification (13-15, 20) and in the creation of PCR product libraries for the characterization of complex microbial communities (19). One advantage of chaperonin-60 as a target is that between closely related taxa, the chaperonin-60 gene contains more phylogenetically informative sequence information than 16S rRNA genes of the same taxa, permitting discrimination of species (14, 21, 25) or subspecies and serotypes (4). A database of chaperonin-60 sequences (cpnDB) is available at http://cpndb.cbr.nrc.ca (18).
The need to identify alternatives to the antimicrobials currently used for growth promotion and prophylactic control of intestinal pathogens in pigs (12, 26) has led to increased interest in understanding the role of the normal intestinal flora in exclusion of epizootic pathogens and promotion of animal health. To this end, the pig gastrointestinal tract microbiota has been the focus of previous 16S rRNA sequence-based studies (22, 31) undertaken with the restricted objective of cataloguing member species of the intestinal tract. The next step in molecular profiling of complex microbial communities is to work toward characterization of population dynamics or community responses to perturbations, which was previously impossible with labor-intensive culture-based methods.
In the current study, we constructed and sequenced chaperonin-60-based PCR product libraries and used phylogenetic analysis to profile the microbial communities of pigs fed diets in which corn, wheat, or barley was the major cereal ingredient. These cereal grains vary markedly in chemical composition (8), significantly modify intestinal microbial colonization (28, 7), impact susceptibility to enteric infection (30), and may affect the efficacy of nonantibiotic feed additives such as direct-fed microbials (2), oligosaccharide prebiotics (9), and organic acids (6). Although the composition of microbial communities along the length of the intestine was of interest, we elected to sequence a large number of clones from a single location in order to maximize the representation of bacteria present in the community and to enable legitimate comparison of library composition among diets. The ileum was targeted as the major site of nutrient absorption in the pig, where microbial communities are less complex than in more distal regions and biologically relevant effects of diet might be more easily identified. For this comparative study, we also wanted to validate the extent to which microbial profiles derived from the library studies, particularly the quantitative value of the frequency of recovery of sequences within a taxon, are true representations of population. We used a quantitative PCR approach to develop molecular tools for the monitoring of taxonomic groups of interest within the pig ileum populations to validate representation of the targeted sequences in the cloned libraries.
|
|
|---|
Collection of ileal digesta.
Pigs were euthanized by CO2 asphyxiation and exsanguination, and their intestinal tracts were removed. The small intestine was dissected from the mesentery, and the mid-ileum was identified as 75% of the distance between the duodenum and the ileo-cecal junction. Digesta (approximately 2 to 3 g) from this location were collected and stored on ice until frozen at 80°C prior to analysis.
Genomic DNA isolation.
For each sample, 200 mg of digesta was placed in a bead-beating tube (FastDNA; Bio 101, Carlsbad, Calif.) with Lysis Matrix A (Bio 101) along with 30 µl of 10% (wt/vol) sodium dodecyl sulfate, 3 µl of proteinase K (20 mg/ml), and Tris-EDTA buffer (10 mM Tris-Cl, pH 8.0, 1 mM EDTA) to a total volume of 600 µl. Sample tubes were mixed by vortexing and incubated at 37°C for 2 h; 600 µl of phenol-chloroform-isoamyl alcohol (25:24:1, vol:vol:vol) was added to each tube, followed by processing in a FastPrep Instrument 120 (Bio 101) bead-beating apparatus 10 times for 20-s intervals with periodic cooling of sample tubes on ice. After bead beating, 0.1 volume of 3 M sodium acetate, pH 5.0, was added to each tube, lysates were mixed by inversion and cooled on ice for 10 min. Lysates were clarified by centrifugation at maximum speed in a microcentrifuge for 10 min. Clarified lysates were transferred to fresh tubes, and nucleic acids were precipitated by the addition of 1 volume of cold isopropanol. Nucleic acids were pelleted by centrifugation at maximum speed in a microcentrifuge for 10 min, after which the supernatant was removed and pellets were washed once with 70% (vol:vol) ethanol. Washed pellets were air dried for approximately 20 min and resuspended in 200 µl of Tris-EDTA buffer.
PCR and creation of PCR product libraries.
PCRs were performed for each of the three genomic DNA templates as described previously (19) with the following exceptions: four reactions were performed with each template at annealing temperatures of 42, 46.5, 50.4, or 56°C. PCRs at the four different annealing temperatures were done in parallel on a Stratagene Gradient-40 Robocycler. PCR products produced at each of the four annealing temperatures were pooled by volume, agarose gel purified, and ligated into T-A cloning vector pCR2.1-TOPO (Invitrogen). Ligation mixtures were used to transform E. coli strain JM109. The three resulting libraries, corn, barley, and wheat, were plated on Luria-Bertani (LB) with ampicillin and 5-bromo-4-chloro-3-indolyl-ß-D-galactopyranoside (X-Gal); 1,248 white colonies were picked randomly for each library. Colonies were picked into 96-well plates containing 100 µl of LB with 100 µg of ampicillin per ml and incubated overnight at 37°C. After overnight incubation, 100 µl of 30% (vol:vol) glycerol were added to each well and the cultures were stored at 80°C.
Plasmid DNA isolation, sequencing, and analysis.
Template preparation and sequencing reactions were performed robotically as described previously (19). Each template was sequenced with T7 and M13-RP primers. Sequencing reactions were resolved on an ABI 3700 capillary sequencing device. All sequence data assembly, analysis, and storage were done with software available through the Canadian Bioinformatics Resource (http://www.cbr.nrc.ca). Raw sequence data were assembled into contigs with PreGap4 (version 1.1) and Gap4 (version 4.6) in the Staden software package (release 2000.0: J. Bonfield, K. Beal, M. Betts, M. Jordan, and R. Staden, 2000). Contig sequences were compared to cpnDB (18), a database of approximately 3,000 chaperonin-60 sequences, with FASTA and BLASTp. Sequence data, template information, and similarity results were placed in an mySQL database for storage and further analysis. Sequence manipulations such as format changes and amino acid translations were done with the EMBOSS software suite (25). Sequence alignments were done with Clustalw (39) and viewed with GeneDoc (27).
Phylogenetic analysis was done with programs in the PHYLIP software package (J. Felsenstein, 1993, PHYLIP [Phylogeny Inference Package] version 3.5c, distributed by the author, Department of Genetics, University of Washington, Seattle). Specifically, alignments were sampled for bootstrap analysis with Seqboot, distances were calculated with the PAM option of Protdist (for peptide sequences) or the maximum-likelihood option of Dnadist. Dendrograms were constructed from distance data by neighbor joining with neighbor. Consensus trees were calculated with Consense and branch lengths were superimposed on consensus trees with Fitch. Completed trees were viewed with TreeView (28).
Quantitative PCR assays.
Quantitative PCR assays were developed to quantify clusters of bacteria possessing closely related (
95% identity among members) chaperonin-60 sequences. These were identified by phylogenetic analysis and designated B1 (70% sequence identity with Bacillales family), C1 (80% sequence identity with Clostridium spp.), S1 (95 to 100% identity with Streptococcus alactolyticus ATCC 43077), and L10 (95 to 100% sequence identity with Lactobacillus amylovorus ATCC 33198) as further described in the Results section. Signature Oligo software (LifeIntel Inc., Port Moody, Canada) was used to identify chaperonin-60 nucleotide sequence "signatures" unique to each target cluster compared with neighboring taxa.
Compatible PCR primers were designed to anneal at the location of signature sequences with Oligo 6 primer analysis software (Molecular Biology Insights Inc., Cascade, Colo.). Primer specificity was checked by comparison with corn, barley, and wheat library sequence data and typed strain reference data with BLAST (1) configured for short, nearly exact matches. Specificity was confirmed by PCR amplification of plasmids containing target cluster chaperonin-60 sequences but not closely related sequences.
The forward (f) and reverse (r) primer sequences for each selected cluster were B1f, 5'-TGCAGGAGCAAATCCAATGAT-3'; B1r, 5'-GCATGGCTTCGGCAATTAAA-3'; C1f,,5'-GCTGTTGATGTAGCAGTTGA-3'; C1r, 5'-ATAACCCCTTCGTTTCCTAC-3'; S1f, 5'-TTGACGTGGTTGAAGG-3'; S1r, 5'-GTTTTCAAGACTTCTTCAAGCAA-3'; L10f, 5'-AAGCTGCCGTTGATGAATTAC-3'; and L10r, 5'-AGCGTCAGCGATTAAGTCACC-3'. A TaqMan fluorescent probe was designed for L10 (5'-TTTAGATGCTGATGAAACAGCAGCTACGTT-3') that was labeled at the 5' end with 6-carboxyfluoecein and at the 3' end with Black Hole Quencher-1 (Integrated DNA Technologies, Inc., Coralville, Iowa).
For the C1, B1, and S1 assays, quantitative PCR was performed with the iCycler iQ real-time PCR detection system (Bio-Rad Laboratories, Hercules, Calif.) and a final reaction mix containing iQ SYBRGreen Supermix (Bio-Rad), 500 nM each primer, plasmid calibration standards or genomic DNA isolated from ileal contents, and adjusted to a final volume of 25 µl. Amplification conditions were 95°C for 2 min, followed by 40 cycles of 60 s at 95°C and 60 s at 62°C (B1), 58°C (C1), or 68°C (S1). In the case of the L10 diagnostic, the reaction contained Platinum Quantitative PCR SuperMix-UDG (Invitrogen, Carlsbad, Calif.), 500 nM each primer, 200 nM probe, and genomic calibration standard or ileal contents genomic DNA in a total volume of 25 µl. PCR conditions were 50°C for 3 min and 95°C for 2 min, followed by 40 cycles of 30 s at 95°C and 30 s at 61°C.
Calibration standards for the C1, B1, and S1 assays were developed with a 10-fold dilution series of plasmids containing chaperonin-60 sequence representatives of each target cluster. Plasmid copy number was calculated from plasmid molecular weight, and plasmid concentration was measured with Picogreen (Molecular Probes) with a Fluoroskan Ascent FL fluorometer (Thermo Labsystems). For the L10 assay, calibration standards were developed with a 10-fold dilution series of Lactobacillus amylovorus (ATCC 33198) genomic DNA; the number of target genomes was calculated with an estimated genomic weight of 1.2 x 109 g/mol (based on a genome size of 1,850 kbp) (11). The number of target bacteria genomes per gram of ileal contents was calculated by plotting threshold cycle value versus calibration standard copy number and interpolating the threshold cycle value obtained for genomic DNA isolated from ileal contents with the iCycler iQ optical system software (version 3.0a). Corrections were made for sample dilution during extraction of DNA and PCR. Calibration standards and unknown samples were assayed in triplicate.
Nucleotide sequence accession number.
The sequences of the cloned chaperonin-60 PCR products and relevant reference strain chaperonin-60 sequences have been deposited in GenBank and assigned accession numbers AY562569 to AY562944 inclusive. Sequences can also be retrieved from cpnDB (http://cpndb.cbr.nrc.ca) (18).
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Number of sequences generated for each library
|
![]() View larger version (17K): [in a new window] |
FIG. 1. Taxonomic composition of corn (C), barley (B), and wheat (W) clone libraries. A. Percentage of clones recovered from corn, barley, and wheat libraries classified by taxonomic group, determined by comparison to the reference database of chaperonin-60 sequences. Total library sizes were 909, 930, and 915 clones for corn, barley, and wheat, respectively. B. Comparison of log10 numbers of clones in each taxonomic category recovered from the corn, barley, and wheat libraries.
|
Table 2 summarizes the taxonomic assignments of sequences recovered at least 10 times from the library sequence data set. Although the five most abundant sequences were very similar to each other with pairwise similarities of 99% (maximum of five nucleotide differences between any pair), each one was amplified independently from all three genomic DNA templates. A multiple sequence alignment of these five sequences indicates that of the six variable positions, five are synonymous substitutions, resulting in identical amino acid translations. Of the 33 sequences shown in Table 2, 30 were recovered from all three libraries, one (represented by clone b02_b02) was recovered from barley and wheat only, one (represented by c02_e05) was recovered from corn and wheat only, and one (represented by c13_h07) was recovered from corn and barley only.
|
View this table: [in a new window] |
TABLE 2. Names, nearest reference sequences, and frequency of recovery for clones recovered at least 10 times
|
Pooling all sequence data from the three libraries resulted in the identification of 372 different nucleotide sequences. To further compare the contents of the corn, barley, and wheat libraries, a multiple sequence alignment and distance matrix were calculated for representatives of each of the 372 different nucleotide sequences. The resulting DNA distance matrix was neighbor joined to produce the tree in Fig. 2. Sequence clusters were defined as indicated in the figure, based on distance. The cluster name indicates the taxonomic assignment of the cluster: C, Clostridiales; B, Bacillales; G, gamma-proteobacteria; S, Streptococcus; and L, Lactobacillales. For each cluster, the library frequencies of each individual sequence in the cluster were combined to produce a corn:barley:wheat ratio for the cluster. For example, sequences belonging to the Lactobacillales-like cluster L2 were recovered twice from the corn library, 36 times from the barley library, and nine times from the wheat library. Clusters C1 (88:18:69), B1 (16:0:7), S1 (1:48:4), and L10 (506:686:616) were chosen as target groups for quantitative PCR with the goal of validating the clone ratios.
![]() View larger version (18K): [in a new window] |
FIG.2. Phylogenetic analysis of 372 unique nucleotide sequences recovered from the corn, barley, and wheat libraries. The tree was calculated with a maximum-likelihood distance formula and neighbor joining. Clusters of similar sequences within the tree are named according to the identity of the nearest database neighbors of the group (C = Clostridiales, B = Bacillales, G = gamma-proteobacteria, L = Lactobacillaceae, S = Streptococcus). The scale bar indicates 0.1 substitution per site. Numbers following the group names indicate the number of clones falling into that sequence group recovered from the corn:barley:wheat libraries. For example, C1 group sequences were recovered 88 times from the corn library, 18 times from the barley library, and 69 times from the wheat library.
|
|
View this table: [in a new window] |
TABLE 3. PCR quantification of target taxa B1, C1, S1, and L10 in pooled ileal contents of pigs fed corn (C), barley (B), or wheat (W) and frequencies of recovery of corresponding clones
|
![]() View larger version (24K): [in a new window] |
FIG. 3. Determination of the prevalence of L10 genomes in each individual animal fed diets based on corn (n = 15), wheat (n = 15), or barley (n = 15). A. Number of genomes (log10) detected in each individual per gram of ileal digesta. The means of the 15 observations within each group were: corn-fed individuals, 7.58; barley-fed individuals, 8.83; and wheat-fed individuals, 9.53.
|
|
|
|---|
The chaperonin-60 gene is an attractive target both for cataloging microbial diversity with clone libraries and for targeting specific taxa with quantitative PCR methods. The 549- to 567-bp region amplified with chaperonin-60 universal PCR primers is easily sequenced with a single sequencing reaction and is phylogenetically informative, often providing sufficient discriminating information to distinguish subspecies or serotypes of a species (4). Another important feature of the chaperonin-60 universal target region is that it is uniformly variable, lacking the alternating conserved and variable regions that characterize the 16S rDNA sequence, generating significant secondary structure and facilitating PCR artifacts such as chimeras which are a confounding factor in 16S rRNA-based studies (32, 42).
Sequence variation introduced by Taq polymerase infidelity is an unavoidable pitfall of PCR-based methods. However, most of the abundant sequences in the libraries were independently amplified from each of the three template pools (Table 2). Among the clusters of nearly identical sequences, variable positions are significantly skewed towards the third codon position (19; this study). Also, no in-frame stop codons were identified in any of the partial chaperonin-60 sequences. These three observations suggest that while Taq polymerase infidelity no doubt introduces some artificial diversity, it is not a major distorting factor and also that a protein-encoding target sequence offers advantages over structural RNA-encoding targets with respect to monitoring and assessing PCR artifacts.
The overall structure of the ileum microbial population described in this report corresponds to previous characterizations of ileal microflora. The most predominant taxa in this community are the low-G+C gram-positive organisms, particularly the Lactobacillales family, which includes Lactobacillus and Pediococcus spp. among others. Smaller numbers of other low G+C gram-positive bacteria, such as the Clostridiales and Bacillales, and yet smaller numbers of gamma-proteobacteria were also identified. The pattern of relative abundances of these groups illustrated in Fig. 1B corresponds to the colony enumeration results for the samples published previously (7), as well as to the observations made by Leser et al. (22). These results also support observations that the ileal microflora is distinct in composition from populations associated with the cecum, colon, or feces (19, 22, 31, 34), where microbial communities are more diverse and contain higher numbers of gram-negative bacteria, such as the Bacteroides class. The relative lack of microbial diversity in the ileum compared to the more distal regions of the gut is also illustrated by the fact that we observed a relatively small number of sequences (14% of total unique sequences) occurring at high frequency (accounting for 81% of clones examined).
One of the major potential pitfalls of PCR-based studies of microbial communities is biased amplification of different sequences during PCR. The results of quantitative PCR assays of four different targets identified in the ileum populations indicate that the relative abundances of these targets in the three targeted populations are accurately represented by comparison of the frequency of chaperonin-60 clone recovery among the libraries (Table 3). The quantitative PCR study results also give an indication of the sensitivity of the library method. Targets that were detected at levels of 1 to 10 clones in the libraries, such as the S1 group in the corn and wheat libraries, were determined to be present in the digesta samples at 2.1 x 105 and 7.4 x 105 S1 genomes/g of contents, respectively. Consistent with this, the B1 target was not found in the barley library, but 2.3 x 105 B1 genomes/g of ileum contents were detected by quantitative PCR. These results suggest that the sequencing of approximately 1,000 clones from each chaperonin-60 library is sufficient to detect organisms present at >105 CFU/g of sample. This sensitivity level is likely also the explanation for why organisms such as E. coli that are reported to be found in the small intestine at levels at or below this threshold were not detected.
One of the most intriguing results of this study is the observation of variability in the abundance of L10 group organisms in individual pigs. In the genomic DNA pools used to create the corn, barley, and wheat libraries, L10 sequences were by far the most abundant (Table 2 and Fig. 2). However, enormous variability (from undetectable to 4 x 1010 genomes/g of ileum contents) was detected in the corn library (Fig. 3). The L10 quantitative PCR primers were designed to specifically target this group and fail to amplify any related but different targets. Thus, it is possible that minor strain variations in the dominant flora of individual animals account for the differences in L10 detection. It is also possible that variation in sampling location, efficiency of extraction of DNA, and/or variation in copurification of PCR inhibitors could account for some animal-to-animal variation. Others have suggested that the major bacterial groups remain constant among individual pigs fed the same diet (23), but those observations were made based on qualitative profiling methods which, at 106 to 1010 genomes/g, would likely have detected L10 in all pigs in the current study with the exception of the one pig fed the corn diet.
Variation in gut microflora associated with lifestyle and diet in humans and other animals has been described (3, 10, 17, 29), but there is a lack of information regarding differences between individuals. In human subjects persistent individual variation in microbial profile, including individual variation in the dominant Lactobacillus sp., has been noted (37, 38). We were surprised, however, to observe significant individual variation in abundance of the dominant L10 Lactobacillus taxon in pigs of the same age and same genotype, housed in the same room, provided identical diets, and managed according to the same protocol. In agreement, however, considerable variation in denaturing gradient gel electrophoresis banding pattern among individual broiler chickens subject to similar constants of genotype, diet, and housing has been reported (40). It is intriguing to suggest that these differences contribute to animal variation in health or nutrient digestion (e.g., fat metabolism) (12, 37). The development of a broad range of molecular tools for profiling microbial populations is a major step in defining individual variation in microflora composition. These differences are deserving of focused research since they are potentially a major confounding factor in the development of methods and products for the manipulation of these populations for improved health, performance, and pathogen resistance.
We anticipated that diet composition would affect the ileum microflora of pigs and hypothesized that these differences could be detected and quantified with a combination of sequence-based and quantitative PCR-based methods. Differences in the relative abundances of taxa and, perhaps more importantly, the diversity within taxa were indeed detected, particularly within Streptococcus spp. and the Clostridiales. The small sample size and the restriction of sample collection to one time point in this study limit our ability to draw specific conclusions regarding the biological significance of the variation in microbial populations associated with diet composition. However, the sequence libraries and quantitative PCR tools which we continue to develop present the opportunity to conduct more focused in vivo studies aimed at relating microflora shift to specific environmental variables, animal performance, and health outcomes.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»