Previous Article | Next Article ![]()
Applied and Environmental Microbiology, May 2003, p. 2755-2764, Vol. 69, No. 5
0099-2240/03/$08.00+0 DOI: 10.1128/AEM.69.5.2755-2764.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Biology and Biotechnology Research Program, Lawrence Livermore National Laboratory, Livermore, California 94551,1 Bioscience Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545,2 Department of Biological Sciences, Northern Arizona University, Flagstaff, Arizona 86011;,3 Center for Environmental Biotechnology, Lawrence Berkeley National Laboratory, Berkeley, California 947204
Received 15 October 2002/ Accepted 13 February 2003
|
|
|---|
|
|
|---|
Extensive genetic diversity among environmental isolates of B. cereus and B. thuringiensis has been demonstrated by pulsed-field gel electrophoresis (6), multienzyme electrophoresis (6, 15), and amplified fragment length polymorphism (AFLP) analysis (38). AFLP analysis proved sensitive enough to classify B. cereus and B. thuringiensis into five phylogenetic groups (38). Significantly, the American Type Culture Collection (ATCC) reference strains of B. cereus and B. thuringiensis did not seem to be represented in the collection of environmental isolates used in that study. There was little correlation between species designation and the five phylogenetic groups identified by AFLP analysis, indicating significant genetic variability within and between B. cereus and B. thuringiensis. Furthermore, in two separate studies, the B. anthracis strains appeared to cluster together with a group containing periodontal B. cereus pathogens (16, 38).
In contrast to the genomic diversity within B. cereus and B. thuringiensis, B. anthracis appears to be genetically clonal (19). Genetic identity among different B. anthracis isolates has necessitated fastidious analysis of their genomes for strain identification. Accurate discrimination is now possible by analysis of variable-number tandem repeats, which enumerates small tandem repeats at several locations in the B. anthracis genome (3, 20). A collection of 426 B. anthracis environmental isolates that contains representatives from worldwide origins was subdivided by variable-number tandem repeat analysis into just six genetically distinct groups (20).
Several strains of B. cereus and B. thuringiensis which appear to be genetically related to B. anthracis have been isolated (15, 16, 38). Isolation and characterization of the genome regions unique to B. anthracis will provide clues to its genetic relationship to these strains and ultimately direct pathogenicity studies. AFLP analysis provides a rapid method for measuring phylogenetic distances. AFLP analysis generates a strain-specific fingerprint of amplified DNA fragments that demonstrate genomic variations in a microbial population based on an analysis of a portion of their genome sequences and DNA fragment length polymorphisms. The great advantage of AFLP analysis is its ability to analyze rapidly many loci, resulting in a phylogenetic resolution higher than those obtained with other methods. However, AFLP analysis provides little information about the genetic differences responsible for these polymorphisms. Suppression subtractive hybridization (SSH) is a highly efficient technique for the isolation and characterization of the large genomic differences that often drive bacterial genome evolution (1, 2). SSH reveals DNA sequence differences that are responsible for many AFLPs.
Unique genomic differences can be exploited as "DNA signatures" for the discrimination of B. anthracis from its closest relatives. Plasmid-encoded toxin genes have proved a useful source of targets for rapid DNA-based assays (4). However, a plasmid-based test may not detect the occurrence of non-plasmid-containing strains of B. anthracis, which have been isolated from the environment (39). Furthermore, plasmids can be readily engineered and can be transferred to other bacteria for heterologous gene expression (5, 10, 37), and there is concern that pXO1 and pXO2 sequences that are present in other Bacillus strains may be encountered in previously uncharacterized genomes (25). There are many published examples of chromosomal regions that can be examined for B. anthracis identification, but all require time-consuming downstream analysis (3, 7, 18, 26). Two recent examples of real-time PCR targeting of the rpoB gene of B. anthracis were useful but reported examples of false-positive results with some strains of B. cereus and targeted only a single locus (11, 28).
This report demonstrates the phylogenetic relationships of a collection of non-anthrax-causing Bacillus species to each other and to B. anthracis, as determined by AFLP analysis. SSH with B. anthracis and three of these close relatives identified a set of unique DNA regions that represent genomic differences between B. anthracis and these non-anthrax-causing Bacillus species. One immediate product of these experiments was a robust set of chromosomal DNA signatures which are capable of quickly detecting all six genetically distinct groups of B. anthracis (20) and which can be used with any rapid DNA-based detection platform. A multiplex PCR analysis with four separate loci of the B. anthracis chromosome provides a rapid and highly specific means for the identification of B. anthracis.
|
|
|---|
Preparation of genomic DNA.
The bacterial strains used for genomic DNA preparation and their geographic origins, where known, are listed in Table 1. DNA was extracted from cell pellets from overnight cultures grown in L broth by using a MasterPure DNA purification kit (Epicentre, Madison, Wis.) according to the manufacturer's instructions. Genomic DNA was prepared from enteric bacteria by using Wizard Genomic DNA Preps (Promega, Madison, Wis.) according to the manufacturer's instructions. Human genomic DNA was purchased from Clontech (Palo Alto, Calif.), and bovine genomic DNA was purchased from Novagen (Madison, Wis.). Soil DNA was extracted by using an UltraClean soil DNA kit (MoBio, Solana Beach, Calif.) according to the manufacturer's instructions. DNA isolated from organisms present in the air was prepared from filters as previously described (31). Both soil and air filter DNAs were prepared from samples originating in Livermore, Calif. After ethanol precipitation, all genomic DNAs were dissolved in 10 mM Tris HCl (pH 8.0) to a concentration of approximately 0.2 µg/ml.
|
View this table: [in a new window] |
TABLE 1. Strains used in this study
|
DNA sequencing.
Nonpurified PCR products were cloned by using a pGEM-T Easy TA cloning kit (Promega). Recombinant clones were picked by using a BioPick automated colony picker (BioRobotics, Woburn, Mass.), and plasmid templates were prepared by boiling lysis and magnetic bead capture with a high-throughput procedure (35). Sequencing of plasmid templates was performed by using an Applied Biosystems Big-Dye Terminator system and either an ABI 377 or an ABI 3700 automated sequencer. The sequencing primers used were 5'-TGTAAAACGACGGCCAGT (forward) and 5'-CAGGAAACAGCTATGACC (reverse). The resulting data were analyzed by using ABI sequencing analysis software, version 3.2, and then assembled and edited by using Phred, Phrap, and Consed 7.0 (14). BLAST searches with the tester-specific DNA sequences were performed by using the National Center for Biotechnology Information website http://www.ncbi.nlm.nih.gov/. BLAST identity was considered significant only when the expected probability of a fortuitous match (E value) was less than 10-4. Comparison of the sequence candidates to plasmids pXO1 and pXO2 and determination of their coordinates on the published B. anthracis genome (32) were performed by using the cross_match program, which is part of the Consed software package (14).
PCRs.
Oligonucleotide primers were designed from the putative tester-specific sequences and were supplied by Sigma-Genosys (The Woodlands, Tex.) or Invitrogen (Carlsbad, Calif.). The primers were designed by using Primer3 software (33); they had a melting temperature of >65°C, contained no more than three identical consecutive nucleotides, and possessed a two-nucleotide 3' GC clamp. The primers were initially screened against genomic DNAs from both the tester and the driver. To determine whether a primer pair was tester specific, 1 ng each of tester- and driver-specific DNAs was used as a template in PCRs with Accuprime polymerase, primers at 10 µM (Invitrogen), and the following cycling parameters: 94°C for 15 s, 65°C for 15 s, and 72°C for 30 s for 32 cycles. The products were visualized on a 1.5% agarose gel run in 0.5x Tris-borate-EDTA; if a product was present with tester-specific DNA as a template and absent with driver-specific DNA, then that sequence was designated tester specific. The tester-specific oligonucleotides were then used to prime PCRs with the B. anthracis, B. cereus, and B. thuringiensis strains listed in Table 1 and the same reaction conditions as those described above. The integrity of the genomic DNA template was tested in all PCRs with primers specific for a region of the 23S gene conserved in Bacillus species: 5'-CTACCTTAGGACCGTTATAGTTAC and 5'-AGGTAGGCGAGGAGAGAATCC. Multiplex PCRs were performed as described above, except that template DNA was added to a final concentration of 10 ng/µl. The primers (see Table 5) were used at the following concentrations: 23S primers at 3 µM (288 bp), dhp73.017 (241 bp) at 20 µM, dhp73.019 (196 bp) at 10 µM, dhp61.183 (163 bp) at 5 µM, and dhp77.002 (133 bp) at 20 µM.
|
View this table: [in a new window] |
TABLE 5. Primers designed from 28 B. anthracis-specific DNA signatures and their coordinates on the B. anthracis A2012 genome (GenBank accession number NC_003995)
|
|
|
|---|
![]() View larger version (16K): [in a new window] |
FIG. 1. Phylogenetic tree of B. anthracis, B. cereus, and B. thuringiensis isolates. The phylogenetic tree was derived from fluorescent AFLP analyses of 12 B. anthracis (Ba), B. cereus (Bc), and B. thuringiensis (Bt) isolates. The phylogenetic tree was based on 34 to 40 amplified DNA fragments per sample (the number per sample is shown in parentheses) generated from EcoRI/MseI digestion of genomic DNAs. Jaccard coefficients for the fragments common in three replicate gels for each isolate were analyzed by using the unweighted pair-group method with arithmetic means to produce the dendrogram.
|
Isolation of tester-specific DNA sequences by SSH.
SSH identifies DNA sequences that are specific to one genome (tester) and absent from the other genome (driver). Typically these DNA sequences range in length from 200 to 1,000 bp and are referred to as difference products. Three subtractions were undertaken to increase the yield and representation of B. anthracis-specific regions. B. cereus ATCC 14579 was selected because it represents one of the two most distantly related strains in this study and would be most likely to yield difference products. B. cereus 3A was chosen because it represents one of the closest relatives of B. anthracis, while B. thuringiensis Al Hakam represents intermediate relatedness.
A total of 256 candidate sequences were generated from subtractions by using genomic DNA from B. cereus ATCC 14579 as the driver (Table 2). Sequences that do not occur on plasmids pXO1 and pXO2 of B. anthracis were of primary interest, since some plasmid sequences are conserved in closely related Bacillus species (25). Sequences identical to pXO1 and pXO2 were eliminated by computer comparisons to published nucleotide sequences (see Materials and Methods) (24). The remaining sequences were used to design oligonucleotide primers for PCR analysis, to determine their representation in the tester and driver genomes. PCR experiments with B. anthracis Ames and B. cereus ATCC 14579 DNA as a template identified 39 B. anthracis Ames-specific sequences (15% of the tester-specific candidates). Six of the tester-specific candidates (2%) did not amplify a product when DNA from the collection of non-anthrax-causing pathogens was used as a template. Similarly, 28 out of 48 sequences (58%) were absent from B. thuringiensis Al Hakam; 8 of these (16%) did not occur in the collection of closely related non-anthrax-causing pathogens (Table 2). Finally, 48 sequences each were generated from two separate subtractions by using genomic DNAs prepared from B. cereus 3A and B. thuringiensis Al Hakam as drivers. A total of 26 out of 48 sequences (54%) were present in B. anthracis Ames but absent from B. cereus 3A; 14 of these did not occur in the collection of closely related non-anthrax-causing pathogens (Table 2).
|
View this table: [in a new window] |
TABLE 2. Isolation of 28 B. anthracis-specific DNA signatures from three SSH experiments
|
Distribution of tester-specific loci in B. cereus and B. thuringiensis.
Genomic variations within the collection of non-anthrax-causing pathogens were observed as the presence or absence of 39 tester-specific regions isolated from the subtraction in which genomic DNA from B. cereus ATCC 14579 was used as the driver and DNA from B. anthracis Ames was used as the tester (Table 3). The tester-specific primers are listed in Table 3 in decreasing order of the number of products amplified from this panel of non-anthrax-causing pathogens. Since, by definition, the driver will always be negative, there will be a maximum of nine loci possible in the 10 non-anthrax-causing Bacillus species. Similarly, the tester will always be positive, so that the minimum number of products possible will be one. The data in Table 3 show that seven primer sets are present in nine templates (excluding the driver), while six candidates (shown in bold type) are represented in B. anthracis only. The columns of data for non-anthrax-causing pathogens are arranged from left to right in Table 3 according to the number of tester-specific sequences seen in their respective genomes. The maximum number of candidates possible for each non-anthrax-causing Bacillus species is 39 for the tester (B. anthracis), while the minimum is 0 for the driver (B. cereus ATCC 14579). Strains that have more markers in common are suggested to be more closely related to B. anthracis. The data in Table 3 show that B. cereus 3A is the most closely related (31 out of 39 sequences), followed by B. cereus S2-8 and D17 (29 out of 39 sequences); as expected, B. thuringiensis ATCC 10792 is the least closely related (10 out of 39 sequences).The presence or absence of these loci is not correlated with the current species designations of B. cereus and B. thuringiensis. Table 4 shows data obtained with primers designed from the subtraction with the most closely related DNA as a driver. Far fewer primer candidates detect the non-anthrax-causing Bacillus species, with 14 (shown in bold type) being found only for B. anthracis Ames.
|
View this table: [in a new window] |
TABLE 3. Presence or absence of B. anthracis Ames-specific nucleotide sequences in the genomes of 10 non-anthrax-causing Bacillus pathogens as determined by PCR with B. cereus ATCC 14579 as the driver
|
|
View this table: [in a new window] |
TABLE 4. Presence or absence of B. anthracis Ames-specific nucleotide sequences in the genomes of 10 non-anthrax-causing Bacillus pathogens as determined by PCR with B. cereus 3A as the driver
|
All but 1 of the 28 B. anthracis-specific DNA sequences can be mapped to the 5.23-Mb genome of B. anthracis A2012 (GenBank accession number NC_003995) (32). BLASTX analysis showed that 24 of the remaining 27 DNA sequences map to the open reading frames defined therein, while the remainder map to intergenic regions. The coordinates for nucleotide identity to B. anthracis A2012, open reading frame identity, and gene identity are listed in Table 5. BLASTX identities with E values of less than 10-3 are also listed. BLAST data for the candidates from the B. cereus ATCC 14579 subtraction show that three of the six DNA sequences have no previously ascribed function, based on similarity searches of the GenBank database. One of the remaining three shows identity with the S-layer protein of B. anthracis, and the other two show identity with a hypothetical phage protein from Streptococcus pyogenes and a galactosyltransferase-related protein from Clostridium acetobutylicum. BLAST data for the candidates from the B. thuringiensis Al Hakam subtraction show that four of the eight DNA sequences have no previously ascribed function. The remaining four share sequence identity with a hypothetical protein in B. halodurans, a penicillin binding protein of B. cereus, a putative phage terminase of C. perfringens, and a cytosine-specific methyltransferase of B. halodurans. BLAST data for the candidates from the B. cereus 3A subtraction show that 9 of the 14 DNA signatures have no previously ascribed function. The remaining five show sequence identity with the Staphylococcus aureus terminase large subunit, a hypothetical protein from the nanH region and an ATP binding cassette transporter of C. perfringens, a glucosamine synthetase of B. subtilis, and an unknown conserved protein of B. halodurans.
Figure 2 provides a visualization of the distribution of the tester-specific sequences for each of the three subtractive hybridization experiments. There are five regions that have more than two B. anthracis-specific loci that lie within 50 kb of each other, suggesting genomic islands that are found only in B. anthracis. Similar genomic islands have been observed when genomes of different strains of the same bacterial species have been compared (e.g., Y. pestis [8, 29] and E. coli [27]).
![]() View larger version (35K): [in a new window] |
FIG. 2. Graphic representation of the locations of the tester-specific clones on the B. anthracis A2012 genome. The outermost circle maps the locations of the clones isolated from the subtraction by using DNA from B. cereus ATCC 14579 as the driver (circles), the second circle maps clones isolated from the subtraction by using DNA from B. thuringiensis Al Hakam as the driver (diamonds), and the third circle maps clones isolated from the subtraction by using DNA from B. cereus 3A as the driver (squares). The innermost circle shows the coordinates of the B. anthracis genome (5.23 Mb); the arrow shows the location of the first nucleotide (32). The sequences that are seen in B. anthracis but that are not seen in any non-anthrax-causing pathogens are represented by closed symbols. Gray boxes A to E, adjacent to the innermost circle, indicate the five regions that contain more than two B. anthracis-specific DNA sequences within 50 kb of each other; these represent putative B. anthracis-specific genomic islands.
|
![]() View larger version (54K): [in a new window] |
FIG. 3. Multiplex analysis of four separate loci on the B. anthracis genome. Four B. anthracis-specific primers yielded predicted products of 133, 163, 196, and 241 bp. One internal positive control yielded a predicted product of 288 bp. The DNA templates used for the multiplex analysis were as follows: lanes 1, 12, and 13, size markers as indicated; lane 2, B. cereus ATCC 14579; lane 3, B. cereus ATCC 4342; lane 4, B. cereus D17; lane 5, B. cereus 3A; lane 6, B. cereus S2-8; lane 7, B. cereus F1-15; lane 8, B. thuringiensis 97-27; lane 9, B. thuringiensis Al Hakam; lane 10, B. thuringiensis ATCC 10792; lane 11, B. thuringiensis HD-571; lane 14, B. anthracis G3; lane 15, B. anthracis G20; lane 16, B. anthracis G25; lane 17, B. anthracis G29; lane 18, B. anthracis G38; lane 19, B. anthracis G62 (Ames); lane 20, B. anthracis G67; lane 21, B. anthracis G77 (Vollum); lane 22, B. anthracis G80; lane 23, B. anthracis G87; and lane 24, no-template negative control.
|
|
|
|---|
AFLP analysis was used to reveal genetic diversity among non-anthrax-causing Bacillus strains and to determine which are most closely related to the highly pathogenic B. anthracis. The phylogenetic tree shown in Fig. 1 indicates that B. cereus 3A, B. cereus S2-8, and B. thuringiensis 97-27 are the strains most genetically similar to B. anthracis of the 10 strains examined. B. cereus ATCC 4342 was shown previously by multienzyme electrophoresis (16) and AFLP analysis (38) to be very closely related to B. anthracis. The AFLP analysis presented here shows that seven strains are even more closely related to B. anthracis. The relationships do not correlate with the species designation for B. cereus or B. thuringiensis, providing another example of how the species designations for B. cereus and B. thuringiensis appear to cross the boundaries of various phylogenetic analyses (15, 38). It may be necessary to develop new criteria for species designations within this group, as information about more Bacillus genomes becomes available.
Although AFLP analysis provides a sensitive method for defining the genetic relationships between bacterial genomes, it provides no information regarding the genetic rearrangements responsible for these differences. Such information can be attained by in vitro genome comparison by SSH, a proven, efficient method for the identification of nucleotide sequences that differ between two genomes (1). The complementary techniques of AFLP analysis and SSH provide a powerful means of defining accurate phylogenetic models and characterizing their underlying genetic components. Three subtractions were performed to identify B. anthracis-specific nucleotide sequences that were absent from the non-anthrax-causing Bacillus species. B. cereus ATCC 14579 represents the most distantly related strain in this study. B. cereus 3A was chosen because it represents one of the closest relatives of B. anthracis. The third strain, of intermediate relatedness, was B. thuringiensis Al Hakam. It would be expected that strains with more sequence identity to B. anthracis (in this situation, B. cereus 3A) would yield more sequences that would not be found in the closely related B. cereus and B. thuringiensis. Indeed, the subtraction with B. cereus 3A yielded 29% B. anthracis-specific sequences, compared to 17 and 2% for B. thuringiensis Al Hakam and B. cereus ATCC 14579, respectively (Table 2), confirming the prediction from AFLP analysis.
The genetic diversity demonstrated within this strain collection by AFLP analysis was mirrored by the results of subsequent PCR analysis. Bacterial evolution is driven by rearrangements of large genomic islands associated with lateral gene transfer (21, 23). Analysis of the G+C contents of the difference products reveals an average of 35.4%, typical of B. anthracis. This finding suggests that any lateral gene transfer has ameliorated the G+C content over a long period of evolution or has been received from species with a similar G+C content. The presence or absence of each of the tester-specific sequences in the non-anthrax-causing pathogens was determined by PCR amplification with primers designed for the difference products. If the number of loci shared by B. anthracis indicates relatedness among the whole panel of non-anthrax-causing Bacillus strains, then B. cereus 3A is the most closely related (Table 3). This result is in agreement with the phylogenetic analysis shown in Fig. 1. The phylogenetic tree shows two pairs of isolates that cannot be distinguished by AFLP analysis: Bacillus strains Vollum and Ames and B. cereus strains 3A and S2-8. SSH with B. cereus ATCC 14579 as a driver yielded two primer sets (M.Ctg056 and M.Ctg007) that can distinguish B. cereus 3A from S2-8 (Table 3). Primers from the same subtraction cannot distinguish B. thuringiensis HD-571 from Al Hakam. Table 4 demonstrates that the number of loci that are shared between the non-anthrax-causing Bacillus species and B. anthracis decreases when the driver strain is much more closely related to B. anthracis. This subtraction also yielded two primer sets (dhp73.03 and dhp73.04) that can distinguish B. thuringiensis HD-571 from Al Hakam, which are indistinguishable by the primer sets listed in Table 3.
The most important criteria for effective DNA signatures are the absence of false-positive results with closely related organisms and their representation in all isolates of the target (i.e., no false-negative results). Given the monomorphic nature of the B. anthracis genome, it was not surprising to find that there was no variation in the signatures within the strains of the collection. Twenty-eight B. anthracis-specific candidates isolated in these experiments fulfilled these criteria and are listed in Table 5. None amplified a PCR product from any of the non-anthrax-causing Bacillus pathogens used in this study (no false-positive results). We were also able to exploit a collection of genetically distinct and geographically diverse isolates of B. anthracis. The twenty-eight DNA signatures amplified a PCR product of the predicted size for every isolate. A comparison of the DNA sequences against the completed genomes of B. halodurans and B. subtilis and the unfinished Bacillus genomes showed no significant sequence identity.
The non-anthrax-causing Bacillus species described here are so closely related to B. anthracis that they would be highly likely to give false-positive results in DNA-based identification assays based on chromosomal loci. The isolation of multiple B. anthracis-specific chromosomal regions allowed the development of a single multiplex assay for the rapid and highly specific detection of B. anthracis. The DNA signatures presented here have the advantage over previous detection methods that require time-consuming analysis (3, 7), are prone to false-positive results, and are based on few nucleotide differences at a single chromosomal locus (11, 28).
BLAST analysis of the nucleotide sequences of these DNA signatures shows that many are not represented in current DNA databases, other than the previously reported B. anthracis A2012 genome (Table 5) (32). The strongest identity seen was to the penicillin binding protein of B. cereus (dhp64.177; E value, 10-67). R.Ctg122 showed identity to the S-layer protein of B. anthracis (13). This cell surface protein is also seen in some isolates of B. cereus and B. thuringiensis (22). However, there is sufficient nucleotide sequence divergence at the oligonucleotide primer binding sites to allow for successful discrimination of B. anthracis from the other two species.
The genomes of several strains of B. anthracis are currently being sequenced (12), and these data will be extremely useful for strain attribution in forensic analyses. We envisage that these DNA signatures can be used for real-time specific detection of B. anthracis, the source of which may then be attributed by monitoring the small nucleotide differences identified by these sequencing projects. The B. anthracis-specific DNA sequences identified in this work provide the largest collection of chromosomal markers that distinguish B. anthracis from other closely related Bacillus species. There are five regions in the B. anthracis genome where several of the specific DNA sequences are located within 50 kb of each other. Such genomic islands may define B. anthracis as a species and distinguish it from the closely related species B. cereus and B. thuringiensis. Future detailed analysis of these B. anthracis-specific regions may ultimately identify chromosome-encoded virulence factors, provide starting points for possible vaccine candidates, and help to reveal the mode of pathogenicity of this important pathogen.
We are extremely grateful to Kimothy Smith for assistance with the B. anthracis collection used in this study and Nancy K. Montgomery for preparing genomic DNA. We also appreciate the technical contributions of Aubree Hubbell, Anne M. Erler, Silvia Gamez-Chin, Cheryl Strout, Julie R. Avila, and Linda L. Ott at Lawrence Livermore National Laboratory.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»