Previous Article | Next Article ![]()
Applied and Environmental Microbiology, November 2006, p. 7311-7323, Vol. 72, No. 11
0099-2240/06/$08.00+0 doi:10.1128/AEM.01179-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Volker Gürtler,2*
Michael Beer,1 and
Robert J. Seviour1
Biotechnology Research Centre, La Trobe University, Bendigo, Victoria 3552, Australia,1 Microbiology Department, Austin Hospital, Studley Road, Heidelberg, Melbourne, Victoria 3084, Australia2
Received 22 May 2006/ Accepted 7 September 2006
|
|
|---|
|
|
|---|
For C. difficile strain identification, essential for epidemiological studies, culturing and molecular characterization are necessary. Many different typing methods have been applied to strains of this organism, including repetitive sequence-based PCR using repetitive extragenic palindromic primers, multilocus sequence typing, and pulsed-field gel electrophoresis (37, 47, 60), but there seems to be general, if not universal, agreement that PCR ribotyping, where the variable intergenic 16S-23S rRNA spacer region (ISR) is amplified (26, 29, 31), is the most attractive method for strain identification and has been used most frequently for this purpose (7, 10, 21, 42, 62, 65).
Bacterial typing of both pure cultures and environmental samples by fragment analysis of the ISR (PCR-ribotyping) has become a popular method because the ISR is more variable in both its length and sequence than the 16S rRNA gene (1). However, PCR-ribotyping will not reveal the true extent of this ISR diversity, since identical-length ISRs of different sequences may produce identical ISR fragment patterns after electrophoresis on agarose gels (25). Sequence analysis of the ISR has revealed the mosaic nature of the ISR in Acinetobacter baylyi (12), Haemophilus parainfluenzae (23, 54), and Photobacterium damselae (48) typified by the presence or absence of sequence blocks up to
100 nucleotides in length between multiple ISR copies of a genome. Furthermore, the whole-genome sequence analysis of C. difficile 630 has also revealed a mosaic genome characterized by a larger number of mobile genetic elements than in other whole bacterial genomes (59).
Eleven copies of the rrn operon are known to occur in C. difficile strain 630 from whole-genome sequencing and annotation (59). In this study the sequence data of these 11 rrn operons from strain 630 were analyzed to confirm previous data which showed that considerable variations exist in both the sizes and sequences of ISRs found in different C. difficile strains, including 630 (26). However, little is known about rrn copy numbers and intragenomic ISR sequence variability in other strains of this organism. Therefore, the aim of this study was to determine the extent of ISR sequence diversity in three other strains of C. difficile compared to that in C. difficile 630 and to see if such diversity might provide some basis for more-precise determination of rrn copy number and identification of individual C. difficile strains.
|
|
|---|
DNA extraction.
Chromosomal DNA was extracted from colonies of C. difficile by using the UltraClean soil DNA kit (Mo Bio) according to the manufacturer's instructions. The DNA obtained was resuspended in Tris-EDTA buffer and electrophoresed on a 1% agarose gel to determine its integrity before being stored at 20°C until required.
Amplification of the 16S-23S rRNA intergenic spacer region.
PCR amplification of the ISR was carried out using two universal primers complementary to conserved regions in the 16S and 23S rRNA genes, as recommended previously (31). The forward and reverse primer sequences are located at nucleotide positions 1477 to 1493 on the 16S rrnA gene of C. difficile strain 630 (region 4, 5'-GGC TGG ATC ACC TCC TT-3'; region 5, 5'-TAG TGC CAA GGC ATC CGC CCT-3') complementary to positions 21 to 41 on the 23S rRNA gene, respectively (31). DNA templates were amplified in a total reaction volume of 50 µl containing 2.5 U of AmpliTaq Gold thermostable polymerase (Roche), 50 pmol of each primer, 200 µM of each deoxynucleotide, 1.5 mM MgCl2, 10 mM Tris-HCl (pH 8.3), and 50 mM KCl. Amplification was carried out in a GeneAmp 2400 thermal cycler (Applied Biosystems) with denaturation at 94°C for 10 min, followed by 30 cycles according to the following program: 94°C for 1 min, 54°C for 1 min, 72°C for 2 min, and a final extension of 10 min at 72°C to complete partial polymerizations. The resulting amplification products were purified with a rapid PCR purification kit (Marligen; Biosciences) according to the manufacturer's instructions and the products analyzed on a 2% agarose gel, stained with ethidium bromide, and viewed on a UV transilluminator.
Cloning, sequencing, and RFLP of the 16S-23S rRNA ISR.
The purified PCR amplification products were cloned into the pGEM-T Easy Vector System II (Promega) in accordance with the manufacturer's instructions. Recombinant colonies were picked off and grown in 3 ml of LB medium containing ampicillin (100 µg/ml) for 14 to 16 h at 37°C. Plasmid DNA was extracted from the clones using the QIAprep Spin Miniprep kit (QIAGEN), again following the manufacturer's instructions. A total of 126 clones were screened by digestion with the restriction endonuclease EcoRI or by PCR amplification using the M13f and M13r primers and subsequent digestion with TaqI. Digestion products from the PCR products were run on 1% agarose gels to check that clones contained an insert and to reveal any possible variations in sizes of these inserts. In the case of TacI digestion, products were separated on 10% polyacrylamide in TBE (40 mM Tris, 20 mM borate and 1 mM EDTA) at 200 V for 1 h. The resulting restriction fragment length polymerases (RFLPs) were analyzed by the database management program GelCompar II upgrade 3.5 (Applied Maths, Belgium).
Sequencing of selected clones was carried out using an ABI DNA sequencer model 377a (Applied Biosystems) with Big-Dye terminator kits (Applied Biosystems). Each sequencing reaction contained 6 µl of template DNA, 2 µl of primer (1.6 µM), and 6 µl of Big-Dye reaction mix in a total reaction volume of 20 µl. After an initial ramp of 96°C, samples were submitted to 25 cycles of 96°C for 10 s, 50°C for 55 s, and 60°C for 4 min.
Sequence data analysis.
The vector sequence was trimmed from all sequences using the program SeqMan 4.05 (DNASTAR, Inc.), and with the same program, the sequences were assembled and checked for any conflicts between the overlapping sequences generated with the forward and reverse primers. The Australian National Genomic Information Service was used to perform BLASTn (3), to do text searches for ISR DNA sequences and perform sequence alignments of ISR DNA sequences (between regions 4 and 5) using Clustal W (63), and to obtain restriction enzyme fragment sizes of the ISR (between regions 4 and 5) for TaqI using tacg (which analyzes a DNA sequence for restriction enzyme sites) (41). The ISR sequences were then annotated, separated, stored, and sorted using MacClade (39), FileMakerPro 8, Sequin 6.2 (http://www.ncbi.nlm.nih.gov/Sequin/index.html), and CLC Free Workbench 2.5.2 (www.clcbio.com). The sorted information was analyzed further with Microsoft Excel 2003. Maps of the ISRs were imported from CLC Free Workbench 2.5.2 into Illustrator 10. The C. difficile 630 genome sequence was generated by the Sanger Institute Pathogen Sequencing Unit in XBase (14, 59).
Nucleotide sequence accession numbers.
The sequences reported in this paper have been assigned GenBank accession numbers DQ487223 and DQ487224, DQ487228 to DQ487230, DQ487232 and DQ487233, DQ487236 to DQ487243, DQ487245 to DQ487249, DQ487251 to DQ487260, DQ487262, DQ487264 to DQ487268, and DQ487270 to DQ487276.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Properties of 43 ISR clones from isolates A, B, and ATCC 43593, 11 ISR sequences from the whole genome of isolate 630, and 1 ISR sequence from ATCC 43597 showing the distribution of variants in each of the 13 ISR groups (i to xiii) shown in Fig. 1 and 2a to da
|
|
View this table: [in a new window] |
TABLE 2. Estimate of the number of rrn operons per genome from numbers of allele variants detected in C. difficile isolates A, B, 630, and ATCC 43593a
|
![]() View larger version (28K): [in a new window] |
FIG. 1. Complete map and GenBank annotation of 16S-23S ISR group variants from four C. difficile isolates. The ISR has been divided into 13 groups (i to xiii) with the variants for each group (1 to n, listed in Table 1) shown together by length and sequence block. See Table 1 for variant numbering nomenclature. The map on the top shows the primer sites corresponding to regions 4 and 5 (31), annotations for groups i to xiii (green blocks), rRNA (left red block corresponds to 16S, and right left block corresponds to 23S), tRNAAla (blue block), and the TaqI sites in groups iv and xiii (see Fig. 3). The blue graph at the bottom of the figure shows the sequence conservation within each of the annotated groups.
|
![]() ![]() ![]() ![]() View larger version (414K): [in a new window] |
FIG.2. Sequence alignments of variants for ISR groups i to iv (a), v to viii (b), ix to xi (c), or xii and xiii (d), shown together in Fig. 1. The sequence names are the same as in Fig. 1 and Table 1, and the variant numbering (Table 1) is shown at the right side of each group. The sequence alignment is shaded in increasing levels of blue, corresponding to increasing homology above 50% conservation, and the decreasing shade of red corresponds to decreasing homology, less than 50% conservation. The numbering (shown at the top of the alignment) starts at the beginning of region 4 (Fig. 1), but only the ISR is included (from start to finish). In the sequence alignments, a dot represents nucleotide identity to the top consensus sequence and a dash represents a gap or absence of the respective nucleotide in the consensus sequence. The blue graph at the bottom of the figure shows the sequence conservation within each of the annotated groups.
|
ISR sequence differences within and between strains of C. difficile.
On the other hand, many ISRs of the same or very similar lengths had markedly different sequences within or between these strains (Table 1) and were thus designated as residing at different chromosomal locations. For example, the respective ISR sequence groups for rrnE and rrnE1 were xi-2 and xi-1; those for rrnF and rrnF1 were xi-1 and xi-2; those for rrnI1, rrnI2, rrnI3, and rrnI4 were iii-2, vi-3, and xi-2; iii-4, vi-4, and xi-1; iii-4, vi-4, and xi-2; and iii-5, vi-5, and xi-1; those for rrnK, rrnK1, and rrnK2 were iii-6, viii-2, and xi-4); iii-1, viii-1, x-1, and xi-1; and iii-1, viii-3, x-1, and xi-1; those for rrnG and rrnG1 were xi-1 and xi-2; and those for rrnN, rrnO, and rrnP were i-2 and xi-5; i-1 and xi-6; and i-1 and xi-5.
Mosaics of ISR sequence groups suggests operon number differences between strains.
The different sequence types within each sequence group (i to xiii) vary significantly in sequence and length (Table 1; Fig. 1 and 2). The differences are so great that the combinations of the sequence groups among the 55 ISR sequences suggest a unique chromosomal origin corresponding to 26 different rrn operons in four C. difficile strains (Table 2). An estimation of the number of operons in each of these strains has been made in Table 2 with the qualification that it is not certain that all operons have been isolated even though a large number of clones were screened, increasing the chance of complete detection. The number of operons could be an underestimate, because in some cases two or more sequences were detected for unique operons (e.g., three copies of rrnG1 in strain A and four copies of rrnJ in strain ATCC 43593). Therefore, the ISR mosaic and operon copy number differences between C. difficile strains needs further investigation by parallel whole-genome sequencing (59), oligonucleotide microarray analysis (11, 16, 18, 24, 59), or multiplex ligation-dependent probe amplification (32) from multiple C. difficile strains.
Presence of tRNA genes in the ISRs of C. difficile.
Most of the ISRs in the three strains examined here lacked tRNA genes. Those that contained them possessed just a single copy of the tRNAAla gene, as seen with ATCC strain 43593 with two ISRs (clones AT-6 and AT-13), strain A with three ISRs (clones A-3, A-5, and A-15), and strain B with two ISRs (clones B-11 and B-15). In contrast, in C. difficile strain 630, its whole-genome sequence data revealed that five rrn operons, rrnA, rrnC, rrnD, rrnE, and rrnF, each contained a single copy of the tRNAAla gene. These sequences were identical in 75 bp (groups iii-1 and iv-1 [positions 76 to 150]), and one had a single nucleotide difference (A-15 [position 96]). When a BLASTn search was performed with this sequence, many other Clostridium species (56-58), including C. tetani (11), were found to contain within their ISRs a sequence of high homology (90 to 98%).
Differentiating ISRs by RFLP typing.
The feasibility of distinguishing between C. difficile ISRs of identical size but differing in their sequences based on their restriction fragment length polymorphisms was examined (Fig. 3). Differences in the lengths and sequences of some of the ISRs seen in strains A and B were also often very small. For example, rrnE1 from clones A-3, A-15, and B-11 (strains A and B, respectively) had ISRs of 502, 502, and 503 nucleotides in length (Fig. 3). Close examination of the ISR sequences revealed that the size difference between clones from strains A and B was due to a nucleotide deletion at position 492 of group xiii in clones A-15 and A-3. This single-nucleotide difference enabled strains A and B to be distinguished by the ISR from allele rrnE1 with Taq1 restriction enzyme digestion, recognizing the cut site T'CGA, which was present in clone B-11 but was absent in A-3 and A-15 (Fig. 3). For comparison, digestion patterns from several other clones that could also be distinguished by size and absence of the TaqI site (clones A-5, A-6, and B-12) are included in Fig. 3.
![]() View larger version (6K): [in a new window] |
FIG. 3. The variable TaqI site within group variants xiii-1, iv--, and iv-1 for the differentiation of rrnE1 sequence variants. From left to right: phylogram; electrophoresis composite compiled in GelCompar II of TaqI ISR restriction fragments; strain, clone, group, allele, size (bp) of regions 4 and 5, and TaqI restriction fragment sizes in bp; maps of TaqI restriction fragments found in each of the six ISRs shown in the electrophoresis composite. Mutations in region xiii at position 492 responsible for the presence or absence of the TaqI site are shown as follows: *, deletion at 492 (TaqI site absent); , G at 492 (TaqI site present); ¶, A at 492 (TaqI site absent).
|
The mosaic nature of the ISR (Table 2) demonstrates 26 different rrn alleles in 5 different strains. When the 13 sequence blocks in these 26 unique alleles were examined, different combinations were together on different alleles (Table 2). The first class of sequence groups (Table 2) contained two or more variants that were linked only with two or more variants from one or more other groups (the only example of this class of sequence groups was the linkage of variants viii-3, vi-7, and ii-2 in all four strains on alleles rrnB, rrnJ, rrnJ1, and rrnK2). The second class of sequence groups (Table 2) contained variants linked with a number of different variants in another sequence group. This was most evident in the conserved or moderately conserved sequence groups (i, ii, vi, vii, xii, and xiii). The sequence group variant vi-7 is one of many examples that was linked to different sequence group variants (vii-1 or vii-2; viii-1, viii-2, or viii-3; x-2 or x-1; xi-1, xi-2, xi-4, xi-5, or xi-6; and xiii-1 or xiii-2). The variability revealed by this analysis of each of the sequence groups shows clearly the mosaic nature of the ISRs in C. difficile, which has been described for many other bacterial species (see Discussion for references). Apart from describing the widespread molecular biological phenomenon of mosaic 16S-23S ISR sequences of the rrn operon, the practical implications for these sequence group variants are in designing the following types of oligonucleotide for use in PCR and microarrays. (i) The conserved pairs of sequence variants show the most potential for revealing sequences that might be used as oligonucleotides for the detection of all C. difficile strains, because many of these sequences are highly conserved in C. difficile strains. (ii) Compared with highly variable alignment groups (e.g., iii, vi, and xi), there are fewer conserved sequence group pair combinations because fewer variants occur within these alignment groups, thereby reducing the number of possible combinations. (iii) The number of combinations between the highly variable alignment group variants will be much greater than the number of conserved sequence group pair combinations shown in Table 2. Combinations of these highly variable sequences can therefore be used for the detection of specific strains, groups of strains, specific operons, and groups of operons.
|
|
|---|
Bacterial rRNA (rrn) operons are generally organized in the order of 16S rrnA-ISR-23S rrnA-ISR-5S rrnA, and their copy numbers vary considerably between 1 and 15 (31, 45, 64, 67, 69). In addition, the 16S-23S ISRs are known to vary in both their length and sequence, and consequently this region has been used with some success to differentiate between closely related species and different strains within a single species (31). This study aimed to see to what extent ISR sequences differed among strains of C. difficile and between the different operons in a single strain. Three strains (ATCC 43593 and two clinical strains [A and B]) were analyzed and the data compared with those for C. difficile strain 630, whose whole-genome sequence contains 11 rrnA operons. Their sequences have been deposited in GenBank.
If each generated ISR clone of a different size, different sequence, or different sense is assumed to represent a single rrn operon, it can be deduced that strains A and B and ATCC strain 43593 contained 7, 8, and 11 operons, respectively. The rrn copy number is not always conserved in different strains of the same species. For example, Streptococcus thermophilus contains six rrn operons, but some strains have been reported to have only five rrn copies (49). Similarly, rrn copy numbers of 7, 8, and 9 occur within different Vibrio cholerae strains (17, 36). Recombination events between rrn operons may be the reason for rrn operon copy number variations among strains of the same species (1, 25, 28, 36). It is still not clear what benefit there might be to an organism in having multiple rrn copies. Klappenbach et al. (34, 35) thought it might facilitate an organism's ability to adapt better to environmental change, and some evidence has been presented suggesting that from soil organisms, those with higher rrn copy numbers dominated in soil communities exposed to complex carbon sources like pesticides. These strains also grew faster in complex media, while slow growers generally had lower rrn copy numbers, suggesting that rrn copy number is growth rate related (34).
Genes coding for tRNAAla are present only in the ISRs of operons rrnA, rrnC, rrnD, rrnE, and rrnF in C. difficile strain 630, and all are coded by the positive-sense DNA strand. They were detected also in ISR clones AT-6, AT-13, A-3, A-5, A-15, B-11, and B-15. The known number of tRNA genes occurring in bacterial rrn operons varies from zero to five (4, 5, 12, 17, 27, 44, 46, 48, 50, 51). However, most of the ISRs in these C. difficile strains did not contain any tRNA genes. In some bacteria, ISRs encoding different tRNAs have been reported, as seen for Vibrio cholerae, Vibrio mimicus, and Photobacterium damselae, where tRNAGlu, tRNALys, tRNAVal, and tRNAAla were revealed (17, 48).
The sequence data analyzed from the 55 ISRs within these 5 C. difficile strains revealed they had a mosaic-like structure, as was also reported for the ISRs in Salmonella enterica, Haemophilus parainfluenzae, Staphylococcus aureus, Vibrio cholerae, Vibrio mimicus, Vibrio parahaemolyticus, Myxobacterium spp., and Acinetobacter baylyi (4, 5, 12, 17, 25, 27, 44, 46, 48, 50, 51). The highly mosaic nature of the C. difficile ISR is entirely consistent with the recent finding that the complete genome of C. difficile is also mosaic in nature, with 11% of the genome comprised of mobile genetic elements that in many cases contain repetitive sequences (59).
Twenty-six different types of ISRs existed among these five strains of C. difficile. When the 16S rRNA and 23S rRNA flanking regions were excluded from the analyses, the resulting data showed that all ISRs shared sequences of 44 bp at the 5' end and of 136 bp at the 3' end (ii-1 was specific for rrnA, -C, -D, -E, and -F, and ii-2 was specific for the remainder of the alleles). In V. parahaemolyticus, all six different ISRs share a 180-bp sequence at the 3' end (40), whereas in P. damselae, a 21-bp sequence was shared at their 3' end (48). The mosaic nature of the ISR seen in all these organisms is analyzed in a unique way in this study, because (i) all the variants are analyzed together, (ii) mosaic ISRs are used to deduce operon variants, and (iii) operon variants are used to deduce rrn copy number within strains.
These sequence data suggest that intraspecific diversity in the ISRs in C. difficile is much greater than that reported for any other bacterial species (4, 5, 12, 15, 17, 20, 27, 30, 40, 50, 51). Many authors have compared ISRs between different species and strains of a single bacterial species (15, 17, 33, 44, 46). In some cases the ISR diversity between species of a genus is lower than the diversity seen between the strains of C. difficile examined here. Sequence differences between ISRs of the same size most likely arise from nucleotide substitution events (4, 5, 25, 30). Also, some ISRs were of similar length (e.g., rrnE and rrnF of S. aureus) but had sequence blocks with different nucleotide sequences, probably because of recombination events (25, 27, 28). Some of the ISRs were identical in length, and any differences in their sequences were due to single nucleotide deletions, insertions, or substitutions (4, 5, 25, 30).
Although PCR amplification of ISRs in C. difficile has been used for typing and differentiation of C. difficile strains, many strains give identical PCR band patterns on agarose gels (7, 13, 21, 26, 42, 55, 61). This study suggests that these apparent similarities may mask considerable levels of heterogeneity not revealed by these fingerprinting methods. Thus, both strains A and B gave similar PCR band patterns on agarose gels, but sequencing these revealed markedly different ISRs in the two strains. For example, ISR clones A-3, A-15, A-11, and A-12 were found only in strain A, while ISR clones B-11, B-12, and B-14 just existed in strain B. It was possible to differentiate between these ISR clones and hence strains which contain them. Examination of restriction endonuclease maps of the ISRs with Biomanager software indicated that the TaqI restriction enzyme could differentiate between B-11 of strain B and A-3 and A-15 of strain A on the basis of different size restriction fragments (Fig. 3). It should also be added that several other restriction enzymes could be used for the differentiation of other ISR clones of strain A from ISR clones of strain B.
Our work has shown that a high degree of diversity existed among ISR sequences from five different strains of C. difficile with respect to rrn copy number and sequence variation. Other studies have shown that significant ISR length variations exist between a large number of other C. difficile strains (13, 26, 61). The sequences of the ISRs in C. difficile are similarly diverse, and it is likely that more diversity will be uncovered when more strains are analyzed at the sequence level. Strain-specific oligonucleotide microarrays have been described for ISR sequences of streptococci (16), for industrial wastewater treatment plant microorganisms in the study of microbial diversity within complex environmental samples (18), and for the detection of different species of the genus Kitasatospora (24). The further characterization of ISR sequence mosaics in C. difficile by oligonucleotide-mosaic arrays is likely to uncover an even higher level of mosaic sequence block rearrangements as more strains of C. difficile are analyzed.
We are grateful to Christopher Maslunka for his assistance in locating the rrn operons in the whole-genome sequence of C. difficile strain 630.
Published ahead of print on 15 September 2006. ![]()
Present address: Microbiology Department, Ilam University of Medical Sciences, Ilam, Iran. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2010 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»