Previous Article | Next Article ![]()
Applied and Environmental Microbiology, August 2008, p. 4898-4909, Vol. 74, No. 15
0099-2240/08/$08.00+0 doi:10.1128/AEM.02884-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.
,
Tina L. Fiedler,1
Jeanne M. Marrazzo,2 and
David N. Fredricks1*
Program in Infectious Diseases, D3-100, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue N, Seattle, Washington 98109-1024,1 Harborview Medical Center, 325 Ninth Avenue, Mailstop 359931, Seattle, Washington 981042
Received 20 December 2007/ Accepted 12 May 2008
|
|
|---|
|
|
|---|
BV is strongly associated with several adverse health outcomes, including preterm labor and delivery (13, 14), pelvic inflammatory disease (29), and the acquisition and transmission of sexually transmitted diseases, such as human immunodeficiency virus (23-25). BV is the most common cause of vaginal discharge among women of reproductive age and results in millions of health care visits annually in the United States (26). Because BV is fundamentally a result of changes to the vaginal microbial community, full understanding and successful treatment will require knowledge of how the healthy community is altered in its taxonomic composition, community structure, and, ultimately, function.
Historically, a search for single etiologic agents in BV and the constraints of traditional cultivation techniques limited appreciation of the full extent of vaginal microbial diversity in this syndrome. BV has long been recognized as a complex condition, but until recently, lactobacilli found in the healthy vagina were thought to be replaced by a relatively small suite of taxa typically including Gardnerella vaginalis, Prevotella spp., Porphyromonas spp., Mobiluncus spp., and Mycoplasma hominis (7, 27). The advent of molecular tools, such as broad-range bacterial 16S rRNA gene PCR, has enabled community assessments independent of cultivation-based techniques and has greatly expanded our knowledge of the phylogenetic range and taxonomic diversity of bacteria found in the vagina. For example, intensive sequencing has revealed extensive diversity, not only within the Lactobacillus species complex (15), but also within other taxa most closely related to Atopobium, Clostridium, Leptotrichia, Prevotella, Peptostreptococcus, and Peptoniphilus, species not previously thought to be common in the human vagina (3, 9, 15, 33, 34).
Although there is agreement in the literature that no single agent likely causes BV, there is no consensus about what constitutes a pathogenic bacterial community in this syndrome. Comprehensive cultivation-independent comparisons of the vaginal bacterial communities from subjects with and without clinically defined BV have been rare. Without such fundamental data, our understanding of the pathogenesis of BV is constrained. In one study that compared broad-range bacterial community composition among patients with objectively defined BV status, subjects with BV had nearly four times as many taxa as subjects without BV, including uncultivated Clostridia-like bacteria, which were shown to be highly specific for BV (9).
In this paper, we extend this previous work to present a systematic phylogenetic analysis of vaginal bacterial community composition from subjects with defined BV status as determined by commonly accepted clinical criteria. We also synthesize current public 16S rRNA gene sequence data from bacteria found in the vaginal ecosystem (with inclusion of studies where BV status was not assessed) and place these data in a phylogenetic context. Our goal was to provide a comprehensive picture of our current knowledge of the community structure of the vaginal bacterial ecosystem.
|
|
|---|
Written informed consent was obtained from all participants in the study, which was also approved by the Institutional Review Boards at the Fred Hutchinson Cancer Research Center and the University of Washington.
Molecular methods.
DNA was obtained from swabs using the UltraClean Soil DNA extraction kit (MoBio, Carlsbad, CA), chosen after testing guanidinium-based lysis protocols with and without bead beating. Out of the 969 sequences generated, 3 were detected only in the protocol without bead beating and 7 were detected only in the bead-beating protocol. Accordingly, we did not find that bead beating substantially affected our results. For broad-range PCR, the primers 338F (5'-ACTCCTRCGGGAGGCAGCAG-3') and 1407R (5'-GACGGGCGGTGTGTRCA-3') were used with a thermal-cycling protocol of denaturation at 95°C, followed by 21 to 25 cycles of 95°C for 30 s, 55°C for 30s, and 72°C for 90s, with a final extension at 72°C for 7 min.
Successfully amplified products were cloned using the Topo blunt cloning kit (Invitrogen, Carlsbad, CA), and PCR-amplified inserts were sequenced using vector primers and BigDye version 3 (Applied Biosystems, Foster City, CA). Single reads were generated for 736 sequences, with an average length of 811 base pairs. Two reads were obtained for 233 sequences, resulting in about 1-kb sequence lengths.
Data analysis.
Raw sequence data were edited, and contigs were assembled for each clone using Sequencher (Gene Codes, Ann Arbor, MI). The 969 sequences obtained from our sequencing efforts and all publicly available sequences from vaginal bacteria (554 sequences, listed below) were then submitted to the GreenGenes 16S rRNA gene database (5) for alignment using the NAST algorithm (4) and taxonomic classification of each sequence. All sequences were checked for chimeric anomalies using the Mallard program (2).
To classify sequences based on self-similarities rather than matches to an external database, sequences were grouped into operational taxonomic units (OTUs) with cutoffs of 99%, 97%, and 95% sequence similarity using the DOTUR software package implemented with the furthest-neighbor option, in which all of the sequences within an OTU are at least x% similar to all of the other sequences within the OTU (22). The commonly accepted phylogenetic species definition of 97% 16S sequence similarity (32) was used to define a core data set of representative sequences that were used for phylogenetic analyses.
Sequences were incorporated into ARB (17) for phylogenetic analysis and tree construction. Sequences from a nonredundant list of three nearest-neighbor isolates for each sequence were also added to the database. Alignments were manually corrected based on the secondary structure of the 16S rRNA and tree construction performed using the maximum-likelihood method in ARB (FastdnaML) with a 75% similarity filter constructed for each phylum. Bootstrap values (100 resamplings) were generated for maximum-likelihood trees created in PHYLIP (8) using the DNAML algorithm and neighbor-joining trees created in ARB and superimposed on nodes supported by all three methods.
Publicly available sequences were collected from previous surveys of vaginal bacterial diversity, which included 439 sequences from Hyman et al. (15) (AY958774 to AY959212), 7 unpublished sequences from Zozaya-Hinchliffe et al. (EF120360 to EF120366), 25 sequences from Verhelst et al. (31) (AJ585206 to AJ619714), 15 unpublished sequences from Zhang et al. (DQ666091 to DQ666105), and 68 sequences from Zhou at al (33) (AY267541 and -2, AY269020 to -34, AY271931 to -53, AY283264 to -75, AY335493 to -504, and DQ987868 to -9). Because subject BV status was not available for most of these studies, the corresponding sequences were excluded from comparisons of bacterial community structure in subjects with and without BV but were included in phylogenetic analyses of vaginal bacterial diversity.
Nucleotide sequence accession numbers.
Sequences from the 97 OTUs generated in this study were deposited in GenBank (AY738660, AY738684, AY738687, AY738691, AY738694, AY738697, AY738701 to -5, EF428974, and EU188937 to EU189021) and are listed in Table S1 in the supplemental material.
|
|
|---|
![]() View larger version (12K): [in a new window] |
FIG. 1. (A) Numbers of taxa per subject using four different taxon definitions. Taxon definitions of 99%, 97%, and 95% OTUs were assigned using the DOTUR package, and NCBI definitions were based on taxonomic classifications using the GreenGenes 16S rRNA database, as described in the text. The data represent 13 subjects without BV and 28 subjects with BV. The shaded boxes encompass the 25th to 75th percentiles of the data, the boldface lines indicate means, the lightface lines indicate median values, and the whiskers span the 5th to 95th percentiles. The asterisks indicate significant (P < 0.001) differences in mean values for subjects with BV versus subjects without BV, as determined by t tests. (B) Total number of taxa found across all subjects that are unique (e.g., found only in subjects with BV) and the number that are shared for each clinical state.
|
|
View this table: [in a new window] |
TABLE 1. Richness and diversity values and best matches to taxon designations in NCBI
|
![]() View larger version (22K): [in a new window] |
FIG. 2. Proportions of subjects for which each of the 97 OTUs classified at a 97% sequence similarity were encountered. The list indicates genus-level identification of each OTU based on the NCBI taxonomy. Note that sequences designated Bifidobacterium by NCBI correspond to sequences classified as Gardnerella by the RDP. The accession number for each OTU is listed in Table S1 in the supplemental material.
|
![]() View larger version (21K): [in a new window] |
FIG. 3. Taxonomic affiliations and relative proportions of sequences summarized at the genus (A) and phylum (B) levels. The taxa in panel A represent the closest matches from the NCBI database, with the numbers in parentheses representing the mean percent match for each group of sequences for subjects with BV (829 sequences) and subjects without BV (140 sequences). Note that some sequences designated Bifidobacterium by NCBI correspond to sequences consistent with G. vaginalis; there is significant sequence heterogeneity within the G. vaginalis species complex. The y axes in both panels show the proportion of sequences from subjects with and without BV calculated separately. The numbers above the bars represent the number of OTUs within each group defined at a minimum sequence similarity of 97%; only numbers >1 are shown. Note the break in the axis for Lactobacillus-like sequences from subjects without BV, where lactobacilli comprise 86% of the sequences.
|
For several taxonomic groups in particular, the true extent of diversity is not reflected in the current GenBank database. For example, even though about 20% of sequences from subjects with BV were classified as Prevotella according to the best match in GenBank, this group of sequences actually contained 21 different taxa based on a species definition of 97% sequence similarity (Fig. 3A). Similarly, within the group of sequences most closely related to Lactobacillus in GenBank, there were actually 15 different phylotypes (Fig. 3A). Estimates of taxonomic richness and diversity based on best matches to named taxa may severely underestimate the true extent of bacterial diversity in the vagina.
Taxonomic designations of sequences using two different classification schemes, NCBI and the Ribosomal Database Project (RDP) (http://rdp.cme.msu.edu), were largely congruent (Tables 2, 3, and 4) but also demonstrated important discrepancies. For example, sequences classified as Bifidobacterium by NCBI belonged to Gardnerella in the RDP scheme (Tables 2 and 4). Similarly, sequences classified as Clostridium by NCBI were called Acetivibrio by RDP (Table 2). In general, NCBI designations split sequences into more groups, while the RDP scheme lumped sequences together: the 60 sequences classified as Acetivibrio by RDP belonged to four different taxonomic groups in the NCBI scheme, and 47 sequences classified as Coriobacteriaceae by RDP represented five taxonomic groups according to NCBI (Table 2).
|
View this table: [in a new window] |
TABLE 2. Comparison of NCBI and RDP taxonomic designations for Actinobacteria, Bacteroidetes, and Firmicutes sequences obtained from subjects with BV (n = 580)
|
|
View this table: [in a new window] |
TABLE 3. Comparison of NCBI and RDP taxonomic designations for Firmicutes and Fusobacteria sequences obtained from subjects with BV (n = 249)
|
|
View this table: [in a new window] |
TABLE 4. Comparison of NCBI and RDP taxonomic designations for sequences obtained from subjects without BV (n = 140)
|
Six sequences deposited in GenBank (AY959158, AY958981, AY959016, AY959073, AY958933, and AY959044) were found to be consistent with chimeric data (P < 0.001) and so were excluded from our analyses.
Phylogenetic analyses of BV-associated bacteria.
The true diversity and novelty of BV-associated bacteria were apparent when sequences generated in this study were compared to publicly available sequences from previous surveys of vaginal bacterial diversity and their nearest cultivated and uncultivated matches.
Within the phylum Bacteroidetes, sequences in our data set were most closely related to one of three genera: Prevotella, Bacteroides, and Porphyromonas (Fig. 4). Most (196/222) belonged to the Prevotella group but were only distantly related to their nearest relative and represented many novel sequence types not found in previous studies of vaginal bacteria. In the Prevotella species complex, we found 13 novel OTUs (Fig. 4). Two novel OTUs representing 24 sequences were most closely related to Porphyromonas uenonis (Fig. 4) and were found only in subjects with BV.
![]() View larger version (53K): [in a new window] |
FIG. 4. Phylogenetic relationships among representative taxa belonging to the phylum Bacteroidetes. The tree was reconstructed using the maximum-likelihood method in ARB with a 75% similarity column filter. Representative taxa were defined using a 97% OTU definition, as described in the text. Additional taxa represent a nonredundant list of nearest neighbors current as of April 2008. Taxa in red represent OTUs unique to our sequencing efforts, OTUs in green were common to our study and at least one other, and OTUs shown in blue were detected in other studies but not encountered in our sequencing. The numbers after each OTU indicate the number of sequences with unknown BV status, the number of sequences from subjects without BV, the number of sequences from subjects with BV, and the total number of sequences within each OTU. OTUs shown in boldface were found only in subjects with BV. Triangles at nodes represent bootstrap values from 100 resamplings of maximum-likelihood (ML) trees built in PHYLIP and neighbor-joining (NJ) trees, as shown in the legend and described in the text. str., strain.
|
![]() View larger version (48K): [in a new window] |
FIG. 5. Phylogenetic relationships among representative taxa belonging to the phylum Actinobacteria. The tree reconstruction methods, color scheme, and bootstrap representations are as described in the legend to Fig. 4.
|
![]() View larger version (41K): [in a new window] |
FIG. 6. Phylogenetic relationships among representative taxa belonging to the phylum Firmicutes. The tree reconstruction methods, color scheme, and bootstrap representations are as described in the legend to Fig. 4.
|
|
|
|---|
For patients afflicted with BV, there is a profound shift in the types of bacteria present in the vagina and their absolute and relative abundances. Data presented here clearly demonstrate that BV is associated with a dramatic increase in the taxonomic richness and diversity of the vaginal bacterial community. Our analyses highlight three main points.
First, the true extent of vaginal bacterial diversity may be severely underestimated by attributing taxonomic identities to sequences simply according to their best match to currently known taxa. Although the use of the NCBI taxonomic classification scheme gives a familiar nomenclature for sequences obtained from any environment, in the case of the vaginal ecosystem, it also clearly hides a significant amount of diversity. Multiple OTUs within sequence groups matching a single taxon as defined by NCBI and low similarity scores to these taxa demonstrate that we are just beginning to reveal the true extent of the diversity of the vaginal flora.
Vaginal bacteria are generally underrepresented in the NCBI database, but for some groups in particular, the true extent of diversity is hidden by their poor representation in GenBank. For example, sequences classified as Prevotella actually contained 21 different OTUs using a 97% definition. Similarly, within sequences classified as Lactobacillus, there were 15 different OTUs. The diversity contained within the Lactobacillus species complex of the vagina has been noted previously by other researchers (15, 34), but the functional importance of this level of diversity remains unknown. Ecological theory suggests that each phylotype occupies its own niche in its environment and thus must be somehow unique, but the details of how BV-associated microbes partition niche space in the vaginal ecosystem and the correspondence between 16S rRNA-defined phylotypes and various functional capabilities remain relatively unexplored.
It is likely that additional sampling will reveal even greater richness and diversity in subjects with BV. When a taxon was defined using a 97% OTU definition, we observed a total of 61 taxa across all subjects with BV, but richness estimates of the true number of taxa present ranged from 63 to 111 (95% confidence interval), with an average estimate of 75 taxa (Table 1). For subjects without BV, our sampling effort was more complete; we encountered 25 taxa with a 97% OTU definition, and estimates of the total ranged from 20 to 30 (95% confidence interval), with an average of 21 (Table 1).
The loss of information caused by lumping multiple OTUs into a single taxon and the subsequent underestimation of the true richness and diversity of the vaginal ecosystem highlight the second important result of our analyses. The application of an OTU-based analysis to the full data set of our sequences and those from previous surveys of the vaginal ecosystem revealed many novel phylotypes associated with BV. We found 38 OTUs not previously encountered in the vaginal ecosystem, including 31 OTUs that were found only in subjects with BV. These OTUs were found across all of the major groups of BV-related taxa (Fig. 4, 5, and 6), providing additional evidence that many more taxa are associated with BV than was previously thought.
Third, although the vaginal bacterial communities found in subjects with and without BV are distinctly different, the structures of these vaginal bacterial communities have high intersubject variability within each clinical group. For both groups, the total number of taxa encountered across all subjects was at least four times greater than the per-patient mean (Fig. 1), and many taxa were relatively uncommon (Fig. 2). This variability in community membership and structure has important implications for understanding the etiology of BV and for developing diagnostic tools. Why is the taxonomic composition of BV-associated bacteria so different for each patient? High interpatient variability has been observed previously for microbial communities in the vagina (3) and other areas of the human body (6, 11). Stochastic differences in colonization could generate this pattern, but so could other factors, such as differences in host immune response, expression of ligands for bacterial attachment to epithelial cells, the chemical and physical environments of the host, or intra- and interspecific microbial competition.
Although no single bacterium can be identified as uniquely associated with BV, aggregating our data at higher taxonomic levels made it clear that several taxonomic groups do have a strong association. Common practice has generally been to describe sampled communities using the finest level of taxonomic discrimination possible, but aggregating data into higher taxonomic groups could be more informative and may avoid currently unresolved problems in classifying many BV-related bacteria at the genus or species level. For example, when treated at the genus level, taxa such as Eggerthella, Mobiluncus, and Slackia were relatively rare in subjects with BV, but when considered as a phylum, Actinobacteria accounted for 26% of all sequences from subjects with BV versus 6% from those without. Actinobacteria and Bacteroidetes in particular were much more common in subjects with BV than in those without BV (4.5 times and 6.7 times, respectively). The ability to identify a set of signature taxa common to BV could have important clinical and diagnostic implications.
Aggregating taxonomic data may also reduce problems associated with the current poorly resolved systematics of many BV-associated bacteria. For example, under the NCBI naming scheme, sequences classified as Gardnerella were relatively rare in subjects with BV, which contradicts generally held notions of a strong association between BV and Gardnerella. However, the 110 sequences classified as Bifidobacterium by NCBI were all classified as Gardnerella by the RDP (Table 2). The discrepancies among different taxonomic classification schemes highlight the current state of systematics for many BV-associated taxa and emphasize the importance of OTU-based analyses to capture the true diversity within a pool of sequences.
The polymicrobial nature of BV clearly raises the possibility of interactions among vaginal microbes, including syntrophies, which might contribute to the pathogenesis of BV (see reference 20 for a review). Ammonia transfer from Prevotella bivia to G. vaginalis has been demonstrated (19), but much remains to be learned about interspecific interactions of vaginal microbes and their contribution to the etiology of BV. Subjects with BV have volatile amines in vaginal fluid (the basis for the "whiff test" used in the clinical diagnosis of BV) with elevated levels of trimethylamine, putrescine, cadaverine, and tyramine, but the microbes responsible for generating these metabolic products are not clearly identified. An interesting recent paper has shown the importance of direct competition between lactobacilli and G. vaginalis, apparently independently of pH and H2O2 production (21). Clearly, much work remains to be done to tease apart the microbial interactions, metabolic processes, and host factors that lead to BV; it is our hope that increased knowledge about the composition and structure of BV-associated bacterial communities will facilitate these studies.
Limitations.
The data presented here reflect different sample sizes for subjects with and without BV. However, initial screening of samples using restriction fragment length polymorphisms indicated that the extreme differences in taxonomic richness between the two patient populations justified the different sample sizes in order to achieve similar sampling depths. It is axiomatic that representative sampling of any community requires that the sampling effort be tied to the structure of the community; depauperate communities naturally require less sampling effort than taxon-rich and diverse communities to achieve equivalent sampling saturation. In the case of subjects without BV, fewer sequences were obtained due to many fewer restriction fragment length polymorphism patterns because of the dominance by Lactobacillus. Richness estimators (Table 1) indicated subjects without BV were adequately sampled, and additional diversity will be discovered in subjects with BV. Future work utilizing techniques such as pyrosequencing could provide greatly improved coverage of BV-associated genetic diversity but with more limited phylogenetic resolution due to limited sequence lengths.
Although the use of cloning to identify bacteria from complex samples has been criticized for potentially biased amplification (28), the clone library approach has also compared favorably to PCR-independent methods, such as fluorescence in situ hybridization, in its estimation of the relative proportions of various taxonomic groups (16). The partial 16S rRNA gene sequences obtained in our study (
800 to 1,000 bp) may contribute to some of the discrepancies in taxonomic assignments between the NCBI and RDP classifications. Full-length sequences (
1.5 kb) would likely improve phylogenetic resolution for some taxa, but in this study, we opted for a relatively high-throughput approach to compare a large number of sequences from subjects with and without BV.
Comparison to previous results.
In our phylogenetic analysis, it is apparent that novel OTUs found in our study generally belong to different phylogenetic clades than OTUs not encountered in our study. This is perhaps most striking for the Bacteroidetes (Fig. 4), for which we found 13 OTUs that had not been discovered in previous studies of vaginal bacteria. Similarly, for the Actinobacteria (Fig. 5), we found only two OTUs representing five sequences that did not group with either Atopobium, Eggerthella, or Gardnerella/Bifidobacterium, while Hyman et al. found 15 OTUs outside of this main group. This pattern could reflect different study populations, the inherent variability of BV, PCR primer bias, or different sampling intensities across studies. Most (82/98) of the OTUs not found by us have two or fewer members represented, and all but six of these were found only by Hyman et al. (15), suggesting that their intensive sequencing effort did find uncommon members of the community. However, it is also possible that these sequences could originate from low-level PCR contaminants, which are more likely to become evident with more intensive sampling of clone libraries (10). Taq polymerase, used for PCR, is known to be contaminated with low levels of bacterial 16S rRNA genes. Demographic differences among patient populations have also been demonstrated (34) and may explain these differences.
Conclusions.
The structures of the vaginal bacterial communities differ dramatically between subjects with and without BV. BV is associated with increased taxonomic richness and diversity. At a species or genus level, the composition of the vaginal bacterial community has high interpatient variability, yet at higher taxonomic levels, several bacterial groups are strongly associated with BV, most notably Actinobacteria and Bacteroidetes. Our data describe a previously unrecognized extent of diversity in the vaginal ecosystem in general and of BV-associated bacteria in particular. The true extent of diversity within several key taxonomic groups is grossly underrepresented in the current NCBI database. The most prominent of these are Prevotella-like sequences, commonly found in subjects with BV, and Lactobacillus-like sequences, common in subjects without BV.
Using Web-based tools freely available to the research community, our analysis provides a comprehensive census of vaginal bacterial communities and their association with BV. It is our hope that the data presented here will stimulate the formulation of new hypotheses about the metabolic functions, syntrophic interactions, and niche partitioning of bacteria colonizing the vaginal ecosystem. Continuing investigations of BV will almost certainly reveal complex syntrophies, cell-to-cell signaling, and bacterial-host interactions that will shed light on how consortia of bacteria interact to form pathogenic communities in the human host.
We thank Andrew Millard for assistance with Perl scripting.
Published ahead of print on 16 May 2008. ![]()
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
Present address: Microbiology Research Group, Department of Biological Sciences, University of Warwick, Coventry CV4 7AL, United Kingdom. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»