Previous Article | Next Article ![]()
Applied and Environmental Microbiology, December 2004, p. 7210-7219, Vol. 70, No. 12
0099-2240/04/$08.00+0 DOI: 10.1128/AEM.70.12.7210-7219.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
Instituto de Fermentaciones Industriales, CSIC, Madrid, Spain
Received 8 June 2004/ Accepted 20 July 2004
|
|
|---|
|
|
|---|
The genus Oenococcus (7) contains a sole species, O. oeni, with a restricted ecological niche (wine and related habitats); although its dissimilarity to the genus Leuconostoc is generally accepted, there is some controversy regarding both the evolution rate of the species and its diversity (6, 46). Despite the exhaustive phenetic and molecular studies that have been performed on O. oeni, little is known about its population genetics. Studies carried out by using different molecular techniques such as chromosomal DNA-DNA hybridizations (7), 16S and 23S rRNA sequence analysis (39), and 16S-23S rRNA gene (rDNA) intergenic spacer region (ISR) sequencing (30, 46) and also by DNA fingerprinting (29, 44), pulsed-field gel electrophoresis (26), and randomly amplified polymorphic DNA (RAPD) analysis (45) suggest that this species is homogeneous. However, metabolic or physiological criteria, such as lactate dehydrogenases (17), carbohydrate fermentation (18), and cellular fatty acid patterns (42), have shown considerable diversity among strains of O. oeni. Therefore, O. oeni has often been referred to as a highly heterogeneous species, and splitting it into two species (38) or subspecies (41) has been proposed.
Multilocus sequence typing (MLST), a method that is based on partial nucleotide sequences of multiple housekeeping genes, has recently been shown to be a powerful technique for bacterial typing (11). Housekeeping genes are preferred because an analysis of mutations in such genes is more likely to properly reflect the phylogeny of strains. MLST uses variation that accumulates slowly, which is expected to be selectively neutral, and achieves very high resolution by analyzing multiple loci. The MLST method was introduced in 1998 and since then has been used for phylogenetic analysis of many bacterial pathogens such as Neisseria meningitidis (31), Streptococcus pneumoniae (10), Haemophilus influenzae (35), etc. MLST has led to a better understanding of the mode of genetic variation within a bacterial species. Since 1998, the MLST scheme has been applied to important bacterial pathogens including several food-borne human pathogens such as Campylobacter jejuni (8), Vibrio cholerae (14), and Bacillus cereus (23); however, until now, MLST had not been applied to a nonpathogenic food production bacteria used in the food industry, where precise methods for characterizing isolates are required as well.
The present study was undertaken to evaluate the discriminatory power of MLST in O. oeni. The results of our analysis indicate that the O. oeni population exhibits high genetic diversity close to linkage equilibrium with a panmictic population structure. This study also constitutes the first step for the development of an MLST method for O. oeni. The sequence diversity of five genes, possible candidates to be included in a future MLST scheme, was analyzed.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Properties of O. oeni isolates analyzed and their allele profiles at each locus
|
Ribotyping.
Chromosomal DNA was digested with HindIII and EcoRI (Roche), and the products were separated by electrophoresis in 0.7% agarose gels in 1x Tris-acetate-EDTA buffer.
Digested DNA was transferred onto positively charged nylon membranes (Roche) by the Southern method. Probe 16S rDNA was obtained from O. oeni CECT 4100T by PCR by using the eubacterial universal pair of primers 63f and 1387r (32). The 16S rDNA probe was digoxigenin labeled and detected by chemiluminescence by using a DIG-High Prime DNA Labeling and Detection Starter Kit (Roche) according to the manufacturer's instructions.
PCR amplification and DNA sequencing.
Since the genome sequence of O. oeni had not been published when this study was performed, we identified a number of candidate loci by searching the O. oeni genome database (http://genome.jgi-psf.org/draft_microbes/oenoe/oenoe.home.html) with gene sequences from other bacteria. The following four loci were chosen for the sequence analysis scheme: gyrB (coding for the B subunit of DNA gyrase and located on contig NZ_AABJ02000001.1), pgm (coding for phosphoglucomutase and located on contig NZ_AABJ02000004.1), ddl (coding for D-alanine-D-alanine ligase, located on contig NZ_AABJ02000021.1), and recP (coding for transketolase and located on contig NZ_AABJ02000005.1). These genes were selected based on the criterion that they are presumptively widely separated on the chromosome. A gene (mleA) coding for the malolactic enzyme, an enzyme of technological interest in O. oeni, was also included in the study (Table 2). The mleA gene is located on contig NZ_AABJ02000005.1 but 85 kb apart from the recP gene in the O. oeni strain MCW chromosome. Primers were designed by using highly conserved DNA regions of these genes for O. oeni obtained from the GenBank database, with the exception of mleA, for which we used primers previously described by Divol et al. (9).
|
View this table: [in a new window] |
TABLE 2. Primers used for MLST of O. oeni strains
|
Phylogenetic analysis.
For each locus, the sequences obtained for all isolates were compared, and the different sequences were assigned arbitrary allele numbers. For each isolate, the combination of alleles obtained at each locus defined its allelic profile (Table 1). Each isolate was therefore designated by five numbers, constituting an allelic profile or sequence type (ST). We refer to a unique combination of alleles as an ST. The STs were identified by arbitrary numbers assigned in order of description.
Sequence alignments and comparison were done with the program BioEdit, version 4.8.10 (http://jwbrown.mbio.ncsu.edu/BioEdit/bioedit.html) (21), and converted into MEGA and NEXUS files with START (sequence type analysis and recombinatorial tests) (http://outbreak.ceid.ox.ac.uk/software.html). Phylogenetic trees were compiled with MEGA version 2.1 software (http://www.megasoftware.net) by using the unweighted pair group method with arithmetic averages (UPGMA) (28).
The method of split decomposition was used to assess the degree of tree-like structure present in the alleles found for each locus in the complete set of 18 isolates (24). The sequence alignments were converted to NEXUS files, and the split decomposition was performed with SplitsTree 2.0 (http://bibiserv.techfak.uni-bielefeld.de/splits/).
Three types of statistical analysis were applied to the data: the index of association (IA), homoplasy test, and Sawyer's run test. Multilocus linkage disequilibrium was estimated by measuring IA (34). The homoplasy test (33) and the Sawyer's run test (40) were performed according to methods described previously with the START program. Using the START program, we also performed a test to detect selection in our population, and we calculated the dN/dS ratio as described by Nei and Gojobori (37).
Nucleotide sequence accession numbers.
The sequences of all alleles have been deposited in the EMBL and GenBank databases under the accession numbers AJ618989 to AJ618996 (gyrB[r] fragment), AJ618997 to AJ619006 (ddl fragment), AJ619007 to AJ619018 (pgm fragment), AJ619666 to AJ619673 (recP fragment), and AJ61974 to AJ619675 (mleA fragment) (Table 2).
|
|
|---|
Ribopattern analysis with EcoRI or HindIII revealed three bands for all O. oeni strains. Two different groups of strains were found, corresponding to groups defined previously by Zavaleta et al. (47). Ribotype A showed bands of 1.6, 2.5, and 4 kb for EcoRI and 1, 5, and 12 kb for HindIII; ribotype B showed bands of 1.6, 2.5, and 8 kb for EcoRI and 1, 5, and 6 kb for HindIII. Table 1 lists the assignment of ribotype group A or B to each O. oeni strain. The ribotyping patterns of the EcoRI digest are shown in Fig. 1.
![]() View larger version (75K): [in a new window] |
FIG. 1. Ribotyping patterns of EcoRI digest of chromosomal DNAs from O. oeni strains. Lane 1, CECT 4028; lane 2, CECT 4029; lane 3, CECT 4100; lane 4, CECT 4721; lane 5, CECT 4725; lane 6, CECT 4728; lane 7, CECT 4758; lane 8, BIFI-1; lane 9, BIFI-9; lane 10, BIFI-21; lane 11, BIFI-26; lane 12, BIFI-86; lane 13, 5001; lane 14, Uvaferm ALPHA; lane 15, Uvaferm MLD; lane 16, Viniflora OENOS; lane 17, Viniflora CH35. The molecular sizes (in kilobases) of the labeled fragments are indicated on the left.
|
|
View this table: [in a new window] |
TABLE 3. Sequence variation at five loci
|
![]() View larger version (33K): [in a new window] |
FIG. 2. Polymorphic sites in each of the five gene fragments studied. Each of the sites where the sequence of one or more of the genes differs from a putative consensus sequence is shown (only sites that differ are shown; sites that are identical to those in the consensus sequence are indicated by periods). The number of strains possessing the allele is indicated in parentheses. Numbering of the polymorphic sites (vertical format) is from the first nucleotide position of the corresponding gene. Asterisks indicate informative polymorphic sites.
|
Table 1 summarizes the allelic profiles of the O. oeni strains used in this study. Each unique combination of allele numbers represents one ST. All STs were represented by a single strain. No strains had identical sequences for all five fragments, and all the strains could be distinguished from each other because they had unique combinations of alleles. Isolates that had been shown to be closely related by using highly discriminatory typing methods, such as O. oeni 51 and BIFI-21, which were isolated from the same wine fermentation and showed identical RAPD patterns, had identical sequences for all the gene fragments analyzed. Moreover, the stability of the gene sequences in O. oeni strains was checked by MLST analysis. This analysis confirmed that the gene sequences analyzed remain unaltered during generations.
The allelic profiles and a UPGMA tree based on these allelic profiles are shown in Table 1 and Fig. 3, respectively. All STs differed in various loci, except ST-11 and ST-13, which differ only in one locus. There were no significant clusters that correlated with the geographic origin of the strains. Concatenated gyrB, ddl, pgm, recP, and mleA gene sequence fragments were analyzed and graphically displayed with SplitsTree (Fig. 4). The relationship of O. oeni isolates based on this analysis is depicted as a star-like structure with rays of different lengths. This star phylogeny is also consistent with a recombinational population structure and with the observation that each strain carries its own allelic combination. Isolates O. oeni CECT 4028, CECT 4721, CECT 4725, CECT 4728, and 5001 are more closely related than the other strains and their tree branches are interconnected, suggesting recombinational events between them. Consistent with the lineage assignment by BURST, this placed the predicted O. oeni founder strains (CECT 4725, Uvaferm ALPHA, and Viniflora OENOS) at a central position of the split graph. The relationships among other members of the group were assessed by examining the number of nodes between two isolates.
![]() View larger version (22K): [in a new window] |
FIG. 3. UPGMA dendrogram showing the genetic relatedness of the 18 O. oeni strains examined in this study. The dendrogram was constructed from a matrix of the pairwise distances between the allelic profiles of 18 O. oeni strains.
|
![]() View larger version (7K): [in a new window] |
FIG. 4. Split decomposition analysis based on the allelic profiles of the 18 O. oeni strains examined in this study. The numbering refers to strain numbers.
|
Comparative results of ribotyping and RFLP-ISR versus MLST.
Only one RFLP-ISR type was identified among the 18 O. oeni strains analyzed in our study; similarly, the same strains were included in only two ribotypes. However, the number of sequence types (18) was larger (Table 1). This observation suggests that the discriminatory ability of MLST is better than that observed by the use of ribotyping and RFLP analysis of the ISR. A specific ribotype was not restricted to specific STs, and, for example, 11 STs contained isolates with the same ribotype B. These results show clearly that ribotype does not correlate with sequence type (Table 1).
Evidence for recombination in O. oeni.
Bacteria can differ widely in their population structures. While some of them have a clonal population structure, in which all sequence diversity has arisen from the sequential accumulation of point mutations, in many other sequences diversity is greatly increased by intraspecies recombination. The sets of sequences were tested with the homoplasy test (33). The homoplasy test requires a sufficient number of informative sites to yield interpretable results, and if a locus has fewer than 10 informative sites, the analysis will not be performed on it. In all of our data sets, there was insufficient sequence diversity to perform the homoplasy test.
There were two possible examples of a recombinational event in the gyrB and recP genes from unknown sources. The mean divergence between allele 7 of recP (6.66%) is much higher than the mean diversity within the other recP alleles (0.37%), with the exception of allele 3 (3.61%). Alleles 7 and 3 of recP seem to have originated from a different source. In gyrB, allele 3 showed a divergence of 6.42%, and alleles 2, 4, and 5 showed a mean diversity of 4.28%. Therefore, in gyrB alleles 1, 6, 7, and 8 (mean diversity, 1.28%) could have originated from a different source from alleles 3, 2, 4, and 5.
Sawyer's test revealed no detectable cases of intragenic recombination in the sample except in the case of the recP locus, in which there was evidence against the null hypothesis of no recombination (P = 0.04) (Table 4). This evidence disappeared when the maximum condensed fragment value (P = 1.00) was considered.
|
View this table: [in a new window] |
TABLE 4. Sawyer's test analysis for evidence of intragenic recombinationa
|
Comparison between gene trees.
In order to further analyze the population structure of O. oeni, we used split decomposition analysis. The algorithm used in this software is able to display conflicting results in the phylogenetic descent of sequences. A tree-like structure is created when the descent is clonal, but an interconnecting network or a bush-like structure will appear when recombination plays a role in the evolutionary history of O. oeni genes. Figure 5 shows the split graphs for all alleles of the five fragments analyzed. The structure of the split graphs varied substantially between the different loci. The SplitsTree graphs obtained with ddl, pgm, and recP loci present network-like structures. This indicates the presence of homoplasies, probably evolved by intragenic recombination. The split graphs of the other two loci showed no evidence of network-like evolution. The split graph of the gyrB gene displays a star- or bush-like structure consisting of a single origin in the center of the graph, from which single branches radiate. However, in gyrB an additional uncentered edge was observed, suggesting that the evolution of some of the gyrB genes has been initiated by a couple of parallel mutations originating from one ancestor. The split graph of the mleA gene displays a line because only two alleles are analyzed. The fit parameter was 100 for gyrB, ddl, recP, and mleA, indicating that all phylogenetic information in the sequences could be visualized in the graphs. The fit parameter for pgm was lower (40.4), indicating that not all information could be integrated into the graph. The differences in structure among the split graphs obtained for the five loci can be explained by recombination, because recombination can lead to the assembly of genes with different evolutionary histories within one strain.
![]() View larger version (12K): [in a new window] |
FIG. 5. Split decomposition analysis of alleles obtained from 18 O. oeni strains for five loci. All branch lengths are drawn to scale. The observation that in the ddl, pgm, and recP graphs several alleles in the sample are connected to each other by multiple pathways, forming an interconnected network, is suggestive of recombination. The numbering refers to allele numbers.
|
|
|
|---|
The internal fragments of the five loci that were selected could be amplified from all strains that we examined. The amplified internal fragments were sequenced, and from these sequences we were able to use fragments between 339 and 554 bp for analyses (Table 3). Only two alleles were found in the mleA locus (Tables 1 and 2 and Fig. 2). The lack of diversity among the mleA locus sequences results in less discriminatory power than found in the housekeeping loci. It was previously shown that the partial sequence of the O. oeni ATCC 23279T (CECT 4100) mleA gene was also exactly identical to the mleA gene sequence of O. oeni IOEB 8413. The authors postulated that the malolactic enzyme (MleA) appears to have an identical sequence at species level. They found that the percentage of changes in sequences between O. oeni and other LAB was lower for mleA than for 16S rRNA (19).
The number of alleles per housekeeping locus ranged from 8 to 12 (Table 3). The four loci were highly polymorphic, and all multilocus types were represented by a single strain. The percentage of variable sites (1.7 to 6.6%) in the housekeeping genes was comparable to that seen by Enright et al. (12) in their analysis of a related gram-positive bacteria, Streptococcus pyogenes. The percentage of variable sites was higher than that seen in group B streptococcus (1.2 to 2.5%) (25) and V. cholerae (1.1 to 3.5%) (14) and considerably less than that of C. jejuni (9.2 to 21.7%) (8) or the B. cereus group (9 to 26%) (23).
The percentage of variable sites observed in O. oeni indicates a considerably high degree of genetic diversity, and it is not consistent with some data obtained by other experimental methods. The analysis of ribotyping and RAPD patterns has revealed a high level of genetic homogeneity in O. oeni (47). This homogeneity is consistent with the results of DNA-DNA hybridization (7), total soluble cell protein analysis (6), and 16S-23S ISR sequences (46). The RAPD profiles discerned two main groups of strains coincident with clusters obtained by macrorestriction typing in previous work (41). Ribotyping of the 18 O. oeni strains analyzed in this study indicated that all strains are included in only two ribotypes, A and B, previously described by Zavaleta et al. (47). The study by Zavaleta et al. (47) included 37 culture collection strains, 17 of which fell in group A (which includes CECT 4100T), and 20 fell in group B (represented by CECT 4028 and CECT 4029). The investigators concluded that the two well-defined and consistent groups of strains shown by all these methods are indicative of two distinct main patterns of DNA arrangement in the genome of the species O. oeni. However, by using MLST we could not observe the two well-defined groups of strains previously described.
Although more extensive studies are necessary to assess the population structure of O. oeni, the UPGMA tree based on the allelic profiles (Fig. 3), the split decomposition analysis of concatenated gene sequences (Fig. 4), and the IA values were consistent with the conclusion that O. oeni represents a good example of panmictic structure. Panmictic populations (as found in Neisseria gonorrohoeae) may be so variable that identical strains are only found among isolates from direct contacts. Even panmictic populations can contain clonal groupings or geographical specialization (34). This was not the case in our study, but the number of isolates analyzed here is too low to allow a definitive conclusion. In previous studies based on RAPD, ribotyping, small-plasmid content, and sequencing of RAPD markers, Zavaleta et al. (47) hypothesized that O. oeni follows a closely clonal mode of evolution, similar to the case of some pathogenic bacterial clones such as those of V. cholerae. However, the analysis of the population structure of O. oeni presented here shows a substantial extent of recombination. In Porphyromonas gingivalis, Frandsen et al. (16) and Koehler et al. (27) found ISA values of 0.068 and 0.089, respectively, and suggested a nonclonal population structure characterized by recombination. In our study we calculated a similar ISA value (0.038) that confirms the importance of recombination in O. oeni. Moreover, the ISA value calculated is similar to that reported for N. gonorrohoeae and, therefore, supports our estimation that the genes investigated in O. oeni are close to linkage equilibrium.
The examination of the sequences of housekeeping genes can provide evidence for the significance of recombination, since the variation within these genes is likely to be selectively neutral. The homoplasy test (33) measures the importance of recombination between members of a population. It is only valid when sequences differ by <5% of the nucleotides, and the test requires a sufficient number of alleles and informative sites to yield interpretable results. In our case, all loci analyzed had insufficient numbers of alleles and informative sites to perform this analysis.
Sawyer's test did not detect clear evidence of recombination in any of the loci analyzed (Table 4). In our analysis this test failed to detect recombination, in a similar way as described previously by Farfán et al. (14) for V. cholerae. These authors suggest that the reason for this disagreement may reside in the definition of condensed fragments themselves, which are designed to detect gene conversion in which both source and target sequences are in the sample. From our data gyrB, pgm, and recP showed a net structure in the split graph, suggesting the existence of recombination.
Falush et al. (13) suggested that the panmictic structure in H. pylori may result from frequent recombination during mixed colonization by unrelated strains during chronic colonization of the gastric mucosa. Recombination between O. oeni strains might occur on the fermentation tanks, where several strains of O. oeni may exist simultaneously in a single fermentation and where a favorable environment for horizontal gene transfers could be created. Although it has been shown that O. oeni is able to receive foreign DNA by transformation in vitro (5) and by conjugation (49), there is no evidence for horizontal gene transfer in vivo. The existence of recombination is not unexpected, because the presence of plasmids (1, 46), bacteriophages, and insertion sequences (48) was previously shown in O. oeni. Zé-Zé et al. (48) reported that small differences observed in two O. oeni strains are apparently due to insertion or deletion events related to the presence of insertion sequences.
Recombination can be detected in the aligned sequences by a number of means, the most simple method being the detection either by eye or through statistical analysis of mosaic structure, where different regions of the gene appear to have different evolutionary histories. For example, most of a gene may be identical in sequence for two isolates of a species, whereas a 500-bp region in the middle may differ at 5% of nucleotide sites. Significant mosaic structure is indicative of recombinational exchange, usually among isolates of the same species but occasionally also between closely related species (15). A point mutation will generate a single nucleotide difference, whereas a recombinational exchange is likely to introduce multiple nucleotide differences. Enright and Spratt (10) found in S. pneumoniae a possible example of an interspecies recombinational event in gdh as the mean divergence between an allele and the other pneumococcal alleles was 4.58%, which was much higher than the mean diversity within the other pneumococcal alleles (0.95%) and similar to the mean divergence between these alleles and that of Streptococcus mitis (5.16%). In our study, two possible examples of recombinational events from unknown sources were found. The mean divergence between allele 7 of recP (6.66%) is much higher than the mean diversity within the other recP alleles (0.37%), with the exception of allele 3 (3.61%) (Fig. 2). This divergence is higher than the divergence observed by Enright and Spratt (10) in a possible example of an interspecies recombinational event in the gdh gene between S. pneumoniae and S. mitis (5.16%). In O. oeni, the recP gene of strain 5001 represents a possible example of a recombinational event, but in this case, the possible source remains unknown. As far as we know, this is the first example of a possible recombinational event described in O. oeni. The extension of the analysis to a higher number of strains might offer information about the possible source of this allele.
In summary, our data suggest strongly that the four housekeeping loci chosen were a suitable basis for an MLST scheme, as they could be amplified and sequenced from isolates obtained from a wide variety of sources, were unlinked on the O. oeni chromosome, and exhibited sufficient diversity to provide a high degree of resolution. This study also demonstrates that MLST discriminates among O. oeni isolates effectively and generates data that can be applied to the investigation of the population structure and evolutionary mechanism in this important food organism.
We thank R. González, A. V. Carrascosa, and E. García for their critical reading of the manuscript. The help of A. V. Carrascosa in providing the commercial starter strains was greatly appreciated.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»