Previous Article | Next Article ![]()
Applied and Environmental Microbiology, April 2009, p. 2079-2090, Vol. 75, No. 7
0099-2240/09/$08.00+0 doi:10.1128/AEM.02194-08
Copyright © 2009, American Society for Microbiology. All Rights Reserved.
,
Arnaud Delaherche,1,3,
Eric Bilhère,1,3
Antoine De Daruvar,1,4
Aline Lonvaud-Funel,1,3 and
Claire Le Marrec1,3*
Université de Bordeaux, Bordeaux, France,1 Université Bordeaux 2-LaBRI (CNRS-Université Bordeaux 1), 351 Cours de la Libération, 33405 Talence Cedex, France,2 UMR INRA 1219-Institut des Sciences de la Vigne et du Vin (ISVV) (Université Victor Segalen Bordeaux 2-Université Bordeaux 1), 210 Chemin de Leysotte, 33882 Villenave d'Ornon Cedex, France,3 CBiB, Université Victor Segalen Bordeaux 2, 146 rue Léo Saignat, 33076 Bordeaux Cedex, France4
Received 22 September 2008/ Accepted 2 February 2009
|
|
|---|
|
|
|---|
Oenococcus oeni has been described as a relatively homogeneous species, and a variety of molecular approaches failed to clearly differentiate strains on a molecular level (23, 39, 45, 48, 49). Discrimination was accomplished only recently, using fine techniques, such as differential display PCR, restriction endonuclease analysis-pulsed-field gel electrophoresis (18, 22, 50), and multilocus sequence typing (7, 8). Multilocus sequence typing clearly demonstrated an unexpectedly high level of allelic diversity in O. oeni. This was recently partially elucidated by Marcobal et al. (26), who demonstrated the hypermutable status in the genus, due to the absence of the mismatch repair (MMR) genes mutS and mutL. As reported for other bacteria, the lack of mut genes results in the accumulation of spontaneous errors in DNA replication, or in reduced stringency in recombination, thus generating high levels of polymorphism (35). More interestingly, the lack of MMR also facilitates the generation of isolates with beneficial mutations, resulting in increased fitness for the environment. This point is of tremendous importance in selecting commercial O. oeni starters. The considerable phenotypic variation in resistance to wine conditions among indigenous O. oeni isolates coexisting in tanks in the early stages of fermentation is well established and was simply thought to account for the unpredictability of MLF. Commercial starter strains are now widely used in winemaking for greater control over this conversion. However, although progress has been made in selecting and preparing O. oeni strains, commercial starters for direct inoculation in wines are not always effective (6).
A comprehensive catalogue of the genomic differences contributing to plasticity in O. oeni populations is now essential to provide relevant data on fundamental aspects of its evolution. A further aim is to identify genetic markers predictive of enological performance to guide industrial strain selection strategies. A first step in this direction identified correlations between enhanced enological properties and genomic variations in nine target genes (7). Another interesting avenue to explore is the possible contribution of high-level polymorphism to fitness. Genetic recombination creates rearrangements (deletions, duplications, and inversions) and mediates the integration of horizontally acquired mobile elements, such as plasmids, bacteriophages, genomic islands, and transposable elements. The introduction of genes with novel functions into genomes or their replacement with functionally similar orthologs or paralogs is recognized as an important mechanism for adaptation to specific niches in prokaryotes and LAB in particular (24, 25, 32).
The major aim of this study was to explore the genome diversity in O. oeni populations and extend our knowledge of the global gene repertoire of this species. Comparative genome analyses using subtractive hybridization (SH) were used to identify genetic differences between two O. oeni isolates with different tolerances for the wine environment. These data were combined with the publicly available genome sequence of the O. oeni PSU-1 strain, as well as that of the ATCC BAA-1163 strain, completely sequenced in our laboratory (A. Lonvaud, J. Guzzo, et al., unpublished data). We present the first comparative genomic analysis of O. oeni, showing great genetic diversity among the various strains and highlighting the role of insertion/deletion (indel) events in creating diversity. Furthermore, we propose two hypotheses: variable gene distribution may be related to enological potential and some of these sequences have been acquired on small islets through horizontal gene transfer (HGT).
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. O. oeni strains used in this work and their originsa
|
Genomic subtraction and cloning of fragments.
The isolation of genomic DNA and genomic subtraction was performed through the Genome Express proprietary protocol (Cogenics-Genome Express S.A., Meylan, France), adapted from the initial Straus and Ausubel SH protocol (43) in which strains IOEB Sarco 1491 and IOEB 8413 were used as the tester and driver, respectively (named hereafter SH-1491). The restriction enzymes Sau3A and RsaI were used for the digestion of genomic DNA. Similarly, the opposite SH experiment was conducted, in which strains IOEB 8413 and IOEB Sarco 1491 were used as the tester and driver, respectively (SH-8413). Subtractive PCR probes originating from SH-1491 and SH-8413 were amplified, respectively, from 1,508 clones and 1,403 clones (see Table S1 in the supplemental material) and sequenced in a single pass using the M13 primer according to standard protocols. All sequences were performed on Amersham Pharmacia Biotech Mega BACE4000 DNA sequencers.
Bioinformatics analysis.
(i) A set of 786 filtered sequences (466 and 320 for the Sau3AI and RsaI tester digestions, respectively) with a mean size of 616 ± 172 bp was provided by SH-1491 and assembled using the phred/phrap programs (14) with their default parameters. A total of 127 contigs and 55 singlets with mean sizes of 641 bp and 415 bp, respectively, were obtained from both digestions (see Table S1 in the supplemental material), totaling about 104 kb (i.e., 5.8% of the genome size) and with a GC content of 38.7%. The sequences of the 182 SH-1491 genomic fragments were then tested for their strain specificities, annotated, and ordered to reconstitute the original genome through an in-house bioinformatics strategy.
(ii) The 132 sequences obtained from SH-8413 were first trimmed of vector sequences and then tested by BLASTN and BLASTX cross-searches against the genome sequence of O. oeni ATCC BAA-1163 (not yet public at the time of this writing; a draft version has been released by A. Lonvaud, J. Guzzo, and coworkers under the GenBank accession number AAUV00000000.1). Since the 132 SH-8413 sequences were 99.8% identical to their ATCC BAA-1163 counterparts, both strains were then considered closely related isolates and made the complete genome sequence of strain ATCC BAA-1163 a useful reference for comparative analyses and the rapid and efficient detection of false positives. SH-1491 sequences exhibiting more than 93% identity to their ATCC BAA-163 counterparts were considered false positives and removed from the SH-1491-specific set. Because some of these discarded sequences may also reflect additional gene copies, i.e., paralogous genes in the IOEB Sarco 1491 strain with respect to the control strain, this collection of sequences was kept for further bioinformatic investigations.
(iii) The rest of the specific SH-1491 sequences were submitted to syntactic and functional annotation. BLASTX homology searches were performed against the nonredundant GenBank protein sequence database available at NCBI, and best BLAST hits (BBH) were recorded. A total of 14 SH-1491 orphan sequences, i.e., without any detectable homologs, were set apart and not submitted to the rest of the procedure. Mobile elements were identified using the ISfinder database (41) and manual expertise.
(iv) Because two restriction enzymes were used during SH, some genetic elements needed to be locally microassembled to reconstitute their original architecture as well as to reconstitute a given region. The analysis was completed with the aid of the genome annotation data from O. oeni PSU-1 strain (CP000411), Lactobacillus plantarum WCFS1 (AL935263), and Streptococcus agalactiae A909 (NC_007432). Sequences were orientated and ordered along the corresponding homolog sequences using either T-COFFEE (31) or CAP2 (19). The presence of at least three consecutive genetic elements was taken as a criterion to define the presence of a plasticity region in the IOEB Sarco 1491 O. oeni strain. All details are given in Tables S1 in the supplemental material.
Genomic PCR.
The validation of each subtracted sequence as tester specific was performed by PCR, using the DNA of the tester and driver strains as well as the DNA of the ATCC BAA-1163 strain as controls. PCR was also used to (i) assess SH-1491 sequence distribution in our collection of O. oeni isolates, using internal primers, and (ii) characterize the presence and gene arrangement in the plasticity regions when necessary. The primer design was achieved using the eprimer3 and Oligo Analyser 1.0.3 software. Oligonucleotides were purchased from Sigma-Aldrich. They are described in Table 2.
|
View this table: [in a new window] |
TABLE 2. Primers used in this study
|
Reverse transcription- and real-time PCR.
cDNA was synthesized using the iScript cDNA synthesis kit (Bio-Rad) as recommended. The 25-µl reaction mixture (4 µl of 5x iScript reaction mix, 1 µl of iScript reverse transcriptase, and 1 µg of RNA) was incubated for 5 min at 25°C, 30 min at 42°C, and 5 min at 85°C. Real-time PCR was carried out on a Bio-Rad iCycler with the iQ SYBR green supermix (Bio-Rad) in 96-well plates. A volume of 5 µl containing 10 pg of cDNA was added to 20 µl of PCR mixture (12.5 µl of iQ SYBR green supermix, 0.05 µl of each primer at 100 µM, and 6.4 µl of RNase-free water). Specific cDNA was amplified by real-time PCR with specific primers that were designed based on internal regions of the stress-responsive omrA gene (3) and of the eight SH fragments of interest identified in our study (see Table S1 in the supplemental material). In each run, negative and positive controls were included. Thermal cycling conditions were designated as follows: the reaction mixture was preheated for an initial denaturation at 95°C for 5 min, and 30 cycles were then carried out, each consisting of a denaturation step (30 s, 95°C), an annealing step (20 s, 52°C), and an extension step (20 s, 72°C). For each measurement, the threshold cycle value was determined with a baseline set manually at 100 relative fluorescence units (RFU). The differences in gene expression between the M-MRS standard condition and wine mimetic stresses (1-h shock in ethanol, acid, or heat) were analyzed using the comparative critical threshold method, in which the amount of target RNA is adjusted to a reference relative to an internal calibrated target RNA as previously described (13). The results were normalized by using the ldhD gene coding for lactate dehydrogenase, as its transcript level is weakly affected by the three tested stress conditions (10). Manipulations were done in duplicate. We considered that genes were significantly down- or upregulated if their relative expression levels were found to be at least twofold lower or higher than that of the control reference (10, 13).
Statistical analysis.
A chi-square test was computed considering the number of positive PCR amplifications obtained in the strain collection for each couple of primers and the total numbers of strains in each group of enological potential. A sequence was considered statistically overrepresented in the HP group if the chi-square test was significant at the 5% level.
Nucleotide sequence accession numbers.
The nucleotide sequence data corresponding to regions 4, 5, and 6 from the IOEB Sarco 1491 strain have been deposited under the accession numbers FJ625794, FJ613272, and FJ613273, respectively.
|
|
|---|
|
View this table: [in a new window] |
TABLE 3. Identification of sequences obtained by SH experiments
|
SH-1491 also revealed a second group of 45 sequences (29%) with best BLASTX matches outside the species, rather than with previously sequenced O. oeni genes/proteins (Table 3; see also Table S3 in the supplemental material). These novel gene sequences represented approximately 2% of the total number of ORFs in the genomes of O. oeni PSU-1 and ATCC BAA-1163 and were, therefore, added to the global gene repertoire of O. oeni. This group contained 29 sequences corresponding to chromosomally encoded proteins, mainly dedicated to sugar transport and metabolism, particularly incomplete transport systems for cellobiose (phosphotransferase system [PTS] subunit A) and fructose (PTS IIABC). The remaining novel sequences consisted of 14 phage- and two transposon-associated genes. The BBH of the 45 alien sequences were associated with other LAB usually present in wine environments, such as lactobacilli (55%), Pediococcus pentosaceus, and Leuconostoc mesenteroides (Fig. 1).
![]() View larger version (20K): [in a new window] |
FIG. 1. Origins of the BBH of the 45 alien sequences identified in O. oeni IOEB Sarco 1491. Lm, L. mesenteroides; Lp, L. plantarum; L, lactobacilli; P. pento, P. pentosaceus; GEs, genetic elements.
|
L10 (n = 3) and
10MC (n = 1) (16, 17, 38, 44). Another 14 phage-related sequences had best BLASTX matches (>30% identity) with structural proteins from phages infecting other LAB: a
C31-like phage (7 matching sequences),
AT3 (n = 4),
Sfi19 (n = 1),
bil309 (n = 1), and
Lp2 (n = 1), infecting L. mesenteroides, Lactobacillus casei, Streptococcus thermophilus, Lactococcus lactis, and L. plantarum, respectively. There was only one exception: a sequence matching a replisome protein from a more distantly related phage, infecting Clostridium beijerinckii (Fig. 1).
Among the phage-related sequences identified above, we focused on a DNA fragment related to a hypothetical protein from
A2, infecting L. casei, and found an orthologous sequence in the genome of strain PSU-1 (OEOE_0686). In this strain, the phage-related sequence was located upstream from four ORFs corresponding to a putative DNA invertase gene, recently identified on plasmids isolated from S. thermophilus (15) and Lactobacillus paracasei (9), and three putative cell surface proteins, whose sequences have few BLAST hits in the LAB family (Fig. 2). Remarkably, sequences corresponding to the latter three sequences were also subtracted from the genome of the IOEB Sarco 1491 strain (Table 4). The sizes and organizations of the locus referred to as region 1 were the same in the genomes of both PSU-1 and IOEB Sarco 1491 isolates, as revealed by an analysis of the 6.4-kb amplicon obtained from a PCR with flanking primers reg1F and reg1R, based in ORFs OEOE_0684 and OEOE_0691, respectively. Only a 778-bp amplicon was obtained from the genomic DNA of both IOEB 8413 and ATCC BAA-1163, indicating that region 1 was absent from these isolates. To better characterize this indel event, we compared the corresponding 778-bp intergenic region from the ATCC BAA-1163 strain with the boundaries of region 1 in the PSU-1 strain. Region 1 was, therefore, located between genes coding for a tRNA lysine and a hypothetical protein (Fig. 2). This analysis also led to the identification of (i) two imperfect
50-bp direct repeats (DR) flanking region 1 and (ii) a third copy in the 778-bp intergenic region in ATCC BAA-1163 (Fig. 2). Interestingly, the leftward DR was just downstream from the core attB site proposed for fOgPSU-1 prophage establishment in the PSU-1 strain (38). Despite the presence of five fOgPSU-1-related sequences among the O. oeni IOEB Sarco 1491-specific genes, amplification of a 6.4-kb amplicon with reg1F and reg1R primers showed that no lysogenization by any fOg-related phage occurred at this particular attB site in this isolate. A fOgPSU-1-related element is, therefore, likely to be integrated at another site in the IOEB Sarco 1491 strain. The existence of multiple loci as targets for prophage integration has been demonstrated in O. oeni (38). One potential target is attBfog44, located nearby, downstream from OEOE_0696. Considering all these data, the chromosomal region under investigation is, therefore, likely to be a spot of high prophage integration in O. oeni.
![]() View larger version (29K): [in a new window] |
FIG. 2. Organization of region 1, a possible tract of phage origin. (A) Schematic representation of region 1 in the IOEB Sarco 1491, PSU-1, and ATCC BAA-1163 strains. The core attB site used by fOgPSU-1 and overlapping the tRNA gene (OEOE_t0685) is represented by a black box. The gray hatched boxes represent the putative junctions (PSU-1) and core attachment sites (ATCC BAA-1163), considered to characterize the site-specific integration of region 1 in the O. oeni genome. (B) Alignments of the putative left, right and bacterial attachment sites found in the PSU-1 and ATCC BAA-1163 strains, respectively. DRL, leftward DR; DRR, rightward DR.
|
|
View this table: [in a new window] |
TABLE 4. Characteristics of the six plasticity regions identified in this study and prevalence of selected specific genomic sequences in a collection of strains with LP, IP, or HP to induce MLF
|
Other plasticity regions.
A total of 33 additional SH-1491 sequences (absent or highly divergent in the driver) mapped to five distinct plasticity regions, in which synteny was conserved in O. oeni (PSU-1 and/or ATCC BAA-1163) or other LAB (Table 4).
A set of 10 SH-1491 sequences mapped to region 2 (30 ORFs;
30 kb) in the genome of the PSU-1 strain (Table 4). This region, located between the OEOE_0060 and OEOE_0091 genes, is presented in Fig. 3. It consists of a variety of genes, including those coding for proteins involved in carotenoid biosynthesis. Various genes dedicated to sugar transport and metabolism are also interspersed in the region, including those encoding a galactoside acetyltransferase and a putative arabinose permease. The region also contains genes encoding hypothetical proteins whose functions have yet to be elucidated. Region 2 as a whole was absent from the genome of the ATCC BAA-1163 strain, replaced by a different set of 11 genes, including seven encoding hypothetical proteins (Fig. 3). PCR analysis revealed the distribution of seven subtracted sequences, indicating the prevalence of region 2 among our strain panel of 79 isolates. Positive strains were in the 73 to 85.9% range (Table 4). The GC content percentage of the region was characteristic of the O. oeni genome as a whole. Last, no common hallmarks of laterally transferred genes (unusual GC bias, anomalous codon usage, presence of tRNA genes at the 3' end, tendency to be flanked by DR, and inclusion of mobile genetic elements) were found in its vicinity in strain PSU-1. Region 2 may, therefore, result from an orthologous replacement event, occurring during the evolutionary history of the species (32).
![]() View larger version (27K): [in a new window] |
FIG. 3. Organization of region 2 in O. oeni PSU-1 and ATCC BAA-1163. Black arrows indicate genes common to O. oeni PSU-1 and ATCC BAA-1163, white arrows indicate genes specific to PSU-1, and hatched arrows indicate genes specific to ATCC BAA-1163. , pseudogene; , SH-1491-specific sequences.
|
Three additional plasticity regions (4, 5, and 6) were detected. They differed from the above loci as they harbored gene sequences with deduced protein sequences displaying BBH outside O. oeni. The most interesting of these was region 4, which comprised 11 ORFs in the IOEB Sarco 1491 strain, including four with a BBH in L. plantarum (Fig. 4). The locus, presenting a GC bias, was sandwiched between two copies of IS30, whose sequences were also partially subtracted. In addition, part of the intervening sequence exceeded 90% identity with L. plantarum DNA, while synteny and gene order were conserved between the corresponding orthologous regions (Fig. 4). All these data indicate a recent HGT event (Fig. 4). Identifiable functions corresponded to aquaglyceroporin (GlpF) and P1-type ATPase (CadA) dedicated, respectively, to the efflux of water and cadmium (Cd2+) in bacteria. The next gene corresponded to trxB, the second thioredoxin gene identified in O. oeni (20). Thioredoxins are key players in maintaining intracellular protein disulfides in reduced form. Region 4 also harbored a gene coding for a Dps-like protein in the mini-ferritin family. These proteins are particularly important during oxidative stress or the depletion of environmental nutrients (46). The next genes were a putative copper chaperone and a transcriptional regulator in the fumarate-nitrate-reductase (FNR) transcription regulator family. FNR-like regulators respond to a broad spectrum of intracellular and exogenous signals, including cyclic AMP, anoxia, redox state, oxidative stress, and temperature (21). Region 4 was present in the commercial O. oeni PSU-1 genome but displayed slight modifications at its leftward extremity. ORF1 and ORF2 were missing, while the left IS30 was replaced by an IS1223-related element (Fig. 4). IS1223 elements belong to the IS3 family and were recently identified in the genome of Lactobacillus salivarius UCC118 (41). Using Reg 4 F/R primers revealed that the strains in our collection that harbored region 4 all exhibited the structure described for the IOEB Sarco 1491 strain. The results of the distribution analysis of selected sequences internal to nine ORFs, showing that positive isolates varied from 14 to 36% (Table 2), also indicated polymorphism within the region.
![]() View larger version (20K): [in a new window] |
FIG. 4. Architecture of region 4 in O. oeni (IOEB Sarco 1491 and PSU-1) and L. plantarum. In the IOEB Sarco 1491 strain, the 7-kb region contains 11 genes flanked by two IS30 sequences. A statistically significant positive association was found between HP strains and the presence of sequences indicated by hatched arrows. Primers Reg 4 F and Reg 4 R are represented by black arrows.
|
5 kb and 4 kb, respectively. These loci included an acetyltransferase, a glucosidase, and a maltose phosphorylase, as well as genes involved in sugar transport (Table 4). Matching proteins were not found in O. oeni but were found in L. plantarum/Streptococcus pyogenes (region 5) and S. agalactiae (region 6), with identity levels from 47 to 76%. BLASTN searches revealed no hits in the databanks. An examination of the local and relative GC content of the O. oeni genome did not reveal any significant variations or hallmarks of horizontal transfer (using Island Finder). Significantly, distribution analysis of five sequences representing both regions in our collection revealed low scores, also indicating recent horizontal transfer (Table 4). This was striking for region 6, as only 5% of the 79 isolates screened shared the sequences corresponding to the putative PTS-fructose-specific IIC and IIA.
Plasticity regions and association with bacterial fitness.
To investigate the involvement of the plasticity regions identified in the increased fitness of O. oeni, the distribution of selected sequences among groups previously characterized for their enological potential was analyzed, as well as gene expression under enological stresses. The distribution of 28 selected sequences representing the six plasticity regions described above, as well as randomly selected singletons, was assessed using PCR (see Table 4). Their prevalence was determined in a strain collection including 19 LP, 44 IP, and 14 HP isolates (the 10 wild strains described above and the four commercial malolactic starters B1, PSU-1, VO, and VF). None of the SH-1491 sequences were unequivocally present in HP isolates. However, the prevalence of eight individual SH-1491 sequences was significantly higher in the HP group than in the rest of the O. oeni collection. The greatest statistically significant differences between the groups concerned the putative dps and copper chaperone sequences, both from region 4. These were present in 43% of HP and 11% of IP and LP isolates and 50% of HP and 17% of IP and LP isolates, respectively. Three additional SH-1491 fragments belonging to the same 7-kb region, related to the hypothetical protein NP_7876185, the P-type ATPase, and Fnr, were also prevalent. The three remaining prevalent sequences were related to the 3-oxo-acyl carrier protein reductase and hypothetical protein YP_809753, both from region 2, as well as a maltose phosphorylase from region 5.
Assuming that the eight genes were novel factors involved in adaptation to wine, we investigated whether any of the genes were actively expressed under normal (laboratory) culture conditions as well as during ethanol and acid shocks, two major stress conditions in winemaking identified during our selection of HP strains (see Materials and Methods). Heat shock was also tested, as it is also a frequent stress factor during winemaking. RNA was extracted from stressed and control cells during the exponential growth phase in MRS broth and the relative expression of the eight genes was determined by reverse transcription followed by quantitative PCR (Fig. 5). Six HP strains were tested, including five that were positive for the eight sequences of interest (IOEB Sarco 1491, 438, 433a, 384, and B1), while the commercial PSU-1 strain was positive for only six potential-associated fragments. Transcripts were obtained for all genes tested, in both stressed and control cells. Remarkably, each performance-associated fragment corresponded to a stress-responsive gene and was induced or repressed by a factor greater than two following at least one of the wine stresses tested. Variations in expression were observed following one (dpsB, copper chaperone, HP NP_786185, and oxo-acyl carrier protein reductase), two (HP YP_809753 and fnr), or all three stresses (maltose phosphorylase and P-ATPase) applied to the O. oeni strains.
![]() View larger version (23K): [in a new window] |
FIG. 5. Expression of the eight enological potential-associated fragments during acid (unfilled bars), ethanol (filled bars), and heat (hatched bars) shocks. 1, Dps; 2, copper chaperone; 3, HP NP_786185; 4, HP YP_809753; 5, maltose phosphorylase; 6, CadA; 7, 3-oxo-acyl carrier protein reductase; 8, Fnr.
|
|
|
|---|
From the analysis of the subtracted sequences, some hypotheses concerning the mechanisms generating plasticity in O. oeni could be proposed. The findings of numerous variable sequences support the role of recombination in creating diversity in O. oeni (8). SH identified major variations in singlets, and some also mapped to discrete areas. This is true for region 3, a small polymorphic locus related to polysaccharide biosynthesis. We believe that this locus is part of a larger region of plasticity, since it is immediately upstream from the recently described dpsA variable region (2).
Another important point highlighted by our study was the occurrence of indel events in O. oeni. Region 2 (30 ORFs) was present in strains IOEB Sarco 1491 and PSU-1 but absent in strain ATCC BAA-1163. No hallmarks of transfer were found, suggesting a possible loss of the region.
High abundance and diversity of sequences of foreign origin were found. A substantial proportion could be attributed to phage-related sequences (26% of the subtracted sequences). Some of the matching proteins were related to phages known to be active with O. oeni (
fog44,
fog30,
fogPSU-1,
L10, and
10MC), suggesting the lysogenization of the IOEB Sarco 1491 strain. However, matches with phages infecting other LAB, as well as more distantly related gram-positive bacteria, were also identified in this study. Similar findings were recently reported in O. oeni BIFI-83 (25). Phages evolve mainly by exchanging modules, and it is now widely accepted that these exchanges are not restricted to phages infecting the same bacterial species (5). The significant contribution of phage-related sequences to the dispensable gene reservoir of O. oeni is consistent with numerous reports of a high incidence of lysogeny in the species (1, 34). However, these findings contrast with the situation recently described in the genomes of the ATCC BAA-1163 and PSU-1 strains, which do not contain any intact temperate bacteriophages or large tracts (>10 kb) of obvious bacteriophage origin (27, 37, 38). The apparent discrepancy with the genome data may be related to the fact that the IOEB Sarco 1491 strain was only recently isolated from wine (2000) and moderately propagated in the laboratory. In contrast, two long-established laboratory strains have been chosen for sequencing projects, and this is also true of the IOEB 8413 driver strain. Many genetic changes occur during multiple laboratory subcultures, including the loss of mobile elements. These events have probably resulted in an underestimation of phage sequences in the previously available complete genome data of O. oeni. An assessment of the stability of phage-related sequences during the prolonged cultivation of bacterial strains on synthetic media, in the absence of any phage pressure, is now required.
Our results confirm previous findings, indicating the presence of strong, widespread pressure due to bacteriophage predation in the environment of O. oeni. This phenomenon has major repercussions on virus evolution. Thus, virulent bacteriophages have the potential to activate dormant prophages, leading to rapid evolution. The evolution of temperate bacteriophages via host recombination results in chimeric phages with novel phenotypes. Phages also exert considerable influence on the genomes of their bacterial hosts through various mechanisms, e.g., prophages also play a prominent role in shaping bacterial genome architecture. An interesting mechanism, by which Campylobacter jejuni bacteria were infected by virulent bacteriophage CP34, led to major genomic rearrangements in the surviving bacteria (40), leading to the hypothesis that these bacteria exist in vivo as families of related metagenomes generated to survive local environmental pressures.
Among the putative sequences of foreign origin, we identified 21 genes that were physically clustered in four loci (regions 1, 4, 5, and 6). Region 1 corresponded to a remnant prophage. Regions 2, 4, and 5 were particular as these sequences had best BLASTX matches outside the species, while gene order was conserved between the orthologous regions. In addition, sequences displayed a restricted distribution among our strain panel. These data may denote their acquisition through lateral gene transfer. Accordingly, region 4 had additional hallmarks of transfer (atypical patterns of codon usage and presence of flanking IS elements). An increase of HGT is one of the consequences of a defect in the MMR system in bacteria.
We identified six different regions of plasticity in this study, resulting from recombination or indel events. We next investigated their involvement in the increased fitness of O. oeni in wine. The bacterial species is not amenable to genetic transformations enabling gene inactivation. Therefore, we analyzed the distribution of selected sequences from each region among groups previously characterized for their enological potential (high, intermediate, or low). We also analyzed gene expression under enological stresses. There was a statistically significant positive association between HP strains and the presence of eight genes. The eight genes were also stress-responsive in wine in HP strains. Although these data do not represent formal proof, they suggest that the presence of these particular sequences contribute to the fitness of O. oeni in wine. All eight sequences belonged to regions 2, 4, and 5, expected to result from indel events. A sequence related to an oxo-acyl carrier protein reductase was part of region 2, a locus containing 30 ORFs that is proposed to be deleted in some O. oeni strains. Further work is now needed to understand how this sequence could benefit O. oeni in wine. Interestingly, this locus also contained a sequence related to an arabinose permease. Arabinose is one of the sugars detected in wines at the end of AF, when glucose and fructose have been degraded by yeasts. It is well-known that some O. oeni isolates do not utilize arabinose (A. Lonvaud-Funel, unpublished data). The correlation of this phenotype with the absence of region 2 is currently being investigated in the laboratory, even though no prevalence of this sequence among HP strains was observed in the current study. Other potential-associated sequences were found in regions 4 and 5, proposed to be acquired by HGT. Hence, five out the nine tested genes from region 4 were associated to fitness and were stress responsive in wine. Although their functionality and role remain to be assessed, it is noteworthy that one of these potential-associated genes was a Dps gene homolog. A first member of this family was recently identified on a genomic island in O. oeni and shown to protect E. coli from the deleterious effects of wine, copper, and ferric ions (2). Another remarkable potential-associated sequence is related to cadA, which encodes a putative cadmium efflux system. Its significant induction under acid stress was consistent with data from Penaud et al. (33), who demonstrated this trait for various P-type ATPases detected in the genome of Lactobacillus bulgaricus. Cadmium is a major environmental hazard and contributes indirectly to oxidative stress by affecting the cellular thiol redox balance of bacteria. Cadmium is not commonly found on grapes or in wine and originates mostly from environmental and technological contamination. However, the prevalence of this stressor in the environment of O. oeni is all the more questionable, as cadA was found close to a thioredoxin gene, trxB. Trx has been observed to play a protective role against Cd2+ in yeasts, while Trx from E. coli has recently been demonstrated to be a potent Cd2+ chelator upon acute Cd2+ exposure (36).
A prophage was recently shown to modify S. thermophilus fitness through lysogenic conversion (5). This was not the case for region 1, as the genes encoding the cell surface proteins present in this putative remnant prophage were not prevalent among HP strains. Similarly, we observed no potential-associated sequences in region 6, related to a putative PTS-fructose system in O. oeni. This region needs further study. Hence, fructose in combination with arginine was previously shown to trigger the expression of a subset of stress-responsive genes, such as arcR, omrR, and ftsH, while the cultivation of O. oeni in a fructose- and arginine-supplemented medium prior to wine exposure protected the bacteria from subsequent wine shock (4).
The acquisition of novel genes is proposed to be a strategy used by O. oeni to adapt to its environment. Our study raises additional questions about the mechanisms of transfer and the nature of the donors. An examination of region 4 provides some interesting clues. The locus is flanked by IS30 elements, suggesting a transfer through transposition. Members of this IS family have been widely transferred among LAB, where they are linked to genes involved in cold-shock adaptation, bacteriocin production, sugar utilization, and antibiotic resistance. It has, therefore, been suggested that IS30-related elements play a role in LAB genome plasticity and environmental adaptation (30, 32). While it is likely that this 7-kb locus is acquired, it is difficult to assess the relevancy of proposed donors, as the genes detected for potential horizontal transfer have generally undergone amelioration. Accordingly, the different structures observed in the PSU-1 and IOEB Sarco 1491 strains may be indicators of this type of rapid amelioration in the O. oeni species. As BBH were found for some sequences in Lactobacillus plantarum, frequently isolated from wine environments, this LAB is a candidate donor. Both species share physical as well as phylogenetic proximity, considered to influence lateral transfers. The locus is not flanked by IS elements in L. plantarum WFSC1. On the other hand, Molenaar and coworkers (28) did not identify this locus as a region of flexibility during exploration of L. plantarum genome diversity. However, the microarrays used only tested 81% of the total number of bases in the WCFS1 isolate. One alternative hypothesis is that O. oeni and L. plantarum are both recent recipients from an unknown donor. We observed that the BBH of 19 other SH sequences (singlets) were also associated with LAB present in the environment of O. oeni (lactobacilli and L. mesenteroides). The availability of complete genomic sequences will soon provide the opportunity to measure and compare the amount of laterally transferred sequences between O. oeni and the other bacteria found in the complex ecosystems present in must and wines.
This work provided some evidence for an association between the presence of particular sequences and the enological potential. However, it should be noted that O. oeni is present on the surface of grape berries. We cannot rule out that some of the subtracted sequences in this study may be relevant for the survival of O. oeni in a plant-associated environment. For example, divergence in the sequences harbored in region 3 may modulate the synthesis and/or structure of extracellular compounds, resulting in variations in surface interactions and, possibly, the ability to persist in the niche.
A.D. received a grant from the French Ministry of Education.
nologie, LMBA, UMR INRA 1219-Institut Supérieur de la Vigne et du Vin (ISVV) (Université Victor Segalen Bordeaux 2-Université Bordeaux 1), 210 Chemin de Leysotte, 33882 Villenave d'Ornon Cedex, France. Phone: 33 5 57 57 58 31. Fax: 33 5 57 57 58 13. E-mail: claire.le-henaff{at}u-bordeaux1.fr
Published ahead of print on 13 February 2009. ![]()
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
E.B. and A.D. contributed equally to this work. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»