,
Bernadet Renckens,1,3
Douwe Molenaar,1,2 and
Johan E. T. van Hylckama Vlieg1,2
NIZO food research, Kluyver Centre for Genomics of Industrial Fermentation, P.O. Box 20, 6710 BA Ede, The Netherlands,1 TI Food and Nutrition, P.O. Box 557, 6700 AN Wageningen, The Netherlands,2 Centre for Molecular and Biomolecular Informatics (CMBI 260), Nijmegen Centre for Molecular Life Sciences, Radboud University Medical Centre, P.O. Box 9101, 6500 HB Nijmegen, The Netherlands3
Received 9 August 2007/ Accepted 12 November 2007
| ABSTRACT |
|---|
|
|
|---|
-galactosides, β-glucosides, arabinose, xylose, galacturonate, glucuronate, and gluconate. Further niche-specific differences are found in genes for defense (nisin biosynthesis), stress response (nonribosomal peptide synthesis and various transporters), and exopolysaccharide biosynthesis, as well as the expected differences in various mobile elements such as prophages, plasmids, restriction-modification systems, and insertion sequence elements. Many of these genes were identified for the first time in Lactococcus lactis. In most cases good correspondence was found with the phenotypic characteristics of these two strains. | INTRODUCTION |
|---|
|
|
|---|
(Fermenting) plant material is a second important ecosystem occupied by L. lactis, where it typically occurs as an early colonizer that is later replaced by species that are more tolerant of low pH values (30, 31). Most plant-associated strains belong to Lactococcus lactis subsp. lactis, whereas Lactococcus lactis subsp. cremoris is typically found in dairy fermentations (30, 31). Fermenting plant material comprises a broad array of highly variable niches with respect to chemical composition, as for instance the availability of carbohydrates other than lactose as growth substrates. Moreover, protein concentrations are typically much lower than those observed in the dairy environment. As a result, strains isolated from fermenting plant material do not harvest amino acids through proteolysis but depend on amino acid biosynthesis and consequently exhibit fewer amino acid auxotrophies than do dairy isolates (2). Therefore, it can be anticipated that strains adapted to the plant ecological niche will exhibit large metabolic differences and their metabolic diversity will most certainly exceed that of dairy strains. Recently it has been shown that strains isolated from a nondairy environment exhibit flavor-forming activities that may be beneficial to dairy fermentation, as exemplified by the production of the key flavor fusel aldehydes as a result of a unique
-keto acid decarboxylase activity (58, 59). Moreover, it was shown that some nondairy L. lactis strains produce the enzyme glutamate dehydrogenase, which converts glutamate to
-ketoglutarate (62). This compound,
-ketoglutarate, is the acceptor of the amino group in aminotransferase reactions, the first step in the production of flavor compounds from amino acid, and present at rate-limiting concentrations in cheese (62).
Several studies have addressed the biodiversity of L. lactis using a variety of molecular approaches as well as extended phenotyping, and this has revealed an unusual population structure (30, 46, 50, 68). A recent extensive genotypic and phenotypic diversity analysis of a large strain collection of dairy and nondairy origins confirmed the existence of two major genomic lineages (50). Most nondairy isolates belong to the lineage containing strains of L. lactis subsp. lactis, and it was shown that nondairy isolates represent molecular diversity not found within the dairy strains. Therefore, our present view on genomic and metabolic diversity of L. lactis may be limited and is likely to be highly biased toward dairy isolates, as only the genomes of strains L. lactis subsp. lactis IL-1403 (7), L. lactis subsp. cremoris SK11 (39), and L. lactis subsp. cremoris MG1363 (71) of dairy origin have been sequenced. In the current study we report the low-coverage (threefold) diagnostic sequencing of two L. lactis subsp. lactis strains isolated from (fermenting) plant material. Unique genes and gene clusters found within these isolates are discussed, and predicted phenotypes are validated experimentally. The results provide a first view of the gene pool present within the species and yield insight into the molecular basis of adaptation to their propagation in fermenting plant material.
(A preliminary report of this work appeared in reference 68.)
| MATERIALS AND METHODS |
|---|
|
|
|---|
Fermentation tests.
The ability to ferment citrate was measured with whey permeate calcium citrate Castione agar as described previously (22) but with milk replaced by whey permeate. The fermentation substrate range with respect to mono/oligosaccharide range was determined with the API 50 CHL assay (bioMérieux, Marcy l'Etoile, France) as described in reference 50. In the current paper, these data were used for experimental validation of the phenotypes predicted from comparative genome analysis. The ability to ferment other carbohydrates and complex polysaccharides was tested by growing the strains at 30°C in 96-well microplates filled with 250 µl GM17 medium per well. Subsequently, these cultures were used to inoculate 250 µl M17 containing 0.5% or 1% of a carbohydrate carbon source. All incubations were carried out at 30°C and in quadruplicate. Growth was analyzed by determining the turbidity at 600 nm and scored as negative (–), weak (+/–), or positive (+), compared with GM17.
Nisin assay.
Nisin concentrations in supernatants of overnight culture were determined by a plate diffusion method (65) using Micrococcus flavus DSM1790 as the assay organism. Standards were prepared from nisin (Sigma; 2.5% pure nisin).
DNA isolation and sequencing.
Total DNA of the isolates KF147 and KF282 was isolated as described previously (55). The total DNA was sequenced using a minimal shotgun sequencing approach. Four different genomic libraries were constructed: three plasmid libraries with average insert sizes of 0.8, 1.8, and 5 kb, respectively, and a fosmid library with inserts of 30 to 40 kb. Over 8 Mb was sequenced from these libraries, corresponding to approximately threefold coverage, assuming a genome size of 2.75 Mb (GATC Biotech, Konstanz, Germany). Contigs were assembled using the Seqman Genome Software of DNAstar (Madison, WI). Contigs were mapped to the template genome of L. lactis IL-1403 using Projector 2 (http://bioinformatics.biol.rug.nl/websoftware/projector2/) (67). Projector 2 settings were as follows: BLAST analysis; size of chopped fragments, 250 nucleotides (nt); minimum size contig, 1,000 nt; minimum size remaining fragment, 250 nt; cutoff for repeats, 1E-20; maximum length deviation mapped versus original template, 20%. Repeats (e.g., rRNA and insertion sequence [IS] elements) are filtered out before template matching. Statistics of sequencing and template mapping are summarized in Table 1. Interesting novel genes/gene clusters from strain KF147 contigs were resequenced by primer walking and PCR on fosmid clones, to close gaps, improve sequence quality, merge contigs, and correct frameshifts (GATC Biotech AG, Konstanz, Germany).
|
Nucleotide sequence accession numbers.
The complete nucleotide sequences of selected novel regions of L. lactis KF147 (about 250 kb; summarized in Table S1 in the supplemental material) have been submitted to the EMBL/GenBank/DDBJ databases and are available under accession numbers EU255902 to EU255918. Detailed annotation of the contigs is available in Table S2 in the supplemental material.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
|
Sequencing and comparative analysis of L. lactis genomes.
A low-coverage, diagnostic sequencing of the genomes of L. lactis strains KF147 and KF282 was performed (Table 1). The assembled contigs suggest that the sizes and G+C contents of the two genomes are similar (
2.5 to 2.6 Mb and 34.9%, respectively) and comparable to those of L. lactis strains IL-1403 (2.365 Mb and 35.4%, respectively) (7), SK11 (2.438 Mb and 35.8%, respectively) (39), and MG1363 (2.529 Mb and 35.8%, respectively) (71). The initial BLAST analysis showed that the large majority of contigs of both L. lactis plant isolates were highly similar to one another (>95% nucleotide sequence identity) and to the IL-1403 sequence (>95% identity) and that the genes showed conserved gene order. Therefore, contigs larger than 1,000 nt were first mapped onto the IL-1403 template genome using Projector 2.0 (for examples, see Fig. S1A and S1B for KF147 and KF282, respectively, in the supplemental material). This mapping provided a fast first estimate of genome coverage, contig order, gap sizes, and possible positions of large insertions and deletions relative to IL-1403.
Closer inspection showed that many unmatched contigs actually contained additional large segments that were highly matched (>90% identity) to the IL-1403 genome. These matches were not found by Projector because the contigs either (i) contained repeats (these are filtered out before mapping), (ii) contained a large deletion but were otherwise >90% identical to IL-1403 (exceeding maximum allowed length deviation in Projector), or (iii) contained a large insertion relative to IL-1403. These partially matching contigs provided further clues for positions of insertions and deletions. Together, at least 1.8 Mb (>75%) of the genome of each plant isolate was found to match the IL-1403 genome, and the majority of these matches were found in both plant isolates.
To characterize genotypic differences between plant and dairy strains in more detail, we subsequently focused on large regions/clusters that were absent on the contigs from strain KF147 or KF282 compared to strain IL-1403 and on functions encoded on contigs that did not map to IL-1403 (Table 3). Since these regions were highly similar in strains KF147 and KF282, we performed high-quality resequencing only of the relevant regions of the KF147 genome.
|
(i) Citrate/malate utilization.
Both plant L. lactis isolates (and strain SK11) lack a segment of about 21 kb corresponding to genes mae to ymcE of strain IL-1403 encompassing the complete citrate/malate utilization gene cluster mae-maeP- citRCDEFXG, followed by genes encoding many hypothetical proteins and transposases. This suggests that strains KF147, KF282, and SK11 cannot cometabolize citrate/malate, a trait that is typical only of dairy Lactococcus lactis subsp. diacetylactis strains. Indeed, these strains were unable to ferment citrate (Table 2).
(ii) TA biosynthesis.
Extracellular polysaccharides (EPS) and (lipo)teichoic acids (TA) of bacteria could play a role in attachment to host surfaces and interactions with other bacteria in biofilms. Both plant L. lactis isolates (and strain SK11) lack a segment of about 20 kb corresponding to genes yjdE to tagF of IL-1403, encompassing a large gene cluster for TA biosynthesis (tagRLHGZYXDF). However, different TA biosynthesis clusters appear to be inserted in the equivalent positions in the genomes of the L. lactis plant isolates (see below).
(iii) Transposable elements.
Most prophages of strain IL-1403 (i.e., pi1, pi2, pi3, ps1, ps2, and ps3) are found to be missing in the genomes of both L. lactis KF147 and L. lactis KF282. In some cases other prophages may be inserted in equivalent positions (see below). Numerous IS elements of strain IL-1403 were not found in the plant L. lactis isolates, as flanking genes separated by these IS elements in IL-1403 were often directly adjacent in strains KF147 and KF282. Also absent in these plant-derived strains is a fragment of about 17 kb, encoding an integrase (XerD), a type I restriction-modification system (HsdRMS), transposases, and many hypotheticals, located in strain IL-1403 between the eno and ygfA genes (which are adjacent in strain KF147).
Novel genes/gene clusters in plant L. lactis strains: complex sugar metabolism.
Plant polysaccharides can be divided into structural carbohydrates (cellulose, xylan, pectin, and arabinan) and storage carbohydrates (sucrose, starch, and fructan). A variety of genes and gene clusters were identified in the genomes of the L. lactis plant isolates that encode proteins assumed to be involved in utilization of plant polysaccharides for growth. The main examples are described below and summarized in Table 3. In addition, a comparison was made of predicted carbohydrate-active enzymes (glycoside hydrolases and esterases) in the genomes of L. lactis strains KF147, KF282, IL-1403, and SK11 based on the CAZy database families (11), as summarized in Table 4.
|
-glucuronidase and
-L-arabinofuranosidase to yield xylose oligomers, D-glucuronic acid, and arabinose, while the xylo-oligomers are hydrolyzed to xylose by β-xylosidase. Xylose is converted into xylulose-5-phosphate, which can enter the pentose phosphate cycle.
Both plant L. lactis strains have (i) a cluster of six genes encoding endoxylanase, polysaccharide deacetylase,
-glucuronidase (aguA), and a sugar ABC transporter (Fig. 1A); (ii) a separate gene for acetyl-xylan esterase (Table 4); and (iii) a cluster of three genes encoding a putative ABC transporter for (substituted) xylo-oligosaccharides (see Tables S1 and S2 in the supplemental material). The last transporter is very similar to a Bacillus clausii gene cluster that is directly downstream of xylan utilization genes, encoding xylan
-1,2-glucuronidase and β-xylosidase (Y. Takaki et al., unpublished data; accession code NC_006582). Both plant L. lactis strains also have uptake and utilization gene clusters for the breakdown products xylose and D-glucuronate (see below).
|
(1>4) bonds. Amylose is linear with few branches, while amylopectin has many branches. All four L. lactis strains contain the same large cluster (mapA-agl-amyY-maa-dexA-dexC-malEFG) for starch breakdown (by
-glucosidases and
-amylases) to dextrin and maltose and subsequent uptake (maltose transporter) and conversion. Only SK11 does not grow on starch, but it is not clear why, since there is no apparent gene inactivation.
(iii) Glucan breakdown (e.g., cellulose and xyloglucan).
Strain KF147 has a unique cluster of four genes encoding putative a UDP-N-acetyl-glucosamine-2-epimerase, a hypothetical membrane protein, an N-acetyl-glucosaminyl-transferase, and an endo-1,4-beta-glucanase/endocellulase of glycoside hydrolase family 8 (Fig. 1B). The best homologs (up to 66% identity) are encoded in similar conserved clusters in Streptococcus mutans and in Lactobacillus acidophilus group genomes. The putative function of this cluster could be breakdown of complex glucans by removal of N-Ac-glucosamine side chains and cleavage of the main glucan backbone.
(iv) Fructan/mannan breakdown.
The phenotypic assays show that some fructans can be degraded by one or both plant L. lactis strains (Table 2). Fructans are the dominant carbohydrate reserve in many plants. Fructans such as inulin, levan, and graminan are polymers of fructose. Some plants cell walls contain galactomannans and glucomannans (including mung bean [18]), which are polymers of primarily mannose (
-D-mannosides). Degradation of mannans to D-mannose can be followed by conversion to D-fructose by mannose isomerase (EC 5.3.1.7; no known sequences). Both KF147 and KF282 strains have a gene cluster encoding either a mannose or a fructose phosphotransferase system (PTS; three genes), a transcriptional antiterminator (BglG family), and a putative
-mannosidase/
-fructosidase of glycosyl hydrolase family 38 (Fig. 1C). This cluster is not present in other L. lactis genomes, but it is highly similar (40 to 60% amino acid identity) to gene clusters in Clostridium difficile and Listeria genomes. These proteins could contribute to degradation of fructans and/or mannans and uptake of monosaccharides for growth.
(v) Side chain removal.
Strain KF282 and strain SK11 encode a putative sialic acid-specific 9-O-acetylesterase (EC 3.1.1.53), which is not present in the genome of strain KF147 or IL-1403. The putative function of this enzyme is to cleave acetyl groups off sialic acid (=N-Ac-neuraminic acid) side chains of glycans. The best homologs are found in Bacteroides, Bifidobacterium, Caldicellulosiruptor, Pseudoalteromonas, and Flavobacterium species; in other genomes, including L. lactis SK11, this gene is typically located in a gene cluster encoding glucan/xylan-degrading enzymes.
Novel genes/gene clusters in plant L. lactis strains: simple sugar metabolism. (i) Arabinose metabolism.
Arabinose is a major sugar moiety in various hemicellulosic and pectic plant polysaccharides, such as arabinan (homoglycan), arabinoglycan, and arabinoxylan. Both plant L. lactis strains, KF147 and KF282, grow on L-arabinose, in contrast to L. lactis strains IL-1403 and SK11 (Table 2). A novel arabinose transport and metabolism gene cluster, araRPFTBDA, was found in both plant strains but not in IL-1403 and SK11 (Fig. 2A). These genes encode proteins for uptake of arabinose (and possibly oligomers) and conversion to D-xylulose-5P, which can enter the pentose phosphate pathway. The cluster also encodes a glycosyl hydrolase family 43 protein; this family contains members that are arabinanases or xylanases (http://www.cazy.org). Arabinanases hydrolyze the
-1,5-linked L-arabinofuranoside backbone of plant cell wall arabinans. Compared to strain IL-1403, this gene cluster is found to be inserted between the xylT and ptk genes (Fig. 2A), which are adjacent in IL-1403. Arabinose operons have not been described before in L. lactis, but they do occur in lactobacilli and bacilli (54). L. lactis strain KF147 also encodes an extra glycosyl hydrolase family 43 protein (Table 4). The best homologs (30 to 40% sequence identity) of this KF147 glycosyl hydrolase are found in Deinococcus, Myxococcus, and Bacteroides species, in which they are generally encoded together with other enzymes for xylan or arabinan degradation.
|
(iii)
-Galactoside metabolism.
-Galactosides such as stachyose, raffinose (=melitose), and melibiose are typical plant oligosaccharides.
-Galactosidase (encoded by gene aga) is required to cleave off terminal D-galactose moieties. Only strain KF147 was found to grow on
-galactosides (raffinose and melibiose were tested), in contrast to the other L. lactis strains (Table 2). In agreement with this phenotype, only strain KF147 was found to have a gene cluster for
-galactoside breakdown and subsequent D-galactose conversion by the Leloir pathway (Fig. 2B). This
-galactoside gene cluster, fbp-galR-aga-galK-galT, is highly similar (90 to 94% nucleotide identity) to that of Lactococcus raffinolactis ATCC 43920 (GenBank accession code AY164273.1) (8). The fbp, galK, and galT genes each have a paralog in strain KF147, each of which is also present in other L. lactis strains.
(iv) β-Glucoside metabolism.
β-Glucosides like cellulose, cellobiose, β-1,3-1,4-glucans (lichenin), and aryl-β-glucosides (e.g., arbutin, salicin, and amygdalin) are important components of plant cell walls. The L. lactis plant isolates KF147 and KF282 grow well on several of these β-glucosides (Table 2), and both genomes encode more β-glucosidases than strains IL-1403 and SK11 do (Table 4). In particular, an additional cluster of six genes is found encoding two sets of (6-P)-β-glucosidase, β-glucoside PTS, and a transcriptional antiterminator (Fig. 2C), all of them most similar to Listeria and Clostridium genes.
(v) Uronic acid metabolism.
Some strains of Bacillus subtilis can use glucuronate and galacturonate as primary carbon sources (42). All four L. lactis strains have a gene cluster, kdgR-uxuB-uxuA-uxuT-hypAE-uxaC-kdgK-kdgA, for uptake and degradation of D-glucuronate (building block of xylan), yet only the two plant isolates were observed to grow on glucuronate (Table 2).
Strain KF147 is found to have an additional gene cluster, exuR-uxaT-uxaC-uxaB-uxaA-hypAE-kdgK2-kdgA2, for uptake and degradation of D-galacturonate, the building block of pectin (Fig. 2D). Pectins are a family of complex polysaccharides containing 1,4-linked galacturonate moieties, with possible variations in the backbone (rhamnose units) or side chain substitutions (53, 72). Pectins can be degraded by extracellular pectate lyases and pectinases (polygalacturonase and pectin esterase), converting pectin into oligogalacturonate and subsequently into D-galacturonate (17). These pectinolytic enzymes were not found in the genomes of strains KF147 and KF282 or other lactococci. This galacturonate utilization cluster has not been described before in lactic acid bacteria and is most similar to gene clusters from Clostridium acetobutylicum, Flavobacterium johnsoniae, and Bacillus licheniformis; in the genomes of B. licheniformis and F. johnsoniae this cluster is linked to a gene for polygalacturonase (EC 3.2.1.15) for pectate degradation. Only strain KF147 grows well on D-galacturonate, in agreement with the genotype, suggesting that it can grow on pectin breakdown products generated by other bacteria or fungi on the plant surface.
(vi) Gluconate metabolism.
Both plant L. lactis strains, KF147 and KF282, grow on gluconate, in contrast to strains IL-1403 and SK11 (Table 2). The plant strains both have a gene cluster encoding a gluconate transporter (gntP), gluconate kinase (gntK) (EC 2.7.1.12), and 6-phosphogluconate dehydrogenase (EC 1.1.1.44). The same genes are found in the genomes of L. lactis IL-1403 and SK11 but may be nonfunctional, since the gntP gene of strain IL-1403 appears to be interrupted by a frameshift and/or transposase and the SK11 GntP transporter appears to be truncated at its C terminus.
(vii) Sucrose metabolism.
Both plant L. lactis isolates grow on sucrose, in contrast to strains IL-1403 and SK11. Only the KF147 and KF282 strains are found to have the sacBK and sacAR operons encoding a sucrose PTS, fructokinase, sucrose-6-phosphate hydrolase (EC 3.2.1.26), and a sucrose regulator, with >95% identity to the proteins encoded by the nisin-sucrose conjugative transposon of L. lactis NZ9800 (38) (data not shown). In addition, only strain KF147 encodes a putative sucrose phosphorylase (EC 2.4.1.7; GtfA), not found before in L. lactis strains, with the best homologs (65 to 70% identity) in Leuconostoc species, Listeria species, lactobacilli, and streptococci. The presence of this sucrose phosphorylase suggests that there should also be a permease for uptake of sucrose.
Nisin production/resistance.
Both plant strains are much more resistant than strains IL-1403 and SK11 to the lantibiotic peptide nisin; strains KF147 and KF282 survived >2,000 ng/ml nisin, while IL-1403 and SK11 survived at maximally 125 and 25 ng/ml, respectively. In a nisin biosynthesis assay, only strain KF282 showed a high growth inhibition of the indicator strain. The gene cluster nisZABTCIPRKFEG for biosynthesis of nisin Z and immunity to nisin Z (19, 28, 36) is found to be present in both KF147 and KF282, in contrast to strains IL-1403 and SK11. However, strain KF147 clearly has an internal deletion in parts of the nisB and nisC genes, which should lead to inactivation of the encoded nisin biosynthetic enzymes. Hence, the genotype predicts that both plant strains should be resistant to nisin (presence of immunity genes) but that only strain KF282 should be a nisin producer. This genotype agrees completely with the phenotypes found by us and Kelly et al. (31).
Survival/stress response. (i) NRPS.
Several contigs were identified in both L. lactis strain KF147 (total of
40 kb; data not shown) and L. lactis strain KF282 encoding enzymes which are characteristic of nonribosomal peptide or polyketide biosynthesis, such as phospho-pantetheine protein transferase, thioesterase, and multimodular peptide synthetases with adenylation, condensation, and acyl carrier protein domains (20, 70). Such systems are commonly found in environmental bacteria such as Bacillus, Pseudomonas, and Streptomyces spp., where they can play a role in survival, defense, signaling, or adhesion (20). This is the first identification of such a system in Lactococcus lactis. Studies are under way to sequence the complete gene clusters and identify their nonribosomal peptide synthesis (NRPS)/polyketide synthesis products.
A preliminary analysis shows that many predicted open reading frames had the best sequence identity (up to 40%) to Bacillus biosynthesis systems for bacitracin, bacillibactin, or bacillomycin (data not shown). The nonribosomally synthesized peptide of L. lactis could be involved in plant interactions (defense or adhesion) or could be a siderophore for binding of Fe ions (12). The production of low-molecular-weight Fe(III) chelators (siderophores) enables microorganisms to efficiently scavenge iron even in aerobic environments where iron exists primarily as insoluble hydroxides (47). Both plant L. lactis strains also have a unique cluster of five genes (Fig. 3A) encoding a FeuABC-like transporter, most similar (33 to 53% identity) to the B. subtilis uptake system for the catecholate siderophores enterobactin and bacillibactin (41, 44), supporting the hypothesis that the product of the L. lactis NRPS system could be a siderophore. Alternatively, the NRPS product could be a lipopeptide surfactant that promotes biofilm formation, like surfactin produced by Bacillus (34, 73).
|
50% identity) to those in lactobacilli but not found previously in L. lactis. This antiporter could play a role in pH regulation and osmoregulation.
(iii) Polyamine ABC transporter.
Polyamines (putrescine, spermidine, and spermine) are necessary for cell growth (9, 27, 61). They are among the major polycations in cells, together with Ca2+ and Mg2+. Polyamines and Mg2+ can bind to intracellular polyanions such as nucleic acids and ATP to modulate their function. A PotABCD spermidine/putrescine uptake system is present in L. lactis IL-1403 and SK11 and in both plant L. lactis genomes, with 99% identity. A second spermidine/putrescine ABC transporter together with a putrescine/ornithine aminotransferase is found only in both plant L. lactis strains, with >80% sequence identity (Fig. 3C). This system is most similar (44 to 55% identity) to PotABCD of Clostridium beijerincki and various bacilli.
Cell envelope.
The physicochemical environment in the natural habitat of L. lactis plant isolates is very different from that of dairy isolates. Plant-associated bacteria are often found in biofilms, using EPS and large proteins to adhere to plants (14). As a consequence, major adaptations in gene sets for biosynthesis of components involved in the physical interaction with the extracellular space are expected, such as in cell-bound polysaccharides, TA, and proteins.
(i) EPS/TA biosynthesis.
Both plant L. lactis strains have several large contigs (and scaffolds) with putative EPS/TA biosynthesis genes, in particular strain KF147, which has three large clusters totaling over 42 kb (see Tables S1 and S2 in the supplemental material). The genes in these clusters show highest sequence similarity to L. lactis, Streptococcus thermophilus, Streptococcus pneumoniae, and Enterococcus faecalis EPS/TA clusters, but gene order in these clusters is generally very different. The only exception appears to be the epsXABCD gene cluster, which is highly similar (>90% identity at amino acid sequence level) to that of L. lactis plasmid pNZ4000 (69). These epsXABCD genes in L. lactis KF147 are part of a larger EPS biosynthesis gene cluster of
13 kb (Fig. 4) that is flanked by two small integrase gene fragments and appears to be inserted in the chromosome between the ybeF and tgt genes compared to L. lactis IL-1403 (see Table S2 in the supplemental material). A TA biosynthesis gene cluster of
22 kb is present in strain KF147 in the same position as the TA cluster in the IL-1403 genome (between genes yjdF and deoB), but the composition and sequence similarity of these clusters differ considerably (see Fig. S2 and Table S2 in the supplemental material).
|
In contrast, the genomes of both L. lactis plant isolates are found to encode two novel putative extracellular proteins with an LPxTG anchor and a so-called collagen-binding domain, which we call LpxA (822 residues) and LpxB (790 residues; two paralogs found in strain KF282). Recently an LpxA ortholog of 815 residues (83% identity) has been identified in L. lactis MG1363 (71), and part of the lpxA gene has also been found in L. lactis MG1614 (51), but no homolog of LpxB has been identified in other bacteria.
These differences in extracellular proteins may relate to differences in niche interactions, e.g., biofilm formation and binding to plant hosts.
Transposable elements. (i) Phages.
While several phages of L. lactis IL-1403 are not found in the plant L. lactis strains (see above), numerous other putative phage-encoding genes were identified in both plant strains, with the best homologs either in other L. lactis strains or in other microorganisms (data not shown). Particularly striking is the presence in L. lactis KF282 of some new (pro)phages resembling known phages of Streptococcus pyogenes and Lactobacillus johnsonii. A gene cluster equivalent to prophage Lj965 of L. johnsonii NCC533 genes LJ0307 to LJ0324 (21.5 kb) was identified with conserved gene order and 24 to 54% amino acid sequence identity, as well as a gene cluster equivalent to phage SSI-1 of S. pyogenes serotype M3 genes SPs1133 to SPs1144 (8.2 kb) with conserved gene order and 26 to 59% amino acid sequence identity (data not shown).
(ii) Plasmids/transposons.
Both L. lactis plant isolates have putative plasmids or transposons, based on the presence of characteristic functions encoded on several contigs, e.g., replication, partitioning (ParA-ParB), conjugation protein, recombinase, excisionase, transposon protein, transfer protein, transposase, and IS elements (data not shown). The total size of the putative plasmids is estimated to be between 50 and 70 kb in each strain, with the largest continuous assembled fragment of 25 kb found in strain KF147; sodium dodecyl sulfate-polyacrylamide gels show at least one dominant band of >20 kb for strain KF147 (data not shown). Several of these putative plasmid fragments of the two strains are highly similar to each other, suggesting that the two strains have similar plasmids. Other putative plasmid-encoded functions on these contigs are, e.g., arsenate/arsenite resistance (arsR-arsD-arsA in strain KF282), abortive infection (abiN), metal-transporting ATPase, SOS response (umuC), cell division (ftsK/spoIIIE), and cell surface proteins.
Different parts of the nisin conjugative transposon were identified, in both strain KF147 and strain KF282 (see above), but these regions were not yet sequenced in detail to assess the completeness of the transposon.
Evolutionary aspects.
One of the questions that now arises is whether these unique genes in L. lactis plant isolates were acquired by recent horizontal gene transfer or whether they are more ancient, having been lost more recently in dairy isolates as an adaptation to the nutrient-rich milk environment, such as found for the yoghurt bacteria Streptococcus thermophilus (7) and Lactobacillus bulgaricus (66). To address this question, we calculated the G+C content of the unique genes and gene clusters of L. lactis KF147 and compared them to their best BLAST hits, as summarized in Table 5 (more details of individual genes can be found in Table S2 in the supplemental material). Strikingly, nearly all the unique gene clusters have a G+C content close to or slightly lower than the average G+C content of 35 to 36% for L. lactis genomes. In contrast, their best BLAST hits generally have a higher G+C content, with the exception of the clostridial best hits, which have a lower average G+C content, in all cases closer to the average G+C content for their own genomes (Table 5 footnote). This strongly suggests that most of these "unique" gene clusters of plant isolates are actually more ancient and were lost in many dairy isolates. This hypothesis is supported both by the fact that most of the best BLAST hits are from phylogenetically related Lactobacillales (see, for instance, Fig. 5 of reference 40), indicative of presence in a common ancestor, and by the fact that some gene clusters are still found in L. lactis subsp. cremoris strain SK11 or MG1363, but not in L. lactis subsp. lactis IL-1403 (Table 5).
|
-galactoside metabolism (contig z563tr), which is highly similar to that in Lactococcus raffinolactis (8), but in both bacteria the G+C content is much higher than expected for Lactococcus strains.
Conclusions and outlook.
The L. lactis strains KF147 and KF282 isolated from mung bean sprouts and mustard and cress, respectively, are found to have many adaptations to the plant environment, particularly for growth on plant carbohydrates. Mung bean (Phaseolus vulgaris) cell walls consist mainly of arabinose (most dominant), uronic acids, galactose, xylose, mannose, and glucose (25, 56). White mustard (Sinapsis alba) cell walls (mucilage) consist mainly of glucose (most dominant), galactose, mannose, rhamnose, arabinose, galacturonic acid, and xylose (13, 26, 52); their polysaccharides are mainly 1,4-linked β-D-glucan (branched cellulose), complex pectin, and xyloglucan. The adaptation to growth on substrates derived from these plant cell walls is evident from the presence of gene sets for the degradation of complex plant polymers such as xylan, arabinan, glucans, and fructans but also for the uptake and conversion of typical plant cell wall degradation products such as
-galactosides, β-glucosides, arabinose, xylose, galacturonate, glucuronate, and gluconate. Lactococci growing on plants generally live in synergy with other microbes in biofilms (14), including various bacteria and fungi, which could have similar and complementary enzymes (e.g., pectinases), allowing lactococci to grow on the plant cell wall breakdown products generated by other microbes.
Other plant niche adaptations include genes for defense (such as nisin biosynthesis and immunity) and stress response. The latter involves several extra putative transport systems for uptake of iron (possibly involving a siderophore), potassium, and polyamines. In most cases these genotypes agree well with the observed phenotypes (Table 3). Many of these genes and gene clusters have been identified for the first time in L. lactis. For instance, in addition to the new genes for plant sugar metabolism, the nonribosomal peptide biosynthesis gene cluster is new for lactococci, and the only other lactic acid bacterium with a known but unrelated NRPS cluster is Lactobacillus plantarum WCFS1 (32).
Our approach of low-cost diagnostic sequencing provides a first quick view of the gene pool present within lactococci from plant environments and yields insight into their molecular basis of adaptation. It also indicates that, to achieve insight into the pangenome of Lactococcus lactis, or any other microbe for that matter, it is essential to isolate and sequence strains from a wide variety of environments to allow for inclusion of all possible adaptation mechanisms. Based on this newly identified repertoire of genes in L. lactis, we have constructed a first-generation L. lactis pangenome microarray, using ultra-high-density DNA arrays (Nimblegen technology) for the assessment of genomic diversity within a large collection of Lactococcus lactis strains from dairy and nondairy origins, with the aim of correlating genomic makeup with phenotypic traits and better defining evolution of the Lactococcus lactis branch (G. Felis et al., unpublished results).
| ACKNOWLEDGMENTS |
|---|
This research was partly funded by the Kluyver Centre for Genomics of Industrial Fermentation, a Centre of Excellence of The Netherlands Genomics Initiative.
| FOOTNOTES |
|---|
Published ahead of print on 26 November 2007. ![]()
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
Present address: Department of Biology, Utrecht University, Utrecht, The Netherlands. ![]()
| REFERENCES |
|---|
|
|
|---|
-keto acid decarboxylase involved in flavor formation. Appl. Environ. Microbiol. 71:303-311.