Previous Article | Next Article ![]()
Applied and Environmental Microbiology, May 2008, p. 3058-3068, Vol. 74, No. 10
0099-2240/08/$08.00+0 doi:10.1128/AEM.02863-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.

Institute for Pharmaceutical Biotechnology, Universität des Saarlandes, Saarbrücken, Germany,1 Bruker Daltonik GmbH, Bremen, Germany,2 Department of Zoology, University of Oxford, Oxford, United Kingdom,3 Max Planck Institute for Developmental Biology, Tübingen, Germany,4 Department of Biology, Indiana University, Bloomington, Indiana5
Received 19 December 2007/ Accepted 20 March 2008
|
|
|---|
|
|
|---|
Myxococcus xanthus has been thoroughly investigated due to its amazing social behaviors, which include coordinated swarming over surfaces and a complex life cycle that culminates in the formation of multicellular fruiting bodies under starvation conditions (23). However, little attention has been paid to secondary metabolite production in this model species. M. xanthus strain DK1622 is the genotype most commonly studied with respect to social interactions and development. This strain has been found to produce several secondary metabolites, including the myxochromids (27), myxalamids (13), the antibiotic myxovirescin (8), and the siderophore myxochelin (15) (Fig. 1A). The molecular basis of synthesis for these compounds has been studied in some detail (25, 26, 30) and has been greatly facilitated by the availability of the DK1622 genome sequence (10). Very recently, a novel secondary metabolite class unique to M. xanthus, the DKxanthenes, was characterized and was shown to be essential for viable-spore formation (18). Additional reports of secondary metabolites from M. xanthus are limited to cittilin (20) and the antibiotics saframycin (12) and althiomycin (16).
![]() View larger version (17K): [in a new window] |
FIG. 1. (A) Structures of the secondary metabolites reported to derive from Myxococcus xanthus strain DK1622 to date. 1, myxochromid A2; 2, myxalamid A; 3, myxovirescin A; 4, myxochelin A; 5, DKxanthene-534. (B) Global distribution of sampling sites from which the 98 M. xanthus strains under investigation in this study originated. The predominance of subtropical and tropical zones with moderate-to-warm temperatures reflects the worldwide abundance pattern of many myxobacterial species (5). Multiple isolates from regional or local spatial ranges were summarized as single spots.
|
To search for additional secondary-metabolite diversity in M. xanthus, we screened 98 M. xanthus strains isolated from locations worldwide (Fig. 1B). We sought to determine whether the metabolic inventory of this species is largely present in DK1622 or whether there is significant potential for novel compound discoveries in other strains.
Multiple approaches have been used to screen microbes for novel natural products. While activity-guided methods have the potential to directly deliver candidate compounds for a specific biological activity, they are inherently biased due to the choice of targets in the activity assays. Chemical screening can rapidly reveal the presence of diverse compound classes in a set of samples but is nonetheless constrained by immense differences in the chromatographic and spectroscopic properties of natural products. For example, the still widely used method of UV/visible light detection by diode array detector-coupled liquid chromatography disproportionately reveals compounds with the most eye-catching absorption characteristics.
Liquid chromatography-coupled mass spectrometry (LC-MS) is now considered the most informative technique for analysis of bacterial secondary metabolite profiles and promises to detect the largest possible range of actual chemical diversity in a given sample. We therefore implemented a combination of fast chromatographic separation (UPLC), electrospray ionization (ESI), and high-resolution time of flight mass spectrometry (TOF-MS) as our primary screening platform (14). In addition to in-depth manual data examination, we use principal component analysis (PCA) in characterizing secondary metabolite profiles for the first time. Intriguingly, our analysis reveals that from a set of 98 M. xanthus isolates, candidate metabolites for 37 novel natural products were identified. Statistical analysis suggests that these 37 compounds are likely to represent
80% of the total natural secondary metabolome diversity in M. xanthus.
|
|
|---|
UPLC-coupled high-resolution ESI-TOF-MS.
LC-MS measurements were performed using a micrOTOF ESI-TOF mass spectrometer (Bruker Daltonik GmbH, Bremen, Germany) coupled to a UPLC system (Waters, Milford, MA) controlled by HyStar chromatography software (Bruker Daltonik GmbH). A mobile phase system consisting of water and acetonitrile (each containing 0.1% formic acid) was used, and separation was performed with an RP-18 column (50 by 2.1 mm, 1.7 µm particle size; Waters Acquity BEH). Data sets were acquired in positive electrospray (ESI) mode in a scan range from 100 to 1,200 m/z at a sampling rate of 2 Hz. Sodium formate solution was used for calibration and injected at the beginning of each chromatographic run. Quality control samples and blank runs were interspersed between the samples under investigation.
Data analysis and statistical interpretation.
Data evaluation by PCA was performed with ProfileAnalysis, a program within the Metabolic Profiler software suite (Bruker Daltonik GmbH). The LC-MS data, commonly including retention time (RT), m/z, and intensity values, were prepared for PCA, using either one of two approaches: (i) classical bucketing, which involves the formation of rectangular "buckets," comprising m/z and RT windows of adjustable size (these buckets summarize the intensities of all signals in the respective window [Fig. 2B]), or (ii) advanced bucketing, which identifies a chromatographic peak by detecting the signals at a certain m/z value in successive mass spectra as well as the corresponding isotope peaks and multiply charged ions from the same parent mass within a narrow tolerance interval. These signals are then defined as corresponding to one compound rather than being treated independently. This approach will be described in detail elsewhere. Further steps involved in PCA model generation are the transformation of RT-m/z pairs and intensity values into a covariance-based coordinate system and the calculation of so-called principal components (PCs), the number of which is chosen as sufficiently high to explain >95% of the variation in the data set (17). A pairwise plot of PCs yields a graphical representation, the score-and-loading plot, which consists of two diagrams: the first one reveals a grouping pattern of samples, and the second one serves to derive the observations (here, RT-m/z buckets) which are responsible for this pattern.
![]() View larger version (29K): [in a new window] |
FIG. 2. Identification of strain-specific compounds in M. xanthus extracts by UPLC-coupled ESI-TOF-MS and PCA. (A) Scores & Loadings plot for PCs 1 and 2 from an analysis comprising 10 randomly chosen samples (squares, triangles) and 5 biological replicates (crosses) of the DK1622 standard strain. The spatial arrangement in the left diagram indicates significant variation among these samples. Circles off the center in the right diagram represent signals (m/z-RT pairs termed "buckets") which contribute to variation. Values along the axes represent the relative degrees and directions of correlation (dimensionless). (B) Illustration of the rectangular bucketing concept for preparation of LC-MS data prior to PCA. (C) Bucket statistics plot revealing that samples 1, 4, 9, and 10 contain the compound of 814.42 m/z with various abundances, while it is absent from the other samples. (D) EIC created for 814.42 m/z in four selected samples, demonstrating the qualitative and quantitative nature of the differences revealed by the PCA model (baselines in traces for samples 5 and 15 are upscaled by a factor of 10 for clarity). (E) BPCs in the m/z range of 800 to 900 for the same subset of samples. HR, high resolution. (F) Generation of a molecular formula proposal for a compound with an 814.4179 m/z by use of accurate m/z values from high-resolution mass spectra and evaluation by calculation of the m/z deviation and comparison to the simulated isotopic pattern.
|
m/z of 1 Da and a
RT of 20 s for conventional bucketing and a
m/z of 20 mDa for advanced bucketing. Each data set was normalized to the total intensity in an analysis. Target screening was performed using QuantAnalysis software (Bruker Daltonik GmbH) for automatic peak integration on extracted ion chromatograms (EICs) created at a 10-mDa mass accuracy. Molecular formulae were calculated using the built-in "generate molecular formula" function in the DataAnalysis software suite, which implements the SigmaFit approach to rank proposals according to both mass deviation and isotopic pattern accuracy (19). A diversity accumulation curve was calculated using the EstimateS software package (4). Compounds are regarded as chemical "species" in the context of this analysis, and a binary matrix listing their presence or absence in a sample (extract from one strain) was used as the input. The obtained curves represent the means of 100 accumulation randomizations. Estimates of the total number of compounds present were based on the abundance-based coverage estimator (3).
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Secondary metabolites known to be produced by Myxococcus xanthus and properties used for their identification via UPLC-coupled high-resolution ESI-TOF-MS in this study
|
By use of the accurate m/z values from high-resolution measurements, molecular formulae were proposed for putative distinct compounds suggested by the above-described analyses (Fig. 2F). Naturally, this results in disproportionately high numbers of proposals as molecular weights increase. The proposals were ranked according to the mass positions and intensity ratios of isotope peaks in comparison to those in a calculated spectrum derived from a suggested molecular formula, implementing the SigmaFit approach (19). Elemental formulae were subsequently inspected manually, and obvious false positives (as determined by chemical considerations) were disregarded.
Compound proposals retained after both statistical analysis and manual examination were compiled into a list of 37 accurate m/z values, corresponding putative molecular formulae, and RTs (Table 2), and this list was used for a targeted analysis of all 98 datasets. This was achieved by automated generation of EICs, using the accurate m/z values of compounds, followed by peak integration and calculation of peak areas. Due to the high dynamic range and overall sensitivity of the instrumentation used, the presence or absence of a compound in a sample could be determined with high confidence and the relative production yields from all strains could be estimated (Fig. 3). The resulting binary matrix of 40 nonubiquitous compounds observed in all samples (Table 3) forms the basis of this study, enabling conclusions regarding the set of metabolites produced by one strain (hereafter referred to as a strain's "chemotype") as well as the distribution of compounds across all samples.
|
View this table: [in a new window] |
TABLE 2. Putative novel compounds not reported to derive from M. xanthus before which are revealed to be produced by subsets of the 98 strains under investigation in this study and the proposed molecular formulae according to high-resolution ESI-TOF-MS measurementsa
|
![]() View larger version (10K): [in a new window] |
FIG. 3. Identification of cittilin A in extracts from M. xanthus Mxx100. (A) The sensitivity of the ESI-TOF screening platform enables reliable compound identification even in samples from very weak producers. HR, high resolution. (B) The obtained data are useful for a semiquantitative comparison of production yield in all 98 strains. Numbers in parentheses are yields relative to the average level from all 98 strains (which was set as 1).
|
|
View this table: [in a new window] |
TABLE 3. Metabolite profiles of all 98 strains, listing the nonubiquitous compounds detected in their extracts as well as the number of producing strains for every compound and the number of compounds found in every sample
|
![]() View larger version (12K): [in a new window] |
FIG. 4. (A) Slices represent percentages of compounds which are produced by less than 20% (black), between 20 and 80% (light gray), or more than 80% (dark gray) of the strains in the respective sample sets. (B) Diversity accumulation curves, based on the distribution of 40 compounds in 98 samples as revealed in Table 3. Each data point represents the average number of discovered compounds that would result if n samples were taken (calculated as the mean of 100 randomizations).
|
Diversity accumulation curves were generated to estimate what proportion of global M. xanthus secondary metabolite diversity is likely to have been represented by our samples (Fig. 4B) and to compare global diversity with the diversity present at the centimeter scale. An accumulation curve is the product of both the diversity of a population and the sampling effort (4). If enough samples are taken, the same chemotypes will be sampled repeatedly and the accumulation curve will reach saturation, indicating that no more chemotypes are likely to be present. The shape of the accumulation curves (Fig. 4B) shows that most samples contain a large proportion of identified compounds. For example, on average, random sampling of just 10 strains would yield 30 of the 40 nonubiquitous compounds present in the total set. Nonetheless, the slopes of the global and total curves remain positive as sample size nears its maximum, suggesting that continued sampling effort should reveal more compounds.
Chemotype diversity on worldwide and local scales.
Our analysis reveals two major patterns. First, chemotype diversity in the local population is lower than that found globally (Fig. 4B), despite the facts that 18 distinct chemotypes are present in the centimeter-scale local set and that differences among the local chemotypes extend to known compounds (myxovirescin is not produced by all 20 local isolates, and cittilin was detected in only 3 samples). Two strains (A98 and B5) (Table 3, rows 18 and 20) account for much of the diversity in the local 20-strain subset, being the exclusive producers of 11 out of 28 observed compounds. Thus, although spatial structuring was not detected at the meter scale (see above), this larger data set shows reduced diversity at the centimeter scale.
Second, our data indicate that intensive sampling at individual sites can substantially increase the estimate of the total number of metabolites beyond that obtained from analysis of single samples from many sites. To examine this point, we made one estimate of total global richness in which we included only one sample per site in all cases where multiple samples had been taken from individual sites (e.g., Poland, Niaux, Ferrara, etc.) as well as all samples which were the sole isolates taken from their respective origin sites (n = 62). This set of strains yielded a global richness estimate of 41.7 metabolites (standard deviation, 0.73). However, analysis of all isolates, including those originating from the same site (Conero, Ferrara, Gran Sasso, Iran, Malago, Massif, Nei, Niaux, NJ, Poland, Serengeti, Solanka, Tübingen, and Sulawesi), yields a total global richness estimate of 45.1 metabolites (standard deviation, 1.37).
In order to both assess the effects of biological variation in the fermentation process and evaluate the use of a PCA model for in-depth examination of largely similar extracts, replicate fermentations of 4 "A" strains isolated in close proximity were performed and compared to replicate samples from the DK1622 model strain. The observed grouping pattern of samples in the Scores and Loadings plot (Fig. 5A) indicates that variation between replicate cultivations of the same strain is sufficiently small relative to systematic differences between the metabolite profiles of distinct isolates to allow clear detection of the latter. The compound most affecting variation among the strains was identified as DKxanthene-534, which was present in all isolates but which varied quantitatively in the amount present in the different strains. (Fig. 5B).
![]() View larger version (16K): [in a new window] |
FIG. 5. PCA for extracts from four M. xanthus strains isolated in close proximity and the DK1622 wild-type strain, using samples from replicate fermentations. (A) Grouping of samples reveals clear systematic differences between their metabolite profiles. The strains are as follows: 1, A23; 2, A47; 3, A96; and 4, A9. (B) Variation in the data set is largely due to quantitative differences in the production of DKxanthenes; representative EICs for an m/z of 535.25 (DKxanthene-534, C29H34N4O6) from high-resolution ESI LC-MS measurements are shown (two replicates for each strain). Ret., RT.
|
|
|
|---|
Using ESI-TOF-MS for the analysis of fermentation extracts of 98 strains from globally distributed locations, this study has revealed a high level of intraspecific diversity in M. xanthus metabolite profiles. All observations (high-resolution m/z signals and the derived molecular formula at a specific RT in an LC-MS data set) constituting a distinct difference between the metabolite profiles of two or more strains were regarded as indicating putative compounds of interest, regardless of potential biological activity or function. Molecular formulae derived from accurate mass measurements were used for the identification of known compounds from an in-house database and served as descriptors for putative novel compounds (Table 2). PCA was successfully used to detect significant differences between LC-MS datasets even in the presence of significant background noise. To our knowledge, this is the first use of PCA for the analysis of complex metabolite samples, such as myxobacterial extracts.
While previous screening projects that included multiple M. xanthus strains reported only a small number of compounds (7), we are confident that the sample size and screening methods used in this study have captured a substantial fraction of global M. xanthus metabolite diversity. Seven of the eight previously known metabolites were present among our samples (Table 1), with only saframycin (already known to be rare [12]) absent. An impressive 37 additional compounds that represent putative novel metabolites could be defined. Some of these compounds may represent novel derivatives of previously classified natural product families, as families such as the DKxanthenes and the myxalamids occur in at least 11 and 4 slightly differing chemical forms, respectively (13, 18).
Analysis of the M. xanthus DK1622 genome has revealed a high number of putative PKSs, NRPSs, or hybrid biosynthetic pathways, the products of which are not yet known. A recent study has provided evidence that many of the encoded enzymatic natural product assembly lines are actually active and expressed, since the corresponding peptides could be identified by mass spectrometric proteome analysis (21). We expect that comparative mining of the M. xanthus secondary metabolome as performed in this study can facilitate the definition of potential candidate compounds and help to disclose the products delivered by these biosynthetic enzymes in strains that possess the respective gene clusters. Studies to elucidate the function of each secondary metabolite pathway in strain DK1622 are currently in progress.
It is unlikely that all of the putative novel compounds listed in Table 2 can be structurally elucidated and directly correlated with a biosynthetic gene cluster. In principle, some of these compounds may represent artifacts resulting from metabolic differences between strains, although the mass range chosen for our analysis should mitigate this possibility. Alternatively, proposed molecular formulae might be incorrect because the detected compound is actually a breakdown product of labile compounds or a product of chemical reactions taking place during the workup procedure. However, the existence of distinct compounds with very similar occurrence patterns plus putative elemental compositions suggests that at least some are likely to be real secondary metabolites synthesized by PKS and/or NRPS biosynthetic routes. For example, compounds c5802+ and c5872+ as well as c800 and c814 (which display an m/z difference of 14 Da) are suggestive of biosynthetically meaningful structural differences and also have similar RTs (Tables 2 and 3). Furthermore, with respect to the above-mentioned genomic potential of M. xanthus DK1622, we find it intriguing that this strain produces 10 candidates from the list of putative novel compounds which we report here (Table 2). DK1622 is additionally shown for the first time to produce the known myxobacterial natural product cittilin (20).
The genetic population structure of M. xanthus has recently been studied at the centimeter scale, utilizing 78 local isolates from a 16- by 16-cm plot of soil, and it was concluded that M. xanthus cells are "surrounded in the soil by a wide range of genetically distinct conspecifics" (28). The picture that emerges from our analysis of metabolite profiles from 19 of these local isolates is fully consistent with this statement. Although the metabolite profiles of the local Tübingen isolates (Table 3, rows 1 to 19) are more similar to each other than they are to most of the global isolates, they nonetheless show that the sampled centimeter-scale population houses substantial metabolite diversity. Another similarity between the genetic analysis of Vos and Velicer (28) and our results here is the finding that the metabolite profile of strain A98 (Table 3, row 18), which had been identified as phylogenetic outgroup of most of the local isolates, differs remarkably from the profiles of those other isolates. Analysis of genetic diversity within versus between pairs of meter-scale isolates did not reveal spatial structure in the distribution of chemotype diversity at this scale.
Comparative metabolite profiling of replicate extracts from strains isolated in close proximity by use of a PCA model (Fig. 5) revealed quantitative variation in the levels of identical compounds as a major determinant of systematic differences between their overall production patterns. One compound revealed by this analysis was identified as a derivative within the DKxanthene family, representatives of which are essential for developmental sporulation (18). Other recent studies have shown that M. xanthus strains can exhibit intense intraspecific antagonism, with some strains being capable of preventing conspecific competitors from sporulating altogether (6). We hypothesize that asymmetries in secondary metabolite production may contribute to such social incompatibilities.
A more advanced understanding of M. xanthus metabolite profiles and their functional significance will likely require consideration of not only the absolute presence or absence of compounds but also their quantitative degree of production. In addition to the classically proposed role of hindering the growth of heterospecific competitors, such compounds might also serve intraspecific biological functions (e.g., for cell-cell communication) that are not revealed by standard activity-based screening efforts.
In conclusion, we have shown that the diversity of the M. xanthus secondary metabolome is significantly greater than suggested by previously identified compounds. The large number of putative novel compounds revealed by our screen correlates well with the high number of genes of unassigned specific function in the DK1622 genome that are predicted to encode proteins involved in secondary metabolite synthesis. This high level of diversity also suggests that a significant number of natural products in M. xanthus have yet to be discovered and characterized, some of which are likely to have structures originating from PKS- and NRPS-type biosynthetic machinery. The isolation, purification, and chemical characterization of target compounds defined in this study as well as the identification of their corresponding biosynthetic gene clusters are the subject of ongoing research.
Research in the laboratory of R.M. was funded by Deutsche Forschungsgemeinschaft (DFG) and Bundesministerium für Bildung und Forschung (BMB+F).
Published ahead of print on 31 March 2008. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»