Previous Article | Next Article ![]()
Applied and Environmental Microbiology, February 2003, p. 845-851, Vol. 69, No. 2
0099-2240/03/$08.00+0 DOI: 10.1128/AEM.69.2.845-851.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Department of Food Science, Cook College, the New Jersey Agricultural Experiment Station, Rutgers, The State University of New Jersey, New Brunswick, New Jersey 08901-8520
Received 25 March 2002/ Accepted 1 November 2002
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
A simulation approach is much more suitable than an analytical approach when dealing with complicated problems with an inherent variance. Instead of using a model to calculate an exact result for a variable, a simulation calculates a value many times over, each time assuming new values for each input, to give an estimation of the real value (15). Monte Carlo simulation is the most widely used simulation method in biology (4, 16) and has been used to study the mechanism in such biochemical procedures as cell cycle control (9) and plasmid replication (10). While many plasmid replication models have been exploited using the molecular, modeling, and simulation methods (3, 12, 13, 20), Kuo and Keasling (10) found that the use of a Monte Carlo simulation to compare the current theories was advantageous, since the possible combinations of replication and partition theories make laboratory experiments cumbersome and time consuming.
Monte Carlo simulation was used in this research to investigate whether Clostridium botulinum strain 56A spores communicate with each other during the process of germination, outgrowth, and multiplication. Independence between the spores was assumed for all the simulations. Each simulation used a higher inoculum to simulate the time to detection (TTD) of a lower one, keeping all the environmental conditions the same. For example, results from 10,000 spores (at pH 6.5, 0.5% salt, and 30°C) were used to simulate the results for 1 spore (at the same pH, salt level, and temperatures). If the independence assumption is true, then the simulated results should resemble the observed results. If communication exists between spores or between cells and spores, however, then the simulated TTDs, by using data from a higher inoculum, should be shorter and more homogeneous than the observed TTDs.
| MATERIALS AND METHODS |
|---|
|
|
|---|
A series of equations was used to convert the maximum growth rates into doubling times at the inflection point of the Gompertz equation by using the calibration curve. Microbial lag times were also calculated from TTDs by subtracting the time used by germinated spores to reach the detection limit of the machine. These calculations were carried out for each growth curve and for all conditions.
Distributions for lag doubling times, TTD, and spore number.
The lag times, doubling times, and TTDs were input into BestFit version 4.0 (Palisade Corporation, Newfield, N.Y.). All continuous distributions were fit to the data, and the choice of the most suitable distribution was based on goodness-of-fit statistics, visual observations, and parsimony of the parameters.
Plate counts from
1 spore/well were pooled. A total of 1,032 plate counts were included, ranging from 0 to 10 CFU/well. All discrete distributions available in BestFit version 4.0 were fit to the data and evaluated using the chi-square statistic.
Plate counts from
100 spores/well were pooled. A total of 931 plate counts were included, ranging from 14 to 184 CFU/well. Both discrete and continuous distributions were fit to the data.
Simulation configuration.
All simulations were conducted with @Risk (Palisade Corporation), which is an add-in program for Excel (Microsoft Corporation, Seattle, Wash.).
Each condition was represented by a separate spreadsheet. Every recalculation of the spreadsheet represented one replicate. The number of spores per iteration was generated from the spore number distribution mentioned above. All the spores were assigned distinct lag times and multiplication rates. Each spore was represented by one column, and each time increment was represented by one row.
The simulated TTD, doubling and lag times, and total cell concentration were set as outputs. Simulations were monitored for the change in percentiles, means, and standard deviations (SDs) of each output. Simulation was halted when the results converged (i.e., all changes were lower than 1.5%). The doubling time, lag time, and spore number outputs were checked to ensure biological sensibility (i.e., doubling time and lag time were greater than zero) and computational accuracy (spore number did not surpass the maximum number of columns). The output for simulated and observed TTDs was saved in separate files and analyzed as described below.
Simulation procedure and data analysis.
Simulation mimics the practice of applying results at high inoculum sizes to systems with lower initial load. Both the lag time and the doubling time were from a higher inoculum. Three types of simulations were carried out: 10,000 spores simulating 1 spore, 10,000 spores simulating 100 spores, and 100 spores simulating 1 spore.
An array of spores was generated, each with a different lag time and doubling time. For each spore, the time increased until it was larger than its lag time and the spore then became a cell and the number of cells doubled according to an assigned doubling rate. The simulated number of cells in a well increased until it reached a value representing a minimum detection limit. The time when the detection limit was first reached was recorded as the simulated TTD. The observed TTD for the condition being simulated was represented as a distribution in the same spreadsheet, so the same number of observed and simulated TTDs was generated.
Point summaries for both simulated and observed TTDs were recorded, and ratios of the location and spread in simulated and observed data were calculated. Location indicators included ratios for the mean (Rmean = meanobs/meansim) and median (Rm = medianobs/mediansim), with the latter one being a robust estimate. Spread indicators included ratios for the SD (RSD = SDobs/SDsim) and 90% range (Rr = 90% rangeobs/90% rangesim), the latter one also being robust. Models were developed to describe the effect of the factors (inoculum difference, temperature, pH, and salt concentration) on the robust ratios since these estimations are more conservative, i.e., closer to one.
When kinetic data were not available at a lower inoculum for comparison, the simulated TTDs were summarized and the time to reach 95% growth was obtained (the 95th percentile of the simulated TTDs), defined as Day95% Sim. The observed percentage of positive wells at this time was obtained experimentally.
| RESULTS |
|---|
|
|
|---|
= 2.08 spores/well) as the distribution of target 1 spore/well. Both the discrete and continuous distributions were fit for target 100 spores/well. The distribution chosen for target 100 spores/well was normal (µ = 67.02,
= 24.44).
Maximum number of simulated columns in the simulation template.
Each spore was represented by one separate column. The maximum number of columns in the simulation was set up such that the maximum number of spores generated in all iterations did not exceed this number. Each simulation achieved convergence in fewer than 5,000 iterations. The 1 to 1/5,000 = 0.9998 = 99.98th percentile of the spore number distribution gives the value such that the maximum spore number would exceed it only 1 out of 5,000 times. These values were calculated in Splus 2000 (MathSoft, Inc., Seattle, Wash.) to be 153.5568 and 9 for 100 and 1 spore/sample, respectively. The maximum numbers of spreadsheet columns were set at 160 and 12 for slight additional safety factors, and none of the simulations produced spores numbers exceeding these values.
Simulation.
There were 50 simulations total, 26 with kinetic data at the target inoculum and 24 without. For some conditions, the simulated TTD agreed well with the observed TTDs, while in other cases, the two TTDs were very different from each other (Fig. 1). Spread ratios (Rr and RSD) were generally larger than the corresponding location ratios (Rm and Rmean) for the same environmental condition and same spore simulation scenario (Table 1). Rm and Rr were more conservative estimators than Rmean and RSD, respectively (Table 1); therefore, discussions will only be made on Rm and Rr from this point on. In all but four conditions, both ratios were greater than one (Fig. 2). The median of the observed and simulated TTDs were not dramatically different from each other, so the data points fell closely to the 45° line, although mainly below the line (Fig. 2A). The spread of the observed TTDs was much larger, however, as evidenced by the far-dispersed data points below the 45° line, even on the natural log scale in Fig. 2B. A general trend could be observed that, when 10,000 spores/well was used to simulate 100 spores/well, both ratios of the median and range were smaller than those in simulations that used 10,000 spores to simulate 1 spore. Using 100 spores to simulate 1 spore had a ratio of median and range similar to that when 10,000 spores were used to simulate 100 spores (Fig. 3).
|
|
|
|
![]() | (1) |
![]() | (2) |
In both equations, the categorical variables (Dummy1 and Dummy2) were indicators for spore scenarios. Dummy1 represents those simulations in which 10,000 spores simulated 1 spore. Dummy2 represents those simulations in which 100 spores simulated 1 spore. Using 10,000 spores to simulate 100 spores served as the baseline for comparison. Each of these two categorical variables had two possible values, 1 when the spore simulation scenario was the one it represented and 0 when it was not. Therefore, Dummy1 equaled Dummy2 equaled 0 when 10,000 spores were used to simulate 100 spores, Dummy1 equaled 1 and Dummy2 equaled 0 when 10,000 spores were used to simulate 1 spore, and Dummy1 equaled 0 and Dummy2 equaled 1 when 100 spores were used to simulate 1 spore. Simulating 1 spore with 10,000 spores resulted in a larger difference in the median of observed TTDs and that of simulated TTDs than the result obtained from using 10,000 spores to simulate 100 spores (equation 1). Using 100 spores to simulate 1 spore also resulted in a larger difference in Rm, but not one as large as the difference obtained when using 10,000 spores to simulate 1 spore. The P values observed for both dummy variables, however, were larger than 0.05 (0.109 for Dummy1 and 0.103 for Dummy2). Therefore, the difference in Rm between the three different spore scenarios was only marginally significant statistically. The effect of Dummy1 on Rr was very prominent (equation 2): e1.02 was
2.78 times larger when simulating 1 spore with 10,000 spores than that with simulating 100 spores with 10,000 spores. This term was also very significant (P < 0.00001). The increase over baseline in ln (Rr) when simulating 1 spore with 100 spores was about 0.12, but with a nonsignificant P value (0.282). The effect from the environmental factors was quite complicated and not the major focus in our research.
When kinetic growth was not observed for the target condition, the simulated TTDs were compared with the percent positive over time data (Table 2). The percent positive observed at the time of the 95th percentile of the simulated TTD (i.e., at this time, 95% of the simulated wells had shown growth) is presented. Only 2 of 24 conditions ever resulted in detectable turbidity changes in more than 10% of the wells. Thirteen conditions had no observed growth.
|
| DISCUSSION |
|---|
|
|
|---|
By using data collected for the growth of C. botulinum spores at different inoculum concentrations, pH, temperature, and salt levels, a computer simulation was carried out, assuming independence between spores. In all 50 simulations, the results from a higher inoculum size were used to simulate the TTD at a lower inoculum size while all environmental factors were kept the same. If spores do not interact with each other, the simulated results from high spore concentrations should resemble the results at lower spore concentrations. On the other hand, if spores do communicate with each other, it should result that the simulated TTDs are smaller and more homogeneous than the observed TTDs. Therefore, by comparing the simulated and observed TTDs, conclusions can be made on the validity of the independence assumption.
This independence assumption can be realized best with simulation. In a real microbial experiment, it is almost impossible to construct a system where spores stay reasonably close but certain physical barriers in between prevent them from signaling each other. Even if the theoretical design is possible, the complexity of the system makes it very expensive to build and operate. Running such a system requires a considerable amount of financial support and, in many cases, a very long time. On the other hand, simulation is the natural approach to carry out this methodology. Independence between spores and cells on the spreadsheet is the default setting. A recalculation of the spreadsheet (a replicate) takes less than 1 s. Any one simulation can be finished in 1 h when the simulation settings are optimized. Therefore, computer simulation is the best (and probably only) technique for realizing the assumption of independence.
Choice of the best distribution.
The choice of the best distributions was based on both the fitness of statistics and visual observations. For continuous data, the Anderson-Darling (A-D) and Kolmogorov-Smirnov (K-S) tests were given more weight than the chi-square test since these two statistics were more powerful and less arbitrary (17). The chi-square statistic is probably the oldest (14) and most widely used of all goodness-of-fit statistics. Sometimes the chi-square test can result in completely different conclusions depending on the bin number chosen. Presently, no prescription is guaranteed to produce a valid and unbiased result. When setting up the program parameters in BestFit, the equal probability option instead of the equal bin option was chosen for chi-square binning. With this specification, bins in chi-square test have equal probability intervals for each fitted distribution instead of equal lengths for the input data. To some extent, this method can remove the arbitrariness of bin selection in a chi-square test, which can be a major drawback (2, 11). While the chi-square test is essentially a comparison between a histogram of the data and the density function of the fitted distribution, the K-S method compares the empirical cumulative distribution derived from the data and the cumulative distribution of the fitted function. Therefore, no grouping of the data or bin size specification was needed. The K-S statistics find the largest difference between the two distributions, and thus, it focuses in the middle of the distribution. The A-D statistics are a modification of the K-S statistics obtained by putting more weight at the two tails (probability tends to 0 or 1); therefore, it amplifies the discrepancies at the two ends (1). In most situations, the A-D and K-S statistics gave quite similar results while the chi-square statistics resulted in very different findings. Visual observations were made with the histogram of the data overlaid with the fitted distribution. Special attention was paid to the mode of the fitted distribution compared to that of the histogram and the percentage covered by the two tails. Linearity of the quantile-quantile plot and probability-probability plot was also checked. The quantile-quantile plot highlights the difference at the tails of the data and the fitted distribution. The probability-probability plot, on the other hand, amplifies the difference in the middle (11). The distribution with fewer parameters was chosen when both fit the data very well.
Some conditions have less than 80% growth observed at the end of the continuous observation period. A distribution was found for these conditions, although the resulting distribution was seriously biased for the fast-germinating spores. Whenever used in the simulation or in a comparison of the observed and simulated TTDs, these conditions were given special attention.
Spore number distributions.
The Poisson parameter
is the mean of the distribution (also the variance). The fitted Poisson distribution with a
of 2.08 indicates that the average number of spores/well is 2 when the target inoculum size is 1 spore/well. This is a direct result of the modification in the experimental procedure to reduce the chance of a well getting no spore (21, 22).
When a variable is specified as discrete, each distinctive value of this variable is treated separately. Plate counts of target 100 spores/well have more than 100 distinctive values, resulting in a large number of categories and rendering all discrete distributions unacceptable. The central-limit theorem states that when the number of observations is very large, the Poisson distribution approaches the normal distribution (15). More than 900 plate count observations are included for a target of
100 spores/well; therefore, the normal distribution can be used in place of the Poisson distribution. While the normal distribution was not chosen as best by any fit statistics, when observed graphically, however, the normal distribution appeared to fit the histogram equally as well as the best choice (data not shown). Because of its universal application in many fields and its ease of use, the normal distribution was chosen for target 100 spores/well. In a recent publication, Corradini et al. studied the applicability of eight different probability distributions to bacterial plate counts in three different food products by using maximum likelihood estimation (6). They also assumed that a continuous distribution could be used to describe the plate count as long as the number of observations was large. Theoretically, the distribution for a nonnegative variable like plate count should be skewed to the right, but Corradini et al. found that the actual distributions are not extremely asymmetric. Occasionally, the normal distribution fit the observed values even better than those right-skewed distributions.
Meaning of the simulation results.
The lag times and doubling times calculated from turbidometric measurement at a higher spore concentration were used to simulate the TTDs of a lower spore concentration. The results show that simulated TTDs are smaller and more homogeneous than the observed TTDs, indicating that interactions exist between the spores or between cells and spores. Additionally, there are differences between the three spore scenarios: 10,000 spores simulating 1 or 100 spores and 100 spores simulating 1 spore. The largest difference between the variance of the simulated and observed TTDs occurs when 10,000 spores are used to simulate 1 spore, which is also the largest difference between the simulating inoculum and the target inoculum. The other two spore scenarios have the same ratio of simulating inoculum and target inoculum (simulating inoculum/target inoculum = 100 in both cases), and the difference between the variance of the simulated and observed TTDs is also similar (equation 2). Rm was the largest again when using 10,000 spores to simulate 1 spore, followed by that when using 100 spores to simulate 1 spore, and the smallest was obtained when using 10,000 spores to simulate 100 spores. The effect of the categorical variable (Dummy1 and Dummy2) on the ratio of median, however, is only marginally significant at 0.1 levels. It is nonetheless interesting that the ratios of the median and 90% range are larger when using 100 spores to simulate 1 spore than those when using 10,000 spores to simulate 100 spores, since both use results from an inoculum level 100 times higher than that of the target inoculum. This suggests that the possible cutoff point of the low and high inoculum concentrations is a number lower than 100 spores. Since the 100:1 spore simulation crosses the boundary, the difference between the simulated and observed TTDs is larger. The cutoff point implied here is different from that by the modeling approach, which suggests a level between 100 and 1,000 spores (21).
Although this research disapproves the independence assumption between spores, it is still premature to conclude at this point that signaling exists between spores during germination without any physical evidence of the putative signaling molecule. More research using the microbiological and biochemical approaches is required to further the investigation before a mechanistic explanation of the germination process can be reached.
Conclusions.
The simulation approach was used to investigate the hypothesis that communication exists between spores or between spores and cells during germination, outgrowth, and growth. Growth parameters at a higher inoculum were used to simulate the TTD at a lower inoculum size, assuming spores were independent from each other. The simulated TTDs were smaller and more homogenous than the observed TTDs. When the difference between the simulating and target inoculum was the largest, the difference in the variance of the simulated and observed TTDs was also the largest. Results from this approach agree with our previous results by using mathematical models that spores are not independent from each other during germination, outgrowth, and growth.
| ACKNOWLEDGMENTS |
|---|
This work was supported by the U.S. Department of Agriculture CSRS NRI Food Safety Program (grant no. 9601541), the Center for Advanced Food Technology (CAFT), and other state and federal support provided by the New Jersey Agricultural Experiment Station.
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| J. Bacteriol. | Microbiol. Mol. Biol. Rev. | Eukaryot. Cell | All ASM Journals |
|---|