Previous Article | Next Article ![]()
Applied and Environmental Microbiology, January 2005, p. 227-239, Vol. 71, No. 1
0099-2240/05/$08.00+0 doi:10.1128/AEM.71.1.227-239.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.

Center for Limnology, University of Wisconsin-Madison, Madison, Wisconsin,1 Department of Microbiology and Cell Science, University of Florida, Gainesville, Florida2
Received 7 April 2004/ Accepted 24 August 2004
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Geography and spatial autocorrelation can impart structure to ecological data, and this structure may coincide with other sources of environmental variability, leading to spurious correlations among biological and environmental variables (6, 42). Untangling the effects of environmental variation from those due to autocorrelation (i.e., purely spatial effects) represents a major challenge for microbial ecologists investigating patterns in BCC. Lindström and Leskinen (47) have suggested that regional differences among lakes can influence the community composition of the abundant bacteria. Recently Whitaker et al. (80) attributed all "Sulfolobus islandicus" genetic variance in hot springs to geographic variation rather than to environmental sources. These studies suggest that a regional perspective may be an important consideration for the assessment of BCC and its patterns and environmental correlates.
A geographical perspective may also be valuable within individual lake districts. Borrowing ideas from landscape ecology, limnologists have begun to consider how a lake's position in the overland and groundwater flow system (i.e., landscape position [LP]) can impact various features of lake ecology (41, 61, 64, 67). These studies have shown that a wide range of biological and environmental variables appear to correlate with a lake's LP and that some of these patterns may be general across different lake districts (61, 64, 67). In particular, it was shown that lakes higher in the landscape tend to be more isolated in terms of direct overland connections to other lakes, and these lakes express a greater temporal range of variation in a number of environmental characteristics than lakes lower in the landscape (40, 49). These patterns may be reflected in the composition and dynamics of lake bacterial communities.
Geological and ecological forces operate at a variety of hierarchical scales (1, 56) and can generate complicated associations and interactions that are reflected in community composition. The present study uses data from 30 Wisconsin lakes to evaluate the spatial, temporal, and environmental influences on BCC. The principal goals were to determine the extent to which similar bacterial communities could be found in lakes with similar environments and to identify the features that best explain variation in lake BCC. The study design was intended to allow for investigation at broad (i.e., regional) and local spatial scales and to capture some of the temporal variation that lake bacterial communities have been shown to exhibit over the course of the year (12, 16, 30, 66, 82). In addition, because conclusions of microbial ecological studies can be influenced by biases inherent in molecular methods (78) and by numerical artifacts arising from data transformations (28, 55), this study investigated the impact of various treatments and transformations applied to the bacterial data set. The questions addressed here are as follows. Do the detection limits of molecular analytical techniques or the data transformation applied to community data (or both) influence the conclusions and, if so, how much? Do similar environmental conditions produce similar bacterial communities? What are the most important environmental controls of BCC? Do regional, landscape level, or seasonal factors produce similar bacterial communities, and to what extent do these factors correlate to environmental controls?
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
|
The Secchi depth, depth to the bottom of the lake, surface water temperature, and pH were recorded at each site. Up to 1 liter of water from each of the triplicate integrated samples was filtered through glass fiber filters (Whatman) to collect phytoplankton for determination of chlorophyll a concentration. Subsamples for analysis of DOC, color, specific absorbance of UV radiation A (SUVA) (79), dissolved nitrogen (DN), dissolved phosphorus (DP), and concentrations of nitrate plus nitrite (nitrite was oxidized to nitrate prior to detection [see below]) and ammonia were filtered from integrated water samples through 0.4-µm-pore-size polycarbonate membrane filters (Osmonics), placed immediately on ice, and transported back to the laboratory. Glass fiber filters and water samples for nitrate plus nitrite were kept frozen and in the dark until analyzed. Water samples for total nitrogen and phosphorus were acidified with 1 ml of Optima HCl and refrigerated. All other samples were refrigerated until analysis, which was performed no more than 60 days after collection.
Laboratory analyses. (i) ARISA.
BCC was assessed by automated ribosomal intergenic spacer analysis (ARISA) (7, 23). ARISA is a molecular technique that utilizes the length heterogeneity of the intergenic transcribed spacer (ITS) region of bacterial rRNA operons to construct bacterial community "fingerprint" profiles (Fig. 2) (23). Treating the elements of ARISA profiles as operational taxonomic units allows for "whole-community" ecological comparisons. It should be pointed out, however, that fingerprint-based assessments of BCC may overlook certain community members and may also misclassify community members by assigning ecologically identical organisms (e.g., members of the same species) to different operational taxonomic units or by assigning ecologically distinct organisms to the same operational taxonomic unit (7, 23). For the present study, ARISA profiles were assumed to be indicative of BCC, and differences in ARISA profiles were assumed to reflect variation in the composition of the respective bacterial communities.
|
To investigate the relative impacts of smaller and larger peaks in the data analyses, four different ARISA data sets were created, each differing from the other in regard to the level of sensitivity applied to ARISA fragment analysis. These sensitivity levels were defined as follows: ARISA peaks were considered for analyses only if the normalized peak height exceeded 50, 100, 150, or 200 fluorescence units (FU). Following the application of these sensitivity levels, profiles from three replicate filters for each lake-date combination were merged into a single consensus profile by averaging the peak area (in FU) of each ARISA peak across the replicates. These consensus profiles were used as the basis for all data analyses.
(ii) Environmental data.
Analyses were conducted as outlined by the North Temperate Lakes Long-Term Ecological Research site (50), and more details about these protocols are available via the "online datasets" link at http://lter.limnology.wisc.edu/index.html. DOC was analyzed by high-temperature combustion on a Shimadzu TOC-5000 analyzer using potassium hydrogen phthalate standards. Color was determined with a Kontron 930 spectrophotometer, as was SUVA at 254 nm. DN and DP were analyzed on a segmented flow calorimeter following potassium persulfate-sodium hydroxide digestion. Ammonium and nitrate plus nitrite were analyzed on a segmented flow colorimeter with a copper-cadmium column to oxidize nitrite to nitrate for colorimetric detection. Chlorophyll a was extracted from glass fiber filters with Optima methanol following maceration and homogenization with a miniblender. Chlorophyll a concentration was determined with a Kontron 930 spectrophotometer based on the absorbance (of acidified extract) at 665 and 750 nm.
Numerical methods. (i) Data sets and data transformations: explanatory (environmental) variables.
The environmental data collected resulted in a data set with 11 quantitative environmental variables: water temperature, Secchi depth, pH, chlorophyll a, SUVA, color, DOC, dissolved phosphorus, dissolved nitrogen, ammonia, and nitrate plus nitrite. All of these variables except for water temperature, pH, and SUVA were log10 transformed to better conform to normality. In addition, all 11 variables were standardized by subtracting the mean and dividing by the standard deviation (SD; the so-called z-score standardization [43]). This had the effect of setting the mean of each variable equal to 0 and setting the SD (and hence variance) to 1 and making all quantitative variables dimensionless (43). All analyses utilized these z-scores.
In addition to the 11 quantitative variables, 3 qualitative variables were used in the analyses. These qualitative variables were coded as seven binary dummy variables (43). The study design yielded two sets of dummy variables: region (with two values, northern Wisconsin and southern Wisconsin) and month (with three values, May, July, and October). To assess the effect of lake LP (41), a third dummy variable was determined for each lake, and the lake order concept of Riera et al. (64) was applied to divide the lakes into two categories. Lakes with lake order
0 were classified as seepage lakes. This category included headwater lakes and lakes not connected to other lakes by permanent overland streams; lakes in this category are presumed to receive most of their water inputs through precipitation and groundwater flow. Lakes with positive lake order were classified as drainage lakes, and this category included all lakes possessing both inflowing and outflowing streams. Note that the drainage lakes in this study are not necessarily connected to each other. For example, Trout and Tomahawk are two northern Wisconsin drainage lakes, but they are located in different watersheds, the former draining into the Chippewa-Flambeau river system and the latter draining into the Wisconsin River (Fig. 1).
The effects of these three qualitative variables on ARISA profile richness were tested by three-way analysis of variance using the R-1.8.0 language and environment (R Foundation for Statistical Computing, 2003). Separate tests were performed for each ARISA sensitivity level, with Bonferroni correction for multiple testing applied.
(ii) Data sets and data transformations: response variables (ARISA).
PCR amplification is known to introduce bias in the ratio of amplified products in mixed-template reactions (60, 68, 69) and is not generally trusted to yield quantitative estimates of bacterial abundance in methods such as ARISA (23, 78). However, several studies have revealed that fingerprinting methods produce remarkably replicable results, even when profiles are produced from different replicate extractions or when they are based on different nucleic acids (Fig. 2) (23, 54, 62). Additionally, it is well known that different transformations of ecological data can lead to different conclusions, e.g., presence-absence of species versus species abundance (14, 15).
To assess the impacts of these different views of molecular data, three different transformations were applied to ARISA profiles to generate quantitative, semiquantitative, and presence-absence data sets. Quantitative ARISAs used the normalized peak areas (FU) of ARISA peaks as direct estimates of abundance. For semiquantitative ARISAs, the normalized peak areas (FU) of ARISA fragments were "relativized" through division by total (normalized) profile signal strength (FU), mimicking transformations for percent cover or percent dominance. Finally, presence-absence analyses were conducted by scoring ARISA peaks as 1 if they were present in a given profile and 0 if they were absent but observed elsewhere in the data set. These three transformations were applied to data from each of the four sensitivity levels previously mentioned. Thus, a total of 12 different ARISA data sets were used as the bases for analyses.
For each ARISA data set, the Bray-Curtis similarity (coefficient S17 = 1 D14) of Lengendre and Legendre (43) was used to assess the degrees of similarity among ARISA profiles obtained from different lakes and times (hereafter referred to as sites) and to produce a site-by-site similarity matrix, Si (where i [1
i
12] refers to the sensitivity level or transformation of the particular ARISA data set). Note that for presence-or-absence-transformed data, the Bray-Curtis similarity is identical to the Sørensen's similarity (coefficient S8 of Lengendre and Legendre [43]). To determine the influence of ARISA sensitivity and data transformation on the structure of BCC data, a nonparametric form of the Mantel test (43) was implemented. The various matrices Si, which summarized the relationships in BCC observed between sites, were compared with the Spearman r rank correlation coefficient, and the results of these comparisons were summarized by a complete-linkage clustering. The production of Si, calculation of Spearman's r, and clustering were all performed with the software package PRIMER 5 for Windows, version 5.2.7 (PRIMER-E Ltd., 2001; routines SIMILARITY, TWO-STAGE, and CLUSTER).
(iii) CCA.
Canonical ordination can be regarded as an extension of linear regression analysis to the multivariate case (34, 43). To determine which environmental variables best explained patterns of similarity in ARISA profiles among sites, canonical correspondence analysis (CCA) (71, 73) was applied. CCA is an ordination technique that seeks the most prominent linear gradients in multivariate data sets, under the constraint that the gradients are linear combinations of a set of explanatory variables. Like multiple linear regression, CCA can use forward (or backward) selection to generate the most efficient model from a set of potential explanatory variables.
CCA was performed using the software package Canoco for Windows, version 4.51 (Biometris-Plant Research International; 1997 to 2003) according to the recommendations of the authors (72). The 11 quantitative and 7 dummy environment variables constituted the initial pool of explanatory variables, and the 12 different sensitivity levels or transformations of ARISA were used as response variables in separate runs of CCA. To determine the combination of explanatory variables that described the most influential gradients of ARISA profiles, the forward selection method outlined by ter Braak and Verdonschot (73) was used. Explanatory variables were added until the addition of further variables failed to contribute significant improvement to the model's explanatory power, as assessed by permutation test (499 permutations under the full model) with sequential Bonferroni adjustment applied to significance tests. Each CCA was conducted under the following additional conditions: biplot scaling with a focus on relationships among sites and down-weighting of rare ARISA fragments, which reduced the abundance of rare ARISA fragments (those occurring with a frequency less than 20% of the frequency of the most common ARISA fragment) in proportion to their frequency (72).
To further explore the patterns in BCC revealed by ARISA, partial CCA was employed. This type of analysis can discern patterns related to one set of variables while controlling for a different set of variables, called the covariables (43). For partial CCA, the analysis was first carried out for the covariables, and then the residual variation was subjected to a second CCA where the axes were constrained by the remaining variables (73). Partial CCA was done using Canoco for Windows, version 4.51, with the dummy variables for region and LP used as covariables.
| RESULTS |
|---|
|
|
|---|
|
|
|
|
|
|
| DISCUSSION |
|---|
|
|
|---|
ARISA provides no specific phylogenetic information about bacterial communities, so it is not possible to definitively address the ecological relevance of changes in OTUs described by ARISA profiles. The answer to this question will ultimately depend on the ability of ARISA-based OTUs to distinguish different species and also on whether bacterial species (as opposed to strains or polyphyletic functional groups) are the units responding to ecological change. Efforts are ongoing by the authors and their colleagues to link specific ARISA peaks from profiles to sequences of 16S rRNA genes from environmental clones, and these data should be invaluable in determining which bacterial species respond to different sources of environmental variation. In the meantime, it is possible to roughly gauge the relationship of ARISA-based OTUs to separate bacterial species. In a survey of intergenic transcribed spacer sequences from GenBank, Fisher and Triplett (23) found that more than 90% of species within the same genus, and 80% of species from different genera, would contribute unique ARISA peaks to profiles and that fewer than 30% of surveyed taxa would generate multiple ARISA peaks from the same individual. Thus, differences in ARISA profiles are likely to provide reasonable approximations of ecologically relevant variation in bacterial communities.
Another concern in the use of genetic fingerprints relates to the quantitative use of post-PCR data to describe the natural abundances of the organisms generating these patterns. It has been well documented that the application of PCR to mixed-template reactions, such as those from whole-community DNA extracts, can bias the ratio of products with respect to their starting ratio (60, 68, 69) and to the ratio of the organisms in the original community (19). There are numerous other potential sources of bias in PCR that could prevent its use in quantitative studies (78). It appears that a conservative application of PCR-based data for molecular ecology would involve simply looking at the presence or absence of the elements comprising profiles. However, presence-absence data analyses also have problems. First of all, these analyses are completely insensitive to community patterns arising from differences in species evenness and dominance, and this may distort ecological relationships in the data set. By simulating different sources of experimental error, including error due to PCR bias and error due to presence-absence data transformations, Muylaert et al. (55) found that the errors introduced by presence-absence data transformations were the most likely to obscure relationships between species and environment. Additionally, similarity indices such as Bray-Curtis (S17) can be disproportionately influenced by rare species when presence-absence transformations are applied, as these transformations weight all species identically (14, 15). This can result in analyses that are dramatically influenced by organisms generating signals right around the detection limits of molecular techniques (28). This effect is demonstrated in the present study by the influence of ARISA sensitivity levels on the presence-absence analyses and the lack of the same for the semiquantitative and quantitative analyses (Fig. 4 and 5; Table 3). This phenomenon illustrates the point that presence-absence studies are not immune to the effects of PCR bias, because this bias almost certainly plays a role in determining whether or not certain elements exceed the detection threshold of the analysis in the first place.
Ultimately, microbial ecologists should strive to use methods that are legitimately quantitative (70) or should strive to determine precisely how much PCR bias affects the results of molecular ecology studies. If PCR bias is such that it can make rare species appear to be very abundant or vice versa, then it has the potential to greatly influence the results of ecological studies involving molecular data. If, however, it is only the ratio of templates (or species) to products that is affected, then a semiquantitative approach (e.g., one that scores high-signal [FU] elements as abundant and weights them accordingly in analyses) may be appropriate.
The three data transformations applied to the datasets of the present study influenced the resulting CCA models (Fig. 4; Table 3). This was not entirely unexpected. The different transformations emphasized different aspects of the profiles, and these differences may be instructive in regard to the nature of the ecological response. For example, while DP and DN were significant variables for quantitative analyses, they were generally not significant in the presence-absence and semiquantitative analyses (Fig. 4 and 5; Table 3). This could indicate that these two variables were more important influences on the absolute dominance of different organisms between profiles, because this information would have been lost in both the presence-absence and semiquantitative transformations. However, the outstanding feature of this approach is that the various models had a set of explanatory variables in common, and this may indicate that the strategy adopted here is a useful one for dealing with issues arising from PCR-based barriers to quantitative analysis as well as the difficulties associated with presence-absence data sets. By analyzing samples with a variety of data transformations microbial ecologists may be able to identify the common features of the data structure that are insensitive to biases introduced by the data analyses and they may also be able to use the differences to identify how different parts of the community respond to different sources of variation.
Sources of variation in BCC.
In spite of the differences in data and model structure produced by different sensitivity levels and transformations, there were some remarkably consistent features of the CCA solutions. Five variables (region, LP, month [May], Secchi depth, and pH) were common to all models (Table 3). In addition, CCA plots (Fig. 5 and 6A and C) revealed the same overall patterns. Sites from drainage lakes and those from seepage lakes were arrayed along two parallel lines, with sites from northern lakes at one end and seepage lakes at the other. The quantitative environmental variables combined to describe gradients, also running from north to south or vice versa, that paralleled the major trends within the groupings of drainage and seepage lakes. The robustness of this pattern with respect to the original sensitivity level or data transformation applied to ARISA strongly suggests that these patterns describe predominant structures within the pelagic bacterial communities of these study lakes.
(i) Geographic sources of variation.
The main patterns revealed in all CCA plots were related to the statewide regional distribution of lakes and to the relative position of the lakes in the local overland and groundwater flow system (Fig. 5 and 6A and C; Table 3). Thus, bacterial communities responded to ecological variation on at least two different geographical scales. At the larger, regional scale, bacterial communities from northern Wisconsin lakes tended to be different from those in southern Wisconsin. This result echoes several recent studies demonstrating the important influence of regionalization on BCC (47, 80, 83). In addition, ARISA profiles from southern Wisconsin had, on average, more peaks than those from northern Wisconsin (Fig. 3; Table 2), which suggests that southern Wisconsin bacterial communities were more species rich. This was also noted in a previous study of Wisconsin lakes (83). There are several factors that could account for these regionalized effects. Northern Wisconsin and southern Wisconsin differ in many important aspects, including geology (51, 67), climate and vegetation (17), land use and land cover (63, 65), and anthropogenic impacts on lakes (4, 5, 26), and it is reasonable that these factors could influence lake BCC through a variety of mechanisms. It is also possible that regional differences in BCC reflect the biogeography of bacteria or of organisms that interact with bacteria. There is some current controversy concerning whether microbes even show biogeographical patterns of distribution (3, 27, 31, 80). Because ARISA provides no phylogenetic information (7, 23), it is not possible for the present study to address this question directly. However, depending on the ARISA sensitivity level, from 83 to 94% of all ARISA fragments in the present study were detected in both northern and southern lakes (data not shown). Under the assumption that each unique ARISA fragment represented a different species, most of the bacteria in the present study were distributed statewide, and thus it is not likely that bacterial biogeography provides a good explanation for the observed regional effect.
At the landscape scale, the drainage-versus-seepage classification of the lake of origin represented another prominent source of geographic variation in BCC of sites (Fig. 5 and 6A and B; Table 3). Lake LP is related to a variety of variables not measured in the present study, including concentrations of silica and major ions, susceptibility to acidification through acid precipitation, lake size and shape, richness and abundance of several vertebrate and invertebrate taxa, and intensity of human exploitation (61, 64), and any of these factors could potentially impact bacterial communities. This study used the drainage-versus-seepage classification of lakes as a rough surrogate for lake LP. Limnologists have long recognized the distinction between drainage lakes and seepage lakes (35, 67), and, even though the lake order concept of Riera et al. (64) represents a substantial refinement of the landscape context for lakes, the largest differences in variables noted by Riera et al. tended to be between seepage lakes and drainage lakes (64). These differences appear to be reflected in the composition of bacterial communities in Wisconsin lakes.
Another potential explanation for the landscape-level effect is related to the hydrology of these systems. In a study of two Swedish lakes, Lindström and Bergström (46) found that bacterial communities in the lake with short hydraulic retention time were similar to communities in the inlet and outlet streams, while those in the lake with long hydraulic retention time were distinct from the stream communities. Thus, bacterial communities in drainage lakes can be influenced by organisms washing in from streams, particularly when the water retention time of the lake is low. Unfortunately, the water retention time is not known for the lakes in the present study, nor were stream communities characterized for comparison. However, it is reasonable to generalize that water residence time in these regions of Wisconsin is higher for seepage lakes than for drainage lakes (9). Thus BCC in drainage lakes may be more susceptible to the influences of bacteria that wash in from outside, while seepage lakes may have a longer time in which to develop indigenous communities. ARISA richness was higher in drainage lakes than in seepage lakes (Fig. 3; Table 2), which could indicate the presence of greater numbers of "foreign" bacteria in these lakes. The influence of transient "wash-in" communities and water residence time on BCC bears further investigation.
(ii) Environmental sources of variation.
Two quantitative environmental variables, pH and Secchi depth, were consistently found to be significant in the CCA models (Table 3). These two variables tended to be strongly associated with the axis of regional variation (Fig. 5 and 6A and B), reflecting the fact that southern Wisconsin lakes tended to have a higher pH and northern Wisconsin lakes tended to have greater water clarity (Table 1). However, partial CCA, from which the effects of region and LP had been removed, showed that, within each region of Wisconsin, Secchi depth and pH were significantly related to variation in BCC (Fig. 6C and D; Table 3).
Secchi depth is a rough measure of water clarity, which itself represents a combination of several different causes. Thus, the identification of Secchi depth as important in these models does not suggest an immediate mechanistic explanation for variation in BCC. Secchi depth may indicate that light levels or spectral composition had a direct influence on BCC in these lakes, as shown in several studies (76, 77). However, water clarity can also be influenced by particulate (e.g., organisms) and dissolved (e.g., DOC) matter in the water. Water color, chlorophyll a, DOC, and SUVA were also measured in this study, and none of these variables was significantly correlated with Secchi depth (data not shown). However, when CCA was performed with Secchi depth removed from the pool of available variables, it was consistently replaced in the models by DOC and/or SUVA (data not shown). Thus, it is likely that the effect of Secchi depth properly reflected a composite effect of all of these sources of variation, but it was particularly associated with variables relating to the quantity and quality of carbon in the water. Regardless of the specific mechanism(s) responsible, it is clear that bacterial communities in these lakes were responding to environmental variation expressed along a water clarity gradient.
Another important source of variation in this study was pH, which has been implicated by other workers in studies of aquatic bacterial communities (44, 47, 53) and communities of organisms likely to influence BCC (21, 24, 25, 38). pH may reflect the influence of geology on water chemistry and is itself an important control of the biogeochemical transformations which can take place in a given environment. pH can also mediate the availability of ions and trace metals, which can have both inhibitory and growth-enhancing effects. Thus, pH may affect BCC through direct biological mechanisms and may also reflect the indirect influences of other unmeasured factors.
Previous work in Wisconsin lakes suggested that BCC might be structured by two principal ecological forces, one related to lake primary productivity and one related to organic carbon (83). It is well known that phosphorus or nitrogen or both are often limiting to bacterial and phytoplankton growth (10, 11, 18, 20, 57), and it has been demonstrated that BCC can also be influenced by nutrient enrichment (22). Both primary production and DOC in lakes represent sources of energy available to bacteria, and chlorophyll a and DOC may also to some extent represent potential interactions with phytoplankton, which have been identified by other workers as important influences on bacterial growth, production, and BCC (2, 16, 30, 36, 45, 52, 75, 81). Therefore it is somewhat surprising that these variables were not consistently identified as important in the present study (Table 3). Quantitative analyses identified DN, DP, and DOC as important variables (Fig. 6B; Table 3), and this may indicate that these variables determine which bacteria dominate particular communities. Additionally, the effects of these variables may have been subsumed by other explanatory variables in these models. In particular, nitrogen and phosphorus concentrations, and to some extent primary producer biomass (as assessed by measuring chlorophyll a content), were higher in southern lakes (Table 1), and, as has been discussed, DOC appeared to explain some of the variance associated with Secchi depth. That partial CCA did not enhance the significance of these variables (Table 3) demonstrates that they did not represent significant sources of variation within the two study regions, but they may have been influential in determining variation in BCC between northern and southern lakes.
(iii) Temporal variation in BCC.
Many studies have demonstrated that lake bacterial communities show considerable variation in time and may exhibit seasonal patterns (8, 30, 59, 82). Temporal variation was also apparent in the ARISA profiles of the present study (Fig. 2A to C). However, the role of sample month, especially as revealed by CCA, was not straightforward. There was no consistent temporal pattern of ARISA fragment richness (Table 2). While May was always determined to be an important descriptor in models, the other 2 months were only included in models based on presence-or-absence-transformed data (Table 3). Thus, samples collected in May were consistently different from samples collected in other months regarding both the set of ARISA fragments detected (i.e., presence or absence) and the dominance of communities by particular ARISA fragments, and this distinction is readily apparent from CCA plots (Fig. 5 and 6). Samples collected in July and October may have been unique in a subset of the ARISA fragments detected in these samples, as evidenced by the presence-or-absence analyses that weight all species equally (Fig. 5; Table 3). Therefore, while different ARISA fragments were consistently present and absent in these lakes at different times, the dominance of profiles by ARISA fragments varied from lake to lake in a fashion that was not consistent in July and October. If the semiquantitative and quantitative analyses are to be trusted, this indicates that the dominant community members in these lakes were determined by lake-specific factors that were not seasonally coherent (48).
This conclusion, however, is not consistent with another interpretation of the data. Table 3 indicates that water temperature was a significant explanatory factor in all of the models that did not include the full complement of months. July sample dates were consistently warmer than other sample dates (data not shown), and thus water temperature may mask some of the variation explained by the July dummy variable. Thus, if May was significant and water temperature was a proxy for July, then these models suggest that October samples were also consistently different given the colinearity of these three variables. This interpretation suggests that ARISA profiles showed coherent temporal variation in both the composition (presence or absence) and dominance of bacterial communities. The present study was conducted on a very coarse temporal scale, given the evidence that lake BCC can radically change over the course of a few weeks (30, 36, 82). This week-to-week variation in lakes may have made it difficult to detect any temporally coherent patterns in BCC in these lakes. However, the inclusion of either water temperature or the full complement of months in CCA models suggests that such temporally coherent behavior may exist, and studies with better temporal resolution over a large set of lakes could help sort this out.
Coherent behavior of BCC seems unquestionably apparent in the case of May samples (Fig. 5 and 6; Table 3). The dummy variable May was not related to any other variable measured in the present study (data not shown), so a mechanism is not readily apparent here. Previous work in Wisconsin lakes has demonstrated that, within the same lake, spring BCC is remarkably stable (82), and this suggests that the springtime ecology of bacterioplankton is controlled by a tight set of constraints. These constraints may include lower water temperatures; the presence or absence of large populations of grazers (i.e., the clear-water phase); the activity of primary producers; and the concentrations of limiting nutrients as determined by wintertime recycling, springtime biological uptake, and/or subsidy by elevated spring runoff. Additionally, temperate lakes stratify in the spring, and mixing dynamics may also help to structure bacterial communities. Thus, the significant influence of May samples here may be due to the proximity of the clear-water phase, the onset of stratification, or a variety of other factors that conspire to place bacterial communities in different lakes under a similar set of ecological pressures (30, 82).
Concluding remarks.
As microbial ecologists increasingly utilize numerical approaches to explain patterns in community composition, it will become more and more important to understand how the biases inherent in a molecular view of the world can influence these analyses. The present work showed that data transformations can affect the outcomes of multivariate analyses, but, despite the differences in models, it may still be possible to discern patterns that are insensitive to transformation-induced error. These robust patterns likely reflect real ecological variation in the underlying microbial communities.
This study revealed that lake BCC displayed coherent geographical variation on both regional and landscape scales, and bacterial communities in different lakes may also display coherent temporal variation. This highlights the importance of spatial and temporal structure in the community ecology of pelagic lake bacteria. Significant variation along water clarity and pH gradients both between northern and southern lakes and within each region of Wisconsin was also detected. It is to be expected that the considerable amount of unexplained variance in ARISA profiles (Table 3) represents individualistic community responses to grazing pressure, antagonistic and consortial interactions, and other factors that were not measured here but that have been shown to be important in other studies. Nevertheless, the present study demonstrates that geographic and temporal coherence, coupled with environmental influences, can generate patterns in BCC detectable across a variety of lake types.
| ACKNOWLEDGMENTS |
|---|
We are grateful to A. Kent, K. Novakofski, T. Kratz, J. Rusak, G. Lauster, K. McMahon, R. Newton, and J. Thoyre for their valuable assistance with data collection and with the preparation of the manuscript. Special thanks are also offered to J. Chipman for assistance with the spatial data set and for graciously producing the graphics for Fig. 1 and other high-quality maps.
| FOOTNOTES |
|---|
This is Journal Series no. R-10604 of the Florida Agricultural Experiment Station. ![]()
Present address: Institute of Marine Sciences, University of North Carolina-Chapel Hill, Morehead City, NC 28557. ![]()
| REFERENCES |
|---|
|
|
|---|
milauer. 2002. CANOCO reference manual and CanoDraw for Windows user's guide: software for canonical community ordination, version 4.5. Microcomputer Power, Ithaca, N.Y.