Previous Article | Next Article ![]()
Applied and Environmental Microbiology, May 2007, p. 2982-2989, Vol. 73, No. 9
0099-2240/07/$08.00+0 doi:10.1128/AEM.02611-06
Copyright © 2007, American Society for Microbiology. All Rights Reserved.

Biotechnology Center for Agriculture and the Environment,1 Department of Biochemistry and Microbiology,3 Department of Environmental Sciences,4 Institute of Marine and Coastal Sciences, Cook College, Rutgers University, New Brunswick, New Jersey 08901-8521,5 Uzbekistan Academy of Sciences Institute of Microbiology, Tashkent, Uzbekistan 7001282
Received 8 November 2006/ Accepted 21 February 2007
|
|
|---|
|
|
|---|
An important corollary of the "everything is everywhere" hypothesis is that all types of bacteria are found in all environments where their growth requirements are met. This prediction has significant implications for bioprospecting for novel secondary metabolites or enzymatic processes of industrial interest. If most bacterial species are found everywhere (i.e., are cosmopolitan), then only a limited number of samples from a particular type of environment need to be surveyed intensely to obtain a large proportion of all microbes associated with that environment. However, if there are endemic populations in similar environments at different locations, then it would be prudent to survey the greatest possible geographic breadth. In several surveys workers have investigated the levels of endemism of specific groups of microbes. For example, in an investigation of the biodiversity and endemism of cyanobacteria in thermal hot springs researchers discovered that some thermophilic strains from temperate zone North American springs are not present in hot springs in Alaska and Iceland (7). In a similar study, using 150 3-chlorobenzoate-degrading isolates from six regions on five continents, workers observed that more than 91% of the strains had unique genotypes and were present at a single site (13). In a more in-depth molecular fingerprinting survey of 248 fluorescent Pseudomonas strains isolated from 10 locations on four continents, Cho and Tiedje detected 85 unique genotypes for which there was no overlap in the sites and continental regions of the collection sites (8). Nucleic acid-based methods have also revealed geographic structuring of denitrifying bacteria in coastal sediments (25), nitrifying bacteria in the ocean (1), soil microbial populations (12), and sulfate-reducing bacteria (24).
It has been hypothesized that the bacteria that are more easily dispersed are better suited for colonizing new environments and are more cosmopolitan (29). For example, certain gram-positive bacteria, particularly Bacillus spp. and Streptomyces spp., are exceptionally well adapted for dispersal, because they produce spores that are highly resistant to desiccation and heat. The purpose of this study was to extend our inquiries of actinomycete communities and secondary metabolite gene diversity found in New Jersey soils (34) to soils collected in Central Asia in order to investigate whether different populations of this important group of microorganisms could be found in Uzbekistan. The degree of actinomycete cosmopolitanism was investigated by terminal restriction fragment length polymorphism (TRFLP) analysis of actinomycete rRNA genes. An analogous analysis was performed for an important class of secondary metabolite genes, the type II polyketide synthase (PKS) genes, which are frequently found in actinomycetes. Environmental fingerprint data demonstrated that there are distinct differences between actinomycete communities present in soil collected in Uzbekistan and actinomycete communities present in soil collected in New Jersey. Evidence of globally distributed actinomycete phylotypes, as determined by TRFLP analysis, was obtained, but these phylotypes appeared to constitute only a small proportion of the phylotypes at each location. These findings indicate that it should be useful to sample remote areas of the globe to enhance bioprospecting endeavors.
|
|
|---|
DNA extraction.
DNA extraction and PCR were performed as previously described (34). DNA was extracted from 0.5 g of soil using a MO BIO UltraClean soil DNA kit as recommended by the manufacturer. DNA yields were determined by comparison of band intensities to the band intensity for 250 ng of phage
DNA digested with HindIII on a 1% agarose gel stained with ethidium bromide and photographed using a Kodak EDAS 290 gel imaging system.
PCR and cloning.
Actinomycete 16S rRNA genes were PCR amplified from 5 ng of soil community DNA using the actinomycete group-specific forward primer 243F (15) (5'-GGATGAGCCCGCGGCCTA-3') and the eubacterial reverse primer 1401R (5'-CGGTGTGTACAAGACCC-3'). The more frequently used ribosomal reverse primers 1525R and 1492R in combination with primer 243F produce nonspecific banding patterns in agarose gels and were not suitable for TRFLP analysis of actinomycete communities (34). Actinomycete TRFLP patterns were generated by labeling primer 243F with 6-carboxyfluorescein (FAM). The PCR mixtures (50 µl) contained 200 nM of each PCR primer, 2.5 mM MgCl, 5 U Taq DNA polymerase (Promega, Madison WI), and 0.4 µl of a 10-mg ml1 bovine serum albumin solution (Promega, Madison WI). The amplification reactions were performed for 30 cycles consisting of 1 min at 95°C, 1 min at 60°C, and 1.5 min at 72°C, followed by extension at 72°C for 15 min. Partial ketoacyl synthase (KS) genes of type II PKS pathways were amplified from soil community DNA by using degenerate PCR primers 540F (5'-GGITGCACSTCIGGIMTSGAC-3') and 1100R (5'-CCGATSGCICCSAGIGAGTG-5') (34). The PCR mixtures (50 µl) contained 5 U of Taq DNA polymerase, 1 µM of each primer, 2.5 mM MgCl, and 0.4 µl of a 10-mg ml1 bovine serum albumin solution (Promega, Madison WI). Each PCR cycle consisted of 1 min at 95°C, 1 min at 68°C, and 1.5 min at 72°C. Clone libraries were generated by performing 40 PCR cycles, followed by 15 min of extension at 72°C. PCR products were excised from 1% agarose gels. The agarose was removed by using a QIAquick gel extraction kit (QIAGEN GmbH, Germany), and the DNA was cloned into the pCR4-TOPO vector (Invitrogen, Carlsbad CA) according to the manufacturer's recommendations. The four most diverse soil samples from Uzbekistan, as determined by TRFLP analysis of PKS gene PCR products, were chosen for cloning, and 24 colonies were picked from each transformation into a 96-well microtiter plate (to obtain a total of 96 individual clones). Plasmid DNA was purified using a QIAGEN plasmid extraction kit, and inserts were sequenced by using the M13 forward and reverse primers. PKS TRFLP patterns were generated by first amplifying 5 ng of soil DNA for 30 cycles using unlabeled primers. Five microliters of the reaction mixture was then used for another PCR performed with FAM-labeled primer 540F. Ten additional cycles were performed as described above to label PKS gene PCR products.
TRFLP and data analysis.
FAM-labeled actinomycete 16S rRNA and KS gene PCR products were diluted to obtain a concentration of 5 ng µl1. Samples that produced no PCR product visible in an ethidium bromide-stained 1% agarose gel were not included in subsequent analyses. A total of 30 ng of labeled DNA was digested with 2 U of MnlI (NEB, Beverly, MA) for 4 h at 37°C. Digests were precipitated using glycogen as a carrier (18). The DNA was suspended in 20 µl deionized formamide, denatured at 95°C, and separated with an ABI 310 automated sequencer (Applied Biosystems, Foster City, CA). The data for fragment lengths between 50 and 500 nucleotides were then exported in a tabular, binary (presence/absence) format to Matlab (Natick, MA) for analysis. A minimum of 50 fluorescence units was used, and only the peaks accounting for 95% of the area were considered. The data were then analyzed using two different algorithms, one of which binned the data into discrete nucleotide length and one of which used a search window of ±0.5 bp to compare all peaks in the data set. Binning can sometimes be problematic (16, 34) during data analysis (peaks that are only a fraction of 1 bp apart are occasionally binned differently, due to rounding artifacts) but is necessary to generate peak frequency histograms (Fig. 1). Dendrograms were generated using both binning and search window algorithms (Fig. 2). The two methods produced analogous results. Pairwise similarity values were calculated using two different methods, as previously described (33, 34). The Sorenson index was calculated as follows: Cs = 2Nab/(Na + Nb), where Nab is the number of shared peaks and Na and Nb are the numbers of peaks in individual samples (25). Alternatively, similarity values were calculated as follows: Ps = Sa/Na, where Sa is the number of peaks in sample A that are also found in sample B and Na is the total number of peaks in sample A (33). The top left and bottom right portions of similarity matrices were averaged and exported as 1 Cs or 1 Ps. Binned data were exported into MEGA (Molecular Evolutionary Genetics Analysis Software, version 2.1) (17) to generate neighbor-joining dendrograms. Similar topologies were obtained regardless of which type of similarity matrix was calculated (data not shown). A bootstrap analysis (10,000 replicates) was performed as described previously (33) to test the robustness of the dendrograms.
![]() View larger version (34K): [in a new window] |
FIG. 1. Histograms showing the frequency distributions of fragment lengths in the data set. The x axis shows the percentages of samples (TRFLP fingerprints) that contained different percentages of the fragments detected in the data set (y axis). We expected our data to exhibit one of three distributions. (A) Three models for the frequency distribution of TRFLP data. (B) Frequency distribution of all actinobacterial 16S rRNA gene TRFLP fingerprints generated in this study. (C) Frequency distribution of actinobacterial 16S rRNA gene TRFLP fingerprints obtained for samples collected in New Jersey (n = 23). (D) Frequency distribution of actinobacterial 16S rRNA gene TRFLP fingerprints obtained for samples collected in Uzbekistan (n = 61). (E) Type II PKS TRFLP fingerprints of samples collected in New Jersey (n = 16). (F) Type II PKS TRFLP fingerprints of samples collected in Uzbekistan (n = 56).
|
![]() View larger version (69K): [in a new window] |
FIG. 2. Cluster analysis of actinobacterial 16S rRNA gene and type II PKS gene TRFLP data. The dendrograms are neighbor-joining dendrograms obtained from a similarity matrix (Ps) by using a sliding search window of ±0.5 bp. A distance of 0.1 indicates a 10% difference between samples. Bootstrap values were obtained by binning the data and calculating a consensus tree using 1,000 bootstrap replicates. With the exception of the central node in panel C, only the bootstrap values that are >65% are shown. The arrows indicate bootstrap values for specific nodes.
, soil samples collected in the Greenwood Forest Wildlife Management Area;, soil samples collected in Picatinny Arsenal. (A) Actinobacterial 16S rRNA gene TRFLP data. (B) Type II PKS gene TRFLP data. (C) Actinobacterial 16S rRNA gene TRFLP data when only the peaks that were found in more 10% of all samples were considered. A bootstrap value is shown for the central node, which divides New Jersey samples from Central Asian samples, even though the value is less than 65%. (D) Actinobacterial 16S rRNA gene TRFLP data when only the peaks that were found in less than 10% of all samples were considered.
|
Nucleotide sequence accession numbers.
PKS gene sequence data obtained in this study have been deposited in the GenBank database under accession numbers EF068379 to EF068414.
|
|
|---|
A total of 355 different actinomycete terminal 16S rRNA gene fragments were detected in the TRFLP data set. Most of the peaks observed were present in only a small proportion of the soil samples, regardless of whether the data were compared at a global scale (Central Asia and North America [Fig. 1B]) or regionally (different locations in New Jersey [Fig. 1C] or in the area near Tashkent [Fig. 1D]). More than two-thirds of the TRFLP peaks (82.5, 69, and 90.3% in Fig. 1B, C, and D, respectively) were observed in less than 20% of all actinomycete profiles. More than one-quarter of the peaks (40, 25, and 43% in Fig. 1B, C, and D, respectively) were observed in <5% of the samples. Sixty-four peaks (18%) were obtained with only one soil sample, while 112 peaks (32%) were obtained with less than three soil samples. Only two peaks were obtained with >50% of the samples (Fig. 1B) (fragment lengths, 96 and 216 bp). The 96-bp fragment was present in nearly all fingerprints. To obtain more information about these two highly cosmopolitan peaks, the TRFLP size and priming site data were used to screen sequences found in the Ribosomal Database Project (RDP) database (http://rdp.cme.msu.edu/). For this purpose all 49,249 16S rRNA sequences in the RDP database at the time of the analysis (sequences from isolates for which good-quality sequences more than 1,200 nucleotides long were available) were downloaded. All sequences that matched our forward primer were identified, which yielded a total of 2,857 16S rRNA gene sequences. Of these, only one sequence produced the 96-bp fragment in silico (Streptomyces sp. strain 1A01503; accession no. EF056489). The 216-bp fragment was produced in silico by eight sequences in the database, which originated from Cryptosporangium sp. (accession no. AB006166 and AB006168), Streptomyces lividans (accession no. X95968, X95969, and X86354), Streptomyces microstreptospora (accession no. AB006159), Streptomyces caelestis (accession no. AJ508062), and Streptomyces sp. strain EN9 (accession no. AY148087). These data indicate that the microorganisms that produce the 96- and 216-bp peaks are not frequently observed in culture or in 16S rRNA gene libraries of actinomycetes, despite their wide geographic distribution.
TRFLP analysis of PKS PCR products (Fig. 1E and F) yielded results similar to the results obtained for the actinomycete 16S rRNA genes. More than 80% of the 191 discernible terminal restriction fragments (TRFs) were detected in less than 20% of the soil samples (84 and 91% in Fig. 1E, and F, respectively). More than 31% of the TRFs (60 peaks) were found in only a single soil sample. These data are consistent with model 1 (Fig. 1A), suggesting that cosmopolitanism of actinomycetes and their secondary metabolite genes may be an exception and not the rule. Most TRFs appeared infrequently, and the distribution of actinomycetes appears to be highly variable.
It has been postulated that bacterial phylogeny should correlate with bacterial biogeography, if species are endemic (29). No geographic clustering of closely related species would be expected for taxa that are cosmopolitan (29). By the same token, it would be expected that the distribution of peaks in TRFLP fingerprints would reflect the geographic origin of samples if bacteria exhibit regional ranges and/or endemism. Geographic clustering of TRFLP fingerprints should not occur if the bacteria examined are cosmopolitan. We compared fingerprint profiles by cluster analysis to determine if geographic clustering of actinomycete communities occurred (Fig. 2). The analysis revealed a strong tendency of actinomycete TRFLP patterns to cluster by country of origin (Fig. 2A). This division was retained whether the data were analyzed by binning or by using a search window approach (see Materials and Methods). Similar dendrograms were obtained using the unweighted-pair group method using average linkage (data not shown) and the neighbor-joining method. A bootstrap analysis was performed to confirm the significance of the branching pattern observed. Only three significant nodes (bootstrap values, >65%) were observed (Fig. 2A). However, the node that indicates a division between North American and Central Asian samples was among the significant branch points of the dendrogram (bootstrap value, 73%). The same analysis revealed no significant clustering of fingerprints generated from samples collected in different parts of New Jersey (northern New Jersey versus southern New Jersey). Weaker regional clustering of PKS TRFLP patterns (Fig. 2B) was observed but was not empirically supported by bootstrap analysis. For more in-depth analysis, we divided the actinomycete 16S rRNA gene TRFLP data into the fragments sizes found frequently in the fingerprints (>10% of all fingerprints) and the fragment sizes found infrequently (<10% of all samples). This allowed us to determine whether the branching patterns shown in Fig. 2A and B were due to differential distribution of abundant or rare peaks in the data set. Each subset of the data was analyzed independently, as indicated above. The analysis of frequently observed peaks produced a dendrogram that was almost identical to the dendrogram generated by considering the complete data set (Fig. 2C), albeit with slightly weaker bootstrap support. On the other hand, regional clustering was not observed for peaks found infrequently in the data set (74% of all fragments) (Fig. 2D). These data indicate that the distribution of actinomycete phylotypes (TRFs) has a regional imprint, supporting the idea that some actinomycete species can have restricted distributions. However, only 9 of the 91 fragment sizes that were detected in more than 10% of all samples appeared to be present in only one country (all 9 were detected in Uzbekistan). Similar results were obtained when the PKS fingerprints were considered. Of 30 fragments that were detectedin more than 10% of all samples, only 2 were present only in New Jersey (none was present only in Uzbekistan). These data suggest that there is significant overlap between the widely distributed actinomycete populations and PKS genes found in Central Asia and New Jersey.
Four Uzbek soil samples were selected based on the number of peaks in their PKS fingerprints (the two samples with the highest total number of peaks and the two samples with the highest number of unique peaks). The TRFLP fingerprints for these four samples contained 54 fragments that were different sizes, accounting for 50.4% of the fragments found in the Uzbek PKS data set. Community DNA was amplified using PKS gene PCR primers, and the PCR products were TA cloned. Twenty-four of the clones from each collection site (a total of 96 clones) were sequenced. All sequences exhibiting more than 95% sequence identity were grouped into clades. Sequence analysis revealed 37 novel sequence clades, all of which exhibited the highest level of similarity to type II KS genes found in the database (Fig. 3). Only one clade (Uzbekistan clade 19) (Fig. 3) matched PKS gene sequences that were obtained from New Jersey soil samples (New Jersey clade 6) (Fig. 3) in our previous study (34). All of the remaining sequences were unique to Uzbekistan and not similar to sequences in the GenBank database. The DNA sequences of cloned PKS genes were between 68 and 92% identical to sequences for the PKS pathways found in public databases. These findings indicate that the majority of type II PKS gene sequences occur infrequently in the environment but that some sequences are present in both Asia and North America, indicating that they have a cosmopolitan distribution.
![]() View larger version (26K): [in a new window] |
FIG. 3. Neighbor-joining dendrogram of DNA sequences showing relationships of sequenced PKS gene fragments from Uzbek soils. The four most diverse PKS PCR products (as determined by TRFLP) were cloned, 96 white colonies (24 colonies from each reaction) were picked, and their inserts were sequenced. All sequences exhibiting more than 95% sequence identity were grouped into clades and numbered. The most closely related sequences found in the GenBank database, as well as sequences that were recovered from the New Jersey soil samples (34), are also included. The numbers indicate bootstrap values calculated using 10,000 bootstrap replicates. Only bootstrap values greater than 65% are shown. Each clade consists of sequences that are >95% identical. Open hexagons, sequences recovered from Uzbek soil samples; shaded boxes, samples recovered from New Jersey soil samples.
|
|
|
|---|
Our findings imply that actinomycete distributions are exceedingly patchy and/or that the actinomycetes are highly endemic. "Patchy" refers to a pattern of distribution in which species are found over a large geographic range but individuals are concentrated (i.e., occur in groups). Many individuals of a species are present at specific locations, while a large proportion of the range of the species is not populated at any one time. Patchiness leads to a pattern in which there is a high degree of differentiation in the actinomycete communities in soil samples obtained at locations that are relatively close to each other (meters to kilometers). Highly diverse and patchily distributed actinomycetes could thus yield frequency distribution curves similar to those shown in Fig. 1. The results of studies of the distribution of other microbes in the environment support the notion that bacterial distributions can be highly patchy, particularly in soil and sediment. For example, in a study of sulfate-reducing sediment bacteria collected on four continents, a patchy distribution that had evolved in a homogeneous background was proposed as one mechanism to explain the geographic structure observed for biodegradative populations (24). Similarly, in a study of the spatial distribution of crenarchaeal assemblages in mesophilic soil habitats it was found that different phylotypes dominated patches of soil in a uniform agricultural field, producing a mosaic distribution of different crenarchaeal phylotypes (27). When the biogeography of the purple nonsulfur bacterium Rhodopseudomonas palustris was investigated, it was observed that individual genotypes of R. palustris were detected only locally and exhibited a lognormal distribution along a 10-m transect (21). The workers concluded that R. palustris in freshwater marsh sediment is composed of locally distinct ecotypes with highly patchy distributions. In a subsequent study researchers found that the distribution of R. palustris assemblages can be patchy even at the 1-m scale (3). However, patchiness alone cannot entirely account for the distribution of actinomycetes observed in this study. Patchy distributions do not preclude species from having global and cosmopolitan ranges; i.e., everything could still be everywhere. If truly random patchiness were the only factor contributing to a high level of variability in actinomycete communities, then no significant clustering of samples by geographic origin should occur (Fig. 2A). The clustering of TRFLP community profiles by continent implies that certain actinomycete species may be found only in certain regions of the world (endemism). The hypothesis that actinomycetes can be endemic is further supported by the observation that almost one-fifth of all peaks in our data set were detected in only one sample.
The mechanisms leading to bacterial strain differentiation and strain endemism in the environment have not been established, but several hypotheses have been put forth. Processes such as dispersal, physical isolation, and genetic drift (similar to the theory of island biogeography) have been hypothesized for the differentiation of local microbial populations (11, 22). Other authors have suggested that competitive interactions and microhabitat variability might contribute to the high degree of genotype endemism that can be detected in microbial populations (3). The degree to which these factors contribute to the distribution of actinomycetes in the environment is difficult to establish using current methods, even if a relatively large number of soil samples is analyzed (91 samples were analyzed in our study). Analysis of infrequently observed peaks (peaks found in less than 10% of samples) revealed no significant regional signal (Fig. 2D), suggesting that sampling density or the resolution of the method was not sufficient to study the distribution of the infrequently observed actinomycete phylotypes and shed light on the processes structuring the community.
In conclusion, the data shown here suggest that only minor components of actinomycete populations and secondary metabolite genes in the environment are cosmopolitan. Only a single actinomycete 16S rRNA terminal fragment was found in virtually all samples that were analyzed. Likewise, only a single clade of 36 type II PKS genes that were detected in Uzbek soil was also detected in New Jersey soils. Based on these data, we propose that distinct populations of actinomycetes (and perhaps bacteria as a whole) can be found in soils collected from different regions of the world. Therefore, bioprospecting that targets actinomycetes and their PKS pathways would benefit from sampling soils from a wide range of geographic locations.
We gratefully acknowledge David Zaurov for his assistance in Uzbekistan with travel arrangements and for collection of field samples and Lora McGuinness for assistance with the TRFLP analysis.
Published ahead of print on 2 March 2007. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»