Previous Article | Next Article ![]()
Applied and Environmental Microbiology, May 2005, p. 2232-2238, Vol. 71, No. 5
0099-2240/05/$08.00+0 doi:10.1128/AEM.71.5.2232-2238.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Biotechnology Center for Agriculture and the Environment,1 Institute of Marine and Coastal Sciences,2 Department of Environmental Sciences,3 Department of Microbiology and Biochemistry, Rutgers University, Cook College, New Brunswick, New Jersey4
Received 2 September 2004/ Accepted 30 November 2004
|
|
|---|
(ketosynthase) genes. Twenty-one soil samples were collected from diverse sources in New Jersey, and their bacterial communities were compared by terminal restriction fragment length polymorphism (TRFLP) analysis of PCR products generated using bacterial 16S rRNA gene primers (27F and 1525R) as well as an actinomycete-specific forward primer. The distribution of actinomycetes was highly variable but correlated with the overall bacterial species composition as determined by TRFLP. Two samples were identified to contain a particularly rich and unique actinomycete community based on their TRFLP patterns. The same samples also contained the greatest diversity of KS
genes as determined by TRFLP analysis of KS
PCR products. KS
PCR products from these and three additional samples with interesting TRFLP pattern were cloned, and seven novel clades of KS
genes were identified. Greatest sequence diversity was observed in a sample containing a moderate number of peaks in its KS
TRFLP. The nucleotide sequences were between 74 and 81% identical to known sequences in GenBank. One cluster of sequences was most similar to the KS
involved in ardacin (glycopeptide antibiotic) production by Kibdelosporangium aridum. The remaining sequences showed greatest similarity to the KS
genes in pathways producing the angucycline-derived antibiotics simocyclinone, pradimicin, and jasomycin. |
|
|---|
A large number of biologically active molecules are synthesized by polyketide synthase (PKS) pathways. Polyketides are structurally diverse secondary metabolites that have already found widespread application as pharmaceuticals (20), in particular as antibiotics. Industrially important polyketides include rapamycin (immunosuppressant), erythromycin (antibiotic), lovostatin (anticholesterol drug), and epothilone B (anticancer drug). Molecules are synthesized from simple two-, three-, and four-carbon building blocks such as acetyl-coenzyme A, propionyl-coenzyme A, and butyryl-coenzyme A (14) by means of a stepwise, decarboxylative condensation pathway analogous to fatty acid synthesis. Unlike in fatty acid synthesis, however, polyketide maturation is not limited to a rigid sequence of ketoreduction, dehydration, and enoyl reduction (14). Instead, modification may include some, all, or none of these steps as well as other modifications such as cyclization and aromatization (among others). Because different starter and chain elongation units can be used and multiple stereo-centers are formed during synthesis, a tremendous structural diversity of PKS products can be achieved.
Several classes of PKS genes are commonly recognized (12-14, 31). Type I PKS, also known as modular PKS, are large, multidomain enzymes which carry a series of functional sites for stepwise polyketide synthesis. Type II PKS are composed of three or more mono- or bi-functional enzymes, which act in an iterative manner during synthesis. A core of three enzymes referred to as the minimal polyketide synthase is shared by all type II pathways: a ketoacyl-synthase (KS
), a chain length factor (KSß) and an acyl-carrier protein. This study focuses on the actinomycete type II minimal PKS.
Many actinomycetes contain PKS gene clusters and this group of bacteria has long been recognized as an important source of bioactive molecules. Screening of actinomycete cultures has for decades yielded novel industrially important products and pharmaceuticals. However, the rate of discovery has not kept pace with resource expenditure and advances in technology and has even declined in recent years (19, 34). This unfortunate circumstance arises because previously described strains and molecules are being rediscovered and redescribed (19, 33). Better methods are clearly needed for the more efficient identification and screening of actinomycete cultures and soils containing potentially interesting secondary metabolites. Most importantly, it is hoped that culture-independent methods may prove worthwhile. Such methods may not only prevent reinvestigation of previously described bioactive molecules, but also alleviate some of the biases introduced by the use of standard cultivation techniques.
In this context Metsä-Ketelä (20) obtained the partial sequences of ketosynthase genes from six known and 29 unidentified actinomycete soil isolates using degenerate PCR primers. Pigment- and antibiotic-producing PKS gene clusters could be clearly distinguished based on their KS
sequences, indicating that functional information may be derived from PKS sequence information. A later study (19) found that the phylogenies of 16S rRNA and PKS genes in actinomycete soil isolates were not congruent, indicating that the phylogenetic grouping of actinomycetes is an inadequate predictor for the type of secondary metabolites these strains produce.
The purpose of this study was to develop a culture-independent method to identify genes encoding novel and unique PKS pathways. Specifically, we hypothesized that soil containing a large diversity of actinomycetes, as indicated by terminal restriction fragment length polymorphism (TRFLP), also contains a large diversity of PKS gene sequences. In addition, we hypothesized that soil with unique TRFLP patterns for both the actinomycete 16S rRNA and PKS genes would yield particularly novel PKS gene sequences.
|
|
|---|
The 3S samples (n = 6) were taken near Pakim Pond in Lebanon State Forest (N.J. Ocean County). Samples were collected roughly 10 m apart. The first sample was taken from Pakim Pond sediment. The remaining samples were collected at progressively greater distance from the pond. Soil was mostly composed of leaf litter and roots, with virtually no mineral (sand or clay) content. The 4S samples (n = 5) were collected from several sites in the Picatinny Arsenal (Morris County). Samples were frozen at 80°C upon return to the laboratory (ca. 2 h) and thawed for DNA extraction. Sample labels throughout the text are shown as NSX, where NS indicates the sampling collection site (1S, 2S, 3S, and 4S) and X indicates the sample number.
DNA extraction and PCR.
DNA was extracted using the Ultraclean Soil DNA kit (MO BIO Laboratories, Carlsbad, CA). This method was first compared to several other extraction protocols. The MO BIO kit produced better DNA yields and copurified less humic contaminants than a freeze-thaw approach, an SDS boiling lysis method, and the DNeasy tissue kit (QIAGEN, Valencia CA) (data not shown). Soil samples (0.2 g) were extracted as recommended by the manufacturer. DNA was analyzed on a 1% agarose gel and quantified using a Kodak EDAS 290 Gel Imaging system by comparing band intensities to known quantities of lambda HindIII markers. Five ng of DNA was then added to 50-µl PCRs.
The total bacterial community was analyzed using the following 16S rRNA gene PCR primers: 27F, 5'-AGAGTTTGATCMTGGCTCAG-3' and 1525R, 5'-AAGGAGGTGATCCAGCC-3'. The 27F primer was carboxyfluorescein (FAM) labeled for TRFLP analysis. The actinomycete community was analyzed using a FAM-labeled specific 16S rRNA gene forward primer, 243F (9): 5'-GGATGAGCCCGCGGCCTA-3'. Neither 1525R nor 1492R were suitable as reverse primers for TRFLP analysis, since multiple and unspecific banding was observed in combination with 243F. 1401R (5'-CGGTGTGTACAAGACCC-3') was instead utilized, since it produced no unspecific amplification. PCRs contained 200 nM of each PCR primer, 2.5 mM MgCl, 5 U Taq DNA polymerase (Promega, Madison WI), and 0.4 µl 10 mg ml1 acetylated bovine serum albumin (Promega). PCR conditions were as follows: (for 27F/1525R) 5 min at 95°C followed by 30 cycles of 1 min at 95°C, 1 min at 55°C, and 1.5 min at 72°C. The annealing temperature for the 243F/1401R primer pair was optimized to 60°C (the highest temperature where no significant loss of PCR product formation was observed in an ethidium bromide stained gel), while the remaining parameters remained identical. A 15-min extension step at 72°C was applied at the end of the PCR program.
PKS cloning and library generation.
The sequences of 69 actinomycete type II KS
genes were obtained from GenBank and aligned. Conserved regions were located and primers designed for a 554 bp fragment located in the 5' portion of the gene. The primer sequences are 540F, 5'-GGITGCACSTCIGGIMTSGAC-3', and 1100R, 5'-CCGATSGCICCSAGIGAGTG-3'. PCRs contained 5 ng DNA template, 1 µM primer each, 2.5 mM MgCl, 5 U Taq DNA polymerase (Promega) and 0.4 µl of 10 mg ml1 BSA. PCR conditions were as follows: 5 min at 95°C followed by 40 cycles of 1 min at 95°C, 1 min at 64°C and 1.5 min at 72°C followed by a 15 min extension step at 72°C. PCR products were gel purified using a gel extraction kit (QIAGEN) and cloned into pCR4-TOPO vector using a TOPO-TA cloning kit (Invitrogen, Carlsbad CA). Forty white colonies from each transformation were grown in LB medium containing 50 µg ml1 kanamycin and ampicillin and plasmids extracted using QIAprep spin miniprep columns (QIAGEN). Plasmids were screened for inserts by digestion with the restriction enzyme EcoRI and insert containing plasmids were sequenced. To obtain TRFLP patterns of the KS
PCR products, 5 ng of template DNA were first amplified for 30 cycles as indicated above using unlabeled primers. Five µl of this reaction was then transferred to a new PCR containing the FAM-labeled forward (540F) primer and amplified for an additional 10 cycles.
TRFLP and data analysis.
FAM-labeled PCR products were diluted to 5 ng µl1 and 30 ng were independently digested for 4 h using 2 units of MnlI or AluI (NEB, Beverly, MA). Digested DNA was precipitated with glycogen as a carrier, resuspended in 20 µl of deionized formamide and denatured at 95°C for 5 min. Samples were then analyzed using an ABI 310 automated sequencer (Applied Biosystems, Foster City, CA) to obtain the TRFLP patterns. For the soil samples in this study (n = 23) this procedure generated 16S TRFLP patterns with average maximum peak heights of 4.36 x 103 fluorescence units (FU) and average cumulative peak areas of 3.92 x 105 FU2.
TRFLP data were exported to Matlab (Natick, MA) and analyzed by considering the presence/absence of peaks only. Data were not binned, because two peaks that are only a fraction of a base pair apart in two different TRFLP patterns, can sometimes be binned into different nucleotide length, even though they may represent the same organism (e.g., 120.48 is rounded to 120 and 120.52 is rounded to 121). This can bias and skew similarities calculated from TRFLP data (15). Instead, a search window of ± 0.5 bp was applied for analysis and each peak was individually compared to all peaks of all other TRFLP pattern in the data set. The minimum peak height was set at 3% of the highest peak in each individual TRFLP. In addition, only those peaks, which cumulatively accounted for < 97% of the total peak area were considered. Pairwise similarities were calculated as previously described (28) using the Sorenson index: Cs = 2Nab/(Na + Nb), where Nab = number of shared peaks and Na and Nb are the number of peaks in each sample (18, 21). Distance data for both restriction enzymes was averaged. This should help reduce biases introduced by particular enzymes (uneven distribution of cut sites). Data were then exported as 1 Cs into MEGA (Molecular Evolutionary Genetics Analysis Software, version 2.1) (16) to generate neighbor-joining dendrograms.
Phylogenetic analysis.
DNA sequences obtained from the PCR product cloning experiments were trimmed by removing all vector and primer sequences from the ends. They were then aligned with their closest match in GenBank (obtained by translated and untranslated BLAST searches) as well as sequences used to design the PCR primers. Neighbor-Joining trees were generated using MEGA by using the Tajima-Nei distance model. One thousand bootstrap replicates were performed and consensus trees are shown.
Nucleotide accession numbers.
DNA sequences obtained in this study were deposited in GenBank under the accession numbers AY684435 to AY684585.
|
|
|---|
![]() View larger version (12K): [in a new window] |
FIG. 1. TRFLP dendrograms. Labels indicate the identity of soil samples (NSX) where X is the sample number and NS (1S, 2S, 3S, and 4S) indicates the sampling collection site (see Materials and Methods). (A) Bacterial 16S rRNA genes (27F/1525R). (B) Actinomycete 16S rRNA genes (243F/1401R). A distance of 10% (scale bar) indicates a dissimilarity of 10% between two samples as calculated by the Sorenson index.
|
Table 1 shows the total number of peaks observed in the actinomycete-specific 16S TRFLP as well as the number of unique peaks (not observed in any other TRFLP in the data set). A significantly larger number (Student's t test: P < 0.001) of peaks was observed in the TRFLP of samples from collection 2S (average = 47) than in the remaining samples (average = 17), indicating that these samples contained particularly rich actinomycete communities. Samples 2S4 and 2S6 produced no amplification product using our KS
primers, eliminating them from further analysis. The TRFLP of sample 2S2 contained a small number of peaks (n = 21) similar to samples from other collections (S1, S3, and S4). Sample 2S3 only contained one unique peak. Samples 2S1 and 2S5 were thus selected for PCR clone library generation and further analysis because of the high level of diversity observed.
|
View this table: [in a new window] |
TABLE 1. Actinomycete and KS TRFLP dataa
|
PCR product. These samples amplified well using 16S rRNA gene primers, suggesting that PCR inhibitor was not preventing amplification of KS
sequences. Two interpretations of these data are possible. Either no PKS gene sequences were present or our PCR primers did not match the ones that were. For the remaining samples, however, a relatively good correlation between the number of peaks in the actinomycete and KS
TRFLPs was observed (R2 = 0.71). Samples 2S1 and 2S5, which contained the most diverse actinomycete community, contained the largest number of total and unique peaks in their PKS TRFLP patterns. 2S1 and 2S5 were thus targeted for further analysis and PCR library generation. Three additional samples, one from each of the remaining three collections (1S, 3S, and 4S), were also chosen. 3S6 and 4S4 contained the largest number of total and unique peaks among samples from their collections (3S and 4S). Sample 1S2 was chosen at random among the three samples from collection 1S.
Despite the large number of peaks in the actinomycete and KS
TRFLP patterns of samples 2S1 and 2S5, only limited sequence diversity was detected in the KS
PCR clone libraries generated from both samples (Fig. 2). All sequences originating from 2S1 and 2S5 formed tightly linked monophyletic clades. With the exception of clone 2S1-1' all sequences from sample 2S1 were > 90% identical, with the majority of percent identities ranging between 97% and 99%. Only one pair of clones contained identical sequences (2S1-3' and 2S1-16'). Percent identity of sequences obtained from sample 2S5 ranged between 88 and 99% (the majority was >95%). PCR and sequencing error are likely to have accounted for less than 1% of the sequence heterogeneity we observed, suggesting that there was considerable microdiversity within the PKS bearing actinomycete population in these samples.
![]() View larger version (22K): [in a new window] |
FIG. 2. Phylogenetic analysis of DNA sequences obtained by cloning of KS PCR products from samples 2S1 and 2S5, which were chosen because they contained a large number of peaks in their actinomycete 16S TRFLPs. Species names indicate sequences obtained from GenBank. A subset of GenBank sequences used to design the PCR primer set as well as GenBank sequences most similar to cloned sequences (Actinomadura hibisca for 2S5 and Streptomyces antibioticus for 2S1 clones) was used for analysis. Numbers indicate bootstrap percentages. Only bootstraps >65% are shown. A star (*) after a sequence identifier indicates that this sequence and its identity to other sequences are shown in Table 2.
|
The DNA sequences within the 2S1 and 2S5 groups of sequences are highly similar to each other, yet they are no more than 75% and 80% identical (Table 2) to sequences found in GenBank (as of July 2004). Sequences from sample 2S1 are most similar to the simocyclinone D8 biosynthetic gene cluster of Streptomyces antibioticus Tü6040 (GenBank accession no. AF324838). Simocyclinone D8 is composed of four structural elements, an angucyclinone (polyketide) core, a deoxysugar, an octatetraene dicarboxylate, and aminocoumarin moiety (10). The molecule has been shown to have antibiotic activity against gram-positive bacteria and to exhibit cytostatic activity against human tumor cells (29, 36).
|
View this table: [in a new window] |
TABLE 2. Percent identity of KS PCR product nucleotide sequences and best GenBank BlastN search resulta
|
]naphthacenequinone aglycone core as well as a D-alanine and sugar moieties (11). These data suggest that sequences recovered from samples 2S1 and 2S5 likely originated from PKS pathways involved in antibiotic synthesis. The products of the 2S1 and 2S5 pathways can however not be predicted using the data at hand. Simocyclinone and pradimicin are greatly dissimilar molecules and their KS
genes in the region of our PCR product are about as similar to one another (ca. 76% identical) as they are similar to the 2S1 and 2S5 clusters (Table 2), suggesting that products of the 2S1 and 2S5 pathways will be chemically unique and quite different from simocyclinone and pradimicin. It is troubling that in silico digestion of sequences obtained from sample 2S1 revealed only 3 (out of 33, Table 1) fragments corresponding to peaks detected by TRFLP analysis of PCR products. This suggests that sequencing 40 clones at random vastly under-samples clone libraries generated from PCR products composed of a large number of different sequences. How to overcome this cloning bias, which has also been encountered in other studies (4, 22), is unfortunately not known. Sequencing a large enough number of clones to obtain all PCR product components may not be feasible and screening clones by other methods for selective sequencing may not be viable as sequencing cost likely approximates the time and effort spent in screening. Better methods are clearly needed for cloning and/or identifying rare PCR product components.
Five additional clusters of sequences were recovered from samples 1S2, 3S6 and 4S4, which were chosen based on their KS
TRFLP patterns (Fig. 3). Sequences from samples 3S6 and 4S4 formed two nonexclusive clades most similar to the KS
of the pradimicin biosynthetic gene cluster of Actinomadura hibisca (ca. 79 to 80% identical, Table 2). However, the 3S6 and 4S4 sequence clusters (Fig. 3) are only 86 to 96% and 79 to 90%, respectively, identical to sequences from the 2S5 clade (Table 2) and there was good bootstrap support to indicate that these sequences are phylogenetically distinct from the sequences of the 2S5 cluster (Fig. 3).
![]() View larger version (21K): [in a new window] |
FIG. 3. Phylogenetic analysis of DNA sequences obtained by cloning of KS PCR products from samples 1S2, 3S6, and 4S4, which were chosen because they contained a large number of peaks in their KS TRFLPs. Only the most divergent subset (n = 35) of the sequences we obtained (n = 97) for these three samples are shown. Also shown is a subset of sequences from the 2S1 and 2S5 samples. Species names indicate sequences obtained from GenBank. A subset of GenBank sequences used to design the PCR primer set as well as GenBank sequences most similar to cloned sequences (Actinomadura hibisca for 3S6 and 4S4, Streptomyces venezuelae for 1S2 A, and Kibdelosporangium ardium for 1S2 B and C clones) was used for analysis. Numbers indicate bootstrap percentages. Only bootstraps >65% are shown. A star (*) after a sequence identifier indicates that this sequence and its % identity to other sequences are shown in Table 2.
|
sequences was recovered from sample 1S2, a sample with a moderate number of peaks in its KS
TRFLP pattern (Fig. 1). One sequence originating from 2S1 (2S1-11 A, Fig. 3) was also most similar to the KS from Actinomadura hibisca (79% identical, Table 2), but was less than 78% identical to any other sequence in our data set indicating its phylogenetic distinctness. The remaining 1S2 sequences formed two clades (1S2 B and 1S2 C, Fig. 3) most closely associated with the KS
of the jadomycin and ardicin pathways in Streptomyces venezuelae (GenBank accession no. AF126429) and Kibdelosporangium aridum, respectively (GenBank accession no. L24518.1). Jadomycin B is a angucycline-derived benzoxazolophenanthridine antibiotic containing a L-digitoxose moiety (5, 24). Jadomycins have a wide spectrum of biological activity and are effective against gram-positive and gram-negative bacteria and even yeasts. Ardacin is a glycopeptide antibiotic shown to be effective against gram-positive bacteria (23, 41). As with the 2S1 and 2S5 clades of sequences, their percent identity to known KS
genes was less than 80%, indicating that the organisms containing these genes have yet to be isolated or have not been analyzed for their PKS gene pathways. The emergence of antibiotic-resistant pathogenic bacteria over the past decades has become an important public health threat, and options for confident chemotherapy in the clinical setting are becoming limited (1, 38). The spread of drug-resistant bacteria has been exacerbated by inappropriate antibiotic use such as improper prescription, inconsistent patient use and feed amendment in domestic livestock (3, 8, 35, 39). To compound the problem, the rate of discovery of new antibiotics is slowing (26) as traditional methods for antibiotics discovery rely on standard isolation techniques and are thus limited to the small fraction of bacteria that can be cultured on standard media. Novel means for the identification and isolation of antibiotic producing microbes are clearly needed. Culture-independent methods will allow direct access to the genetic material, i.e., the metabolic capabilities of hitherto uncultured microbes.
The purpose of this study was to develop a method for identifying soil samples most likely to contain novel and unique PKS pathways and to ascertain whether KS
gene diversity is reflected by actinomycete 16S rRNA gene diversity. We made the assumption that it is most sensible to select for soils containing dissimilar bacterial communities. This should allow a greater diversity of bacteria to be screened with fewer samples. We hypothesized that samples containing more diverse and unique actinomycete communities are most likely to contain novel and unique PKS genes. Our data provides only partial support for this hypothesis. Among the samples that were amplified using our KS
primers, the two samples with the richest and most unique actinomycete community in our sample set (2S1 and 2S5) also contained the largest diversity of KS
sequences as indicated by TRFLP.
Statistical support for the linkage between actinomycete species and KS
sequence diversity based on TRFLP data was weak, mainly because eight samples produced no KS
PCR product using the primers developed in this study. Also, the largest sequence diversity obtained by cloning PCR products came from a sample with only an average number of peaks in it's KS
TRFLP. Overall, KS
sequences obtained in this study belong to seven novel clades, none of which were >81% identical at the DNA level to sequences already in GenBank (as of August 2004). In particular there are six clades with high similarity to the KS
genes in pathways for the production of angucycline-derived antibiotics. These data indicate that group-specific TRFLP analysis of bacterial communities and functional genes can be a practical tool for prescreening environmental samples for diverse and unique gene sequences, but cannot be used to predict which sample will yield the most diverse sequence data in cloning experiments. Prescreening methods, such as the one described here, would be of very large benefit to projects aiming to identify novel bioactive molecules, because they increase throughput and efficiency, reduce cost, and reduce the probability of reinvestigating previously described targets.
We thank Laurie Seliger for DNA sequencing assistance.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»