Previous Article | Next Article ![]()
Applied and Environmental Microbiology, April 2005, p. 2026-2035, Vol. 71, No. 4
0099-2240/05/$08.00+0 doi:10.1128/AEM.71.4.2026-2035.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Botanisches Institut, Universität zu Köln, Cologne, Germany
Received 26 August 2004/ Accepted 10 November 2004
|
|
|---|
|
|
|---|
In addition to this phylogenetic assignment method for the analysis of tRFLP profiles of the 16S rRNA gene, the present work describes, for the first time, the development of a similar tool for N2 fixation and denitrification. For N2 fixation, nifH coding for dinitrogenase reductase was chosen because this gene is highly conserved among microorganisms (22) and because some 2,000 sequences are deposited in GenBank. For denitrification, nosZ, encoding nitrous oxide reductase, with only about 180 entries in GenBank, was selected. A further disadvantage of this gene is that it does not occur in denitrifying bacteria which do not reduce N2O. However, this gene had to be chosen because all of the other steps of denitrification (reduction of nitrate to nitrite, nitrite to nitric oxide, and nitric oxide to nitrous oxide) are catalyzed by at least two different enzymes in each case. Moreover, the sequence information for all these genes is even scarcer than that available for nitrous oxide reductase.
To demonstrate the potential of the present approach, the bacterial population of a forest soil from the vicinity of Cologne (Germany) was analyzed for its content with respect to the 16S rRNA, nifH, and nosZ genes. For this goal, a soil was selected with a high carbon/nitrogen ratio, which led us to assume that both denitrifying and N2-fixing bacteria occurred there simultaneously and with a high abundance. DNA was extracted from samples of the soil, and segments of the three genes were amplified by PCR and subjected to tRFLP analysis. Additionally, PCR products generated from the DNA of the forest soil were cloned and sequenced. The species list obtained by the TReFID software was compared with the sequence data from the clone libraries. Data were also evaluated by various control procedures which will be shown in detail. It is suggested that tRFLP analysis employing multiple restriction enzymes will become a useful tool to analyze complex microbial communities and one which will improve as more sequence information for the different enzymes becomes available.
|
|
|---|
Triplicate soil samples were taken from two 4-m2 patches in October 2002. Soil cores from the upper 20 cm, excluding nondecomposed litter, were homogenized. Total DNA was then extracted using an UltraClean Soil DNA kit (MoBio, Solana Beach, Calif.). The DNA preparation obtained was used as a template for amplifying the 16S rRNA, nifH, and nosZ genes by PCR. For the 16S rRNA gene, the primers 63F (10) and 778R (AGG GTA TCT AAT CCT GTT TGC) were routinely used for tRFLP analysis. This new primer, 778R, was tested with both laboratory cultures and clone libraries and provided amplicons from a wide range of organisms (C. Rösch, diploma thesis). To construct clone libraries, the additional primer combinations 27F-1495R (20) and 63F-1387R (10) were employed. Segments of nifH for tRFLP were amplified with the following primers (wobbling bases are underlined): nifHF (AAA GGY GGW ATC GGY AAR TCC ACC AC) and nifHRb (TGS GCY TTG TCY TCR CGG ATB GGC AT). For the clone libraries, the alternative reverse primer nifHR (ATG ATG GCS ATG TAY GCS GCS AAC AA) or nifHRc (TGG GCY TTG TTY TCR CGG ATY GGC AT) was used. The nosZ segments were obtained using nosZFb (AAC GCC TAY ACS ACS CTG TTC) and nosZRb (TCC ATG TGC AGN GCR TGG CAG AA). The choice of the primers was as described in reference 16; however, some minor modifications were made for improvements. PCR products of nifH and nosZ were about 400 and 700 bp in length, respectively. Hot start PCRs in a 25-µl volume were performed using a MasterTaq kit (Eppendorf, Hamburg, Germany) followed by touch-down time programs of 40 cycles in a Personal Cycler (Biometra, Göttingen, Germany). Annealing temperatures decreased stepwise from 66 to 56°C for the 16S rRNA gene amplifications and from 65 to 50°C in the case of nifH and nosZ.
To construct a clone library, PCR products were purified with a MinElute gel extraction kit (QIAGEN, Hilden, Germany), cloned using a pGEMT Easy Vector system (Promega, Mannheim, Germany), and sequenced with a BigDye Terminator cycle sequencing kit version 1.1 (Applied Biosystems, Weiterstadt, Germany) and an ABI 3100 automatic sequencer (Applied Biosystems). Raw sequences were processed in BioEdit 5.0.9 (5) and verified by BlastN (1), ClustalX alignments (18), and ChimeraCheck (8). The phylogenetic affiliation of the novel clones was deduced by means of the RDP II Phylip Interface (http://rdp.cme.msu.edu/cgis/phylip.cgi). Additionally, neighbor-joining phylogenies of 100 replicate trees were constructed with ClustalX and visualized with TreeView 1.6.1 (14). Gap positions were excluded from the analysis, but corrections for multiple substitutions were applied. Corrected sequences were deposited in GenBank (www.ncbi.nlm.nih.gov [accession no. AY723961 to AY724250]).
For tRFLP analyses, 5' fluorochrome-labeled PCR primers were used: 63F-6-carboxyfluorescein or 63F-6-carboxy-4,5-dichloro-2',7'-dimethoxyfluorescein (JOE) for the 16S rRNA gene and nifHF-6-carboxyfluorescein, or nosZR-6-carboxytetramethylrhodamine (MWG Biotech, Ebersberg, Germany). Purified PCR products from a reaction with nonlabeled primers were reamplified in a second PCR with labeled primers up to a total volume of 1,000 µl of PCR product. Products of this second reaction were purified (QiaQuick gel extraction kit [QIAGEN] or Ultrafree-MC columns [Millipore, Bedford, Mass.]), which also showed that PCR products had been obtained in sufficient quantities. The preparation was then partitioned into up to 13 aliquots and digested overnight at 37°C in a 100-µl volume by separately using one of the following restriction endonucleases (MBI Fermentas, Leon-Rot, Germany) per tube: AluI (AG/CT), Bme1390I (CC/NGG), Bsh1236I (CG/CG), Cfr13I (G/GNCC), HaeIII (GG/CC), Hin6I (C/GCG), HinfI (G/ANTC), MboI (/GATC), MspI (C/CGG), or RsaI (GT/AC). Digestions with TaiI (ACGT/), TacI (T/CGA), and TasI (/AATT) were performed at 65°C. After ethanol precipitation, the fragment mixtures were dissolved in 10 µl of sterile deionized water. Prior to gel loading, the samples (2.5 µl) were mixed with formamide (1.0 µl), loading buffer (Applied Biosystems) (1.0 µl), and a GeneScan 500 ROX size standard (Applied Biosystems) (0.5 µl). The analysis of 1.6 µl of this mixture was performed by 3 h of electrophoresis on a 36-cm-long 4.5% polyacrylamide gel at 2,700 V (ABI 377 automatic sequencer equipped with GeneScan 3.1.2; Applied Biosystems). The sizes of fragments were determined by using the local Southern method implemented in GeneScan 3.1.2 and a GS-500 ROX size standard. Only fragment lengths in the range of 30 to 500 nucleotides (nt) were considered for analysis. The noise threshold for signal processing was set as low as possible (generally about 20 relative fluorescence units) to cover a broad range of prominent and weak signals at the same time. Thus, peak heights for single tRFLP profiles were distributed over 2 orders of magnitude (
20 to
2,000 relative fluorescence units). The results were exported as tabulated-delimited text files (GeneScan 3.1.2) and further processed with TReFID (Fig. 1).
![]() View larger version (70K): [in a new window] |
FIG. 1. A flow chart describing the steps in the identification procedure of the TReFID program.
|
Preparation of the databases.
DNA sequence data for the three genes examined (the 16S rRNA, nifH, and nosZ genes) were downloaded in GenBank format by use of the Entrez nucleotide database query form (www.ncbi.nlm.nih.gov). In the cases of nifH and nosZ, all available sequences (October 2003) were analyzed using multiple sequence alignments (ClustalX 1.81) (18). Sequences including the binding sites for the fluorochrome-labeled primers (nifHF and nosZR) were incorporated in the respective TReFID databases. Those sequences had to be modified to fit the 5' end of the primer for nifHF or the 3' end of that for nosZR. Overhang nucleotides were cut, while missing nucleotides (up to 30) in the primer binding site were completed with "N" to assure correct fragment sizes. This modification was automatically accomplished by use of our own GBSD program (available under http://www.trefid.net) and a table of the number of nucleotides (up to 30) to be removed from or added to the reference sequences. Alignments of all sequences were manually done with ClustalX. Only 16S rRNA gene sequences deposited before January 2003 were used and similarly analyzed. A single GenBank sequence can be characterized by its set of theoretically derived tRFs. A graphical representation of this would be a polygon in a spider web graph (Fig. 2), where each axis represents a different restriction enzyme; the respective fragment sizes are dots on these axes, the origin being in the center.
![]() View larger version (39K): [in a new window] |
FIG. 2. Graphical representation of the tRFs obtained for nifH of N2-fixing bacteria and nosZ of denitrifying microorganisms. (I) For nifH, tRFs derived from the sequences deposited in GenBank (a) and tRFs experimentally obtained from the DNA isolated from the Dünnwald soil (b) are shown. (II) For nosZ, tRFs derived from the sequences deposited in GenBank (a) and tRFs experimentally obtained from the DNA isolated from the Dünnwald soil (b) are shown. The shaded polygon symbolizes the tRFs from Azospirillum brasilense Sp7. The lengths of the tRFs are given on the axes in the range between 0 and 500 bp.
|
2/3 matched to the tRFs of a polygon in TReFID were assigned a score sum. The procedure is described in detail under Results for cases in which the following defined conditions were met.
Polygon.
A graph is formed by all tRFs of one GenBank entry (thus for one organism or one sequence deposited) by use of all restriction enzymes (Fig. 2). For evaluation of the polygons, only those formed by the tRFs with a threshold of
2/3 are taken for analysis of the polygons deposited in the databank.
Match.
tRF is retrieved from the TReFID databank, with a deviation between expected and experimentally obtained lengths of not more than 1.5%.
Score.
Deviations between determined and theoretical tRF lengths as found in the TReFID bank are scored as follows: results of
0.5% receive a score of 1.0; results between 0.5 and 1.5% receive a score of 0.5; results between 1.0 and 1.5% receive a score of 0.25; and results of >1.5% receive a score of 0.
Score sum.
The sum for the tRFs with all the restriction enzymes employed referred to a single database entry (a polygon for an organism).
Threshold.
For the threshold value,
2/3 of restriction enzymes have to provide a tRF with a score of >0 for an analyzed sequence.
TReFID.
TReFID represents the terminal restriction fragment identifying program.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Summary of the total sequences and tRFs obtained for the three 16S rRNA, nifH, and nosZ genes
|
Unfortunately, the sequence information for nosZ is limiting. Only 85 out of 181 GenBank sequences available in October 2003 were suitable for constructing tRFs by use of the nosZR primer which binds near the 3' terminus of the gene. Together with the 43 sequences of the clone library, these GenBank sequences provided 1,038 tRFs, with only 129 being unique (Table 1). The present meager sequence information for nosZ is reflected by the few dots in the corresponding spider web graph (Fig. 2II, panel a) and the low number (128) of polygons obtained. In contrast, the sequence information for the 16S rRNA gene is much more than can be shown in a spider web graph similar to those for nifH and nosZ, because an axis for any restriction enzyme would contain too many dots (tRFs) to be graphically resolved.
Demonstration of the applicability of the approach by using DNA from an environmental sample.
The applicability of the method was tested with forest soil from the vicinity of Cologne. The loamy sand soil of the Dünnwald forest was selected because this soil was not rich in nitrogen, in contrast to many other locations in the Cologne area which are N saturated (16). Therefore, we expected a relatively high abundance of both denitrifying and N2-fixing bacteria which would enable us to retrieve tRFs of both groups. DNA isolated from this soil was used for PCR amplifications with the fluorochrome-labeled nifHF primer. Restriction digests with the eight different enzymes yielded 284 tRFs in total (Fig. 2I, panel b). These tRFs were then examined for related entities (organisms) with the tRF pattern in the nifH databank by use of the TReFID program (Table 1). When a tRF for a given restriction enzyme from the sample matched with one of the databank within 0.5% of the total nucleotide length, the score was set to 1.0. When the similarity between the determined length of a Dünnwald tRF and that of the closest one from the TReFID databank differed up to 1%, the score was set to 0.5. When such a difference was even 1.5%, the score was defined as 0.25. Any value outside of this range was defined as 0. Those restriction enzymes which did not provide a tRF for a segment because of the absence of a restriction site within the 30- to 500-bp sequence were not considered any further.
The polygon for Vibrio diazotrophicus was selected to illustrate the method. The enzymes AluI and MboI had no restriction site within the 400 bp between the nifHF and nifHR motifs and therefore could not be employed for analysis. Among the tRFs of the residual six restriction enzymes used, five gave the maximal score, meaning that the size of the best matching tRF showed less than 0.5% deviation from the predicted one. However, one enzyme (HinfI) gave a fragment of nearly the same size (being maximally 1.5% larger or smaller than that predicted fragment for V. diazotrophicus). The score sum for V. diazotrophicus in the Dünnwald soil is thus 5.25 out of 6 (88% identity). Thus, the program indicates with high fidelity that an organism occurs in the Dünnwald soil which is closely related to V. diazotrophicus.
As another example, in the case of the noncultured clone DUN1+B26, all eight restriction enzymes provided a tRF, but the similarity value was 6.75 out of 8 due to results that included six scores of 1.0, one score of 0.5, and one score of 0.25. The overall similarity between the pool of tRFs from the Dünnwald soil and DUN1+B26 was 6.75/8 (84%), indicating that tRFs of a bacterium closely related to the deposited clone from Dünnwald, DUN1+B26, had been retrieved. The method does not permit one to decide whether this bacterium retrieved by the tRF pattern was identical to that from the clone library. Clearly, the lower the score sum (in percent similarity), the lower is the probability that an organism with sequence similarity to any bacterium of the TReFID bank is present in the environmental sample. A match in only two cases, for example, could have been due to restriction sites at the same sequence position in two totally unrelated bacteria and would therefore be meaningless.
A threshold value was arbitrarily set for all three genes. In the case of the 16S rRNA gene, the tRFs, obtained from at least 9 restriction enzymes out of the 13 maximally used and providing a tRF within the range of 30 to 500 bp, had to give a score of 1.0, 0.5, or 0.25 (and thus matches of at least two of three) to be considered for further analysis (Table 2). Thus, the tRFs of the DNA of an environmental sample also form a polygon. To be used for the further analysis, at least 9 tRFs of the DNA of this sample had to match the 13 tRFs of one polygon present in TReFID. Cases of matches giving results of <2/3 were discarded. If one (or more) restriction enzyme(s) had no site within the PCR fragment analyzed, the tRF for this enzyme was left out. Matches were then referred to 12 (or less) enzymes, but the required threshold value was also kept at
2/3.
|
View this table: [in a new window] |
TABLE 2. Similarities of the tRFs polygons obtained from DNA of the Dünnwald soil with those of the TReFID databank
|
2/3 were further analyzed for their score sum by the TReFID program. An overall similarity value (i.e., the score sum/number of restriction enzymes with a restriction site in the sequence range analyzed) between 94 and 100% was regarded as being highly indicative of similarity to a polygon consisting of tRFs for a specific bacterium of the TReFID databank. In the case of the 16S rRNA gene, the majority of sequences of the TReFID results (1,037) listed for the Dünnwald soil had similarity values between 73 and 79% (Table 2). In the case of nifH, five out of eight restriction enzymes had to give a match with a score of 1.0, 0.5, or 0.25 (i.e., a match of two out of three at least) for the sequences to be considered any further. Remarkably, the average similarity value was about 10% less in the case of nifH than that for the 16S rRNA gene (Table 2).
The same protocol was tried for nosZ (Fig. 2II). The threshold was set to six out of nine restriction enzymes utilized (i.e., a match of at least two out of three). The highest matches of a polygon of Dünnwald soil DNA were to Pseudomonas stutzeri, with a score sum of 7.0 out of 8 restriction enzymes that had a restriction site inside the fragment analyzed, Paracoccus pantotrophus, with a score sum of 6.0 out of 7, and Azospirillum lipoferum, with a score sum of 6.0 out of 8. Thus, organisms closely related to the denitrifying bacteria mentioned occurred in the Dünnwald soil. The observed similarity value of the Dünnwald sequences for nosZ was as low as that for nifH (Table 2), although the size of the databank for nosZ was too small to allow us to draw a definitive conclusion here.
Controls to assess the quality of the TReFID program and to ascertain the feasibility of the approach. Control A.
The 16S rRNA gene tRFs obtained from the Dünnwald soil DNA were examined for matches with any of the 135 polygons of the Dünnwald clone library as part of the TReFID databank (Fig. 3a). For this examination, the stringency in the two parameters used to compare the tRFs from the soil DNA and Dünnwald clone library was varied. The ordinate in the figure indicates the percentage of hits of polygons (i.e., the sum of tRFs for each bacterium deposited) obtained from the DNA isolated from the Dünnwald soil (i.e., entity 1) within the total entries of the Dünnwald clone library (i.e., entity 2). The percentage of matches (polygon similarities) between the two entities ranging between one of two and three of four is given in one abscissa, whereas the percentage of deviations between the lengths of the tRFs from entity 1 and 2 between 0.33 and 1.0% is shown in the other. Figure 3a reveals that a high percentage of tRFs retrieved from the soil DNA are represented in the Dünnwald clone library. However, the correlation between both entities cannot be perfect, since the 135 clones deposited into the library cannot always be retrieved with different DNA preparations from a soil. In addition, DNA isolation, PCR amplification, and sequencing may have caused false negatives which are particularly obvious in cases of extreme stringency where three-fourths of the tRFs have to be present in both entities with a maximal deviation of 0.33% (Fig. 3). Taking all these difficulties into account, the proportion of sequences matching between both entities is remarkably high. Clearly, direct sequencing of the clones obtained provided more accurate data but is much more time consuming and expensive than TReFID analysis.
![]() View larger version (51K): [in a new window] |
FIG. 3. Identification of polygons representing organisms in the Dünnwald soil in two libraries. DNA was isolated from the Dünnwald soil, and the lengths of the tRFs obtained with all 13 restriction enzymes were experimentally determined for the 16S rRNA gene. The lengths of the tRFs were then compared with those calculated using the sequences from the Dünnwald clone library (a) and from GenBank (b). The ordinate represents the percentages of polygons of the soil DNA among the total polygons of either the Dünnwald clone library (a) or the GenBank sequences (b). One abscissa (front side) denotes the proportions of enzymes providing a tRF with a match corresponding to the total number of enzymes employed and providing tRFs for analysis. The other abscissa (right side) denotes the deviations in the lengths of the tRFs. The percentage of sequences retrieved was low when the deviation was restricted to 0.33% and was high at a deviation of 1.00%. For the further calculations, the values 0.66 (for the number of matches) and 0.5% (for the deviation) were selected (dark column in both parts of the figure). However, score values of 1.0, 0.5, or 0.25 (for definitions, see text) were taken for the calculations and this figure.
|
From the Dünnwald soil DNA, 1,373 polygons formed by tRFs scoring above the threshold (i.e.,
2/3 matches) in the total 17,327 entries of the 16S rRNA gene TReFID databank could be retrieved altogether. The corresponding values for the nifH and nosZ sequences were 130 out of 824 and 23 out of 97 total entries, respectively. To demonstrate the specificity of the TReFID bank, tRFs of the Dünnwald soil DNA were screened in the TReFID bank for the heterologous, false genes. For nifH tRFs, only three matching 16S rRNA gene polygons were detected in the 17,360 sequences of the 16S rRNA gene database, whereas all other combinations (tRFs for nifH in the nosZ bank, for the 16S rRNA gene in either the nifH or nosZ banks, and for nosZ in either the 16S rRNA gene or nifH banks) gave negative results.
Control B.
The TReFID data allowed the construction of synthetic tRFLP profiles of the genes in the Dünnwald soil DNA assayed on the basis of the tRF entries in the TReFID result list, as exemplified in Fig. 4, for the 16S rRNA gene by the use of Bsh1236I (a), MboI (b), and RsaI (c). The different heights in the peaks of the profile constructed from the TReFID database reflect the abundances of tRFs at distinct nucleotide lengths. The 16S rRNA gene tRFs of the Dünnwald soil were obtained by PCR using fluorochrome labeling, and their nucleotide lengths were determined experimentally. They corresponded well with those of the reconstructed profile. However, the peak height was small in some cases and was slightly above the peak detection threshold value, set at 20 relative fluorescence units (see peaks at 135, 325, or 380 nt in Fig. 4b). Thus, it was not clear in these cases whether such small peaks indicated a hit for a tRF. Despite this drawback, this procedure for constructing a synthetic tRF profile allows verification of the composition of a bacterial population for any environmental habitat with respect to the 16S rRNA, nifH, and nosZ genes or any other gene with a tRF data bank when this approach is applied to the tRFs obtained for all restriction enzymes.
![]() View larger version (27K): [in a new window] |
FIG. 4. Comparison in the tRF profiles obtained experimentally with DNA from the Dünnwald soil and predicted from the sequence information in the TReFID result list. DNA was isolated from the Dünnwald soil probe, and the tRF lengths were determined experimentally. By using the polygon constructions from the tRFs of all 13 restriction enzymes, a list of bacteria in the Dünnwald soil which are closely related to organisms with entries in the TReFID databank could be compiled. The tRFs of these closest relatives in the databank were then taken to construct the predicted tRF profile of the DNA from the Dünnwald soil.
|
![]() View larger version (36K): [in a new window] |
FIG. 5. The occurrence of the 16S rRNA gene tRF polygons related to those from Azospirillum brasilense, Escherichia coli, or Rhizobium leguminosarum. The TReFID program provided 192 polygons related to those of the three organisms and deposited in the TReFID databank (8 for A. brasilense, 111 for E. coli, and 64 for R. leguminosarum). Their sequences were used to construct the phylogenetic tree by use of the neighbor-joining method. The tree shows that only six artifacts were obtained (see text). The average sequence homologies between the three clusters of organisms (A. brasilense, E. coli, and R. leguminosarum) were 70.2, 80.8, and 71.1%, respectively. The numbers in the figure refer to the following: 1, Roseomonas fauriae (AF533354); 2, Roseomonas genomospecies (AY150050); 3, Azospirillum sp. (AB049110); 4, Devosia riboflavina (AF501346); 5, Agrobacterium tumefaciens (AF508094); 6, Agrobacterium tumefaciens (AF406666); 7, Sinorhizobium kummerowiae (AF364067); 8, Sinorhizobium meliloti (AF533685); 9, Rhizobium tropici (U89832); 10, Mesorhizobium plurifarium (AF516882); 11, Ochrobactrum sp. (AF452128); 12, Ochrobactrum anthropi (AF501340); 13, Idiomarina baltica (AJ440214); 14, Xenorhabdus nematophila (AF522294); 15, Pectobacterium carotovorum (AF373189); 16, Serratia quinivorans (AJ279050); 17, Serratia sp. (AF511524); 18, Serratia odorifera (AF286870); 19, Pectobacterium carotovorum (AF373184); 20, Erwinia amylovora (AF141892); 21, Escherichia albertii (AJ508775); 22, Escherichia coli (AY319394); 23, Obesumbacterium proteus (AY077753); 24, "Escherichia senegalensis" (AY217654); 25, "Dickeya dadantii" (AF520707); 26, Shewanella gelidimarina (AF530149); 27, Pseudoalteromonas sp. (AB055788); 28, Pseudoalteromonas prydzensis (U85855).
|
-proteobacteria. However, other major groups of soil bacteria that included acidobacteria and actinobacteria also occurred. Others (about 10%) consisted of spirochetes, firmicutes, bacteroidetes, fusobacteria, fibrobacteres, planctomycetes,
-proteobacteria, and cyanobacteria; these latter were Planktothricoides raciborskii, "Lyngbya hieronymnsii," "Planktothrix agardhii," "P. rubescens," "Trichodesmium havanum," Anabaena compacta, Nostoc sp., and several noncultured forms, among which some have been described only for marine habitats as yet. For nifH, the amount of sequences retrieved by this approach was not so large due to the limited number of sequences in the TReFID data bank (Table 3). Surprisingly, the proportion of sequences related to unclassified bacteria possessing this gene amounted to 194 (87%) and was thus high. This was an unexpected, new result obtained by the use of the TReFID program. However, sequences related to members of the classical N2-fixing groups (Rhizobiales, Rhodobacterales, and Pseudomonadales) were also detected. The file obtained for bacteria with nosZ was small, but the highest percentage of these again belonged to the unclassified organisms. The actinobacteria and acidobacteria retrieved from the 16S rRNA gene sequences were not present in the list of bacteria with nifH or nosZ. Only a few bacteria, Bradyrhizobium japonicum, Mesorhizobium ciceri, Methylosinus trichosporium, and Rhodovulum strictum, were found in the lists for both the 16S rRNA gene and nifH. Bacteria included in the list for the 16S rRNA gene and nosZ were Paracoccus denitrificans, P. pantotrophus, and Pseudomonas fluorescens. One bacterium, Pseudomonas stutzeri, possessed both nifH and nosZ but was not included in the list of the 16S rRNA gene. |
View this table: [in a new window] |
TABLE 3. Results for the groups of organisms retrieved from the Dünnwald soil by using the TReFID databank
|
|
View this table: [in a new window] |
TABLE 4. Taxa retrieved from the Dünnwald soil by using the TReFID databank
|
|
|
|---|
The tRF analysis also does not permit the determination of the abundance of single sequences in an environmental sample. This is because DNA templates differ in their primer homologies and because bacteria can contain different copy numbers of the 16S rRNA gene (4). In addition, PCR can hardly be exactly reproducible with different DNA preparations from the same environmental habitat unless extensive and careful calibrations are performed. Despite drawbacks, the 17,327 16S rRNA gene polygons in the TReFID database allow characterization of the population structure in an environmental sample at the DNA sequence level. The relative percentages of sequences retrieved which are related to a specific bacterium, as exemplified for E. coli, Azospirillum brasilense, and Rhizobium leguminosarum (Fig. 5), depend on the number of entries in the databank but do not reflect the situation in an environmental sample. As in other investigations, a sequence completely identical to one deposited in GenBank is rarely retrieved from a soil or other environmental sample. This might reflect the existence of a sequence continuum among the 104 ribotypes per g of soil (19), as was also inferred from other investigations (16).
The present report seems to be the first attempt to characterize N2-fixing and denitrifying bacterial profiles in an environmental sample by a tRF analysis. The nifH gene coding for nitrogenase reductase is highly conserved among N2-fixing bacteria (22), and more sequences are available for this gene than for the other two structural (nifDK) genes or for any other nif gene. The 822 polygons deposited in the TReFID databank are sufficient to allow characterization of the composition of a population of N2-fixing bacteria in environmental samples, as shown for the Dünnwald soil, but the resolution will improve as more nifH sequences become available. Clearly, the situation is presently in infancy for nosZ or any other gene coding for a gene of denitrification. Thus, the present communication merely indicates that the technique can theoretically also be employed for denitrification.
The TReFID program presented here and the phylogenetic assignment tool recently published (6) should complement each other in characterizations of members of a bacterial community by their 16S rRNA gene tRF profiles. The two methods differ in several respects. The approach by Kent et al. (6) employs a hierarchical algorithm which allows the identification of organisms by analysis of the tRFs provided by the restriction enzymes in a consecutive order (see Fig. 1 of reference 6). Their tRF data have been automatically generated by MiCA (http://mica.ibest.uidaho.edu). The TReFID program utilizes polygons of tRFs of defined bacterial sequences obtained from databanks or a clone library. The TReFID program is presently more comprehensive for analysis of 16S rRNA gene sequences. It contains many sequences from bacteria not yet cultured. The program by Kent et al. (6) utilizes the 8F primer, whereas TReFID employs the 63F region, which is strongly conserved among the 16S rRNA gene sequences deposited in the databanks (13). In addition, many sequences of GenBank contain the 63F but not the 8F region. Only the use of the 63F region as a primer binding site enabled us to develop the more than 17,000 polygons used here for the 16S rRNA gene. The program described in reference 6 could not easily install such a detailed phylogram as that shown in Fig. 5 (C. Rösch, unpublished).
Previous molecular analyses of soil or marine samples suggested a relatively low species richness with respect to nifH but not the nosZ or the 16S rRNA gene (15, 16, 17, 21). The present study with DNA from the Dünnwald soil indicated by the use of the TReFID program that most nifH sequences retrieved were related to those of noncultured bacteria whose sequences have been deposited in the GenBank. It is presently not clear whether this result represents a special feature of the Dünnwald soil or whether the tRFLP analysis more readily accesses noncultured bacteria than other molecular approaches tried so far. To resolve this issue, a more detailed analysis of the bacterial population and its dynamics after N fertilization is presently under way with soil samples from Dünnwald and also from other locations. This is now possible, since the tRFLP TReFID program allows rapid analysis of the bacterial composition of an environmental sample and avoids time-consuming and expensive cloning and sequencing.
This study was kindly supported by grants from the GEW Stiftung (Cologne, Germany) and the Deutsche Bundesstiftung Umwelt (Osnabrück, Germany).
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»