Previous Article | Next Article ![]()
Applied and Environmental Microbiology, May 2003, p. 2657-2663, Vol. 69, No. 5
0099-2240/03/$08.00+0 DOI: 10.1128/AEM.69.5.2657-2663.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Marine Science Center, Northeastern University, Nahant, Massachusetts 01908
Received 20 November 2002/ Accepted 6 February 2003
|
|
|---|
|
|
|---|
This debate is a natural reflection of how limited our knowledge is on microeukaryotic diversity. The data available are derived primarily from cultivation studies and manual collection of cells from environmental samples. The proportion of cultivable microorganisms is likely to be under 1% (4, 24, 26), and this fact alone suggests that there may be an enormous reservoir of protistan diversity yet to be discovered (35).
The rRNA approach bypasses the problems of microbial uncultivability and has revolutionized the diversity studies. Its application to prokaryotic diversity resulted in some of the most interesting discoveries in environmental microbiology of the past decade (2, 6, 9, 23, 25, 28, 30, 37, 40, 41). In contrast, there are surprisingly few works exploring eukaryotic diversity in a similar manner (5, 7, 31, 34, 36). These studies reported a number of novel protistan lineages, some at the kingdom level. They also abundantly showed that the study of real microeukaryotic diversity is still in its infancy. Apart from identifying substantial gaps in our knowledge, this has an important implication for molecular phylogeny. Dawson and Pace (5) showed that the currently available collection of rRNA sequences is heavily biased toward only a few phylogenetic kingdoms. The accuracy with which both known as well as novel lines of descent are resolved can only be improved through a substantial growth of the sequence collection. This paper is a further contribution to the survey of marine microeukaryotic diversity and improvement of the resolution of the eukaryotic tree. We applied the 18S rRNA approach to samples from several suboxic and anoxic marine systems. We chose these environments because some of the most deeply divergent known eukaryotic lineages contain microaerobic and anaerobic organisms (15). Anoxic environments have occurred continuously throughout the Earth's history and promise to harbor a diversity of novel sequences potentially relevant to the early evolution of eukaryotes.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Physicochemical characteristics of samples used to generate clone libraries of eukaryotic 18S rRNA genes
|
Nucleic acid isolation.
High-molecular-weight DNA was obtained from the frozen filters and 5-g aliquots of sediment samples as described by Zhou et al. (43). In short, the samples were heated to 65°C for 2 h in a high-salt extraction buffer (see above) with sodium dodecyl sulfate (20%), cetyltrimethylammonium bromide (1%), and proteinase K (0.1 mg ml-1 final concentration). The lysates were purified twice by extraction with an equal volume of chloroform-isoamyl alcohol (24:1) and precipitated with a 0.6 volume of isopropanol. Humic acids and other potential inhibitors of downstream applications were removed by DNA purification with the resin-based Wizard DNA clean up system (Promega, Madison, Wis.). The integrity of the total DNA was checked by agarose gel electrophoresis (0.8%), and the DNA yield was quantified photometrically (Biophotometer; Eppendorf, Hamburg, Germany). The molecular size of extracted genomic DNA was between 11 and 13 kb (as determined by comparison to a molecular size standard). On average, we extracted 4.5 µg of crude DNA from a filter (1-liter sample). The benthic DNA yield was 9.5 and 52 µg of DNA per g of interface and sediment samples, respectively. After Wizard purification, we recovered approximately 40% of the crude DNA. Humic materials in crude extracts are assumed to interfere with DNA binding to the resin (43), resulting in these low recovery rates.
PCR amplification of eukaryotic rRNAs.
The nearly full-length 18S rRNA genes were PCR amplified by using a universal eukaryotic primer set (EukA 1 to 21 and EukB 1795 to 1772) (33). Each PCR mixture contained 10 to 20 ng of DNA template, 2.5 U of HotStar Taq DNA polymerase (Qiagen, Valencia, Calif.) in the manufacturer-provided reaction buffer [Tris-HCl [pH 8.7], KCl, and (NH4)2SO4], 1.5 mM MgCl2, 200 µM concentrations of each deoxynucleoside triphosphate, and 0.5 µM concentrations of each oligonucleotide primer. The final volume was adjusted to 50 µl with sterile water. The PCR protocol for 18S rRNA gene amplification consisted of an initial hot start incubation (15 min at 95°C) followed by 30 identical amplification cycles (denaturation at 95°C for 45 s, annealing at 55°C for 1 min, and extension at 72°C for 2.5 min) and a final extension at 72°C for 7 min. Negative control reactions included bacterial (Escherichia coli) or archaeal (Halobacterium salinarium ATCC 19700) DNA as a template.
Cloning, ARDRA, and sequencing.
Fresh PCR products were used to construct a clone library by using the TA cloning kit (Invitrogen, Carlsbad, Calif.) according to the manufacturer's instructions. Positive clones were identified by blue-white screening. Plasmids were isolated from overnight cultures by using the Qiaprep Spin Miniprep kit (Qiagen). The presence of an 18S rRNA gene insert was confirmed by PCR reamplification with the primer set and PCR protocol described above. Between 200 and 400 ng of positive amplification products of the target size were digested with 7.5 U of the restriction endonuclease HaeIII (New England Biolabs, Beverly, Mass.) for 60 min at 37°C, followed by an inactivation step for 20 min at 80°C. The resulting bands were separated by electrophoresis in a 2.5% low-melting-point agarose gel at 80 V for 2 to 3 h. Clones with identical amplified ribosomal DNA restriction fragment analysis (ARDRA) patterns were considered members of the same operational taxonomic unit (OTU). At least one clone of each OTU was sequenced at the University of Maine DNA Sequencing Facility with an Applied Biosystems (ABI) 373 DNA stretch sequencer, with the XL Upgrade and the ABI Prism BigDye, Terminator version 3.0, cycle sequencing ready reaction kit. Approximately 300 clones with unique ARDRA patterns were sequenced for the Sippewisset clone library. Low-quality sequence reads (7 sequences) and nontarget metazoan sequences (<10% in the water body and about 50% in each of both sediment layers) were excluded from the phylogenetic analyses. The clones from the water body are referred to as CCW (Cape Cod, water column), the clones from the sediment-water interface are referred to as CCI, and the clones from the deep anoxic layer are referred to as CCA.
Phylogenetic analyses.
The environmental 18S rRNA gene sequences obtained were compared to those in a GenBank by using gapped BLAST analysis (1) and to over 5,000 prealigned eukaryotic SSU rRNAs in the Ribosomal Database Project (32). The closest matches were aligned together with the Sippewisset clone sequences by using ClustalX (39), and the resulting multiple alignments were manually refined by using phylogenetically conserved secondary structures. The RNA secondary structure prediction and comparison tools of the Vienna RNA Package (27) were used to construct secondary structure models, which were verified against known conserved secondary structure regions (3). Environmental gene sequences were tested by using the RDP CHECK_CHIMERA program (32) to detect potential chimeric gene artifacts (38). Additionally, we tested 50 random 18S rRNA gene sequences by constructing sets of phylogenetic trees by using different regions of the same gene and examining the identity of the resulting trees' topology (16, 38). Further phylogenetic analyses included only conserved nucleotides at unambiguously aligned positions. Evolutionary trees were constructed by a distance-based (evolutionary distance) and by a character-based (maximum parsimony) method by using the PAUP* software package, version 4.0b10 (D. L. Swofford, Sinauer Associates, Sunderland, Mass.). All heuristic searches were performed by using random, stepwise addition of taxa with the tree bisection reconnection (TBR) branch-swapping algorithm. For each evolutionary distance tree we evaluated several models of character change: a Tamura-Nei minimum evolutionary distance model, a Kimura 2-Parameter model, an HKY85 model, and a general time-reversible substitution model under maximum-likelihood conditions (with two substitution models, a Ti/tv ratio of 2 or a six-category estimated general time-reversible model with individual base frequencies determined empirically). We assessed the relative stability of tree topologies by using 1,000 bootstrap replicates. Heuristic searches for bootstrap analyses employed stepwise addition, starting trees with simple addition of sequences and TBR branch swapping. Relative similarities between two different clones are based on ClustalX alignments of homologous regions of these two clones.
Nucleotide sequence accession number.
The gene sequences reported in this study have been deposited in the GenBank database and assigned GenBank accession numbers AY179969 to AY180047 and AY230211.
|
|
|---|
Several OTUs were excluded from analyses either because they represented nontarget organisms (2 crustaceans, 2 nematodes, and 2 polychaetes) or due to low quality or chimeric nature of their 18S rRNA gene sequences (21 OTUs). The final phylogenetic analyses thus included sequences of 42 unique protistan and fungal OTUs. This analysis showed (Fig. 1) that the CCW library was dominated by stramenopiles and alveolates (26% of the total number of unique OTUs each), followed by green algae (14%) and cercozoans (12%), fungi (10%), and euglenoids (2%). Four sequences showed no clear affiliation. Only 17% of sequences were closely related (>97% similarity) to sequences reported previously, mainly restricted to stramenopiles (e.g., CCW25 and CCW27) and green algae (CCW13). Most alveolate, cercozoan, and fungal sequences obtained were divergent from the closest known relatives by at least 5%. Several sequences could not be identified with any established protistan group, i.e., the three independent clades CCW8, CCW100, and CCW75, as well as CCW88, all of which branched in the base of the tree. Some of the deeply rooted sequences showed affiliation with uncultured clones from previous surveys of microeukaryotic SSU rRNA in aquatic environments. These included an independent clade CCW100/CCW75, a alveolate clade CCW77/CCW111, and a cercozoan clade CCW29/CCW83.
![]() View larger version (33K): [in a new window] |
FIG. 1. Phylogenetic tree for nearly complete SSU rRNA sequences of environmental clones from oxygen-depleted water in the Great Sippewisset salt marsh, Cape Cod, Mass., and the most closely related cultured organisms and environmental 18S rRNA signatures. Environmental clones are indicated by boldface type, and each clone is designated by the library designation (CCW) followed by a number. The tree topology was obtained by a heuristic search with stepwise addition of taxa (TBR branch swapping) under a Tamura Nei minimum evolutionary distance model, with gapped and ambiguously aligned positions excluded. Distance bootstrap values over 50% from an analysis of 1,000 bootstrap replicates are given at respective nodes. Sequences are followed by GenBank accession numbers.
|
![]() ![]() ![]() ![]() View larger version (139K): [in a new window] |
FIG. 2. Phylogenetic trees for nearly complete SSU rRNA sequences of environmental clones in the Great Sippewisset salt marsh, Cape Cod, Mass., and the most closely related cultured organisms and environmental 18S rRNA signatures. Environmental clones are indicated by boldface type, and each clone is designated by the library designation (CCI for water-sediment interface; CCA for subsurface sediment [10 cm]) followed by a number. The tree topologies were obtained by a heuristic search with stepwise addition of taxa (TBR branch swapping) under a Tamura Nei minimum evolutionary distance model (trees a, b, and c) and maximum-likelihood (tree d) trees, with gapped and ambiguously aligned positions excluded. Distance bootstrap values over 50% from an analysis of 1,000 bootstrap replicates are given at the respective nodes. Sequences are followed by GenBank accession numbers.
|
|
|
|---|
As expected, the sequences obtained from the clone libraries encompass all major protistan lineages, including alveolates, stramenopiles, cercomonads, euglenoids, and green algae, as well as fungi and nontargeted metazoan groups, such as invertebrates. Although PCR and cloning reactions tend to produce a biased outcome, screening of only ca. 300 clones yielded representatives of all these major groups expected in the environment under study.
The sequences of our SSU rRNA libraries (76 unique protistan OTUs) fall into three different categories, according to their relatedness to the ribosomal sequences reported earlier. The first category (29%) is the sequences closely related to the named organisms known from prior taxonomic surveys and/or available in culture collections. In general, the sequences from this group are 95 to 99% similar to those of known organisms, which indicates relatedness at the specific to generic levels (R. Droste and S. Epstein, submitted for publication). The closest matches (up to 99% similarity and above) included widely distributed ciliates (e.g., Plagiopyla, and Euplotes), green algae (e.g., Micromonas, Chlorococcum, and Chlamydomonas), and stramenopiles (e.g., Navicula, Amphora, Fragilaria, Synura, and Paraphysomonas).
The second category of sequences (59%) represents sequences that are not closely related to any named organisms but are branching within well-established protistan clades. As a rule, the depth of branching of these sequences is comparable to class level differences. This group contains the novel golden-brown algae CCI40 and the diatom clone CCA72. We also include in this category clones CCA42, CCA67, CCI32, CCW77, and CCW111, which have unclear taxonomic affiliations but are related to the A2- and BOLA-type sequences obtained in molecular surveys of other marine oxygen-depleted environments (5, 11).
The third group of sequences (12%) cannot be assigned to any established clade and seem to represent novel groups at the kingdom level. Four of the clones (CCW8, CCW88, CCI78, and CCI73) form early branches on the eukaryotic tree that precede the euglenozoa. To the same category belong several clones that are equally divergent from the known organisms but are affiliated with previously reported novel 18S rRNA-based kingdom level clades. Among these are clones CCA38, CCI7, and CCI31, which are related to two sequences recovered from a deep-sea hydrothermal vent site (11). These clones support and increase the diversity in what appears to be a novel protistan kingdom branching off the eukaryotic tree preceding trichomonads. Additionally, clones CCW75 and CCW100 are affiliated with a recently reported sequence DH145-EKD11 (31). Together, these three clones form a tight protistan clade whose origin seems to precede the crown radiation of eukaryotes. Interestingly, the CCW and DH clones originated from samples collected in different hemispheres and drastically different environments, yet their sequences are similar at the level approaching specific (>97%). This indicates that the novel clade may be ubiquitous and probably contains cosmopolitan representatives. Collectively, these findings illustrate how decidedly incomplete is the eukaryotic evolutionary tree and our view on protistan diversity.
It is important to know how well the sequence diversity in the clone libraries represents the organism diversity in the target community. Ciliated Protozoa may serve as a convenient model to address this issue because they have been surveyed by classical taxonomists for over a century (8, 10, 12, 19, 20, 21, 22, 29). This makes it possible to conduct a direct comparison between the molecular and classical inventories.
Taxonomic and ecological research of the past indicate that suboxic and anoxic sediments and waters are characterized by a range of facultative and obligate anaerobes, such as Plagiopyla, Metopus, Caenomorpha, and others (13, 14, 15, 18). Using the rRNA approach, we detected sequences related to all of the above genera (e.g., clone CCW92 reveals 99% similarity to Plagiopyla frontata) as well as to other typical benthic ciliates such as Euplotes, Aspidisca, and Spathidium. In addition, we detected a large number of lineages within the ciliate clade whose phylogenetic position suggest relatedness to anaerobic and microaerophilic ciliates (e.g., CCA11, CCA31, CCI74, CCI32, and CCA46). Therefore, even limited clone libraries such as ours with only ca. 300 clones processed contain the signatures of those genera that are expected to dominate the sampling site. This identifies the 18S rRNA approach as useful in identifying at least some of the key elements in the ciliate assemblage in question. The latter is undoubtedly more diverse than the collection of ciliate-related sequences we obtained. However, the ciliate sequence diversity should increase dramatically as the clone library is sampled more systematically, perhaps reaching, and likely exceeding, the level of ciliate diversity known from classical research. The relatively good coverage of the ciliate dominant taxa exhibited by our clone libraries tentatively indicates that these libraries may contain a reasonable diversity record of other protistan taxa from the Sippewisset salt marsh.
In conclusion, the 18S rRNA approach seems to be a valuable tool in assessing the global diversity of aquatic eukaryotes. The first applications of this approach to microeukaryotes in aquatic environments (5, 7, 11, 31, 34, 36, 42) uncovered a substantial number of novel clades from the specific to the kingdom level. This work supports some of the previously reported clades and identifies novel deeply branching lineages in the eukaryotic evolutionary tree. The oxygen-depleted habitats seem to be particularly promising in a search for such new lineages, which calls for an extended SSU RNA sequence collection in this and other extreme environments as part of surveying the diversity of life and in an attempt to develop models of the early Precambrian microbial communities.
We are indebted to Edward Jarroll (Marine Science Center, Northeastern University) for invaluable support throughout this study in too many areas to list. We thank Ted Maney (Marine Science Center, Northeastern University) for lending us a hand in many challenging situations. Our acknowledgments extend to Rita Droste and Victoria Alla (Marine Science Center, Northeastern University) for help in the field as well as in the lab. We are grateful to Virginia Edgcomb (Woods Hole Oceanographic Institution) for stimulating discussions on protistan phylogeny.
This is contribution no. 245 of the Marine Science Center of Northeastern University. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»