Previous Article | Next Article ![]()
Applied and Environmental Microbiology, March 2008, p. 1546-1554, Vol. 74, No. 5
0099-2240/08/$08.00+0 doi:10.1128/AEM.02103-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.

Department of Marine Sciences, University of Connecticut, Groton, Connecticut 06340,1 Department of Biological Sciences and Roy J. Carver Center for Comparative Genomics, University of Iowa, 446 Biology Building, Iowa City, Iowa 52242-1324,2 Graduate School of Oceanography, University of Rhode Island, South Ferry Road, Narragansett, Rhode Island 028823
Received 13 September 2007/ Accepted 20 December 2007
|
|
|---|
|
|
|---|
Given the challenge of obtaining pure Dinophysis samples from natural microbial assemblages for DNA and RNA extraction, isolation of these cells can be made more efficient with the use of flow cytometry (6). Furthermore, dinoflagellate-specific primers can significantly reduce the amplification of gene homologs in other organisms still present in the sorted cell samples. In this study, we combined these two methods to maximize the chance of obtaining gene sequences from the target species. We chose to analyze three genes: the small-subunit (SSU) rRNA (18S rDNA), mitochondrial cytochrome b (cob), and cytochrome c oxidase I (cox1). The 18S rDNA gene was included because a large data set is available for this conserved gene. Mitochondrial genes were selected because of their higher mutation rates and a more clock-like behavior relative to 18S rDNA (see references 10 and 45 and references therein). In addition, mRNA editing characteristics of mitochondrial genes can potentially act as species-specific molecular markers (20). Thus, taking advantage of dinoflagellate-specific gene primers developed in recent years (19, 42), we sequenced these three genes and cDNAs of cob and cox1 from Dinophysis acuminata collected in 2005 from a field population from Narragansett Bay, RI. We then characterized mRNA editing of cob and cox1 for this and other dinoflagellate species. Our main goal was to develop an approach that would lead to an efficient genetic analysis of environmental samples for ecological and phylogenetic purposes and allow for the study of gene expression in a natural dinoflagellate population.
|
|
|---|
Dinoflagellate cultures and sample collection.
The dinoflagellate taxa used in this study are listed in Table 1. All photosynthetic species were grown in f/2-Si medium except D. acuminata collected from Narragansett Bay and the heterotrophic species grown with an algal prey (Rhodomonas sp. strain CCMP768) (46). Salinity was adjusted to 28 PSU for most species and to 15 PSU for Rhodomonas, Karlodinium veneficum, and heterotrophic taxa (46). Cultures were maintained at 20 ± 1°C under a 12 h:2 h light-dark cycle with a photon flux density of ca. 100 µmol photons·m–2·s–1. The growth rate was monitored by microscopic cell counts using a Sedgwick-Rafter chamber.
|
View this table: [in a new window] |
TABLE 1. Dinoflagellate species included in the phylogeny and mitochondrial cob and cox1 editing analyses
|
Cloning and sequencing of 18S rDNA, cob, and cox1.
DNA and total RNA were extracted and cDNA synthesized as previously described (21). The resulting cDNAs and genomic DNA were subjected to PCR amplification with dinoflagellate specific primer sets 18ScomF1 (5'-GCTTGTCTCAAAGATTAAGCCATGC-3')-Dino18SR1 (5'-GAGCCAGATRCDCACCCA-3') and Dino18SF1 (5'-AAGGGTTGTGTTYATTAGNTACARAAC-3')-18ScomR1 (5'-CACCTACGGAAACCTTGTTACGAC-3') for 18S rDNA (17), Dinocob1F (5'-ATGAAATCTCATTTACAWWCATATCCTTGTCC-3')-Dinocob1R (5'-TCTCTTGAGGKAATTGWKMACCTATCCA-3') for cob, and Dinocox1F (5'-AAAAATTGTAATCATAAACGCTTAGG-3')-Dinocox1R (5'-TGTTGAGCCACCTATAGTAAACATTA-3') for cox1 (21, 44). PCRs comprised 35 cycles of 25 s at 94°C, 30 s at 52°C, and 40 s at 72°C, followed by 10 min at 72°C. PCR products were directly sequenced as reported by Zhang and Lin (43).
Inference of RNA editing and analysis of editing characteristics.
Genomic DNA and corresponding cDNA sequences of cob and cox1 were aligned using CLUSTAL W (1.8) (http://clustalw.ddbj.nig.ac.jp/top-e.html). Differences between the genomic and cDNA sequences were identified as an indication of mRNA editing. Editing density, or the percent nucleotides of the gene sequence that are edited, was determined for each species. Editing densities were compared between groups of taxa that were clustered on phylogenetic trees. Furthermore, types of nucleotide and amino acid changes as a result of editing were analyzed. The percent edited sites that were mapped to the same location (location of editing [LOE]) between two species and that underwent the same type of nucleotide substitution (type of editing [TOE]) between the two species were also calculated. In addition, the frequency of editing at each position of the codon was analyzed.
Phylogenetic analysis.
The DNA sequences encoding COB and COX1 were aligned using REVTRANS (http://www.cbs.dtu.dk/services/RevTrans/) with the default values. The cob and cox1 cDNA sequences were combined to produce a two-gene data set (1,963 nucleotides [nt]) and then with SSU rDNA (1,714 nt) to generate a cob-cox1-SSU rDNA three-gene (3,677-nt) data set. The apicomplexans Plasmodium yoelii and Plasmodium berghei were used to root the COB and COX1 protein tree, whereas the early diverging dinoflagellate Oxyrrhis marina was used to root the three-gene DNA tree (46). In the previous study (45), the congruence of COB, COX1, and rDNA data from dinoflagellates was tested (with the D. acuminata data not included) using the partition homogeneity test (ILD test in PAUP*; 1,000 replicates). An absence of significant incongruence between the COB and COX1 protein alignments (P = 0.136) and significant incongruence between the DNA data from these two partitions (P = 0.003) were noted. However, the single protein and DNA trees did not differ substantially from each other in these analyses, suggesting that gene concatenation would not mask a clear topological conflict (for details, see Fig. 1 and 3 in reference 45), but rather increased overall bootstrap support in the phylogenies. In this study we repeated the analysis for each gene pair in the three-gene alignment. These ILD test results again showed that P = 0.001, 0.001, and 0.003 for the rDNA-cob, rDNA-cox1, and cob-cox1 data, respectively (see also reference 45). Although these were significant P values, we chose to combine these data because of our previous work (45) and because there was considerable debate about the usefulness of the ILD test (4, 14). We also did a preliminary maximum-parsimony (MP) analysis with each data partition to determine whether the position of D. acuminata was resolved in any of the single-gene trees with an MP bootstrap support value of
70%. This analysis revealed that this taxon emerged as an independent lineage in all three gene trees, leading us to concatenate the data in the hope of achieving greater phylogenetic resolution.
For the three-gene DNA data set, the "best" tree was inferred using PAUP* and the site-specific GTR model (ssGTR) (31) with different evolutionary rates for each amino acid codon position and for the rDNA data. Bootstrap analyses were done using PHYML (200 replicates) with the GTR + I +
model over all nucleotide positions. Bayesian posterior probabilities for the ssGTR tree were calculated using MrBayes and the ssGTR +I +
model over the four data partitions. These analyses were run as described by Zhang et al. (45). Unweighted MP bootstrap analyses were also performed with the three-gene data set (2,000 replications) using heuristic searches and TBR branch-swapping to find the shortest trees (38). The number of random-addition replicates was set to 10 for each bootstrap tree search, and best-scoring trees were held at each step.
For the COB-COX1 data set, ProtTest V1.3 (1) was used to identify the best-fit model with "Fast" optimization and a BIONJ tree. The ProtTest parameter values were then used in maximum-likelihood analyses with the RAxML (VI-HPC, v2.2.1) computer program using the hill-climbing algorithm (36). The results of a PHYML V2.4.3 (12) bootstrap analysis (200 replicates) with tree optimization were used to assess the robustness of monophyletic clades in the RAxML tree. The protein data set was also analyzed using Bayesian inference (MrBayes V3.0b4) (15). The ProtTest best-fit evolutionary model was used in this analysis with Metropolis-coupled Markov chain Monte Carlo from a random starting tree. Four chains were run simultaneously, of which three were heated and one was cold, and the nrun = 2 command was used to monitor tree standard deviations. To increase the probability of chain convergence, trees were sampled after the standard deviations of the two runs were <0.05 to calculate the posterior probabilities (i.e., after 56,200 generations). The remaining phylogenies were discarded as burn-in. For the COB-COX1 data set, an unweighted MP bootstrap analysis was also performed as described above.
Testing the tree topology.
To assess the position of D. acuminata in the three-gene DNA tree, we generated a backbone phylogeny that was identical to the "best" ssGTR topology but excluded this species. Using this backbone tree, D. acuminata was then added individually using MacClade V4.05 to each branch in the tree to generate a set of topologies that addressed all possible positions for this taxon (23). The site-by-site likelihoods for the trees in this analysis were calculated using the three-gene data set and TREEPUZZLE (V5.2) (33) with the GTR + I +
evolutionary model (the alpha value for the gamma distribution was identified using RAxML) and the default settings. The approximately unbiased (AU) test was implemented using CONSEL V0.1i (35) to identify the pool of probable trees in this test and to assign their probabilities. The phylogenetic relationship of LOE and TOE was analyzed for 17 dinoflagellates. Each edited site was designated as a character with presence (1) or absence (0) of editing in each species as its state. For the TOE-based MP analysis, each of the six TOEs that was observed was treated as a character, each of which was assigned a state value of 1 or 0 for presence or absence of that editing type in each species. The six characters for each species were recorded and concatenated into a single alignment for unweighted MP bootstrap analysis as described above. In addition, Bayesian analysis was conducted for the LOE and TOE concatenated data set in an alignment that also included the COB and COX1 protein sequences. We used an "unlinked" analysis in which independent model parameter estimates were calculated for each data partition, i.e., independent gamma parameter estimate for the editing data and independent invariants-gamma model parameter estimates for the protein data (i.e., cpREV + I +
as chosen by ProtTest V 1.3 [1]). The Bayesian analysis was done as described above, and the trees were sampled after the standard deviations of the two runs were <0.01 to calculate the posterior probabilities (i.e., after 59,000 generations).
|
|
|---|
18S rDNA, cob, and cox1 sequences.
The use of two primer sets of the 18S rDNA yielded a longer gene fragment than possible with a single primer set, resulting in a 1,741-bp fragment. Genomic sequences of cob (926 bp) and cox1 (1,338 bp) and corresponding cDNA sequences (cob, 926 bp; cox1, 1,338 bp) were also obtained. The 18S rDNA was identical to that reported for D. acuminata (accession no. AJ506972) and the cox1 sequence obtained here was similar to those of dinoflagellates reported previously (21, 43). These sequences were colinear with homologs in other dinoflagellates, with no indels detected. When translated to amino acid sequences, the critical histidines conserved in COB (4 His; ligands for the heme b group in apocytochrome b) and COX1 (6 His; ligands for heme a, CuB, and heme a3) of other organisms were identified in the D. acuminata proteins.
mRNA editing characteristics.
Comparison of the colinear genomic and corresponding cDNA sequences of D. acuminata revealed edited sites for cob and cox1. Five TOEs were detected for cob; A
G, U
C, and C
U were predominant, followed by G
C and G
A (Table 2). For cox1, six TOEs were observed, with additional A
C edits in comparison to cob. Similar to the case for cob, cox1 editing was dominated by A
G and U
C, followed by G
C and C
U. There were 31 editing events in cob and 30 in cox1, accounting for 3.35% and 2.24%, respectively, of the sequence length covered in the study. These edited sites were distributed in four (Fig. 1) and three (Fig. 2) discrete clusters for cob and cox1, respectively, as found in other dinoflagellates. Unique to D. acuminata was a silent U
C change that occurred in the gap between clusters II and III (Fig. 1). In cox1, there were two unique editing sites (site 3 and 17, both A
G) in D. acuminata; however, they resided in clusters I and II, respectively. Similar to the case for other dinoflagellates documented previously (21, 43), editing in both cob and cox1 of D. acuminata occurred predominantly in the first and second positions of the affected codons, resulting in changes in the encoded amino acid residues. These changes did not result in an increase of similarity in amino acid sequences between Dinophysis and other lineages (not shown). Rather, editing caused some changes in the proportions of several amino acid residues (Fig. 3). For COB in D. acuminata, leucine and alanine increased and threonine and glycine decreased most markedly. For COX1, the greatest increase occurred in valine, leucine, and alanine whereas, isoleucine, threonine, and glycine decreased. These changes led to only a small increase in the average hydrophobicity for both COB and COX1 (from 8.4 to 9.1 and from 9.0 to 9.3, respectively, as estimated using PepTool [BioTools, Inc., Alberta, Canada]).
|
View this table: [in a new window] |
TABLE 2. Distribution of number and type of editing in cob and cox1 mRNAs in dinoflagellates
|
![]() View larger version (39K): [in a new window] |
FIG. 1. Schematic diagram of the distribution of editing events in mitochondrial cytochrome b (cob) in dinoflagellates. The sites at which editing increases similarity between dinoflagellate and nondinoflagellate species are designated by either a closed circle (editing to an amino acid identical to the consensus) or a pentagon (editing to an amino acid chemically similar to the consensus). A triangle denotes dinoflagellate-specific editing. Brackets with Roman numerals indicate discrete clusters of editing events. Species abbreviations are as listed in Table 1, and those underlined are taxa whose gene sequences were analyzed in this study.
|
![]() View larger version (30K): [in a new window] |
FIG. 2. Schematic diagram of distribution of editing events in mitochondrial cytochrome c oxidase subunit I (cox1) in dinoflagellates. Symbols are as in Fig. 1. Brackets with Roman numerals indicate discrete clusters of editing events. Species abbreviations are as listed in Table 1, and those underlined are taxa whose gene sequences were analyzed in this study.
|
![]() View larger version (20K): [in a new window] |
FIG. 3. Editing-mediated increase (+) or decrease (–) in the content of the 20 amino acid substituents of COB and COX1. Amino acids are denoted by their standard one-letter abbreviations.
|
![]() View larger version (39K): [in a new window] |
FIG. 4. Maximum-likelihood tree inferred from 18S rDNA-cob-cox1 three-gene concatenated data set (A) and mapping of editing density (B). (A) The three-gene maximum-likelihood tree was inferred using PAUP with an ssGTR model (four categories of nucleotide substitution rates). Bootstrap support was assessed using PHYML with a GTR + I + model (numbers above branches) and MP analysis (numbers below branches). Posterior probabilities from Bayesian analysis when 0.95 are indicated as thick branches. The tree is rooted with Oxyrrhis, the lineage that is consistently placed at the base of all dinoflagellates when trees are rooted with the apicomplexan Plasmodium. (B) Editing density is plotted against taxa (abbreviations of species names are as in Table 1), with filled bars of cob stacked by open bars of cox1. Note that in the phylogenetic tree, there is a well-supported early-to-later branching order from Oxyrrhis, to Amphidinium, to Heterocapsa, to Symbiodinium, to the clade of Akashiwo/Karlodinium/Karenia. Accordingly, editing density increases in this order. For the remaining lineages, phylogenetic resolution is less clear, and editing densities among those lineages are similar.
|
Editing-based phylogeny.
The Bayesian consensus tree based on the concatenated data (i.e., six categories) of editing information is generally congruent with the three-gene DNA tree although providing far less resolution of branching order among derived dinoflagellate clades due to the paucity of informative editing characteristics for these taxa (Fig. 5A versus Fig. 4A), However, again we found no significant Bayesian support for a specific affiliation of D. acuminata (Fig. 5A). These data showed that D. acuminata diverged somewhere in the region occupied by gonyaulacoid, gymnodinioid, prorocentroid, and peridinioid lineages that formed a multifurcation (Fig. 5A). Similarly, the Bayesian consensus tree (burn-in = 9,000 generations) inferred using the unlinked models for editing (+
) and for each protein partition (cpREV + I +
) revealed the affiliation of D. acuminata in the neighborhood of the prorocentroid, gonyaulacoid, and peridinioid clusters (Fig. 5B).
![]() View larger version (24K): [in a new window] |
FIG. 5. (A) Bayesian consensus tree that was inferred from the concatenated editing data. The posterior probabilities that were calculated after the standard deviation between the two runs was <0.05 (i.e., 11,000 generations) are shown at the branches. (B) Bayesian consensus tree (burn-in = 9,000 generations) inferred using the unlinked models for editing (+ ) and for each protein partition (cpREV + I + ). The average standard deviation between two runs was <0.01. Posterior probabilities of 0.95 are indicated by thick lines.
|
|
|
|---|
Amplifying multiple genes (nuclear and mitochondrial) and deciphering characteristics of editing density proved useful in characterizing dinoflagellates, albeit still not allowing us to elucidate the phylogenetic relationships of D. acuminata. Low taxon sampling within this genus may explain this result. It has already been demonstrated that a combination of cob and nuclear genes provides robust phylogenetic trees for alveolates and other organisms (30, 34) and that cob and cox1 can be used for inferring dinoflagellate phylogeny (44, 45). Our study indicates that the characteristics of editing density bolster the results of the multigene phylogeny approach even for a natural population.
In our analysis of the editing of cob and cox1, the putative basal lineages exhibited low (Amphidinium) or no (Oxyrrhis and Heterocapsa) editing, whereas more-derived lineages (including Dinophysis) consistently had higher levels of editing. This result extends the previous observation of cob mRNA editing density increasing from basal to derived lineages of dinoflagellates (43, 46). The editing alone and editing-plus-protein phylogenetic trees are generally similar to the gene sequence tree, in that Oxyrrhis, Heterocapsa, Noctiluca, and Amphidinium occupied basal positions in the tree. The clustering of Pfiesteria and related taxa with Scrippsiella is in full agreement with our current understanding that Pfiesteria and related lineages are peridinioid, and the affiliation of Karenia with Karlodinium in all of the trees is also consistent with our current understanding of their phylogenetic positions (see, e.g., references 9, 41, and 45). These data allowed us to draw two important conclusions. First, D. acuminata is not related to any of the basal lineages identified in this study based on the phylogenetic analyses and the frequency and distribution of edited sites in the cob and cox1 genes. In support of this conclusion are the intermediate level of editing density and the fact that D. acuminata was never placed near the root in any of the phylogenetic trees inferred using various combinations of data.
Second, D. acuminata is no more closely related to Prorocentrales than to Gonyaulacales or Gymnodiniales. This is despite the fact that Dinophysiales and Prorocentrales share major synapomorphies. Both lineages have a theca divided into lateral halves and have two apical pores, one large and the other small, with both flagella arising from the larger pore (39), and many Dinophysis and some Prorocentrum species (e.g., P. lima) produce polyether-type toxins (27). The apparently close relationship between the two dinoflagellate orders based on these features is not strongly supported by phylogenies based on multiple genes and editing characteristics. This weak correspondence stresses the need to include as many analytic approaches as possible and to use caution when inferring relationships between groups of organisms. Indeed, increasing evidence shows that algal lineages that produce the same type of metabolites can be phylogenetically distantly related. For instance, some cyanobacteria (e.g., Anabaena circinalis) and Alexandrium species produce saxitoxin (3, 29). Amphidinium and Karlodinium produce linear polyketide toxins (2, 8, 17). Our study provides another piece of evidence that similar toxin-producing attributes do not necessarily indicate a close phylogenetic affiliation, in our case between Prorocentrum and Dinophysis. The AU test rejects the positioning of D. acuminata at the base of the Prorocentrales (P = 0.007) or as sister to any of its members (P < 0.01).
In summary, our study presents a novel approach that could prove successful in characterizing environmental samples, especially when harmful algal bloom species are considered. Currently, our approach is time-consuming and requires highly skilled workers, and it thus can be used only for research purposes. Nevertheless, it can contribute to the development of molecular markers specific to certain target organisms, especially when genes involved in toxin production are incorporated in the approach.
This research was supported by NSF grants DEB-0344186 (to S.L. and H.Z.), EF-0629624 (to S.L.), DEB-0107754 (to D.B.), and MCB-0236631 (to D.B.).
Published ahead of print on 28 December 2007. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»