Previous Article | Next Article ![]()
Applied and Environmental Microbiology, February 2009, p. 991-997, Vol. 75, No. 4
0099-2240/09/$08.00+0 doi:10.1128/AEM.02024-08
Copyright © 2009, American Society for Microbiology. All Rights Reserved.
,

Department of Earth and Ocean Sciences, University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z4,1 Departments of Earth and Ocean Sciences, Botany, and Microbiology and Immunology, University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z42
Received 1 September 2008/ Accepted 11 December 2008
|
|
|---|
|
|
|---|
Many viruses infecting eukaryotic phytoplankton are members of the Phycodnaviridae (3), a family of large (100 to 220 nm in diameter), polyhedral, double-stranded DNA viruses (29). Exploration of the genetic richness of Phycodnaviridae in environmental samples has employed PCR-based methods and degenerate primers (algal-virus-specific 1 and 2 [AVS-1 and AVS-2]) that amplify
-like DNA polymerase (pol) gene fragments (6, 7, 18). Combining this method with denaturing gradient gel electrophoresis (PCR-DGGE) and phylogenetic analysis has shown that some closely related phycodnaviruses are cosmopolitan in distribution (19-21), while others appear to be restricted to specific environments (18). Moreover, phylogenetic analysis has demonstrated that Phycodnaviridae infecting the same species are typically much more closely related than those that infect different species (22).
In our study, we used PCR-DGGE and phylogenetic analysis to examine the genetic richness and relationships among Phycodnaviridae within several lakes in the Experimental Lakes Area (ELA) of western Ontario, Canada. These data were analyzed using genetic distance and multivariate statistics to infer the number and identities of potential host species.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Environmental parameters for samples from the ELA
|
80-fold from 18 liters of 0.45-µm-filtered (142-mm-diameter polyvinylidine difluoride Durapore filters; Millipore) lake water (26). The concentrates were stored at 4°C in the dark until they were processed within 2 to 7 months.
PCR-DGGE.
Viral DNA was extracted from each concentrate using the MoBio (Carlsbad, CA) Ultra Clean Soil DNA extraction kit and stored frozen until it was used for PCR. The volume of concentrate extracted was standardized to 50 ml of lake water.
DNA polymerase gene (pol) fragments from Phycodnaviridae were amplified using two rounds of PCR with AVS-1 and -2 primers (6, 7). In the first round, 5 µl of DNA template was added to a 45-µl PCR mixture containing 5.0 µl of 10x PCR buffer, 1.5 µl of 50 mM MgCl2, 1.0 µl of each of the 2.0 mM deoxynucleotide triphosphates, 1.0 µl of 10 nM AVS-1, 3.0 µl of 10 nM AVS-2, 0.625 U of Platinum Taq DNA polymerase (Invitrogen, Burlington, ON, Canada), and water. The negative control was prepared as described above but without a DNA template, while Micromonas pusilla virus SP1 (MpV-SP1) was used as a positive control. The PCR conditions were as follows: initial denaturation at 95°C for 90 s, followed by 40 cycles of denaturation at 95°C for 45 s, annealing at 45°C for 45 s, and extension at 72°C for 45 s, and a final extension at 72°C for 7 min. To confirm amplification, 10 µl of PCR product and 2 µl of 6x loading buffer were loaded onto a 1.5% agarose gel flanked by a 100-bp ladder (Invitrogen). The gel was run at 90 V for 60 min in Tris-borate-EDTA buffer (0.5x), stained with ethidium bromide, and viewed with a gel documentation system (AlphaImager 3400; Alpha Innotech, San Leandro, CA). The bands (
800 bp) were plugged with a clean Pasteur pipette and placed in sterile 0.5-ml microcentrifuge tubes. DNA was eluted by adding 100 µl of 1x Tris-acetate-EDTA (TAE) and heating it to 65°C for 60 min. Two microliters of the eluted DNA was used in a second-round PCR prepared as described above, except the number of cycles was reduced to 26. Positive and negative controls were prepared as before; however, the negative control from the first-round reaction was also plugged and included as an internal control in the second-round reaction. The second-round PCR products were stored at –20°C.
DGGE was used to separate the AVS-amplified products (21). Second-round PCR products (40 µl) and 6x loading buffer (10 µl) were loaded onto a 6 to 7% polyacrylamide gel, which had a 20 to 40% gradient of denaturant (100% denaturant is defined as 7 M urea and 40% deionized formamide). Samples were run at 60 V for 15 h in a 60°C 1x TAE buffer, using a D-code electrophoresis system (Bio-Rad, Hercules, CA). Upon completion, the gel was stained in a 1x SYBR green I solution (Invitrogen) for >3 h and then visualized and photographed with a gel documentation system.
Cloning and sequencing.
Several randomly selected bands from each lake sample were excised from the DGGE gel with a Pasteur pipette and placed in sterile microcentrifuge tubes, and the DNA was eluted by adding 100 µl of 1x TAE and heating it to 95°C for 15 min. Two microliters of the eluted DNA was used as the template in a PCR with AVS primers under the conditions described above for the second-round AVS PCR. The amplified DNA was purified (MinElute PCR purification kit; Qiagen, Germantown, MD), ligated, and transformed into Escherichia coli strain TOP10 using a TOPO TA Cloning kit (Invitrogen), and following an overnight incubation (37°C), the inserted pol fragments were amplified according to the manufacturer's recommendations. The amplified PCR products (
800 bp) were cleaned with a MinElute PCR purification kit, diluted, and sequenced using Applied Biosystems BioDye v3.1 Terminator Chemistry (Applied Biosystems, Foster City, CA) at the University of British Columbia's Nucleic Acid and Proteins Services Facility.
Sequence analysis.
Phylogenetic analysis was used to compare phycodnavirus DNA pol sequences from the ELA lakes (n = 20) with those from other freshwater phycodnaviruses (n = 24) (18) and those from several isolates (n = 28) (Tables 2 and 3), while a second analysis included environmental phycodnavirus sequences from the ELA lakes (n = 20), other lakes (n = 6), and oceans (n = 44), including the top 20 matches from the Global Ocean Survey (GOS) data (15) to each of the 20 ELA lake sequences (n = 35) (Tables 2 and 3).
|
View this table: [in a new window] |
TABLE 2. Phycodnaviridae isolatesa
|
|
View this table: [in a new window] |
TABLE 3. Environmental samplesa
|
Specificity of AVS primers.
To determine if AVS-amplified gene fragments represent the overall richness of the Phycodnaviridae, DNA pol sequences from the phycodnaviruses Paramecium bursaria Chlorella virus 1 (PBCV-1) and Emiliana huxleyi virus 86 (EhV) (accession numbers U42580 and AJ890364, respectively) were queried against the GOS data (15). As the virus and GOS sequences were obtained without AVS amplification, they were free of primer bias. The top 20 matches to PBCV and EhV that contained the conserved motif YGDTDS (Asp-Asp), along with sequences from the isolates, were phylogenetically analyzed (as described above) to determine if the tree topology was the same for amplified pol sequences as for sequences obtained from whole genomes and environmental shotgun libraries.
Inferring the number of potential hosts.
An ML tree of phycodnavirus isolates was generated with sequences from isolates of PBCV, EhV, MpV, Phaeocystis globosa virus (PgV), and Chrysochromulina brevifilum virus (CbV) (Table 2). The maximum genetic distances between sequences from viruses infecting the same host species (within), and the minimum genetic distances between sequences from viruses infecting different host species (between) were determined. A discriminant analysis of the genetic-distance data was used to predict the likelihood that an "unknown" sample belonged in a particular group and therefore measured the robustness of the grouping criteria (SYSTAT v. 11). Finally, the within- and between-group means, standard errors, standard deviations, and 95% upper and lower confidence intervals were calculated and compared (by analysis of variance [ANOVA]) to determine the threshold genetic distance separating viruses infecting the same species (SYSTAT v. 11). Sequences separated by distances larger than this threshold value were assumed to have originated from viruses that infect different phytoplankton species.
Inferring the identities of potential hosts using multivariate statistics.
The eukaryotic phytoplankton community composition was determined on each sampling date by D. L. Findlay at the Freshwater Institute, Winnipeg, Manitoba, Canada. A portion (125 ml) of the integrated (0- to 16-m) water samples were fixed in Lugol's fixative, and 10-ml subsamples were allowed to settle for 24 h in Utermöhl settling chambers (27) as outlined by Nauwerck (14). The cells were counted and identified to species level using an inverted microscope with phase contrast. The composition of the phytoplankton community on each sampling date was converted into a presence/absence binary matrix.
Sequences from viruses thought to infect different hosts based upon the genetic-distance analysis were traced back to specific plugged DGGE bands. On each sampling date, the presence or absence of each of these bands was converted into a binary matrix using GelCompar II (Applied Maths). For the purpose of this analysis, each band was treated as a single genotype, although different sequences can have the same melting temperature and migrate to the same point on the gel.
A monotonic multiple-dimensional scaling (MDS) analysis with the presence/absence data for phytoplankton species and viral sequences from each lake was conducted to infer potential hosts (SYSTAT v. 11). Phytoplankton species and viral pol sequences that occupied the same space in an MDS analysis cooccurred 100% of the time (see Fig. S1 in the supplemental material). Although the cooccurrence of viral DNA pol fragments and phytoplankton species suggests that the two are associated, it does not necessarily mean that the cooccurring taxa are the viral hosts. Moreover, the analysis does not take into account time lags that may obscure relationships between viruses and phytoplankton.
Nucleotide sequence accession numbers.
The sequences obtained in this study have been deposited in GenBank under accession numbers EU408225 through EU408244.
|
|
|---|
99% of the marine and freshwater environmental sequences were more closely related to viruses infecting M. pusilla than other Phycodnaviridae isolates. Third, genetic-distance analysis indicated that the ELA freshwater environmental sequences likely originated from viruses that infect at least nine different species of phytoplankton. The significance and ecological importance of these findings are discussed below.
Phylogenies of freshwater pol sequences.
DNA pol sequences from the ELA DGGEs (see Fig. S2 in the supplemental material) clustered in several clades within the Phycodnaviridae that are distinct from isolate sequences (Fig. 1). The exception was L227September2a, which clustered with MpV-SP1. Seventy-five percent of the ELA sequences clustered within the previously identified groups (18) Freshwater and Marine and Freshwater I and II, while 25% fell outside these clades. There were no obvious temporal or spatial patterns among the ELA sequences, aside from Freshwater II, which was mostly comprised of sequences from Lake 239 (Fig. 1). Similar to previous results (18), the sequences were more closely related to those from viruses (MpV) infecting the marine phytoplankton M. pusilla than to sequences from the PBCV group of viruses, which infect freshwater Chlorella-like algae.
![]() View larger version (22K): [in a new window] |
FIG. 1. NJ tree of freshwater phycodnavirus inferred amino acid pol sequences. NJ bootstrap values (10,000 replicates) and ML support values of >50 are indicated at the nodes (ML/NJ). The three freshwater clades identified by Short and Short (18) are indicated. Boldface annotations indicate the lake and collection date. Groups of phycodnavirus isolates are indicated on the right of the tree (Table 2). FsV, Feldmannia sp. virus; ESV, Ectocarpus siliculousus virus; ASFV, African swine fever virus (Asfarviridae). The scale bar represents the number of amino acid substitutions per residue.
|
![]() View larger version (16K): [in a new window] |
FIG. 2. NJ tree of environmental phycodnavirus inferred amino acid pol sequences. The tree includes sequences from some Phycodnaviridae isolates, freshwater and marine environmental samples, and GOS BLAST hits (Tables 2 and 3). NJ bootstrap values (10,000 replicates) and corresponding ML support values of >50 are indicated at the nodes (ML/NJ). The ELA and other freshwater sequences (boldface), phycodnavirus isolates (boxed), and the African swine fever virus (ASFV) outgroup are as described in Table 2 and the legend to Fig. 1. Marine environmental sequences (italics) are from several locations, including the Gulf of Mexico and the Pacific and southern oceans (Table 3). The top 20 matches to each of the freshwater DNA pol sequences in the GOS are indicated by their accession numbers. The underlined sequences are the two GOS sequences that cluster nearest to freshwater sequences (see the text). The numbers indicate the potential host species determined from a genetic-distance analysis between viral pol sequences. Distances between AVS-amplified sequences of >0.081 amino acid substitution per residue were used to infer freshwater and marine host species. The scale bar represents the number of amino acid substitutions per residue. FsV, Feldmannia sp. virus; ESV, Ectocarpus siliculousus virus.
|
Specificity of the AVS primers.
The AVS primers do not amplify some marine phycodnavirus isolates, including EhVs and Herterosigma akashiwo viruses (HaVs). This suggests that only a subset of Phycodnaviridae sequences are represented in data sets generated using these primers. The fact that 99% of the environmental pol sequences were more closely related to MpV than to other phycodnavirus isolates (Fig. 2) suggested that the primers might preferentially amplify MpV-like sequences. This was tested by BLASTing the same region of DNA pol from the complete genomes of EhV-86 and PBCV-1 (11, 29) against the GOS database. Not only did the EhV and PBCV pol sequences fall outside of the clade containing MpV (Fig. 1), they were also obtained without the use of the AVS primers. Moreover, as previously mentioned, EhVs do not amplify with the AVS primers. If the AVS amplicons are representative of the natural richness of the Phycodnaviridae, then the BLAST searches with the EhV and PBCV sequences should also be more closely related to MpV than to other isolates. All of the retrieved GOS sequences (n = 20) were most closely related to MpV (data not shown), indicating that primer bias was not responsible for AVS-amplified environmental sequences being most closely related to MpV.
Number of potential host species.
Since phycodnavirus isolates infecting the same host species typically cluster in monophyletic groups (Fig. 1) (4, 6), the number of host species should be reflected in the number of discrete clades (23), which have a genetic distance greater than that which separates viruses infecting the same species. However, there is evidence that exceptions occur (30). Distances within and between monophyletic groups of phycodnavirus isolates were determined from branch lengths (Table 4), and as there was a significant difference between the within- and between-group distances (Fig. 3 and Table 5) (ANOVA, F1,19 = 43.966; P < 0.0001), both groups were used as predictors in a discriminant analysis. Overall, unknown distances were placed in the correct group 95% of the time; however, within-group distances were always placed correctly. As a result, the upper 95% confidence interval around the mean within-group distance (0.081 amino acid substitution per residue) was used as the threshold to distinguish among discrete clades of viruses. Therefore, if the distance between two viral sequences was <0.081 substitution per amino acid, the sequences were assumed to have originated from viruses that infect the same host species. Applying this threshold suggests that the environmental DNA pol sequences came from viruses infecting 13 different freshwater and 20 different marine hosts. The 20 ELA sequences are calculated to have come from viruses infecting nine different hosts (Fig. 2). The within-group distances are similar for all six isolate groups (Fig. 3); however, this distance may have to be adjusted as more sequences from isolates become available. An exception to this relationship is PgV-102P, which reportedly clusters with CbVs rather than other PgVs (30), suggesting that lateral gene transfer of a polymerase gene has occurred or that this is an example of an expansion in host range for CbV.
|
View this table: [in a new window] |
TABLE 4. Genetic distances within and between phycodnavirus groupsa
|
![]() View larger version (12K): [in a new window] |
FIG. 3. Genetic distances within and between phycodnavirus groups. In the box plots of within- and between-host group distances, means, quartiles, and outliers are indicated, as are the ANOVA statistics from a comparison between the groups (Table 5).
|
|
View this table: [in a new window] |
TABLE 5. DA results and basic statistics from genetic-distance analysisa
|
Implications.
The results of this study have evolutionary and ecological implications. Phylogenetic analyses place most freshwater and marine phycodnaviruses in discrete evolutionary groups. Even though most of these clusters are not statistically well supported, the fact that they are clustered by environment suggests that this interpretation is valid. Furthermore, distance and MDS analyses allowed the number and identities of host phytoplankton species to be inferred. Such information can focus research on particular phytoplankton species and help unravel the interactions between viruses and hosts and the impact viruses have on structuring phytoplankton communities.
This work was supported by the Natural Sciences and Engineering Research Council of Canada through a graduate student fellowship to J.L.C. and a Discovery Grant to C.A.S., as well as by a grant from the Tula Foundation for sequencing and analysis.
Published ahead of print on 16 December 2008. ![]()
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
Present address: Department of Ecology and Evolutionary Biology, University of California, Irvine, CA 92697. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»