Previous Article | Next Article ![]()
Applied and Environmental Microbiology, March 2002, p. 1290-1296, Vol. 68, No. 3
0099-2240/02/$04.00+0 DOI: 10.1128/AEM.68.3.1290-1296.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
Departments of Earth and Ocean Sciences,1 Microbiology and Immunology,1 Botany, University of British Columbia, Vancouver, British Columbia, Canada, V6T 1Z41
Received 29 August 2001/ Accepted 10 December 2001
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
These findings inspired the development of the degenerate algal-virus-specific PCR primers AVS1 and AVS2, which amplify a 700-bp fragment of algal virus DNA polymerase (pol) genes (5). To permit unambiguous identification of PCR products amplified with the AVS primers, they were designed to span a short sequence encoding a highly conserved amino acid motif (YGDTDS) found in DNA polymerases (18). Phylogenetic analysis of amplified pol fragments revealed that cultured algal viruses formed a monophyletic group compared to double-stranded DNA viruses (viruses belonging to several families) (6). Recently, known algal viruses were assigned the virus family name Phycodnaviridae. Subsequently, the AVS primers were used to amplify unknown algal virus pol fragments from natural virus communities, demonstrating that molecular techniques can be used to study algal virus diversity (7).
Denaturing gradient gel electrophoresis (DGGE) is widely used to examine the diversity of PCR products. The technique separates DNA fragments based on sequence rather than length (12, 13) and has been widely used to examine the diversity of gene sequences in complex microbial communities (e.g., see references 1, 11, 21, and 34). Studies have indicated that DGGE analysis of PCR products amplified with AVS1 and AVS2 can be used to resolve genetically distinct algal viruses in artificial mixtures and to examine the diversity of natural algal virus communities (26, 27).
In general, it is agreed that viruses exert significant control on microbial communities and likely influence host community composition and succession (14, 29, 35). For example, Peduzzi and Weinbauer (23) demonstrated that virus enrichment influenced microbial community dynamics and succession, while Hennes et al. (17) used fluorescently labeled viruses to demonstrate that viruses could control the abundance of susceptible hosts. In addition, recent experiments with Pseudoalteromonas sp. revealed that the growth of virus-resistant cells was correlated to the lysis of sensitive cells (20). Furthermore, the virus-bacterium population dynamics model analyzed by Thingstad predicts that viruses ensure the coexistence of competing bacteria by infecting only the most abundant hosts or "killing the winner" (31). Similarly, the model of virioplankton control of community diversity proposed by Wommack and Colwell (36) predicts that virus community composition should be dynamic and that dominant viruses should be ephemeral. Examination of natural virus communities using pulsed-field gel electrophoresis supported these hypotheses. However, it should be noted that pulsed-field gel electrophoresis permits determination of genome sizes but not sequence identities of individual viruses. While it seems apparent that viruses influence host community composition, the effect of viruses on phytoplankton communities remains, for the most part, unexamined.
A long-term goal of our research is to understand the dynamics and effects of marine phytoplankton viruses. A critical part of this understanding requires examining the genetic composition of natural virus communities. Thus, the purpose of the present study was to use PCR and DGGE to recover and compare unknown algal virus sequences from the natural environment.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
|
DGGE of second-stage PCR products was conducted using 20 to 55% (100% denaturant is defined as 7 M urea and 40% deionized formamide) linear denaturing gradient 6% polyacrylamide gels. Thirty microliters of products from SIB99 M and 50 µl from all other reactions were loaded into wells with 5 µl of 10x loading buffer (50% [vol/vol] glycerol, 0.33 M EDTA [pH 8.0], and 0.08% [wt/vol] bromophenol blue). Electrophoresis was carried out for 15 h in 1x TAE buffer (40 mM Tris-base, 20 mM sodium acetate, 1 mM EDTA [pH 8.5]) at 135 V and a constant temperature of 58°C using the D-code electrophoresis system (Bio-Rad Laboratories, Hercules, Calif.). Gels were stained in a 0.1x SYBR Green I (Molecular Probes, Eugene, Oreg.) solution overnight and were visualized and photographed as described above for agarose gels.
Sequencing.
To obtain sequence information from the denaturing gradient gels, individual bands were excised, reamplified, cloned, and sequenced. In total, 31 bands were excised and sequenced from the gradient gel. Excised bands are referred to by sample abbreviation and position in the gel from top to bottom; e.g., band BSB99-2 refers to the second band from the top of the lane loaded with sample BSB99. After excision, the gel slices were placed in sterile microcentrifuge tubes with 200 µl of 1x TAE buffer and were heated to 95°C for 5 min to elute the DNA band. One microliter of the resulting eluant was PCR amplified as described above for second-stage reactions. DNA fragments amplified from DGGE bands were cloned into the vector pGEM-5zf(+) (Promega, Madison, Wis.) by TA cloning. Subsamples from ligation mixtures were used to transform competent Escherichia coli JM109. To check if multiple sequences could be obtained from one DGGE band, cloned PCR fragments in three colonies from one transformation were sequenced (BSA99-1a, BSA99-1b, and BSA99-1c). Plasmid DNA was harvested from the cultures using a QIAprep spin miniprep kit according to the manufacturer's recommendations (Qiagen, Valencia, Calif.). Plasmid DNA (300 to 500 ng) was added to sequencing reactions using AmpliTaq FS BIGDYE Terminator cycle sequencing chemistry (Applied Biosystems, Foster City, Calif.) according to the manufacturer's recommendations. Excess Dye-Terminators were removed from the completed sequencing reactions using CENTRI-SEP spin columns (Princeton Separations, Adelphia, N.J.), and reactions were run in ABI Model 373 Stretch or ABI Prism 377 automated sequencers (Applied Biosystems) at the University of British Columbia sequencing facility.
Sequence analysis.
Sequences obtained from DGGE bands were compared to each other and known virus pol genes. The GenBank accession numbers for all sequences used in this study are listed below. All sequences were edited and translated using BioEdit (version 5.0.7) (15). Using the same program, a pairwise DNA identity matrix showing the proportion of identical residues was constructed from DGGE band sequences; only one sequence from any group of sequences with >98% identity was included in further analysis. Inferred amino acids of the unknown sequences were aligned with virus pol sequences from GenBank using the multiple sequence alignment program CLUSTAL W and the protein weight matrix BLOSUM (32). The alignment was then edited so that all sequences, with gaps included, were the same length. The alignment of the resulting 337 amino acid positions was used to construct maximum-likelihood and neighbor-joining trees using the programs TREE-PUZZLE (version 5.0) (28) and PHYLIP (version 3.57c) (10), respectively. Finally, phylogenetic trees were drawn and visualized using the program TreeView (Win32; version 1.6.1) (22).
Virus sequences used in phylogenetic analysis.
The designations and GenBank accession numbers of other virus sequences used in the phylogenetic analysis are as follows: Prasinovirus (MpV)-SP1, U32975; MpV-GM1, U32977; MpV-PB8, U32980; MpV-PL1, U32982; MpV-SG1, U32981; Prymnesiovirus (CbV)-PW1, U32983; CbV-PW3, U32984; Chlorovirus PBCV-1, M86836; Chlorovirus NY-2A, M86837; Chlorovirus CVA-1, U32985; Operational Taxonomic Unit 1 (OTU1), U36931; OTU2, U36932; OTU3, U36933; OTU4, U36934; OTU5, U36935; herpes simplex virus type 1, X04771; herpes simplex virus type 2, M16321; human cytomegalovirus, M14709; murine cytomegalovirus, M73549; Epstein-Barr virus, V01555; Autographa californica nuclear polyhedrosis virus (NPV), M20744; Bombyx mori NPV, D16231; Lymantria dispar NPV, D11476; African swine fever virus, X73330.
Nucleotide sequence accession numbers.
The 33 sequences obtained in this study were deposited in the GenBank database. The accession numbers for the sequences SO98-1, SO98-2, SO98-3, SO98-4, SO98-5, BSA99-1, BSA99-2, BSA99-3, BSA99-4, BSA99-5, BSA99-6, BSA99-7a, BSA99-7b, BSA99-7c, BSA99-8, BSB99-1, BSB99-2, BSB99-4, SIA99-1, MIB99-1, MIB99-2, PSB99-1, PSB99-2, PSB99-3, PSB99-4, PSC99-1, PSC99-2, BSA99-9, BSA99-10, BSB99-3, PSA99-1, PSC99-3, and SIB99 M-1 are AF405572 through AF405604, respectively.
| RESULTS |
|---|
|
|
|---|
|
The first sequences in each of the seven groups and the seven unique sequences were phylogenetically analyzed. Translation of the 14 DNA sequences revealed, in 12 instances, the presence of the conserved pol motif YGDTDS, and in one case the presence of a very similar sequence, HGDTDS (sequence BSA99-5). All of the sequences in the group starting with BSA99-9 did not contain amino acids resembling the conserved motif. As expected, this group of sequences without the pol motif included the band excised from lane M, which was loaded with products from the PCR with the primer AVS1 alone (SIB99 M). As stated previously, only 12 of the 14 analyzed sequences coded the pol motif YGDTDS, and one contained a similar sequence, HGDTDS. Not surprisingly, these sequences aligned at this motif with 24 virus pol sequences catalogued in GenBank. An amino acid alignment also revealed extensive sequence conservation among the unknown sequences and pol fragments from viruses belonging to the family Phycodnaviridae.
Phylogenetic analysis of aligned amino acids revealed robust relationships between sequences recovered from the gradient gel and other groups of known viruses (Fig. 3). Both algorithms (maximum-likelihood and neighbor-joining) used for phylogenetic reconstruction produced identical tree topology at major nodes. The topologies varied only at nodes near branch tips. Furthermore, the likelihood support and bootstrap values at nodes separating families of viruses were greater than 93%. All of the sequences recovered by DGGE were closely related to and clustered among pol sequences of known phycodnaviruses when compared to virus pol sequences from three other families of double-stranded DNA viruses, including Asfarviridae, Baculoviridae, and Herpesviridae. Several sequences from DGGE (SO98-1, SO98-2, SO98-3, BSA99-2, BSA99-5, SIA99-1, MIB99-2, and PSC99-1) were closely related to pol sequences from the genus Prasinovirus; prasinoviruses (abbreviated MpV) infect the prasinophyte alga M. pusilla. Four sequences (SO98-5, BSA99-1, PSB99-1, and PSC99-2) formed their own clade yet were also closely related to prasinoviruses, and one sequence (BSB99-2) formed a separate branch between prasinoviruses and viruses of the genus Prymnesiovirus (abbreviated CbV). Finally, viruses from the families Baculoviridae and Herpesviridae all clustered with viruses of the same family.
|
| DISCUSSION |
|---|
|
|
|---|
PCRs produced expected and unexpected products. The unexpected 550-bp products were the result of amplification with the primer AVS1 only, suggesting that they were PCR artifacts or the result of amplification of extant targets in the environment. PCR artifacts are often produced during reactions with mixed templates. For example, chimeric sequences form through recombination of similar PCR products and therefore have regions with different phylogenetic affinities. Because of this, many researchers suggest that nearest-neighbor analysis of different domains of suspect sequences can reveal PCR chimeras (19, 25, 33). To determine if any sequences recovered from DGGE were chimeric, phylogenetic trees were constructed from upstream, middle, and downstream regions of all sequences. The topology of phylogenetic trees constructed from the different regions did not differ, indicating that the sequences were likely not chimeric (data not shown). To check for the possibility of amplification of other targets, and to determine their identity, the 550-bp sequence SIB99 M-1 was compared to sequences in the public databases GenBank, DDBJ, and EMBL using the Basic Local Alignment Search Tool (BLAST) at the National Center for Biological Information web site (http://www.ncbi.nlm.nih.gov). This comparison revealed that the closest relatives of SIB99 M were phycodnavirus pol sequences; however, only the upstream primer region was related to the phycodnavirus sequences, and therefore no conclusions about the identities of the small amplicons were reached. Although the origins and identities of the small PCR products remain unknown, they did not interfere with DGGE analysis of desired PCR products.
DGGE facilitated the resolution of heterogeneous DNA fragments amplified from natural environments. However, one must be careful inferring diversity or richness from DGGE fingerprints. This is due to the fact that different sequences can migrate similarly and similar sequences can migrate differently. For example, SO98-1 and SO98-4 were 98% identical, yet SO98-1 migrated one-third of the gel length while SO98-4 migrated two-thirds of the gel. On the other hand, SO98-3 and BSB99-2 were only 36% identical, yet they migrated the same distance (Fig. 2B). Nonetheless, the community fingerprints obtained in this study were reproducible; multiple independent PCRs of the same sample analyzed by DGGE revealed the same fingerprint (data not shown). Furthermore, because DGGE resolved multiple bands from the PCR products of each reaction, qualitative comparison of PCR products was possible. For example, the banding patterns or "fingerprints" of samples from Salmon Inlet (SIA99 and SIB99) were identical and easily distinguished from other fingerprints. These samples were collected at the same time and location but different depths, suggesting that PCR templates were homogenously distributed with depth. This is reasonable considering the salinity and temperature of Salmon Inlet did not vary greatly with depth, indicating that the water and viruses were well mixed. On the other hand, the fingerprint of pol fragments amplified from surface water at Pendrell Sound (Fig. 2B, lane 12) is different than the fingerprints from greater depths (Fig. 2B, lanes 10 and 11). At this location the salinity of surface water was half of that at depth, indicating the water and viruses were stratified with depth. In general, environmental parameters agreed with qualitative lane comparison, reinforcing the idea that DGGE can be used to distinguish pol fragments amplified from natural algal virus communities.
Preliminary analysis of sequences recovered from DGGE revealed many that were more than 98% identical. On the other hand, the sequences BSA99-7a, BSA99-7b, and BSA99-7c from three different clones of PCR products reamplified from a single DGGE band were not identical. Although some of the nearly identical sequences may represent strain variation, the methods used in this study cannot distinguish the possibility that some were due to PCR and/or sequencing error. Therefore, only one sequence from each group of sequences with >98% identity was included in phylogenetic analyses.
To confirm that the sequences from DGGE were pol gene fragments, they were translated and searched for the conserved amino acid sequence corresponding to the putative pol catalytic site, YGDTDS (18). Of the 14 sequences analyzed 1 (BSA99-9) was not a recognizable pol fragment. This sequence was from the group that included the 550-bp sequence SIB99 M-1 mentioned previously. Twelve sequences encoded the pol catalytic motif, and one sequence (BSA99-5) encoded the unusual motif HGDTDS. There are two possible explanations for this sequence variation. First, HGDTDS is not an algal virus pol sequence and was the result of PCR and/or sequencing error. Second, HGDTDS may represent natural sequence variation in B-family pol genes. The amino acids histidine (H) and tyrosine (Y) are both polar, and therefore it is possible that this substitution does not affect pol function. Future studies will support the latter argument if they reveal that this unusual motif is a feature of some algal virus pol genes. On the other hand, if the HGDTDS sequence is not observed again, it is likely this pol motif was an artifact.
Phylogenetic inference revealed that the putative pol fragments recovered in this study were closely related to pol genes from the Phycodnaviridae. High likelihood support and bootstrap values indicated that the deep branches of the maximum-likelihood tree were robust (Fig. 3). For example, viruses belonging to the family Baculoviridae branched from the Asfarviridae outgroup with a likelihood support value of 98%. Similarly, viruses belonging to the closely related families Herpesviridae and Phycodnaviridae branched from the Asfarviridae and Baculoviridae with a support value of 96%. The Herpesviridae and Phycodnaviridae formed their own clusters with support values of 96 and 93%, respectively. In addition to having high support values, the phylogeny from this study resembled previously published pol phylogenies (3, 6). Therefore, it is likely that relationships obtained in this study provide accurate relationships of pol genes. Chen and Suttle (6) demonstrated that pol phylogeny reflects the phylogeny of total genomic DNA in M. pusilla viruses (MpV). Interestingly, the results of a previous phylogenetic analysis of phycodnavirus pol sequences based on nucleotide sequences (7) do not agree with the results presented in this study, yet previous phylogenies based on inferred amino acids (6) do. For example, the nucleotide-based phylogeny placed the Chlorella sp. viruses (CVA-1, PBCV-1, and NY-2A) more distant from MpV isolates than Chrysochromulina sp. viruses (CbV-PW1 and CbV-PW3) (7), whereas in our study CbV isolates were more distant from MpV isolates than Chlorella viruses. An explanation of this minor discrepancy may be that for the sake of reproducibility no eye refinement was performed on the present amino acid alignment. While the unrefined alignment presented in this work differed slightly from and may not be as accurate as previous alignments, a strong argument for unrefined alignments can be made because of the inherent irreproducibility of eye refinement. Nonetheless, these subtle differences do not affect the interpretation of our main result; all pol sequences obtained in this study clustered within the Phycodnaviridae, although outside of established genera.
Several sequences obtained in this study were not closely related to any known phycodnaviruses, yet many others (SO98-1, SO98-2, SO98-3, BSA99-2, BSA99-5, SIA99-1, MIB99-2, and PSC99-1) were most closely related to viruses in the genus Prasinovirus (MpV). However, because none of these sequences clustered among MpV, their identity remains unknown. Nonetheless, it is remarkable that sequences from the Southern Ocean were closely related to sequences from coastal British Columbia, in some cases being >98% identical, even though the environments were very different. This is the first time nearly identical viruses have been shown to occur in the Southern Ocean as well as in the temperate waters of the Northeast Pacific Ocean. It should be noted that although the samples were collected across the planet, the sample locations were all coastal and may have supported similar algal communities. In addition, the results of this study agree with previous work demonstrating the cosmopolitan nature of some algal viruses and their hosts (8). Five of the analyzed sequences obtained in this study formed two distinct groups within the Phycodnaviridae. These sequences may represent unknown genera within the Phycodnaviridae; further study will be necessary to refine the phylogeny of these unknown sequences. Previously, only one sequence (OTU5) has been recovered from the environment that was not closely related to other prasinoviruses (7). This study is the first to examine sequence diversity of algal viruses in more than one natural sample and provides further evidence that the PCR primers AVS1 and AVS2 can amplify unknown algal viruses from marine environments.
This study demonstrated that PCR and DGGE can be used to recover and identify unknown algal virus pol sequences from the natural environment and that very similar sequences can be recovered from different oceanic provinces across the planet. As more viruses are isolated and cultured from the environment and more sequences become known, we may find viruses that cluster with currently unidentified viruses. This will permit putative identification of uncultured virus sequences. In addition, we demonstrated that DGGE fingerprints can be used for qualitative comparison of virus communities. Therefore, the molecular methods described in this study can be used to relate the genetic composition of marine algal virus communities obtained from different environments. Furthermore, the ability to track changes in virus community composition may lead us closer to understanding the effects of viruses on host community structure. For example, it may be possible to relate changes in host populations to changes in the virus community. This study clearly demonstrates that the genetic composition of algal virus communities can be examined using the molecular techniques presented. Therefore, we are confident our long-term goal to understand the dynamics and effects of marine algal viruses can be obtained, in part, by examining the genetic composition of phytoplankton virus communities.
| ACKNOWLEDGMENTS |
|---|
This research was supported in part by an NSERC postgraduate scholarship to S. M. Short and NSERC research grants to C. A. Suttle.
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| J. Bacteriol. | Microbiol. Mol. Biol. Rev. | Eukaryot. Cell | All ASM Journals |
|---|