Previous Article | Next Article ![]()
Applied and Environmental Microbiology, September 2007, p. 5937-5944, Vol. 73, No. 18
0099-2240/07/$08.00+0 doi:10.1128/AEM.01065-07
Copyright © 2007, American Society for Microbiology. All Rights Reserved.
,
Department of Oceanography, University of Hawaii at M
noa, 1000 Pope Road, Marine Sciences Building, Honolulu, Hawaii 96822
Received 12 May 2007/ Accepted 10 July 2007
|
|
|---|
|
|
|---|
Most efforts to characterize genetic diversity among the virioplankton have focused on DNA viruses. The only published surveys of RNA viral diversity so far have been surveys of temperate coastal waters of British Columbia (5, 7). The first of these studies targeted viruses belonging to the picorna-like virus "superfamily," which is a group of positive-sense ssRNA viruses that have similar genome features and share conserved regions in several nonstructural genes, including the RNA-dependent RNA polymerase (RdRP) gene (P. Christian, C. M. Fauquet, A. E. Gorbalenya, A. M. G. King, N. Knowles, O. LeGall, and G. Stanway, presented at Microbes in a Changing World, San Francisco, CA, 23 to 28 July 2005). Based on the analysis of conserved RdRP sequences amplified from marine viral assemblages, it was demonstrated (5) that a distantly related group of novel picorna-like viruses are detectable and persistent in the marine environment and that the RdRP gene is a useful molecular marker for the determination of marine RNA virus diversity.
In a subsequent study of the same location, whole-genome shotgun libraries were constructed from the RNA virioplankton of two samples (7). While single-gene surveys can reveal novel variants of known genes, the whole-genome shotgun approach provides a more global assessment of the diversity in a community. Libraries were dominated by RdRP sequences that formed a diverse but monophyletic clade along with homologous sequences from the protistan viruses Heterosigma akashiwo RNA virus (HaRNAV) (10), Rhizosolenia setigera RNA virus (RsRNAV) (12), and Schizochytrium sp. ssRNA virus (SssRNAV) (15). The repeated detection of picorna-like viruses from coastal British Columbia with two independent methods suggests that picorna-like viruses are a consistent component of the RNA virioplankton in that area.
In this study, we wished to determine whether RdRP genes from picorna-like viruses are also detectable in coastal subtropical waters and, if so, how the sequences compare to those from the highly productive temperate waters of British Columbia. Our investigations showed that novel RdRP sequences are readily detectable in Hawaiian waters. Phylogenetic analysis suggested that these gene sequences represent new species and genera of RNA viruses, presumably infecting marine protists.
|
|
|---|
neohe Bay, a reef-protected embayment on the windward side of Oahu, Hawaii; Ala Wai Canal, an estuarine urban waterway in Honolulu, HI; and Monterey Bay, a productive cold water embayment located on the central California coast. Samples were collected from three sites in K
neohe Bay on each of two occasions and analyzed, and a single sample each from the Ala Wai Canal and Monterey Bay was analyzed (Table 1). |
View this table: [in a new window] |
TABLE 1. Station details
|
neohe Bay in July 2006 were pressure filtered (7 mm Hg), using a peristaltic pump, through a 0.22-µm-pore-size polyether sulfone membrane filter cartridge (Sterivex; Millipore, Billerica, MA), followed by a 0.02-µm aluminum oxide filter (Anotop; Whatman, Middlesex, United Kingdom). In the latter case, filtration was continued until the filtration rate decreased dramatically or dropped to zero (200 to 550 ml). Whole seawater samples collected from stations SB, D, and E in K
neohe Bay in March 2006 were filtered directly onto 0.02-µm aluminum oxide filters in the same manner but were limited to 50 ml. After processing, the filter inlets and outlets were sealed, and the filters were stored at –80°C until extraction.
Extraction.
Total nucleic acids were extracted from the aluminum oxide filters using a Masterpure complete DNA and RNA purification kit (Epicenter, Madison, WI) with a protocol slightly modified from the manufacturer's instructions. Briefly, 400 µl of 2x T+C lysis buffer containing 50 µg/µl proteinase K was added to a 3-ml syringe. The syringe was locked to the filter inlet, and the lysis buffer was gently pushed into the filter until the buffer just reached the outlet. A flame-sealed, sterile, 1,000-µl pipette tip was firmly inserted into the filter outlet, and the entire assembly was incubated for 10 min at 65°C in air. Afterward, the filter tip was removed, and the extracted material was gently evacuated through the outlet into a sterile microcentrifuge tube by applying pressure with the syringe. Extracts were immediately placed on ice. Salt-induced precipitation of protein-detergent complexes followed by alcohol precipitation and washing of total nucleic acids was carried out according to the manufacturer's instructions.
DNase treatment.
Precipitated total nucleic acids were resuspended in 10 µl Tris-EDTA buffer and then incubated with 1 U DNase I (Invitrogen, Carlsbad, CA) and 1x DNase I buffer for 15 min at room temperature to remove DNA from the sample. The reaction was terminated by adding 2.5 mM (final concentration) EDTA and incubating the preparation for 10 min at 65°C.
Reverse transcription (RT)-PCR.
(i) Primer design. Conserved regions of the putative RdRP sequences from the marine picorna-like viruses (5) were aligned using CLUSTAL X v1.83 with the Gonnet series protein matrix (16). The alignment was used to design four degenerate primer sets with various specificities. Three of the primer sets were designed to target subclades 1 to 3 (Mpl.sc1, Mpl.sc2, and Mpl.sc3) within the larger cluster but overlapped in their target ranges. A fourth primer set (Mpl.cdh) was designed based on the consensus degenerate hybrid oligonucleotide (CODEHOP) strategy (13) with the aim of capturing broader diversity across, and possibly outside, the cluster as it was defined at the time.
(ii) cDNA synthesis.
cDNA was synthesized using reverse transcriptase (Superscript III; Invitrogen, Carlsbad, CA) primed with random hexamers, Mpl.sc1R, Mpl.sc2R, Mpl.sc3R, or Mpl.cdhR. The reaction mixtures (total volume, 13 µl) contained 8 µl of extracted DNase-treated RNA template, 0.2 mM of each deoxynucleoside triphosphate, and either 100 ng (N6) or 5 pmol (all other primers) of primer. Samples were denatured at 65°C and then cooled on ice, at which point the reaction mixtures were supplemented with dithiothreitol (final concentration, 0.5 mM), 40 U RNase OUT (Invitrogen), and 200 U Superscript III (Invitrogen) to obtain a final reaction mixture volume of 20 µl. The reaction mixtures were mixed and brought to 25°C for 5 min, and this was followed by incubation at 50°C for 55 min (N6-primed reactions) or at the primer-specific annealing temperature (Table 2) for 45 min. All samples were incubated at 70°C for 15 min as a final termination step.
|
View this table: [in a new window] |
TABLE 2. RT and PCR primers
|
(iv) Cloning and sequencing.
Purified PCR products were cloned directly into the TOPO-TA vector (Invitrogen) and transformed into TOP10 DH5
competent cells (Invitrogen) according to instructions provided with the vector and cells. Clones were screened for inserts by PCR amplification with the universal M13 forward (–20; 5'-GTAAAACGACGGCCAGT-3') and M13 reverse (–27; 5'-CAGGAAACAGCTATGAC-3') primers. Amplified products of the correct size were sequenced by the ASGPB sequencing service at the University of Hawaii at M
noa with the M13 reverse primer.
(v) Phylogenetic analyses.
Sequences that were more than 98% identical at the nucleotide level were considered a single phylotype. Phylotypes were compared to sequences in the NCBI database with tBLASTx (3). Details of the viruses and environmental sequences used in phylogenetic analyses are shown in Table S1 in the supplemental material. Translated sequences of viruses were aligned using CLUSTAL X v1.83 with the Gonnet series protein matrix (16). Alignments were transformed into likelihood distances by using Mr Bayes v3.1.2 (2) and 1,000,000 generations.
Nucleotide sequence accession numbers.
Sequences have been deposited in the GenBank database, and the accession numbers are listed in Table S1 in the supplemental material.
|
|
|---|
A Bayesian phylogenetic analysis based on an amino acid sequence alignment resulted in single large clade with a Bayesian support value of 87 that contained all of the environmental RdRP sequences (phylotypes) from this study, RdRP sequences obtained from seawater samples collected in coastal British Columbia, Canada, and the RdRP sequences of three RNA virus isolates that infect marine protists (Fig. 1). Established families of picorna-like viruses that infect plants and animals were all resolved into monophyletic clusters with Bayesian support values of 77 to 100 (Fig. 1). The levels of amino acid identity of phylotypes to their nearest neighbors in the tree ranged from 38 to 98%, and the two phylotypes recovered in this study that were most divergent from one another (KB15 and KB23) shared only 24% amino acid identity.
![]() View larger version (31K): [in a new window] |
FIG. 1. Phylogenetic tree of picorna-like virus RdRP sequences: Bayesian maximum likelihood tree for RdRP amino acid sequences in the picorna-like virus superfamily. Regions of the tree are colored coded to indicate host type where known. The designations for sequences retrieved in this study are in bold type and begin with "KB" (for K neohe Bay sequences) or "MB" (for Monterey Bay sequences). Sequences designated B to D, sequences designated with numbers only, and the sequences designated JP-A and JP-B are environmental sequences from previously published investigations (5, 7). Amino acid alignments were transformed into likelihood distances with Mr Bayes v3.1.1 (2) using 1,000,000 generations. Bayesian clade credibility values are shown for relevant nodes. The Bayesian scale bar indicates a distance of 0.1. RTSV, rice tungro spherical virus; PYFV, parsnip yellow fleck virus; SDV, satsuma dwarf virus; NIMV, naval orange infectious mottling virus; BBWV, broad bean wilt virus; CPMV, cowpea mosaic virus; ALSV, apple latent spherical virus; CRLV, cherry rasp leaf virus; ERBV, equine rhinitis B virus; FMDV, foot-and-mouth disease virus; HRV, human rhinovirus; PV, poliovirus; DWV, deformed wing virus; VDV, Varroa destructor virus; DCV, Drosophila C virus; CrPV, cricket paralysis virus.
|
neohe Bay, a site characterized by high salinity and low nutrients, and from the Ala Wai Canal, a eutrophic estuarine drainage canal in the city of Honolulu (Fig. 2A). No identical phylotypes were retrieved from samples obtained in K
neohe Bay on two different occasions. The two most similar sequences, KB21 and KB22 from these samples, shared 86% identity. Phylotype MB1, amplified from a sample collected in Monterey Bay in California in 2006, shared 97% amino acid identity with the homologous region of an RdRP recovered from coastal British Columbia in 2000 (7).
![]() View larger version (26K): [in a new window] |
FIG. 2. Relationship between the distribution of phylotypes recovered and the sample source or the primers used for RT or PCR amplification. The Mpl clade, clipped from the tree constructed in Fig. 1, is displayed with each phylotype recovered in this study coded by shape and color to indicate the location and date of each sample (A), the RT primer(s) used that produced the phylotype (B), and the PCR primer set(s) that yielded the phylotype (C).
|
PCR amplification with the clade-specific primers Mpl.sc1 and Mpl.sc2 resulted in two distinct clusters of sequences, which together accounted for 22 of the 24 phylotypes detected (Fig. 2C). Mpl.sc3 failed to amplify a product in all of the samples tested. The remaining two phylotypes were detected only by the Mpl.cdh primers (Fig. 2C).
The relative abundances of different phylotypes recovered were compared for two clone libraries produced from the same sample (K
neohe Bay station SB), using the same PCR primers (Mpl.sc1) but different methods of cDNA synthesis (primer N6 or Mpl.cdhR). Phylotype KB3 dominated both libraries, accounting for 45% (10/28) of the sequences in the N6-primed library and 36% (8/22) of the sequences in the Mpl.cdhR-primed sample. Of the 16 phylotypes detected, 4 were present in both libraries, while 6 were unique to each library. Of the 12 unique phylotypes, 10 were represented by one sequence.
|
|
|---|
There is considerable diversity within the Mpl clade, suggesting that a wide range of marine protists are infected with picorna-like viruses. If we assume that the relationship between RdRP gene sequence divergence and phylogenetic affiliation is roughly consistent across the picorna-like viruses, then the distances between known phylogenetic groups can be used to infer the level of taxonomic diversity represented among the Mpl viruses. Such an analysis suggests that, at present, the Mpl clade consists of at least three families composed of 16 genera and 40 species (Fig. 3). This level of diversity is comparable to the existing taxa of picorna-like viruses that are officially recognized at present (Christian et al., presented at Microbes in a Changing World, 23 to 28 July 2005).
![]() View larger version (29K): [in a new window] |
FIG. 3. Assessment of phylogenetic divergence within the marine picorna-like clade based on Bayesian distances: cumulative frequency distribution of the minimum Bayesian distances between each unique, partial RdRP sequence in the Mpl clade and its nearest neighbor. For each distance step of 0.01, the number of sequences in the Mpl clade at that or a greater distance from its nearest neighbor is plotted. The distribution is divided to show the contribution of the new sequences from this study to the total number of known sequences within the Mpl clade. Labeled regions indicate the ranges of Bayesian distance that constitute family, genus, and species level divergence within currently established taxa of picorna-like viruses recognized by the ICTV. Specifically, the indicated ranges are based on the calculated Bayesian distances among all available homologous partial RdRP sequences for members of three families of picorna-like viruses (Comoviridae, Picornaviridae, and Sequiviridae). These three families were selected for analysis because each of them also includes ICTV-recognized genera and species. The upper value for each range is the shortest calculated distance between any two RdRP sequences derived from viruses that are considered to be members of different taxa. The lower value for each range is the greatest distance calculated between any two RdRP sequences derived from viruses that are still considered to be members of the same taxon. Multiple-sequence alignment and calculation of Bayesian distances were carried out as described in Materials and Methods.
|
The detection of an RdRP sequence from Monterey Bay in California that is very similar to a sequence recovered 6 years earlier from English Bay in British Columbia ca. 1,500 km to the north (5) suggests the presence of at least one picorna-like virus species that is persistent and relatively widespread in temperate coastal waters. In contrast, the lack of common sequences in samples from subtropical coastal waters of Oahu and temperate coastal waters of the North Pacific most likely reflects differences in the plankton community composition between these two habitats. The sample coverage and sequence coverage are still too low, however, to draw firm conclusions about the biogeography of picorna-like viruses.
The failure of the previously described RT-PCR protocol described by Culley et al. (5) to amplify RdRP sequences from a coastal Oahu sample suggested initially that picorna-like viruses were either absent or substantially different from those recovered from coastal British Columbia. Since the primers used by Culley et al. (5) were designed from the very limited number of sequences available at the time, we opted to design new sets of RT and PCR primers with various specificities, incorporating new information about marine picorna-like RdRP sequences, rather than to further test the original primer sets. The results obtained from use of these new primer sets in various combinations provide some insight into the most effective strategies for detecting RdRP genes from the environment.
The use of broad-specificity (Mpl.cdh) or nonspecific (N6) primers for the RT step resulted in recovery of the greatest sequence diversity. Some unique sequences were obtained with each of these primers, but they retrieved sequences over similarly broad phylogenetic ranges with substantial overlap. The data are insufficient to conclude whether the minor differences between them are significant. The Mpl.cdh primer, designed to prime RT of diverse RdRP sequences, might be useful for boosting the signal-to-noise ratio in cases where there is a very high background of nontarget RNA. However, given that the completely nonspecific N6 primer was effective even with samples that had not been prefiltered, this may be the best primer to use for the RT step, particularly since the range of RdRP diversity in the environment is still so poorly known.
Since it was not practical to design a single degenerate PCR primer set that could encompass the diversity within the entire Mpl clade, we used two strategies to maximize the range of sequences retrieved. The use of degenerate, subclade-targeted PCR primer sets Mpl.sc1 and Mpl.sc2 resulted in selective amplification of sequences that clustered in different portions of the Mpl clade, as was expected based on their design. The failure of Mpl.sc3 to amplify any targets suggests that the HaRNAV- and SssRNAV-like sequences were not common in our samples.
The consensus degenerate hybrid primer set, Mpl.cdh, retrieved only two distinct phylotypes, neither of which was recovered with the subclade primers. The hybrid primers are capable, in principle, of retrieving the sequences of novel, highly divergent members of a protein family because the 3' end is fully degenerate for short, conserved amino acid motifs. However, because the 5' portion of the primers is not degenerate, these primers are also likely to exhibit significant amplification bias in the presence of mixed targets. These properties may explain why the Mpl.cdh PCR primers recovered rather divergent sequences but very few sequences overall. The apparent lack of bias when the Mpl.cdhR primer was used for RT might be attributable to the fundamental differences between the kinetics of an RT reaction and those of a PCR. Since each of the four PCR primer sets appeared to target a different subset of the Mpl clade, we recommend screening samples with all four of the primer sets in order to increase the diversity of Mpl RdRP sequences recovered from environmental samples.
A major processing bottleneck and possible source of bias in previous studies of viral RdRP diversity has been the use of prefiltration followed by tangential flow ultrafiltration. Here we demonstrated that it is possible to recover RdRP sequences from virioplankton collected by direct filtration of relatively small volumes of seawater onto 0.02-µm-pore-size filters. In one case (Monterey Bay sample), amplification was successful only when the sample was prefiltered, perhaps due to the presence of PCR inhibitors in the whole-water extract. However, successful amplifications of RdRP were achieved for a number of other samples with no prefiltration. This suggests that direct filtration of whole seawater is a reasonable sample collection strategy that avoids the pitfalls associated with prefiltration, although in some cases additional steps to remove PCR inhibitors may be required.
Direct filtration involves less sample manipulation and is faster than tangential flow ultrafiltration, which should help to minimize some sources of bias, such as selective losses of viruses due to trapping on prefilters or decay of labile viruses during prolonged processing of concentrates. This procedure also avoids the economic and operational drawbacks of tangential flow filtration, namely, the expense of the equipment and the use of filters that must be cleaned between samples. The rapidity and simplicity of direct filtration should allow investigations of marine RNA viral diversity at greater temporal and spatial resolution than is possible when tangential flow ultrafiltration is employed.
In summary, we successfully applied a simplified virus sampling protocol along with a suite of new primers to recover novel picornavirus-like RdRP gene sequences from samples of subtropical and temperate waters. The results extend the known geographic range of picorna-like viruses in seawater and contribute substantially to the known RdRP sequence diversity. Phylogenetic analysis suggested that the recently described and very diverse clade of "marine picorna-like" viruses most likely represents a wealth of protistan viruses which have yet to be isolated and described but which are likely to have a significant influence on the composition and dynamics of the marine eukaryotic plankton.
neohe Bay and the participants in the ONR-funded program Layered Organization in the Coastal Ocean and Jim Christian, Captain of the R/V Shana Rae, for their assistance with sampling in Monterey Bay. This work was supported by grants OCE 06-0026 and EF 04-24599 from the National Science Foundation and by grant UAF06-0026 from the NOAA West Coast & Polar Regions Undersea Research Center.
noa, 1000 Pope Road, Marine Sciences Building, Honolulu, HI 96822. Phone: (808) 956-8629. Fax: (808) 956-9516. E-mail: aculley{at}hawaii.edu
Published ahead of print on 20 July 2007. ![]()
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»