Previous Article | Next Article ![]()
Applied and Environmental Microbiology, January 2006, p. 252-260, Vol. 72, No. 1
0099-2240/06/$08.00+0 doi:10.1128/AEM.72.1.252-260.2006
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Biology Department, Woods Hole Oceanographic Institution, Woods Hole, Massachusetts 02543,1 Marine Biological Laboratory, Woods Hole, Massachusetts 025432
Received 23 June 2005/ Accepted 21 September 2005
|
|
|---|
|
|
|---|
Nitrogen (N) and phosphorus (P) are two critical macronutrients for E. huxleyi growth, and their availability can impact when and where E. huxleyi blooms are able to occur (20). Further, N and P starvation can influence CO2 exchange by changing rates of photosynthesis and calcification (24). For example, P starvation typically increases calcification rates relative to photosynthesis (25). In short, N and P availability in the field may influence bloom dynamics, calcification, and their concomitant impact on C cycling and on the ocean's ability to buffer changing CO2 concentrations in the atmosphere.
To cope with low macronutrient availability in nature, marine phytoplankton have evolved inducible systems that enable them to efficiently scavenge dissolved inorganic N (DIN) and dissolved inorganic P (DIP), the concentrations of which are often growth limiting in marine systems. Phytoplankton also have the ability to utilize N and P from a diverse suite of dissolved organic N (DON) and P (DOP) compounds (1, 5). The concentrations of DON and DOP often exceed those of DIN and DIP in surface waters, so these organic compounds can be an important nutrient source in DIN- or DIP-depleted environments, such as the oligotrophic oceans. Understanding the complexity of phytoplankton nutrient scavenging systems and how they are expressed in response to depletion of N or P in the ocean is an ongoing area of study for biological oceanographers. Previous work with E. huxleyi cultures suggests that this coccolithophore has the ability to scavenge nitrogen from diverse sources. For example, it is able to grow on several DON substrates as a sole N source, including formamide, hypoxanthine, and urea (28). E. huxleyi is also able to scavenge P from diverse sources, expressing the enzyme alkaline phosphatase under low-DIP conditions allowing for the hydrolysis of certain DOP compounds (13, 32). In fact, E. huxleyi is known for being a good competitor relative to other algae in low-DIP systems and elevating phosphate uptake at growth-limiting DIP concentrations (32). Although some N and P starvation-inducible proteins have been identified for E. huxleyi (13, 29), our transcriptional understanding of E. huxleyi biology and particularly nutrient scavenging and nutrient starvation responses is limited.
While genomic research with marine cyanobacteria is rapidly advancing our understanding of their role in the sea (12, 27), there are few genome sequences (3), differential gene expression studies (2, 22, 37), and transcriptome analyses with eukaryotic marine algae. In the case of coccolithophores, fundamental gaps in our molecular-level understanding of calcification and even basic N and P scavenging mechanisms remain. Gene expression analyses are one way to work towards closing these gaps, providing a dynamic link between the E. huxleyi genome and its cellular functioning in the ocean.
Methods for assessing gene expression on a genomic scale include DNA microarrays, massively parallel signature sequencing, differential display reverse transcriptase PCR (RT-PCR), subtraction hybridization, and serial analysis of gene expression (SAGE) (38). In long SAGE (33), a short, 21-bp sequence tag from the most poly(A) proximal NlaIII restriction site of an mRNA molecule is used to uniquely identify the source gene from within the genome. Short sequence tags are sampled from all NlaIII-positive transcripts in an mRNA sample and are linked together to form long concatenated molecules that are cloned and sequenced. Quantification of all tags provides a relative measure of gene expression (i.e., mRNA abundance). SAGE thus provides both the identities of expressed genes and the levels of their expression. Detection level and sensitivity in SAGE are a function of sampling depth, i.e., the more tags sampled, the more likely detection of rare transcripts and the stronger statistical resolution of differential abundance of transcripts among mRNA samples. Of the aforementioned approaches, SAGE is advantageous in that it requires no a priori knowledge of the genome, responsive genes, or gene sequences and directly samples transcript abundance. The material benefits of SAGE are its simplicity, low cost, and efficiency. Perhaps the most common approach to expression studies with eukaryotic marine algae involves expressed sequence tag (EST) sequencing (34, 39, 40). In contrast to EST sequencing, SAGE can provide a much deeper sampling of the transcriptome, is not subject to the difficulties involved in normalization, and is quantitative in nature.
Despite its clear value, SAGE is typically applied to model systems with available genome sequences (e.g., mouse, Arabidopsis thaliana) and its utility in nontraditional models has only recently been addressed (11). Here we used long SAGE to examine genes responsive to N or P starvation by sampling 21-bp sequence tags from three E. huxleyi libraries: replete, nitrogen starved (N), and phosphorus starved (P). Because P starvation resulted in increased calcification, we were also able to screen the P library for genes potentially responsive to changes in calcification. To our knowledge, these are the first transcriptome analyses of N and P starvation for a coccolithophore.
|
|
|---|
Total RNA extraction.
Approximately 2 x 107 cells were harvested (8,000 x g for 10 min at 22°C), and the RNA was extracted using TRI reagent (Sigma, St. Louis, MO) according to the manufacturer's instructions. Briefly, cell pellets were resuspended and lysed in 0.75 ml TRI reagent. After complete lysis of the cells, 0.15 ml chloroform was added. Total RNA was precipitated with 0.35 ml of isopropanol and washed with 75% (vol/vol) ethanol. The RNA pellets were solubilized in 50 µl diethyl pyrocarbonate-treated water. RNA concentrations were obtained with a UV spectrophotometer. Integrity of the total RNA was assessed by 1% (wt/vol) agarose gel electrophoresis.
SAGE.
SAGE libraries were constructed using
30 µg RNA isolated from extractions of replete, P, and N E. huxleyi cells following the I-SAGE Long kit protocol (Invitrogen, Carlsbad, CA). Recombinant pZEro1 clones produced by SAGE were purified using GeneMachines RevPrep Orbit (Genomic Solutions, Ann Arbor, MI) and were sequenced on an ABI 3730xl DNA sequencer (Applied Biosystems, Foster City, CA). Sequences collected were analyzed with software created at the Marine Biological Laboratory specifically for E. huxleyi SAGE analysis. The SAGE software extracts ditag sequences from the ABI 3730xl results according to the SAGE sequence grammar, parses out individual SAGE tags, excludes tags with sequence ambiguities, and reduces all SAGE tags to a look-up table of unique SAGE tag sequences and their observed frequencies among all of the E. huxleyi SAGE libraries. SAGE tags not found more than once in at least one SAGE library were excluded from analyses as putative sequencing errors, unless the tag sequence had a perfect match to available genomic and EST data. The unique tag sequences were mapped to all available E. huxleyi DNA sequences to determine the identities of expressed genes. For this, 11,880 genome and EST sequences were obtained from NCBI on 1 April 2005 and assembled with the computer program PHRAP (15). The 21-bp SAGE tags were mapped to assembled contigs and singletons based on exact sequence matches. Tags were then visualized with a generic model organism database using the Gbrowse interface (35) in the context of EST sequences and predicted open reading frames (ORFs). As SAGE tags are generated from the most 3' NlaIII restriction site of transcripts, tags with significant differential expression among the SAGE libraries were manually assigned to predicted genes based on their positions relative to ESTs and predicted ORFs. These genes were annotated by BLAST and other homology detection methods. SAGE tags were scored for differential expression among the three libraries by using the R statistic of Stekel et al. (36), a log likelihood ratio statistic which scores tags by their deviation from the null hypothesis of equal frequencies given the tag sampling depth for each SAGE library. Higher scores represent a greater deviation from the null hypothesis, while scores close to zero represent near constitutive expression. To reduce the effects of sampling error in highlighting differential expression, only tags with an R value of 2 or greater were considered for RT-PCR and other post-SAGE analysis.
RT-PCR.
RNA was extracted from cultures as described above. Genomic DNA contamination was removed from the replete, N, and P RNA samples with a Turbo DNA-free kit (Ambion, Austin, TX) according to the directions provided. Equivalent concentrations of total RNA from each condition were reverse transcribed to single-stranded cDNA by using an iScript cDNA synthesis kit (Bio-Rad, Hercules, CA) and amplified using gene-specific primers (Table 1). All genes and conditions were tested for genomic contamination using no RT controls. PCR reaction conditions were as follows: 15-µl total volume of 1x PCR buffer, 0.25 mM deoxynucleoside triphosphates (Bio-Rad), 0.3 µM primers, 1 U Taq DNA polymerase (Bio-Rad) or Optimase high-fidelity polymerase (Transgenomic, Cambridge, MA), and 1 ng cDNA. PCR conditions were as follows: 1 cycle at 95°C for 5 min; 30 cycles at 95°C for 1 min, x°C for 1 min, and 72°C for 1 min; and 1 cycle at 72°C for 7 min. See Table 1 for annealing temperatures and amplicon sizes. PCR products were resolved on a 2% agarose gel, stained with ethidium bromide, and imaged with a ChemiImaging system (Alpha Innotech, San Leandro, CA).
|
View this table: [in a new window] |
TABLE 1. PCR primers and amplification conditions for the genes used in the RT-PCR validation of the long SAGE analysisa
|
|
|
|---|
![]() View larger version (9K): [in a new window] |
FIG. 1. Growth curves of E. huxleyi under replete, N, and P conditions. Graphs are plotted as cell number (filled symbols) and nutrient concentration (open symbols) versus day in culture, noting the difference in scale of the y axes. Error bars denote standard errors of the means (n = 4). Filled arrowhead indicates point of harvest.
|
|
View this table: [in a new window] |
TABLE 2. Summary of library characteristics, including calcification data and long SAGE analyses
|
2) to reduce the effects of sampling error, we identified a total of 38 unique tags that were up-regulated twofold or greater in the N library and 74 unique tags that were up-regulated twofold or greater in the P library (Table 2). In addition, a total of 4 tags were uniquely present in the N library, and 54 tags were uniquely present in the P library (Table 2). Approximately 48% of all differentially expressed tags could be mapped to publicly available sequence data. Expression data for tags that map to genes with a putative function are presented in Table 3. The remaining tags either map to E. huxleyi sequence data with no database homology, match more than one EST or genome sequence (unresolved tags), or do not have a sequence match (orphan tags) and apparently represent nascent E. huxleyi sequences (Table 4) (also see the supplemental material). The five most highly expressed (i.e., greatest change [n-fold]) tags encompass these tag categories (Table 4). The complete data set including all of the differentially expressed genes is available in Table S1 in the supplemental material. Additional information on all the gene tags, including those that are not differentially expressed, is available on our website (http://gmod.mbl.edu/emiliania_huxleyi). |
View this table: [in a new window] |
TABLE 3. Successfully annotated differentially expressed (R 2) SAGE tags
|
|
View this table: [in a new window] |
TABLE 4. Differentially (R 2) expressed tags with the greatest change (n-fold) in the N and P libraries
|
![]() View larger version (25K): [in a new window] |
FIG. 2. (A) Differential expression of long SAGE tag 285. Frequency of tag 285 in the SAGE libraries generated from replete (control) and N E. huxleyi (top). RT-PCR amplification of the fucoxanthin chlorophyll a/c binding protein gene that maps to tag 285 (115 bp) and type 1 actin (116 bp) from control and N conditions (bottom). (B) Differential expression of long SAGE tag 45. Frequency of tag 45 in the SAGE libraries generated from control and N E. huxleyi (top). RT-PCR amplification of the EST sequence which maps to tag 45 (267 bp) and type 1 actin (116 bp) from control and N E. huxleyi (bottom). (C) Differential expression of long SAGE tag 12112. Frequency of tag 12112 in the SAGE libraries generated from control and P E. huxleyi (top). RT-PCR amplification of a 286-bp fragment of the putative inorganic pyrophosphatase gene that maps to tag 12112 and a 116-bp fragment of type 1 actin from control and P samples (bottom). All PCRs were performed with equal template concentrations.
|
|
|
|---|
Validation.
Nearly half of the long SAGE tags could be mapped to gene sequence data from an E. huxleyi strain. Although this mapping is most straightforward with an available genome sequence, we show herein that it is tractable to map these sequence tags to existing sequence information in the absence of a genome. It is worth noting that the data mapped to multiple strains, suggesting that analysis of these SAGE data are not be restricted to strain 1516. Using RT-PCR, we examined the relative expression patterns of three tags (285, 45, and 12112), which mapped to publicly available sequence data and were either up- or down-regulated in one of the libraries. In all cases, RT-PCR generated an expression pattern similar to that observed with the long SAGE analysis. Although these tags were consistently differentially regulated, type 1 actin was constitutively expressed across the conditions tested in this study. Particularly striking are the results for tag 12112, which amplified from the P sample but not the replete sample and is uniquely present (four copies) in the P library. In this case, long SAGE results could be confirmed even for a low copy number tag with a small R value. Overall, these validation results demonstrate the value of SAGE libraries as a tool for transcriptome profiling, where subtle changes in expression, not just presence/absence, can be examined across multiple conditions. With the increasing number of whole-genome sequences or EST collections for diatoms, dinoflagellates, and other groups of eukaryotic marine phytoplankton, long SAGE is a promising new approach for transcriptome profiling with these different taxa. However, long SAGE is equally promising for phytoplankton lacking available EST or genome information, as high-throughput methods exist to identify full transcripts from SAGE tag sequences (6, 7). In some cases, SAGE tag sequences have been used to prime 5' rapid amplification of cDNA ends for isolation of nearly complete transcripts (A. G. McArthur, unpublished data).
Nitrogen responses.
Little is known about transcriptional responses to N starvation in E. huxleyi. However, many algae become chlorotic, with a decrease in light-harvesting pigments, under N starvation, and the N library was characterized by the down-regulation of genes mapping to multiple fucoxanthin chlorophyll a/c binding proteins (fcp) of the E. huxleyi light-harvesting complex. RT-PCR validation confirmed the regulation of one fcp (tag 285) (Fig. 2). The expression of this gene is also down-regulated by N conditions in Pleurochrysis carterae (10), suggesting this may be a common transcriptional response to N starvation. Of the 38 up-regulated tags (e.g., 255, a putative 21-hydroxylase-like P450 protein) in the N library, only 6 could be assigned a putative function (Table 3). Many of the genes up-regulated in the N library were either orphan tags or mapped to sequence with no database homology (e.g., Table 4). This observation emphasizes how SAGE can be an important tool for the identification of new genes as well as for the regulation of genes with no putative function. For example, tags 1083, 294, 827, and 1041 (Table 4) are all orphan tags which do not map to available sequence data but are strongly up-regulated eightfold or more in the N library. Tag 128 maps to an EST sequence without a significant BLASTX hit and thus has an unknown function (Table 4). Although the function of the gene is unclear, the specific N regulation observed with our long SAGE analysis (Table 4) suggests an involvement with N scavenging or N homeostasis. Here long SAGE has identified several new genes or gene targets for more-directed study of N scavenging in this organism.
Phosphorus responses.
E. huxleyi blooms in many oligotrophic, low-DIP areas, such as the North Atlantic (17), and has a strong physiological response to P starvation in culture, inducing several phosphate-regulated surface-associated proteins, as well as increasing alkaline phosphatase activity and phosphate uptake rate (13, 32). As was the case with the N library, we have identified many orphan tags and tags which map to sequences with no database homology that are up-regulated in the P library. These tags warrant further study, yet the overall results from the P library suggest a robust transcriptional response to P starvation, with the up-regulation of several genes likely to be involved in P scavenging or P metabolism. One previously observed response to P deficiency in eukaryotic algae is the up-regulation of genes encoding phosphate-repressible permeases, allowing cells to transport phosphate with high affinity (8). In our analysis, tag 9469 maps to a putative E. huxleyi phosphate-repressible permease (9) that is up-regulated over fourfold in the P library. Although this putative phosphate-repressible permease was previously identified (9), this is the first data confirming its regulation by phosphate supply. E. huxleyi can increase phosphate uptake under P-deficient conditions (32), and this gene is likely involved with this process.
Tag 12112 is uniquely present in the P library (Table 3), and its expression was confirmed with RT-PCR. This tag has been successfully mapped to a complete ORF with homology to soluble inorganic pyrophosphatases (sPPase) (data not shown). Transcriptional activation of sPPase genes has been observed for P-starved cyanobacteria (14), and genes for sPPase have been identified for a number of protists (30), but the presence and P regulation of an sPPase gene has not previously been demonstrated with a coccolithophore.
An additional tag (24557) mapped to a gene with a putative role in polyphosphate metabolism. Polyphosphate is ubiquitously found in microorganisms and is either demonstrated to be or thought to be involved in diverse biological functions, including acting as a reservoir for inorganic phosphate (Pi) (18). In Saccharomyces cerevisiae, genes for polyphosphate synthesis and polyphosphate hydrolysis are both up-regulated in response to low extracellular phosphate as part of the PHO pathway involved in the acquisition of phosphate (23). In E. huxleyi, tag 24557, a putative polyphosphate synthetase, was up-regulated over ninefold in the P library (Table 3). As highlighted above, we observed the up-regulation of a putative sPPase (tag 12112) which releases Pi from pyrophosphate, but we were unable to map any tags to a putative exopolyphosphatase, a processive enzyme that releases the terminal orthophosphate group from linear polyphosphates and would be required for the degradation of polyphosphate. It is important to note that a transcript encoding an exopolyphosphatase could be present in the P library but not up-regulated sufficiently to be detected in this analysis. The seemingly paradoxical increase in the cell's ability to convert Pi into polyphosphate in response to P starvation might represent a strategy for accumulating and preserving Pi. Studies with yeast suggest that polyphosphate accumulation is required, presumably as a sink, to sustain a high rate of Pi uptake (23), and Escherichia coli has been shown to transiently accumulate polyphosphate in P-deficient medium (31). In E. huxleyi, where P deficiency results in the up-regulation of a phosphate permease (tag 9469) and increased rates of P uptake (32), the production of polyphosphate may be an important short-term storage pool for Pi. Ultimately the ability to both store and mobilize polyphosphate might help explain why E. huxleyi is such a good competitor for P in the oceans, where a given cell may transiently experience fluctuations in DIP concentration.
The annotation of the long SAGE data yielded some unexpected results. Notably, there was up-regulation of a tag (5002) which maps to a putative sulfate adenylyltransferase (ATP-sulfurylase) in the P library. In plants, this enzyme mediates the first step in the metabolism of sulfate (21), and the up-regulation of this gene would be a potential mechanism for E. huxleyi cells to accumulate S during P deficiency. E. huxleyi is known to produce high levels of dimethylsulfoniopropionate (DMSP), and in diatoms there can be an accumulation of intracellular DMSP in nutrient-limited batch cultures (4). This accumulation of DMSP may be fueled by increased sulfate metabolism. We are in the initial stages of understanding E. huxleyi S metabolism, and additional expression studies of the putative ATP-sulfurylase and the extent to which its expression is linked to DMSP accumulation are certainly warranted. Also, we identified SAGE tags from rRNA in our libraries. This has been observed with other organisms from the genera Giardia and Trypanosoma (McArthur, unpublished), as rRNA contains stretches of poly(A). In most cases, these tags are not differentially expressed, but results from the P library suggest that mitochondrial and plastid rRNA transcripts are more abundant in the P condition (Table 3). There was some up-regulation of nuclear rRNA genes in the P library as well, which is sometimes observed for Trypanosoma spp. at different cell densities (McArthur, unpublished). At this juncture, the mechanisms driving increased abundance of mitochondrial rRNA and plastid rRNA are unclear; however, this may be related to increased calcification in the P condition. Additional targeted genome expression studies in this organism over a greater range of conditions would address the consistency of this observation.
Calcification.
Calcification was enhanced in the P condition, and nearly twofold more genes were up-regulated in the P library than in the N library. In addition, a total of 54 tags were detected only in the P library. The high number of unique and up-regulated tags in the P library suggests that we may be observing the up-regulation of genes involved in calcification or calcium homeostasis (e.g., tag 11952 [calmodulin]) in addition to those involved in P scavenging. Substantial efforts have been made to identify the genes involved in calcification, and the most recent approaches have compared E. huxleyi EST libraries (40) or used suppressive subtractive hybridization with calcifying and noncalcifying cells (22). Of the five tags most highly expressed in our long SAGE P library (Table 4), none clearly map to the most prevalent ESTs in the calcifying library analyzed by Nguyen et al. (22). Moreover, of the tags up-regulated or unique to the P library, over 60 are orphan tags, which do not map to any of the genes identified to date in calcifying cells (22, 40). Considering the sensitivity of the transcriptome to culture conditions and the differences between EST, suppressive subtractive hybridization, and long SAGE analyses, these inconsistencies are not surprising; rather, these data underscore the value of using multiple genomic approaches over a range of conditions to identify the genes involved in uncharacterized metabolic pathways such as calcification. Further analysis of the tags up-regulated in our P library will be helpful for future studies of transcriptional responses associated with increased calcification, and these data emphasize the potential of long SAGE analyses for building our understanding of the genes involved in calcification as well as other poorly understood pathways, such as alkenone biosynthesis.
Conclusions.
Genomic tools such as SAGE hold great promise for phytoplankton ecology and biological oceanography, where the molecular genetics of key marine organisms and the biogeochemical processes they mediate are still poorly understood. With our initial SAGE analyses, we have identified many new E. huxleyi sequences, assigned regulation data to sequences of unknown function, and highlighted previously uncharacterized aspects of E. huxleyi P scavenging, including polyphosphate synthesis. As discussed in the introduction, both N and P bioavailability can have a substantial influence on the timing, magnitude, and C cycling of E. huxleyi blooms. Our data demonstrate the nutrient regulation of a number of potential targets for RT-PCR assays in field populations of E. huxleyi, where the expression patterns of key genes relative to actin could be used to identify the N or P status of a population. As sequence data from the E. huxleyi genome project are released, an increasing number of E. huxleyi long SAGE tags will be resolved to genes in our public database (http://gmod.mbl.edu/emiliania_huxleyi), thus providing a dynamic tool for future gene discovery, transcriptome profiling, and genome annotation efforts. As scientists work toward an understanding of how E. huxleyi influences the marine C cycle and global climate, further analysis of the genes examined herein will help identify the molecular underpinnings that drive the cellular functioning of this organism in the sea.
This work was supported by the Woods Hole Oceanographic Institution Ocean Life Institute, the J. Lamar Worzel Assistant Scientist Fund, and the Frank and Lisina Hoch Endowed Fund. A.G.M., S.R.B., and M.J.C. were supported in part by the Marine Biological Laboratory's Program in Global Infectious Diseases, funded by the Ellison Medical Foundation. Computational resources were provided by the Josephine Bay Paul Center for Comparative Molecular Biology and Evolution (Marine Biological Laboratory) through funds provided by the W. M. Keck Foundation and the G. Unger Vetlesen Foundation.
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»