Previous Article | Next Article ![]()
Applied and Environmental Microbiology, April 2006, p. 2730-2737, Vol. 72, No. 4
0099-2240/06/$08.00+0 doi:10.1128/AEM.72.4.2730-2737.2006
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Department of Biological Sciences, University of Warwick, Gibbet Hill Road, Coventry CV4 7AL, United Kingdom
Received 31 October 2005/ Accepted 2 February 2006
|
|
|---|
|
|
|---|
Genome sequencing has highlighted some of the underlying genomic differences that underpin the physiology of specific niche-adapted groups in the closely related genus Prochlorococcus, in which there are lineages that are adapted to niches in the surface or depths of a water column (12, 33), but much less is known about the more widespread genus Synechococcus. In order to begin to define more specifically differences at the genomic level that have allowed marine Synechococcus lineages to adapt to specific niches, we utilized a suppression subtractive hybridization (SSH) approach to identify genes that might be considered clade or strain specific.
SSH was developed and has been mostly used to identify genetic differences between virulent and avirulent strains of various pathogens (2, 11, 17, 29, 45), but more recently it has also been used to identify genes unique to the saxitoxin-producing cyanobacterium Anabaena circinalis (31), as well as to identify genetic diversity in an environmental metagenome (15). In this technique pools of genomic DNA from a bacterial strain of interest (tester) are depleted, by hybridization and PCR, by sequences that are also in a reference strain (driver). The remaining highly enriched fraction of tester-specific sequences is then cloned and sequenced. We chose this technique over, for example, DNA-DNA hybridization using a microarray, because the latter technique identifies only genes that are present in the reference genome and not genes that are absent.
In order to gauge the utility of the SSH approach for assessing genomic differences in the marine genus Synechococcus, Synechococcus strains WH8103 (clade III) and WH7803 (clade V) were used as the tester and driver (and vice versa), respectively, to identify genes "unique" to each strain. Strain WH8103 is closely related to another clade III strain, strain WH8102. The complete genome of Synechococcus sp. strain WH8102 was recently published (28). Hence, our SSH approach should also identify the genes that are potentially clade specific (i.e., present in both WH8103 and WH8102 but not in WH7803). Synechococcus sp. strains WH7803 and WH8103 are axenic, well-characterized laboratory strains that represent different Synechococcus clades (13, 44). These strains have very similar DNA base ratios, which helps rule out genomic differences due to G+C content. Furthermore, several physiological differences between these strains with regard to the composition of the light-harvesting pigments and the ability to swim or use urea have already been identified (9, 44). Together, these features should allow rigorous testing of the SSH technique for identifying genomic differences which ultimately could define niche-specific genes (i.e., genes important in open-ocean regions or in coastal or up-welling regions) in these important photoautotrophs.
|
|
|---|
SSH.
Genes unique to either Synechococcus sp. strain WH8103 or Synechococcus sp. strain WH7803 were identified by SSH using first WH8103 as the tester and WH7803 as the driver (to identify WH8103-specific sequences) and then WH7803 as the tester and WH8103 as the driver (to identify WH7803-specific sequences). SSH was performed using a CLONTECH PCR-Select bacterial genome subtraction kit (CLONTECH Laboratories, Inc., Palo Alto, CA) according to the manufacturer's instructions but with the following modifications.
A combination of AluI and HaeIII was used to digest Synechococcus sp. strain WH8103 and Synechococcus sp. strain WH7803 chromosomal DNAs, which yielded fragments that were less than 1 kb long. Adaptor ligation was performed according to the kit protocol, and this was followed by addition of 1 µl T4 DNA ligase and reincubation at 15°C overnight to ensure sufficient ligation of adaptors.
To determine the efficiency of adapter ligation when Synechococcus sp. strain WH8103 was used as the tester, PCR primers were designed to amplify a 1,146-bp fragment of nirA (accession number AF065403), a gene encoding nitrate reductase (5), containing no AluI or HaeIII recognition sites. The gene-specific primers were SSHLEW8103F (5'-CGA CAT CAC CAC AAG GCA AA-3') and SSHLEWH8103R (5'-TGA CCA ATA GTT GGG TTG CG-3'), while SSHPRIMER1 (5'-CTA ATA CGA CTC ACT ATA GGG C-3') was the adapter-specific primer. PCRs were carried out in 25-ml mixtures containing each deoxynucleoside triphosphate at a concentration of 200 µM, 2 mM MgCl2, 10 pmol of each primer, 1 µl of template (prepared as instructed in the kit manual), and 0.75 U of Taq polymerase in 1x enzyme buffer (Invitrogen, Carlsbad, CA). The amplification conditions comprised a denaturation step of 94°C for 3 min; 80°C for 1 min, at which time Taq polymerase was added; and 72°C for 2 min, followed by 30 cycles of 94°C for 1 min, 55°C for 1 min, and 72°C for 1.5 min and a final extension for 5 min at 72°C. The reaction mixtures were stored at 4°C prior to analysis. Products (10 µl of 25 µl) were resolved by gel electrophoresis on a 1.5% (wt/vol) agarose gel at 100 V. DNA was stained with ethidium bromide (0.5 µg ml1), visualized under short-wavelength UV, and photographed using a gel documentation system (UVP Inc., Upland, CA).
When Synechococcus sp. strain WH7803 was used as the tester, a 307-bp fragment of napA (accession number AAG45172), a gene encoding nitrate permease (A. F. Post, D. Lindell, A. Moyal, S. Solomon, and Q. Wang, unpublished data), containing no AluI or HaeIII recognition sites was used to perform a ligation efficiency analysis. The primers used were SSHLEW7803F (5'-GCT TGG CGA ACT TTG GTC ATT T-3') and SSHLEWH7803R (5'-CTG ATT GAA GTC CTG AGC AGA T-3'). PCRs were performed as described above, but the amplification conditions were 94°C for 3 min; 75°C for 1 min, at which time Taq polymerase was added; and 72°C for 2 min, followed by 27 cycles of 94°C for 30 s, 60°C for 30 s, and 68°C for 1 min and a final extension for 2 min at 68°C.
For the primary PCR using the adapter-specific primer SSHPRIMER1 (5'-CTA ATA CGA CTC ACT ATA GGG C-3'), PCRs were performed as described above, but the amplification conditions were 94°C for 3 min; 75°C for 1 min, at which time Taq polymerase was added; and 72°C for 5 min, followed by 30 cycles of 94°C for 1 min, 66°C for 1 min, and 72°C for 1.5 min.
For the nested PCR, primers SSHnestpri1 (5'-TCG AGC GGC CGC CCG GGC AGG T-3') and SSHnestpri2 (5'-AGC GTG GTC GCG GCC GAG GT-3') were used, and PCRs were again performed as described above, except that the template DNA was 1 µl of a 1:40 dilution of the PCR products from the primary PCR. The amplification conditions consisted of 94°C for 3 min, 75°C for 1 min, at which time Taq polymerase was added, and 72°C for 5 min, followed by 15 cycles of 94°C for 1 min, 68°C for 1 min, and 72°C for 1.5 min.
DNA sequencing.
The nested PCR products were TA cloned into the pCR2.1TOPO vector, and plasmid DNA was extracted using a QIAprep Spin Miniprep kit (QIAGEN GmbH, Hilden, Germany). PCR products were sequenced bidirectionally at the Warwick University sequencing facility using Big Dye Terminator version 3.1 chemistry (Applied Biosystems, Foster City, Calif.) and were examined with a 3100 genetic analyzer. The M13R primer (5'-CAG GAA ACA GCT ATG AC-3'), which annealed to the plasmid 74 bp upstream of the insert, was used as the sequencing primer.
Computer analysis.
SSH-enriched sequences from Synechococcus sp. strains WH8103 and WH7803 were initially analyzed by BLASTX searches (3) against the closed WH7803 genome (F. Partensky, personal communication). Sequences derived from WH8103 were aligned with the complete genome sequence of WH7803 using Washington University BLASTN (W. Gish, 1996 to 2004; http://BLAST.wustl.edu) with gapped alignments. Sequences with more than 85% nucleotide identity to the WH7803 sequence were considered false positives and were removed from the WH8103-specific set. The rest of the specific sequences were analyzed by BLASTX against the GenBank nonredundant database (http://www.ncbi.nlm.nih.gov). Analyses of WH7803 sequences were completed with the aid of the draft annotation of the WH7803 genome based on Glimmer, Critica, and GeneMarks software predictions with manual annotation. All of the WH7803-specific sequences exhibited 100% identity with the WH7803 genome sequence.
Nucleotide sequence accession numbers.
The sequences reported in this paper have been deposited in the GenBank database under the following accession numbers: for WH8103 SSH-enriched fragments, DU635204 to DU635313 and DU635372; and for WH7803 SSH-enriched fragments, DU635093 to DU635203.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Summary of Synechococcus sp. strain WH8103 SSH-enriched DNA fragments
|
Fifty-seven of the remaining WH8103 and WH8102 enriched fragments had significant hits with gene sequences in the database, and 55 of the hits were best hits in WH8102. Another 22 sequences had no significant hits when tBLASTX was used, indicating that they were derived from intergenic spacer regions or were entirely novel sequences. Thus, the majority of the differences found correspond to genes rather than noncoding DNA. The WH8103 and WH8102 enriched fragments were found to be evenly distributed around the Synechococcus sp. strain WH8102 genome, as shown in Fig. 1. The genes to which the fragments correspond are shown in Table 1. Several of the DNA fragments exhibited homology with the sequence encoding the ABC urea transporter, SYNW2442 (fragments A029, A055, and A097), or the sequence encoding SwmB (fragments A048, A062, A113, A130, A146, and A169); the former is required for high-affinity urea transport (42), and the latter is required for swimming motility (20). Both of these physiological traits (i.e., urea utilization and swimming motility) are known to be absent in WH7803 (9, 41, 44). SYNW2442 appears to be a homolog of urtA, a component of the urtABCDE gene cluster, and hence this component of the high-affinity urea transport system can be added to the urease structural genes as missing genes required for urea utilization by this strain. The genes for several other potential ABC transporter components appear in these WH8103- and WH8102-specific sequences, including the gene encoding a potential nitrate-like substrate binding protein for an ABC transporter, SYNW1415. The presence of such a gene is interesting given the fact that WH8102 contains a reported deletion after nucleotide position 617 (resulting in a frameshift that produces a stop codon 26 codons downstream [see EMBL entry CAE08977]) in the napA (or nrtP) gene encoding a nitrate permease belonging to the major facilitator superfamily (5, 35, 43). Certainly, in WH7803 the napA product seems to be the sole nitrate transporter, since interposon mutagenesis of the gene revealed a strain that could not grow on nitrate (A. F. Post, D. Lindell, A. Moyal, S. Solomon, and Q. Wang, submitted for publication). This leads to speculation about whether nitrate uptake by the napA product is a functional system in Synechococcus sp. strain WH8102 and whether the SYNW1415 product is part of a functional nitrate transport system together with other ABC transporter components encoded by SYNW1416 and SYNW1417 adjacent to SYNW1415 on the genome. Certainly, thus far, nitrate transport via an ABC-type system, encoded by the nrtABCD genes, appears to be restricted to freshwater cyanobacteria (24, 25).
![]() View larger version (44K): [in a new window] |
FIG. 1. Genomic distribution of SSH-enriched DNA fragments. SSH-enriched DNA sequences from Synechococcus sp. strain WH8103 were mapped on the complete genome of Synechococcus sp. strain WH8102 (outer circle), while SSH-enriched sequences from Synechococcus sp. strain WH7803 were mapped on the complete genome sequence of this strain (inner circle). The origin of both sequences is indicated by an arrow.
|
Succinate dehydrogenase (EC 1.3.99.1), encoded by sdhA (SYNW0591), a citric acid cycle enzyme, catalyzes the conversion between succinate and fumarate. It has been known for a long time that cyanobacteria have an incomplete citric acid cycle and lack
-ketoglutarate dehydrogenase (37). Interestingly, several other central carbon metabolism genes appear to be missing entirely in marine Synechococcus and Prochlorococcus genomes; these genes include genes encoding malate dehydrogenase, glucose-1-phosphate dehydrogenase, 6-phosphofructokinase, succinyl coenzyme A synthetase, and NAD- and NADP-dependent alcohol dehydrogenases (16). Succinate dehydrogenase appears to be an exception to this, however; the data presented here show that this enzyme is present in Synechococcus sp. strains WH8103 and WH8102 and Prochlorococcus spp. strains MIT9313 and SS120 but is not present in Synechococcus sp. strain WH7803 and Prochlorococcus sp. strain MED4.
Asparagine synthetase (EC 6.3.5.4), encoded by asnB (SYNW2453), is responsible for the following reaction, which results in the synthesis of asparagine: ATP + L-aspartate + L-glutamine
AMP + diphosphate + L-asparagine + L-glutamate. Several bacteria have been shown to possess a tRNA-dependent transamidation pathway in which aspartyl-tRNAAsn is converted to asparaginyl-tRNAAsn (4, 10). It has been demonstrated that in Deinococcus radiodurans (which lacks asparagine synthetase) this tRNA-dependent transamidation pathway is the sole route for asparagine synthesis (21). Roy et al. (34) have proposed that a truncated archaeal asparaginyl-tRNA synthetase that is responsible for transamidation of aspartyl-tRNAAsn was introduced into bacteria via lateral gene transfer, where it became the evolutionary ancestor of bacterial asparagine synthetase. The presence of asnB (encoding asparagine synthetase) in Synechococcus sp. strain WH8103 but not in strain WH7803 suggests that Synechococcus sp. strain WH7803 is forced to rely on a tRNA-dependent transamidation pathway for the synthesis of asparagine, whereas Synechococcus sp. strain WH8103 possesses an alternative tRNA-independent pathway for asparagine synthesis.
(ii) Genes specific to WH7803.
A total of 150 colonies were cloned into the pCR2.1TOPO vector, and 37 of the clones gave insufficient sequence data or had no insert. Of the remaining 113 sequences, 27 were found to be duplicates (i.e., they had sequences identical to the sequence[s] of another fragment[s]), which left 86 sequences for analysis (Table 2). The lengths of the fragments sequenced ranged from 114 bp to 897 bp. These sequences were identified by BLASTX comparison to the nonredundant database, as well as to the recently derived Synechococcus sp. strain WH7803 genome sequence (A. Dufresne, M. Ostrowski, P. Wincker, D. J. Scanlan, and F. Partensky, unpublished data). Forty of these sequences had significant hits to gene sequences in the database, and 20 had best hits to WH8102. These Synechococcus sp. strain WH7803-specific fragments were found to be evenly distributed around the genome (Fig. 1), suggesting that there are not distinct "islands" of genomic differences for both organisms. Similarly, only two of the fragments were found to consist of only noncoding sequences, while 14 fragments were a mixture of noncoding sequences and coding sequences and 35 fragments consisted of only coding sequences. Thus, the vast majority of the differences found corresponded to genes rather than to noncoding DNA. Many of the unique genes encode hypothetical or conserved hypothetical proteins, while 13 have an assigned function (Table 2). Among the "identifiable" strain-specific genes are genes that may be involved in deterring grazing or phage infection. For example, the fact that fragment B139 encodes a permease component of an ABC-type polysaccharide/polyol phosphate export system suggests that WH7803 may produce an extracellular polysaccharide, although this extracellular polysaccharide as well as being unpalatable to grazers may also be useful as a sink for excess carbon under nitrogen limitation conditions (26); alternatively, one of the fragments, fragment B114, potentially encodes a component of an ABC-type bacteriocin/antibiotic exporter, which may be important in alleviating bacterial competition via production of an antibacterial compound.
|
View this table: [in a new window] |
TABLE 2. Summary of Synechococcus sp. strain WH7803 SSH-enriched DNA fragments
|
Conclusions.
We describe here identification of several interesting candidate genes that may represent clade-specific characteristics that allow occupation of a specific environmental niche by a particular marine Synechococcus lineage. Although we are cautious about the existence of a plethora of clade-specific traits, it would not be unexpected that some specific physiological traits are shared by clades, given that it is the total suite of traits that correlates with the environmental niche occupied.
Certainly, it is interesting that genes potentially related to predator avoidance have been identified using the SSH approach, and each marine Synechococcus strain apparently has distinct genetic predator avoidance capabilities. It is known that resistance to protist grazing can be influenced by a variety of factors, such as high motility, reduced or increased size, and cell surface masking (for a review see reference 19), while modification of the cell surface is also clearly important with respect to cyanophage infection. Hence, it is not unreasonable to suggest that the high numbers of cyanophage in marine waters (38) and also potentially the grazers themselves are involved in a "biological arms race" that results in strong selective pressures for the host cells to evolve specific mechanisms that make them resistant to predation.
Clearly, the SSH approach used here is a cheap and relatively fast approach to identify strain-specific genes in the marine genus Synechococcus. These organisms differ from most pathogenic organisms that have been targeted previously with the SSH approach because they lack defined "islands" of genomic differences. Certainly, the strain-specific sequences could augment information contained in microarrays, expanding the scope of gene content or gene expression analysis and allowing multistrain chips to be developed. Clearly, strain variation also provides valuable insights into evolutionary processes, while finding sequences that are variable but common in strains should facilitate more precise and reliable taxonomy beyond the 16S rRNA gene nomenclature, as has been initiated using rpoC1 (39). Finally, mutagenesis studies (for an example, see reference 20) of the many conserved hypothetical genes and hypothetical genes identified here as strain specific should lead to a more fundamental understanding of the processes involved in niche adaptation.
H.J. was the recipient of a BBSRC-funded Ph.D. studentship. This work was also supported in part by the EU FP5 program MARGENES (grant QLRT-2001-0226 to D.J.S.).
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»