Previous Article | Next Article ![]()
Applied and Environmental Microbiology, March 2007, p. 1514-1524, Vol. 73, No. 5
0099-2240/07/$08.00+0 doi:10.1128/AEM.01900-06
Copyright © 2007, American Society for Microbiology. All Rights Reserved.
,
Peter Burghout,2,
Tomas G. Kloosterman,1,
Hester J. Bootsma,2,
Anne de Jong,1
Peter W. M. Hermans,2 and
Oscar P. Kuipers1*
Department of Molecular Genetics, University of Groningen, Haren, The Netherlands,1 Laboratory of Pediatric Infectious Diseases, Radboud University Nijmegen Medical Centre, Nijmegen, The Netherlands2
Received 9 August 2006/ Accepted 30 December 2006
|
|
|---|
|
|
|---|
Several methods have been used to determine which genes are needed by S. pneumoniae in the various niches it occupies in the host (conditionally essential genes), such as transcriptome analysis (29), differential fluorescence induction (22), and signature-tagged mutagenesis (STM) (7, 16, 30). Of these methods, STM is the only one that enables negative screening and thus directly addresses the essentiality of a gene under a certain condition. However, STM is hampered by various drawbacks: every mutant has to be grown and stored separately, which makes it hard to generate large libraries with a more than 1x coverage of the genome; detection of the tags is cumbersome; and identification of transposon insertion sites is labor intensive (2). In the three in vivo STM screens performed with S. pneumoniae, no more than 8,000 mutants in total have been tested. Furthermore, there is little overlap in the identified genes in each study, indicating that the screens were not saturated, and more importantly, that not all existing conditionally essential genes have been identified (7, 16, 30).
Therefore, we set out to develop genomic array footprinting (GAF), a high-throughput method to identify conditionally essential genes in S. pneumoniae by using a combination of random transposon mutagenesis and microarray technology (Fig. 1). GAF detects the transposon insertion sites in a library by amplifying and labeling the chromosomal DNA adjacent to the transposon and subsequent hybridization of these probes to a microarray. Identification of transposon insertion sites in mutants that have disappeared from the library due to selection, which represent conditionally essential genes, is achieved by differential hybridization of the probes generated from the library grown under two conditions to an array (Fig. 1). We anticipated that the most critical step in the whole procedure is the specific amplification and detection of the DNA adjacent to the transposon insertion sites, as this determines the rate of false positives. Recently, several methods for identifying transposon insertion sites using microarrays have been described (6, 32, 33). In this paper, we describe (i) the pitfalls of several methods of amplification when applied to S. pneumoniae libraries and (ii) the successful development and validation of a specific and reproducible method for the detection of transposon insertion sites in S. pneumoniae. Finally we show, with a biological screen for genes essential for surviving zinc stress, that GAF is a reliable and effective method to identify conditionally essential genes.
![]() View larger version (8K): [in a new window] |
FIG. 1. Schematic representation of the GAF procedure. A large S. pneumoniae transposon library is grown under nonselective and selective conditions. Subsequently, chromosomal DNA containing transposons (gray rectangle) with outward-facing T7 RNA polymerase promoters (arrow with T7) is isolated from each population. The DNA is digested, and the DNA adjacent to the transposon insertion site is amplified using in vitro transcription with T7 RNA polymerase. The RNA is used in standard microarray procedures. Hybridization of these probes to a microarray will reveal which genes were disrupted in the mutants that disappeared during selection: those spots to which only material derived from the nonselective condition will hybridize (gray spots).
|
|
|
|---|
F was used as a host for the pR412 plasmid and the generation of pR412T7. All strains were routinely cultured in tryptone yeast broth or M17 broth containing glucose (GM17) as described previously (10). General DNA techniques were performed as described previously (10). Sequencing of the DNA adjacent to the insertion sites was performed using a single-primer PCR essentially as described previously (9). An in-frame deletion of czcD (SP1857) was constructed using pORI280 as described before (10) with primer pairs czcD-KO-1/czcD-KO-2 and czcD-KO-3/czcD-KO-4 (Table 1), which removed 852 bp of the open reading frame (ORF); a detailed description of its construction is provided elsewhere (T. G. Kloosterman, M. M. Pol, J. J. E. Bijlsma, and O. P. Kuipers, submitted for publication). |
View this table: [in a new window] |
TABLE 1. Primers used in this studya
|
The backbone of transposon donor plasmid pR412 (23), including the inverted left repeat and inverted right repeat of the transposon, was amplified with Pwo DNA polymerase (Roche) with a single phosphorylated primer PBMrIRPi (Table 1); PCR cycling conditions were as follows: 94°C for 4 min; 30 cycles of 94°C for 1 min, 55°C for 1 min, and 68°C for 5 min; and 68°C for 5 min. The spectinomycin-resistance cassette was PCR amplified with primers containing a T7 tag, i.e., PBMrStartT7 (Table 1) and primer PBMrEndT7 (Table 1); PCR cycling conditions were as follows: 94°C for 4 min; 30 cycles of 94°C for 1 min, 55°C for 1 min, and 68°C for 1.5 min; and 68°C for 4 min. The two PCR products were ligated to obtain the transposon donor plasmid pR412T7.
Generation of transposon libraries.
Random transposon libraries were routinely generated in S. pneumoniae as described before, using the in vitro mariner transposon (1) and the in vivo pGh:ISS1 system (21). Recombinant HimarC9 transposase was purified from the E. coli strain BL21(DE3) (Novagen) containing plasmid pET29C9 (13) essentially as described before (14). However, to prevent DNase contamination of the transposase stock, the DNase I treatment of the purification step was omitted from the protocol. For in vitro transposition, 1 µg of purified chromosomal DNA was incubated with 0.5 µg pR412T7, which contains the marinerT7 transposon, and 0.5 µg purified recombinant HimarC9 transposase in a 20-µl reaction mixture. After repair of the transposition reaction with T4 DNA polymerase and E. coli DNA ligase (7), the DNA was used for transformation of strain R6 (1, 13). After overnight growth on selective plates containing 120 mg liter1 spectinomycin, colonies were scraped, pooled, and stored at 80°C. For the construction of pGh9T7:ISS1 mutant libraries (21), the R6 and D39 strains were transformed with the pGh9T7:ISS1 plasmid and grown at 30°C on selective plates containing 0.25 mg liter1 erythromycin. Several colonies of S. pneumoniae containing the pGh9T7:ISS1 plasmid were pooled, grown overnight in GM17 broth with erythromycin at 30°C, and stored at 80°C. From this stock, bacteria were inoculated for overnight growth in GM17 broth at 30°C with the appropriate antibiotic, and the next day the cultures were diluted 1 to 50 in GM17 broth without antibiotics and after three hours of growth shifted to 37°C. After another three hours of growth at the nonpermissive temperature, 103 and 104 dilutions were plated on selective medium. Approximately 20,000 to 30,000 CFU were scraped from the plate, pooled, and stored at 80°C for further use. The same marinerT7 and pGh9T7:ISS1 transposon libraries were used for all experiments to determine the reproducibility and specificity of each method. New libraries were generated for the comparison of the marinerT7 and pGh9T7:ISS1 transposon libraries and the GAF zinc screen.
Microarray construction, labeling, hybridization, and analysis.
S. pneumoniae microarrays were produced essentially as described previously (12, 40) and contained amplicons of on average 600 bp representing 2,087 ORFs of S. pneumoniae TIGR4, each of which are present twice on the array (36) as an internal control to monitor variations in hybridization efficiency per slide. Purified RNA obtained after in vitro transcription was used to generate fluorescent DNA probes by direct or indirect labeling using standard methods. Hybridization, washing, and acquisition of array images were performed as described previously (11). Spots were screened visually to identify those of low quality. The amount of low-quality spots varied per slide but was never more than 10%; they were removed from the data set prior to analysis. Slide data were processed and normalized using MicroPreP (41). A net mean intensity filter based on hybridization signals obtained with amplicons representing open reading frames unique for S. pneumoniae strain TIGR4 (36) was applied in all experiments. For one-array experiments (Fig. 2 and 3), only genes that had reliable measurements (i.e., passed filters) for both duplicate spots present on the array were analyzed. For the other experiments, microarray data were analyzed when at least three measurements were available, using a CyberT implementation of the Student's t test (20). False discovery rates were calculated as described previously (40). The correlation between normalized signal intensities per gene in both channels was determined by linear regression analysis.
![]() View larger version (26K): [in a new window] |
FIG. 2. Detection of transposon insertion sites using PCR-based methods. (A) Scatterplot of the signal intensities of two parallel reactions on the chromosomal DNA of a genome-wide library using MATT. There is no correlation between the signal intensities for each gene (r2, 0.036), indicating that the procedure is nonreproducible. (B) Scatterplot of the signal intensities of two parallel reactions on the chromosomal DNA of a genome-wide library using a TOPO-isomerase adapter combined with a nested PCR. There is no correlation between the signal intensities in each gene (r2, 0.010), indicating that the procedure is nonreproducible. (C) Scatterplot of the signal intensities of two parallel reactions on the chromosomal DNA of a genome-wide library using TraSH. Correlation between the signal intensities for each gene indicates that the procedure is fairly reproducible (r2, 0.825).
|
![]() View larger version (27K): [in a new window] |
FIG. 3. In vitro transcription on purified chromosomal DNA restriction fragments leads to specific and sensitive detection of transposon insertion sites. (A) Scatterplot of the comparison of reactions on the chromosomal DNA of a mutant library consisting of 90 unique marinerT7 mutants and the DNA of the same mutants to which the chromosomal DNA of 9 defined mutants has been added. Straight lines denote a twofold difference in signal intensities between the Cy5 and Cy3 channels. Genes identified by sequencing as added to the library are indicated as black squares. (B to D) Scatterplots of the signal intensities of two parallel reactions on the chromosomal DNA of a genome-wide library digested with TacI (B), DdeI (C), and AluI (D). Correlation between the signal intensities in each plot is >0.96, indicating that the procedure is highly reproducible.
|
Identification of transposon insertion sites using a TOPO adapter.
Genomic DNA was diluted in 20 µl H2O to 0.5 µg/µl and fragmented by sonication with the microtip of a Branson digital sonifier (eight times for 0.5 s, output 20%). To prepare double-stranded, blunt-end DNA, fragmented DNA was successively treated with S1 nuclease, T4 DNA polymerase, and E. coli DNA ligase. To facilitate the ligation of a TOPO-isomerase-activated linker (TOPO-Walker kit; Invitrogen), the blunt-end, double-stranded DNA was incubated with calf intestinal phosphatase for dephosphorylation and with Taq DNA polymerase (Thermoperfect; Integro) to create 3' A-tailed DNA ends. Ligated DNA was used as a template in two consecutive PCRs with Taq DNA polymerase: the first reaction contained primers LinkAmp1 (Table 1) and PBMrTn5 (Table 1), and the second contained nested PCR primers LinkAmp2 (Table 1) and PBMrTn3 (Table 1). PCR cycling conditions for both reactions were as follows: 94°C for 4 min; 20 cycles of 94°C for 30 s, 63°C for 1 min, and 72°C for 1.5 min; and 72°C for 3 min. PCR products were loaded on a 2% agarose gel, and DNA fragments with the preferred size were excised and extracted with a Qiaquick gel purification kit (QIAGEN). In vitro transcription reaction and subsequent DNase I digestion were performed with a T7 MEGAscript kit (Ambion).
Identification of transposon insertion sites using TraSH.
The TraSH method was performed essentially as described previously (33) with a few modifications. Genomic DNA was digested with ApoI or TacI and precipitated and ligated overnight in a 15-µl volume with an approximately 1,000-fold molar excess of the appropriate adapter (Adapter1 for ApoI- and Adapter2 for TacI-digested DNA; Table 1). The ligated products were purified, and 0.3 to 1 µl was used as a template in the following PCR containing primer TmR1 (Table 1): 1 cycle at 95°C for 1 min; and 5 cycles at 95°C for 30 s, 72°C for 30 s, and 72°C for 1 min. Subsequently the appropriate adapter primer (Adapter3 for ApoI- and Adapter4 for TacI-digested DNA; Table 1) was added, and the reaction was continued with the following cycling conditions: 5 cycles at 95°C for 30 s, 72°C for 30 s, and 72°C for 1 min, and 30 cycles at 94°C for 30 s, 65°C for 30 s (0.3°C each cycle), and 72°C for 70 s. PCR products were purified and used as a template for a T7 RNA polymerase reaction using a MEGAscript kit (Ambion).
Identification of transposon insertion sites using the T7 RNA polymerase promoter.
The procedure was performed essentially as described previously (6). Genomic DNA (>10 µg) was digested overnight with TacI, DdeI, or AluI, and the purified DNA was used for an in vitro transcription reaction with a MEGAscript T7 RNA polymerase kit (Ambion) in a total volume of 20 µl.
Selection for genes essential for surviving high-zinc conditions.
Aliquots of a marinerT7 transposon library generated in S. pneumoniae R6 containing approximately 20,000 independent mutants were diluted 1 to 20 in GM17 broth or in GM17 broth containing 0.5 mM ZnSO4. Both cultures were grown for 10 generations as determined by the optical density at 600 nm, after which the bacteria were spun down (9,000 x g, 4 min) and used for the extraction of genomic DNA.
Microarray data accession numbers.
The microarray data have been deposited in the NCBI Gene Expression Omnibus (GEO) database (http://www.ncbi.nlm.nih.gov/geo/ under GEO Series accession numbers GSM158348, GSM158350, GSM158558, GSM158562, GSM158563, GSM158564, GSM158565, GSM158566, GSM158567, GSM158568, GSM158569, and GSM158570.
|
|
|---|
Although we used strains R6 and D39 throughout this study, we wanted our method to be applicable to (in principle) all S. pneumoniae strains, some of which are not as easily transformable. Therefore, we established two methods for the generation of random mutant libraries: in vitro mariner mutagenesis (1), which depends on a high-transformation efficiency of the accepting strain for the generation of large libraries and is not expected to have polar effects on downstream genes (31), and the in vivo pGh:ISS1 system (21), which consists of a plasmid with a temperature-sensitive origin of replication, the ISS1 insertion element, and an erythromycin resistance gene with a weak terminator (7.5 kcal mol1); thus, insertion of pGh:ISS1 could cause some polar effects. Growth of bacteria that contain the plasmid for several generations at 30°C, the permissive temperature, followed by a temperature shift to 37°C, induces the insertion of the plasmid into the chromosome. Thus, a one-time introduction of this plasmid in a strain with low transformation efficiency is sufficient to generate large (>20,000 CFU) libraries, avoiding the need for high-transformation efficiencies in accepting strains. In each transposon, an outward-facing T7 RNA polymerase promoter close to both ends of the insertion element was introduced for amplification purposes. Both transposons readily generated large (>20,000 CFU) libraries in S. pneumoniae strains R6 and D39, as expected, and in strain TIGR4 with the pGh:ISS1 system (34) as well (results not shown). Mutant libraries generated with these transposons were used exclusively throughout this study, and the RNA generated with the aid of the T7 RNA polymerase promoter was used in our standard microarray procedures.
Identification of the transposon insertion sites using MATT.
Microarray tracking of transposon mutants combines two rounds of PCR: first, a reaction with specific and nonspecific primers, followed by a nested PCR to further amplify the material (32). The specific primers hybridize to sequences on the transposon, and the nonspecific primers contain 10 degenerative nucleotides followed by a unique anchor sequence. Several anchor sequences were chosen that are present on average every 400 bp in the genome. MATT was found to be specific and reproducible when used for artificial GAF experiments on a few mutants (comparison of 12 mutants with the same set in which three mutants were removed; results not shown). However, when two parallel reactions on one large (>20,000 CFU) genome-wide library were compared, it was obvious that the method was not reproducible (Fig. 2A), as few genes displayed equal signal intensities in both channels (r2, 0.036). When the products of the second round of PCR were visualized on an agarose gel, several distinct bands of various sizes in each reaction were visible, indicating that some DNA fragments were selected randomly during the PCR steps (results not shown).
Identification of transposon insertion sites using a TOPO-isomerase adapter or the adapter-based TraSH method.
Next, we tested a method that consists of the ligation of a TOPO-isomerase-activated adapter to sheared chromosomal DNA, followed by two PCR steps using nested primers that anneal specifically to the adapter and the transposon. Similar to MATT, this method was specific and reproducible when applied to artificial GAF experiments on a small number of mutants (i.e., comparison of 21 mutants with the same set from which 6 mutants were removed; results not shown) but was clearly nonreproducible when used on a genome-wide library (Fig. 2B). When the PCR products of separate reactions were examined, on an agarose gel, different distinct banding patterns were visible, again indicating that the PCR steps introduced a bias in the whole procedure. Repeated attempts to improve the PCR steps in both the MATT and the TOPO-adapter method by using Taq polymerase variants with increased specificity, touchdown PCR, and even more stringent annealing conditions were unsuccessful (results not shown).
Subsequently, we tested the TraSH method, which consists of the ligation of an adapter to chromosomal DNA digested with TaqI or ApoI, followed by a short linear PCR with a transposon-specific primer and a PCR with transposon-specific and adapter-specific primers (33). This method was fairly specific and reproducible both when used on a small number of mutants (i.e., comparison of 21 mutants with the same set of mutants from which 6 mutants were removed; results not shown) and on a large (>20,000 CFU) genome-wide library (Fig. 2C), as the majority of the genes had comparable signal intensities in both channels on two parallel reactions (r2, 0.83). However, the optimal annealing temperature and template concentration had to be determined for each new sample, as standardized PCR conditions resulted in distinct banding patterns in the PCR products and lower correlations between signal intensities in each channel. As the continuous optimization of the PCR conditions made this method cumbersome, we did not pursue it further.
Identification of transposon insertion sites using in vitro transcription on digested DNA.
To avoid the bias introduced by methods employing PCR, we directly used purified chromosomal fragments digested with TaqI (which on average cuts every 442 bp) as input for an in vitro transcription reaction using the T7 RNA polymerase promoter that we had introduced in both transposons (Fig. 1) (6, 17). Comparison of duplicate reactions on a small set of 90 mutants with this method gave rise to specific and reproducible results (data not shown). Next, we tested the feasibility of this method by performing an artificial GAF experiment. To a small library of 90 mutants, 9 defined mutants (transposon insertion site determined by sequencing) were added (90 + 9 library). Subsequently, transposon insertion sites in both libraries were determined by microarray analysis as described above. As expected, the signal intensities of most genes were found to be similar in both channels (i.e., ratios between 0.5 and 2), indicating that mutants in these genes were present in both populations. Furthermore, the signal intensities of 13 genes in the 90 + 9 library were increased significantly in one channel (Fig. 3A), leading to ratios of >9 (Table 2), indicating that transposon insertions in or near these genes were not present in the 90 mutant library. All 13 of these genes corresponded to ORFs present on the TaqI fragments of 8 defined mutants added to the library (Fig. 3A; Table 2). The corresponding amplicons of the genes on the TaqI restriction fragment in one defined mutant (SP2129 to SP2131) had high signal intensities in both channels (ratio of 1), indicating that a mutant with a transposon insertion in this location was already present in the 90 mutant library. Three additional genes were detected as having been added to the library (Fig. 3A; Table 2). The detection of SP1792 is probably due to the fact that the amplicon has significant homology with, among others, ORF SP1262 (data not shown), and thus the high Cy5 signal is probably due to cross hybridization with the fragments generated by the added mutant A2. SP0001 is adjacent to ORF SP2240 on the genome and is present on the TaqI fragment of added mutant A10, and the intermediate-level signals in the Cy5 channel could be due to the occurrence of some partially digested fragments. It is not clear why gene SP2169 is detected; however, the signals in both channels and the ratio are close to the arbitrarily chosen cutoff value of 2 and differ substantially from the signal intensities of the amplicons that represent the 9 added mutants.
|
View this table: [in a new window] |
TABLE 2. ORFs present on the TaqI fragments that contain the insertion sites of the 9 mutants added to the library and the corresponding normalized signals and ratiosa
|
Comparison of the pGhT7:ISS1 and marinerT7 transposon libraries.
Previous experiments were performed on marinerT7-generated libraries. To test whether the method could also be applied to pGh9T7:ISS1 libraries, and to assess the level of saturation and randomness achieved by each transposon, we compared a large (>20,000 CFU) marinerT7 library with a pGh9T7:ISS1 one (>20,000 CFU), each grown for approximately 20 generations in GM17. After isolation, the DNA was digested with DdeI and AluI and each detection reaction was performed in duplicate. Microarray data were analyzed as before by using the background filter generated with the TIGR4-specific amplicons (the normalized signal data and CyberT output data for these arrays are available at http://molgen.biol.rug.nl/publication/GAF_data). This analysis resulted in the detection of 1,473 genes, and the nondetected genes were similar to those not detected in the analysis of the three marinerT7 digests. The ratio of the majority of the genes was between 0.5 and 2, indicating that they are mutated by both transposons. However, 139 (9%) genes had a ratio lower than 0.5 or higher than 2, indicating that these genes are only, or preferentially, mutagenized by one of the two transposons. These "hot spots" are distributed throughout the genome (Table 3). There were 60 genes preferentially mutated by pGh9T7:ISS1, whereas 79 were preferentially mutated by marinerT7, indicating that it has a slightly higher saturation rate than pGh9T7:ISS1. Anchored PCR using primers located on some of the ORFs that had a ratio close to the cutoff point of 0.5 indicated that these genes were indeed preferentially hit by either transposon and that these ratios were not due to slight differences in, for instance, T7 RNA polymerase efficiency (data not shown).
|
View this table: [in a new window] |
TABLE 3. Summary of genes preferentially mutated by marinerT7 (ratio of <0.5 for the first two columns) or by pGh9T7:ISS1 (ratio of >2 for the last two columns)a
|
|
View this table: [in a new window] |
TABLE 4. Summary of genes essential for surviving high-Zn2+ concentrations identified with GAFa
|
![]() View larger version (16K): [in a new window] |
FIG. 4. czcD is a conditionally essential gene when S. pneumoniae is grown under high-Zn2+ conditions. (A) Growth of the wild type (filled triangles) and a czcD deletion mutant (open triangles) in GM17. (B) Growth of the wild type (filled triangles) and a czcD deletion mutant (open triangles) in GM17 to which 0.5 mM ZnSO4 was added. Results are representative of at least three replicate experiments; the error bars indicate the standard deviations. OD595, optical density at 595 nm. (C) Schematic representation of in silico digestion of the R6 chromosomal region containing czcD and its neighboring genes with TaqI. Arrows indicate the locations of genes, and gray rectangles indicate the genomic regions that are present as amplicons on the microarray.
|
|
|
|---|
GAF has several advantages. Detection of transposon insertion sites in a library is performed in a genome-wide manner using microarrays, bypassing the need to generate mutants with unique tags and/or store all mutants separately; this allows for the use of large mutant libraries that have more than 10x coverage of the genome. It also eliminates the need to sequence each individual mutant to determine the transposon insertion site, which accelerates the whole screening procedure and makes it possible to go from recovering the mutants after selection to identifying all transposon insertion sites in about one week.
Recently, several methods have been developed to identify transposon insertion sites using microarrays (32, 33, 39). For all these approaches it is important to reduce the number of false positives, as the mutants are not stored separately and site-directed mutants have to be generated to confirm that the identified genes are indeed conditionally essential. Therefore, we have looked carefully into the specificity and reproducibility of these methods, an issue that is rarely addressed. We showed that methods that use a PCR step or a combination of PCR and adapters introduce a random selection of certain fragments in the procedure when used on genome-wide libraries, which leads to nonreproducible results. A similar effect was observed by Tong et al., and they showed that this is not dependent on the use of amplicon arrays (39). It seems more likely that this random selection of certain fragments is caused by the presence of more than 500 mutants at once, as sensitive and specific results with PCR-based methods can be obtained when small libraries (<500 CFU) or subpools of a large library are used (39; results not shown).
When the PCR step is omitted and restricted DNA is used directly as a template for an in vitro transcription reaction, the detection of transposon insertion sites becomes reproducible and specific (Fig. 3). We used the signal present on TIGR4-specific amplicons for the generation of a background filter. Application of this filter resulted in the detection of 66% of the ORFs of strain R6 on our array, which is in agreement with the proposed S. pneumoniae core genome consisting of 73% of the TIGR4 ORFs (28). As expected, the set of nondetected genes contained many essential genes or genes adjacent to essential genes. The latter indicates that there are quite a number of polar effects in the studies investigating the essential genes of S. pneumoniae. Other nondetected genes are either hypothetical or expected to be essential as they, for instance, encode parts of the ribosome (e.g., SP0213, SP0214, and SP0232-SP0233-SP0234; see Table S1 in the supplemental material). A few undetected genes are represented by short amplicons, and it could be that insertions in this region, combined with the restriction enzymes used, do not generate fragments that hybridize to these amplicons (this could be the case for, e.g., SP0540 to SP0548, which encode a putative bacteriocin operon, and we do not expect these genes to be essential; see Table S1 in the supplemental material). We also showed that large genome-wide libraries (>20,000 CFU) generated with marinerT7 and pGh9T7:ISS1 have a similar degree of saturation but that each transposon is missing approximately 4% of the total number of genes. These "blind spots" are probably due to differences in the insertion site preferences of each transposon, although the genes missed by marinerT7 do contain its recognition site TA. This also shows that the duplication of pGh9T7:ISS1 in the genome (21) does not interfere with the detection of the transposon insertion sites and that this system is a good alternative for the marinerT7 system for use in strains that do not have a high-transformation efficiency.
In contrast to transcriptome analysis, the severalfold change obtained by GAF analysis bears no direct relation to the functional impairment caused by mutation of a gene. For instance, the ratio of czcD in the Zn2+ stress GAF screen was 0.43, but a deletion mutant was at least ten times more sensitive to high-Zn2+ concentrations than the wild type. This apparent discrepancy has two probable causes. (i) The ratio is dependent on the generated restriction fragments, which prevents control over the length of the probes generated for hybridization. It is conceivable that an essential gene is also represented on a fragment derived from a transposon insertion in an adjacent, nonessential gene, thereby masking the effect of the disappearance of the mutant in the conditionally essential gene. This is probably the case for czcD. (ii) Not all transposon insertions abolish gene function completely; however, these mutants will also generate a signal during the GAF procedure, again leading to a masking of the effect of the complete disruption of gene function. These effects are amplified when large, complex libraries are used, which explains why the ratios in the artificial go + g library GAF experiment are much higher than those obtained when genome-wide libraries are used.
In conclusion, GAF is a versatile, high-throughput method for the identification of conditionally essential genes in S. pneumoniae. GAF should be easily adaptable to other microorganisms, as the only prerequisites for this technique are the availability of random (transposon) mutant libraries and a microarray.
This work was supported by IOP genomic grant IGE3002 of the Dutch Ministry of Economic Affairs.
Published ahead of print on 19 January 2007. ![]()
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
These authors contributed equally to the work described in this paper. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»