Previous Article | Next Article ![]()
Applied and Environmental Microbiology, August 2007, p. 4915-4921, Vol. 73, No. 15
0099-2240/07/$08.00+0 doi:10.1128/AEM.00068-07
Copyright © 2007, American Society for Microbiology. All Rights Reserved.
,
Wataru Kitagawa,2*
Nobutada Kimura,1
Naohiro Noda,1
Kazunori Nakamura,1 and
Yoichi Kamagata1,2
Institute for Biological Resources and Functions, National Institute of Advanced Industrial Science and Technology, 1-1-1 Higashi, Tsukuba, Ibaraki 305-8566, Japan,1 Research Institute of Genome-based Biofactory, National Institute of Advanced Industrial Science and Technology, 2-17-2-1, Tsukisamu-Higashi, Toyohira, Sapporo 062-8517, Japan2
Received 11 January 2007/ Accepted 20 May 2007
|
|
|---|
|
|
|---|
Why are GMMs seldom released into the environment despite the relative simplicity and ease of genetic modification? One reason is the small size of the microbes and their diversity, which make them difficult to trace and make it difficult for their ecology to be studied in the environment (31). Specific tracing is the key technology for studying the effectiveness and predicting the fate of GMMs in the environment. Specific tracing is also related to the proper risk management, regulation, and operation of GMMs for safer use in field application (24, 35). The current DNA-based detection method relies mostly on marker sequences, such as antibiotic resistance genes, the transgenes of other functions, or functional sequences (e.g., promoter or terminator) (3). These marker sequences also occur naturally in microorganisms in soil, and some components (e.g., common promoters, such as Plac) are used in multiple GMMs. The presence of a naturally occurring background as well as the same marker sequence in different GMMs can thus cause false-positive results (1). Because of such uncertainty, additional confirmation tests that employ product-specific PCR methods must be carried out.
In a previous report, Marillonnet et al. proposed a method that included an identity tag in GM plants by introducing a silent DNA sequence (300 bp) that coded alphabetical information based on the customized codon table (22). However, as argued by Pauli and Marillonnet et al. (23, 26), there are still some points to be considered, such as the need for extensive verification of safety, the requirement for additional sequencing, and most importantly, gene-crossing problems caused by horizontal gene transfer (HGT), leading to the separation of tag and transgene. The gene-crossing problems are more serious in GMMs, since gene transfer is suspected to be a common event in microbial communities (10, 21). Therefore, the detection of only the GM host or marker sequence adjacent to the transgene would not be more reliable for tracing GMMs and examining their fate in the environment; a method for directly marking and detecting the transgene is an important technology for accurately detecting GMMs.
Here we propose a simple and effective method for directly marking functional transgenes so that they can be specifically traced or detected in the environment without causing the gene functions to deteriorate. For that purpose, we introduced nucleotide substitutions (NS) based on the degeneracy of codons into the transgene as markers (NS markers) through a PCR-based method. The NS-introduced transgene is different from its family genes in the NS region; thus, it can be specifically detected in the environment. The PCR-based introduction and detection of the marker is a method that can be applied universally to any of the genes that have to be detected specifically in the environment, even in the presence of their parental or relative genes.
|
|
|---|
Introduction and detection of markers in functional genes.
The primers used in this study are shown in Table SA2 in the supplemental material. NS markers were introduced into the functional genes by gene splicing with overlap extension (14) (or linking PCR [15]) (see Fig. 2) with some modifications using Pyrobest DNA polymerase according to the manufacturer's instructions. The cycling conditions for linking PCR were as follows: preincubation at 95°C for 3 min; and a total of 30 cycles at 95°C for 30 s, 60°C for 30 s, and 72°C for 1 min/kb of the targeted region. Two or more sets of primers were designed for each marker introduction. Each set of primers contained the marker sequence as an overlapping region and 20 nucleotides (nt) of the priming region complementary to the original gene (see Fig. 2a). In the first step of the PCR, two (or more) fragments that contained a part of the functional gene were amplified so that they had the marker sequence at one end of the products. The resulting PCR fragments were mixed and linked together by a second PCR (linking PCR) using both ends of the primers. The resulting PCR product was digested by an appropriate restriction enzyme and cloned into the plasmids. For the detection of each marker, a common forward primer and marker-specific reverse primers were designed. PCR detection was conducted using AmpliTaq Gold DNA polymerase under the following conditions: preincubation at 95°C for 9 min; and a total of 35 cycles at 95°C for 30 s, 60°C for 30 s, and 72°C for 1 min/kb of targeted region.
![]() View larger version (19K): [in a new window] |
FIG. 2. Positions of the marker-introduced site in BphC. (a) Organization of marker-embedding primer. To our knowledge, at least 25 nt of overlapping region, including 10 nt of additional overlapping regions (AORs), were needed for a successful linkage reaction. (b) Marker introduction procedure for NS marker. The marking was done as follows: [1] two (or more) sets of PCR primers containing marker sequences and an overlapping region for the amplification of gene fragments were designed; [2] partial gene fragments containing marker sequences were generated by PCR; [3] the PCR products were mixed, and denaturation, reannealing, and extension were done to generate hybrid molecules; and [4] PCR amplification with edge primers to obtain the full length of the marker-introduced gene.
|
Quantitative PCR.
We used a real-time PCR method that uses an internal quenching probe (18, 20) for specific quantification of the marker-introduced gene. The quenching probe was designed to be specific for bphC (5'-GGCCACGATTGAAAGTAATCTGTATCCTC-3') and labeled with BODIPY FL at its 3' end. The forward and reverse primers for quantitative detection were designed to be specific for NS1 (5'-CGCAGTCGGATGAGAGT-3') and NS2 (5'-GATTCTCTCCATGACATCACC-3'), respectively. All oligonucleotides were obtained from Tsukuba Oligo Service Corp. (Tsukuba, Japan) and used without further purification. PCR with fluorescence monitoring was performed in 20-µl volumes per reaction in a real-time PCR instrument (LightCycler, Roche Diagnostics, Mannheim, Germany), basically following the method reported previously (18). In brief, each reaction mixture contained 1 µM of forward primer, 0.3 µM of reverse primer, 1x TITANIUM Taq PCR buffer, 50 nM of BODIPY FL-labeled internal probe, 200 µM of each deoxynucleoside triphosphate, 0.25 mg/ml bovine serum albumin, and 1 x TITANIUM Taq DNA polymerase (TAKARA BIO INC., Ohtsu, Japan). The standard curve for the quantification was obtained by using the duplicate of the seven-point standards with a known number of target genes (40 to 4.0 x 107). The total community DNAs were extracted from activated sludge (municipal wastewater plant, Kasumigaura, Japan) and lake sediment (Lake Kasumigaura, Japan) using the FastDNA SPIN kit for soil (Qbiogene, CA).
Enzyme assays for BphC and TOM.
E. coli JM109 harboring the recombinant plasmids was grown in LB. The cells were harvested by centrifugation, washed with 50 mM Tris-HCl buffer (pH 7.5), and resuspended in the same buffer. To measure the BphC activity, the cells were disrupted by sonication, and cell debris was removed by centrifugation at 15,000 rpm for 10 min at 4°C. The supernatant was then used as a crude extract. The BphC activity of crude extracts for 2,3-dihydroxybiphenyl (23DHBP) was spectrophotometrically determined by using a DU800 spectrophotometer (Beckman Coulter). Reactions were carried out in 2 ml of 50 mM Tris-HCl buffer (pH 7.5) containing 200 µM 23DHBP at 25°C. The molecular extinction coefficient of 23DHBP under the assay conditions (pH 7.5) was as follows:
432 = 13,200 M–l cm–1 (2). Protein concentrations were determined with a Bio-Rad Laboratories protein assay kit with bovine serum albumin as the standard. Toluene-ortho-monooxygenase (TOM) activity was measured by naphthol formation from naphthalene by modifying the assay of Canada et al. (4). Two milliliters of washed cells (optical density of 5.0) were contacted with 0.1 mM of naphthalene (dissolved in dimethyl formamide) at 25°C for 30 min. The naphthols synthesized were detected by reacting 800 µl of cell supernatant with 40 µl of 1% (wt/vol) tetrazotized o-dianisidine and 160 µl of glacial acetic acid. The purple azo dye product was detected at an optical density of 528 nm with a DU800 spectrophotometer (Beckman Coulter). The molar amount of naphthols synthesized was calculated by comparison to a 1-naphthol standard curve (molar extinction coefficient of 8,500 M–1 cm–1 (4). Measurements were done at least three times for each strain tested.
|
|
|---|
![]() View larger version (35K): [in a new window] |
FIG. 1. Marker description and introduction procedure. (a) NS marker that is based on the degeneracy of codons. In Ser, Arg, and Leu, two or three bases can be substituted, and at least 1 base can be substituted in all codons except for Met and Trp. (b and c) The introductory positions of NS markers in the amino acid sequences of BphC and TomA3, respectively. Red letters indicate the essential residues for enzymatic activity or the assembly of subunits derived from information on three-dimensional structures or alignments with known genes for bphC and tomA3, respectively (19, 29). (d) Sequence alignment of NS1 and NS2 markers with wild-type bphC and relative genes. The relative genes were as follows: bphCKKS102 (wild-type gene in Achromobacter georgiopolitanum KKS102, accession no. M26433); bphCTK102 (Comamonas testosteroni TK102, accession no. AB086835); phdFPYR1 (Mycobacterium vanbaalenii strain PYR-1, accession no. AY365117); bphC5TA421 (Rhodococcus erythropolis TA421, accession no. D88017); etbCRHA1 (Rhodococcus sp. strain RHA1, accession no. AB120955); nahCNAH7 (plasmid NAH7 from P. putida, accession no. J04994); bphCB356 (Comamonas testosteroni B-356, accession no. U91936); bphCLB400 (Burkholderia sp. strain LB400, accession no. X66122); bphCRHA1 (Rhodococcus sp. strain RHA1, accession no. D32142); bphC3TA421 (Rhodococcus erythropolis TA421, accession no. D88015); todEF1 (Pseudomonas putida F1, accession no. J04996); bphC4TA421 (Rhodococcus erythropolis TA421, accession no. D88016); and bphC5RHA1 (Rhodococcus sp. strain RHA1, accession no. AB030672). Red letters indicate the bases that differ from the sequences of the NS markers.
|
Introduction of NS markers and enzymatic activities of the translated proteins.
The procedure for the introduction of the NS marker was the PCR-based simple method that is similar to the method known as "gene splicing by overlap extension" (14) or "linking PCR" (15). In brief, two sets of PCR primers were designed to have overlapping regions (Fig. 2a) other than the usual priming sites. The substitutions of nucleotides (NS markers) were included in the overlapping region. The first PCR was used to generate two fragments (products AB and CD in Fig. 2b[1] and [2]) that have NS markers at their ends. These two fragments were mixed and used as templates for a second PCR, and the two products were linked to generate an intact marked gene (Fig. 2b[3] and [4]). To our knowledge, at least 25 nt of overlapping region (including 10 nt of additional overlapping regions) was needed for a successful linkage reaction (data not shown).
We obtained the marked genes, namely, bphCNS12 and tomA3NS3t8. The gene bphCNS12 has NS1 and NS2 markers, and tomA3NS3t8 has NS3, NS4, NS5, NS6, NS7, and NS8 (a total of six) markers. The enzymatic activities corresponding to bphCNS12 and tomA3NS3t8 were measured and compared with those for wild-type genes. In both of the translated proteins, the activity was almost the same as that of its wild type (104.5 and 100.6% of its wild type activity for BphC and TOM, respectively), showing that NS markers did not cause any deterioration of enzymatic activity (see Table SA3 in the supplemental material). In principle, the NS-marking method introduces a synonymous mutation that does not change the amino acid sequence; thus, the translated proteins are kept intact. In terms of the translation efficiency, the marked genes can be affected by the existence of rare codons in the expression host E. coli (16). In tomA3NS3t8, the introduction of six NS markers resulted in a net increase of 10 rare codons aside from the interchanges between 2 rare codons. However, the negligible change in enzymatic activity of the whole cell indicated that the introduction of the NS markers had a minor effect on translation efficiency in this case.
Specific detection (quantification) by PCR and identification of variations.
The NS markers in bphCNS12 and tomA3NS3t8 (two and six, respectively) were detected by PCR (Fig. 3a). In the series of PCRs, eight pairs of the primers were used to detect the markers (Fig. 3a, lanes 1 to 8). Among these reactions, the amplified product was observed only in marked bphCNS12 and tomA3NS3t8 templates. None of the amplified product was detected in the wild-type bphC or tomA3.
![]() View larger version (31K): [in a new window] |
FIG. 3. Qualitative and quantitative detection of the marker sequence by PCR. (a) PCR detection of each of the NS markers with the primers specific for the marker sequence by using each of the marked and wild-type genes as a template. bphCNS12, bphC gene with markers NS1 and NS2; tomA3NS3t8, tomA3 gene with markers NS3 to NS8. The numbers on the picture show the primer pair that was used for the detection of each NS marker (with the numbers 1 to 8 corresponding to NS1 to NS8). (b) Identification of the tomA3 variations harboring NS markers in different positions. The PCR mixture contained one forward primer and six reverse primers complementary to each of the NS3 to NS8 marker sequences. Lanes 1 to 6, single template with single markers (single detection); lanes 7 to 13, multiple templates with single markers (multiplexing); lanes 14 to 17, single template with multiple markers (patterning). The numbers below the picture show the marker number contained in the respective tomA3 variations.
|
We also tried to quantify bphCNS12 in the range of 40 to 4 x 106 copies/tube by quantitative PCR in the presence of 4 x 107 copies/tube of wild-type bphC (Table 1). The target sequence (bphCNS12) was successfully quantified in a target/background (wild type) ratio of 0.000001 to 0.1. Moreover, we could quantify a similar range of bphCNS12 (400 to 4 x 107 copies/tube) even in the presence of 500 ng of environmental DNA extracted from activated sludge or lake sediment samples. Assuming the microbes to have an average genome size of 4 Mbp, 500 ng DNA corresponds to the genome of 1.1 x 108 microbial cells, and thus, the lower limit for quantification can be calculated as 3 to 4 cells/ml for the case of activated sludge with 109 microbial cells/ml (with an assumption that bphCNS12 is on a high-copy-number plasmid [about 100 copies per cell]). In the community experiment, we successfully detected and quantified 105 to 1010 copies/ml of bphCNS12 on the pBBR122 derivative plasmid in Pseudomonas putida KT2440 for more than 30 days in a sequential batch reactor seeded with activated sludge (data not shown).
|
View this table: [in a new window] |
TABLE 1. Quantitative detection of bphCNS12 by PCR
|
|
|
|---|
The NS markers were introduced into the transgenes by the PCR-based method. Many transgenes usually have a reported set of primers that are designed for detection, modification, or other purposes. In our method, two more primers having a marker sequence at each 5' end as the overlapping region are needed for the marker introduction. The marking method itself has no preference in terms of the GM host organism. Since 18 out of 20 amino acids are coded by multiple codons, degeneracy-based codon substitution can thus provide a direct, easy, and flexible marking method for the transgenes. Principally, since the codon substitution is based on synonymous mutations that are thought to be evolutionarily neutral, orthologous genes may have sequences similar to those of the NS markers. However, as shown in Fig. 1d (see also Fig. SA1 in the supplemental material), NS markers showed sufficient specificity among the relative genes.
One of the most attractive points of this marking method is that the NS markers can be introduced while the original amino acid sequence of the transgenes is kept intact. Thus, the introduction of the marker itself can be regarded as self-cloning, so that the marking and detection of particular microbes are possible without the microbes becoming GMMs. The introduction of the NS marker into the transgene causes no change in the activity of the translated protein as long as the translation efficiency is unchanged. As reported by some researchers, rare codons can cause translational problems (6, 12, 16). However, the problem is usually obvious when the same rare codon (e.g., AGA) exists in clusters or in excess numbers in the gene (16). In the tomA3NS3t8 gene in this study, the introduction of the 6 NS markers resulted in an increase of 10 rare codons without introducing a cluster of rare codons. The fact that no decrease in enzymatic activity was observed for both BphCNS12 and TomA3NS3t8 showed that the increase of several rare codons caused no deterioration in gene function. However, the introduction of the NS markers should be considered from the point of view of codon usage in order to prevent any error of translation, especially in the case of highly expressing proteins.
Another advantage of this method is the capability of creating several variations of particular transgenes that share an identical amino acid sequence. We demonstrated two different uses of such variations, namely, "multiplexing" and "patterning." In multiplexing, six variations of tomA3 harboring single NS markers in different positions were constructed. These six variations could be detected separately by PCR and through multiplex detection of mixtures of up to three variations. Each NS marker was identified by the size of the PCR product. If there are two or more similar GMMs in the environmental samples, the selection of one particular kind of GMM is impractical even just in the mixture of pure cultures. This multiplexed detection system can be a powerful tool for the specific detection and simultaneous identification of the actual producer when there are several different GMMs that share the same transgenes with different NS markers (Fig. 4a). On the other hand, our "patterning" technique can be used for more-precise identification of GM products. We succeeded in creating several transgenes that differed in terms of their patterns of NS markers while keeping the genetic function intact. To detect the "pattern" of a transgene, only the PCR amplification and identification of the product patterns are needed (Fig. 4b). We succeeded in the PCR detection of up to three different NS markers in one reaction tube. Using combinations of 3 NS markers out of 6 positions, 20 variations can be distinguished. The large variety of marker patterns enables the application to a product-specific tag for the transgenes that are widely used in multiple GMMs. The application fields for GM organisms are variable (contained reactor, field release, etc.), and the use of the NS markers can be customized depending on where and how they are used and on the target organism.
![]() View larger version (37K): [in a new window] |
FIG. 4. Schematic diagram of the identification of the GMMs that harbor the same transgene. (a) In multiplex identification, several GMMs with the transgene harboring the NS marker in different positions can be simultaneously detected. (b) In gene patterning, one transgene must be the template DNA. However, 20 patterns of transgenes can be identified only by the patterns of the PCR products.
|
We thank H. Futamata for providing the DNA fragment extracted from Burkholderia cepacia G4 and M. Fukuda for providing the plasmid pHNA2. We also thank O. Yagi and F. Kurisu for helpful advice.
Published ahead of print on 25 May 2007. ![]()
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
Present address: Kochi Institute for Core Sample Research, Japan Agency for Marine-Earth Science and Technology (JAMSTEC), Monobe B200, Nankoku, Kochi 783-8502, Japan. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»