Kevin N. Turner,
and
David A. Mills*
University of California at Davis, Department of Viticulture and Enology, One Shields Ave., Davis, California 95616
Received 19 December 2005/ Accepted 9 July 2006
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
An alternative mechanism for integrating heterologous genes into bacterial chromosomes is via group II introns. Some group II introns are retroelements that mobilize at high efficiency into specific intronless alleles in a process called homing (14). Homing initiates with intron splicing, resulting in a ligated exon and a ribonucleoparticle comprised of the intron RNA and an intron-encoded protein (IEP). Both intron RNA and IEP components recognize specific DNA sites in the target region and promote reverse splicing of the intron RNA directly into the sense strand of the target DNA (14). The IEP cleaves the antisense strand of the target site and initiates reverse transcription of the intron RNA. The resulting cDNA is then incorporated into genomic DNA via repair mechanisms.
Specificity of the homing reaction is achieved through a combination of both IEP and intron RNA binding to the specific DNA target site. The IEP facilitates unwinding of the double stranded DNA and interacts with specific sites in the target region that flank the insertion site. Sequences within the intron (exon binding sequence 1 [EBS1], EBS2, and
) anneal with the corresponding target sequences (intron binding sequence 1 [IBS1], IBS2, and
') to facilitate reverse splicing of intron RNA into the target site (24, 30).
By rescripting intron EBS1, EBS2, and
regions to anneal to alternative targets, it is possible to retarget the intron to invade a different site (9, 11), thus providing a novel means for gene invasion. Since integration is independent of host recombination functions (22), intron-mediated inactivation is considered an attractive alternative for bacteria that are recalcitrant to more-standard methods of genomic manipulation.
The most employed group II intron for this purpose is the lactococcal intron Ll.ltrB. Retargeted Ll.ltrB introns have successfully invaded chromosomal loci in a range of hosts including Escherichia coli (11), Shigella (11), Salmonella (11), lactococci (8), and clostridia (5). We recently demonstrated that Ll.ltrB can mediate delivery of heterologous genes, engineered inside the intron, into the lactococcal mleS gene at high frequency, obviating the need for selectable markers (8). Given the absence of selectable markers, the copy number of any delivered gene(s) could then be increased by successive rounds of intron integration. One way to reduce the need for successive rounds of intron invasion is to target repeated genes, such as insertion sequence (IS) elements, within the host genome. IS elements are typically arranged with terminal inverted repeats of 10 to 40 bp flanking a single transposase gene (tra) (20). These elements have been found in multiple copies in numerous LAB (20, 26). L. lactis IL1403-UCD contains six different IS elements, which range in copy number from 1 to 14 copies (3). In this work, Ll.ltrB was retargeted to invade three different IS element transposase genes, tra904, tra981, and tra983, within the IL1403-UCD genome. tra-targeted introns integrated into multiple tra alleles in a single delivery experiment without selection. By cloning a green fluorescent protein (GFP) marker into the tra904-targeted intron, segregants were obtained with different numbers of Ll.ltrB::GFP copies in different sets of tra904 alleles, resulting in a range of GFP expression levels. Thus, this work reveals a strategy for targeting multicopy alleles for intron-mediated delivery of heterologous genes in order to achieve a desired level of integration and expression in bacteria.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Molecular biology techniques.
Standard molecular techniques were carried out as described elsewhere (1). For E. coli transformations, One Shot competent cells (Invitrogen, Carlsbad, CA) were used according to the manufacturer's instructions. Transformation of L. lactis was performed as previously described (10). Plasmid DNA was purified from E. coli by use of QIAquick spin columns (QIAGEN, Valencia, CA) following the manufacturer's directions. Chromosomal DNA was prepared from L. lactis as described elsewhere (19). Plasmid DNA was cured from L. lactis by use of ascorbic acid (27). All PCRs were performed using a PTC-200 thermal cycler (MJ Research, San Francisco, CA). Primers and PCR conditions used in this study are listed in the supplemental material (see Tables S2A and S2B, respectively, in the supplemental material). For standard PCRs, including intron rescripting and colony screening, Amplitaq Gold (Applied Biosystems, Foster City, CA) was used. Colony PCR of L. lactis was performed using GeneReleaser (Bioventures Incorporated, Murfreesboro, TN) as previously described (8). PfuTurbo (Stratagene, La Jolla, CA) was used for all inverse PCR and gene amplifications for cloning purposes. Inverse and megaprimer PCRs were purified using Montage PCR filter units (Millipore, Billerica, MA) following the manufacturer's instructions. Restriction enzymes were supplied by New England Biolabs (Beverly, MA). DNA ligations were performed using a Fast-Link DNA ligation kit (Epicentre, Madison, WI) following the manufacturer's instructions. DNA purification from all other PCRs and agarose gels was done using a Geneclean spin kit (Q-Biogene, Irvine, CA) according to the manufacturer's directions. DNA sequencing was performed by the DBS DNA sequencing facility at UC Davis.
Construction of intron donor vector and intron rescripting.
The Ll.ltrB intron was amplified from pLE12 (23) by use of primers IntronUP-F and IntronDN-R, which introduced flanking XbaI and XhoI sites, respectively. The intron fragment was then cloned into the XbaI and XhoI sites of pMSP3535 (4), resulting in pKT0180. Plasmid pKT0180 was then rescripted so that the IBS and EBS1 regions contained MluI and NcoI restriction sites, respectively. This and all subsequent intron rescripting was performed using the two-step PCR method followed by a megaprimer PCR in a fashion analogous to that used previously for Ll.ltrB retargeting (11). The first step in the two-step PCR was performed using primer pairs IBSMluI-EBSR2 and EBS1NcoI-EBS2NruI followed by IBSMluI-EBS2NruI. For the megaprimer PCR, the vector backbone was generated by inverse PCR using pKT0180 as the template and primers pHR057F and pHR057R. The PCR products were purified, and 1 µg of the vector backbone and the two-step PCR product were used in a megaprimer PCR for 10 cycles with an extension of 12 min and an annealing temperature of 55°C. The resulting plasmid DNA was cleaned and transformed into E. coli TOP10 cells (Invitrogen), and clones were screened by sequence analysis to confirm that the correct bases had been incorporated. The resulting plasmid, pKT0185, was used in all subsequent rescripting reactions. Intron rescripting of pKT0185 was performed using two-step PCR primers (Tra904473sIBS, Tra904473sEBS1, Tra904473sEBS2, Tra981555sIBS, Tra981555sEBS1, Tra981555sEBS2, Tra983378sIBS, Tra983378sEBS1, Tra983378sEBS2, EBS2R) to introduce base changes into the Ll.ltrB IBS, EBS1, and EBS2 regions of the intron. Before transformation into E. coli TOP10 cells, the rescripted intron vectors were digested with MluI and NcoI to remove any unrescripted pKT0185 background. The vectors were screened again by sequence analysis to confirm that the correct bases had been incorporated.
Intron invasion.
Strains of L. lactis containing the tra-targeted intron donor vectors were grown overnight. A 1/100 dilution was made with media containing 25 ng/ml nisin (Sigma-Aldrich, St. Louis, MO), and the mixture was left overnight. One milliliter of overnight culture was washed in GM17 to remove residual nisin to limit further intron homing. A serial dilution was performed and plated out onto selective media. Colonies were initially screened for the presence of any intron-tra junctions by use of a consensus forward primer designed to the 5' region of tra904, tra981, or tra983, respectively, combined with an intron-based reverse primer, EBS2R (see Table S2 in the supplemental material). This allowed an estimate of the invasion frequency for tra904, tra981, or tra983, respectively. Between 48 and 192 colonies were screened for each intron in this step. To score which specific tra alleles had been invaded, chromosomal DNA was prepared from colonies which produced amplicons by use of the consensus tra-Ll.ltrB junction PCR and examined by PCR using primers specific to each individual tra allele. To accomplish this, the allele-specific forward primers (see Table S2 in the supplemental material) were paired with reverse primer EBS2R.
Construction of pHR082.
GFP was amplified from plasmid pMSP::GFP3bMut (a gift from G. Dunny) by use of primers pGFPXbaF1 and pGFPXhoRI. The PCR product was digested with XbaI and XhoI and cloned into pMSP3535 digested with the same enzymes. The plasmid, numbered pHR082, contains GFP under control of the nisin promoter.
Construction of Ll.ltrB::GFP.
To enable the cloning of genes into Ll.ltrB, a SacII site was introduced immediately downstream of the stop codon of ltrA in pKT0185. An inverse PCR was performed with pKT0185 as the template by use of primers pKT0185SacIIF and pKT0185SacIIR2. The PCR product was digested with SacII and self-ligated. The resulting plasmid, pHR116, contains an enzyme site for cloning genes into the intron. This plasmid was then rescripted to the tra904 allele as described above, creating pHR120. The nisA promoter and GFP gene were amplified from plasmid pHR082 with primers nisGFPSacIIF5 and nisGFPSacIIR5 and cloned into pHR120, creating pHR131.
Measurement of GFP expression.
Strains were grown overnight and diluted 1/100 into GM17. Growth was monitored until an absorbance at 600 nm of 0.5 was reached. Nisin was added to a final concentration of 25 ng/ml. Samples were then removed hourly to measure GFP levels. One milliliter of cells was removed, washed, and resuspended in phosphate-buffered saline, pH 6.0, to achieve an absorbance at 600 nm of 0.5. One hundred microliters of the washed cells was placed into a 96-well optical plate and covered with an optical cap (Applied Biosystems). The GFP fluorescence was measured using the plate-only read function of an ABI 7700 sequence detection system using an excitation wavelength of 488 nm and an emission wavelength of 520 nm.
| RESULTS |
|---|
|
|
|---|
Construction of tra-targeted introns.
To determine if it was possible to deliver the intron into multiple target sites in a single delivery experiment, introns were retargeted to invade tra904, tra981, and tra983, which are present in the IL1403-UCD genome in 9, 10, and 14 copies, respectively (3). The tra904 allele was targeted using the Targetron software to identify candidate target sites (InGex LLC, St. Louis, MO). Of eight sites analyzed, only the tra904502s intron successfully invaded (data not shown). Target sites for tra981 and tra983 were chosen on the basis of consensus with previously successful intron invasion sites (Fig. 1) (8, 9, 38). The plasmid pKT0185 was rescripted to generate plasmids pHR109, pHR123, and pHR117, containing introns targeted for tra904, tra981, and tra983, respectively.
|
As seen in Tables 1 2 and 3, the rescripted Ll.ltrB intron was able to invade more than one tra allele in a single invasion event. For all three tra targets, segregants containing different numbers of invaded alleles were obtained, with a maximum of nine alleles invaded for any single tra904, tra981, or tra983 segregant (Tables 1, 2 and 3, respectively). The tra904 invasion showed the most variability, with segregants containing one, two, three, five, and nine copies of the intron in different tra904 alleles. The tra904A, tra904D, and tra904F alleles were invaded at a higher frequency than the other alleles (invaded in 60 to 70% of segregants). All nine of the tra981 alleles present in the IL1403-UCD strain (tra981B is absent) were invaded in two individual segregants, while a third segregant had six alleles harboring an intron. Invasion of tra983 alleles showed some bias. tra983A, tra983B, tra983C, tra983D, tra983E, tra983F, tra983H, and tra983L all were invaded in most of the segregants (>75%). Conversely, tra983G, tra983I, tra983J, and tra983N were invaded less frequently (<25%), and tra983M was not invaded at all. Sequence analysis of the invasion site within tra983M showed that it possessed the same target region sequence as the other tra983 alleles (Fig. 1), indicating that differences in the tra983M target site region of our strain was not the reason for the lack of invasion.
|
|
|
|
|
| DISCUSSION |
|---|
|
|
|---|
When L. lactis is engineered for this purpose, a common strategy is to incorporate the expression cassette into the genome of the target strain (28). This approach is taken for a variety of reasons, including the need to eliminate all vector-associated antibiotic resistance genes as well as the potential stability and containment issues inherent with plasmid-borne expression. Chromosomal incorporation typically involves selection of crossover events between cloned genomic DNAs that flank the heterologous gene of choice and cognate chromosomal loci.
We previously demonstrated an alternative means for delivery of genes into the genome of L. lactis IL1403 by use of the Ll.ltrB intron (8). Intron-mediated delivery occurred at high frequencies, obviating the need for antibiotic selection to identify the appropriate integrant (8). In the present work, we examined intron delivery targeted toward repeated elements in L. lactis as a means to increase the copy number of the delivered gene. Introns were retargeted to invade tra904, tra981, and tra983 alleles present in the IL1403 genome. Rescripted introns invaded multiple alleles in a single induction event, although invasion frequencies varied between 1.5 and 79% for various tra genes. This variation is likely due to the differences in the specific target sites and their respective capacities to serve as a viable partner for the intron ribonucleoparticle-based invasion (Fig. 1). Target nucleotide requirements for Ll.ltrB homing have been extensively examined for E. coli, and 11 specific bases were shown to be favored (>50% predominance) in Ll.ltrB target sites (36). The rescripted Ll.ltrB introns utilized in this study possessed a maximum of five bases in common with those selected for in E. coli (Fig. 1). Of the three sites targeted, the similarity of the tra gene target sequence to the wild-type ltrB target sequence appeared related to invasion efficiency, with targets of greater similarity to the wild type being invaded more efficiently. tra904, which has 17 nucleotides in common with ltrB, homed at 79%, while tra983 and tra981, with 12 and 6 nucleotides shared with ltrB, homed at 8 and 1.5%, respectively. A more comprehensive survey of intron target site preferences in lactococci will enable better prediction of introns that invade at high efficiency.
After screening a wide range of segregants from tra904, tra981, and tra983 intron invasions, we noticed that a maximum of nine tra allele integrations could be accomplished in a single step. It is unclear why no more than nine introns could be integrated in one induction event. Perhaps a screening of more segregants would reveal additional insertions. Alternatively, polar expression from the intron-encoded ltrA promoter (39) into genes downstream of a specific tra allele might have made integration into a larger complement of target alleles problematic.
By cloning into the tra904-targeted intron, as many as eight copies of Ll.ltrB::GFP were delivered into the chromosome in a single step without the use of selection. The general invasion frequency of Ll.ltrB::GFP into the tra904 alleles (42%) was reduced in comparison to that of Ll.ltrB invasion alone (79%), suggesting that the addition of heterologous DNA inside the intron impedes invasion. As expected, the copy number of tra904::GFP directly correlated with the level of GFP fluorescence. The level of fluorescence observed for a strain containing six copies of tra904::GFP (strain HR308.1) was similar to that observed for a GFP-containing plasmid maintained at approximately the same copy number (six to nine copies per cell). Growth was inhibited upon nisin induction in strains HR313.1 and HR314.1, which carried seven and eight copies of GFP, respectively. The cause for this growth inhibition remains unknown. Perhaps the expression of GFP from the chromosome at a copy number of seven or more produces a significant metabolic burden on the cells. This is unlikely, given the lack of previous reports of diminished growth of L. lactis due to expression of heterologous proteins from other multicopy vectors (18, 25). An alternative explanation is that the expression of GFP has a polar effect on genes downstream of the intron insertion.
Multicopy intron-mediated delivery is unlike other methods used to obtain multiple chromosomal copies of heterologous genes in other bacteria. Others have achieved multicopy integrants through forcing multiple, tandem integrations of suicide or conditionally replicating plasmids by increasing selection for a plasmid-encoded antibiotic resistance gene (37). This method utilizes Campbell-type integration to amplify the integrated plasmid carrying a gene of interest and has been previously performed with L. lactis (6, 16, 17). Leenhouts et al. (15) modified this method to enable food-grade delivery of multiple gene copies by using sucrose as the selection system. Using this technique, these authors were able to insert up to 20 copies of the plasmid into the chromosome. However, the integrated plasmids were lost at a rate of 7.5 x 102 to 15 x 102 copies per generation (15). Intron-mediated delivery does not rely on homologous recombination to achieve multicopy integration, and unlike tandem integration of plasmids, the delivered genes were stable over 120 generations, as was previously observed for singly integrated introns (8).
There are relatively few reports that show the impact of the copy number of chromosomal-borne heterologous genes on the overall expression achieved from those strains. Srinivasan et al. (31) delivered up to three copies of organophosphohydrolyase into the chromosome of Ralstonia eutropha and observed a linear increase in protein expression as the copy number increased. Wang et al. (35) optimized expression of a heterologous keratinase gene (kerA) through multiple, random integrations into the Bacillus licheniformis genome by use of a suicide plasmid. Interestingly, those authors found that the highest level of secreted keratinase was achieved with strains containing between three and five integrated kerA gene copies, while strains possessing higher numbers exhibited lower levels of keratinase production. We observed an analogous result in this work, in which reduced GFP expression was demonstrated for strains containing more than six tra904 alleles possessing the intron-GFP cassette.
Given the increased use of lactococcal strains as oral delivery vehicles for prophylactic or therapeutic proteins, versatile tools for both chromosomal stabilization and optimal protein expression are needed. The lactococcal group II intron has already been shown to facilitate stable chromosomal gene delivery without selection. By targeting repeated elements in lactococci, this work demonstrates how these introns can be used to achieve a desired copy number, thereby facilitating protein overexpression aims. Given that group II intron invasion is operable in various bacterial and even mammalian environments (9), the general strategy of targeting repeated elements should be a viable method to construct production strains that harbor a specified number of chromosomally borne expression cassettes in other organisms as well.
| ACKNOWLEDGMENTS |
|---|
D.A.M. thanks Sharon Clark for technical assistance. David Sela is acknowledged for his assistance with the manuscript preparation.
| FOOTNOTES |
|---|
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
Present address: Department of Food Science, Box 7624, North Carolina State University, Raleigh, NC 27695-7624. ![]()
Presnet address: University of California at Irvine, Irvine, CA 92697. ![]()
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| J. Bacteriol. | Microbiol. Mol. Biol. Rev. | Eukaryot. Cell | All ASM Journals |
|---|