Previous Article | Next Article ![]()
Applied and Environmental Microbiology, April 2009, p. 2284-2293, Vol. 75, No. 8
0099-2240/09/$08.00+0 doi:10.1128/AEM.02621-08
Copyright © 2009, American Society for Microbiology. All Rights Reserved.
,
T. Sabari Sankar, and
Karin Schnetz*
Institute for Genetics, University of Cologne, Zülpicher Strasse 47, 50674 Cologne, Germany
Received 17 November 2008/ Accepted 10 February 2009
|
|
|---|
|
|
|---|
In E. coli, several loci for the utilization of β-glucosides have been characterized. These include the "cryptic" bgl, asc, and arbT loci, as well as the constitutively expressed bglA gene. Among these, the bgl operon is the best characterized (29, 36). The bgl operon is repressed by the nucleoid-associated protein H-NS (10, 19, 24, 34). In the laboratory strain K-12, silencing of bgl by H-NS can be relieved by various mutations, which arise quickly on indicator plates. Consequently, expression of bgl becomes inducible by substrate (20, 24). The bgl operon is present in the majority of E. coli strains and functional in most of them. Interestingly, silencing of the bgl operon is less strict in uropathogenic E. coli and related strains when they are grown at 37°C (31). In contrast, Asc (arbutin, salicin, cellobiose)-positive mutants arise only after prolonged incubation for 4 to 5 weeks (13, 25). In the asc mutants, the ascG gene is disrupted by an IS186 insertion. AscG is a repressor of the divergent ascFB operon, which encodes a phosphoenolpyruvate-dependent phosphotransfer system (PTS) enzyme II permease for arbutin, salicin, and cellobiose and a phospho-β-glucosidase, respectively (13). The constitutively expressed bglA gene encodes an arbutin-specific phospho-β-D-glucosidase (27), and the cryptic arbT locus encodes an arbutin-specific enzyme II permease of the PTS (18). The arbT locus has not been mapped but may be identical to glvCBG, which putatively encodes β-D-glucoside-specific enzyme IIB and IIC subunits and a phospho-β-D-glucosidase (22). Furthermore, multiple mutations in the chb operon, which is an inducible N,N'-diacetylchitobiose system, can convert chb to a cellobiose-specific system (14a, 15).
In this report, we describe an additional β-glucoside operon (named bgc for aryl-β-D-glucosides and cellobiose) which is prevalent in E. coli. The bgc locus was identified when we observed that an E. coli septicemia strain (i484) in which the bgl operon was deleted yielded β-glucoside-positive mutants after a few days of incubation on salicin plates at low temperature. The locus was mapped, and the mutations causing activation were characterized. In addition, evolution of the locus was addressed by phylogenetic studies and typing of a representative collection of pathogenic and commensal E. coli strains.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Strains and plasmids used in this study
|
β-Galactosidase assay.
For enzyme assays, cells were grown in M9 medium containing 0.4% (wt/vol) glycerol or glucose, 0.66% (wt/vol) Casamino Acids (Difco), and 1 µg/ml vitamin B1 or in LB medium (Difco), as indicated. Cultures were inoculated to an optical density at 600 nm (OD600) of 0.1 to 0.15 from fresh overnight cultures and grown in the same medium to an OD600 of 0.5 at 37 or 28°C, as indicated. Salicin was added at a 0.2% final concentration to the overnight and exponential cultures where indicated. β-Galactosidase assays were performed as described previously (9, 23). Enzyme activities were determined at least three times from at least two independent transformants or integration derivatives. Standard deviations were less than 10%.
Bioinformatic and phylogenetic analyses.
For the analysis of the prevalence of bgc in E. coli (including Shigella strains) the nucleotide sequence of the entire bgc locus, including the flanking core genes marB and ydeD, of strain CFT073 was used as a query to search the NCBI nonredundant nucleotide database with BLASTN. The sequences of the marB-bgc-ydeD and marB-ydeD loci were then extracted from the published E. coli and Shigella genome sequences, as well as from E. albertii, and used for structural comparison, as well as for phylogenetic analysis. For the latter, the sequences were aligned and a neighbor-joining (NJ) tree was constructed with MEGA4 (37). In strains in which the bgc locus is disrupted by insertion elements, these sequences were manually removed. To correlate the phylogeny of the bgc locus with the species phylogeny, the sequences of the seven multilocus sequence typing (MLST) loci (41) were also extracted from the genome sequences, concatenated, and used to construct an NJ tree with MEGA4. To identify homologues of the genes encoded by the bgc locus, the deduced protein sequences from E. coli CFT073 were used as queries to search the NCBI microbial genome database, as well as the NCBI nonredundant nucleotide database, with TBLASTN. In parallel, the UniProt database (38) was searched for homologues with BLASTP. Similar results were obtained in these searches.
Typing of the bgc locus.
Typing for the presence of bgc was performed by PCR with oligonucleotides S693 (AACGTGACAACGTCACTGAGGCAAT; specific for the flanking gene marB), S694 (AACGGTCAGCATGTGGCGATG; specific for ydeD), and S695 (TGAAATCGCCAGTATTTTACGGATCAG; specific for bgcR). The PCR fragments were sequenced to confirm specific amplification and to analyze for sequence variations. Strains which yielded a PCR fragment of 770 bp with oligonucleotides S694 and S695 carry the bgcR gene and were assigned to bgc type I. Strains from which a specific 690-bp PCR fragment could be amplified with primers S693 and S694 lack bgc and were assigned to type II. These types were color coded and mapped onto a minimal spanning tree representing the population structure of a representative E. coli collection based on MLST as described previously (31, 41).
|
|
|---|
bgl strain (KEC93) was salicin and arbutin negative at 37 and 28°C, as expected. Surprisingly, though, i484
bgl yielded salicin-positive papillae on BTB salicin indicator plates after 5 days of incubation at 28°C but not at 37°C (Fig. 1). Four of the salicin-positive papillae were restreaked, and these four mutants (KEC131, KEC132, KEC133, and KEC134) were salicin positive at 28°C but negative at 37°C (Fig. 1; see Table 3), indicating the existence of an additional system for β-glucoside utilization whose expression can be activated by mutations.
![]() View larger version (18K): [in a new window] |
FIG. 1. Identification of the cryptic bgc (β-glucoside and cellobiose) operon in E. coli i484. (A) E. coli strain i484 bgl (KEC93), which is a derivative of septicemia strain i484 (16), yields salicin-positive (Sal+) mutants at 28°C. Mini-Tn10-Catr transposon mutagenesis of four spontaneous salicin-positive mutants (KEC131 to KEC134) was performed to screen for salicin-negative mutants. All seven independent salicin-negative transposon mutants which were characterized carried an insertion in the c1955-to-c1959 (bgc) locus. (B) Structure of the bgc (CFT073_c1955 to c1960) locus, which comprises of the bgcEFIHA operon and the divergent bgcR regulator gene. The bgc-encoded proteins are homologous to proteins specific for aryl-β-D-glucoside and cellobiose transport and metabolism, as indicated. The positions of the mini-Tn10-Catr insertions are shown by vertical lines labeled with strain numbers; the arrowheads indicate the orientation of the cat gene carried by the transposon.
|
|
View this table: [in a new window] |
TABLE 3. β-D-Glucoside phenotypes of the strains used in this study
|
|
View this table: [in a new window] |
TABLE 2. E. coli bgc-encoded proteins and homologues in other species
|
bgc encodes proteins for the utilization of aryl-β-D-glucosides and cellobiose.
The function of the bgc locus was further analyzed by comparing the phenotypes of the wild-type and mutant bgc strains on β-glucoside indicator plates at 28 and 37°C (Table 3). Strain KEC93, carrying the wild-type bgc locus, had a negative phenotype on all of the β-glucosides tested, including salicin, arbutin, and cellobiose (Table 3). The phenotype of the four mutants (KEC131 to KEC134) was positive on all of the β-glucosides tested at 28°C, while at 37°C the bgc* mutants were salicin and cellobiose negative but remained weakly arbutin positive (Table 3). The bgc::mini-Tn10-Catr insertion mutations mapping in genes bgcE, bgcF, and bgcI, which encode EII permease subunits, were β-glucoside negative on all three sugars (Table 3). Interestingly, bgcH::mini-Tn10-Catr mutants KEC143 and KEC144 remained weakly arbutin positive at 28°C and at 37°C (Table 3). The salicin- and cellobiose-negative phenotype suggests that this mutation has a polar effect on the expression of bgcA. Therefore, the arbutin-positive phenotype is likely due to the constitutively expressed bglA gene, which encodes an arbutin-specific phospho-β-D-glucosidase and maps elsewhere in the genome (27, 39). In comparison to the phenotype conferred by the "activated" bgc locus, isogenic strain KEC2, carrying an activated bgl operon, was strongly salicin and arbutin positive after 1 day of incubation at 28°C and at 37°C. However, strain KEC2 was cellobiose negative, as expected, since the bgl operon does not encode cellobiose-specific enzymes (Table 3). Taken together, the data suggest that the bgc locus encodes proteins for the uptake and hydrolysis of aryl-β-D-glucosides and cellobiose at low temperature. In comparison to the phenotype conferred by the bgl operon, that conferred by the bgc locus is weaker. To further address the role of bgc, we analyzed whether activation of bgc allows growth on minimal plates containing arbutin, salicin, or cellobiose as the sole carbon source. The bgc* mutants (KEC131 to KEC134), but not strain KEC93 carrying the wild-type bgc locus, grew moderately well on arbutin minimal plates at 28°C. On salicin and cellobiose plates, the growth of the bgc* mutants was poor but distinguishable from that of the wild-type bgc strain, supporting the conclusion that the bgc operon encodes proteins for the utilization of aryl-β-D-glucosides and cellobiose.
A point mutation in the promoter region activates the bgc operon.
To characterize the mutation causing spontaneous activation of the bgc operon, the putative regulatory region between bgcE (c1959) and bgcR (c1960) and the bgcR gene were sequenced from parent strain KEC93 and from the four spontaneous β-glucoside-positive mutants (KEC131 to KEC134). The sequence of wild-type bgc strain i484
bgl was identical to the published sequence of CFT073. Furthermore, all four independent spontaneous mutants carry the identical single point mutation of G/C to A/T at nucleotide position –67 relative to the AUG of bgcE (Fig. 2). This mutation maps within a sequence that matches the consensus binding sequence of the catabolite regulator protein (CRP), where it affects a less conserved position (12) (Fig. 2). To exclude the possibility that the bgc* mutants (KEC131 to KEC134) acquired mutations in addition to the single point mutation within the promoter region, the sequence of the whole bgc operon was determined for parent strain i484 and bgc* mutant KEC133. The sequences of the bgc operon of i484 and its bgc* mutant were identical. In comparison to the sequence of CFT073, one synonymous single nucleotide exchange was detected which changes the 58th valine codon of bgcH from GTG to GTA. This confirms that the single point mutation in the promoter region activates the bgc operon.
![]() View larger version (21K): [in a new window] |
FIG. 2. Activities of the wild-type (wt) bgc and mutant bgc* promoters in E. coli K-12. (A) A single-nucleotide C/G-to-T/A exchange at position –67 relative to the putative translation start of bgcE causes activation of bgc in bgc* mutants KEC131 to KEC134. The mutation affects a nonconserved base within a putative CRP-binding site (framed, matches to the CRP consensus sequence are shown in bold or underlined). A putative –10 box is underlined. The distances of the putative –10 box and the CRP-binding site suggest that this is a class II CRP-dependent promoter, with the CRP-binding site centered at position –41.5 relative to the transcription start. (B and C) The bgcEFIHA promoter Pbgc of the wild-type strain (pKEGN46 and pKEGN51) and Pbgc* of one of the mutants (pKEGN48 and pKEGN52) were fused to lacZ without or together with the divergent bgcR gene. The β-galactosidase activity of transformants of E. coli K-12 strain S541 grown in M9 medium with either glycerol or glucose as a carbon source or in LB was determined. The transformants of crp (S996) and hns (S614) mutants, respectively, were grown in LB.
|
bgl
lacZ), and the β-galactosidase activities directed by these plasmids were determined in cultures grown to the mid-exponential phase (OD600 = 0.5) at 37°C. Transformants of the plasmid carrying the wild-type bgc regulatory region fused to lacZ, pKEGN46, expressed 2,995 U of β-galactosidase activity when grown in minimal M9 medium with glycerol as a carbon source (Fig. 2B). The β-galactosidase activity decreased about threefold (1,020 U) when cells were grown in minimal M9 medium with glucose (Fig. 2B), indicating catabolite regulation of bgc. In an isogenic crp mutant (S996) grown in LB medium, the β-galactosidase activity was sixfold lower (380 U) than that of the wild type (2,465 U) (Fig. 2B), which confirms catabolite control of the bgc promoter. Similar results were obtained with plasmid pKEGN51, which carries the wild-type bgc regulatory region and, in addition, the divergent bgcR gene (Fig. 2B, pKEGN51). Transformants of E. coli K-12 with the bgc*-lacZ fusion (pKEGN48) carrying the single point mutation expressed eightfold higher levels of β-galactosidase than the wild-type bgc-lacZ fusions in all of the media tested (Fig. 2C). Interestingly, in the crp mutant, expression decreased to 318 U, i.e., to levels similar to those directed by the wild-type bgc-lacZ fusion, suggesting that the expression of the mutant remains CRP dependent. The presence of the divergent bgcR gene again had no effect on the expression level (Fig. 2C). These data show that the single point mutation enhances the activity of the bgc promoter and that the bgc promoter is CRP dependent.
Furthermore, we tested whether the histone-like nucleoid structuring protein H-NS, which is crucial for silencing of the bgl operon (7, 10), has a role in the regulation of bgc. To this end, the β-galactosidase expression directed by the bgc and bgc* reporter plasmids was also determined in an hns::Amp null mutant (S614) isogenic to wild-type K-12 strain S541 (
bgl
lacZ) (Fig. 2). However, a mere 1.5-fold difference in activity compared to that of the wild-type strain was observed, suggesting that H-NS has no significant role in the regulation of bgc.
To further address the role of BgcR, the expression level directed by the bgc-lacZ fusion was determined in E. coli strain KEC93 and its bgc* mutant derivative KEC132 (Fig. 3). To this end, cells were grown to mid-exponential phase at 28°C in minimal M9 glycerol medium and salicin and cellobiose were added where indicated (Fig. 3). The wild-type bgc promoter-lacZ fusion directed the expression of 885 U of β-galactosidase activity, and the addition of salicin or cellobiose had no effect (Fig. 3A). The activity was similar when the plasmids also carried the bgcR gene (Fig. 3B). With bgc* mutant KEC132, similar results were obtained when no sugar was added. Interestingly, in the presence of salicin, the expression of β-galactosidase activity increased from 935 to 8,220 U when the bgcR gene was also encoded on the reporter (Fig. 3B), while the addition of cellobiose had no effect (Fig. 3B). The bgc*-lacZ fusions again directed about eightfold higher levels of β-galactosidase activity (8,310 U) than the wild-type bgc-lacZ fusions (Fig. 3C). However, the expression level directed by the bgc*-lacZ fusion also containing the bgcR gene increased only 1.5-fold upon the addition of salicin in the bgc* background (Fig. 3D). These data suggest that BgcR is a positive regulator of the bgc promoter, which can be induced by salicin in a bgc* background, and that the point mutation enhances the promoter activity so that it is almost independent of activation by BgcR. Further, an expression analysis of cells grown at 37°C was also performed and the n-fold differences in expression level were similar, although the values per se were increased approximately threefold (not shown). This demonstrates that the BgcR-dependent induction by salicin in the bgc* background is independent of the temperature and suggests that expression of the operon may be temperature regulated.
![]() View larger version (16K): [in a new window] |
FIG. 3. Induction of the bgc promoter by salicin in a bgc* background. Wild-type bgc strain i484 bgl and its bgc* (Bgc-positive) derivative KEC132, as well as bgl+ strain KEC2, were transformed with plasmids carrying Pbgc and Pbgc* promoter-lacZ fusions. β-Galactosidase activities of cultures grown in M9 glycerol medium without (–) or with the β-D-glucoside salicin (Sal) or cellobiose (Cel) at 0.2% at 28°C were determined.
|
Phylogeny of bgc and prevalence in E. coli.
As bgc is present in E. coli CFT073 and i484 but not in E. coli K-12, the locus belongs to the variable gene pool of E. coli. To characterize the prevalence of bgc in E. coli, including Shigella species, we searched the NCBI sequence database and typed a representative collection of E. coli strains (31) for the presence of bgc.
The NCBI nonredundant nucleotide sequence database was searched by BLASTN for bgc by using the nucleotide sequence of the whole bgc locus and its flanking core genome genes from CFT073. This search yielded highly significant hits in E. coli and Shigella genomes. The bgc locus was found in 16 of 23 E. coli and Shigella genome sequences tested which were analyzed in further detail (Fig. 4). In seven genomes, all of the genes of the bgc locus are present; in two genomes, frameshift mutations disrupt a gene of the bgc locus; and in seven genomes (including those of four Shigella strains), the bgc locus is disrupted by insertions and/or deletions (Fig. 4). No bgc homologues were found in the remaining seven genomes, including those of E. coli K-12 strain MG1655, 101-1, 53638, and O157:H7; Shigella flexneri 5 and 301; and Shigella dysenteriae Sd197. Furthermore, the complete bgc locus is also present in the closely related species E. albertii (Fig. 4).
![]() View larger version (13K): [in a new window] |
FIG. 4. Structure of the bgc locus in E. coli and E. albertii. Schematically shown are the structures of the chromosomal marRAB-bgc-ydeDF region and the marRAB-ydeDF region, respectively, as extracted from genome sequences of E. coli strains (including Shigella), as well as E. albertii. Genes of the bgc locus which are disrupted by mutations are shown in gray. Deletions and insertion sites of IS elements are indicated. In strains B7A and E110019, bgcR is annotated as a pseudogene; however, the gene is intact compared to the other strains. Not indicated in the scheme is that strain 53638 carries a frameshift in ydeD. The sequences of the schematically shown loci were extracted from NCBI RefSeq entries NC_000913.2, NZ_AAMK00000000, NC_002695.1, NZ_AAKB00000000, NC_004337.1, NC_007606.1, NC_004431.1, NC_007946.1, NC_008563.1, NC_010498.1, NC_009801.1, NZ_AAJT00000000, NZ_AAJW00000000, NZ_AAJV00000000, NC_009800.1, NZ_AAJX00000000, NC_008253.1, NZ_AAJU00000000, NZ_AAMJ00000000, NC_007613.1, NC_010658.1, NC_007384.1, and NZ_ABKX00000000.
|
![]() View larger version (9K): [in a new window] |
FIG. 5. Comparison of the phylogeny of the bgc operon with the phylogeny of E. coli. The sequence of the bgc operon was extracted from the genome sequences of the strains indicated, aligned, and analyzed by an NJ tree. For comparison, the sequences of the seven MLST loci were also extracted from the genome sequences, concatenated, and similarly used for NJ tree construction. The types of the bgc operon are indicated by + for present, – for absent, and (+) for a bgc locus disrupted by mutation.
|
![]() View larger version (20K): [in a new window] |
FIG. 6. Distribution of the bgc locus within a representative collection of E. coli strains represented by a minimal spanning tree. A representative collection of E. coli strains was typed for the presence or absence of bgc by PCR and sequencing. Presence and absence were color coded and mapped onto a minimal spanning tree based on MLST analysis as described previously (41). Strains which carry bgc (in intact or mutated form) are in light gray, while strains which lack the locus are in dark gray. Strains representing presumably a second E. coli population which diverged early in evolution are encircled. The STs are indicated, and the size of the circle correlates with the number of strains by which an ST is represented. The phylogenetic groups to which strains of specific STs belong are designated A (next to ST10 and other STs), B1 (next to ST23, ST88, and ST56), D (next to ST38, ST69, and ST405), and B2 (next to ST95, ST73, and related STs).
|
|
|
|---|
In addition to bgc, other apparently cryptic β-glucoside systems have been characterized in E. coli. These include the well-studied bgl operon, as well as the asc and arbT/glvCBG loci (see introduction). The bgl operon also belongs to the variable gene pool of E. coli, and bgl genes were presumably acquired by horizontal transfer from gram-positive bacteria during the evolution of the family Enterobacteriaceae (31). However, the regulation of the bgc operon and that of the bgl operon are strikingly different. Regulation of bgl includes silencing by H-NS and substrate-specific control by transcriptional antitermination. The bgc operon is presumably not regulated by H-NS, and no sequence motifs important for regulation by antitermination are present within the operon. Interestingly, the bgl operon is likewise most conserved in extraintestinal pathogens and silencing of bgl is less strict in approximately half of the strains belonging to this group (31). This suggests that utilization of β-glucosides may provide an advantage in a habitat occupied by these bacteria.
The bgcEFIHA operon encodes five proteins, including enzyme IIB, IIC, and IIA subunits of a β-glucoside-specific enzyme II PTS permease, a putative β-glucoside-specific porin of the LamB family (BgcH), and phospho-β-D-glucosidase BgcA. BgcA or expression of bgcA is presumably temperature sensitive. Mutants that carry an activated bgc* locus are salicin and cellobiose positive at 28°C, but not at 37°C. However, they are (weakly) arbutin positive at 28°C and 37°C, presumably due to the activity of the constitutively expressed bglA gene, which encodes an arbutin-specific phospho-β-D-glucosidase (27, 39).
BgcR, which is encoded upstream of the bgcEFIHA operon, is likely to be a positive regulator, since the expression of a bgc promoter-lacZ fusion containing bgcR is induced by salicin in a strain background which carries an activated bgc* locus. However, it remains unclear why induction requires bgcR to be encoded in cis to the bgc promoter and how addition of salicin induces expression. The presence of intracellular phospho-salicin by transport of the bgl-encoded enzyme II permease in strain i484 (KEC2) does not lead to induction of the bgc promoter. However, it is possible that a different substrate induces the expression of bgc. Similarly, it is surprising that the four independent bgc* mutants (which were picked as papillae from different colonies) carry the identical point mutation. If this mutation indeed renders the bgc promoter independent of activation by BgcR, other mutations should also activate bgc. It is possible, though, that this single point mutation has the strongest effect and was thus picked four times independently.
The reason for the presence of cryptic β-glucoside systems in E. coli remains unclear. It has been speculated that silencing of bgl is to prevent the utilization of toxic aryl-β-D-glucoside substrates (29). However, β-glucoside systems in other members of the family Enterobacteriaceae such as Erwinia and Klebsiella species, as well as in gram-positive bacteria, are not cryptic, which may indicate that silencing or the cryptic state of β-glucoside systems is specific for the intestinal habitat. It was demonstrated that the bgl operon is maintained in a functional state in strains of phylogenetic group B2 of E. coli, which represents extraintestinal pathogens, while it is absent or subject to erosion in other groups of E. coli strains (31). Interestingly, in strain i484, which belongs to phylogenetic group B2, expression of bgl is induced in vivo (16). This indicates a functional role for β-glucoside utilization in extraintestinal E. coli. Similarly, the bgc operon is most preserved among the strains belonging to phylogenetic group B2. This may indicate that the presence of a locus for the utilization of β-glucosides at low temperature provides an advantage for strains belonging to this group.
We thank Vartul Sangal and Mark Achtman for help in the analysis of the clonal distribution of bgc and for discussions.
Published ahead of print on 20 February 2009. ![]()
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
Present address: Section of Infectious Diseases, Department of Internal Medicine, Yale University School of Medicine, 300 Cedar Street, New Haven, CT 06520. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»