Previous Article | Next Article ![]()
Applied and Environmental Microbiology, November 2008, p. 7094-7097, Vol. 74, No. 22
0099-2240/08/$08.00+0 doi:10.1128/AEM.01378-08
Copyright © 2008, American Society for Microbiology. All Rights Reserved.
,
Department of Medical Microbiology, University Medical Center Utrecht, Utrecht, The Netherlands
Received 19 June 2008/ Accepted 21 September 2008
|
|
|---|
|
|
|---|
An additional method to identify CC17 E. faecium-acquired genes or gene clusters (GeCs) in a genome is base composition analysis. At the time of transfer, horizontally acquired genes often differ in their codon usages, GC percentages, and dinucleotide frequencies, since horizontally acquired genes share these characteristics with the DNA of the bacterium from which they originated (8, 9). A recently described Web-based tool for the detection of horizontally transferred genes and GeCs is the 
-Web model (16). The 
-Web model allows whole-genome composition analysis to visualize anomalous DNA in a prokaryotic genome based on differences in both GC percentages and dinucleotide frequencies. Horizontally acquired genes or GeCs, such as genomic islands (GIs), often encode accessory functions, such as additional metabolic activities and antibiotic resistance, or functions involved in microbial fitness, symbiosis, or pathogenesis (1, 4). In this study, we used the 
-Web model as an initial, quick screen to identify anomalous GeCs in the genome of E. faecium DO, a CC17 E. faecium strain that may have contributed to increased fitness and enhanced survival of CC17 E. faecium. In addition, PCR and dot blotting were performed on a large set of E. faecium isolates to confirm whether these anomalous GeCs were CC17 specific.
|
|
|---|

-Web model, an in silico concatenated genome sequence was created by linking all contigs larger than 2,000 bp (n = 41), encompassing 64% of the whole genome, in order from large to small. After submission, the concatenated genome sequence was divided into nonoverlapping fragments of 10,000 bp, as recommended by the user guidelines supplied at http://deltarho.amc.uva.nl. The difference in dinucleotide frequency (
* value) between each fragment and the complete sequence and the GC percentage of each fragment were calculated. The model identified five fragments with both a high
* value and an aberrant GC percentage compared to the average genome values for the concatenated E. faecium DO contigs. These fragments represented sequences located in contigs 608, 624, 638, 654, and 656. Contig 656 was previously identified as a contig harboring many CC17-specific genes (13). Therefore, we chose to focus on the fragments located in the other four contigs. For each fragment, one gene (orf877, orf1155, orf1482, and orf2303 of contigs 608, 624, 638, and 654, respectively) was chosen and the presence of this gene was assessed by PCR and dot blotting on chromosomal DNA from 134 E. faecium isolates, 41 CC17 E. faecium isolates, and 93 non-CC17 E. faecium isolates (see Table S1 in the supplemental material). The preparation of chromosomal DNA and dot blotting were performed as described previously (7). The primers used for PCR and for the generation of DNA probes are listed in Table S2 in the supplemental material. E. faecium DO was used as a positive control and E. faecalis V583 as a negative control (15). PCR (data not shown) and dot blotting revealed that one of the four genes, orf1482 on contig 638, was specific to CC17 E. faecium (Table 1). This gene was detected in 97.56% (40/41) of the CC17 E. faecium isolates and in only 4.30% (4/93) of the non-CC17 E. faecium isolates (P < 0.0001; Fisher's exact test). orf1482 encodes a putative transcriptional regulator belonging to the AraC family. Transcriptional regulators of the AraC family are widely spread among bacteria and regulate genes with diverse functions, ranging from carbon metabolism and stress response to pathogenesis (3, 14). Since AraC-type transcriptional regulators are often found close to or on GIs (5), the presence or absence of genes located upstream and downstream of orf1482 was determined. This revealed that all isolates that contained orf1482 also contained a set of seven genes just upstream of orf1482, while isolates lacking orf1482 also lacked this set of genes, indicating that the araC-like gene is located in an 8.5-kb GeC, which is specifically enriched in CC17 E. faecium (Table 1). This means that the genes of the GeC may also serve as a marker to distinguish CC17 E. faecium strains from other E. faecium strains. orf1474 and orf1483, flanking this 8.5-kb GeC, belong to the E. faecium core genome. The four non-CC17 E. faecium isolates that harbor this GeC represent two hospital outbreak isolates (E300 and E1679) and two clinical isolates (E1172 and E1721). The single CC17 E. faecium isolate that does not harbor this GeC represents a clinical isolate (E1263). |
View this table: [in a new window] |
TABLE 1. Prevalence of genes in CC17 and non-CC17 E. faecium isolates as determined by dot blotting
|
|
|
|---|
|
View larger version (11K): [in a new window] |
FIG. 1. Genomic organization of the 8.5-kb GI (E. faecium DO contig 638) specifically enriched in CC17 E. faecium. The direction of transcription is indicated by arrows. The gray arrows represent the genes which belong to the GI, and the black arrows represent the flanking genes. The numbers below the arrows indicate gene sizes. Direct (dashed boxes) and inverted (black boxes) repeats were found at positions 7184 and 16391 and positions 7061 and 16803, respectively. Open reading frame numbers are indicated in italics. a, nucleotide reference position relative to that of the E. faecium DO contig 638 sequence (GenBank accession no. AAAK03000019).
|
|
View this table: [in a new window] |
TABLE 2. Identities of the predicted proteins encoded by the GI specifically enriched in CC17 E. faecium as determined by BLAST
|
|
|
|---|
G values of –8.11 kcal/mol and –14.01 kcal/mol, respectively. This suggests the presence of transcriptional terminators at these sites and that orf1475 to orf1481 are part of a single operon. To confirm this, reverse transcription-PCR was performed with cDNA by using gene-specific primer pairs (see Table S2 in the supplemental material) designed to span the entire region, resulting in overlapping amplification products. Products of the expected size were observed with primer pairs covering orf1475 to orf1481, showing that these genes are cotranscribed in a single operon and that orf1474 and orf1482 are not part of the operon (Fig. 2A). In addition, promoter mapping of orf1481 and orf1482 was performed using 5' rapid amplification of cDNA ends (Invitrogen Corp.) according to the manufacturer's instructions. Total RNA from E. faecium DO was isolated and reverse transcribed using primers 1481R and 1482R (see Table S2 in the supplemental material). The subsequent PCR was performed using primers 1481R2 and 1482R2 and the abridged anchor primer provided with the system. Sequencing of the PCR products revealed that two transcriptional start sites were located in the orf1481-orf1482 intergenic region (Fig. 2B). Direct repeats were found between the two promoters (P1 and P2), which may represent a putative binding site of a transcriptional regulator protein (2).
![]() View larger version (26K): [in a new window] |
FIG. 2. Transcriptional analysis of the GI specifically enriched in CC17 E. faecium. (A) Cotranscription of orf1475 to orf1481, demonstrated by using primer pairs designed to span the entire region, resulting in overlapping amplification products. The molecular size marker is the 1-kb ladder (Invitrogen Corp.). (B) Intergenic region of orf1481-orf1482, with the start codons and orientations of orf1481 and orf1482 in bold and indicated by arrows below the sequence. Transcriptional start sites and directions are in bold, underlined and indicated by arrows above the sequence. Putative –35 and –10 boxes are underlined and in italics. Direct repeats, representing a putative binding site of a transcriptional regulator protein, are between the two promoters (P1 and P2) and are underlined.
|
|
|
|---|
![]() View larger version (47K): [in a new window] |
FIG. 3. Analysis of the insertion site of the GI specifically enriched in CC17 E. faecium. (A) Schematic representation of insertion of the GI in the intergenic region from orf1474 to orf1483, resulting in deletion of a 108-bp fragment (dashed line). The primers used to analyze the insertion site are indicated by small gray arrows. Numbers indicate the start and stop positions of orf1474 and orf1483 and the start position of the primers. The sizes of the two amplicons, with and without GI insertion, the GI, and the deleted fragment are indicated. Open reading frame numbers are indicated in italics. (B) PCR was performed with one CC17 E. faecium isolate (E1162), four non-CC17 E. faecium isolates that harbor the gene cluster (E300, E1172, E1679, and E1721), one non-CC17 E. faecium isolate (E980), and the single CC17 E. faecium isolate that does not harbor the GI (E1263). The molecular size marker is the 1-kb ladder (Invitrogen Corp.).
|
|
|
|---|

-Web model, PCR, and dot blotting, we identified a GI tentatively encoding a novel metabolic pathway involved in carbohydrate transport and metabolism. Our finding that all CC17 E. faecium isolates but one harbor this island and that none of the non-CC17 E. faecium human surveillance, environmental, and animal isolates harbors it indicates that this GI is acquired by CC17 E. faecium via horizontal transfer. We hypothesize that this GI may provide CC17 E. faecium a competitive advantage over the indigenous commensal E. faecium flora by enabling CC17 E. faecium to effectively colonize the gastrointestinal tracts of hospitalized patients.
Published ahead of print on 3 October 2008. ![]()
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
|
|
|---|

-Web, an online tool to assess composition similarity of individual nucleic acid sequences. Bioinformatics 21:3053-3055.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»