Previous Article | Next Article ![]()
Applied and Environmental Microbiology, July 2007, p. 4226-4233, Vol. 73, No. 13
0099-2240/07/$08.00+0 doi:10.1128/AEM.00455-07
Copyright © 2007, American Society for Microbiology. All Rights Reserved.

Department of Microbiology, University of Illinois, Urbana, Illinois 61801
Received 27 February 2007/ Accepted 1 May 2007
|
|
|---|
|
|
|---|
Gupta et al. (7) used pulsed-field gel electrophoresis to show that the B. uniformis element carrying ermB was integrated into the chromosome and was thus probably a conjugative transposon (CTn). The CTn was named CTnBST. CTnBST was estimated to be about 100 kbp in size. A 13-kbp region containing ermB was sequenced and compared to sequences then in the databases. The DNA sequence of the ermB gene was identical to genes found in Clostridium difficile. Outside the ermB gene, however, only a few hundred base pairs downstream of ermB had high sequence identity to DNA from gram-positive bacteria. This did not rule out a gram-positive bacterium origin for CTnBST, however, because at that time, relatively few sequences from gram-positive bacterium transmissible elements were available in the databases.
Bacteroides species are gram-negative bacteria that are normal residents of the human colon. They comprise approximately 25 to 30% of the indigenous microflora (5, 14). The other numerically predominant group of bacteria in the colon consists of gram-positive anaerobes, about which little is known (1, 6, 23). These bacteria comprise at least 65% of the human colonic microflora. Bacteroides strains could have picked up an ermB-containing element from these gram-positive colonic bacteria or from oral gram-positive bacteria that are swallowed on a daily basis. Recently, we succeeded in cloning portions of DNA that covered the entire length of CTnBST and used them to locate and sequence the ends and integrase gene of CTnBST (21). The availability of these clones provided an opportunity to ask whether CTnBST had other regions of sequence similarity to DNA from gram-positive bacteria. In the several years since the study by Gupta et al. (7), many more sequences from gram-positive bacteria have been deposited in the databases, so we considered that expansion and further analysis of the CTnBST sequence could well yield more evidence for the origin of CTnBST. Specifically we looked at the % G+C content of CTnBST in conjunction with BLAST analysis to determine if all, or part of, CTnBST has a gram-positive bacterium origin. In this report we describe an analysis of the entire CTnBST sequence.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. List of strains and plasmids used in this study
|
Sequencing of CTnBST.
The four fosmid clones containing clones of CTnBST DNA (21) were digested with EcoRI and HindIII, and the resulting fragments (EcoRI/EcoRI, EcoRI/HindIII, and HindIII/HindIII) were subcloned into pUC19 (24) and sequenced from either end of the multiple cloning site using the BigDye Terminator v3.1 cycle sequencing kit from Applied Biosystems. Sequencing was performed by the Core DNA Sequencing Facility of the W. M. Keck Center for Comparative and Functional Genomics at the University of Illinois at Urbana-Champaign. The primer walking method was used to sequence clones that were too large to sequence with a single read from either end of the clone. The sequence fragments were assembled using the program Sequencher 4.5 (Gene Codes Corporation). Tentative open reading frames (ORFs) were identified by using two Web-based ORF finding programs, GeneMark (2) and FGENESB: Bacterial Operon and Gene Prediction (Softberry Inc.). Identified ORFs were assigned putative functions based on BLAST searches.
Localization of the excision region.
A possible excisionase gene was identified on the basis of the properties of its product (small basic protein) and its proximity to the integrase gene. A small internal region of the putative excisionase gene (
150 bp) of CTnBST was PCR amplified and was cloned with the pGEM-T Easy Vector System I kit from Promega (catalog no. A1360). Unique SacI and SphI restriction sites were engineered into the primers for the PCR. The fragment was subsequently subcloned into the suicide vector pLYL001a (15) between the SacI and SphI restriction sites. The resulting construct was used to transform chemically competent E. coli S17-1 cells (19) to ampicillin resistance. The resulting transformant was used as a donor in a filter mating with a Bacteroides thetaiotaomicron 5482 strain containing CTnBST, BT4021, as the recipient (7). CTnBST can sometimes integrate into multiple locations in the same strain (21). Southern blot analysis had indicated that the strain which was used for the disruption contained only one copy of CTnBST (data not shown). The CTn from the resulting mutant strain, BT4100
xisBST, was unable to excise itself from the chromosome, as tested by PCR to detect the joined ends of the CTnBST element.
Portions of the excisionase region from CTnBST were PCR amplified and were cloned with the pGEM-T Easy Vector System I kit from Promega. The primers used for the amplification are shown in Table 2. The gene was then subcloned into the E. coli/Bacteroides shuttle vector pLYL05 (11). The resulting construct, pLYL05xisBST, was used to transform chemically competent E. coli S17-1 cells (19) to ampicillin resistance. The resulting transformant was used as the donor in a filter mating with strain BT4100
xisBST as the recipient. Transconjugants were then screened for their ability to excise from the chromosome (by PCR). In addition, pLYL05xisBST was transferred to a strain containing the CTnBST minielement, which cannot excise (21), to see if the cloned DNA causes minielement excision.
|
View this table: [in a new window] |
TABLE 2. List of primers used in this study
|
PCR.
Primers used for PCR in this study are shown in Table 2. For the excision assay, a set of primers was designed for PCR amplifying approximately 800 bp of the joined ends of the excised form of CTnBST. PCRs were done using the specifications provided by the manufacturer of Taq DNA polymerase (Invitrogen). Thermocycler parameters were as follows: 95°C for 5 min; 95°C for 30 seconds, 53°C for 30 seconds, and 72°C for 30 seconds for 30 cycles; and 72°C for 5 min. For PCR products longer than 1 kb the extension time (at 72°C) was increased accordingly (30 seconds per 2 kb).
Nucleotide sequence accession number.
The complete sequence of CTnBST has been submitted to the GenBank nucleotide sequence database under accession number AY345595. Approximately 13 kb of CTnBST (bp 46854 to 59763) was previously published under this accession number (7), which has now been corrected and updated.
|
|
|---|
When the deduced amino acid sequences encoded by the 76 ORFs of CTnBST were compared to the corresponding proteins encoded by CTn3-Bf ORFs, an interesting pattern emerged (Fig. 1A). A 17-kb region of CTnBST encoded proteins with no amino acid similarity to any proteins associated with CTn3-Bf. This 17-kb region was localized around the ermB gene. An additional 23-kb region upstream of the 17-kb ermB region had intermittent sequence similarity to CTn3-Bf, but there were several large gaps between the regions of similarity. By contrast, the remainder of CTnBST, comprising the left and right ends of the element, encoded proteins with high levels of amino acid sequence similarity (30 to 95%) to those encoded by the corresponding regions in CTn3-Bf.
![]() View larger version (35K): [in a new window] |
FIG. 1. (A) The percent amino acid identity between the ORFs of CTnBST and CTn3-Bf defines two distinctly gram-positive organism-like regions within CTnBST. The ermB region, as depicted in the graph, is 100% identical on the nucleotide level to the same sequence in Arcanobacterium pyogenes OX-7. The region upstream of the ermB region, labeled as a mixed region, contains a mixture of Bacteroides-like and gram-positive organism-like ORFs. (B) The % G+C content of each ORF in CTnBST is shown. This graph helps to define both functional regions (transfer region) and putative insertions of gram-positive organism-like DNA (in both the mixed region and the ermB region). It should be noted that the average % G+C content of the Bacteroides chromosome is around 42%.
|
|
View this table: [in a new window] |
TABLE 3. List of ORFs from CTnBST that have identical homologs in Arcanobacterium pyogenes OX-7
|
The average % G+C content of CTnBST in the region where it was most similar to CTn3-bf was 48%, which is higher than the average % G+C content of Bacteroides species (42 to 43% G+C). In addition, as the % G+C content of CTnBST was compared along the entire sequence, a significant change occurred within and around the region of A. pyogenes homology. This region had an average of around 36% G+C, much lower than the rest of CTnBST.
Further analysis of the ermB region.
As mentioned above, the ermB region is the region that had 100% DNA sequence identity to a tylosin resistance determinant that carries the ermB gene found in the high-% G+C gram-positive species A. pyogenes (Fig. 2) (9). The identity begins 144 bp into the published A. pyogenes sequence with BST058 in CTnBST (orf181 in the A. pyogenes sequence) and extends all the way through the published A. pyogenes sequence ending with BST070 (Table 3), whose product is most closely related to an aspartate racemase found in Thermosinus carboxydivorans. A detailed description of this intervening region is given by Jost et al. (9).
![]() View larger version (17K): [in a new window] |
FIG. 2. The ermB region of CTnBST and the corresponding region in Arcanobacterium pyogenes OX-7 share 100% identity on the nucleotide level. This identity continues past the published sequence for Arcanobacterium pyogenes OX-7 (shown here) to include the aadE homolog and the flanking partial ORFs (in gray).
|
Comparison with other ermB-containing strains.
Previously, we had screened approximately 300 Bacteroides isolates for ermB by hybridization and, in a subset of cases, PCR. An ermB gene that was identical to that on CTnBST was found in six of these isolates (18). Further hybridization and sequence analysis showed that all six of these isolates carry the same 7-kb ermB region as that found in A. pyogenes (Table 4). However, only the CTnBST-carrying strain (WH207) had an IS between BST059 and BST061. The other five isolates do not contain this IS. Three of the six isolates (WH202, WH714, and DH3760) appear to have had the aadE gene deleted. The three strains that carried the aadE gene (WH207, Bov7991, and BF8371) appeared to be almost 100% identical on the DNA sequence level. A partial sequence analysis of the 16S rRNA genes of these strains revealed that these six isolates represent several Bacteroides species (Table 4).
|
View this table: [in a new window] |
TABLE 4. Overview of CTnBST and ermB strainsa
|
The remaining isolates (WH202, WH714, DH3760, and BF8371) did not contain any CTnBST sequences outside the ermB region, as detected by PCR and DNA-DNA hybridization. It could be that the ermB region is itself on a transmissible element and has integrated into the chromosomes of Bacteroides spp. several times, only once inserting into an active CTn3-Bf like element.
We also found two other Bacteroides isolates (WH301 and WAL9063) that appeared to contain a close homolog of CTnBST, with more homology to CTnBST than CTn3-Bf and without any trace of the ermB region, as both PCR and DNA-DNA hybridization failed to identify any sequences in this region. Again, this may imply that the ermB region has come from another, presumably gram-positive, bacterial species via several independent transfer events.
Assignment of possible functions to CTnBST genes.
The first ORF in CTnBST, as depicted in Fig. 3, encodes a tyrosine type site-specific recombinase, IntBST, that has been shown to be the CTnBST integrase (21). This ORF, when cloned into a suicide plasmid that also contained the joined ends of CTnBST, was sufficient for integration but not for wild-type levels of excision. Approximately 150 bp downstream of intBST was an ORF (BST002) encoding a hypothetical protein which our sequence analysis shows contains a possible frameshift mutation. The next ORF, 200 bp downstream of BST002, was a small ORF (395 bp) that encoded a basic protein. This gene might encode an excisionase protein for CTnBST. If so, it appeared to be in an operon with a downstream ORF (BST004) which overlapped the putative excisionase by several base pairs.
![]() View larger version (14K): [in a new window] |
FIG. 3. The region that includes the genes necessary for excision of CTnBST. The numbers above the arrows indicate the base pair positions within CTnBST sequence. The "+" and "" symbols correspond to whether or not complementation of the excisionless phenotype was achieved with the given construct. The results of this experiment lead us to conclude that the promoter for xisBST lies within and/or upstream of BST002.
|
Identification of the excision region.
The excision region of the CTn CTnDOT is quite complex, involving three excision genes that work together with the integrase (4). By contrast, the gram-positive CTn Tn916 has only a single excisionase gene (13). We wanted to identify the excisionase gene(s) of CTnBST to determine if its excision mechanism was more like that of CTnDOT or the much simpler xis of Tn916. The putative excisionase gene, tentatively designated xisBST, was disrupted by insertion of a suicide plasmid. A PCR assay was used to determine if the element was able to excise itself from the chromosome, and the result was that CTnBST could no longer excise itself from the chromosome (Fig. 4).
![]() View larger version (38K): [in a new window] |
FIG. 4. The joined ends of the excised circular form of CTnBST were PCR amplified from each of the listed strains carrying each of the listed plasmids shown in Fig. 3 and described in Table 1. BT4020, a strain that carries a wild-type copy of CTnBST, was used as a positive control. BT4001 does not contain CTnBST and was used as a negative control in this experiment. As a positive control for the amplification reaction the gene that encodes sigma 70 in Bacteroides was successfully amplified from each strain but is not shown here.
|
xisBST, and excision of CTnBST was detected using the PCR excision assay that detects the joined ends of the CTnBST circular form. We were able to show that pBSTxis1 complements the excisionase phenotype of strain BT4100
xisBST (Fig. 3 and 4). It is interesting that only one of several xisBST clones was able to complement the excisionless phenotype of strain BT4100
xisBST. We started with a construct, pBSTxis1, that contained the xisBST gene plus the upstream (BST002) and downstream (BST004) ORFs. This construct fully complemented strain BT4100
xisBST; however, when we tried to further localize the excisionase gene, cloning only xisBST plus a few hundred base pairs of upstream sequence, pBSTxis2, we lost complementation. Complementation was restored only when we included the entire upstream ORF (BST002) with construct pBSTxis4. This gene, BST002, carries a potential frameshift but is still present in BT4100
xisBST, implying that the promoter for xisBST may be upstream of the coding sequence of BST002. In addition, pBSTsxis1 and pBSTxis4 were also able to complement the excision-deficient phenotype of the CTnBST minielement, miniBST (Fig. 4). This minielement contains all of the DNA sequences that are needed for the integration of CTnBST into the chromosome, including the integrase gene intBST plus the joined ends of the excised circular form of CTnBST (21). Together these data imply that the excisionase gene, including the promoter sequence, is the only thing that is required for excision of CTnBST. This is in stark contrast to the complex excision mechanism of CTnDOT, which is tightly regulated and requires multiple proteins (4).
|
|
|---|
Initially, we had hoped to identify the origin of CTnBST by finding the entire CTnBST sequence in genome sequences from other bacteria. It is clear from our results that this hypothesis was too simplistic. Instead, CTnBST appears to be an amalgam of sequences having different origins, some of which may already have been in Bacteroides spp. as a cryptic CTn before acquiring sequences from gram-positive bacteria. It is also possible, however, that cryptic forms of CTnBST such as CTn3-Bf may themselves have had their origin outside of Bacteroides spp., because the % G+C content of this DNA differs from that of the BT4001 chromosome (46 to 50% versus 42 to 43%). In addition to portions of an element like CTn3-Bf, CTnBST seems also to have acquired several regions that could have come from a gram-positive bacterium. Moreover, even this region has itself undergone an IS insertion in one instance and a deletion in the aad gene in three others. Even the region that has sequence similarity to CTn3-Bf has also undergone some alterations. That is, it is not identical in sequence to CTn3-Bf but has experienced a number of changes, some of which are fairly large (Fig. 1A). Our results show how much variation can occur in transmissible elements as they move from one host to another.
The high sequence identity of the ermB region of CTnBST to a similar region in Arcanobacterium pyogenes OX7 strongly suggests a direct genetic conduit between gram-positive species and Bacteroides species. The fact that CTnBST carries an IS in one of the ORFs from this region, whereas similar ermB regions in other Bacteroides isolates do not, implies a direction of transfer. That is, the IS insertion may have occurred after the ermB region arrived in Bacteroides spp. An alternate explanation is that the IS was lost by precise excision in the other Bacteroides spp., but this possibility appears to be less likely than the acquisition of the IS sequence in Bacteroides spp. It is important to note that, even if the argument is accepted that the ermB region moved from a gram-positive bacterium to Bacteroides spp., an intermediate host cannot be ruled out. The fact that this region first appeared in Arcanobacterium pyogenes OX-7 before Bacteroides is also supported by the fact that several isolates of Bacteroides that carry the ermB region have a deletion of the aadE gene. Bacteroides species are naturally resistant to streptomycin, so the fact that this gene was deleted in several isolates is consistent with the idea that there was no selection to maintain the gene in Bacteroides spp.
Approximately 23 kbp of sequence upstream of the ermB region appear to be a mixture of Bacteroides-like sequences and gram-positive bacterium-like sequences. Part of this region is also found in CTn3-Bf, suggesting that the acquisition of some of the gram-positive bacterium-like sequences occurred in a common progenitor of the two CTns. Jost et al. (9) proposed that, in Arcanobacterium pyogenes, a mobile element carrying ermB had inserted into an ORF, designated orfY (homologous to BST059, BST061, and BST070 in CTnBST), causing a duplication of orfY. Subsequent to that event, an internal portion of the trsK gene inserted itself immediately upstream of ermB, forming a putative promoter to drive expression of ermB in Arcanobacterium pyogenes. The evidence they provide is that they have other isolates of Arcanobacterium pyogenes that do not contain ermB but carry an intact orfY and lack any trace of trsK. As noted above, the homology between the A. pyogenes element and CTnBST begins with an ORF immediately upstream of the first orfY. Taken together, these findings suggest that this element came together in Arcanobacterium pyogenes and subsequently, through an as yet unknown mechanism, transferred itself to Bacteroides spp., either directly or indirectly.
Identifying possible CTns in genome sequences has proven to be a challenge because of the very limited number of CTn sequences that are currently available. One way to locate a foreign integrated element has been to calculate the % G+C content, looking for regions that differ from the rest of the chromosome. This is the approach that has been used to identify possible pathogenicity islands (12). Two problems with this approach are illustrated by the CTnBST sequence. First, the % G+C content of most of CTnBST is higher than, but not radically different from, that of the chromosomes of sequenced Bacteroides strains. However, this difference may be significant enough to imply that this part of CTnBST is not native to Bacteroides species. Second, the abrupt drop in % G+C in the middle of CTnBST, which is clearly different in % G+C content, could indicate the presence of an integrated region of some other origin or it could indicate the presence of two CTns, one on either side of the low-% G+C region. It is not clear simply from the % G+C content whether these disparate segments transfer as a unit or as separate units. Either way the drastic change in G+C content (from 48% to 36% G+C) implies that this particular segment of CTnBST is of a different origin than the rest of CTnBST. It is interesting that gram-positive organisms such as Clostridium spp., Enterococcus spp., and Streptococcus spp. have a % G+C content between 28 and 39%. And, as stated above, most of the ORFs in this region show a high level of identity to genes from these organisms. This would imply that this 7-kb ermB region, while it may have entered Bacteroides species via A. pyogenes, did not originate in A. pyogenes, which is a high-G+C gram-positive organism (9). The fact that the same regions from both organisms are 100% identical provides further evidence that CTnBST has picked up an element(s) from a gram-positive organism during its evolution. It also seems likely that the region upstream of the 7-kb ermB region may itself have come in on another gram-positive element. The % G+C content in this area is similar to that of the rest of CTnBST, but many of the ORFs had a high level of identity to gram-positive organism genes. In addition, this area contains a mixture of Bacteroides homologs (a few from CTn3-Bf) and gram-positive organism homologs. Thus, this region may have been present in CTnBST long enough to reshuffle genes and/or to acquire a Bacteroides type codon usage.
A second approach for identifying CTns in a genome sequence is to look for genes known to be associated with known CTns, like transfer (tra) genes. Indeed, this was how CTn3-Bf was identified (10). However, it should be noted that, while the annotators of the YCH46 genome sequence identified CTn3-Bf as a CTn on the basis that some of the sequences encoded by this region had amino acid sequence similarity to known CTn transfer proteins, no evidence was presented to show that this is an active CTn.
CTnBST illustrates a theme that is beginning to emerge among the CTs: their chimeric structure. For example, the region carrying the ermB gene in CTnBST, which extends for at least 7 kb, appears to have come from A. pyogenes or some close relative. Something like this has been seen in CTnDOT, where a 13-kbp region carrying the ermF gene seems to have entered the larger CTn because a very closely related CTn, CTnERL, is virtually identical to CTnDOT except for the addition of the 13-kbp segment in CTnDOT (22). Apparently, the structures of CTns can be very fluid.
CTnBST is only the second type of CTn from the genus Bacteroides that has been sequenced. The other type of sequenced Bacteroides CTn is represented by CTnDOT and CTn341, two CTns with very similar sequences. Other sequenced CTns are Tn916, the enterococcal CTn, and the STX element of Vibrio cholerae, if indeed the STX element is a CTn. As seen from this report, some of the regions of genome sequences tentatively identified as CTns but not shown to be transmissible may prove to be active CTns. If so, the number of CTn sequences will multiply considerably because multiple possible cryptic CTns have been identified in the genome sequences of all three of the sequenced Bacteroides spp. and in the emerging genome sequences of Porphyromonas and Prevotella spp.
It is interesting that the genes on CTnBST that are most like those on CTnDOT are the transfer genes, especially a gene called traG. The fact that the CTnBST sequence is otherwise so different from the sequence of CTnDOT reinforces the use of tra genes to identify a possible CTn, at least in the Bacteroides group of genera. CTnBST turned out to be closely related to a tentatively identified CTn in B. fragilis YCH46, CTn3-Bf. There was no evidence that this putative B. fragilis CTn is transmissible, but the similarity between it and CTnBST, which does excise, suggests that it may be or may have been transmissible. As more CTn sequences become available, it will be easier to identify CTns more precisely in the emerging genome sequences.
This work was supported by a grant from the Ellison Foundation (IDSS-042703) and by a grant from the National Institutes of Health (AI/G M22383).
Published ahead of print on 4 May 2007. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»