Previous Article | Next Article ![]()
Applied and Environmental Microbiology, August 2003, p. 4837-4845, Vol. 69, No. 8
0099-2240/03/$08.00+0 DOI: 10.1128/AEM.69.8.4837-4845.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Sébastien Monchy,1,3 M. Abderrafi Benotmane,3 Raphaël Leplae,1 Max Mergeay,3,4 and Dirk Springael4,5
SCMBB, Université Libre de Bruxelles, Brussels,1 Laboratory for Microbiology and Radiobiology, Belgian Nuclear Research Centre, SCK-CEN,3 Department of Environmental Technology, Flemish Institute for Technological Research (Vito), Mol,4 Laboratory for Soil and Water Management, Catholic University Leuven, Leuven, Belgium,5 Institute of Cell and Molecular Biology, Edinburgh University, Edinburgh, United Kingdom2
Received 7 February 2003/ Accepted 14 May 2003
|
|
|---|
|
|
|---|
Tn4371 transposition most likely involves a site-specific excision/integration process since the ends of the element can be detected covalently bound (20). In the CH34 chromosome and on the pMOL30 plasmid of that strain, transposition is targeted to a low number of sites, as it is the case on the RP4 plasmid where 2 sites were identified so far. The main target site in RP4 consists of a 5'-TTTTTCAT-3' sequence, which is also present between the covalently joined ends of the transposon (20).
We now report the complete nucleotide sequence of Tn4371, which was found to contain several plasmid-related genes, in addition to a complete type IV secretion gene cluster and an orthologue of the TraG motor protein responsible for DNA transfer during conjugation (see reference 9 for a short review). Comparison of the whole Tn4371 sequence with both complete bacterial genomic sequences and bacterial genomic sequences still being determined revealed the presence in Ralstonia solanacearum (27), R. metallidurans (http://www.jgi.doe.gov/JGI_microbial/html/ index.html), Azotobacter vinelandii (NZ_AAAD01RO000088), and Erwinia chrysanthemi (http://tigrblast.tigr.org/ufmg/index.cgi?database =e_chrysanthemi|seq) of chromosomal regions closely related in their sequence and organization to several segments of Tn4371, including the left and right regions flanking the bph genes, but in which the bph gene cluster was in all cases replaced by a different set of open reading frames (ORFs).
|
|
|---|
Sequencing strategy.
DNA for sequencing was isolated by Qiagen plasmid Midi kit (Qiagen). PstI fragments from characterized members of a Tn4371 library in pLAFR3 (19) were sequenced by primer walking from both ends after a cloning step in vector pBluescript II SK(+). Junctions between the fragments present on pECG319 and pECG316 and between fragments present on pECG316 and pECG328 were sequenced by primer walking on cosmid clones pECG212, pECG293, and pECG236, starting from primer sequences chosen on the basis of the sequences obtained for the pBluescript clones. Two remaining gaps between fragments present on pECG317 and pECG345 and between fragments present on pECG345 and pECG322 were sequenced after the corresponding region had been amplified by PCR by using the following primers: pECG317fw (5'-GCAATCAGATGTACCTCGATGC-3') and pECG345rv (5'-TGGTCAGCTTGAACTCGATCAG-3') for the pECG317-pECG345 gap and pEG345fw (5'-CTTGTCATCACCACAAGCCG-3') and pEG322rv (5'-AACTGGATGTAGACCTTCTGGCC-3') for the pECG345-pECG322 gap. Amplification was performed after a short denaturation cycle of 3 min at 95°C by using 35 cycles as follows: 95°C for 20 s, 58°C for 20 s, and 72°C for 30 s for the 0.3-kb fragment and 2 min for the 2-kb fragment, with a final elongation cycle at 72°C for 10 min. The PCR products were then cloned into the pDrive cloning vector from the Qiagen PCR cloning kit. The generated pDrive2kbTn4371 and pDrive0.3kbTn4371 were used for sequencing by primer walking. Sequencing reactions were performed with the BigDye terminator sequencing master mix (Applied Biosystems). Sequences available under accession numbers AJ012075, Y10831, X97984, X98271, and Y10832 were included where appropriate. Nucleotide sequencing was performed either on a Pharmacia ALF or an ABI 310 genetic analyzer from Applied Biosystems or by GenomExpress (Grenoble, France). The complete Tn4371 sequence was annotated by using the iAnt environment (27) and is available under accession no. AJ536756 and at http://graton.ulb.ac.be/Tn4371/. Preliminary sequence data for E. chrysanthemi was obtained from The Institute for Genomic Research (http://www.tigr.org) and from a collaborative annotation effort (8). Sequence data for R. metallidurans CH34 are available online (http://www.jgi.doe.gov/JGI_microbial/html/ralstonia /ralston_homepage.html/). Comparisons between Tn4371 ORF products and these unfinished sequences were performed by using local TBLASTN access (2).
|
|
|---|
![]() View larger version (31K): [in a new window] |
FIG. 1. Tn4371 Characteristics. (A) Conceptual restriction map based on the complete nucleotide sequence of Tn4371. (B) %GC calculated from a 500-bp window moved along the sequence by 10-nucleotide steps. (C) Predicted genetic organization of Tn4371. Gray arrows represent the bph gene cluster, black arrows indicate ORFs sharing homology with known genes, and white arrows are used for the other genes.
|
Comparison with the sequences in the databases further demonstrated, to the right of the previously sequenced left end of Tn4371 (defined as 5'-TTTTTCAT-3' and the integrase-tyrosine recombinase gene int) the presence of plasmid related genes (orfRO00013 and -14; repA, parAB, and traF; and the traG-trb gene cluster) organized in three conserved blocks (orfRO00013 to -18, orfRO00055 and -33 to -41, orfRO00054, and all of the ORFs to its right). The first set included two conceptual proteins similar to proteins of unknown functions encoded by genes located near the transfer origin of E. coli plasmid F (Q9WTE4 and Q9S4W2). The genes located between int and orfRO00013 could not be assigned with any function, except for an insertion sequence (IS) element transposase (orfRO0005) closely related to R. metallidurans IS1090 (GenBank accession no. AJ010060) and R. solanacearum ISRso7 of the IS256 family (see the IS database at http://www-is.biotoul.fr/). Alignments with the related IS suggested that this one was truncated. The second set of Tn4371 plasmid-related genes contained ORFs whose translated products were, respectively, related to (i) the RepA protein of Erwinia stewartii plasmid pSW500 (GenBank accession no. S65577), Pseudomonas aeruginosa plasmid pVS1 (GenBank accession no. BAA96327), and plasmid pEMT8 (GenBank accession no. CAC94910) isolated from a polluted environment; (ii) a ParA partition protein of the type Ib family (7) and its associated ParB protein, whose AUG start codon overlapped the ParA UGA stop codon by one base; and (iii) the conjugation protein TraF (one of the pilus assembly proteins) of IncP plasmids. The third and largest cluster of plasmid related genes mapped to the right of the bph gene cluster. Their translated products were very similar to the so called T4CP, i.e., type IV coupling proteins TrwB/TraG/VirD4 (4a) and to proteins of the mating-pair formation (mpf) apparatus, related to the type IV secretion system, from plasmids RP4, R388, and Ti. The mpf genes were named trbB-I according to their orthologues in RP4 despite their different organization [BC(D)EFG(H)IJ(K)L in RP4 and BCEJLFGI in Tn4371].
As shown in Fig. 1B and C, the successive platforms in GC content aligned with the different functional blocks, supporting the hypothesis that these originated from various sources and were brought together through successive rounds of recombination events, most likely via horizontal transfer.
Related gene clusters in other bacterial chromosomes.
Sequence comparison with the genome sequence of the phytopathogenic bacterium R. solanacearum revealed, on its chromosome, four blocks of genes with a significant level of similarity and the same organization as in Tn4371 (27) (see also Fig. 2). Comparison with available bacterial genome sequences, including some currently being determined, uncovered further related chromosomal segments. Some were found in R. metallidurans CH34, split over two or more as-yet-undetermined contigs (contigs 696, 668, 600, 620, 573, 185, and 373 [http://www.jgi.doe.gov/JGI_microbial/html/ralstonia/ralston _homepage.html]). One single segment in the E. chrysanthemi 3937 and one in the A. vinelandii chromosome (GenBank accession no. NZ_AAAD01000088, ORFs avin3078 to avin3126), contained an int-related gene at one end and the same three sets of conserved plasmid-related genes with the same relative organization (Fig. 2). The int gene was in all cases followed by nonconserved ORFs. The E. chrysanthemi segment missed the parAB related ORFs. It carried orthologues (Ech00008 and -9) of plasmid F genes coding for the poison antidote proteins CcdA and CcdB (18) at about the same position as those of orfRO00002 and orfRO00003 in Tn4371. In view of their size and organization, these orthologues could encode the products of another poison antidote system. Possible orthologues of RadC lay in more or less the same position in the R. solanacearum, E. chrysanthemi, A. vinelandii, and R. metallidurans contig C668, although in the latter case it was split by several int-related ORFs. In all of the DNA segments that carried a trb gene cluster, it was flanked by two additional conserved ORFs. The one on the left was an orthologue of bphR in Tn4371, despite the absence of the other bph genes in the other sequences.
![]() View larger version (42K): [in a new window] |
FIG. 2. Genetic organization and alignment of Tn4371 and related chromosomal islands. The GenBank accession numbers and references for the sequences used to build the figure are in the text. For R. metallidurans only unassembled contigs were available for comparison. The presence (or absence) of a type IV secretion gene cluster in the vicinity of contig 668 and 696 remains to be tested experimentally. Contigs 636, 620, 573, and 373 are represented in continuity, separated by spacers because PCR experiments have confirmed their linkage, although the gaps remain to be sequenced (Monchy and Mergeay, unpublished). Orthologues are represented in the same color. White arrows represent ORFs, which have no orthologue in the other islands. Space between genes does not reflect any discontinuity unless stated otherwise. In a few cases, one orthologue appeared split in two: traR in E. chrysanthemi (Ech0023 and Ech00024), radC in R. metallidurans contig 668, and trbI in A. vinelandii (Avin3123 and Avin3124). This might result from sequencing errors since these sequencing projects are in a preliminary phase. Sequence comparison between orthologous regions was performed by using NCBI BLAST (http://www.ncbi.nlm.nih.gov:80/BLAST/) or the sequence comparison tools at the Microbial Genome Database website (http: //mbgd.genome.ad.jp/).
|
Pairwise comparisons between Tn4371 gene products and their orthologues in all of the bacterial and plasmid sequences mentioned thus far showed that conservation was always stronger between Tn4371, R. metallidurans, R. solanacearum, A. vinelandii, and E. chrysanthemi than with the next-closest relative in the databases. Clearly, the levels of sequence similarity do not match the taxonomic proximities. In phylogenetic trees built from multiple alignments of these amino acid sequences, each family of orthologues showed very similar clustering of the set of genes from Tn4371, R. metallidurans CH34, R. solanacearum, E. chrysanthemi, and A. vinelandii. The M. loti pMLb-encoded proteins again appeared to be less related based on these criteria (see Fig. 4 for a few examples). This did not hold true for the E. chrysanthemi integrases, which were, respectively, ca. 12 and 15% identical to their orthologues.
![]() ![]() ![]() View larger version (106K): [in a new window] |
FIG. 4. Phylogenetic trees of Tn4371 RepA, TraF, TraG, TrbE, TraR, and their orthologues. The phylogenetic trees were built by using the PHYLIP package. Proteins similar to individual Tn4371 ORF products were identified by searching the NCBI protein database by using BLASTP. Multiple alignments in the PHYLIP format were performed by using CLUSTALW software (http://www.infobiogen.fr/services/analyseq/cgi-bin /clustalw_in.pl). The alignments were analyzed with Protdist (PHYLIP package), which uses protein sequences to compute a distance matrix. The distance for each pair of species estimates the total branch length between the two species and was used in the distance matrix programs NEIGHBOR with the Jones et al. (13) model of amino acid change. The Protdist output was then analyzed with the NEIGHBOR program (PHYLIP package), which implements the neighbor-joining method of Saitou and Nei (26) (26) and the UPGMA (unweighted pair-group method with arithmetic averages) method of clustering. NEIGHBOR constructs a tree by successive clustering of lineages, setting branch lengths as the lineages join. The resulting tree was displayed by using Treeview in a phylogram format. Avin, Rme, and Rso stand for A. vinelandii, R. metallidurans, and R. solanacearum, respectively. pMOL28 is one of the two endogenous plasmids of R. metallidurans CH34. Identity levels between Tn4371-encoded proteins and their orthologues from A. vinelandii, R. metallidurans, and R. solanacearum ranged from 39 to 49% identity for the less-conserved TrbJ up to 97% for the most-conserved protein, ParA. It was ca. 80% for most of the proteins. E. chrysanthemi proteins were less conserved. Identity levels were about 60% for the majority of the proteins and ranged from 35 to >80% for the less- and most-conserved ones, TrbJ and RepA, respectively. BphR (BAA07613) belongs to the Achromobacter georgiopolitanum KKS102 bph gene cluster, which is very closely related to that of Tn4371 (19, 22).
|
![]() View larger version (48K): [in a new window] |
FIG. 3. Alignment of the left ends of Tn4371 and related chromosomal islands. A stretch of 200 bp from the Tn4371 left end was compared to individual genome sequences of E. chrysanthemi 3937 and R. metallidurans CH34 (contigs 668 and 696 [C668 and C696], resepctively), A. vinelandii OP (Avin), and R. solanacearum GMI1000 (TnRso) by using BLASTN. All of the aligned regions are located just to the left of the int orthologues in the islands shown in Fig. 2. E. chrysanthemi 3937 and CH34 contig 636 did not contain any sequence significantly similar to the other islands' left ends. The coordinates indicated are those of the Tn4371 sequence.
|
|
|
|---|
Another puzzling observation is that no ORF encoding for a TraI relaxase and TraK orthologue could be identified on Tn4371. Thus far, these proteins, which organize and nick the DNA at oriT as the first step in conjugative transfer, were always found to be associated with the TraG-TypeIV secretion conjugation apparatus (for a review, see reference 36). Some plasmids use tyrosine recombinases to resolve dimers (1). This could also be the case for Tn4371, which, overall, would then rather appear as a plasmid. In CH34 at least, Tn4371 could not be detected as an autonomously replicating circular entity (20; unpublished results). All of the conclusions drawn thus far for Tn4371 also apply to the other DNA segments analyzed above. In R. solanacearum GMI1000 the island is on the chromosome (27). E. chrysanthemi 3937 and A. vinelandii strain OP (ATCC 13705), a derivative of which was used for the sequencing project, do not host any plasmid (15, 25), and the R. metallidurans CH34 contigs concerned do not belong to either the pMOL28 (J. Dunn, unpublished data) or the pMOL30 (M. Mergeay, unpublished data) plasmid endogenous to that strain. Several hypotheses appear to be reasonable to account for this apparent contradiction. All of the DNA islands discussed above could be plasmids that cannot assume their plasmid mode of life in their present host. However, the high conservation of such a large set of genes with the same organization would appear to be unlikely for a defective genetic element that did not recently undergo selective pressure for its maintenance functions. Alternatively, the plasmid maintenance functions could in these cases rather assume a function related to conjugative transfer, despite the absence of similarity with TraI and TraK. Their conceptual RepA proteins include several Y residues, but none of them is in a context that fits the TraI relaxase consensus sequence (36). In addition to their nicking activity, TraI relaxases have a helicase activity that is also essential for DNA transfer. This helicase activity could here depend on a second protein, for instance, the product encoded by orfRO00008 from Tn4371, which carries domains typical of DNA helicases (PRODOM domains PD460075, PD001658, and PD002094). RepA could conceivably here nick the DNA for transfer initiation and, in conjunction with the helicase, couple it to the TraG protein for DNA transfer through a conjugation pore consisting of Trb proteins. Clearly, the existence of a potential oriT and its location, as well as the elucidation of the mechanism of conjugative transfer of these DNA segments, requires further experimental analysis.
As mentioned earlier, the Tn4371 bphR gene and its orthologue in the very closely related bph gene cluster in Achromobacter georgiopolitanum KKS102 were shown not to be involved in the regulation of the bph operon (21, 22). The presence of orthologues of this gene next to the traG-trb region in all of the DNA segments analyzed here suggests that these orthologues actually regulate the expression of the transfer genes. On plasmids Ti and RK2/RP4, the conjugal transfer genes are regulated in completely different ways: by quorum sensing for Ti (23) and cooperative interactions between TrbA and KorB for RK2/RP4 (L. Bingle, M. Zatyka, S. E. Manzoor, and C. M. Thomas, unpublished data). This most likely correlates with the different life-styles of the bacteria carrying either of those two plasmids. Regulation of the TraG-Trb transfer machinery genes by a LysR regulator would thus represent a third mode of regulation of very similar gene clusters, illustrating, once more, the fact that regulatory genes and the genes they regulate do not necessarily evolve together (5).
A clear modular picture emerges from the comparisons shown in Fig. 2. Very similar traG-trb operons and their potential LysR family regulator are separated from the conserved segment, including the repA and traF genes, by a set of completely unrelated ORFs. Preliminary results (S. Monchy and M. Mergeay, unpublished data) indicate that R. metallidurans CH34 contigs C636, C620, C573, and C373 are contiguous on the chromosome and thus likely to form one island, where the set of nonconserved genes is replaced by an IS1071 insertion sequence. On the other islands that contain C696 and C668 more sequencing is needed in order to determine whether a stretch of 10 to 15 kb of unpredictable nature indeed separates the repA-traF from the traG-trb region.
Other places in the islands seem to be prone to shorter insertions. Two of these, which flank the conserved region that covers Tn4371 orf RO00013 and orfRO00014, include truncated IS elements and serine recombinase encoding genes. These might be the footprints of rearrangements that led to the present structure.
Despite the fact that the mobility of the sequences discussed here has not yet been demonstrated experimentally (except for that of Tn4371), these sequences reside in a wide range of species in the ß-proteobacteria (Ralstonia) and
-proteobacteria (A. vinelandii and E. chrysanthemi), including soil and plant pathogenic strains, suggesting that the conjugation transfer machinery might confer a broad host range, as for the similar machinery of the IncP plasmids.
Tn4371-related sequences have been identified in other biphenyl- and chlorobiphenyl-degrading isolates by using DNA-DNA hybridization with appropriate Tn4371-borne gene probes (31a). A group of ß-proteobacteria contained bph genes that are highly similar in organization to those of Tn4371 and KKS102. Three of these bacterial strains were able to transfer their bph genes to R. metallidurans in a mode similar to that of Tn4371; a 50-kb chromosomal fragment carrying the bph genes was, after insertion in RP4, transferred to CH34. The transferred DNA segment of two strains (1C3 and 4A4) showed an extended similarity with Tn4371, to the right of the bph catabolic genes and up to trbI. However, no homology was found with a Tn4371 int probe, which might be related to the fact that Int orthologues tend to be less conserved than Tra and Trb orthologues (except for TrbJ, see the legend to Fig. 4). Strains 1C3 and 4A4, although derived from the same geographical locations as strain A5, might carry Tn4371-related elements with a different int gene but the same bph gene cluster, again pointing toward a reassortment of building blocks.
|
|
|---|
This work was supported by the Belgian Fonds de la Recherche Fondamentale Collective, the Fonds de la Recherche Scientifique Médicale, a TOURNESOL exchange grant from the Belgian CGRI and the French CNRS, and a grant from the Convention between the Université Libre de Bruxelles and SCK/CEN. A.T. is Directeur de Recherche from the Fonds National de la Recherche Scientifique (Belgium).
Present address: LEMiR, UMR 163 CNRS-CEA, CEA Cadarache, 13108 St. Paul Lez Durance, France. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»