Previous Article | Next Article ![]()
Applied and Environmental Microbiology, September 2005, p. 5494-5500, Vol. 71, No. 9
0099-2240/05/$08.00+0 doi:10.1128/AEM.71.9.5494-5500.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Richard T. Okinaka,1
Paul Keim,2 and
Cheryl R. Kuske1*
Biosciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545,1 Department of Biological Sciences, Northern Arizona University, Flagstaff, Arizona 860112
Received 7 January 2005/ Accepted 4 April 2005
|
|
|---|
|
|
|---|
An average of 124 cases of tularemia were recorded annually in the United States in the last decade (6), and hundreds of cases per year are reported from tularemia-endemic areas of Europe (3, 35). F. philomiragia infections affect mainly near-drowning victims and immunocompromised patients but nonetheless can cause severe disease (22). Comprising only these two validly published named species, the currently described diversity of the genus Francisella is rather limited (34). Several closely related endosymbiots of ticks, including the named species Wolbachia persica (18) as well as species identified only by analysis of DNA sequence data (29), also affiliate with the Francisellaceae based on analysis of 16S rRNA sequences. Recent genetic analyses, however, have suggested that considerable diversity within the genus remains to be discovered (23).
Francisella species are known to infect
150 species of vertebrate animals and may also be associated with protozoa in the environment (1). This broad host distribution has impeded understanding of Francisella ecology and epidemiology (27). Francisella strains are extremely difficult to culture from environmental sources (27), and few studies of its natural distribution and diversity have been undertaken (16), usually after human disease outbreaks have been reported (3, 13, 19). The organism can persist in water or mud for at least a year (5), supporting the possibility that environmental matrices may be important reservoirs for this pathogen. Recent and historical outbreaks indicate that environmental exposure to the organism is a significant source of morbidity (3, 14, 21).
PCR-based analyses have been developed and used to detect Francisella species in water (16) and tissue samples in a few cases (11, 17, 19, 30, 36), but such analyses have not yet been reported for soil or other environmental samples. To better understand the natural diversity and ecology of this pathogen and its closely related species, it is important to further explore its distribution and diversity in the environment. This information is also critical to understanding the natural background of the pathogen in environmental samples that may be collected for detection and attribution of biological threat agents.
We analyzed soil and water samples for the presence of Francisella-like DNA sequences to follow up on the detection of putative F. tularensis in Houston, Texas, by BioWatch aerosol monitors in October 2003 (4). We employed a nested approach, using PCR primer sets of increasing specificity to detect and identify Francisella sequences present in DNA extracts from the samples. PCR products were then cloned and sequenced to provide additional information on the diversity of species present.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Francisella reference isolates sequenced in this study
|
To screen for the presence of 16S rRNA gene sequences related to Francisella, each soil and water sample was amplified with primers Fr153F0.1 (5'-GCCCATTTGAGGGGGATACC-3') and Fr1281R0.1 (5'-GGACTAAGAGTACCTTTTTGAGT-3'), modified from the F11 and F5 primers of Forsman et al. (18) to increase sensitivity and specificity, at an annealing temperature of 60°C. These primers were designed to detect F. tularensis and F. philomiragia as well as the Francisella-like tick endosymbionts related to Wolbachia persica (29).
Soil samples that gave positive results with the Francisella 16S rRNA gene screen were tested further by amplification of extracted DNAs with a series of primers designed to be more specific for F. tularensis (ISFTu2F/R, targeting an insertion element-like sequence; 23kDaF/R, targeting the 23kDa gene, which is expressed upon macrophage infection; and the Tul4F/R and FopAF/R pairs, targeting genes encoding outer membrane proteins) (36). To differentiate among F. tularensis subgroups, primers SdhF (5'-AAGATATATCAACGAGCKTTT-3') and SdhR (5'-AAAGCAAGACCCATACCATC-3'), targeting a putative succinate dehydrogenase locus (sdhA), were designed and used in PCR and sequencing analyses to identify differences among 48 isolates representing F. tularensis subspecies and F. philomiragia. Primers SdhF/R were used at an annealing temperature of 56°C, and the ISFTu2F/R, 23kDaF/R, Tul4F/R and FopAF/R pairs at used at an annealing temperature of 60°C.
Positive control reactions using DNA from F. tularensis LVS (ATCC 29684) (1 and 0.1 pg), as well as negative-control reactions without DNA, were included in each experiment. F. tularensis DNA was added to control reactions only after all experimental reaction tubes were sealed to prevent false positives due to contamination. Five microliters of each reaction mixture was analyzed on a 1, 2, or 3% (depending on expected product size) agarose gel. DNA was visualized by ethidium bromide staining and UV transillumination. Putatively positive reactions were repeated to confirm results and obtain products for cloning. PCR amplification of extracts from Francisella laboratory isolates was performed as above, using the 27F/1492R primer pair for the 16S rRNA gene (24) and SdhF/R for the sdhA gene (Table 1).
Cloning and sequencing of PCR products.
PCR products from all of the Francisella primer sets were purified by electrophoresis on SeaKem agarose gels, the bands were excised and purified using a Qiaex DNA purification kit (QIAGEN, Inc., Chatsworth, CA), using the manufacturer's protocol. Products were cloned into the pCR4 vector, using the TOPO-TA cloning kit and the manufacturer's protocols (Invitrogen, San Diego, CA). For most soil and water PCR products, 48 to 96 clones were picked and stored in glycerol medium for sequencing. For 16S rRNA and sdhA gene sequences from Francisella reference isolates, two clones were picked and sequenced on both strands for each isolate and the sequences were compared, in an effort to reduce Taq polymerase-induced errors.
Plasmid DNA was isolated from overnight cultures using a solid-phase reverse immobilization procedure (10), and inserts were sequenced using the M13 forward primer or a PCR primer, with the BigDye terminator cycle sequencing reagents (v3.0, Applied Biosystems, Foster City, CA). Sequencing reactions were analyzed on ABI 3700 and 3730 automated sequencers (Applied Biosystems). Preliminary analysis of partial 16S rRNA gene sequences from environmental clone libraries was used to select representatives for full sequencing. Full-length, double-stranded sequence was obtained from these selected environmental and isolate clones using the M13F and M13R primers and additional primers internal to the 16S rRNA gene, 533Fb (5'-GCCAGCAGCNGCGGTAA-3'), 940Fb.Ft (5'-CGGGGACCCGCACAAGC-3'), 910Rb.Ft (5'-GTCCCCGTCAATTCCTTTGAG-3'), and 517Rb0.1 (5'-ATTACCGCIGCTGCTGGC-3') (modified from reference 24 for use with Francisella spp. sequences).
Analysis of sequence data.
Raw data were analyzed using Sequencher (Gene Codes, Inc.) software. The Check_Chimera program (8) was used to screen for chimeric 16S rRNA sequences, which were removed from subsequent analyses. Comparisons were made to database sequences using the RDP Sequence Match program (7) and NCBI BLAST 2.0 (01/05) (2). For phylogenetic analyses of 16S rRNA gene data, sequences were obtained from databases and aligned using Clustal X (33), with final alignment accomplished manually using the GDE multiple sequence editor (8). Sequence analyses were performed on approximately 1,125 aligned nucleotides from each rRNA gene sequence, corresponding to the length of clones obtained from the environmental PCRs.
Phylogenetic trees were inferred using maximum-likelihood analysis (fastDNAml version 1.1, distributed by RDP) (26). Additional distance (minimum evolution) and parsimony analyses (not shown) used to assess support for placement of new sequences were performed using PAUP* (v. 4.0b10 for Macintosh) (31). This program was also used to calculate 16S rRNA gene sequence similarities and infer trees from sdhA sequence data. For sequences from amplification reactions using the primers of Versage et al. (36), sequence comparisons and similarity calculations were performed using BLAST analyses (2).
Nucleotide sequence accession numbers.
Sequences representative of each new sequence type obtained in this study, as well as from the reference isolates (Table 1), have been deposited in GenBank under accession no. AY968223 to AY968239 (16S rRNA gene sequences from reference isolates); AY968283 to AY968305 (16S rRNA gene sequences from Houston soil clones); AY968240 to AY968282 (sdhA sequences from reference isolates); AY968306 to AY968310 (representative sdhA sequences from Houston soil clones); and AY973868 to AY973879 (representative sequences from Houston soil samples generated with the primers of Versage et al.).
|
|
|---|
A clone library was constructed from the PCR products of each sample, and a total of 311 good-quality 16S rRNA gene sequences were obtained and analyzed. All sequences obtained affiliated most closely with sequences from the Francisella genus by BLAST analysis, confirming the specificity of the primers. Representative clones for each sequence type obtained were sequenced in their entirety (approximately 1,170 bp), and cloned rRNA gene sequences of an additional 17 Francisella isolates were determined for reference.
Phylogenetic analyses revealed a surprising variety of sequence types present in the samples (Fig. 1). None of the sequences obtained from the soil samples were identical to any 16S rRNA sequences available in GenBank or in our reference collection, and the majority of the sequences from the samples did not group closely with previously reported sequences from any Francisella isolates. These sequences instead fell into three phylogenetically distinct clusters (Fig. 1; groups II, III, and V) containing only novel sequences from this study. These new clusters showed high sequence similarity (
99.1%) within each cluster, but they were only more distantly related to sequences from isolates (
98.5% sequence identity to closest outlying group). Phylogenetic analysis by several methods gave strong support (97 to 100% support in bootstrap analysis) to the coherence of these clusters and their separation from other, previously known sequence types.
![]() View larger version (29K): [in a new window] |
FIG. 1. Phylogenetic tree showing relationships of small-subunit rRNA gene sequences obtained from environmental samples to those of Francisella and related species. The tree was rooted using sequences of Escherichia coli, Thiothrix ramosa, Caedibacter taenospiralis, Piscirikettsia salmonis, and Thiomicrospira thyasirae (not shown). The percentage of 100 bootstrap resamplings that support each topological element in maximum-likelihood analysis is indicated, for values of >70%. Phylogenetic groups of sequences obtained in this study are labeled I to V. Within these groups, sequences representative of the types obtained from Houston soil clone libraries (in red) are labeled with the sample name (soil samples 005 to 045 and one water sample) and by distinct sequence types within each sample (a to d). Numbers in parentheses indicate the percentage of clones with identical sequences obtained from that sample. Sequences in blue text are reference sequences obtained in this study. Sequences in black were obtained from GenBank, with accession numbers shown in parentheses. The scale bar corresponds to 0.05 substitution per nucleotide position.
|
93.7% identity) to any other sequences in the database, but were more similar to those of Francisella spp. than to any non-Francisella sequences (87.8% identity to the sequence of the closest non-Francisella relative, Caedibacter tenospiralis). Based on this level of sequence identity, our analysis suggests that the organisms from which sequences in group V derived may constitute a new genus of Francisella-related species. Sequences were also obtained which affiliated closely with those of F. philomiragia strains isolated from water. Although the sequences from the one positive water sample from Houston were all identical to that of F. philomiragia ATCC 25015, none of the soil-derived sequences were an exact match to any F. philomiragia sequence in the database (Fig. 1, group IV). In addition, bootstrap support was high for grouping these sequences to the exclusion of other F. philomiragia sequences, and these may represent new strains or subspecies of F. philomiragia organisms in these samples.
The majority of sequences from sample 039 and a few from 034 differed by only 1 to 2 nucleotides from sequences of F. tularensis isolates (Fig. 1, group I). This level of nucleotide identity is similar to that reported for different subspecies of F. tularensis (18), and bootstrap analysis also supported inclusion of the soil sequences within the F. tularensis group. These data indicate that the environmental organisms may constitute new subspecies of F. tularensis.
sdhA sequence analysis.
To further investigate the genetic relationship between the soil organisms and known isolates, we used a PCR primer set capable of discriminating among the subspecies of F. tularensis. From available Francisella species genome sequence data, the sdhA gene, which encodes a putative succinate dehydrogenase, was found to differentiate strains at the subspecies level based on specific single-nucleotide polymorphism (SNP) signatures. The SdhA primer set was tested in PCR against DNAs from a diverse group of 48 F. tularensis isolates (including representatives of F. tularensis subspp. tularensis, holarctica, mediaasiatica, and novicida) and four isolates of F. philomiragia. Maximum-parsimony analysis of the sdhA sequences from the 48 known isolates revealed 43 SNPs common to all F. tularensis isolates that distinguish them from F. philomiragia strains (Fig. 2). In contrast, strains of F. tularensis clustered much more closely in this analysis and were generally well resolved into subspecies clusters by the sequence of their sdhA genes. Three of the subspecies of F. tularensis form closely related but distinct clades with single-nucleotide polymorphisms separating F. tularensis subsp. tularensis, F. tularensis subsp. holarctica, and F. tularensis subsp. mediaasiatica. F. tularensis subsp. novicida isolates form a more distant clade, with two and three SNP resolution between the F. tularensis subsp. novicida and F. tularensis subsp. novicida-like isolates and the other biovars. The separation of the individual subspecies of F. tularensis by only a limited number of SNP differences is consistent with multilocus variable-number tandem repeat studies indicating the clonal evolution of this species (12, 23).
![]() View larger version (27K): [in a new window] |
FIG. 2. Maximum-parsimony phylogenetic tree of sdhA sequences from Houston soil organisms and Francisella reference strains. Sequences representative of the types obtained from Houston soil clone libraries (in red) are labeled with the sample name (soil samples 015 to 045) and sequence type (0.1, 0.2, and 0.3). Numbers in parentheses indicate the percentage of clones with identical sequences obtained from that sample. Sequences in black text are reference sequences obtained in this study. F.t., F. tularensis. Numbers above branches indicate the number of single-nucleotide sequence polymorphisms that distinguish each group.
|
|
View this table: [in a new window] |
TABLE 2. Clone sequencing results for primer sets used in this study
|
Although a few clones in this library matched the sequence of F. tularensis tul4 or differed at one position, most clones in this library differed at seven positions from the database sequence. These are predicted to be silent mutations, indicating that these sequences derived from tul4 homologues in this sample. In addition, the FopAF/R primer set gave positive PCR results with sample 027, suggesting lower specificity, since in this sample only species group II was detected (Fig. 1). Cloned sequences from these products were all identical and differed by two bases from the F. tularensis sequence available in the database, both predicted to be silent mutations.
|
|
|---|
F. tularensis has been divided into several subspecies based on geographic distribution and disease potential. F. tularensis subsp. tularensis (also known as biovar A) is considered the most virulent type, and while human cases have been reported only in North America, the pathogen has also been found in Europe (20). F. tularensis subsp. holarctica (biovar B) causes most cases of tularemia in Europe and also occurs in North America and Japan. Although infection with this subspecies is rarely fatal, it is nonetheless highly infectious and causes significant morbidity in Europe. The other two described subspecies, F. tularensis subsp. mediasiatica and F. tularensis subsp. novicida, are most commonly isolated from areas of Central Asia and in North America and Australia, respectively, and are less commonly associated with human disease (36). Our results suggest the existence of additional F. tularensis subspecies in soil samples, the pathogenicity of which is unknown. Two clinically significant isolates (Fx1 and Fx2) recovered from immunocompromised patients in Galveston, Texas, and Liberty County, Texas, were characterized as F. tularensis subsp. novicida-like (Fig. 2) based on growth characteristics, extragenic palindromic sequences, and specific biochemical tests (7). Although these Texas isolates are distinct in sequence from those recovered from the Houston soil samples, they emphasize the existence of other unusual Francisella strains that can cause disease in humans.
Francisella species are able to enter a viable but nonculturable state (15) and are particularly refractive to cultivation from environmental samples (27). This has likely contributed to our limited knowledge of their diversity, distribution, and ecology. Using a DNA-based survey in place of cultivation, we have identified genes suggesting the presence of several new Francisella species. Our results support the observation made with cultured isolates that Francisella species are very diverse (12), but the natural reservoirs for this diverse group of species remain largely unknown (15). This is the first report of the recovery of Francisella DNA sequences from soil samples. In a parallel analysis of soil samples from Denver, Colo. (not shown), we have obtained additional sequences that cluster with the novel group II (Fig. 1), suggesting that some of these new groups may indeed be widespread in the environment. Continued DNA-based survey of soil and other environmental samples, coupled with continued culture attempts from these natural sources, will help define the distribution, ecology, transmission, and pathogenicity of the new groups of Francisella described here.
As a result of increased concern over terrorist use of agents such as F. tularensis, sensitive PCR-based monitoring systems have been developed and deployed to detect the presence of pathogens in environmental samples (25). In addition, high-resolution DNA-based strain typing systems are being developed to provide platforms for epidemiologic and bioforensic analyses (5, 12, 23, 32). The ability of these approaches to differentiate an agent introduced in a bioterrorist attack from naturally occurring strains requires an extensive understanding of the diversity and distribution of the organism and its related species that share genetic traits. This is especially true for the Francisella group. Francisella species inhabit a wide variety of ecological niches. Species sharing considerable genetic similarity to human and animal pathogens are free-living or are symbionts or pathogens of invertebrates such as insects and amoebae (19, 20) that may not be associated with human disease. Even within the F. tularensis group, isolates representing the different subspecies are very similar in genomic characteristics but display a wide range of pathogenicity characteristics.
Currently, little is known about the mechanisms of Francisella virulence (34), but comparison with nonpathogenic environmental isolates may shed light on this area. Further DNA-based studies of environmental samples using the methods described here should enhance our ability to identify and characterize new strains and species of Francisella. Isolation and characterization of environmental isolates will contribute significantly to the development of more specific and informative assays for pathogen detection and forensics, as well as for monitoring epidemiology and environmental sources of natural outbreaks of tularemia.
This work was supported by grants to C.R.K., P.K., and R.O. from the DOE Chemical & Biological Nonproliferation program and the DHS.
This is publication number LA-UR-04-8972. ![]()
Present address: Section of Microbiology, University of California, Davis, California. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»