Previous Article | Next Article ![]()
Applied and Environmental Microbiology, July 2005, p. 3832-3839, Vol. 71, No. 7
0099-2240/05/$08.00+0 doi:10.1128/AEM.71.7.3832-3839.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Department of Biology,1 Department of Entomology, University of California, Riverside, California 925212
Received 13 October 2004/ Accepted 4 February 2005
|
|
|---|
3%), these three clades probably originated more than 15,000 years ago, long before the introduction of the nonnative plants that characterize most infections. The sister clades of X. fastidiosa subsp. sandyi and X. fastidiosa subsp. piercei have synonymous-site evolutionary rates 2.9 times faster than X. fastidiosa subsp. multiplex, possibly due to generation time differences. Within X. fastidiosa subsp. multiplex, a low level (
0.1%) of genetic differentiation indicates the recent divergence of ALS isolates from the PP, PLS, and OAK isolates due to host plant adaptation and/or allopatry. The low level of variation within the X. fastidiosa subsp. piercei and X. fastidiosa subsp. sandyi clades, despite their antiquity, suggests strong selection, possibly driven by host plant adaptation. |
|
|---|
Many species exhibit a high degree of genetic diversity among strains that infect the same host, and this pattern is typical of the gamma subdivision proteobacteria, e.g., Escherichia coli (17, 27), Vibrio cholerae (5), Pseudomonas stutzeri (36), Xanthomonas axonopodis pv. manihotis (30), and Xanthomonas fragariae (31). Xylella fastidiosa, another gamma subdivision proteobacterium, offers an opportunity to study genetic variability both between and within plant-host strains (pathovars) of a single species that infects multiple hosts. Currently two complete genomes (41, 37) and two draft genomes (7) of strains from four different plant hosts have been sequenced.
X. fastidiosa infects the xylem of a range of plant species and is typically transmitted by xylem-feeding leaf hoppers (28). Xylem blockage by X. fastidiosa causes the plant scorch diseases of several economically important plants, such as grapevine, almond, and citrus (28). It also infects ornamentals, such as oleander (29). The known host range continues to expand, with over 100 host species identified (35) and with the most symptomatic hosts being plants nonnative to North or South America (23). Because of its potentially serious economic effect, X. fastidiosa has been the subject of intensive study, and several host plant strains have been identified, including three different pathovars causing Pierce's disease of grapevine (PD), almond leaf scorch (ALS), and oleander leaf scorch (OLS) in California (1, 4, 8-10, 12, 14, 20, 25, 26, 32).
Early DNA-DNA hybridization studies of several X. fastidiosa isolates indicated at least 70% similarity (24) which, combined with studies of nutritional fastidiousness and reciprocal transmission tests, placed the different host strains into groups (23). The greater than 70% similarity indicated that X. fastidiosa was a single species (11); however, no meaningful phylogenetic relationships could be defined. Randomly amplified polymorphic DNA analysis (RAPD) led to a more comprehensive phylogenetic classification of a larger variety of X. fastidiosa plant-host pathovars distributed across its range, such as grapevine (western United States), mulberry and oak (eastern United States), and coffee (South America) (1, 4, 10, 14, 20, 26). Similarity indices of the RAPDs indicated that bacteria isolated from the same plant-host species were more similar than bacteria from different hosts (20). The amount of diversity within these plant-host strains, however, was contradictory, with one study indicating limited diversity (12) and another indicating moderate amounts (20). Several single-gene trees and noncoding regions have also been analyzed, including 16S rRNA (25, 32), 16S-23S intergenic spacer sequences (20, 25, 35), and gyrB (32). The resulting trees had conflicting topologies, due to a lack of informative sites and phylogenetic signal.
A recent DNA-DNA hybridization study (35) again supported that the different pathovars of X. fastidiosa formed a single species. The North American strains were divided into two subspecies, X. fastidiosa subsp. piercei (PRC) and X. fastidiosa subsp. multiplex (MULT), each subspecies showing at least 84% internal similarity (35). In this study, our aim was to reexamine the relationship among the three major plant-host pathovars found in California, PD, ALS, and OLS, and examine the genetic and evolutionary relationships between these California groups and strains from the eastern United States isolated from oak, peach, and plum. For this purpose we used a high-resolution approach using DNA sequence data from genes distributed throughout the genome. We sequenced 10 chromosomal genes, including genes involved in such varied functions as aerobic respiration (nuoN and nuoL) and cell surface structures (pilU). We used these sequence data to establish a robust phylogeny, which is an important prerequisite for understanding how this bacterial species evolves and how genetic change has allowed it to adapt to new host plants across the United States. We used phylogeny as the basis for estimating the divergence times of the North American clades and to determine if these phylogenetic groupings showed geographical differentiation.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. X. fastidiosa isolates used in this study
|
Amplification and sequence determination of 10 genes.
The genes sequenced were chosen from among the genes that occur in all four genomes, ALS strain Dixon (NC_002723), OLS strain Ann-1 (NC_002722), PD strain Temecula (AE009442), and CVC strain 9a5c (AE003849). The 10 genes chosen are distributed throughout the CVC genome, represent a variety of biochemical functions, and have KA/KS values of <1, where KA and KS are, respectively, the rate of the nonsynonymous and synonymous base substitutions (Table 2).
|
View this table: [in a new window] |
TABLE 2. Position, name, function, PCR primers, and length sequenced for the 10 X. fastidiosa genes
|
Sequence analysis.
The sequences were aligned with BioEdit v5.0.9 (19) and ClustalX (40). A maximum likelihood (ML) tree of the 10-gene concatenated data set for 25 of the isolates was constructed with PAUP* v4 (39), using the following options: the general time-reversible model of molecular evolution, which includes six separate rates for each nucleotide substitution type, and a gamma distribution and an invariant sites parameter to account for evolutionary rate variation among sites. A second ML tree with only seven genes and all 26 isolates was similarly constructed to determine the position of the PLS isolate. Bootstrap (n = 200 replicates) and Bayesian analyses were only performed on the 10-gene tree. Bayesian analyses were performed with MrBayes 2.01 (22) with a general time-reversible model plus gamma distribution plus invariant sites model of molecular evolution, 1,000,000 generations with burn-in after 200,000, four simultaneous Markov chains (three heated, one cold), and random starting trees.
To estimate clonal diversity, the genetic differentiation between populations relative to within-population variation, or KST (21), was calculated using DNAsp v. 4.00 (33) and ProSeq (18). The isolates were divided to test if the variation was different in five comparisons: (i) northern California PRC (isolates PD4 and -6) versus southern California PRC (isolates PD1, -7, -10, and -14 and ALS5 and -11); (ii) northern California ALS isolates (ALS3, -15, and -22) versus southern California ALS isolates (ALS12 and -13); (iii) PP isolates (PP27 and -28) versus OAK isolates (OAK17, -23, and -24); (iv) ALS isolates (ALS3, -13, and -15) versus eastern United States isolates (OAK17, -23, and -24 and PP27 and -28); (v) PRC isolates (all PD isolates and ALS5 and -11) versus OLS isolates (all OLS); and (vi) PRC/OLS (all PD, ALS5 and -11, and all OLS) versus MULT isolates (all OAK, all PP, and ALS3, -12, -13, -15, and -22). In tests ii and iv, ALS5 and -11 were excluded since, unlike the other ALS isolates, they are subspecies of PRC. In tests iv and vi, ALS12 and ALS22 were excluded since they carry several recombinant genes (M. Scally et al., submitted for publication).
To determine the divergence times, the concatenated data set of PD1 (southern California), PD6 (northern California), PD16 (Florida), OLS8 (Texas), OLS9 (California), ALS3 (northern California), ALS13 (southern California), OAK17 (Georgia), OAK 23 (Florida), PP27 (Georgia), and PP28 (Georgia) were used with CVC18 as the outgroup. These isolates represented the available geographic range of each plant host and showed no evidence of recombination. The codeml application of PAML (44) with the Fcodon substitution model was used to estimate the number of synonymous and nonsynonymous substitutions along each branch (without imposing a clock). Using these estimates of synonymous substitution (K), plus an estimated mutation rate (µ) of 5.4 x 1010 changes per site per generation (16) and a division rate of 1,000 generations per year (G), the time since divergence can be calculated assuming selective neutrality at synonymous sites with the following equation:
![]() | (1) |
Nucleotide sequence accession numbers.
The gene sequences are available under GenBank accession numbers AY876648 to AY876897 and AY887139 to AY887145.
|
|
|---|
![]() View larger version (14K): [in a new window] |
FIG. 1. Maximum-likelihood phylogeny of 9,288 bp including 25 isolates across five plant host strains of Xylella fastidiosa using a general time-reversible model with a gamma distribution and invariant sites. Bootstrap values with 200 replicates for the major division are reported above each branch, and Bayesian posterior probabilities are reported below. The PLS isolate was not included in the full maximum-likelihood analysis since only 7 of the 10 genes were sequenced; however, its position in the phylogeny based on the maximum-likelihood analysis using those 7 genes is marked (*).
|
The only isolates failing to group within these major groupings were ALS12 and ALS22. However, examination of all single-gene trees (data not shown) showed that these isolates group within the MULT strain for 7 of 10 trees. Visual inspection and multilocus sequence typing analysis (Scally et al., submitted) of the three remaining genes (holC, cysG, and pilU) indicated that the alleles carried by ALS12 and ALS22 were the result of recombination with other pathovars. This same interpretation applies to the cysG gene in the PD14 isolate. No other signs of recombination were apparent among the sequences, and their single-gene trees showed the same broad patterns as the concatenated tree (Fig. 1).
Geographic variation within plant-host pathovars was not apparent (Table 3). Within the PD clade, the comparison of subpopulations of isolates from northern and central/southern California showed no indication of genetic differentiation (Table 3, test i). There was also no differentiation between northern and southern California subpopulations among the non-PRC ALS isolates (Table 3, test ii). Within the MULT clade subspecies, the ALS clade isolates (isolates 3, 13, and 15) were significantly different (P < 0.05) from the eastern U.S. isolates from peach and oak (Table 3, test iv); however, this effect confounds geography and plant host. Within the East Coast group, there was no significant plant-host differentiation found between the OAK and PP isolates (Table 3, test iii). Finally, this KST approach confirms the highly significant genetic differentiation of the three major clades (Table 3, tests v and vi).
|
View this table: [in a new window] |
TABLE 3. Geographic differentiation within and between strains
|
The average estimate for the divergence time of the MULT clade from the PRC/OLS clade is 30,000 years, with the PRC and OLS clades separating about 23,000 years ago (Fig. 2). The rate of evolution at synonymous sites in the OLS and PRC clades has been very similar, based on the two estimates of their divergence time (22,400 versus 24,000 years); however, the combined PRC/OLS clade has been evolving at almost three times the rate of the MULT clade based on the estimated divergence times of 43,700 and 16,900 years ago. The estimated split of the ALS (1,800 years), OAK (1,400 years), and PP (700 years) plant-host strains within the MULT clade averages 1,300 years.
![]() View larger version (12K): [in a new window] |
FIG. 2. Maximum-likelihood phylogeny of 12 isolates from six plant hosts of Xylella fastidiosa. The branch lengths are proportional to their estimated substitution rates (nonsynonymous plus synonymous). The divergence times were estimated from the synonymous component of these rates of evolution, so that time is only weakly correlated to branch length. The divergence times reported are the average values for that branch from Tukey's jackknife analysis. The standard errors are reported after the divergence time estimates for those branches that were directly estimated. The estimates shown at the nodes of the two major divisions are averages from the component branches. CVC18 was the South American outgroup.
|
|
View this table: [in a new window] |
TABLE 4. Nonsynonymous and synonymous DNA sequence differences between the five plant host strainsa
|
|
|
|---|
We identified three genetically divergent phylogenetic clades, PRC, MULT, and OLS, that differ by 2.6 to 3.3% at synonymous sites (Table 4). We estimate that these differences have taken more than 15,000 years to accumulate. The PRC and MULT clades correspond to the two named North American subspecies, X. fastidiosa subsp. piercei and X. fastidiosa subsp. multiplex, respectively (35). The OLS clade is most closely related to PRC (Fig. 1), but it has a high degree of synonymous-site differentiation from PRC (2.6%) and has very little internal variability (0.05%). In addition, the OLS and PRC strains cannot infect each other's major host plant, oleander and grapevine, respectively (29). For these reasons, placing the old and genetically distinct OLS clade within the X. fastidiosa subsp. piercii subspecies cannot be justified. We suggest that the OLS isolates should be recognized as a new subspecies with the name X. fastidiosa subsp. sandyi; the type specimen Ann-1 (Table 1) is deposited as ATCC 700598 (29), and the genome of this strain has been sequenced (7).
The relatively close relationship of PD and OLS pathovars was first suggested by Chen et al. (10), whose RAPD analysis placed them as genetically similar relative to plum and periwinkle pathovars. However, Hendson et al. (20), also using RAPDs, placed OLS as the outgroup to PD, ALS, PP, and OAK isolates and single-gene trees of 16S rRNA and gyrB (32) recovered different tree topologies for the PD, ALS, and OLS pathovars, but in both cases, ALS was closer to PD. With our larger data set and an appropriate outgroup (CVC), the PRC subspecies, which included all of the PD isolates, was shown to be most closely related to the sandyi clade (SAN), which includes all of the OLS isolates, a relationship with 100% bootstrap support and 1.00 posterior probability on the branch between the two clades. The MULT isolates formed a clade with 100% support outside the PRC/SAN clade.
The PD isolates from grapevines together with a subset of ALS isolates from almonds had previously been placed together based on RAPDs (20, 26), 23S-16S intergenic spacer sequence region (20, 35), and DNA-DNA hybridization (35). Our results agree with these previous findings and support this division into X. fastidiosa subsp. piercei. Initially, it was suggested that all ALS-causing isolates were identical to the PD-causing bacteria, but that these were different from the pathovars causing disease in oaks and peaches (23). PD and the Dixon-like ALS (i.e., like ALS3) have since been distinguished as separate groups based on RAPDs (20), but evidence defining the relationship between these ALS isolates and OAK, PP, and PLS was conflicting. RAPDs supports a division between ALS and PP/PLS (20), while DNA-DNA hybridization supports a single group (35). We confirmed the association between ALS and PP/PLS within the X. fastidiosa subsp. multiplex isolates and added OAK to this group.
Our finding that the major North American X. fastidiosa clades diverged about 16,000 years ago (Fig. 2) means that they predate the introduction of nonnative plants by European colonists. For example, X. fastidiosa subsp. sandyi has only been isolated from oleander, but oleander is a relatively recently introduced plant; yet, the SAN clade has a history of many thousands of years as separated from the sister taxon X. fastidiosa subsp. piercei (PRC clade). There is no evidence that X. fastidiosa occurs in Europe, apart from a single isolation from an infected grapevine in Croatia, which probably represented an invasion from the New World (6). Thus, it appears that the SAN subspecies must have evolved in North America within native plant hosts that have yet to be identified.
Our divergence time estimates raise two important issues. First, how reliable are such estimates. Two of the parameters used to estimate our divergence times, mutation rate and generation time, could be over- or underestimated. If mutation rate is overestimated, this could lead to our estimates being an order of magnitude or more too high. Underestimation of generation time would also lead to our divergence time being too large. The second issue raised is what factors might lead to the almost-threefold difference in the estimated divergence time of the MULT lineage and the PRC/SAN lineage.
Our calculation of divergence times using equation 1 required estimates of generation time and mutation rate for X. fastidiosa. We assumed a generation time of 9 h, i.e., 1,000 generations per year, and a mutation rate of 5.4 x 1010. Generation times for X. fastidiosa have been estimated at between 9 and 60 h (43), i.e., a range between about 1,000 and 150 generations per year. Our use of the top of this range provides a minimum estimate of divergence times. The mutation rate is based on an estimate from Escherichia coli (16), which has a genome that is twice as large as X. fastidiosa (37). The mutation rate, in general, is inversely proportional to genome size (16), suggesting that our estimate may be low, which would overestimate the divergence times. If the mutation rate of E. coli is extrapolated to X. fastidiosa on the basis of this proportionality, the estimated mutation rate is 1.0 x 109, which roughly halves the estimates shown in Fig. 2. However, the three major groups are still estimated to have been distinct for more than 12,000 years, placing the split between MULT and PRC/SAN at 16,000 years ago. Given that the long-term average generation time is very unlikely to be as short as 9 hours, these divergence time estimates are likely to be significantly less than the actual values.
The divergence times and percent differences also indicate that the PRC and SAN clades have had an overall evolutionary rate that is 2.9 times faster than MULT. The effect is long term, since the rate difference is seen both before and after the PRC/SAN split (Fig. 2). This is indicative of a historically shorter generation time in PRC and SAN bacteria than MULT. Alternatively, an increased effective population size in the ancestral MULT strain could have increased the efficiency of selection acting on codon usage, thus slowing the rate of synonymous substitutions (3); however, this effective size difference would have to have been maintained for many thousands of years to account for the observed pattern.
An important characteristic of the X. fastidiosa phylogeny (Fig. 1) is the long basal branches leading to very short terminal branches. Thus, the three major groups show highly significant genetic differentiation (Table 3, tests v and vi), but there is little variation within subspecies. MULT shows the most variability, but much of this variation is explained by host plant and/or geographical differentiation (test iv), so that across the three major clades the plant-host isolates show minimal genetic variation, with synonymous-site variation of less than 0.12%. This lack of variation, despite plenty of time for its accumulation, suggests that the strains experience strong selective pressures from their host plants, eliminating all but the best-adapted clones. The lack of geographical differentiation among the PD strains and among the OLS strains is consistent with this hypothesis. In both cases, strains were sampled from five widely separated sites, and yet the genetic differences among the pathovars was minimal. This sampling strategy is adequate to assess levels of genetic diversity, provided that the culturing technique does not itself select for specific genotypes. We are planning to sample some fresh isolates directly (i.e., without culturing) to eliminate this possibility. The adaptive hypothesis has been further supported by a direct test of plant-host adaptation using isolates that are symptomatic in almonds (2). ALS5 and -11, which are PRC bacteria isolated from almonds, exhibited a slower growth rate and lower survivorship compared to the MULT isolates ALS3 and -22 when tested in almonds.
An alternative explanation for the loss of variability within the plant host strains would be recurrent population bottlenecks. At each bottleneck, much of each clade's genetic variation would be lost. However, given the wide geographical range and the wide range of host plants of this species, this demographic pattern appears unlikely.
Within the MULT clade, there are three groupings. The ALS isolates from California form a separate clade from the eastern U.S. isolates, which form two (nonsignificant) groupings of the OAK isolates and the PP/PLS isolates. However, it is not known if these isolates of MULT from plum, peach, oak, and almond are host specific.
The isolates ALS12 and ALS22 cluster outside of the MULT clade, apparently due to recombination (Scally et al., submitted). It is notable that ALS22 has been found to have different nutritional requirements and differentiation of RAPDs from ALS3 (2, 20). Since ALS22 is found in northern California and ALS12 in southern, this may indicate that the recombinant clade of ALS-causing bacteria is becoming well established as a new strain.
We found little evidence of geographical substructure within the major clades. Within PRC, we found no evidence of geographic differentiation between northern and southern California (Table 4). These results contrast with previous work using RAPDs, where geographic variation was found between PD strains from northern California and central California (2, 20). Within SAN, the single non-Californian isolate (OLS8 from Texas) showed no sign of genetic differentiation. In the non-PRC ALS isolates, we again found that isolates from northern and southern California showed no signs of geographical differentiation (Table 4). There was, however, differentiation within MULT between the western ALS isolates and the eastern OAK and PP isolates, but identifying causation confounds geography and plant hosts. However, even this genetic differentiation is minor (0.14% at synonymous sites) (Table 4) compared to the strain differences of about 3%. ALS is not known to occur in the east and OAK and PP are not known to occur in California, suggesting the biogeography of these closely related forms may involve several intermediates across the United States.
The biogeographic history of the North American strains of X. fastidiosa remains poorly understood. The origins of the PRC, SAN, and MULT clade subspecies predate the human introduction of many of their host plants. Since there has been only one isolation of X. fastidiosa from the Old World (6), these subspecies must have differentiated in hosts native to the New World, hosts that may be largely asymptomatic and that are, at present, unknown.
We gratefully acknowledge Donald Cooksey, Rufina Hernandez-Martinez, Alexander Purcell, and Christina Wistrom for providing the strains used in this study.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»