Previous Article | Next Article ![]()
Applied and Environmental Microbiology, December 2004, p. 6977-6983, Vol. 70, No. 12
0099-2240/04/$08.00+0 DOI: 10.1128/AEM.70.12.6977-6983.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
Department of Marine Sciences, University of Georgia, Athens, Georgia,1 Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut2
Received 8 May 2004/ Accepted 20 July 2004
|
|
|---|
900-bp fragment of family 18, group I chitinase genes and used it to retrieve these gene fragments from environmental samples. Clone libraries of presumptive chitinase genes were created for nine water and six sediment samples from 10 aquatic environments including freshwater and saline lakes, estuarine water and sediments, and the central Arctic Ocean. Putative chitinase sequences were also retrieved from the Sargasso Sea metagenome sequence database. We were unable to obtain PCR product with these primers from an alkaline, hypersaline lake (Mono Lake, California). In total, 108 partial chitinase gene sequences were analyzed, with a minimum of 5 and a maximum of 13 chitinase sequences obtained from each library. All chitinase sequences were novel compared to previously identified sequences. Intralibrary sequence diversity was low, while we found significant differences between libraries from different water column samples and between water column and sediment samples. However, identical sequences were retrieved from samples collected at widely distributed locations that did not necessarily represent similar environments, suggesting homogeneity of chitinoclastic communities between some environments. |
|
|---|
Chitinases are classified as either family 18 or 19 glycosyl hydrolases based on amino acid sequence similarity (19). These two families are truly distinct; they share no similarity at the amino acid level and have different three-dimensional structures (10) and mechanisms of action (22). The vast majority of bacterial chitinases fall within family 18 and can be further organized into five different groups (I to V) based on conservation of amino acid residues within the catalytic domain (38). Group I chitinases are widely distributed among members of diverse proteobacterial lineages (7). Groups II to IV contain chitinases from more-narrowly restricted lineages. Group V is a collection of chitinases that do not fall into one of the other four groups (38).
Bacterial chitinase genes have been retrieved from diverse terrestrial environments, including alkaline soils (40), sandy soils (46), and pastures (25, 27). However, equivalent studies of chitinases in aquatic systems are relatively rare (8, 24, 33). Furthermore, no studies have compared chitinases across a broad range of distinct environments. Comparison of chitinase genes retrieved from similar, but geographically isolated, environments could yield insight into the biogeography of functional genes. In addition, comparisons of gene sequences retrieved from environments with distinct chemical and physical characteristics (water column versus sediments, estuaries, freshwater and saline lakes, temperate coastal waters, the Sargasso Sea, and the Arctic Ocean) may yield insights into how environmental conditions select for enzymes with novel properties.
In this study, we used a degenerate primer set to retrieve putative chitinase genes from 10 aquatic systems with distinct environmental characteristics. The results suggest that similar environments yield similar chitinase gene sequences. Furthermore, unique signature sequences were retrieved from one set of samples that may translate into fundamental differences in enzyme properties.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Location, summary characteristics, and references for further descriptions of the environments where samples used in this study were collected
|
Community DNA was extracted from sediment samples and purified using an Ultraclean Soil DNA Kit (MoBio Laboratories Inc., Solana Beach, Calif.) following the manufacturer's instructions. Extraction and purification of DNA from cartridge filters were essentially as described by Ferrari and Hollibaugh (13). Briefly, 40 µl of lysozyme (50 mg ml1) was added to each cartridge, and the cartridges were incubated for 60 min at 37°C. Fifty microliters of proteinase K (20 mg ml1) and 100 µl of a 20% (wt/vol) solution of sodium dodecyl sulfate were added to each cartridge, and the cartridges were incubated at 55°C for 2 h. DNA was purified from 800 µl of the lysate by sequential extraction with 800 µl of phenol-chloroform-isoamyl alcohol (25:24:1), chloroform-isoamyl alcohol (24:1), and finally n-butanol. The aqueous phase was removed, placed in a Centricon-100 concentrator (Amicon, Bedford, Mass.), mixed with 500 µl of TE buffer (10 mM Tris and 1 mM EDTA, pH 8.0), and centrifuged at 1,000 x g for 10 min. Next, 500 µl of TE buffer was added to the Centricon-100 concentrator, and the mixture was centrifuged for another 10 min. Successful extraction of high-molecular-weight DNA was verified for all samples by electrophoresis on 1% agarose gels.
Primer design.
The degenerate primer chiAfor.ext was based on conserved residues identified in chitinases from diverse proteobacteria (Fig. 1). Protein sequences were aligned using the PILEUP tool of the Wisconsin package, version 10.2 (Accelrys, San Diego, Calif.). chiAfor.ext was used in conjunction with chiA.rev, a primer developed by Cottrell et al. (9). This primer set successfully amplified the chitinase gene from Vibrio harveyi.
![]() View larger version (32K): [in a new window] |
FIG. 1. Design of degenerate primers for family 18, group I chitinase genes. Alignments of chitinase amino acid sequences from organisms representing diverse phylogenetic lineages were used to design the degenerate primers. Symbols represent bacterial taxonomic groups: , -proteobacteria; ß, ß-proteobacteria; , -proteobacteria; and +, gram-positive bacteria. GenBank accession numbers are provided in parentheses. Position designations relative to the Serratia marcescens chitinase sequence (P07254) are shown above the alignment. Conserved residues are shown in black, similar residues in grey. I, inosine base; Y, C or T; W, A or T; S, G or C; and R, A or G. The degeneracy for both primers in this study is 16-fold. The references for chiAfor.ext and chiA.rev are this study and reference 9, respectively.
|
Products of the appropriate size (
900 bp) were recovered from a 1.5% agarose gel using the QiaQuik gel extraction kit (QIAGEN, Valencia, Calif.) and cloned into the pCR 2.1 vector (Invitrogen Corp., Carlsbad, Calif.) following the manufacturer's protocols. Clone libraries were generated for all samples that yielded a PCR product of the expected size. Colonies were selected randomly, and then plasmids were isolated from Escherichia coli host cells with a Qiaprep Spin Miniprep kit (QIAGEN). Insert size was verified by digestion with EcoRI, and then inserts of the correct size were sequenced using an ABI PRISM 310 genetic analyzer and a BigDye terminator cycle sequencing kit (Applied Biosystems, Foster City, Calif.) using primers that recognized the cloning vector (M13 forward and reverse). Reads of approximately 550 bp of nucleotide sequence were obtained in each direction. Sequences were edited and assembled using the AssemblyLign program (Oxford Molecular, 1998). The forward and reverse reactions resulted in a complete sequence for the amplified region of the chitinase gene with
200 bp of overlap. Regions corresponding to the primer binding sites were removed from the sequences prior to analysis.
Phylogenetic analyses.
Sequences were analyzed using the Wisconsin Package v. 10.2 (Accelrys) and homology searches (BLASTX) were carried out at the network server of the National Center for Biotechnology Information. Phylogenetic trees were constructed with the PHYLIP package using evolutionary distances (Jukes-Cantor or Kimura) and the neighbor-joining method (12). A maximum-likelihood tree was also constructed using the phylogenetic analysis program PAUP (39) to verify the results from the Jukes-Cantor algorithm.
Database sequences.
Putative chitinase sequences were retrieved from the Sargasso Sea metagenome database (SSMD) (http://www.ncbi.nih.gov/BLAST/Genome/EnvirSamplesBlast.html) (43) by interrogation (BLASTX) using one sequence from each of the five clusters of our tree (refer to Fig. 2; WLS-07 [accession no. AY674163], TLS-08 [AY674150], BBW-04 [AY674077], AOW55-10 [AY674066], and SLW21-07 [AY674140]). Homology searches were then carried out against the entire GenBank database using each of the SSMD potential chitinase sequences. Criteria for inclusion in our phylogenetic analysis were as follows: (i) the sequenced portion of the gene had to contain the entire region of the gene analyzed in this study and (ii) the putative genes had to be capable of being aligned to our existing library of chitinases using the PILEUP tool of the Wisconsin package. Accession numbers of all potential chitinases from the SSMD have been recorded in a spreadsheet that can be accessed at the Mono Lake Microbial Observatory web site (http://www.monolake.uga.edu/research.htm; "Ancillary Data" section, "Sargasso_Sea_Chitinases.xls" file).
![]() View larger version (32K): [in a new window] |
FIG. 2. Neighbor-joining tree (partial sequence, 800 bp) showing phylogenetic relationships between family 18, group I chitinase nucleotide sequences. Clone designations are as follows: AOW55, Arctic Ocean, 55-m depth; AOW131, Arctic Ocean, 131-m depth; BBW, Bodega Bay water column; SIS, Sapelo Island sediments; SIW, Sapelo Island water column; SFBS, San Francisco Bay sediments; SFBW, San Francisco Bay water column; SJRW, San Joaquin River water column; SLW21, Soap Lake, 21-m depth; SLW23, Soap Lake, 23-m depth; TBS, Tomales Bay sediments; TLS, Topaz Lake sediments; and WLS, Walker Lake sediments. Water column samples for which no depth is given were collected at the surface (nominal depth, 0.1 m). Each sequence from a given library is also provided with a numerical designation. Branches containing identical sequences are indicated with a filled circle. The scale bar indicates Jukes-Cantor distance. Bootstrap values of >50% (for 100 iterations) are shown at branch nodes. The tree is unrooted with the chitinase gene from Bacillus circulans (AF154827) as the outgroup. GenBank accession numbers for reference sequences are provided in parentheses.
|
|
|
|---|
A total of 160 inserts was sequenced from 15 clone libraries with inserts from at least 10 randomly selected clones sequenced from each library. Homology searches suggested that the inserts in 52 of the clones were not chitinase genes (Table 1). Nontarget sequences were retrieved from all environments examined in this study (including 20 from Mono Lake). These typically lacked significant similarity to any database sequence and were not analyzed further. We checked a subset of our remaining sequences (13 total; all of the deeply branching, unique sequences in Fig. 2, for example SIS-10, SFBW-13, and SFBW-11) for possible chimera formation by using BLAST on 200 bp from each end of the sequence against the database to ensure that they returned the same top hits. None of the sequences we examined failed this test; however, some of the 52 discarded sequences may have been chimeras. All 108 putative chitinase genes retrieved were unique when compared to sequences presently in the GenBank database. At the nucleotide level, the sequences were between 57 to 94% identical to previously identified chitinase genes. At the amino acid level, the sequences were 44 to 98% identical and 52 to 98% similar to current (July 2004) GenBank entries.
Phylogenetic analysis (Jukes-Cantor) placed the chitinase nucleotide sequences into five major clusters, designated clusters A to E (Fig. 2). A maximum-likelihood tree of the nucleotide sequences (data not shown) was essentially identical to this tree. A phylogenetic tree (Kimura) was also constructed using deduced amino acid sequences (data not shown). The topologies of the nucleotide and amino acid trees were similar, with the composition of the clusters being the same for all trees. Cluster A contained sequences from the sediments collected at Sapelo Island, Georgia; San Francisco Bay, California; Tomales Bay, California; Topaz Lake, Nevada; and Walker Lake, Nevada. Cluster B contained sequences retrieved from sediments collected at Sapelo Island, Georgia; San Francisco Bay, California; Tomales Bay, California; and Topaz Lake, Nevada. In addition, sequences retrieved from the San Francisco Bay and San Joaquin River water samples formed a distinct subcluster within cluster B. Cluster C contained sequences retrieved from the Sapelo Island, San Joaquin River, and Bodega Bay water column samples. Cluster D consisted of sequences retrieved from Arctic Ocean water samples. These sequences segregated into subclusters that typically corresponded to sample depth. Cluster E contained sequences retrieved exclusively from the two Soap Lake, Washington, water column samples.
We identified 43 potential family 18, group I chitinase sequences (maximum E value of 8e-4) in the SSMD. The region possessing the signature motif, [DG]-G-[LIV]-[DG]-[IV]-[DH]-W-[EG], of the family 18, group I chitinase sequences (38), was present in 13 (30%) of these sequences. These putative chitinases appear to be diverse in origin, as the most similar sequences in GenBank were obtained from
-proteobacteria (23%), gram-positive bacteria (51%), Bacteroides (2%) bacteria, arthropods (9%), mammals (5%), fungi (5%), and Caenorhabditis elegans (5%). The majority of the SSMD putative chitinases either did not have any overlap with the region of the gene analyzed in this study (58%) or contained only a portion of the region (30%). The remaining five (12%) SSMD sequences contained the entire region of the chitinase gene delimited by the primers we used; however, only three of these sequences were similar enough to be included in the tree (Fig. 2). The three SSMD sequences included in the tree fell outside of the clusters (A to E) defined by sequences retrieved from our samples. Two SSMD sequences (EAJ50883 and EAH89100) clustered with a family 18, group I chitinase reference sequence from Shewanella baltica and were most closely related to our cluster A (Fig. 2). The third SSMD sequence (EAI65414) grouped with one Enterobacter and two Serratia chitinase sequences. Given the overall dominance of Shewanella-like sequences in the Sargasso Sea metagenome library (42), it is not surprising that we retrieved Shewanella-like chitinase sequences from it. We were surprised that we did not find sequences similar to those from our Arctic Ocean samples, since the 16S rRNA gene libraries from these samples contained sequences similar to those retrieved from Sargasso Sea samples (1).
Some of the chitinase sequences retrieved from different samples were identical (Fig. 2). For example, a sequence from the Sapelo Island library (SIS-01) was identical to three San Francisco Bay sequences (SFBS16-02, SFBS17-05, and SFBS29-01). Both of these samples are intertidal sediments from salt marshes dominated by Spartina alterniflora (Sapelo Island) or Salicornia virginica (San Francisco Bay). The estuaries have similar temperatures and salinity ranges, which would lead to the expectation that they harbor similar microflora, but they are geographically isolated. We are unaware of other reports of identical functional gene sequences having been retrieved from isolated environments; however, this may simply be due to the smaller database for functional genes, as closely related (16) or identical (1, 2) 16S rRNA genes have been retrieved from distant locations.
Interestingly, some sequences retrieved from sediments collected in freshwater Topaz Lake were identical to sequences retrieved from estuarine sediments of San Francisco Bay and from sediments of alkaline, saline Walker Lake (i.e., TLS-05, SFBS28-06, and WLS-02). Furthermore, another San Francisco Bay sequence (SFBS17-06) was identical to a clone from Topaz Lake (TLS-06) and from Walker Lake (WLS-08). This was a surprising finding, as these environments range in salinity from <1 (Topaz Lake) up to
30 ppt (San Francisco Bay) and in pH from
7 (Topaz Lake) to 9.8 (Walker Lake) (Table 1).
One factor that these sequences have in common is that they were all retrieved from sediment samples. This suggests that physicochemical properties common to sediments (surfaces, hypoxia and anoxia, elevated organic carbon concentrations, and likely elevated chitin concentrations since shed arthropod exoskeletons sink) override other environmental factors (temperature, salinity, and pH) in determining the distribution of functional gene sequences. Clearly there is a limit to this generalization because Mono Lake chitinases (water column, sediment, and isolates) were not amplified by the primer set used in this study, even though enzyme assays demonstrated chitinase activity (G. R. LeCleir, unpublished data). DNA extracted from Soap Lake sediment also failed to yield PCR product with our primer set. In contrast to chitinase sequences retrieved from sediment communities, sequences retrieved from water column samples collected at different locations segregated into separate clusters (Fig. 2). Furthermore, within cluster D, sequences retrieved from mixed-layer (55 m) and halocline (131 m) samples collected at the same station tended to fall into separate subclusters. The bacterial assemblages associated with these water masses have been characterized previously and were found to be distinct from one another (1, 2) and from those of temperate coastal water assemblages (1). Because the composition of Soap Lake water differs significantly from either seawater or freshwater, the bacterial assemblages from the lake might also be expected to be phylogenetically distinct. Biodiversity studies of other saline, alkaline lakes have verified that the composition of bacterial assemblages differs from those in other aquatic environments and also that the same suites of organisms are found in lakes from widely separated locations (11, 21, 36).
Alignment of family 18 glycosyl hydrolases shows that a number of residues essential for catalytic activity are conserved (29). The majority of chitinase sequences identified in this study (94%) contain a conserved motif encompassing the catalytic site, [DG]-G-[LIV]-[DG]-[IV]-[DH]-W-[EG], corresponding to positions 308 to 315 of the Serratia marcescens ChiA protein (29) (Fig. 3). Two additional residues, a tyrosine and an aspartate at positions 390 and 391, respectively, are also conserved in most of our sequences. However, seven of the sequences we obtained contained substitutions at one of these conserved positions. All of these substitutions result from single-base-pair changes: six A
G transitions and one G
C transversion. Both SLW23-03 and AOW131-04 contain a glycine instead of an aspartate at position 308. WLS-07 contains histidine rather than aspartate at position 313. Interestingly, this same substitution is found in narbonin, a protein found in plants with high similarity to chitinase but with no known enzymatic function (42). WLS-08, TLS-06, and SFBS17-6 have a glycine instead of glutamate at position 315. This glutamate residue has been shown to be the essential catalytic proton donor in structurally characterized bacterial chitinases (45). Finally, clone SFBS16-01 contains a cysteine rather than the completely conserved tyrosine at position 390. Collectively, these seven sequences may represent pseudogenes. Alternatively, they may correspond to genes that encode enzymes with unique properties, including different activities and mechanisms of action, or they may encode proteins with no known enzymatic function that share sequence similarity with chitinase (i.e., narbonin). They may also simply be the result of PCR (44), cloning (28, 35), or sequencing errors, although the sequence reads were unambiguous at these positions. In the absence of biochemical data for the expressed protein, it is difficult to evaluate the significance of these substitutions.
|
View larger version (15K): [in a new window] |
FIG. 3. Conserved residues including and surrounding the catalytic domain of proteobacterial chitinases. Residues are coded according to degree of conservation as follows: black, >75%; gray, 50 to 75%; and no color, <50%. Positions that are altered in chitinase sequences retrieved in this study are indicated by symbols. Stars indicate residues found in a limited number of sequences and that are described in the text. Residues found exclusively in sequences retrieved from Soap Lake samples are circled. Numbers and dots indicate residue positions relative to the Serratia marcescens gene chiA (P07254).
|
Chitinases from polar microorganisms appear to have adaptations required to function well in cold environments, as recently demonstrated for two chitinase alleles, ChiA (CAB62382) and ChiB (CAB62499), from an Arthrobacter strain isolated from Antarctic sediment (26). The increased heat lability of these chitinases is believed to be a consequence of structural changes that give the enzymes greater flexibility at lower temperatures, permitting conformational changes necessary for catalysis (14). Similar sequence modifications might be expected in genes from other cold-adapted microbes, regardless of their phylogenetic affiliation, leading to unique sequences for Arctic Ocean genes, as we have found (Fig. 2).
The form and source of chitin found in the environment may also select for specific genes in different environments. There are three major types of chitin, designated
, ß, and
(32). Each has unique physical attributes and chemical properties. Chitin can also vary by the degree of acetylation and the presence of cross-linked structural components (37). The composition of the chitin matrix and its associated molecules is typically organism dependent (15). Other molecules associated with the chitin matrix often select for specific enzymes and control rates of chitin hydrolysis (32, 37). Therefore, predominance of different structural variants of chitin in the environments we examined may dictate elaboration of what appear to be environmental-specific proteins that are in reality required for efficient hydrolysis of the predominant form of chitin.
This work was supported by the Mono Lake Microbial Observatory grant NSF MCB 9977886.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»