Previous Article | Next Article ![]()
Applied and Environmental Microbiology, November 2004, p. 6865-6870, Vol. 70, No. 11
0099-2240/04/$08.00+0 DOI: 10.1128/AEM.70.11.6865-6870.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
and
Jon Clardy*
Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, Massachusetts
Received 23 February 2004/ Accepted 4 July 2004
|
|
|---|
|
|
|---|
In our approach, which is designed to find small-molecule antibiotics, eDNA is used to prepare cosmid libraries in Escherichia coli, and antibacterially active eDNA clones are found by using a double-antibiotic selection screen that was developed to identify and recover antibacterially active clones directly from the original library selection plates. Two active clones, CSL12 and CSLC2, were initially characterized, and both were found to produce long-chain N-acyltyrosine antibiotics (3, 5). The long-chain N-acyl amino acid synthase (NAS) from CSLC2 is part of a biosynthetic gene cluster of 13 open reading frames (ORFs) that codes for the production of long-chain N-acyltyrosines as well as two additional families of long-chain acyl phenols that are related to N-acyltyrosines but were not found to be antibacterially active (3, 6).
The presence of multiple different N-acyltyrosine-producing clones in the initial two eDNA libraries that were screened suggested that NAS genes might occur at a high frequency in eDNA. However, the rapid identification of additional NAS sequences from eDNA by hybridization or PCR-based strategies was not feasible because the eDNA NASs showed very little overall sequence similarity. The use of an expression-based screening strategy was the only feasible approach to identify additional NASs in eDNA samples. Clones from seven independent soil samples were screened for the production of antibacterial activities, and cultures of these clones were assayed for the presence of long-chain N-acyltyrosines. Environmental DNA clones that produce long-chain N-acyltyrosine antibiotics were found in all seven eDNA libraries that were screened. The routine identification of long-chain N-acyltyrosine-producing clones in eDNA libraries constructed from multiple geographically distinct environmental samples suggests that these compounds may play an important, if undefined, role for soil microbes. To explore the genetic diversity underlying the production of these apparently common natural products, nine additional long-chain N-acyltyrosine-producing clones were characterized.
|
|
|---|
Transposon mutagenesis and sequence analysis.
The cosmids isolated from cultures of active clones were randomly transposon mutagenized with donor plasmid pGPS2.1 by using the genome priming system (NEB), or alternatively, an antibacterially active NotI subclone in pBC (Stratagene) was randomly transposon mutagenized with donor plasmid pGPS1.1 (NEB). E. coli transformed with the transposon-mutagenized cosmids was selected on Luria-Bertani (LB) agar plates containing kanamycin (30 µg/ml), ampicillin (50 µg/ml; optional), and chloramphenicol (20 µg/ml). The triply resistant transformants were then patched in duplicate onto LB agar plates containing both kanamycin and chloramphenicol and incubated at 30°C for 2 to 3 days. One set of the duplicate plates was overlayed with top agar containing Bacillus subtilis (BR151/pPL608) (22) to identify the mutagenized cosmid clones that no longer produced antibacterial activity. Cosmids were isolated from inactive colonies that were picked from the undisturbed plate, and the eDNA sequence surrounding the site of the transposon insertion was determined with primers S and N (NEB) that are specific for the genome priming system transposon. The sequence derived from transposon insertions that disrupted the production of antibacterial activity was used to construct the eDNA sequence responsible for the observed antibacterial activity. Primer walking was used to complete each sequence and confirm the sequence obtained from the transposon mutagenesis experiments. Sequencing was performed at the Cornell University BioResource Center. The cladogram of NAS sequences was constructed with MacVector (version 7.1; Oxford Molecular Ltd.) by using ClustalW (16).
GST fusion proteins.
The predicted NASs from Nitrosomonas europaea and Desulfovibrio vulgaris were amplified from genomic DNA (American Type Culture Collection) by using the FailSafe PCR system from Epicentre and primer pairs 5'-GCGCTGATCATGCAAATTGCGAATCATCTTCAATC-3' and 5'-GCGCGAATTCCGGTACGATGAAATTGGTTCCGA-3' (BclI and EcoRI) for N. europaea and 5'-GCGAGGGATCCATGCTCGCCATCGCCGACAAGCGCCGT-3' and 5'-GTCACGAAGCTTCTCGCCCTCACTCGGCAGGCAGCAGGA-3' (BamHI and HindIII) for D. vulgaris. Each PCR product was gel purified (QIAquick; QIAGEN), digested with the appropriate restriction endonucleases (as indicated), and ligated into gel-purified pGEX-3X (Pharmacia Biotech) or pET28a (Novagen) to give pNeGST and pDvulHIS, respectively. The genes for N-acyltyrosine synthase 2 (NasY2) and NasY9 were amplified from pCSL132 (5'CCGGATCCCCATGATGTCTCTACCTGCTTACCACT-3' and 5'-GGAATTCTGCATTTCTGGCGTTTGATCTGCTT-3' [BamHI and EcoRI]) and pCSLG10 (5'-CGCGGGATCcATGAAGCTTATACCAGGTTCT-3' and 5'-GCGAATTCCGAACAATATACCGGCTGACCG-3' [BamHI and EcoRI]), respectively. Each PCR product was gel purified (QIAquick; QIAGEN), digested with the appropriate restriction endonucleases (as indicated), and ligated into gel-purified pGEX-3X or pETMAL to give pNasY2GST and pNasY9MAL, respectively.
Isolation of N-acyl amino acids.
The original eDNA cosmid clones were grown in LB agar (30 µg of kanamycin/ml) for 3 days at room temperature or 30°C. E. coli BL21(DE3) transformed with pNeGST, pDvulHIS, pNasY2GST, or pNasY9MAL was grown at 37°C to an optical density at 600 nm of 0.7, at which point the temperature was reduced to 20°C and the cultures were induced with 10 µM isopropyl-ß-D-thiogalactopyranoside for an additional 18 h at 20°C. The mature cultures were neutralized with 2N HCl and extracted twice with an equal volume of ethyl acetate. The dried organic extracts were then concentrated in vacuo and analyzed directly by fast atom bombardment mass spectrometry (FABMS), or alternatively, the resulting extracts were either analyzed directly or partitioned by silica gel flash chromatography (step gradient CHCl3-methanol modified with 0.1% HOAc), and the active material that eluted from the column was analyzed by mass spectrometry. The FABMS analysis was performed by the University of Illinois (Urbana) Mass Spectrometry Facility.
Bioautography.
Crude ethyl acetate extracts and column fractions were chromatographed on silica gel thin-layer chromatography (TLC) plates (90:10 CHCl3-methanol), and then the dried TLC plate was overlayed onto an LB agar plate containing B. subtilis. After 1 to 2 h, the TLC plate was removed from the agar surface, and the agar plate containing the transferred extract was incubated at 30°C overnight. The pattern of growth inhibition in the bacterial lawn was then recorded the following day.
Nucleotide sequence accession numbers.
The eDNA NASs have been deposited in GenBank under to the following accession numbers: AF324335 (CSL12, NasY1) (5), AY214921 (CSL132, NasY2), AY214922 (CSL144, NasY3), AY214923 (CSL147, NasY1), AY128669 (CSLC2, FeeM) (3), AY214924 (CSLC3, NasY5), AY214925 (CSLD10, NasY6), AY214926 (CSLF42, NasY7), AY214927 (CSLF43, NasY8), AY653031 (CSLG7, NasY9), and AY653030 (CSLG10, NasY10).
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Plasmids and eDNA clones
|
![]() View larger version (24K): [in a new window] |
FIG. 1. Identity matrix (a) and cladogram (b) derived from ClustalW sequence alignments of the conceptual translation for each predicted NAS characterized from eDNA. The eDNA NASs cluster into at least two groups (group I and group II) and possibly a third (NasY3).
|
|
View this table: [in a new window] |
TABLE 2. Major molecular ions (m/z) observed in the ethyl acetate extract from E. coli cultures transformed with eDNA NAS and the predicted long-chain N-acyltyrosines that correspond to each of the observed masses
|
Sequence analysis.
No absolutely conserved residues were detected in a ClustalW (16) alignment using all of the eDNA-derived NAS sequences. The ClustalW-derived phylogenetic tree produced from this alignment suggests that the conceptually translated proteins from 9 out of the 10 unique eDNA NASs fall into two distinct groups, and the remaining NAS, NasY3, is not significantly related by sequence identity to either group (Fig. 1). Group I contains the original two NASs known to produce long-chain N-acyltyrosines, NasY1 from CSL12 and FeeM from CSLC2, as well as three new NASs, NasY2, NasY5, and NasY7. There are three highly conserved regions in this family of NASs: (G/A)(T/S)(I/L/V)(S/T)(I/L/V), (V/I)NP(R/K)H, and NAPA(V/I)(A/L) (Fig. 2). Although FeeM and NasY7 show only limited overall sequence identity to the other NASs in this large family, they contain the conserved residues seen in the other sequences (Fig. 1 and 2). The low overall sequence similarity observed between FeeM and the other sequences in this group is at least in part due to FeeM being between 58 and 78 residues shorter than each of the other sequences in the group (Fig. 2).
![]() View larger version (33K): [in a new window] |
FIG. 2. ClustalW-derived multiple-sequence alignment of the group I long-chain NASs. Identical residues appear in black boxes, and similar residues appear in open boxes. The three conserved regions in the group I NASs are underlined.
|
40%) and have a large number of absolutely conserved residues. The inclusion of NasY3 in the multiple-sequence alignment of group II results in only 11 absolutely conserved residues, and if NasY3 is included in the group I alignment, only five of the absolutely conserved residues shown in Fig. 2 are still conserved in the new alignment. NasY3 could be distantly related to either group I or group II and would therefore represent a link between groups I and II. It is also possible that NasY3 represents a third group of NASs that is distinct from both groups. Additional NasY sequences are needed to determine the precise relationships within this group of enzymes.
Homserine lactones and AISs.
NasY6 and NasY10 from group II show low-level sequence identity (15% identity) to a putative autoinducer synthase (AIS) (GenBank accession number ZP 00004916) from the cultured bacterium Rhodobacter sphaeroides (8, 12). AISs condense S-adenosylmethionine with a variety of acyl carrier protein (ACP)-linked fatty acids to produce medium- and long-chain N-acyl homoserine lactones that are used by gram-negative bacteria as quorum sensors (Fig. 3b) (13). Multiple-sequence alignments with AISs contain only a few highly conserved residues, and no obvious catalytic residues are conserved within this large family of sequences (12, 21). Many of the conserved residues are charged and thought to hold the S-adenosylmethionine in the proper orientation for a reaction with the phosphopantetheinyl-linked fatty acid of the ACP. The low overall sequence conservation within AISs suggests that properly orienting the S-adenosylmethionine with respect to the highly activated incoming ACP-linked fatty acid may be the sole requirement for catalysis by this family of enzymes. Although AISs show only limited sequence identity to N-acyl amino acid biosynthesis genes, both families of enzymes produce long-chain N-acyl amino acid derivatives, and this limited sequence similarity may indicate a shared biosynthetic strategy. The long-chain NASs may therefore catalyze the transfer of a fatty acid directly from the endogenous E. coli ACP-linked fatty acid onto an amino acid (Fig. 3a).
![]() View larger version (15K): [in a new window] |
FIG. 3. Hypothetical scheme for the biosynthesis of long-chain N-acyltyrosines by NASs expressed in E. coli. The proposed use of an ACP-linked fatty acid in the biosynthesis of long-chain N-acyltyrosines (a) is similar to the biosynthesis of acylhomoserine lactones by AISs for cultured bacteria (b).
|
NASs are sufficient to biosynthesize long-chain N-acyl amino acids in E. coli.
At least one representative enzyme from each of the two major groups of NASs (NasY2 and FeeM from group I and NasY9 from group II) was subcloned as a glutathione S-transferase (GST) fusion protein and assayed for the ability to produce N-acyl amino acids in E. coli. FABMS analysis of ethyl acetate extracts from cultures of E. coli transformed with each GST fusion construct confirmed the production of long-chain N-acyltyrosines by the NAS fusion proteins (Table 2). Although variations in the array of fatty acids conjugated onto the amino acid head group were observed in extracts from different fermentation conditions and from cultures containing the GST fusion proteins, long-chain N-acyltyrosines were consistently produced by cultures transformed with eDNA NASs.
N. europaea.
Although long-chain N-acyltyrosines have never been reported from the extensive screening of extracts from cultured organisms for antibacterial activity, three potential NASs were identified in BLAST searches using group I eDNA NAS sequences. NasY3 and group II NASs show no significant sequence identity (<20%) to sequences derived from cultured bacteria. The group I NASs are similar to a predicted protein of unknown function from N. europaea (GenBank accession number ZP 00003821) and to the N-terminal
250 amino acids of two predicted proteins of unknown function from Desulfovibrio vulgaris (989 amino acids) and Desulfovibrio desulfuricans (1,006 amino acids) (accession number ZP 00130011) that are 51% identical. The central region of the two large Desulfovibrio proteins is similar to the ThiF/MoeB (MeoW)/HesA (1, 15) family of nucleotide-binding proteins that are thought to activate carboxylic acids, while the C-terminal ends of these predicted proteins show no significant similarity to any deposited sequences of known function.
To assess the ability of these ORFs to confer the production of long-chain N-acyltyrosines to E. coli, the proposed NAS from D. vulgaris, a representative of the two Desulfovibrio ORFs, and the proposed NAS from N. europaea were subcloned into E. coli as fusion proteins. No clone-specific compounds were seen in ethyl acetate extracts from E. coli cultures containing a His6 fusion of the proposed NAS from D. vulgaris (pDvulHIS). Although the overexpression of this large ORF as a His6 fusion protein did not lead to the production of detectable levels of N-acyl amino acids under the conditions tested, this does not rule out the possibility that it could produce N-acyl amino acids in E. coli or D. vulgaris under alternative expression conditions. The proposed NAS domain in conjunction with the two additional domains in the large Desulfovibrio ORFs could also be used in the biosynthesis of an as-yet-undefined N-acyl amino acid-based natural product. Ethyl acetate extracts from cultures of E. coli containing the N. europaea GST (pNeGST) fusion protein contained long-chain N-acyltyrosines (m/z of 390, 392) as seen with the eDNA NASs. This N. europaea ORF is the first NAS from a cultured bacterium that has been functionally characterized. Understanding the role that long-chain N-acyltyrosines play in soil microbial communities should now be feasible with the identification of a cultured organism that has the genetic capacity to produce these compounds.
Conclusions.
Environmental DNA clones that produce long-chain N-acyltyrosine antibiotics were found in all seven eDNA libraries that were screened, five constructed from soil collected in New York, N.Y., one from soil collected in Boston, Mass., and one from sediment collected in Costa Rica. Of the 11 eDNA NASs that have been characterized, 10 are unique sequences and none have been previously described from cultured organisms. NASs derived from eDNA show low-level (<20%) sequence identity to AISs, the biosynthetic enzymes for homoserine lactones. Long-chain N-acyltyrosines may therefore be biosynthesized from the condensation of an ACP-linked fatty acid with the primary amine of an amino acid in a mechanism similar to that used in the biosynthesis of homoserine lactones. The identification of NAS-like sequences in three recently determined bacterial genomes led to the functional annotation of an ORF of previously unknown function from N. europaea as an NAS. Understanding the role that long-chain N-acyl amino acids play in soil microbial communities should now be feasible with the identification of a cultured organism that has the genetic capacity to produce these compounds.
In the search for bioactive natural products from easily cultured bacteria, even common microbial metabolites may have been overlooked because they are not produced by the small fraction of bacteria that grow easily in the laboratory. The heterologous expression of eDNA in easily cultured hosts should provide access to many of these previously inaccessible natural products.
Present address: Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA 02138. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»