Previous Article | Next Article ![]()
Applied and Environmental Microbiology, June 2006, p. 4382-4387, Vol. 72, No. 6
0099-2240/06/$08.00+0 doi:10.1128/AEM.00380-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Scripps Institution of Oceanography, University of CaliforniaSan Diego, La Jolla, California,1 Department of Environmental and Biomolecular Systems, OGI School of Science and Engineering, Oregon Health and Science University, Beaverton, Oregon,2 Department of Pharmacognosy, Faculty of Pharmacy, Suez Canal University, Ismailia, Egypt,3 Department of Medicinal Chemistry, University of Utah, Salt Lake City, Utah4
Received 15 February 2006/ Accepted 6 April 2006
|
|
|---|
|
|
|---|
While most small peptides found in cyanobacteria are biosynthesized by nonribosomal peptide synthetases (17), we recently reported a microcin-like pathway for the biosynthesis of a family of cyclic peptides, the patellamides, from Prochloron didemni, a cyanobacterial symbiont of tropical ascidians (GenBank accession number AY986476) (23). The patellamides are moderately cytotoxic and are composed of a pseudosymmetrical, cyclic dimer, with each substructure having the sequence thiazole-nonpolar amino acid-oxazoline-nonpolar amino acid. Despite these unusual features, patellamide biosynthesis is ribosomal. The pat gene cluster consists of a precursor peptide gene, which codes for the patellamide amino acid sequence, and a number of genes with protease- and other peptide-modifying homologies (23). BLAST searches of GenBank with the pat genes revealed homologs in T. erythraeum IMS101. This led us to investigate the presence of a potential patellamide-like biosynthesis cluster as well as its product, a small cyclic peptide dubbed trichamide, in T. erythraeum.
|
|
|---|
Culturing.
T. erythraeum IMS101 (20) was obtained from John Waterbury, Woods Hole Oceanographic Institution, via Katherine Barbeau at the Scripps Institution of Oceanography (SIO). The culture is nonaxenic; i.e., it does contain other heterotrophic bacteria (K. Barbeau, personal communication). Cultures were grown in R medium (modified from John Waterbury's recipe) (25% double-distilled water and 75% natural seawater from the Scripps pier are mixed and amended with 8 µM KH2PO4, 2.5 µM EDTA, 0.1 µM ferric citrate, 0.1 µM MnCl2, 10 nM Na2MoO4, 10 nM ZnSO4, 0.1 nM CoCl2, 0.1 nM NiCl2, and 0.1 nM Na2SeO4) at 25°C under a 12-h light-dark photocycle with slow stirring as well as daily inversion of the culture flasks. All components were filter (0.2 µm) sterilized. T. erythraeum requires a 10% inoculum to start cultures; accordingly, 800 ml of culture was used in 8 liters of R medium. After 12 to 14 days, the culture was vacuum filtered through a 5-µm polycarbonate filter to retain T. erythraeum colonies and remove free-living bacteria. The cell material was rinsed off the filters into a 50-ml Falcon tube with 25% double-distilled water, immediately frozen at 80°C, and later lyophilized. The average yield was
10 mg dried cells per liter culture volume.
Extraction and purification.
Lyophilized cyanobacterial pellets were extracted three times with a
100-fold excess of methanol. The methanolic extract was dried, yielding a crude extract that was used for initial electron spray ionization (ESI) mass spectrometry (MS). For Fourier transform (FT) MS, the crude extract was purified with a C18 ZipTip (Millipore). A peak at m/z 1,099 corresponded to a new compound, trichamide.
A portion of the crude methanolic extract (23 mg) was further purified by partitioning between ethyl acetate and water. The aqueous part was fractionated over a HP20SS column with 25, 50, 75, and 100% acetone. As determined by ESI-MS, the 25 and 50% acetone (aqueous) fractions contained the peak at m/z 1,099 and were combined. This combined fraction was run on a high-performance liquid chromatography (HPLC) Phenomenex C18 analytical column with the following protocol (all solvents contained 0.01% trifluoroacetic acid): 5 min of water, a 5- to 35-min gradient from 0 to 100% acetonitrile, and 10 min of 100% acetonitrile. Fractions were collected at 1-min intervals. Only fractions eluting at 16 to 17 and 17 to 18 min contained a peak at m/z 1,099 as determined by ESI-MS. These fractions did not contain a single compound, since additional peaks beside the peak at m/z 1,099 were present in the MS analysis. The amount of material in the two HPLC fractions was too low to measure.
In an improved procedure, a methanolic extract (57 mg) was partially purified by a step gradient on a column containing 7 g C18 using solvents containing 0.01% trifluoroacetic acid. Fractions were eluted with water followed by 25%, 50%, and 100% acetonitrile (aqueous). The 100% elution fraction was further purified on a Phenomenex C18 column as described above. A single peak with the correct diode array profile cleanly eluted at 16.6 min. By ESI-MS analysis, this HPLC peak contained the m/z 1,099 ion. The concentration of trichamide was below a measurable limit and was thus estimated by comparison of the diode array absorbance at 240 nm with those for standards of ulithiacyclamide at various concentrations. This intensity should depend mainly upon the concentration of thiazole, since both ulithiacyclamide and trichamide have no other chromophores at this wavelength. By this method, the total amount of trichamide isolated was estimated to be 25 to 50 µg.
Mass spectrometry.
Crude extracts and partially purified fractions were monitored by ESI-MS and by FT-MS on a ThermoFinnigan LTQ-FT at a 100,000 resolution (i.e., mass of 400). FT tandem MS (MS/MS) experiments were run with collision-induced dissociation (CID) and infrared multiphoton dissociation (IRMPD) techniques. Predicted masses were calculated using the following values: C, 12; H, 1.007825; N, 14.003074; O, 15.994914; S, 31.97207.
|
|
|---|
The tri cluster contains 11 ORFs designated triA-K (Fig. 1 and Table 1). Four of these ORFs (triBCEF) are short and have sequence identity only to conserved hypothetical proteins, while triI is only hypothetical, with no significant sequence identities. Some of these ORFs may not be actively transcribed. This paper will focus on the remaining six genes, for which function may be inferred.
|
View larger version (5K): [in a new window] |
FIG. 1. The tri gene cluster. Arrows denote ORFs and their direction. Black ORFs are tRNA synthetases, white ORFs are conserved hypothetical genes without a homolog in the pat cluster, green ORFs are pat homologs, and the gene encoding the precursor peptide is orange.
|
|
View this table: [in a new window] |
TABLE 1. The tri cluster proteins and their homologs
|
|
View larger version (8K): [in a new window] |
FIG. 2. Alignment of the precursor peptides PatE and TriG. The sequences encoding patellamide C, patellamide A, and trichamide (top to bottom) are underlined, and the proposed cyclization signals are in boldface type.
|
TriD has high similarity to the N-terminal part of PatG and to oxidases. Previously, we predicted that this part of PatG would oxidize the intermediate thiazoline rings into thiazoles (23).
The BLASTP analysis of TriH and TriK gives homology to subtilisin-like proteases. They have high similarity to PatA and the C-terminal part of PatG (23). We predicted that these proteases would be involved in the maturation of PatE by cleaving the product from leader and trailer sequences and assumed the same function in trichamide biosynthesis. It is interesting that TriH and TriK have 48% identity to each other.
TriJ has 50% similarity and 72% identity to PatB. There is no other homolog of either of the two proteins in GenBank. PatB is not required for biosynthesis but seems to improve patellamide yield in heterologous expression experiments with the pat cluster (E. W. Schmidt, unpublished results). The high identity between TriJ and PatB over their entire length and their presence in both clusters suggest that they serve a role in peptide biosynthesis.
There are a few differences between the pat and tri clusters: PatG has two domains, one for oxidation and one for proteolytic cleavage. In T. erythraeum, these functionalities are separated into two proteins, TriD and TriH, respectively. The only pat gene without a homolog in the tri cluster (excluding very short putative ORFs) is patF, which has no significant homologies in GenBank. Overall, the pat and tri clusters have striking similarities. The biosynthetic genes have between 45 and 60% identity, and both gene clusters consist of a heterocyclization enzyme, an oxidase, two proteases, and patB/triJ, a gene of unknown functionality. Also, while there is variability in the length of the precursor protein, in terms of both the leader sequence and product sequence (8 amino acids for patellamide and 11 amino acids for trichamide), the 5-amino-acid cyclization signals before and after the peptide are highly conserved.
Based upon these similarities in biosynthesis genes, we predicted the presence of a patellamide-like compound, trichamide, a cyclic, thiazole-containing peptide, in T. erythraeum. Depending on the pattern of cyclization of the peptide and/or the heterocyclization of serine and cysteine moieties, the possible molecular weight of the compound would be between 1,079 and 1,157.
Mass spectrometry.
Initial screening of a crude extract of T. erythraeum with ESI-MS revealed the presence of a major peak at m/z 1,099. A molecule with this mass can be constructed from the precursor peptide sequence GDGLHPRLCSC by heterocyclization and oxidation of two of the three possible amino acids, cysteine, serine, and cysteine, to thiazoles or oxazoles and cyclization of the entire peptide. Alternatively, this mass is also consistent with heterocyclization of the remaining amino acid to a thiazoline or oxazoline moiety in a linear peptide.
A high-resolution experiment using a Fourier transform MS/MS system gave a molecular ion at (MH2)2+ 550.23166, only 0.022 ppm different from the theoretical value of (MH2)2+ 550.231648 for the predicted structure, validating the presence of a molecule containing the trichamide molecular formula C46H66N16O12S2 (see the supplemental material).
Further MS/MS fragmentation experiments with a mass of 550.2 using CID and IRMPD techniques revealed fragmentation patterns in congruence only with a cyclic peptide (Table 2; see the supplemental material). With the exception of ion A, all masses are within
3 ppm of their predicted values. This leaves three possible heterocyclization patterns that have identical masses: (i) thiazole-serine-thiazole, (ii) thiazole-oxazole-cysteine, and (iii) cysteine-oxazole-thiazole. The data are consistent with the first pattern on the basis of three arguments. First, heterocyclization of adjacent amino acids has no precedent in the literature on patellamide structure; in fact, when two cysteine residues are adjacent in the patellamide family as in the ulithiacyclamides (4), only one is cyclized. Second, the patellamide class of compounds does not contain oxazoles but contains only oxazolines. Third, it is highly unlikely that an enzyme would specifically modify one cysteine but not the other.
|
View this table: [in a new window] |
TABLE 2. Mass spectrometrya
|
![]() View larger version (16K): [in a new window] |
FIG. 3. (A) Structure of trichamide. Stereochemistry is inferred and not determined experimentally, as described in the text. (B) Assignment of CID-MS fragments from Table 2 to the trichamide structure. (C) Assignment of IRMPD-MS fragments.
|
![]() View larger version (16K): [in a new window] |
FIG. 4. Proposed biosynthetic pathway to trichamide.
|
Patellamide and trichamide biosynthesis may be examples of a more common pathway to small peptides. Besides the above-mentioned goadsporin from Streptomyces sp. strain TP-A0584, at this time, clustered ORFs with 35 to 40% identity to TriA and TriD are present in the genomes of phylogenetically distant bacteria: plut_0880 and plut_0878 in Pelodictyon luteolum, chlorobia (GenBank accession number CP000096), swolDRAFT_1502 and swolDRAFT_1501 in Syntrophomonas wolfei, clostridia (GenBank accession number NZ_AAJG01000002), and blr4538 and blr4539 in Bradyrhizobium japonicum (Rhizobiales) (11) (GenBank accession number BA000040).
Trichamide function.
Trichamide is hydrophilic, partitioning to the aqueous fraction relative to ethyl acetate. In addition, it is found only in the cells and is not excreted in significant quantities in the growth medium (data not shown). These properties suggest an antipredation defense function rather than anticompetitor or communication functions. To test biological activities,T. erythraeum crude methanolic extracts were tested for general cytotoxicity (HCT-116 at 10 µg/ml and CEM-TART at 5 and 50 µg/ml) and anti-human immunodeficiency virus (1 and 10 µg/ml), antifungal (Candida albicans at 10 µg/ml), or antimicrobial (Staphylococcus aureus and Enterococcus faecium at 10 µg/ml) effects. No significant activity was found in these assays (data not shown). A number of algal blooms have neurotoxic effects (2), and neurotoxicity of environmental Trichodesmium spp. in mice has previously been reported (9). The crude methanolic extract of T. erythraeum IMS101 also exhibited neurotoxicity in a mouse assay, but purified trichamide was not the active component (data not shown). Guo and Tester previously found that healthy Trichodesmium sp. cells do not affect the copepod Acartia tonsa, while aged or lysed Trichodesmium cells are toxic (8). This result is consistent with the properties of trichamide, which suggest that the compound is maintained inside healthy cells but would be released from lysed cells into seawater. Testing of the effect of trichamide on health and feeding behavior of copepods and other grazers of Trichodesmium might reveal the ecological function of trichamide. The presence of trichamide should be examined in Trichodesmium bloom waters. It would also be interesting to determine if T. thiebautii, the other major Trichodesmium species, which was proven to be more toxic than T. erythraeum in one report (10), contains trichamide (or a related compound) and the necessary biosynthetic capabilities.
Conclusion.
The ongoing exponential growth of DNA sequence data will lead to the discovery of many natural-product biosynthesis pathways for which no actual product has been characterized. A careful study of these pathways can lead to the discovery of novel products. Lautru et al. previously identified a gene cluster encoding nonribosomal peptide synthetase in the genome of a Streptomyces strain and discovered a novel compound, coelichelin (13). Nonribosomal peptide synthetase pathways are well-known routes to natural products, and the genus Streptomyces is a prolific source. The genus Trichodesmium was not previously known to produce natural products, and trichamide is only the second example, after the pat cluster, of a cyclic peptide biosynthesized in this way. Depending on the type of pathway, genomic mining should encompass careful curation. While the autoannotation of the T. erythraeum genome identified most of the tri genes as ORFs of unknown function, the most essential part of the cluster, the precursor peptide gene, was discovered manually.
This study is also an example of the power of general and timely access to genomic data. Even though the T. erythraeum IMS101 genome has not been completely sequenced or published at the time of this writing, the public availability of the draft sequence data allowed an assignment of function to the tri gene cluster and the discovery of a novel cyclic peptide.
We are grateful to Kelly Roe and Kathy Barbeau (SIO) for the Trichodesmium starter culture as well as helpful advice for maintaining it, to Chad Nelson (UU) for FT-MS analysis, and to Louis Barrows (UU), Baldomero Olivera (UU), William Gerwick (SIO), and William Fenical (SIO) for bioassays.
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»