Applied and Environmental Microbiology, November 2004, p. 6353-6362, Vol. 70, No. 11
0099-2240/04/$08.00+0 DOI: 10.1128/AEM.70.11.6353-6362.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
School of Microbiology and Immunology, The University of New South Wales, Sydney, New South Wales, Australia
Received 14 January 2004/ Accepted 22 June 2004
|
|
|---|
|
|
|---|
The consumption of water containing nodularin-producing N. spumigena blooms has led to the death of domestic and native animals by massive liver hemorrhage in Australia, the Baltic Sea, and New Zealand (4, 5, 13, 29). In subacute doses, nodularin is thought to act as a liver tumor initiator and promoter (34). The hepatotoxicity and carcinogenicity of nodularin are associated with the inhibition of eukaryotic protein phosphatase catalytic subunit types 1 and 2A (18).
Nodularin is a cyclic pentapeptide consisting of 3-amino-9-methoxy-2,6,8-trimethyl-10-phenyl-4,6-decadienoic acid (Adda), D-glutamic acid (D-Glu), N-methyldehydrobutyrine (MeDhb), D-erythro-ß-methylaspartic acid (D-MeAsp), and L-arginine (L-Arg) (Fig. 1) (36). The possible pathway involved in its biosynthesis has been determined previously by labeled-precursor studies (27, 37). The structure of nodularin suggested that the mechanism of biosynthesis is via a nonribosomal mechanism (22). Nonribosomal peptides and polyketides are synthesized by large multienzyme complexes. These large modular proteins catalyze the activation, modification, and condensation of specific amino acid or small chain carboxylic acid substrates. Nonribosomal peptide synthetase (NRPS) and polyketide synthase (PKS) genes have been associated with the production of nodularin in strains of Nodularia (26).
![]() View larger version (13K): [in a new window] |
FIG. 1. Structure of the cyanobacterial hepatotoxins, nodularin and microcystin. General numbering of residues is indicated. Nodularin is a cyclic pentapeptide (molecular weight, 619). The L-Arg residue of nodularin may be replaced with a homoarginine residue (nodularin-Har) or a valine residue (motuporin). Microcystin is a cyclic heptapeptide with two variable amino acids, indicated by X and Y. The most common isoform is microcystin-LR, where X is L-Leu and Y is L-Arg (molecular weight, 994).
|
![]() View larger version (24K): [in a new window] |
FIG. 2. Structures of the gene clusters, nda and mcy, which encode polyketide synthase (hatched), nonribosomal peptide synthetase (black), and tailoring (white) enzymes involved in the biosynthesis of nodularin and microcystin, respectively. A) Structure of the gene cluster, nda, isolated from the nodularin-producing strain N. spumigena NSOR10 (this study). B) Structure of the gene cluster, mcy, isolated from two microcystin-producing strains of Microcystis (32, 33, 50). C) Structure of the mcy gene cluster isolated from the microcystin-producing strain P. agardhii CYA126/8 (10).
|
The aim of this study was to completely sequence and characterize the gene cluster required for nodularin synthesis and its flanking regions in the toxic strain N. spumigena NSOR10. The previously identified N. spumigena NRPS and PKS genes were used as a basis for this work (26). Following this, the proteins encoded within the putative nodularin synthetase (nda) gene cluster were characterized by comparative analysis with other biosynthetic enzymes, including microcystin synthetase. By using these data along with results of previous labeling studies by Moore et al. (27) and Rinehart et al. (37), a nodularin biosynthetic pathway was proposed. The second aim of this study was to compare and contrast the structures of the nda and mcy gene clusters in order to suggest a mechanism of acquisition and evolution via the deletion or insertion of two NRPS modules. This may have occurred through a natural combinatorial engineering event. A complete understanding of how this event has occurred in evolutionary history can provide invaluable information for the future combinatorial biosynthesis of NRPS and PKS systems. In addition, knowledge of the evolution of these biosynthetic pathways in cyanobacteria will have important implications in understanding the ecological roles of microcystin and nodularin, as well as their association with cyanobacterial bloom formation.
|
|
|---|
Amplification of nodularin-associated NRPS and PKS sequences was performed with NRPS-specific primers NPF (TAT TTT GTG GTG GAG AAG CAC TA) and NPR (GGA ACT ATC TGA TAA TTA GAC) and PKS-specific primers MNKF (GTT CYT CYT CAY TRG TRG CG) and MNKR (CCY AAG AAC AAC WAY TCC ACA) as described previously (26). Nodularia strains were screened for the presence of ORFs flanking the putative nda gene cluster by PCR with primers NPSF3 (CTT ATC GAG GAG GTC GTG AAG) and HLIPR (CAG AAA GTC AGT ATT AGG). Amplification was performed in a 30-µl reaction mixture containing 1x PCR buffer (Fischer Biotech, Perth, Australia), 2.5 mM MgCl2, 130 µM deoxynucleoside triphosphate mix (Fischer Biotech), and 0.5 U of Taq F1 DNA polymerase (Fischer Biotech). PCR was performed with 1 µl of template DNA at a concentration of approximately 100 ng · µl1. Specific forward and reverse oligonucleotide primers (synthesized by Sigma or GenSet Oligos, Lismore, Australia) were added to a final concentration of 0.3 µM. Thermal cycling was performed in a PCR Sprint temperature cycling system machine (Hybaid Limited, Middlesex, United Kingdom) or a GeneAmp PCR system 2400 thermocycler (Perkin-Elmer Corporation). The initial denaturation step at 94°C for 2 min was followed by 30 cycles of DNA denaturation at 94°C for 5 s, primer annealing for 10 s at the corresponding annealing temperature, and DNA strand extension at 72°C for the appropriate extension time and then by a final extension step at 72°C for 7 min.
Amplification of unknown fragments.
The amplification of the nda gene cluster was performed by using adaptor-mediated, inverse, and "hemidegenerate" PCR methods. Overlapping DNA fragments of up to 5 kb in length were generated by using these approaches, and both DNA strands were sequenced by using primers designed for every 400 to 500 bp.
The adaptor-mediated PCR method was modified from that described by Siebert et al. (43). Short adaptor DNA was ligated to digested genomic DNA, and a specific genomic outward-facing primer was then used with an adaptor primer to amplify a region of the genome. Twenty picomoles of T7 adaptor (Fig. 3) was added to each reaction mixture, containing 1 µg of genomic DNA, 10 U of blunt-ended restriction enzyme, and 5 U of T4 ligase (Promega) in 1x One Phor All buffer (Amersham/Pharmacia), and the one-step digestion-and-ligation reaction mixture was incubated at room temperature overnight.
|
View larger version (5K): [in a new window] |
FIG. 3. Sequence of the T7 adaptors, utilized for adaptor-mediated PCR. Positions of the primers used for PCR (T7Pr1) and sequencing (T7Pr2) are indicated.
|
A PCR mixture containing 1 to 2 µl of adaptor-ligated DNA, 10 pmol of adaptor primer (Fig. 3), and 10 pmol of genome-specific oligonucleotide primers was set up. PCR cycling was performed as described previously, with DNA strand extension at 72°C for 5 min. The primer annealing temperature was decreased by 1°C at each cycle, from 65 to 55°C, followed by primer annealing at 55°C for a further 25 cycles.
Flanking regions were also amplified by using the inverse PCR approach, requiring two specific outward-facing primers as described by Pang and Knecht (35). Genomic DNA was partially digested with between 0.5 and 2 U of the 4-bp cutter Sau3A I, and individual DNA fragments were self-religated at a concentration of 5 ng · µl1. The ligated DNA was used as a template in PCR with Nodularia-specific primers facing outwards from the known gene sequences.
When possible, a rational hemidegenerate PCR approach was used to take advantage of the modular structure of NRPS and PKS gene clusters. In this case a specific PCR primer was used with a degenerate PCR primer designed to PKS ketosynthase domains DKF (GTG CCG GTN CCR TGN GYY TC) and DKR (GCG ATG GAY CCN CAR CAR MG) or NRPS adenylation domains (A domains) MTF2 (GCN GGY GGY GCN TAY GTN CC) and MTR2 (CCN CGD ATY TTN ACY TG) (26, 30).
Sequencing, analysis, and alignments.
Automated sequencing was performed with the PRISM Big Dye cycle sequencing system and a model 373 sequencer (Applied Biosystems Inc., Foster City, Calif.). Sequence data were analyzed by using the Applied Biosystems Auto-Assembler computer program.
ORFs were identified and translated and homology was inferred by using programs accessed through BioManager at the Australian National Genome Information Service, Sydney, Australia, and through the Basic Local Alignment Search Tool at the National Center for Biotechnology Information (1).
Protein sequences were aligned by using the program Pileup from the Genetics Computer Group (GCG) (12), accessed through BioManager at the Australian National Genome Information Service, and the ClustalX multiple-sequence alignment tool (version 1.8) (48). Divergence between amino acid sequences was calculated by using a PAM-Dayhoff matrix (42). Gene sequences used in this study were aligned by using the programs Pileup from GCG and the Clustal W multiple-sequence alignment tool (49). Manual alignment of the output was also performed. Evolutionary relationships and genetic distances were determined by the alignment of 1,236 bp from within the 16S rRNA gene sequence and 1,623 bp from within the ABC transporter gene sequence. Genetic distances (D) between strains were calculated by using the following formula, described by Jukes and Cantor (20): D = 3/4[ln(1 4/3d)], where d is the level of sequence dissimilarity. Phylogenetic trees were constructed with the ClustalX program, by the neighbor-joining method of Saito and Nei (40). Statistical confidence values were determined by performing 1,000 bootstrap trials (11).
Nucleotide sequence accession number.
The nucleotide sequence of the nodularin biosynthetic gene cluster described in this paper has been submitted to GenBank under accession number AY210783.
|
|
|---|
Sequence analysis indicates that nda includes two large operons, upstream and downstream of a bidirectional promoter (Fig. 2). Downstream of the promoter, three NRPS modules are encoded within the two ORFs ndaA and ndaB. Transcribed in the opposite direction, upstream of the promoter, are three ORFs, ndaCDF, which encode NRPS and PKS modules, along with four other ORFs, ndaE and ndaGHI, which encode tailoring enzymes.
A 4.5-kb region downstream of the nda gene cluster was sequenced. Five ORFs were identified; the first three, ORF1 to -3, were transcribed in the same direction as ndaAB, while ORF4 and -5 were transcribed on the opposing strand. Upstream of ndaI, a 2.5-kb region was sequenced and characterized. The three ORFs, ORF6 to -8 were found to be transcribed in the opposite direction from ndaI.
Putative functions of enzymes encoded by the nda gene cluster.
The protein sequences predicted from analysis of the nda gene cluster were compared to those of homologous proteins in order to propose functions and catalytic activities (Table 1). The ORFs ndaCDEF encode enzymes that are responsible for the biosynthesis of Adda, based on protein homology to microcystin biosynthetic enzymes, while ndaFGHAB encode the enzymes responsible for peptide synthesis, cyclization, and transport.
|
View this table: [in a new window] |
TABLE 1. Deduced functions of encoded proteins in the nda gene cluster of N. spumigena NSOR10
|
|
View this table: [in a new window] |
TABLE 2. Analysis of the residues of the A domain-binding pockets that confer substrate specificity in nodularin synthetase and related A domains
|
The NdaF NRPS module contains the same catalytic domains as the NRPS module of McyE. The specificity-conferring recognition sequences of the A domains in NdaF and McyE from M. aeruginosa PCC7806 and P. agardhii CYA126/8 are all identical and are probably responsible for the activation of D-Glu. Epimerization of L-Glu may be catalyzed by NdaG, which is encoded by ndaG, directly upstream of ndaF. This 26,538-Da protein shows 77% identity to McyF, which may be a Glu racemase (32), although the structures of the active sites of NdaG and McyF appear to be distinct from those of other bacterial L-Glu racemases (14). More recent studies have indicated that McyF may also be an L-Asp epimerase (44).
NdaH is 71 and 44% identical to McyI and a putative D-3-PGDH identified in the hyperthermophilic archaeon Methanopyrus kandleri, respectively (46). In Escherichia coli, D-3-PGDH enzymes are responsible for the first step in the pathway for Ser biosynthesis, via the oxidation of 3-phosphoglycerate to phosphohydroxypyruvate (15). The role of the D-3-PGDH homologs in microcystin biosynthesis is unknown; however, it has been proposed that McyI catalyzes the formation of MeDha from activated MeSer or the biosynthesis of Ser (50). NdaH is likely to catalyze the conversion of MeThr to MeDhb, requiring NADH as a cofactor. Clear differences in the specificities of McyI for MeSer and NdaH for MeThr could not be detected through their alignment; however, a region C terminal of the putative NADH binding site was distinct between the two proteins, indicating that this may be a region of substrate specificity (data not shown).
Located upstream of ndaH and transcribed in the same direction, ndaI encodes a 69,010-Da protein which is 73% identical to McyH, a putative ABC transporter, and 69% identical to NosG of nostopeptolide synthetase from Nostoc sp. strain GSV224, also a putative ABC transporter associated with a nonribosomal biosynthetic gene cluster (17).
The first ORF downstream of the promoter region ndaA encodes two NRPS modules. The first NRPS module (NdaA-M1) is 50% identical to the first NRPS module of McyA. The specificity-conferring recognition sequence of the first A domain of NdaA (NdaA-A1) was identical to the Thr-specific consensus sequence (Table 2) (47). The NdaA-A1 domain is likely to activate Thr as the precursor of MeDhb. The second NRPS module is 67% identical to the second NRPS module of McyB of M. aeruginosa PCC7806. The specificity-conferring region of the second A domain of NdaA (NdaA-A2) is most similar to the D-MeAsp activation domain of McyB-A2, although it shows little homology to other A domain active sites. Downstream of ndaA, ndaB encodes a protein with 71% similarity to the NRPS module McyC, which catalyzes the final peptide extension reaction, cyclization, and release of the peptide. Analysis of the recognition sequence of NdaB revealed some similarity to the McyC A domain of P. agardhii; however, there was little homology to any other A domains. The structure of nodularin suggests that this A domain activates L-Arg. A recently characterized isoform of nodularin contains a Har residue at position 5 of nodularin and is produced by N. harveyana PCC7804 (2, 39). Comparison of the specificity-conferring regions of these strains identified that the sequence reflected the change in specificity, with the Asn of the NSOR10 active site replaced with a Ser residue. The longer Har structure more appropriately fits within the structure of the binding pocket due to the replacement of Asn with Ser.
ORFs flanking nodularin synthetase (nda) genes.
The ORFs flanking the nda gene cluster were sequenced and analyzed. Downstream of ndaB, ORF1 encodes a small, 19,729-Da protein with similarity to a group of transposases. The ORF1 protein lacks 160 amino acids of the N-terminal region that are present in other homologous transposases, and it is unknown whether the ORF1 transposase would be functional in this form. Interestingly, the mcy gene cluster also has an ORF, uma4, downstream of the NRPS gene mcyC in M. aeruginosa PCC7806 and downstream of the OM gene mcyJ in P. agardhii CYA126/8, both of which have homology to transposases. While they are associated with similar gene clusters, the transposases identified in Microcystis, Planktothrix, and Nodularia are members of three distinct families of transposases (10, 50).
ORF2 encodes a 63-amino-acid protein which is homologous to a family of high-light-inducible chlorophyll-binding proteins, Hli, identified in cyanobacteria. ORF3 to -5 encode a ß-carotene ketolase homolog, a hypothetical protein, and a heat shock transcriptional repressor protein homolog, respectively. Upstream of ndaI, ORF6 encodes a protein with similarity to 3-oxoacyl-[acyl-carrier-protein] synthase proteins, which are involved in bacterial fatty acid biosynthesis. ORF7 and ORF8 encode two hypothetical proteins.
In order to determine whether ORF1 and ORF2 were associated with nodularin production, the toxic N. spumigena strains BY1, NSGL02A10, HEM, and L575 and the toxic N. harveyana strain PCC7804 were screened by specific PCR for the distribution of the ORFs downstream of ndaAB. This specific PCR amplified the expected 1.2-kb fragment from ndaB to ORF2 in the toxigenic N. spumigena strains NSOR10, BY1, NSGL02A10, HEM, and L575 (Fig. 4). ORF1 and ORF2 were not identified in nontoxic Nodularia strains (data not shown). An additional 1-kb band was amplified from the N. spumigena strains HEM and L575. N. harveyana strain PCC7804 did not produce the expected 1.2-kb band; however, it did yield a smaller, 1-kb fragment. Sequencing of this region found that ORF1 was absent from this strain.
![]() View larger version (67K): [in a new window] |
FIG. 4. Amplification of the open reading frames downstream of the nodularin synthetase (nda) gene cluster from toxic Nodularia strains. PCR with primers NPSF3 and HLIPR generated 1.2- and/or 1.0-kb fragments. Lanes 1 to 6 correspond to PCR-amplified genes from the toxic strains N. spumigena NSOR10 (lane 1), N. spumigena BY1 (lane 2), N. spumigena NSGL02A10 (lane 3), N. spumigena HEM (lane 4), N. spumigena L575 (lane 5), and N. harveyana PCC7804 (lane 6). , no-DNA control. Lane M contains 150 ng of 1-kb DNA ladder marker (Invitrogen).
|
![]() View larger version (20K): [in a new window] |
FIG. 5. Characterization of the NdaA condensation domain. A) Phylogenetic analysis of the amino acid sequences of the condensation (C) domains from nodularin synthetase of N. spumigena NSOR10, microcystin synthetases of M. aeruginosa PCC7806 and P. agardhii CYA126/8, and gramicidin S synthetase (GrsA) of Bacillus brevis ATCC 9999. The sequence alignment and phylogenetic tree were created by using Pileup and ClustalX from GCG (40, 48). The scale bar represents one substitution per 10 amino acids. Significant bootstrap values calculated from 1,000 bootstrap trees are indicated at the nodes (11). B) Diagram of the proposed deletion of the second and first NRPS modules of McyA and McyB, respectively. A, adenylation; C, condensation; E, epimerization; M, N-methyltransferase. PCP domains are shown in black; the NdaA condensation domain linker region is shown in white.
|
Phylogenetic comparisons between nodularin and microcystin synthetases.
In order to further infer genetic relationships between the mcy and nda gene clusters, phylogenetic analysis of the genes that encode the putative ABC transporters, mcyH and ndaI, was performed, with the nosG ABC transporter from Nostoc sp. strain GSV224 used as an outgroup. The ABC transporter gene was chosen to analyze the relationship between these clusters because it is associated with each of the nda and mcy clusters characterized to date. The results of the phylogenetic analysis indicated that the mcyH genes of M. aeruginosa and P. agardhii are more similar to each other than to the ndaI sequence of N. spumigena (Fig. 6). These species represent distinct orders within the cyanobacteria. The evolutionary distances between these genera were also determined by 16S ribosomal DNA (rDNA) nucleotide sequence analysis (Fig. 6). Phylogenetic analysis of the 16S rRNA gene indicated that the evolutionary distances between the species N. spumigena, P. agardhii, and M. aeruginosa were similar (Fig. 6).
![]() View larger version (19K): [in a new window] |
FIG. 6. Evolution of the hepatotoxicity in cyanobacterial genera by using phylogenetic characterization of the ABC transporter and 16S rRNA genes. All sequences were obtained from GenBank, and the accession numbers are given in parentheses. The sequence alignment and phylogenetic tree were created by using Pileup and ClustalX from GCG (40, 48). The scale bar represents two substitutions per 100 nucleotides. Significant bootstrap values calculated from 1,000 bootstrap trees are indicated at the nodes (11). A) Phylogenetic analysis of the putative ABC transporter gene. The ndaI gene from N. spumigena NSOR10 was aligned with the mcyH sequences from M. aeruginosa strains PCC7806 and K-139 and from P. agardhii CYA126/8 and with the putative ABC transporter nosG sequence from Nostoc sp. strain GSV224. B) Phylogenetic analysis of the 16S rRNA gene.
|
|
|
|---|
![]() View larger version (27K): [in a new window] |
FIG. 7. Model of the nodularin biosynthetic pathway catalyzed by proteins encoded by ndaC to -F. The squares represent the NRPS adenylation (A), peptidyl carrier protein (PCP), and condensation (C) domains. The circles represent the PKS, ketosynthase (KS), acyltransferase (AT), C-methyltransferase (CM), dehydratase (DH), ketoreductase (KR), and acyl carrier protein (ACP) domains. Condensation reactions are indicated in red. The O-methyltransferase (OM) NdaE catalyzes the transfer of the methyl group to the C9 hydroxyl. The racemase NdaG catalyzes the epimerization of the Glu residue. The aminotransferase (AMT) domain catalyzes the transfer of the amine group to Adda. Peptide condensation reactions are catalyzed by NdaA and NdaB. The putative dehydrogenase, NdaH, catalyzes the formation of the MeDhb residue following peptide bond formation. The thioesterase (TE) domain catalyzes the cyclization and release of the peptide.
|
The structure of the cyanobacterial toxin nodularin is similar to that of the toxin microcystin (Fig. 1). It is therefore interesting to note the differences in the structures of the nda gene cluster sequenced in this study and the mcy gene clusters sequenced from M. aeruginosa strains and P. agardhii CYA126/8 (10, 33, 50). The nda genes are located upstream and downstream of a bidirectional promoter, similar to the case for the mcy cluster of M. aeruginosa; however, the order of proteins encoded by the nda gene cluster is more colinear, generally representing the order of each catalytic step required for the biosynthesis of nodularin (Fig. 2). The mcy gene cluster encodes an additional two NRPS modules, responsible for the activation of the amino acids D-Ala and L-Leu, that are not present in the structure of nodularin. Four NRPS modules of nodularin synthetase were compared to the modules of microcystin synthetase. The active sites of the D-Glu and D-MeAsp A domains are identical in both nodularin and microcystin synthetases. The active site of NdaA clearly activates and methylates L-Thr as a precursor of MeDhb and has little homology to the active site of McyA, which activates and methylates L-Ser. Conversion of MeThr to MeDhb probably then occurs via the NdaH enzyme via a mechanism similar to the conversion of MeSer to MeDha.
Interestingly, the substrate-binding pockets of the terminal activation domains of nodularin and microcystin synthetases show only 50% identity. This A domain in nodularin is likely to be specific for the activation of L-Arg. Many isoforms of microcystin have been shown to have variations at this position (6, 37, 45). Perhaps the relaxed specificity of this A domain in microcystin synthetase is reflected in the sequence of the active site of the A domain. P. agardhii CYA126/8 also produces the L-Arg isoform; however, the substrate-binding pocket is distinct from those of both NdaB and McyC of M. aeruginosa (Table 2). Further expression and activation studies are required to determine the degree of relaxed specificity of the McyC and NdaB A domains. The A domain of NdaB of N. harveyana PCC7804, which produces the nodularin-Har isoform (2, 39), was analyzed, and the active site was compared with that of NdaB of N. spumigena NSOR10. Indeed, the sequence reflects the change in specificity with the replacement of Asn with Ser in the binding pocket of N. harveyana PCC7804, clearly allowing a more compatible fit for the longer Har residue.
Evolution of hepatotoxin biosynthesis in cyanobacteria.
While nodularin production is restricted to Nodularia, the production of microcystin is not consistently present in distinct evolutionary clades of cyanobacterial species, in particular M. aeruginosa (51). It has recently been proposed that the transfer of large NRPS genes clusters is responsible for the presence of the highly homologous gene clusters in different strains of Bacillus subtilis (52). This indicates that the transfer of large fragments of DNA is more common than previously thought. The transposases associated with the mcy gene clusters in M. aeruginosa and P. agardhii and the nda gene cluster in N. spumigena may have mediated transfer of these gene clusters between cyanobacterial genera. Previous studies of mcy in Microcystis and Planktothrix proposed a recent insertion of the second and first NRPS modules of mcyA and mcyB, respectively, into the nda gene cluster (10, 24). This indicated that mcy evolved from the nda gene cluster. Characterization of the nda gene cluster from N. spumigena NSOR10 in this study has provided new data which may suggest an alternative hypothesis for hepatotoxin evolution in cyanobacteria.
A duplicated region, separated by a linker region with little homology to other proteins, was identified within the NdaA C domain. We hypothesized that the duplication within the NdaA C domain and the putative linker region are remnants of a deletion event which had occurred during evolution (Fig. 5). To confirm this hypothesis, sequence alignments and phylogenetic analysis found that NdaA-C1 was most similar to the C domain of McyA (McyA-C1), while the NdaA-C1b region was most closely related to McyB-C1 of both M. aeruginosa and P. agardhii. Thus, we propose that the structure of nodularin synthetase has evolved from microcystin synthetase via the deletion of the second and first NRPS modules of McyA and McyB, respectively. This differs from previous hypotheses that microcystin synthetase evolved from nodularin synthetase (10, 24).
To further characterize evolution of hepatotoxin biosynthesis in cyanobacteria, phylogenetic analysis of ABC transporter genes, ndaI and mcyH, was performed, and the results were compared with 16S rDNA phylogeny. Comparative analysis of the branch lengths observed in the 16S rDNA and ABC transporter gene phylogenies may suggest that the nda lineage diverged from the mcy lineage well before the transfer of the mcy gene cluster between P. agardhii CYA126/8 and M. aeruginosa PCC7806. This result supports previous hypotheses that the mcy gene cluster was transferred between strains via a transposase (24, 51); however, additional phylogenetic analysis using a larger group of sequences is required to strengthen this theory.
From these results, we propose that hepatotoxins in cyanobacteria most likely originated as microcystin-LR, the most common isoform of microcystin. The original biosynthetic gene cluster may have been similar in structure to the nda gene cluster from N. spumigena NSOR10. An MeDhb microcystin-LR isoform, the most likely direct nda ancestor, may have been inserted within the genome of Nodularia via transposition. During or following insertion of this gene cluster into the genome of Nodularia, a deletion event may have occurred, resulting in the loss of the two NRPS modules. We propose that the wide distribution of this gene cluster across the cyanobacteria and the 60 isoforms of microcystin that have been reported to date (9, 37, 45) may be attributed to transposition of the gene cluster followed by further genetic shuffling, mutagenesis, deletion, and recombination events. This is exemplified by the distinct differences present in the structures of the mcy gene clusters of M. aeruginosa and P. agardhii. Further characterization of the gene clusters, their associated transposases, and the extent of recombination and shuffling in these strains is required to more accurately describe the evolution of hepatotoxigenicity in cyanobacterial genera.
Unlike microcystin biosynthesis, nodularin biosynthesis is restricted to strains of N. spumigena and one strain of N. harveyana. Since nodularin biosynthesis has been identified in all strains of N. spumigena, it is likely that the transposition event occurred early in the delineation of the species N. spumigena. The toxic strain N. harveyana PCC7804 is phylogenetically distinct from the toxic species N. spumigena, and the toxigenicity of this strain cannot be easily explained (25). Nodularin biosynthesis in this strain may have resulted from a separate transposition-acquisition event, leading to convergent phenotypes. A putative transposase gene could not be identified downstream of ndaB in N. harveyana PCC7804; thus, the mechanism of transfer of this gene cluster is unclear.
This study reports for the first time the complete sequence and characterization of the nda gene cluster in the bloom-forming cyanobacterium N. spumigena. The sequence of the nda gene cluster will be useful for future genetic and biochemical studies into the biosynthesis of nodularin and its regulation, which will provide a strong foundation for understanding biosynthesis of toxins in blooms of N. spumigena and related cyanobacterial species. A new hypothesis describing the evolution of nodularin synthetase from microcystin synthetase has been proposed from analyses reported in this study. These results may provide important insight into mechanisms of natural evolution of nonribosomal biosynthetic clusters in cyanobacteria and how these mechanisms may be utilized for the directed evolution of such clusters for the rational design of novel metabolites.
We thank Elke Dittman for invaluable advice throughout this project and Bradley S. Moore for helpful discussions regarding the manuscript.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»