Previous Article | Next Article ![]()
Applied and Environmental Microbiology, April 2009, p. 2506-2516, Vol. 75, No. 8
0099-2240/09/$08.00+0 doi:10.1128/AEM.02136-08
Copyright © 2009, American Society for Microbiology. All Rights Reserved.
,
Abteilung Genomische und Angewandte Mikrobiologie, Institut für Mikrobiologie und Genetik der Georg-August-Universität, Grisebachstr. 8, 37077 Göttingen, Germany
Received 15 September 2008/ Accepted 3 February 2009
|
|
|---|
|
|
|---|
Proteases can be divided into various groups with respect to the functional group present at the active site and the pH profile of the activity. Microbial alkaline proteases, which are defined to be active in a neutral to alkaline pH range (13), possess either a serine center (serine proteases) or are of metal-type (metalloproteases). Extracellular serine proteases are one of the most important groups of industrial enzymes (13, 23). In addition, many extracellular metalloproteases play an important role in pathogenesis (36). Both types of proteases have been cloned and sequenced from many different individual microorganisms, i.e., Bacillus subtilis (56), Enterobacter sakazakii (22), Listeria monocytogenes (4), and Alteromonas sp. (33, 34).
In the present study, we sought to isolate proteases by a metagenomic approach. Functional metagenomics comprising the isolation of DNA from environmental samples without prior enrichment of individual microorganisms, the construction of libraries from the recovered DNA, and function-driven screening of the generated libraries has led to the identification and characterization of a variety of novel enzymes (7, 14, 43), i.e., amylases (44, 58), oxidoreductases (21), lipolytic enzymes (9, 16, 44), monooxygenases (53), and proteins conferring nickel resistance (32). Despite the wealth of different biotechnologically important enzyme activities derived from metagenomes, little information on the characteristics of metagenome-derived proteases is available. One metagenome-derived fibrinolytic metalloprotease has been recovered (26). In addition, one serine protease has been cited in reviews (13, 29). In other metagenome projects, activity-based screening was unsuccessful (19, 44), or the thereby recovered proteolytic clones were not characterized (47).
We report here on the identification and characterization of two metagenome-derived metalloproteases. We constructed metagenomic DNA libraries from environmental samples by a T4 DNA ligase-based and a topoisomerase-based cloning method. The suitability of both approaches for the construction of small-insert metagenomic libraries was compared The resulting library-containing Escherichia coli clones were screened for proteolytic activity. The plasmids were recovered from the positive E. coli strains and the insert sequences of two plasmids (pTW2 and pTW3), which conferred strong proteolytic activity, were sequenced. The targeted protease-encoding genes and the corresponding gene products were characterized.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Plasmids examined in this study
|
For the production of the metalloproteases and derivatives of these proteins, recombinant E. coli strains were grown under aerobic conditions at 37°C in basal medium containing the following per liter: Na2HPO4, 6.0 g; KH2PO4, 3.0 g; NH4Cl, 3.0 g; MgSO4·7H2O, 0.05 g; CaCl2·12H2O, 0.02 g; yeast extract, 0.2 g; and trace element solution SL4 (37), 1 ml (pH 7.4). The medium was supplemented with 20 mM glucose and, to increase the yield of secreted recombinant proteins, 0.3 M L-arginine. For the induction of heterologous gene expression, 0.5 mM (final concentration) IPTG (isopropyl-β-D-thiogalactopyranoside) was added to cultures with an A600 of
0.8. Subsequently, the cultures were incubated for 15 h and then harvested. All growth media for E. coli strains harboring plasmids contained 100 µg of ampicillin/ml.
General molecular procedures.
All manipulations of DNA, PCR, and transformation of plasmids into E. coli were done according to routine procedures (2) unless otherwise specified. The Göttingen Genomics Laboratory (Göttingen, Germany) determined the DNA sequences.
Isolation of environmental DNA and library construction.
The genomic DNA was isolated from the samples by direct lysis of the microorganisms present without prior removal of environmental compounds as described previously (17). The purified DNA was partially digested with Bsp142I and, in order to avoid cloning of very small DNA fragments, size fractionated by sucrose density centrifugation (10 to 40% [wt/vol]). To construct small-insert libraries, fractions containing DNA fragments of >2 kb were pooled and used for library construction. One library was constructed in pCR2.1-TOPO and one was constructed in pBluescript SKII+ (pSKII+) from the size-fractionated DNA derived from each of the samples described below (Table 2). The quality of the different environmental libraries produced was controlled by determination of the average insert size and the number of insert-containing clones.
|
View this table: [in a new window] |
TABLE 2. Characterization of constructed metagenomic libraries and screening of these libraries for genes conferring proteolytic activity on E. colia
|
To clone the metagenomic DNA into pCR2.1-TOPO, a TOPO TA cloning method (Invitrogen) was used. For this purpose, 5 µg of the size-fractionated metagenomic DNA was subjected to blunt-end polishing by using T4 DNA polymerase (MBI Fermentas) as suggested by the manufacturer. The reaction was performed in a total volume of 50 µl at 25°C for 1 h. Subsequently, the DNA was purified by using QIAquick PCR purification kit (Qiagen, Hilden, Germany) as recommended by the manufacturer. The volume of the purified DNA solution was 50 µl. The next step comprised addition of deoxyadenosine to the 3' termini of the DNA, which is required to facilitate cloning by the TA method. For this purpose, 6 µl of dATP solution (2 mM), 8 µl of 10-fold MgCl2-containing Taq DNA polymerase buffer (MBI Fermentas), 2 µl of Taq DNA polymerase (2 U), and 14 µl of H2O were mixed with the DNA solution, followed by incubation at 72°C for 20 min, and then purified by using a QIAquick PCR purification kit. The resulting DNA solution (50 µl) was dephosphorylated by using 2 U of calf intestine alkaline phosphatase (MBI Fermentas) as recommended by the manufacturer. Subsequently, the DNA was purified by using a QIAquick PCR purification kit. Finally, the recovered DNA solution (30 µl) was inserted into pCR2.1-TOPO by using the TOPO TA cloning kit (Invitrogen) according to the manufacturer's instructions. All constructed libraries were used to transform E. coli TOP10 (Invitrogen) as recommended by the manufacturer.
Four different samples were used to construct metagenomic libraries. The characteristics and the designations of the constructed libraries are given in Table 2. Two libraries (SK3 and CR3) were constructed from a soil sample derived from the mining shaft Fortuna (Harz Mountains, Germany), and two libraries (SK4 and CR4) were constructed from compost soil. The other four libraries were generated from DNA derived from two different mixed samples, designated mixed sample A and mixed sample B. Mixed samples were prepared by merging 0.5 g of each of the different environmental samples. Mixed sample A (libraries SK1 and CR1) contained sediment from a digestion tower of a sewage plant (Göttingen, Germany), sediment from the Gulf of Eilat (Israel), and sediment from the Leine River (Göttingen, Germany). Mixed sample B (libraries SK2 and CR2) harbored garden soil (Varmissen, Germany) and sediment from the Wadden Sea (Germany), sediment from the Leine River, and sediment from the Solar Lake (Egypt).
Cloning of the protease-encoding genes and derivatives of these genes into expression vector pET101/D.
In order to construct derivatives of protease-encoding genes mprA and mprB and to achieve high-level expression of the mprA and mprB gene products and derivatives of these gene products, the plasmids pMPRA, pMPRA.1, pMPRB, and pMPRB.1 to pMPRB.4 were constructed by a PCR-based approach (Table 1). For this purpose, the coding regions of mprA and mprB or parts of these regions were amplified from recombinant plasmids pTW2 and pTW3, respectively, by PCR using the following sets of primers with synthetic sites (underlined) that allowed a directional cloning into pET101/D and translation of the gene regions by using the pET101/D directional TOPO expression kit (Invitrogen): pMPRA, 5'-CACCATGCATTCCAGCAGTCG-3' (primer mprA-for) and 5'-GAAGGTGACGCTCCAGCTG-3' (primer mprA/mprB-rev); pMPRA.1, primer mprA-for and 5'-CGATGCCGACTGGTTGGTC-3' (primer mprA.1-rev); pMPRB, 5'-CACCATGCAGTCCAGCAGTCGT-3' (primer mprB-for) and primer mprA/mprB-rev; pMPRB.1, primer mprB-for and 5'-GCCGGTGCTGGCGGAGATG-3' (primer mprB.1-rev); pMPRB.2, 5'-CACCATGGCCGATATCGGCACC-3' (primer mprB.2-for) and primer mprA/mprB-rev; pMPRB.3, primer mprB.2-for and primer mprB.1-rev; and pMPRB.4, 5'-CACCATGGCCACCCGGGTCGACCTG-3' (primer mprB.4-for) and primer mprA/mprB-rev Each PCR contained in a total volume of 50 µl of 1-fold Mg-free polymerase buffer (MBI Fermentas), 200 µM concentrations of each of the four deoxynucleoside triphosphates, 1.5 mM MgCl2, 2 µM concentrations of each of the primers, 1 U of Pfu DNA polymerase (MBI Fermentas), and 0.1 µg of pTW2 or pTW3 as a template. The reactions were initiated at 95°C (2 min), followed by 30 cycles of 95°C (1 min), a temperature gradient ranging from 37 to 57°C (1 min), 72°C (1 min per 500 bp), and ended with incubation at 72°C for at least 10 min. Finally, all obtained PCR products were purified and cloned into pET101/D as recommended by the manufacturer (Invitrogen). All coding regions were placed under the control of the IPTG-inducible T7 promoter by cloning in the expression vector pET101/D. In addition, sequences encoding a His6 tag and a V5 epitope provided by the vector were added to the 3' end of the coding regions.
To detect V5 epitope-tagged proteins by Western blotting hybridization analysis samples of cell extracts were separated by analytical sodium dodecyl sulfate polyacrylamide gel electrophoresis using the procedure of Laemmli (24), and then transferred to Hybond-P membranes (GE Healthcare, Freiburg, Germany) as recommended by the manufacturer. Incubation of membranes with alkaline phosphatase-conjugated anti-V5 antibody (Invitrogen) and development of signals by using BCIP (5-bromo-4-chloro-3-indolylphosphate)/nitroblue tetrazolium color development reagent (Promega) were performed according to the manufacturer's instructions.
Sequence analysis.
The initial prediction of open reading frames (ORFs) located on the inserts of pTW2 and pTWP3 was accomplished by using the Artemis program (http://www.sanger.ac.uk/Software/Artemis) (46). The results were verified and improved manually by using criteria such as the presence of a ribosome-binding site, GC frame plot analysis, and similarity to known protein-encoding sequences. Initial annotation of the deduced proteins was performed by searching the amino acid sequences against the public GenBank database by using the BLAST program (1). All predictions were verified and modified manually by comparing the protein sequences with the Swiss-Prot, ProDom, COG, and Prosite public databases. All coding sequences were searched for similarities to protein families and domains using searches against the Pfam (10) and CDD databases (31). Signal peptides of proteins were predicted by using the SignalP 3.0 server (http://www.cbs.dtu.dk/services/SignalP/) (3). Sequence analysis and classification of the identified proteases were performed by comparing the sequences to the MEROPS peptidase database (http://merops.sanger.ac.uk) (42).
Preparation of cell-free culture supernatants and cell extracts.
The cells and the culture supernatants from 500-ml cultures were separated by centrifugation at 16,300 x g and 4°C for 15 min. To prepare cell extracts, the resulting cell pellets were washed once with 100 mM potassium phosphate buffer (pH 8.0) and resuspended in 2 to 3 ml of the same buffer. The cells were disrupted by using a French press (1.38 x 108 Pa). Subsequently, the extract was cleared by centrifugation at 32,000 x g and 4°C for 30 min.
The culture supernatant was concentrated 20-fold by ultrafiltration using a Vivaflow 200 module (exclusion limit 10,000 Da; Sartorius AG, Göttingen, Germany) as recommended by the manufacturer. The module was also used to change buffer systems. The concentrated culture supernatants were used for protein purification as described below. To characterize the protease activity directly in the culture supernatant, it was further concentrated by using a Vivaspin 6 concentrator (Sartorious AG) as recommended by the manufacturer. The culture supernatant was concentrated 100-fold by using both ultrafiltration steps.
Enzyme assays.
Protease activity was determined by assaying the amount of liberated amino acids using casein as a substrate. To 0.5 ml of 0.65% (wt/vol) casein solution in 50 mM potassium phosphate buffer (pH 7.5) 0.1 ml of enzyme solution was added. After incubation for 1 h at 55°C, the reaction was terminated by the addition of 0.5 ml of 0.11 M trichloroacetic acid solution. For the preparation of blanks, the enzyme solutions were added after termination of the reaction by trichloroacetic acid. Subsequently, the terminated reaction mixtures were incubated at 55°C for 30 min, and the precipitate was removed by centrifugation at 14,000 x g for 5 min. An aliquot (0.4 ml) of the remaining supernatant was mixed with 1 ml of 0.5 M Na2CO3 solution and 0.2 ml of Folin-Ciocalteu reagent (Sigma Chemie, Deisenhofen, Germany), which was diluted fivefold prior to use. After incubation at 37°C for 30 min, the solution was cleared by centrifugation, and the absorbance was measured at 660 nm. A standard curve was generated using solutions of 0 to 80 mg of tyrosine/liter. One unit of protease activity was defined as the amount of enzyme that liberated 1 µmol of tyrosine in 1 min. All measured values of absorbency were corrected for the blank absorbencies. Protein concentrations were determined by the method of Bradford (5) with bovine serum albumin as the standard.
The optimum temperature for enzyme activity was measured in the range of 15 to 85°C. The optimum pH was determined by incubating the enzyme with casein as the substrate in 50 mM potassium phosphate buffer (pH 6 to 9) or 50 mM sodium carbonate buffer (pH 9 to 11.5).
Effect of inhibitors.
The effects of inhibitors on proteolytic activity were determined by incubation of the proteases with 10 mM ortho-phenanthroline, 10 mM phenylmethylsulfonyl fluoride, or 10 mM EDTA for 1 h at 25°C before the enzyme activity was measured.
Purification of proteases.
The proteases were purified from the concentrated culture supernatants by using hydrophobic interaction chromatography using a computer-controlled ÄKTA FPLC system (GE Healthcare) as recommended by the manufacturer. Protease solutions (13.7 to 16.5 mg of protein) containing MprA, MprB, MprA.1, or MprB.1 were applied to a prepacked Phenyl-Sepharose HP XK 16/10 column (GE Healthcare) that has been equilibrated with 1 M (NH4)2SO4 in 50 mM Tris buffer (pH 8). Subsequently, the column was washed with a negatively sloped gradient of ammonium sulfate [200 ml of 1 to 0 M (NH4)2SO4 in 50 mM Tris buffer; pH 8]. The proteases desorbed from the column, when the ammonium sulfate concentration dropped below 100 mM. The flow rate was 1.5 ml/min, and 6-ml fractions were collected. Subsequently, active fractions were pooled, and the pools of the different proteases (0.7 to 2.4 mg of protein) were used for characterization of the enzymes.
Nucleotide sequence accession numbers.
The nucleotide sequences of the inserts of pTW2 and pTW3 have been deposited in the GenBank database under accession numbers EU333168 and EU333169, respectively.
|
|
|---|
Two different types of small-insert metagenomic libraries were constructed from each of the four environmental samples. One type was generated by using pSKII+ as the vector and a standard ligation reaction with T4 DNA ligase to clone the metagenomic DNA. The other type was constructed by using pCR2.1-TOPO as the vector and a TOPO TA cloning approach. This approach is based on the ligase activity of topoisomerase I, which is bound to the linearized vector. In order to compare both approaches, 5-µg portions of the isolated and size-fractionated DNA from each of the four environmental samples were used as starting material for library construction. The mixed samples were used to study the influence of different environmental matrix substances on library construction.
The four libraries in the high-copy vector pSKII+ (libraries SK1 to SK4) and the four libraries in vector pCR2.1-TOPO (libraries CR1 to CR4) contained 389,000 insert-bearing E. coli clones, which harbored 1.94 Gb of cloned environmental DNA (Table 2). The number of clones per µg of DNA used for library construction and the average insert sizes ranged from 2,000 to 15,600 and from 2.4 to 5.3 kb, respectively. Direct comparison of both different types of libraries generated from each of the four samples revealed that use of the TOPO TA-based cloning method results in more insert-containing clones per µg of DNA and in larger average insert sizes than the application of the standard cloning method involving T4 DNA ligase (Table 2).
Activity-based screening of the constructed libraries.
The screen for genes conferring proteolytic activity was based on the ability of the library-containing E. coli clones to form halos when grown on indicator agar medium containing skim milk. Halo formation is caused by hydrolysis of the milk proteins. The screening was initiated by the transfer of approximately 10,000 clones of each library to indicator agar medium. Thus,
318 Mb of environmental DNA were screened for the presence of genes encoding proteolytic activity. After incubation for 48 h at 37°C, positive E. coli clones were collected. In order to confirm that the proteolytic activity of the positive clones was plasmid encoded, the recombinant plasmids were isolated and used to transform E. coli. The resulting E. coli strains were screened again on indicator agar. Four different recombinant plasmids designated pTW1 to pTW4 conferred a stable proteolytic phenotype. Two of these plasmids were derived from library SK2: one from library CR1 and one from library CR3 (Table 2). All four plasmids conferred also a proteolytic phenotype on indicator agar containing other protease substrates such as azoalbumin and azocasein. E. coli strains harboring pTW2 or pTW3 showed the largest halos and the fastest appearance of the proteolytic phenotype of all positive clones on indicator agar. In addition, prolonged incubation of these recombinant E. coli strains resulted in complete clearing of the entire skim milk-containing indicator agar medium (25 ml) placed in a standard petri dish (data not shown). Since these results indicated a strong proteolytic activity, the plasmids pTW2 and pTW3 and the corresponding E. coli clones (E. coli TOP10/pTW2 and E. coli TOP10/pTW3) were studied further.
Sequence analysis of pTW2 and pTW3.
The inserts of pTW2 (28,113 bp) and pTW3 (19,956 bp) were sequenced, analyzed, and compared to the sequences in the National Center for Biotechnology Information databases (see Tables S1 and S2, respectively, in the supplemental material). The apparent gene organizations are shown in Fig. 1A and B, respectively. The similar G+C content of the sequenced inserts of pTW2 (68.3%) and pTW3 (69.5%) suggested a similar phylogenetic affiliation of the cloned DNA fragments. Each of the inserts contained one ORF that encoded a putative protease (mprA [pTW2] and mprB [pTW3]; see below).
![]() View larger version (40K): [in a new window] |
FIG. 1. Comparison of the inserts of pTW2 (A) and pTW3 (B) with correlating genomic regions of X. campestris pv. vesicatoria strain 85-10 (C), X. campestris pv. campestris strain ATCC 33913 (D), X. campestris pv. campestris strain 8004 (E), X. axonopodis pv. citri strain 306 (F), X. oryzae pv. oryzae KACC 10331 (G), and X. oryzae pv. oryzae MAFF 311018 (H). Arrows and arrowheads indicate the lengths, location, and orientations of potential genes. Genes sharing the same predicted functions are marked with identical colors. Gray arrows and open arrows indicate conserved hypothetical proteins and hypothetical proteins, respectively. Genes encoding putative proteases are boxed in yellow. ORFs that are found in pTW2 and pTW3 and not in one of the six Xanthomonas strains are marked by the red triangles. Genes of the Xanthomonas strains that are not present in the insert sequences of pTW2 and pTW3 are indicated by the black rectangles. Numbers below the arrows indicate the ORF position. The predicted functions of the gene products from the six Xanthomonas strains and detailed comparisons with pTW2 and pTW3 are given in Tables S4 to S9 in the supplemental material. References were as follows: pTW2, EU333168 (the present study); pTW3, EU333169 (the present study); X. campestris pv. vesicatoria strain 85-10 (base 1118626 to base 1074579), AM039952; X. campestris pv. campestris strain ATCC 33913 (base 1040326 to base 995188), AE008922; X. campestris pv. campestris strain 8004 (base 4000216 to base 4045359), CP000050; X. axonopodis pv. citri strain 306 (base 1119791 to base 1071729), AE008923; X. oryzae pv. oryzae KACC 10331 (base 3859493 to base 3902413), AE013598; and X. oryzae pv. oryzae MAFF 311018, (base 3858820 to base 3899102), AP008229.
|
The insert of pTW3 contained 22 predicted protein-encoding ORFs. Two ORFs (ORFs 1 and 9) were unrelated to any known genes, and 20 ORFs possessed similarities to known bacterial genes. Seventeen of the proteins deduced from the latter ORFs revealed high amino acid sequence identities (33 to 89%) to database entries of the above-mentioned members of the Xanthomonadaceae. The gene products deduced from the remaining three ORFs (ORFs 2, 8, and 22), including mprB, were most similar to gene products from organisms belonging to other families (see Table S2 in the supplemental material).
Comparative sequence analysis of pTW2 and pTW3 revealed that the inserts contained similar but not identical overlapping gene regions. (Fig. 1A and B). This includes ORFs 18 to 32 of pTW2 and ORFs 1 to 12 of pTW3. The amino acid identities of correlating proteins deduced from pTW2 and pTW3 in this region ranged from 68 to 96% (see Table S3 in the supplemental material). These high similarities, together with the similar gene organization of the overlapping regions and the similar G+C content of the inserts, suggested that the cloned DNA fragments of pTW2 and pTW3 were derived from closely related but different organisms.
The high similarities of the majority of the gene products deduced from the inserts of pTW2 and pTW3 to gene products from members of the Xanthomonadaceae family suggested a phylogenetic affiliation of the organisms from which the inserts of pTW2 or pTW3 were derived to the Xanthomonadaceae. To further analyze this, the sequences and the gene organization of the inserts of pTW2 and pTW3 were compared to complete genome sequences of members of the Xanthomonadaceae. The gene organization of both inserts revealed striking similarities to the organization of genomic regions of members of the genus Xanthomonas, such as X. axonopodis pv. citri strain 306, X. campestris pv. campestris strain ATCC 33913, X. campestris pv. campestris strain 8004, X. campestris pv. vesicatoria strain 85-10, X. oryzae pv. oryzae strain KACC 10331, and X. oryzae pv. oryzae strain MAFF 311018 (Fig. 1). In each case, at least 68% of the deduced gene products of pTW2 or pTW3 shared high amino acid sequence identity with the correlating gene products of the different Xanthomonas species (for detailed comparisons, see Tables S4 to S9 in the supplemental material). In addition, the G+C contents of pTW2 and pTW3 and of the Xanthomonas species (63 to 67%) (25) are in the same range.
Most of the differences between the insert sequences of pTW2 and pTW3 and the genomic regions of the Xanthomonas species were located in the neighborhood of the putative protease-encoding genes. The six Xanthomonas strains possessed one or three putative serine protease-encoding genes in the corresponding region but none of the protein sequences deduced from these genes showed high similarity to MprA and MprB. The amino acid identities ranged from 6 to 22% (see Tables S4 to S9 in the supplemental material).
Sequence analysis of the protease-encoding genes.
The presumptive genes encoding the proteases MprA (pTW2) and MprB (pTW3) were of equal size (2589 bp) and exhibited 90% nucleotide sequence identity and 92% amino acid sequence identity (see Fig. S1 in the supplemental material). A potential signal peptide of 27 amino acids was predicted at the N terminus of MprA and MprB by using the SignalP program (3). The amino acid sequences of both putative signal peptides showed the typical orientation of signal peptides with three distinct parts (N, H, and C domains (38). The protein sequences of MprA and MprB contained the highly conserved zinc-binding motif HEXXH (residues H362 to H366) and the third zinc ligand motif GXXNEXXSD (residues G382 to D390) (Fig. 2). The presence of these conserved motifs is typical for metalloproteases belonging to the M4 family of metallopeptidases, which is also known as the thermolysin-like family (40). Most of the members are secreted bacterial enzymes that degrade extracellular proteins. This is in accordance with the presence of putative signal peptides in the N-terminal regions of MprA and MprB.
|
View larger version (23K): [in a new window] |
FIG. 2. Alignment of the of the active-site regions of MprA and MprB with those of other metalloproteases belonging to the M4 family. The conserved HEXXH and GXXNEXXSD motif are shaded. Identical amino acid residues are indicated by bold letters. References were as follows: MprA and MprB (the present study); MprI from Pseudoalteromonas piscicida (34); EmpI from Pseudoalteromonas sp. strain A28 (27); vibriolysin from Vibrio vulnificus (6); and thermolysin from Bacillus thermoproteolyticus (CAA01492).
|
![]() View larger version (22K): [in a new window] |
FIG. 3. Domain structure of MprA (A) and MprB (B) and localization of the constructed derivatives of both proteases. For amino acid positions and similarities of domains and motifs, see Table 3. Abbreviations: SP, signal peptide; FTP, fungalysin/thermolysin propeptide motif; M4, peptidase M4 catalytic domain; M4_C, peptidase M4 alpha-helical domain; P_propr, proprotein convertase P domain.
|
|
View this table: [in a new window] |
TABLE 3. Localization and similarities of domains and motifs in the amino acid sequences of MprA and MprBa
|
Subcloning, expression, and characterization of mprA and mprB and of derivatives.
To show that mprA and mprB encode extracellular proteases and to unravel the importance of the different domains for enzyme activity the coding regions and derivatives of these regions that lack one or more of the putative domains (Fig. 3) were amplified by PCR. Since mprA and mprB are highly similar, most of the derivatives were constructed from the coding region of one gene (mprB). The PCR products were cloned into the expression vector pET101/D, thereby placing the genes under the control of the IPTG-inducible T7 promoter and adding sequences encoding a His6 tag and a V5 epitope. In addition, to facilitate protein production of derivatives that lack the N-terminal region, a start codon (ATG) was added to the 5' end of the coding region. The fidelity of the PCR products and the cloning steps were confirmed by sequencing of the resulting constructs pMPRA, pMPRA.1, pMPRB, and pMPRB.1 to pMPRB.4 (Table 1 and Fig. 3). The plasmids were transformed into E. coli BL21. Subsequently, the correlating E. coli strains BL21/pMPRA, BL21/pMPRA.1, BL21/pMPRB, and BL21/pMPRB.1 to BL21/pMPRB.4 were grown on indicator agar medium containing skim milk and trace amounts of IPTG (Fig. 4). Proteolytic activity was detectable for the E. coli strains BL21/pMPRA, BL21/pMPRA.1, BL21/pMPRB, and BL21/pMPRB.1. These strains harbored full-length protease-encoding genes or derivatives encoding the signal peptide, the proregion, and the protease region. A proteolytic phenotype was also recorded for the positive controls (E. coli TOP10/pTW2 and E. coli TOP10/pTW3). The other recombinant E. coli strains, including the strains containing pET101/D or pSKII+ (negative controls), showed no proteolytic activity (Fig. 4). All of the active E. coli clones mentioned above revealed also a proteolytic phenotype on indicator agar containing azoalbumin or azocasein as protease substrate.
![]() View larger version (56K): [in a new window] |
FIG. 4. Proteolytic activity of recombinant E. coli strains containing pTW2, pTW3, pMPRA, pMPRA.1, pMPRB, and pMPRB.1 to pMPRB.4 on skim milk-containing agar plates. The host for the plasmids pTW2 and pTW3 and the negative control pSKII+ was E. coli TOP10. The plasmids pMPRA, pMPRA.1, pMPRB, pMPRB.1 to pMPRB.4 and the negative control pET101/D were maintained in E. coli BL21. The recombinant E. coli strains were cultured on LB agar containing 2% skim. Halo formation indicated proteolytic activity. The gene regions cloned in pMPRA, pMPRA.1, pMPRB, and pMPRB.1 to pMPRB.4 are shown in Fig. 3.
|
|
View this table: [in a new window] |
TABLE 4. Proteolytic activity in culture supernatants of recombinant E. coli BL21 strains containing pMPRA, pMPRA.1, pMPRB, pMPRB.1, pMPRB.2, pMPRB.3, or pMPRB.4a
|
|
|
|---|
In the present study, the isolation of metagenomic DNA was achieved by a direct lysis approach. This approach often results in coextraction of environmental matrix substances, which remain in the purified environmental DNA and interfere with the restriction digestion or the ligation reaction (7). The better performance of the TOPO TA approach for cloning of environmental DNA may be due to a lack of susceptibility of topoisomerase 1 toward remaining matrix substances. In addition, the TOPO TA method involves blunt-end polishing using T4 DNA polymerase, which exhibits proofreading activity. This activity may have contributed to an improvement of the overall quality of the environmental DNA used for cloning. In conclusion, the TOPO TA-based method is advisable for the generation of small-insert libraries from environmental DNA that has been isolated by direct lysis approaches.
The function-driven screening and the identification of clones exhibiting proteolytic activity was based on a well-established screen on skim milk-containing agar. This screen has been used to identify protease activity of individual microorganisms (22, 55) and recombinant E. coli strains that harbor gene libraries from single microorganisms (27) or metagenomic libraries (13, 26, 47). The partial screening of the constructed metagenomic libraries for genes encoding proteolytic enzymes and characterization of the recombinant plasmids recovered from positive clones resulted in the identification of mprA and mprB. Besides MprA and MprB, only one other active metalloprotease has been obtained from a metagenome project (26). In addition, a serine protease has been mentioned in reviews, but the original work was not published (13, 29). In other cases, screening of metagenomic libraries for proteolytic activity was unsuccessful (19, 44). Jones et al. (19) found glycoside hydrolases instead of proteases during activity-based screening of the human gut microbiome on skim milk-containing LB medium. These authors concluded that this screen is not entirely suitable for detection of proteolytic enzymes. Thus, the use of this screen for the identification of proteolytic clones might be one explanation for the current deficit of metagenome-derived proteases. Since the above-mentioned activity-based screen is the standard screen for identification of proteolytic enzymes, this cannot be the only explanation. Proteases degrade or modify enzymes (13, 39). Thus, the detection and recovery of proteases might have been hindered by deleterious effects of foreign protease activity on the host cells. Other possible explanations are associated with the general drawbacks of activity-based screening of metagenomic libraries. Although function-driven screens result in the identification of functional gene products, one limitation of this approach is its dependence on the expression of the cloned genes and synthesis of the corresponding gene products in a foreign host (7). In the case of extracellular proteases, secretion and processing of the foreign protein by the host strain is also required. Therefore, the inability to recover active proteases encoded by metagenomic DNA during function-driven screening might be due to the fact that many genes and gene products are not expressed, not translated, or not active in the host strain. In the present study, the E. coli host was able to synthesize and secrete active proteases derived from a foreign organism.
We were able to deduce the origin of the metagenomic DNA fragments that harbor mprA and mprB. The high sequence similarities of most of the gene products encoded by the inserts of pTW2 and pTW3 to gene products from different members of the Xanthomonadaceae (see Tables S1 and S2 in the supplemental material), and the similar organization of the cloned metagenomic gene regions and genomic regions of different Xanthomonas strains (Fig. 1) suggested that the inserts of pTW2 and pTW3 were derived from members of the genus Xanthomonas. Xanthomonas and E. coli are both gram negative and belong to the Gammaproteobacteria. This phylogenetic relationship and the similar type of cell wall might have contributed to the ability of E. coli to produce a foreign extracellular protease in an active form. Interestingly, the Xanthomonas strains contained one or more serine protease-encoding genes in a position similar to that of mprA and mprB (Fig. 1). Therefore, the localization of a protease-encoding gene in this genomic region appears to be important. The six Xanthomonas strains depicted in Fig. 1 are phytopathogenic organisms, and protease activity plays an important role in virulence (15, 28). However, an involvement of the proteases located in this region in pathogenesis has not been shown. In addition, MprA and MprB showed no significant sequence similarity to the serine proteases derived from the Xanthomonas species. Nevertheless, MprA and MprB belong to the zinc-containing metalloproteases of which many are involved in pathogenesis (4, 6, 15, 20, 35). Furthermore, the amino acid sequences of MprA and MprB showed the highest similarities (ca. 50% identity) to the zinc-containing metalloprotease EmpI from Pseudoalteromonas sp. strain A28, which exhibits potent algicidal activity (27). Thus, it cannot be excluded that MprA and MprB have a function in pathogenesis in the organisms from which these enzymes originate.
Analysis of MprA and MprB revealed that both enzymes are members of the M4 family of metallopeptidases. Both metalloproteases exhibited a unique modular structure and consisted of four parts (Fig. 3). The protease region showed high similarity to that of various zinc-containing metalloproteases belonging to the M4 family. It has been shown for thermolysin the prototype member of the M4 family that the two histidines residues of the highly conserved zinc-binding motif HEXXH (Fig. 2) are zinc ligands and the glutamate residue is an active site residue (41). The glutamic acid residue of the other conserved motif GXXNEXXSD is the third zinc ligand. This residue is located 20 residues downstream of the zinc-binding motif of MprA and MprB (Fig. 2), as is typical for M4 family members (18). All members of this family bind a single zinc ion that is coordinated by the above-mentioned residues and an activated water molecule. Thermolysin-like metalloproteases are synthesized as inactive preproenzymes, which are converted to the mature protein by proteolytic cleavage (4, 41, 50). SignalP analysis revealed that the first 27 amino acids of MprA and MprB are typical signal peptides. The signal peptides may be removed during the passage through the inner membrane by endogenous signal peptidases provided by the host, as observed for other zinc-metalloproteases (8, 20). Since the entire protease activity was found in the cell-free culture supernatant and the presence of the sequences encoding the signal peptide in the gene regions was required to obtain protease activity (Fig. 4), the functionality of the signal peptides of MprA and MprB is indicated. In addition to the signal peptide and the protease region, the presence of the N-terminal proregion was also essential for the activity of MprA and MprB. Processing of the proregion of thermolysin-like proteases is required to yield the active protein. The proregion serves as a chaperone and as an inhibitor of catalysis, which prevents premature activation of the enzyme (50, 51). The C-terminal extension of MprA and MprB containing two PPC domains and one P domain is unique for M4 metalloproteases. Usually, the C-terminal extension is not essential for enzyme activity and is not present in the active proteases (57). Some metalloproteases, such as thermolysin and elastase, have no C-terminal extension. It has been suggested that C-terminal extensions containing PPC domains are involved in adhesion to protein substrates (35) or in protein secretion through the outer membrane of gram-negative bacteria (11). The C-terminal extension of MprA and MprB showed with respect to the presence of the two PPC domains similarity to other M4 metalloproteases such as EmpI (27) and MprI (34), but the latter enzymes lack an additional P domain. P domains are primarily associated with subtilisin-like proprotein convertases of eukaryotes, which are serine endopeptidases. The P domain of these enzymes is located immediately downstream of the catalytic domain. It has been reported that the P domain is required for the folding and regulation of pH dependence of the catalytic domain (49, 59). In addition, P domains contain the amino acid sequence RGD. The integrity of this sequence is necessary for zymogen and C-terminal processing and cellular trafficking of proprotein convertase (45). The amino acid sequences of MprA and MprB contained an RGD sequence (R794 to D796). Our studies with the derivatives lacking the PPC domains and the P domain (MprA.1 and MprB.1) revealed that the domains of the C-terminal extension are not crucial for enzyme activity, localization of the enzymes, and the pH optimum of enzyme activity. Taking into account that mprA and mprB were expressed in a foreign host, this could be different in the organisms from which mprA and mprB originated.
In conclusion, the applied function-driven metagenomic approach resulted in identification of two extracellular metalloproteases with a novel domain structure. The function of the unique C-terminal extension, especially the role of the P domain, will be examined in the future. Since the recovered protease genes-containing DNA fragments were probably derived from members of the genus Xanthomonas, we will also use genetically accessible Xanthomonas strains as hosts for mprA and mprB.
Published ahead of print on 13 February 2009. ![]()
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
|
|
|---|
5β1. J. Biol. Chem. 274:12461-12467.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»