Previous Article | Next Article ![]()
Applied and Environmental Microbiology, October 2005, p. 5787-5793, Vol. 71, No. 10
0099-2240/05/$08.00+0 doi:10.1128/AEM.71.10.5787-5793.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
United Graduate School of Agricultural Science, Tokyo University of Agriculture and Technology, Fuchuu-city, Tokyo 183-8509, Japan,1 College of Agriculture, Ibaraki University, Ami, Ibaraki 300-0393, Japan2
Received 17 February 2005/ Accepted 12 May 2005
|
|
|---|
|
|
|---|
The anaerobic cellulolytic bacterium Eubacterium cellulosolvens is sporadically dominant in the rumen (18). It is known that E. cellulosolvens 5 adheres tightly to cellulose, so studies of this adhesion have been performed (13, 14, 23-26). Some cellulose-binding proteins (CBPs) have been found in culture supernatant and cell lysate of the organism (14, 26). A gene encoding cellulose-binding protein A (CBPA), which is one of these CBPs, has been cloned and characterized (23-25). Additionally, the presence of some proteins exhibiting carboxymethyl cellulase (CMCase) activity in culture supernatant and cell lysate of E. cellulosolvens 5 was revealed by zymogram analysis (26). In order to advance research on the mechanism of cellulose degradation by this bacterium, we tried to isolate a gene encoding CMCase from the genomic DNA library of E. cellulosolvens 5.
In this report, we describe cloning and nucleotide sequencing of the E. cellulosolvens 5 endoglucanase Cel5A gene (cel5A), the primary structure of Cel5A, and the enzymatic properties of Cel5A and derivatives of this protein expressed in Escherichia coli. We also describe the importance of novel carbohydrate-binding modules (CBMs) in cellulose hydrolysis by Cel5A.
|
|
|---|
Construction of genomic DNA library, screening of CMCase-producing clones, and sequencing.
Ligation, transformation, and restriction enzyme analysis were performed by standard procedures (20). A genomic DNA library of E. cellulosolvens 5 was constructed in the ZAP Express vector as described previously (23). The genomic library was screened for CMCase-producing clones by using an overlay of 0.7% (wt/vol) top agar containing 0.2% (wt/vol) carboxymethyl cellulose (CMC). Plaques having CMCase activity were recognized by the formation of clear haloes on a red background after staining with 0.1% (wt/vol) Congo red and destaining with 1 M NaCl (27). Positive plaques were reisolated three times to ensure purity. The sequences of both strands were determined as described previously (23).
Cloning of a DNA fragment encoding the N-terminal end of Cel5A by targeted gene walking PCR.
Targeted gene walking PCR was performed as described previously (23). The first PCR was performed with a targeted sequence-specific primer, K-3 (Table 1), containing a known sequence in the cel5A gene, and a walking BK-reverse primer corresponding to the sequence present in the multiple-cloning site of the
ZAP Express vector. After the first PCR, the PCR product was subjected to a second PCR with an internal detection primer, K-4 (Table 1), which contained a known sequence upstream from the K-3 sequence, and a walking T3 primer corresponding to the T3 promoter sequence that is located downstream from the BK-reverse sequence present in the multiple-cloning site of the
ZAP Express vector. The first PCR amplification was performed for 32 cycles consisting of 94°C for 30 s, 52°C for 45 s, and 72°C for 2 min, using ExTaq (Takara). The second PCR was performed under the same conditions as the first PCR except that the annealing temperature was 56°C.
|
View this table: [in a new window] |
TABLE 1. PCR primers used for screening of the N terminus of Cel5A and construction of rCel5A and its derivatives
|
![]() View larger version (28K): [in a new window] |
FIG. 1. Restriction map of pKY1 (A) and molecular architecture of Cel5A and its derivatives used in this study (B). The thin lines indicate vector plasmid DNA. The arrows indicate the primers used for targeted gene walking PCR. The different modules of Cel5A are indicated.
|
Enzyme assays.
The CMCase activities of the derivatives of Cel5A were detected by spotting the proteins on an agar plate containing 0.2% (wt/vol) CMC in 50 mM sodium citrate buffer (pH 5.5). The plate was incubated for 15 min at 37°C. The CMCase activities of the proteins were detected by Congo red staining as described above. The hydrolytic activities of the recombinant proteins were measured after 5 or 60 min of incubation at 37°C in 50 mM sodium citrate buffer (pH 5.0) in the presence of 1% (wt/vol) polysaccharides. The polysaccharides tested were CMC (Wako), acid-swollen cellulose (ASC) (28), Avicel (PH-101; Asahi Chemical Industry), oat spelt xylan (Sigma), lichenan (Sigma), laminarin (Sigma), chitin (Wako), and mannan (Sigma). The reducing sugars released from the substrates were determined with the 3,5-dinitrosalicylic acid reagent as described by Miller (12). One unit of activity was defined as the amount of enzyme that released 1 µmol of glucose equivalents per min from the substrate. The viscometric assay with recombinant Cel5A was performed using CMC as described previously (25).
SDS-PAGE and zymogram analysis.
Sodium dodecyl sulfate (SDS)-polyacrylamide gel electrophoresis (PAGE) was performed using standard procedures, as described previously (26). Zymogram analysis was performed by the method of Toyoda et al. (26), with some modifications.
Analysis of hydrolysis products.
Cellooligosaccharides (cellobiose to cellohexaose; 1 mg each) were incubated with 0.1 U of the purified enzyme in 0.2 ml of 50 mM sodium citrate buffer (pH 5.5) at 37°C for 6 h. Thin-layer chromatography (TLC) of the hydrolysis products was performed on a silica gel plate (Sigma) developed with a solvent consisting of 1-propanol, acetic acid, and water (2:1:1, vol/vol/vol). Cellooligosaccharides were visualized by spraying the plate with an aniline-diphenylamine-phosphoric acid reagent and heating it at 120°C (30).
Polysaccharide binding assay.
Binding of Cel5A and its derivatives to the insoluble polysaccharides was determined as follows. The proteins were mixed with insoluble polysaccharides (5 mg) in 0.2 ml of 50 mM sodium phosphate buffer (pH 6.5) and were incubated on ice for 1 h with occasional stirring. After centrifugation, the pellets were washed four times with 50 mM sodium phosphate buffer (pH 6.5). Then the polysaccharides with bound proteins were eluted with 100 µl of 5% SDS for 30 min at 37°C. The eluted proteins were collected by centrifugation and subjected to SDS-PAGE. The polysaccharides tested were ASC, Avicel, oat spelt xylan, lichenan, chitin, agarose (Agarose S; Nippon Gene), Sephadex G-25 (Amersham Biosciences), and corn starch (Wako).
The affinities of the proteins for soluble polysaccharides, including methylcellulose (Wako), birchwood xylan (Sigma), laminarin, and soluble starch (Wako), were examined by native affinity gel electrophoresis as described by Arai et al. (4). The separating gel contained 7.5% acrylamide. The polysaccharides were incorporated into the gel at a concentration of 0.1% prior to polymerization. A control gel without polysaccharides was prepared and run simultaneously.
Nucleotide sequence accession number.
The nucleotide sequence data reported in this paper have been deposited in the DDBJ, EMBL, and GenBank nucleotide sequence databases under accession number AB179780.
|
|
|---|
Nucleotide sequence of the cel5A gene.
The nucleotide sequence of the gene encoding Cel5A was determined from both the sequence of the inserted DNA in pKY1 and the sequence of the 4.8-kb PCR product obtained by the gene walking PCR. The open reading frame of the cel5A gene consisted of 3,444 nucleotides encoding a 1,148-amino-acid protein with a deduced molecular mass of 127,047 Da. The putative ATG translational start codon was preceded at a spacing of 6 bp by a potential ribosome-binding site (GGGGA). Two possible promoter sequences, TTGCGT and TTGCAG for the 35 region and TATAAT and TAAAAA for the 10 region, with 17-bp spacing, were located upstream of the open reading frame. A presumptive transcription terminator that consisted of a 12-bp palindrome was found downstream of the TAA termination codon.
Structural features of Cel5A.
The deduced primary structure of Cel5A is shown in Fig. 1B. The 36-amino-acid sequence of the N terminus of the deduced polypeptide had features similar to features of the signal sequences of prokaryotes. A comparison of the amino acid sequence of Cel5A with the amino acid sequences of other proteins using the BLAST program revealed that the mature Cel5A consisted of four modules, two linker sequences, and a C-terminal sequence with an unknown function. In particular, Cel5A comprised mainly two catalytic modules belonging to glycosyl hydrolase family 5 and two modules with unknown functions. The two modules with unknown functions had carbohydrate-binding ability, as described below. The C-terminal sequence with an unknown function was composed of 17 amino acid residues. The two family 5 catalytic modules of Cel5A were designated GHF5-1 and GHF5-2. The amino acid sequences of GHF5-1 and GHF5-2 were 94% identical. An amino acid sequence alignment of the catalytic modules of Cel5A and family 5 glycosyl hydrolases is shown in Fig. 2. The amino acid sequence of the GHF5-1 module of Cel5A exhibited high degrees of similarity to the amino acid sequences of endoglucanase VII from Ruminococcus albus (43% sequence identity for 355 amino acid residues) (16), CelE from Orpinomyces sp. strain PC-2 (39% sequence identity for 359 amino acid residues) (7), CelB29 from Orpinomyces joyonii (39% sequence identity for 359 amino acid residues) (19), and CelA from Clostridium longisporum (37% sequence identity for 353 amino acid residues) (15). All of these enzymes had a family 5 catalytic module. Two modules with unknown functions (UM-1 and UM-2) were linked to the N termini of GHF5-1 and GHF5-2, respectively (Fig. 1B). The level amino acid sequence identity between UM-1 and UM-2 was 73%. Furthermore, two linker sequences rich in proline, threonine, and serine linked to the C termini of GHF5-1 and GHF5-2.
![]() View larger version (72K): [in a new window] |
FIG. 2. Alignment of the amino acid sequences of the catalytic modules of E. cellulosolvens 5 (e.c.) Cel5A, C. longisporum (c.l.) CelA (GenBank accession no. L02868), O. joyonii (o.p.) CelB29 (GenBank accession no. AF015248), Orpinomyces sp. strain PC-2 (o.sp.) CelE (EMBL accession no. U97153), and R. albus (r.a.) endoglucanase VII (GenBank accession no. AB028321). Amino acids that are conserved in at least four of the six sequences are shaded. Dashes indicate gaps left to improve the alignment. The numbers indicate the amino acid residues at the beginning of each line; all sequences are numbered from Met-1 of the peptide.
|
![]() View larger version (107K): [in a new window] |
FIG. 3. Recombinant Cel5A expressed in E. coli and its CMCase activity. Proteins in gels containing 0.1% CMC were stained with Coomassie brilliant blue (lanes 1 to 3), and CMCase in gels containing 0.1% CMC was detected by Congo red staining (lane 4). Whole-cell lysates of both E. coli BL21(DE3) (control) (lane 1) and E. coli BL21(DE3) harboring plasmid pKY12 (lane 2) were included. The rCel5A expressed was partially purified with ProBond resin (lanes 3 and 4). The arrow indicates the position of the approximately 127-kDa band. The locations of molecular mass markers are indicated on the left.
|
![]() View larger version (48K): [in a new window] |
FIG. 4. Expression of Cel5A and its derivatives in E. coli (A) and binding of the proteins to acid-swollen cellulose (B). Lane 1, Cel5A; lane 2, D1; lane 3, D2; lane 4, D3; lane 5, D4; lane 6, D5; lane 7, D6; lane 8, D7. The locations of molecular mass makers are indicated on the left.
|
The affinities of the CBMs (D1 and D4) for a series of soluble polysaccharides were also qualitatively evaluated by native affinity gel electrophoresis (Fig. 5). The migration of these proteins was significantly retarded by inclusion of methylcellulose and lichenan in gels but was not affected by CMC, laminarin, or soluble starch, whereas the CBMs showed very low affinity for the soluble fraction of oat spelt xylan (data for oat spelt xylan, laminarin, and soluble starch are not shown).
![]() View larger version (72K): [in a new window] |
FIG. 5. Affinity analysis of derivatives D1 (lane 1) and D2 (lane 2) of Cel5A with soluble polysaccharides by native affinity gel electrophoresis. Purified protein samples were separated in nondenaturing polyacrylamide gels containing 0.1% (wt/vol) soluble polysaccharides, including methylcellulose (B), lichenan (C), and CMC (D). A gel without polysaccharide was used as a noninteracting control for migration (A). Lane M contained bovine serum albumin as a control protein.
|
|
View this table: [in a new window] |
TABLE 2. Activities of rCel5A and its derivatives with various substratesa
|
![]() View larger version (71K): [in a new window] |
FIG. 6. TLC analysis of products from hydrolysis of cellooligosaccharides by the derivatives of Cel5A. Each cellooligosaccharide (G2 to G6; 1 µg) was incubated with the D2 derivative (0.1 U) for 6 h, and the hydrolysates were analyzed by TLC. Lane M contained authentic oligosaccharides. G1, glucose; G2, cellobiose; G3, cellotriose; G4, cellotetraose; G5, cellopentaose; G6, cellohexaose.
|
|
|
|---|
cazy/CAZY/index.html). On the other hand, CBMs are classified into 49 families on the basis of amino acid sequence similarities (http://afmb.cors-mrs.fr/
cazy/CAZY/index.html). It is thought that CBMs promote close proximity of enzymes to polysaccharides and assist the more rapid degradation of polysaccharides with their catalytic modules (2, 6). E. cellulosolvens 5 Cel5A is a modular enzyme consisting of an N-terminal signal peptide, two catalytic modules belonging to glycosyl hydrolase family 5, two novel CBMs, two linker sequences, and a C-terminal sequence with an unknown function (Fig. 1B). The two catalytic modules were classified into glycosyl hydrolase family 5 on the basis of amino acid sequence similarity (Fig. 2). It has been reported previously that the amino acid sequence of the CBM of E. cellulosolvens 5 CBPA did not exhibit significant homology with the amino acid sequences of other CBMs (23); the amino acid sequences of CBMs of E. cellulosolvens 5 Cel5A also did not exhibit significant homology to the amino acid sequences of other CBMs as determined by a BLAST search. These findings indicate that CBMs of both Cel5A and CBPA from E. cellulosolvens 5 are members of new types of CBM families.
To compare the enzymatic properties of the two catalytic modules (GHF5-1 and GHF5-2) of E. cellulosolvens 5 Cel5A, truncated derivatives (D2, D3, D4, D5, D5, and D6) of Cel5A were constructed by PCR, and their hydrolytic activities with various substrates and activities with cellooligosaccharides were examined. All of the derivatives that commonly contained one catalytic module of Cel5A hydrolyzed CMC, ASC, oat spelt xylan, and lichenan. rCel5A showed the same substrate specificity with these derivatives (Table 2). In addition, all of the derivatives produced cellobiose and cellotriose and a minor amount of glucose from cellotetraose and cellohexaose and produced only cellobiose and cellotriose from cellopentaose. rCel5A showed the same activity with cellooligosaccharides with these derivatives (Fig. 6). These results indicate that there are no differences in substrate specificity and activity with cellooligosaccharides between GHF5-1 and GHF5-2.
Moreover, to investigate the effect of the repetition of a catalytic module of Cel5A on enzymatic activities, the hydrolytic activities of rCel5A and its derivatives with various substrates were determined. The specific activity of rCel5A with ASC or lichenan was almost identical to the sum of the specific activities of D3 containing GHF5-1 and CBM-1 and D6 containing GHF5-2 and CBM-2. On the other hand, the specific activity of rCel5A with oat spelt xylan was about one-half the sum of the D3 and D6 specific activities (Table 2). This suggests that only one of the two catalytic modules of rCel5A is involved in the hydrolysis of xylan. One possible explanation for this result is that ß-1,4-linked hexoses present a planar three-dimensional structure, with 180° rotation between monomers, whereas xylan has a threefold helical structure, with approximately 120° rotation between every second monomer (1, 11). Therefore, it is possible that the substrate-binding groove of rCel5A has a completely complementary shape with a flat conformation of ß-1,4-linked hexoses but has an incomplete complementary shape with a threefold helix conformation of xylan. Furthermore, the results described above indicate that the duplication of a catalytic module in Cel5A does not result in synergistic activity between the catalytic modules. In general, it is known that aerobic microorganisms produce higher concentrations of the enzymes than anaerobic microorganisms produce (21). Therefore, E. cellulosolvens and other rumen microorganisms may have overcome the disadvantage mentioned above by using a strategy such as reiteration of the catalytic module in an enzyme and insertion of a different kind of catalytic module into an enzyme.
It is known that the removal of CBMs from cellulolytic enzymes reduces their catalytic activities with various substrates (2, 4). In this study, the hydrolytic specific activities of the derivatives containing a catalytic module and a CBM with insoluble polysaccharides (ASC, oat spelt xylan, and lichenan) and soluble polysaccharide (CMC) were evidently higher than those of the derivatives devoid of CBMs (Table 2). These results indicate that for Cel5A and its derivatives the hydrolytic activities with insoluble and soluble polysaccharides are remarkably reduced by removal of CBMs and that the CBMs of Cel5A play an important role in assisting hydrolysis of polysaccharides by the catalytic module. Additionally, CBMs of Cel5A showed no affinity for CMC (Fig. 5), whereas CBMs adjacent to catalytic modules were involved in reinforcing the hydrolytic activity of catalytic modules (Table 2). A likely assumption is that the carboxymethyl side chains of CMC interfere with the association with the substrate-binding sites of Cel5A and that CBMs adjacent to catalytic modules accommodate a flat CMC structure on the substrate-binding surface of Cel5A and facilitate an association between active sites and ligands.
Cel5A from E. cellulosolvens 5 contained two homologous family 5 catalytic modules. Other cellulases comprising multiple family 5 catalytic modules have been found mainly in rumen fungi (3, 8, 29). For example, Neocallimastix patriciarum CelD consists of three family 5 catalytic modules that exhibit a very high degree of amino acid identity (3, 29), and Piromyces equi Cel5A comprises four family 5 catalytic modules that are more than 99% identical to each other on the nucleic acid level (8). Recently, genes encoding cellulases with multiple family 5 catalytic modules have been found in the rumen protozoa Polyplastron multivesiculatum and Epidinium caudatum (22). Additionally, the region consisting of a CBM and a catalytic module is tandemly reiterated in the primary structure of Cel5A. The two catalytic modules (GHF5-1 and GHF5-2) of Cel5A exhibited 94% sequence identity. Based on these findings, it is assumed that Cel5A arose through duplication of an ancestral gene that originally encoded a single region consisting of one CBM and one catalytic module.
In this study, the cel5A gene from E. cellulosolvens 5 was expressed as a His6-tagged Cel5A protein in E. coli, and the properties of the recombinant Cel5A were examined. However, further work is needed to compare the characteristics of the recombinant Cel5A with those of native Cel5A from E. cellulosolvens 5.
This work was supported in part by Grant-in-Aid for Encouragement of Young Scientists 14760171 from the Ministry of Education, Science, and Culture of Japan.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»