| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Geeta Ram,1,
Adesh Kumar Saini,2
Reena Vohra,2
Krishan Kumar,1
Yogendra Singh,2 and
Anand Ranganathan1*
Recombinant Gene Products Group, International Centre for Genetic Engineering and Biotechnology, Aruna Asaf Ali Marg, New Delhi-110067, India,1 Institute of Genomics and Integrative Biology, Mall Road, Delhi-110007, India2
Received 20 October 2006/ Accepted 4 December 2006
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
On the other hand, criticism is sometimes levied on nonrational approaches because they furnish few well-folded and therefore useful proteins. Helpfully, this has fundamentally been addressed by Hecht and coworkers (21, 29). It follows from their binary-patterning concept that de novo proteins that lack substantial binary patterning of polar (
) and nonpolar () amino acids (
or



for beta-sheet and alpha-helix, respectively) would fail to form adequate secondary structure and therefore not be able to fold properly (35). A nonfolded protein is more amenable to in vivo proteolytic degradation and is therefore less useful (9, 34). Thus, de novo protein libraries wherein members display considerable binary patterning would be a good starting point in the search for drug-like candidates. Even so, there are instances where peptides produced from degenerate oligonucleotide-based libraries have been isolated and show strong binding to the target protein (4, 5, 7). However, the obtained entities in most cases are small (approximately 20 amino acids in length) and are tethered to a comparatively large scaffold. It has been hypothesized that these peptides may not display their original structural fold once removed from the scaffold and, as a result, become prone to proteolytic degradation. Furthermore, the binding activity may, in many instances, be partly due to the bulky scaffold itself (3, 53). In comparison, we have earlier shown that "codon shuffling" is a versatile method for generating de novo protein libraries that possess a considerable diversity of amino acid compositions as well as protein lengths (6, 35). We had also postulated that the constituent library members would theoretically possess adequate secondary structure elements, as they are formed from dicodons (DC) that have defined binary patterning (Fig. 1) (35). Protein libraries that are programmable, display good binary patterning, and show diversity in length and composition represent, we believe, a good pool from which potent candidates can be isolated. We now extend this approach towards the construction of programmable codon-shuffled libraries aimed at selecting de novo proteins that strongly bind and inhibit in vivo the functioning of HupB, an essential M. tuberculosis protein.
|
| MATERIALS AND METHODS |
|---|
|
|
|---|
Construction of pBT- and pTRG-derived plasmids pBTnn and pTRGnn.
A 587-bp-long DNA fragment from plasmid pBT was PCR amplified using the oligonucleotides 5'-AAGCGGCCGCGGGATCCTACGTATGAGAATTCTAGACCTCGAGTTAATTAATTAATTAAGATCT-3' and 5'-CACCATGGGCAAATATTATACGCAAGGCGAC-3' as the forward and reverse primers, respectively. The PCR product was cloned in the pBluescript SRF vector, the fragment excised using NcoI and NotI restriction enzymes and cloned in a pBT vector previously cut with the same two enzymes. The resulting plasmid was designated pBTnn. Plasmid pBTnn carries the SnaBI site into which DC fragment libraries can be inserted. The site immediately follows the end of the
cI gene and just precedes a stop codon. Expression of this plasmid would result in libraries of codon-shuffled proteins, all fused with
cI. For the creation of plasmid pTRGnn, a 580-bp BamHI fragment from the pBluescript SRF plasmid that contained the PCR product mentioned above was excised and cloned in the BamHI-cut pTRG. A clone that contained the BamHI fragment in the wrong orientation was specifically selected and digested with XhoI, and the vector was self-ligated to finally yield pTRGnn. Plasmid pTRGnn carries the SnaBI site into which the codon-shuffled libraries can be inserted. The site immediately follows the end of the gene encoding the
subunit of RNA polymerase (RNAP) and just precedes a stop codon. Expression of this plasmid would result in libraries of codon-shuffled proteins, all fused with the
subunit of RNA polymerase.
Construction of plasmid pGEX4T3nn.
The 580-bp BamHI fragment that was used for the construction of pTRGnn was cloned in the BamHI-cut pGEX4T3. A clone that contained the BamHI fragment in the wrong orientation was specifically selected and digested with XhoI, and the vector was self-ligated to finally yield pGEX4T3nn. Plasmid pGEX4T3nn carries the SnaBI site into which DC fragment libraries can be inserted. The site immediately follows the end of the glutathione S-transferase (GST) gene and just precedes a stop codon. Expression of this plasmid would result in libraries of codon-shuffled proteins, all fused with the GST protein.
Construction of plasmid pThiohisann.
A 945-bp XbaI fragment was excised from pBTnn and inserted in the XbaI-cut pThiohisA. A clone that contained the XbaI fragment in the right orientation was specifically selected and digested with BamHI, and the vector was self-ligated to finally yield pThiohisann. Plasmid pThiohisann carries the SnaBI site into which DC fragment libraries can be inserted. The site immediately follows the end of the trx gene and just precedes a stop codon. Expression of this plasmid would result in libraries of codon-shuffled proteins, all fused with the TRX protein.
Construction of plasmid series containing M. tuberculosis HupB and its derivatives. (i) pBTnnHupB and pTRGnnHupB.
The full-length hupB gene (Rv2986c) was PCR amplified from M. tuberculosis H37Rv genomic DNA using the oligonucleotides 5'-CCTACGTAATGAACAAAGCAGAGCTCATTGACGTG-3' and 5'-CCTACGTATTTGCGACCCCGCCGAGCGGTTGCC-3' as the forward and reverse primers, respectively. The 658-bp PCR product was isolated, purified, and subsequently cloned in pBluescript SRF. The resulting plasmid was digested with SnaBI, and the 648-bp-long M. tuberculosis hupB gene was inserted in frame in plasmids pBTnn and pTRGnn, which were earlier cut with SnaBI. Expression of the resulting plasmids, pBTnnHupB and pTRGnnHupB, in a suitable two-hybrid host results in
cI-HupB and RNAP
subunit-HupB fusion proteins.
(ii) HupBpGEX4T3nn.
The SnaBI-cut full-length hupB gene was inserted in plasmid pGEX4T3nn, which was earlier cut with the same restriction enzyme, to yield plasmid hupBpGEX4T3nn. Expression of this plasmid in E. coli resulted in a GST-HupB fusion protein (molecular mass [MW], 49.0 kDa; the MW of GST alone is 26.57 kDa).
(iii) pBTnnHup2 and pTRGnnHup2.
The DNA fragment corresponding to the C-terminal half of the M. tuberculosis HupB protein was PCR amplified using the oligonucleotides 5'-GGATACGTAGCTGTTAAGCGTGGTGTGGG-3' and 5'-CCTACGTATTTGCGACCCCGCCGAGCGGTTGCC-3' as the forward and reverse primers, respectively. The 358-bp-long PCR product was isolated, purified, and subsequently cloned in pBluescript SRF. Using procedures similar to those described for hupB, the SnaBI-flanked hup2 gene was inserted in plasmids pBTnn, pTRGnn, pGEX4T3nn, pThiohisann, and pMALc2x to generate hup2 gene fusions that would result in Hup2 fusion proteins with
cI, the RNAP
subunit, GST, thioredoxin (TRX), and maltose-binding protein (MBP), respectively.
Construction of codon-shuffled libraries.
The protocol for library generation has been described earlier (35) and was faithfully followed, except that in the present sets of experiments, the proportions of dicodons were skewed and not kept equimolar. The degrees of skewing were as follows.
Library-versus-library (LvL) experiments.
For library 1, KL and RT dicodons were added one, three, and five times in molar excess and DI and EL dicodons were removed from the initial mix. For library 2, DI and EL dicodons were added one, three, and five times in molar excess; KL and RT dicodons were removed from the initial mix.
HupB/Hup2 experiments.
DI and EL dicodons were added one and three times in molar excess; KL and RT dicodons were removed from the initial mix.
All libraries preferentially contained DC-fragment sizes larger than 250 bp (this corresponds to DC proteins at least 80 amino acids in length). Purified libraries were stored at 20°C until further use.
Two-hybrid assays (in vivo).
All in vivo protein-protein interaction studies were carried out using the Bacteriomatch two-hybrid system (13). The Bacteriomatch reporter strain, bearing the ampicillin resistance gene and the lacZ reporter gene, was cotransformed with equal amounts of bait (pBTnn-derived plasmids or libraries) and prey (pTRGnn-derived plasmids or libraries) DNA. Protein-protein interaction was detected by observing the level of transcriptional activation of lacZ reporter gene in various cotransformants of the reporter strain, when the latter was plated on X-Gal plates that were supplemented with appropriate antibiotics (chloramphenicol and tetracycline), 25 µM IPTG (isopropyl-ß-D-thiogalactoside), and ß-galactosidase inhibitor. Plasmids pBT-LGF2 (
cI-LGF2 fusion) and pTRG-Gal11p (RNAP
subunit-Gal11p fusion) provided the positive controls, whereas empty pBTnn and pTRGnn plasmids served as negative controls for the interaction studies. Routine protocols performed during the assay were according to the manufacturer's instructions.
Liquid ß-galactosidase assays.
Confirmation of the in vivo interactions detected on X-Gal plates was obtained by the measurement of beta-galactosidase (LacZ) activity in a given cotransformant strain. Colonies were grown at 30°C to mid-log phase in the presence of 25 µM IPTG. Cells were permeabilized with sodium dodecyl sulfate (SDS)-CHCl3 and assayed for ß-galactosidase activity essentially as described previously (27). Assays were performed twice, in duplicate, with different cotransformants on separate occasions, and the values reported are the means of these measurements.
Construction of plasmids LvL7+pthiohisann and LvL7pthiohisann.
The full-length clone 7 gene of the positively charged library (LvL7+) (318 bp) and the full-length clone 7 gene of the negatively charged library (LvL7) (360 bp) were PCR amplified from their pBTnn and pTRGnn clones, respectively, using the 5'-phosphorylated oligonucleotide 5'-AGCGGCCGCCACGTGTTTAAA-3', which served as both a forward and reverse primer. The PCR products were eluted using DEAE membrane and inserted into the SnaBI-cut pThiohisann to yield plasmids LvL7+pthiohisann and LvL7pthiohisann. Expression of these plasmids resulted in the production of TRX-LvL fusion proteins.
Expression and purification of TRX-LvL7+ and TRX-LvL7 proteins.
Liquid cultures of E. coli DH5
strains containing plasmids LvL7+pthiohisann and LvL7pthiohisann were independently grown to mid-log phase (optical density at 600 nm [OD600], 0.4 to 0.6) and then induced with 0.5 mM IPTG. The induction was continued for a further 4 h at 37°C, after which the cells were pelleted, washed, and resuspended in buffer A (10 mM Tris-HCl, pH 8.0, 1 mM EDTA, 0.8% NaCl). Postlysis and removal of cell debris, the supernatant was loaded onto a Ni-nitrilotriacetic acid column that had previously been equilibrated with buffer A. As both proteins are TRX fusions, they are expected to bind the nickel column by virtue of the thioredoxin His patch. Subsequently, the proteins were eluted with buffer A containing 250 mM imidazole. Protein fractions were visualized by 15% SDS-polyacrylamide gel electrophoresis (PAGE) (the MW of TRX-LvL7+ and TRX-LvL7 are 24.33 kDa and 26.03 kDa), and those containing purified proteins were pooled and dialyzed against 10 mM Tris-HCl, pH 8.0. The proteins were concentrated using YM-5 concentrators and utilized for circular dichroism (CD), matrix-assisted laser desorption ionization (MALDI), and in vitro analysis experiments.
In vitro ion-exchange pull-down assay.
The pull-down assay was conducted using two different approaches. In the first approach, the purified proteins TRX-LvL7+ and TRX-LvL7 were taken in equal amounts (10 µg; phosphate buffer, pH 7.0) and mixed for 1 h at 4°C. The protein mixture was added to 100 µl of SP-Sepharose beads and the mix further incubated for 1 h. The beads were washed extensively with 50 to 150 mM NaCl (phosphate buffer, pH 7.0) and the proteins eluted with 500 mM NaCl. In the second approach, 10 µg of TRX-LvL7+ was bound onto 100 µl of SP beads and the mix incubated for 1 h at 4°C. To this was added 10 µg of TRX-LvL7 protein, and the mixture was further incubated with mild shaking for 1 h at 4°C. The protein recovery procedure was identical to the one used in the first approach. Purified TRX protein and unbound SP beads acted as controls in the experiments mentioned above. The proteins were visualized using Coomassie blue on 15% SDS-PAGE gels.
Expression and purification of GST-C4 protein.
The protein GST-C4, as well as GST alone, was found to be preferentially expressed in the soluble form. GST-C4 and GST were routinely purified using glutathione-Sepharose 4B beads. The beads were washed extensively with buffer B (10 mM Tris-HCl, pH 8.0, 14 mM ß-mercaptoethanol, 0.1% Tween 20, 20% [vol/vol] ethanol, 0.5 M NaCl), and the proteins eluted with buffer C (10 mM reduced glutathione, 50 mM Tris-HCl, pH 7.5). Purified fractions of eluted proteins were analyzed by 15% SDS-PAGE and subsequently pooled and dialyzed.
In vitro protein interaction assay.
The in vitro protocol was essentially as described earlier (36), with some variations. Briefly, the purified GST-C4 protein (10 µg) or the GST protein alone (5 µg) was incubated with 45 µg of glutathione-Sepharose 4B beads at 4°C for 1 h, followed by repeated washing and blocking with 1% (wt/vol) polyvinylpyrrolidone at 4°C for 1 h. Subsequently, the beads were incubated with purified recombinant Hup2 protein (10 µg) at 4°C for 4 h. Finally, after extensive washings, all bound protein was eluted by boiling the beads in 50 µl of 2x SDS loading buffer (20% [vol/vol] glycerol, 4% SDS, 0.125 M Tris-HCl, pH 6.8, 4% [vol/vol] ß-mercaptoethanol) and resolved by 15% SDS-PAGE for subsequent visualization by Coomassie staining. Various controls (e.g., beads alone or GST plus Hup2) for the pull-down assay were also subjected to the same treatment.
Construction of plasmid pSD5GFP.
The full-length green fluorescent protein (GFP) gene was PCR amplified from the commercially available vector pEGFPn1 (Amersham) using the oligonucleotides 5'-TCCACCCTGCAGTACTATGGTGAGCAAG-3' and 5'-AGAGTCGCGACGCGTTTACTTGTACAG-3' as the forward and reverse primers, respectively. The 750-bp-long PCR product was isolated, purified and subsequently cloned in pBluescript SRF. From this plasmid, the GFP gene was excised using PstI and MluI and cloned in plasmid pSD5, which was earlier cut with the same two restriction enzymes. The resulting plasmid carries a unique ScaI site just upstream of the GFP initiation codon. The insertion of genes into this site, followed by expression of the resulting plasmids, yields GFP fusion proteins.
Construction of pSD5-C and pSD5GFP-C series plasmids.
Full-length genes corresponding to the "C series" were PCR amplified from their pBTnn or pTRGnn clones, using the 5'-phosphorylated oligonucleotide 5'-AGCGGCCGCCACGTGTTTAAA-3', which served as both a forward and reverse primer. The PCR products were eluted using the DEAE membrane and inserted in ScaI-cut pSD5, as well as ScaI-cut pSD5GFP, to yield plasmids ready for expression in Mycobacterium smegmatis. All inserts reside upstream of the GFP gene in pSD5GFP.
Growth analysis and colony count of mycobacteria.
Mycobacterium smegmatis strain LR222 (28) was used throughout the course of this study. Mycobacterial transformation procedures were as described earlier (19). Five-milliliter starter cultures of M. smegmatis strains with pSD5 as well as pSD5C4 were prepared from frozen stock in Middlebrook 7H9 medium (Difco Laboratories) supplemented with 1% (vol/vol) ADC medium (5% albumin, 2% dextrose) and 0.05% Tween 80. Cultures were incubated at 37°C with aeration for 3 days, or until the stationary phase of growth was achieved. These were used as seed cultures for subsequent investigations. For growth curve analysis, 100-ml secondary cultures were seeded with 1% of primary cultures and incubated at 37°C. Periodically, aliquots were removed over a period of 50 h and their OD600 measured. For CFU analysis, the 100-ml secondary cultures were grown at 37°C until mid-log phase (OD600 of 0.9) was reached. At this point, the growing cultures were shifted to a 10°C water bath shaker (180 rpm). Subsequently, 10-µl aliquots were removed and made up to 100 µl with liquid media. Fifty microliters of this culture was plated onto agar plates containing kanamycin and the plates incubated for 3 days at 37°C. Finally, the number of colonies that appeared for each set (the total number of days was 10) was plotted as a percent increase over the number present on day 0.
RNA extraction.
Fifty-milliliter cultures of wild-type and recombinant strains of M. smegmatis were grown at 37°C. Cell pellets were rinsed with ice-cold diethyl pyrocarbonate-treated sterile double-distilled water and resuspended in 400 µl of TES [N-tris(hydroxymethyl)methyl-2-aminoethanesulfonic acid] buffer. To the tubes was added 400 µl water saturated with phenol-chloroform/isoamylalcohol and the contents mixed vigorously. Glass beads were added for efficient cell disruption. The tubes were incubated at 65°C for 1 h with intermittent mixing, following which the tubes were incubated on ice for 2 min and then centrifuged. Aqueous phase was transferred to fresh tubes and the RNA was precipitated with 3 M sodium acetate (pH 5.2, 0.1 volumes) and chilled ethanol (2 volumes). After incubation at 20°C for 30 min, RNA samples were pelleted (10,000 rpm, 4°C, 15 min), washed with 70% ethanol, dried, and resuspended in 20 µl of diethyl pyrocarbonate-treated sterile water. RNA was checked for quality and quantity spectrophotometrically as well as by denaturing agarose gel electrophoresis.
Blotting.
Total RNA (20 µg) was electrophoresed on 1% denaturing agarose-formaldehyde gel, photographed to check for comparable levels of both samples, and then transferred to nylon membrane (Hybond N+; Amersham), following which the membrane was exposed to UV light for cross-linking. Transfer was checked with methylene blue. Linear DNA probe was generated using M. smegmatis hupB-specific oligonucleotides (5'-CCCTACGTAATGAACAAAGCGGAGCTCATCGAC-3' and 5'-CCCTACGTACCTGCGGCCCTTCTTGGCCGG-3' as forward and reverse primers, respectively) and labeled using a nick translation system (Invitrogen) according to the manufacturer's specifications. After denaturation, this probe was added to the prehybridization buffer A (6x SSC [1x SSC is 0.15 M sodium chloride plus 0.015 M sodium citrate], 50% formamide, 0.5% SDS, 100 µg/ml salmon sperm DNA, 5x Denhardt's reagent) and incubated at 65°C for 3 to 6 h with the membrane. The membrane was subsequently washed first with buffer B (2x SSC, 2% SDS) at room temperature for 15 min and then with buffer C (0.5x SSC, 2% SDS) at 65°C for 45 min. The membrane was finally washed with buffer D (0.2x SSC) to remove excess SDS, salts, and free radioactivity. Intensity was measured by a PhosphorImager (Amersham Biosiciences). The profile seen was reproducible.
Circular dichroism studies.
CD spectra were recorded on a JASCO model J-810 spectropolarimeter. Data were collected at room temperature (25°C) with a pathlength of 1 mm. The concentrations of proteins (suspended in 1x phosphate-buffered saline, pH 8.0) were 12.5 µM for TRX-LvL7+ and 10.0 µM for TRX-LvL7.
TEM studies.
Fresh colonies of M. smegmatis LR222 transformed with expression plasmids pSD5 and pSD5C4 were cultured until mid-log phase. Five milliliters of each culture was centrifuged and the pellet washed once with buffer A (10 mM phosphate-buffered saline, pH 7.4). For cell fixing, the cell pellet was fixed using 2.5% glutaraldehyde and 2% paraformaldehyde on ice for 45 min and subsequently washed with buffer A. Pellets were postfixed with 1% buffered osmium tetroxide (1 h, room temperature), followed by dehydration against a graded series of acetone (30 to 100%). The postfixed samples were infiltrated with resin and toluene and were finally embedded in LR white resin (TAAB, United Kingdom). Polymerization of resin was done at 60°C for 12 h. For transmission electron microscopic (TEM) analysis, ultrathin sections (
70 nm) of each embedded sample were cut and lifted on copper grids followed by staining with 4% uranyl acetate and 0.4% lead acetate for 5 min. The sections were examined and photographed using a FEI-PHILIPS MORGAGMI.268D digital transmission electron microscope.
Protein analysis.
SDS-PAGE analysis was carried out on 15% gels by using a method described previously (24). All protein samples were treated with mercaptoethanol before they were loaded onto SDS-PAGE gels. Multiple amino acid sequence alignment was carried out using the CLUSTALW program (available at http://www.ebi.ac.uk). Theoretical modeling of HupB and C4 proteins was carried out using the 3DPSSM and PHYRE algorithms (available at http://www.sbg.bio.ic.ac.uk/, courtesy of Structural Bioinformatics Group, Imperial College, London, United Kingdom).
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
For the choice of M. tuberculosis protein target, we drew our attention to the recent seminal contributions of Rubin and coworkers in the area of identification of essential M. tuberculosis proteins (42). Using an incisively elegant transposon-based methodology called TRASH, Rubin and coworkers have drawn up lists of proteins that are essential to M. tuberculosis, albeit the term "essential" is used in an assortment of contexts, ranging from general growth of the pathogen to its growth inside a macrophage, etc. (38, 43, 44). Based on an inspection of cutoff ratios (which characterize the degree of essentiality) and isoelectric points (pI) of the essential proteins, we narrowed down our choice to the protein HupB (also called Hlp, or histone-like protein). This protein has a low cutoff ratio (an indication of a high probability of essentiality) and a very high pI of 12.5 (a good bearing for instigation of a charge-based interaction). Moreover, the behavior and properties of mycobacterial HupB have been extensively characterized (26, 33). HupB has been shown to play an important role in cellular adhesion of mycobacteria, an event that is crucial for gaining entry into macrophages and other host cells (48). Additionally, HupB has been shown to be hyperinduced during anaerobically induced dormancy as well as under cold shock stress (25). Under normal mycobacterial growth conditions, HupB is preferentially expressed during the exponential phase of growth (46). Knockout of the hupB gene in mycobacteria results in an inability to withstand cold shock stress, as well as the inability to bind to Schwann cells (25, 48). It is also hypothesized that HupB binds to DNA and plays a crucial role in the regulation of cellular growth (26, 46). The primary sequence characteristics of HupB also merit a brief mention. It is a reasonably sized protein (MW, 22 kDa) of strikingly different N- and C-terminal halves. While the N-terminal half of the protein has a more or less neutral charge and shows strong homology to histone-like proteins of other organisms like E. coli and Streptomyces coelicolor, the C-terminal half is highly positively charged and shows homology only to proteins of only mycobacteria, for which the sequence identity is upwards of 80% (90% with the M. smegmatis histone-like protein) (Fig. 2A to C). There is as yet no structural information available for HupB. Nonetheless, three-dimensional models obtained through threading bring out clearly the degree of dissimilarity between the two halves of HupB (Fig. 2D). Therefore, de novo protein binders that target the C-terminal half of this protein could also bind to similar proteins across the mycobacterial spectrum. Given that our target protein HupB is very positively charged, one could envisage the construction of a very negatively charged library for the two-hybrid experiments. However, selection of binders through a charge-based interaction has not been previously reported. As a proof of principle, we designed an experiment to test this very concept.
|
cI repressor protein and the
subunit of RNA polymerase, respectively, or vice versa. Any worthwhile interaction between the two accelerates transcription of a marker gene, ultimately resulting in the selection of cells wherein the interaction is occurring. The extent of interaction can be scored by a variety of means, most commonly through the appearance of a blue colony when the marker gene is lacZ. First, we altered the expression plasmids pBT and pTRG, which carry the
cI and RNAP
subunit-encoding genes, so that our codon-shuffled libraries could be inserted downstream and in frame to the genes mentioned above. This resulted in
cI-fused and RNAP
subunit-fused protein libraries (see Materials and Methods). We then created two sets of highly charged libraries. In one set, the proportions of DI and EL dicodons were increased by three and five times with respect to the other dicodons, while at the same time, the KL and RT dicodons were completely removed from the pool. Correspondingly, in the other set, the proportions of KL and RT were increased by three and five times with respect to the other dicodons, while DI and EL were completely absent. Using the codon-shuffling method described previously (35), we obtained a total library size of approximately 105 CFU in both cases. The oppositely charged libraries were cotransformed in the two-hybrid reporter strain. We found fairly equal prevalences of blue and white colonies. As mentioned earlier, a blue colony is indicative of an interaction having occurred, while a white colony indicates the absence of interaction. We isolated the pTRG and pBT plasmids from one representative colony (LvL7) that displayed the same intensity of blue as the positive control (Fig. 3A, colony streak 5). The isolated plasmids were used to again cotransform the reporter strain to rule out the possibility of a false positive. DNA sequencing of the plasmids showed incorporation of the codon-shuffled fragments downstream of the
cI and RNAP
subunit-encoding genes, respectively. The corresponding protein sequence, along with primary protein characteristics, is shown in Fig. 4. The proteins, labeled LvL7+ and LvL7, carry opposite charges, as reflected by their extreme pIs. Both proteins could routinely be overexpressed and purified as thioredoxin fusion partners (Fig. 5A and B). MALDI-time-of-flight analysis of the proteins confirmed their molecular masses with a deviation of 0.001% from the expected values while circular-dichroism data on the two proteins indicated good secondary structure formation, with one protein (LvL7) calculated to hold almost 40% beta-sheet content (results not shown). The interaction between LvL7+ and LvL7 was further confirmed by an ion-exchange pull-down assay. Protein LvL7+ bound strongly to the negatively charged SP matrix and was seen to pull down protein LvL7 (Fig. 5C). On the other hand, we also isolated the pBT and pTRG plasmids from a representative white colony. Again, DNA sequencing showed both plasmids to be carrying codon-shuffled fragments; the predicted protein sequences showed the bait and target to be highly charged (one negatively and the other positively) (Fig. 4), yet the two proteins (LvL3+ and LvL3) were not interacting in vivo. This represents a direct confirmation of the hypothesis that while a charge-based interaction allows two proteins to come together, the strength and stability of the interaction depends on electronic-steering, induced-fit, and surface hydrophobic types of interactions between the two proteins. Indeed, X-ray or nuclear magnetic resonance-based structural studies may shed more light on what allows LvL7 and not the LvL3 proteins to interact. Meanwhile, for the present objective of isolating binders against M. tuberculosis HupB, the results of the LvL experiments aptly illustrate the judgment of choosing protein partners on a charge-based selection, given the extremely high charge of the target.
|
|
|
cI and RNAP
subunit-encoding genes in the two-hybrid pBT and pTRG plasmids, respectively. Separately, the hupB gene was cloned as a GST fusion to check for expression of the protein in E. coli (Fig. 6). It was previously mentioned that not only the entire HupB protein but also the C-terminal half, which shows exclusive homology to only mycobacterial proteins, represents an attractive target. In light of this, we also created a truncated version of the hupB gene, coding for just the C-terminal half, and fused it with the
cI and RNAP
subunit-encoding genes of pBT and pTRG. The C-terminal protein, termed Hup2, was also seen to be expressed in E. coli as a GST fusion protein (Fig. 6). The two-hybrid plasmids carrying the two targets, hupB and hup2, were cotransformed with a range of negatively charged libraries wherein the KL and RT dicodons were removed and the proportions of DI and EL were gradually skewed, from one to five times with respect to the other 10 dicodons. In all cases, the library sizes ranged from 104 (when normal competent cells were used) to 106 (when electrocompetent cells were used) CFU. We were able to isolate three separate intensely blue-colored colonies, C4, C5, and C6 (C4 in HupB experiments and the other two in Hup2 experiments). The pBT and pTRG plasmids were isolated from each colony and used to freshly cotransform the reporter strain to rule out false positives. Additionally, the codon-shuffled DNA fragments were PCR amplified from these plasmids and cloned afresh into pBT and pTRG plasmids and experiments repeated. All three clones reproducibly gave blue colonies (Fig. 3A). The amino acid sequence and primary characteristics of the binders are shown in Fig. 4. All proteins show a very high negative charge but nonetheless share little homology with each other. Consensus secondary structure predictions on the three binders mirror their sequence dissimilarity by showing differing compositions of alpha-helical and beta-sheet folds. We then carried out a beta-galactosidase liquid assay to determine the strength of interactions in vivo for the obtained binders (Fig. 3B). As the results indicate, all three show strong interactions compared to the positive control (the positive control proteins Gal11p and LGF-2 display very strong interactions, with a Kd of 107 M) (1). Lastly, confirmation of the binder-target interaction was further confirmed via an in vitro GST pull-down assay, wherein GST-C4 was shown to pull down the recombinant Hup2 protein (Fig. 7). Despite our best efforts, we found that the purified GST-C4 protein underwent some amount of degradation, which resulted in the presence of cleaved GST along with GST-C4. However, because of the design of the in vitro pull-down assay, the presence of cleaved GST did not affect the outcome of the experiment, as it was shown that purified recombinant Hup2 protein does not bind GST but only GST-C4.
|
|
Effect of protein binder C4 on mycobacterial growth and colony count.
To explore the effects of de novo protein binders on mycobacteria, we chose to base our studies on the nonpathogenic Mycobacterium smegmatis. The availability of the complete genome sequences of both M. tuberculosis and M. smegmatis has pointed to a significant homology between the two proteomes (8). For example, M. smegmatis HupB is almost 90% identical to the M. tuberculosis HupB (Fig. 2A). Further, the M. smegmatis HupB has been extensively studied and characterized (25, 46). Consequently, the gene corresponding to one of the protein binders (C4) was cloned in a mycobacterial shuttle vector pSD5 (kind gift of Anil Tyagi). Plasmid pSD5 contains a strong heat shock protein 60 promoter element of mycobacterial origin that helps in the overexpression of foreign and mycobacterial genes in M. smegmatis (11). Figure 8A shows the growth curve of M. smegmatis/pSD5C4, with M. smegmatis/pSD5 and M. smegmatis/pSD5GFP strains serving as controls. As can be seen, the reduction in growth is considerable in the exponential phase with it petering out at the stationary-phase level. Interestingly, this is very consistent with the timing of HupB expression in mycobacteria. Previous reports have indicated that HupB is expressed chiefly in the exponential phase of mycobacterial growth (46). While no concrete reason has been attributed to this phenomenon, it has been speculated that the possible role played by HupB in growth regulation is the cause (25, 46). It is tempting to attribute the observed reduction in the growth of M. smegmatis/pSD5C4 to the in vivo binding of C4 to M. smegmatis HupB, especially as the reduction in growth goes on to result in a reduction in M. smegmatis colony count (Fig. 8B). This last observation unquestionably relates the expression of C4 with the altered behavior of M. smegmatis growth.
|
One of the functions attributed to the HupB protein is that it facilitates the maintenance of mycobacteria under cold shock stress (46). We decided to investigate whether the de novo protein binder, by virtue of its interaction with M. smegmatis HupB, is able to reduce the stress tolerance. Exponentially growing cultures of M. smegmatis/pSD5C4 as well as M. smegmatis/pSD5 were subjected to the cold shock stress (see Materials and Methods) and their growth monitored over a period of time (Fig. 8C). Herein, we altered the design of our experiment in contrast to an earlier study that simply measured CFU at two different temperatures, the favorable temperature of 37°C and the cold shock temperature of 10°C (46). To ascertain the viability of mycobacteria poststress, we periodically removed cells growing at 10°C and plated them at the favorable temperature (of 37°C), as this would demonstrate the growth viability of cells once they had been subjected to a cold shock (said to mirror the anaerobically induced dormancy commonly witnessed for mycobacteria). As can be seen from the figure, we found a noteworthy decrease in the colony counts of the recombinant M. smegmatis strain compared to the wild type, hinting at the possibility that M. smegmatis HupB was being directly targeted by the protein binder.
Lastly, we also carried out transmission electron microscopy of recombinant and wild-type M. smegmatis in order to investigate whether the expression of protein binders results in differences of a physical nature, e.g., in cell morphology. The results indicate noticeable differences in cell morphology between the wild-type and recombinant strains, with the latter displaying unequal distribution of daughter cells across the septum (Fig. 9). This is indicative of an aberration of some cell division functions (2). Interestingly, TEM analysis of many M. tuberculosis and E. coli strains with proteins normally associated with cell division, elongation, and septum formation (e.g., FtsZ, PknF, and PknA) knocked out shows similar cell morphologies, thereby further underlining the role of HupB in the regulation of cell division (12, 54).
|
|
| CONCLUSIONS |
|---|
|
|
|---|
These findings, we believe, have laid the groundwork for attacking other essential M. tuberculosis targets mentioned in the many lists described by Rubin and coworkers. This approach would entail an inspection of target protein properties (pI, MW, and other secondary structure characteristics), followed by the programming of de novo libraries to have properties that are complementary to those of the target protein. Imminently, the 200-odd culture-filtrate proteins of M. tuberculosis appear as an attractive pool for target selection. This is because the generated protein binders would bind to the target extracellularly, as opposed to them having to penetrate the M. tuberculosis cell wall to carry out their action (as would have to happen in the case of HupB, a cytoplasmic protein), although recent developments point to some peptide tags being capable of transporting our protein binders to their eventual intracellular destination (14). On the other hand, one may envisage a scenario where a particular target gene is not picked, rather a library of in-frame M. tuberculosis gene fragments is hit against variably programmed codon-shuffled protein libraries, resulting not only in the isolation of protein binders but also in scores of targets with which the former bind strongly. In other words, the binders as well as the targets would be simultaneously selected; follow-up experiments could then be performed on particular targets from this pool, based on their importance or essentiality. Such experiments are currently under way in our laboratory.
The absence of a reliable vaccine and new frontline drugs, coupled with increasing prevalence of multiple-drug-resistant M. tuberculosis strains, demands urgent attention. Rational, part-rational, or nonrational approaches towards the design of de novo proteins that target M. tuberculosis proteins (one at a time or indiscriminately) can, we believe, earnestly augment the conventional natural/semisynthetic small-molecule-based antituberculosis drug discovery efforts, even though matters of protein delivery, creation of peptidomimetics, and structure-based small-molecule design (all issues that would follow the initial protein/peptide leads) still remain largely at an exploratory stage (30). It is hoped that buildup of a collection of protein binders that show potent in vivo inhibitory activity would encourage further research in the areas mentioned above.
| ACKNOWLEDGMENTS |
|---|
This work was funded by internal grants of ICGEB and IGIB. A. Rao, G. Ram, and K. Kumar thank CSIR, India, for financial support.
| FOOTNOTES |
|---|
Published ahead of print on 22 December 2006. ![]()
Both authors contributed equally to this work. ![]()
| REFERENCES |
|---|
|
|
|---|