Previous Article | Next Article ![]()
Applied and Environmental Microbiology, January 2007, p. 380-389, Vol. 73, No. 2
0099-2240/07/$08.00+0 doi:10.1128/AEM.01785-06
Copyright © 2007, American Society for Microbiology. All Rights Reserved.
,
Department of Civil and Environmental Engineering,1 Center for Microbial Ecology,2 Department of Crop and Soil Sciences,3 National Food Safety and Toxicology Center, Michigan State University, East Lansing, Michigan 48824,4 Department of Chemical Engineering, University of Michigan, Ann Arbor, Michigan 481095
Received 27 July 2006/ Accepted 4 November 2006
|
|
|---|
|
|
|---|
Target preparation and the resulting lengths, sequences,
double- versus single-stranded compositions, positions of fluorescent
labels, and secondary structures of the dangling ends and
surface-proximal tails of targets influence the stability of duplex
formation and the resulting signal intensity
(25,
31,
44). The targets may also
interact with other nontarget DNA sequences present in the background,
previously referred to as "hitchhiking"
(23). This is especially
true for 16S rRNA gene-based diagnostic microarrays, where
dangling ends and surface-proximal tails will generally represent
conserved regions and may have greater potential to interact with the
background DNA or other targets. Because the background will be sample
specific, comparative evaluation of two samples on the same microarray
using target mixtures that interact with each other may pose problems.
Low reproducibility and false-positive and false-negative signals with
16S rRNA gene-based arrays are partly accredited to bias caused by the
characteristics of target molecules
(8,
17), and
target-background interaction may be one of the main reasons for these
phenomena. Poor statistical relationships between experimental and
predicted signal intensities (from Gibbs free energy calculations)
suggest that thermodynamic parameters between target-and-probe duplexes
are not fully understood
(26). Thus, an evaluation
of the influence of dangling ends and surface-proximal tails on signal
intensity and target-background interaction is needed in order to
interpret the signal intensities from 16S rRNA gene-based diagnostic
and microbial community analysis arrays. The interaction among probe,
target, and background can be represented as Pn
Tn
Bn, where P represents probe, T
represents target, B represents background, and n represents the length
of the corresponding DNA sequence. A given background DNA sequence may
also interact with the probe sequence (Pn
Bn). However, this interaction was not
considered significant in this study because signal due to background
DNA when hybridized alone was absent for the targeted probes.
The objective of this study was to systematically examine the impact of dangling ends and surface-proximal tails of targets on hybridization signal intensity and to evaluate target-background interaction by (i) comparing products of unknown length generated by random primer labeling with targets of known length and (ii) examining interactions between well-defined targets and backgrounds. This study underlines the need for careful target preparation and evaluation of signal intensities for diagnostic arrays using 16S rRNA and other gene targets due to the potential for target interaction with a complex background.
|
|
|---|
Preparation of target DNA.
The following fivedifferent types of targets were prepared: (i) a mixture of
Klenow/random primer-labeled (with Cy5) targets prepared from 16S rRNA
genes of 10 pathogens (TK-m); (ii) a mixture of Cy3
end-labeled 45-mer synthetic targets corresponding to 95 probes for the
10 pathogens (T45-m); (iii) 1 individual Cy3 end-labeled
synthetic target of a length of 102 bases (TD) to examine
dangling end length; (iv) 4 additional Cy3 end-labeled synthetic
targets of lengths of 47, 60, 80, and 106 bases (Tn, where n
is the length of the target) to study the effect of increasing the
length of the dangling end; and (v) 11 individual Cy3 end-labeled
synthetic targets of lengths of 89 to 102 bases (TS), which
were used to study the effect of sequence similarity in the
surface-proximal tails of targets. The sequences of dangling ends and
surface-proximal tails for all the synthetic targets (T45-m,
TD, Tn, and TS) matched the 16S rRNA
gene sequences of the corresponding pathogens. The approach used to
prepare these targets is summarized
below.
Klenow/random primer-labeled target mixture of 16S rRNA genes of pathogens (TK-m).
Genomic
DNA from 10 bacterial pathogens (Table
1) was used as the source of 16S rRNA genes. Escherichia coli,
Legionella pneumophila, Pseudomonas aeruginosa,
Salmonella enterica subsp. arizonae, S.
enterica serovar Typhimurium, and Yersinia
enterocolitica type strains were obtained from the American Type
Culture Collection (ATCC, Manassas, VA) and grown according to the
protocol provided. For Helicobacter pylori and Clostridium
perfringens, only genomic DNA was obtained from the ATCC.
Enterococcus faecalis and Campylobacter
jejuni were kindly provided by Joan B. Rose and Vincent Young
(both at Michigan State University), respectively. DNA from pure
cultures and the environmental sample was extracted using a Promega
Wizard DNA extraction kit (Promega, Madison, WI).
|
View this table: [in a new window] |
TABLE 1. Identities
of pathogens, numbers of probes, and Gibbs free energy ranges for the
set of probes
|
The amplified 16S rRNA gene (250 ng) from each pathogen was individually labeled with Cy5 by use of a Bioprime DNA labeling kit (Invitrogen, San Diego, CA). Briefly, the protocol included a 90-min incubation of the amplicon with Klenow polymerase and 5:1 amino-allyl-dUTP:dTTP (Ambion, Austin, TX) followed by Cy5 labeling. All amino-allyl-dUTP-labeled products were cleaned using a QIAGEN PCR clean-up kit with modified phosphate wash buffer (5 mM K2HPO4, pH 8.0, 80% ethyl alcohol) and phosphate elution buffer (4 mM K2HPO4, pH 8.5). Cyanine dye was attached by incubating 3 to 5 µg of amino-allyl-dUTP-labeled DNA for 1 h in a 50:50 mixture of 0.1 M sodium carbonate buffer (pH 9.3) and N-hydroxysuccinimide ester Cy dye (prepared in fresh dimethyl sulfoxide). DNA product from dye coupling was cleaned using a QIAGEN PCR clean-up kit. A mixture of Cy5-labeled 16S rRNA genes was then prepared by mixing 10 pmol (157 to 357 ng DNA) of labeled product from each of the 10 pathogens. Klenow/random primer-labeled targets were estimated to be between 200 to 1,500 bp (as assessed by gel electrophoresis). All Klenow-labeled targets are expected to have multiple Cy5 labels. The specific activity of Cy dye was calculated by dividing the measured pmol of nucleotides by the measured pmol of Cy3/Cy5 dye for each target.
Mixture of 45-mer synthetic targets (T45-m).
A mixture of synthetic
targets was designed and synthesized to complement a set of 95 probes
(18-mer) in the middle region of the 45-mer targets (Fig.
1a). Each 45-mer synthetic target had a 14-nucleotide
surface-proximal tail at the 5' terminus and a 13-nucleotide
dangling end at the 3' terminus. The sequences of both the
overhanging ends matched the 16S rRNA gene sequence of the
corresponding pathogen. The single-stranded mixture of sequences was
synthesized at the University of Michigan by use of an in situ
oligonucleotide synthesis technology
(11,
12), end labeled with Cy3
at the 5' terminus, and harvested from the solid
substrate.
![]() View larger version (23K): [in a new window] |
FIG. 1. Klenow/random
primer-labeled target mixture versus end-labeled 45-mer synthetic
target mixture. (a) Experimental strategy. (b) Signal-to-noise ratio of
synthetic targets T45-m ( ) and Klenow/random
primer-labeled targets TK-m ( ) at 26°C. The
average signals for three replicate experiments are plotted, and the
error bars represent standard deviations. (c) Ratio of the signal from
hybridizations with target alone to the signal from targets spiked in
background DNA for T45-m ( ) and TK-m
( ). The x axis represents the factor by which the S/N
of the target mixture (T45-m or TK-m) is
influenced by the background, B16S, and the y axis
shows the frequencies of probes with a given influence factor. For a
given probe, the influence factor was computed as the S/N in the
absence of B16S divided by the S/N in the presence of
B16S. The signal shown in panel c is the S/N of the area
under the dissociation curve to include variation in both initial
signal intensity and
dissociation.
|
![]() View larger version (28K): [in a new window] |
FIG. 2. Synthetic
targets of increasing dangling end length. (a) Experimental strategy.
(b) The signal-to-noise ratios of BD and TD at
26°C (y axis) are shown for each of nine separate
probes offset by one nucleotide relative to the target, in order to
generate the increasing dangling end overhang of TD
(x axis). Clear bars represent S/N of TD, and dark
bars are S/N of BD. The signal of BD is observed
when the target dangling end overhang has a stretch of continuous
homology greater than 7 bases. The average signals for three replicate
experiments are plotted, and the error bars represent standard
deviations.
|
Synthetic targets (TS) with 10 to 100% sequence similarity at the surface-proximal tail.
The effect of the
sequence similarity of the surface-proximal tail of the target to a
defined background DNA sequence was studied using 11 different targets
(TS) of similar lengths (89 to 102 bases). Each
TS was synthesized and singly end labeled with Cy3 at the
5' terminus by IDT (Coralville, IA). All the targets had 18
bases complementary to 52 probes on the microarray (Fig.
3a), a dangling end of 1 to 16 bases, and a surface-proximal tail of 67 to
76 bases depending on the probe and target. TS were designed
to have 10 to 100% sequence similarity to a single background sequence
(BS [described
below]).
![]() View larger version (55K): [in a new window] |
FIG. 3. Synthetic
targets with surface-proximal tail sequences complementary to
background DNA. (a) Experimental strategy. (b) Mismatches, insertions,
and deletions influencing Gibbs free energy between the BS
and 11 different TS, resulting in a Gibbs free energy for
TS BS duplex formation that ranged
between 9.2 and 76.8 kcal/mol. (c) S/N at
26°C of TS and BS (y axis)
versus Gibbs free energy (kcal/mol) between Ts and BS
(x axis). Clear bars represent S/N of TS, and dark
bars are S/N of BS. Only 7 out of 11 results are displayed
because the BS did not interact with other TS.
The average signals for three replicate experiments are plotted, and
the error bars indicate standard
deviations.
|
|
View this table: [in a new window] |
TABLE 2. Characteristics
of various DNA sequences used to study the effect of dangling ends and
surface-proximal tails on signal
intensity
|
Synthetic background DNA of defined length.
Two 59-mer
synthetic defined background sequences (BD and
BS) were designed to complement the dangling ends of
TD and surface-proximal tails of TS (11 different
TS sequences were used). The extent of sequence similarity
for defined background DNA was carefully controlled to result in either
continuous homologies of variable length at the dangling ends of
TD or various sequence similarities at the surface-proximal
tails of TS. For TD, the sequence of
BD had 4 to 12 bases of continuous homology from position 86
to 98 considering the 5' end as 0 (Table
2 and Fig.
2a). The Gibbs free energy
between TD and BD varied between 3.2 and
14.9 kcal/mol for 4- to 12-base stretches of continuous
sequence homology at the dangling end. The Gibbs free energy between
BD and P18 (complementing the TD) was
between 4.3 and 4.6 kcal/mol.
In comparison with Tn (T47, T60, T80, and T106), the sequence of BD had 0, 13, 33, and 59 bases matching within the dangling ends of Tn and a Gibbs free energy of 2.3 to 74.9 kcal/mol. The Gibbs free energy between BD and P18 (complementing the Tn) was between 3.2 and 5.3 kcal/mol.
The second defined background DNA sequence, BS, was designed to complement the surface-proximal tails of 11 different TS (as shown in Table 2 and depicted in Fig. 3a) with various degrees of sequence similarity. The surface-proximal tails of TS had an 8- to 16-base overhang (duplex dependent) with 0% sequence homology to BS followed by a stretch of 59 bases with variable regions of contiguous perfect matches and sequence similarity with BS. This resulted in a sequence similarity of 0 to 90% on the whole surface-proximal tail and 10 to 100% sequence similarity considering only the 59 bases on the 5' end of TS. The dangling end of TS had 1 to 16 bases depending on the probe to which it was hybridized. For all interactions between BS and TS, the Gibbs free energy was always less negative than 5.7 kcal/mol. The Gibbs free energy between BS and P18 (complementing the TS) was between 3.8 and 8.1 kcal/mol.
Probe design and microarray synthesis.
Probes were
designed using a Perl script developed in-house that screened for all
18-mers of 16S rRNA genes that had at least two mismatches to every
other sequence in the RDP-II
(http://rdp.cme.msu.edu/).
Screened probes for a given pathogen were ranked based on melting
temperature and G+C content by following the parameters
incorporated into OligoArray 2.0 probe design software
(32). The final array
included a set of 146 probes (P18) targeting 21 different
16S rRNA gene sequences and mismatch probes identical to the perfect
match sequence except for 1 or 2 incorrect bases in the middle of the
oligomer to measure the degree of cross-hybridization. Microarrays
containing the above-described set of probes were synthesized by
Xeotron Corporation, Houston, TX (now part of Invitrogen, Carlsbad,
CA), using a proprietary in situ synthesis technology developed by the
University of Michigan
(11,
12).
Experimental approach. (i) Comparison of 45-mer synthetic target mixture to the Klenow/random primer-labeled target mixture.
Two-sample comparative hybridization,
similar to that used in gene expression studies
(5), was performed to
evaluate the differences and similarities between synthetic targets and
Klenow/random primer-labeled targets. A similar experiment was
conducted to study the impact of the complex background 16S rRNA gene
mixture on target signal intensity. A total of three comparative
hybridizations were performed in triplicate (shown by double arrows in
Fig. 1a). These were (i)
T45-m versus TK-m, to compare the hybridization
behaviors of two types of target mixtures; (ii) T45-m versus
B16S, to evaluate the impact of background DNA on the 45-mer
synthetic target mixture; and (iii) TK-m versus
B16S, to evaluate the impact of background DNA on the
Klenow/random primer-labeled target mixture. The complex
background mixture B16S, BD, and BS
was also hybridized alone to ensure that target signals for the probes
to be tested did not cross-hybridize with various backgrounds. For the
first of these hybridizations, 20 pmol (1,069 ng DNA) of
T45-m was mixed with 100 pmol (2,481 ng DNA) of
TK-m. For the second hybridization, 20 pmol (1,069 ng DNA)
of Cy3-end-labeled T45-m was mixed with 200 pmol (6,550 ng
DNA) of Cy5-labeled B16S. For the third hybridization, 100
pmol of Cy5-labeled TK-m was mixed with 200 pmol (6,550 ng
DNA) of Cy3-labeled B16S. The last mixture was obtained by
combining 10 pmol of labeled 16S rRNA gene from each of the 10
pathogens.
(ii) Evaluation of lengths and sequences of dangling ends of synthetic targets.
The effect of dangling end length was
studied by varying the length of the continuous sequence homology
between TD and BD from 4 to 12 bases as shown in
Fig. 2a. The dangling end
had the sequence 5'-(86
bases)-CGACTTGCATGTGTTG-3',
wherein nucleotides in boldface complement the sequence of
BD. In order to generate the increasing dangling end
overhang of TD from 8 to 16 bases, nine separate probes with
increasing offsets relative to the target were constructed.
The sequence of background DNA (BD) included the
complementary sequence for the 12 bases of the dangling end to result
in target-background interaction, which was expected to increase with
an increase in dangling end length. The hybridization included 67 pmol
of Cy3-labeled TD (3,557 ng) and 37 pmol of Cy5-labeled
BD (1,540 ng).
The dangling end interaction due to Gibbs free energy was examined further using four synthetic targets with 100% sequence similarity and of various lengths (T47, T60, T80, and T106) along with a 59-mer BD that was designed to complement the dangling end of Tn by 0, 13, 33, and 59 bp (having Gibbs free energies of 8.8, 14.8, 43.8, and 74.9 kcal/mol, respectively). The BD for this examination was the same as before; however, Tn targeted 10 probes different than those targeted by TD. The hybridization included 52 to 84 pmol of Cy3-labeled Tn (1,522 to 4,277 ng) and 37 pmol of Cy5-labeled BD (1,540 ng).
(iii) Evaluation of sequence similarity in the surface-proximal tail of a synthetic target.
The dependence of
sequence similarity at the surface-proximal tail of a synthetic target
with background DNA was studied by synthesizing 11 different synthetic
targets with TS of similar lengths (Fig.
3a) and a new synthetic
background DNA sequence (BS). The Gibbs free energies of
TS sequences varied between 9.2 and 76.8
kcal/mol, with sequence similarity varying between 10 and 100% (Fig.
3a). Each TS
was end labeled with Cy3 at the 5' end and complemented three
to eight probes. The hybridization included a total of 160 pmol of
Cy3-labeled TS (7,895 ng) and 30 pmol of Cy5-labeled
BS (1,115 ng).
Hybridization and scanning.
Target and
background mixtures were prepared in 100 µl of hybridization
solution containing 35% formamide, 0.4% Triton X-100, and 6x
SSPE. SSPE buffers were made from a stock of 18x SSPE, which is
2.7 M NaCl, 180 mM Na2PO4, 18 mM
Na2EDTA (adjusted to pH 6.6 with HCl). The hybridization
solution was heated at 95°C for 3 min, cooled on ice for 1 min,
and passed through a 0.22-µm filter. All hybridizations were
carried out in triplicate for 16 to 18 h at 20°C
using an M-2 microfluidic station (Xeotron Corporation). A flow rate of
500 µl per min was used for the recirculation of hybridization
solution through the microfluidic array during hybridization
(42). After overnight
hybridization at 20°C, the microarray was washed using wash
buffer 2 (6x SSPE, 0.2% Triton X-100), wash buffer 4
(1x SSPE, 0.2% Triton X-100), and wash buffer 2 with no Triton
X-100 in series for 2.2 min each (500 µl per min,
20°C). A nonequilibrium thermal dissociation approach adapted
to the Xeotron platform was used in all hybridization experiments
(8,
20,
39,
42). The protocol was
based on earlier studies utilizing a dissociation curve approach for
diagnostic arrays (8,
20,
39,
42). The microarrays were
washed with a high-stringency wash buffer (10 mM
Na2HPO4, 5 mM EDTA, pH 6.6; flow rate, 500
µl per min) for 2.2 min at increasing temperatures from 20 to
70°C at 2°C intervals. Experimental conditions have
been examined previously for optimal specificity
(42). Signal intensities
were quantified after each wash by use of a GenePix 4000B non-confocal
laser scanner (Axon Instruments, Inc, Foster City, CA) at a
photomultiplier tube voltage setting of 650 V for Cy5 (635-nm laser)
and 600 V for Cy3 (532-nm laser). For complex background DNA
(B16S), these settings were 750 V (for Cy5) and 500 V (for
Cy3). With these settings, photobleaching caused an average decrease in
signal intensity of 0.85% for Cy5-labeled targets and 0.54% for
Cy3-labeled targets between each of the 20 scans at
20°C.
Data analysis.
Data were analyzed using an XL script
that imported raw GenePix dissociation curve data between 20 and
70°C into Excel and generated a sigmoid curve for each
microarray feature. The signal-to-noise ratio (S/N) at a given
temperature was calculated from three replicate hybridization
experiments. Noise was defined as the average signal of twenty empty
spots (containing linker chemistry and no probes). The average signal
of 20 randomly selected nontarget probes (80.3 ± 10.6 arbitrary
units [a.u.] with a 635-nm laser or 148.3 ±
16.7 a.u. with a 532-nm laser) was similar to the average
signal of 20 empty spots (83.6 ± 6.4 a.u. with a
635-nm laser or 155.9 ± 10.6 a.u. with a 532-nm
laser) when the microarray was hybridized with the complex background.
Compared to results obtained for no background, the average noise
increased slightly when a complex background DNA (B16S) was
hybridized (1.19-fold for Cy3 and 1.54-fold for Cy5). For all
experiments, an average S/N greater than 3.0 was considered positive to
balance the maximum percentages of true-positive calls of
targeted probes and true-negative calls of nontargeted probes. Figure
1c is presented as the S/N
for the area under the dissociation curve to include variation in both
dissociation and signal intensity. For statistical
comparison of probe signal intensities when hybridization took place
with Klenow/random primer fragmented and 45-mer targets, a two-tailed
inference about differences in population means for independent samples
was performed with a 95% confidence interval. The free energy of
hybridization was computed using the two-state hybridization server
(www.bioinfo.rpi.edu/)
developed by Dimitrov and Zuker
(7) with a value of
43°C instead of the actual 20°C. This value was chosen
because the hybridization buffer contained 35% formamide, which is
expected to destabilize duplexes equivalently to increasing the
temperature by 21 to 25°C
(2,
39).
|
|
|---|
While a difference
in signal intensity with various target mixtures is expected, higher
signals with single-stranded targets than with Klenow/random
primer-labeled targets have not been reported previously. More
specifically, probes 1 through 30 had much higher S/N with
T45-m than with TK-m, while the opposite was
observed for probes 60 to 95. For 32 out of 95 probes, the S/N was
significantly higher with T45-m than with TK-m
(P
0.05). Dissimilar results were observed in a
study by Franke-Whittle et al.
(10), in which S/N was
higher for a majority of duplexes targeted with Klenow fragmented
targets than with those targeted with single-stranded targets. Higher
S/N with Klenow fragmented targets is expected because longer dangling
ends in TK-m provide the opportunity for the attachment of
many more Cy5 dye molecules than what was obtained for
T45-m, i.e., only one Cy3 dye molecule. The
average specific activity for TK-m was 72 ± 20
nucleotides per Cy5 dye molecule (calculated using the arithmetic
average of the specific activities of each of 10 targets that were
labeled separately). This implies that the average number of Cy5 dye
molecules was between 2 and 30 (assuming a target length of between 200
and 1,500).
It is evident from Fig. 1b that normalization of S/N for the higher number of dye molecules in TK-m or dye bias would not result in a profile that overlaps completely with the S/N profile of T45-m. This implies that factors in addition to differences in label abundance and dye bias are responsible for the differences in S/N. These factors include (i) the secondary structure of longer dangling ends of TK-m making certain regions inaccessible for hybridization with probes (4, 19, 28); (ii) TK-m and T45-m displacement from probes due to complementing strands of double-stranded TK-m (25), known as a zipper effect (30); (iii) bias in Klenow labeling, as previously suggested by Tiquia et al. (38); (iv) probe sequence variability and linear probe influences on thermodynamic stability (31); (v) physical quenching of Cy dyes (6); and (vi) conserved regions on overhangs of hybridized TK-m interacting with complementary strands from one or more of the 10 organisms also targeted in the mixture (23), creating chains of targets. Interactions occurring in solution between 16S rRNA gene targets from various organisms may influence the spot signal intensity by competing with the targets bound to the probe.
Further experiments were performed to examine and quantify the extent of target-background interaction on the two dissimilar targets by spiking both TK-m and T45-m into background DNA. When the two target mixtures were spiked (separately) into a complex mixture of nontarget 16S rRNA genes (B16S), the S/N of TK-m changed significantly compared to the S/N of T45-m (Fig. 1c). The x axis in Fig. 1c represents the factor by which the S/N of the target mixture (T45-m or TK-m) is influenced by the background, B16S, and the y axis is the frequency of probes with a given influence factor. For a given probe, the influence factor was computed as the S/N in the absence of B16S divided by the S/N in the presence of B16S. For targets not impacted by the background DNA, this influence factor should be close to unity (implying no effect), and the spread of the bell-shaped curve should be minimal. For T45-m, the influence factor was close to 1, while for TK-m, it was approximately 4. Similarly, the spread of the T45-m influence curve was very small compared to that obtained for TK-m. A simple comparison of the areas under each influence factor curve indicated that the interaction of TK-m with the background was 3.7 times higher than that of T45-m (obtained by comparing the areas under the curve of TK-m and T45-m, which were 69.5 and 19 arbitrary units, respectively). A Wilcoxon signed-rank test was used to statistically conclude that the distribution of influence factor was greater for TK-m than for T45-m with a 95% confidence interval. Therefore, background significantly influences the signal intensity of TK-m and not that of T45-m. Since the same probes were targeted with both target mixtures, the influence of background on signal intensity is dependent on the characteristics of the target molecules and independent of the spatial region on the array. The B16S was also hybridized alone to ensure that targeted probes did not produce any signal with the background 16S rRNA gene mixture.
The influence of background on the signal intensity of double-stranded TK-m may be the result of target interactions influenced by complementary and nontarget strands in background DNA. In one study, greater hybridization efficiency was observed with single-stranded targets than with double-stranded targets (13). This is due to competition between complementary strands and probes with double-stranded DNA. Related studies have reported increased competition between the complementing strand and probe as the length of the dangling end increased, while increasing the length of the surface-proximal tail did not influence hybridization (25). Interactions between the conserved regions of target dangling ends and sequences from a second target have also been observed (23). These interactions were minimized by using single-stranded RNA instead of double-stranded DNA. If this observation can be extended to DNA sequences, the use of asymmetric PCR for target preparation may be advantageous in minimizing target-background interaction.
Target-background interaction on the dangling ends of the target may also influence signal intensity by affecting the secondary structure. As studied by Chandler et al. (4), signal intensity was substantially increased as the target secondary structure was relieved using a second probe in solution that annealed to the target dangling end. Interaction between the target dangling end and a background DNA sequence may have a similar effect. Peplies et al. (24) observed similar results for a majority of tested probes. However, the signal of some probes decreased with a second probe in solution, suggesting that the opening of a selected binding site may lead to a reorganization of one or more secondary structures in other target regions.
There are additional reasons for using shorter targets with diagnostic arrays. Shorter targets reduce intermolecular structures that occur more frequently with single-stranded targets than with stiffer double-stranded products (36). Lane et al. (19) used amplicons of various lengths and concluded that shorter fragments reduced false negatives due to the high level of intermolecular secondary structure in longer targets. Fluorophore position on a target DNA sequence can also influence signal intensity (44). Signal intensity decreases as the proximity of the fluorescent molecule from a probe target duplex increases. Thus, target preparation strategies should include amplifying shorter targets with specific asymmetric PCR or fragmenting longer amplicons to produce a range of desired lengths.
Wilson et al. (43) suggested using fragments under 100 bp rather than full-length amplicons to obtain a stronger hybridization signal. A number of fragmentation strategies have been described for obtaining a size range between 35 and 200 bases with pre- and postlabeled targets and asymmetric PCR-amplified products (1, 21, 27, 41). Subsequent studies comparing costs, efficiencies, and influences on signal intensity for some of these methods are currently under way in our laboratory. Target-target and target-background interactions are expected to be less for targets other than 16S rRNA genes, especially if the targets have less-conservedregions.
Synthetic targets of increasing dangling end length.
The
effect of dangling end length was evaluated using a target with an
increasing stretch of continuous homology. A Cy3 end-labeled synthetic
102-mer target (TD) was designed to hybridize to nine
different probes on the array. After hybridization, TD was
expected to have a dangling end of 8 to 16 bases (Fig.
2a). A 59-mer synthetic
background DNA sequence (BD) end labeled with Cy5 was
designed to complement the dangling end sequence from position 86 to 98
with 4- to 12-base stretches of continuous homology. The
signal-to-noise ratio of BD and TD at
26°C is shown for each of the nine separate probes (Fig.
2b). In order to generate
the increasing dangling end overhang on the TD, probes were
offset by one nucleotide relative to the target. Considering an S/N
greater than 3 as a positive signal, BD hybridized to the
dangling end of TD only when the dangling end stretch of
continuous homology was 7 bases or longer. Signal from BD
was not observed when hybridized alone (i.e., without TD),
and mismatch probes did not display signal for BD or
TD, directly implicating that BD did not
cross-hybridize with the probe. As evident by the lack of signal for
probes with target dangling ends of 6 bases or less, interaction of the
surface-proximal end of TD with BD was
insignificant. The S/N of BD for 11- and 12-base overhangs
was higher than the S/N given by TD. This may be due to dye
bias between Cy5 and Cy3 and a higher specific activity of Cy5-labeled
BD. The 59-mer BD may have resulted in a great
percentage of dye-labeled product compared to that due to the 102-mer
TD, as the purity of oligonucleotide synthesis decreases
with increasing length.
The dangling end interaction due to the length of overhang was reexamined using four synthetic targets of various lengths (T47, T60, T80, and T106, together represented by Tn) along with the same 59-mer BD that was designed to complement the dangling end of Tn by 0, 13, 33, and 59 bp. Dangling end interaction occurred between T60, T80, and T106 and BD but not between T47 and BD (data not shown). The zero sequence homology between T47 and BD was the obvious reason for this lack of interaction.
Gibbs free energy calculations incorporate influences such as length, sequence similarity, secondary structures, and hairpin loops caused by near-perfect match sequences with insertions or deletions (16). It was used to examine interaction between TD, Tn and BD. For TD, an overhang with 4 sequential perfect match bases had a Gibbs free energy of 3.2 kcal/mol, while an overhang with 12 sequential perfect match bases had a Gibbs free energy of 14.9 kcal/mol. The S/N of BD became greater than 3 when the stretch of continuous bases on the dangling end was 7 or higher and Gibbs free energy was 7.6 kcal/mol or more negative. With the experimental conditions used in this study, target-background interaction on dangling end can occur with Gibbs free energy that is more negative than 7.6 kcal/mol. However, due to phenomena such as differences in intermolecular structures and competitive influences of complementary strands (28), the interaction observed with this free energy may not be extensible to double-stranded DNA targets. For calculating the Gibbs free energy between BD and TD, only the 3' end of TD was considered to circumvent the influence of probes on dangling end interaction. Neglecting the presence of the probe, the Gibbs free energy of the duplex formed between TD and BD was 15.4 kcal/mol. Because the 5' end of BD preferentially hybridizes to the 3' dangling end of TD, the interaction occurs solely on the dangling end rather than on the surface-proximal end of TD or the probe.
The S/N of BD increased as the Gibbs free energy became more negative and the length of continuous homology on the TD dangling end increased. Gibbs free energies between BD and T47, T60, T80, and T106 were 2.3, 14.2, 43.8, and 74.9 kcal/mol, respectively. Previous studies have also reported successful hybridization to 9-mer probes, with a Gibbs free energy of between 6 and 10 kcal/mol producing weak signals and one between 8 and 12 kcal/mol producing strong signals (29). It should be noted that binding free energy has been shown to be a function of the surface material, the surface charge density, the length of the linker molecule, and other experimental conditions (40).
Synthetic targets with surface-proximal tail sequences complementary to background DNA.
The effect of
the similarity of the sequence of the surface-proximal tail of the
target to the sequence of defined background DNA was studied using 11
different targets (TS) of similar lengths (89 to 102 bases).
All the targets had 18 bases complementary to 52 probes on the
microarray (Fig. 3a),
a dangling end of 1 to 16 bases, and a surface-proximal tail
of 67 to 76 bases, depending on the probe and target. Targets were
designed to have 10 to 100% sequence similarity with BS.
Probes were not offset (as with dangling end experiments); however, 11
targets were used, resulting in a Gibbs free energy of between
9.2 and 76.8 kcal/mol for the TS
BS duplex (Fig.
3b). The Gibbs free energy
for P18
TS varied between 17.5
and 24.1 kcal/mol. Continuous stretches of 38 and 45 perfect
matches at the surface-proximal tails of TS (with free
energies of 64.5 to 65.9 kcal/mol) displayed
target-background interaction (Fig.
3c), while continuous
stretches of 19 or fewer perfectly matching bases (with free energy of
42.1 kcal/mol or more positive) did not result in signals
attributable to BS. Targeted probes did not display signal
when BS was hybridized alone (i.e., without
TS).
These results suggest that the Gibbs free energy requirement for the target-background interaction at the surface-proximal tail is different from that for a similar interaction at the dangling end. While a Gibbs free energy of 7.6 kcal/mol (for a stretch of 7 continuous perfect matches) was sufficient for the dangling end interaction, a Gibbs free energy more positive than 64.5 kcal/mol (including a stretch of 19 continuous perfect matches) on the surface-proximal tail did not display interaction. Further examination of the TS with a Gibbs free energy of 36.8 kcal/mol when hybridized with BS (19-base stretch of continuous homology) showed no secondary structure on the surface-proximal tail. Thus, external influences that are not incorporated into Gibbs free energy calculations may be causing greater destabilization on the surface-proximal end. A study examining the influence of target overhang length on hybridization efficiency showed that the dangling end of a target interacted with its complementary strand more than with the surface-proximal tail (25). It was suggested that this is because the dangling end is exposed to the liquid phase, providing better access to complementary strands in solution. On the other hand, the surface-proximal tail is closer to the surface, thus limiting the availability to other strands in solution and reducing the association with complementary strands. It is obvious that the difference between target-background interactions on surface-proximal tails and those on dangling ends requires further investigation.
Conclusions.
The lengths and sequences of
surface-proximal tails and dangling ends of targets and the resulting
target-background interactions influence signal intensity and decrease
the specificity of oligonucleotide probe-target hybridizations. The
dangling end was found to be significantly more prone to
target-background interaction than the surface-proximal tail. Gibbs
free energy between the target and background was found to be a better
indicator of hybridization signal intensity than the sequence or length
of the dangling end alone. This study underlines the need for careful
target preparation (fragmentation and asymmetric PCR) and evaluation of
signal intensities for diagnostic arrays by use of 16S rRNA and other
gene targets due to the potential for target interaction with a complex
background.
This work was supported by the National Institutes of Health (grant R01 RR018625-01) and Michigan Economic Development Corporation (GR-476 PO 085P3000517).
Published ahead of print on 17 November 2006. ![]()
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»