Previous Article | Next Article ![]()
Applied and Environmental Microbiology, December 2007, p. 7947-7958, Vol. 73, No. 24
0099-2240/07/$08.00+0 doi:10.1128/AEM.00842-07
Copyright © 2007, American Society for Microbiology. All Rights Reserved.
,
Key Laboratory of Systematic Mycology and Lichenology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101,1 College of Plant Protection, Agricultural University of Hebei, Baoding 071001,2 Graduate School of Chinese Academy of Sciences, Beijing 100049, China,3 Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AB, United Kingdom4
Received 13 April 2007/ Accepted 14 October 2007
|
|
|---|
|
|
|---|
In recent years, various computer programs have been used for the selection of these restriction endonucleases. Programs have been designed to search for the recognition sites of endonucleases, to locate the positions of these sites in sequences, and to calculate the sizes of fragments after digestion. Some of the software packages, e.g., Gene Runner version 3.05 (24), Genetyx version 6.1 (26), MapDraw of DNASTAR version 3.14 (16, 32), NEBcutter version 2.0 (35), Webcutter version 2.0 (40), BioEdit (45), DNA Club (47), the Infobiogen restriction program (29), and REBsites (14), have been adopted to assist in the selection of endonucleases in the PCR-RFLP experimental design. However, these programs perform restriction analysis only for individual sequences. Although the SNP Cutter software (51) can analyze the data from multiple sequences, the subsequent comparison of banding profiles of different sequences after digestion is done manually. Therefore, employing these programs is time-consuming, tedious, and often error prone, especially when a large number of taxa with different sequences are involved. There is a need to develop a strategy to select the endonucleases rapidly through the restriction analysis of a large number of sequences.
The purpose of this work was to develop a method of rapid selection of restriction endonucleases for PCR-RFLP identification of fungal species with the aid of a computer program based on the analysis of a considerably large number of DNA sequences. Species of Pleurotus (Fr.) P. Kumm., a fungal genus, are used as examples owing to the reasonable number of species in the genus and the considerable molecular data on internal transcribed spacer (ITS) sequences.
Species of Pleurotus, commercially called oyster mushrooms, are a group of the most important edible fungi, accounting for 25% of the total world production of commercially cultivated mushrooms (17). In addition, Pleurotus species are also used as a tool for environmental control in the biotransformation-biodegradation of industrial effluents owing to their unique ligninolytic system (6). Further, some species of Pleurotus are known as predaceous fungi (39) and have the potential for controlling nematode diseases of agricultural crops. Traditionally, species-level identification of Pleurotus has been based mainly on morphology and compatibility relationships (3, 18, 49). However, many difficulties have been associated with the identification of species of this genus, especially those of commercially cultivated strains, due to their similar morphological characteristics and morphological plasticity in cultivation (19). The long period from the inoculation of spawn to the production of fruiting bodies of these fungi also delays any positive determination (33). Various molecular methods, such as DNA sequencing and RFLP and PCR-RFLP techniques, have been used to identify Pleurotus species. The application of analyses of sequences of ITS regions and large subunits (LSU) of nuclear ribosomal DNA (rDNA) to the determination of Pleurotus species (42, 43, 50) is very powerful at the species level, but these analyses are often used for particular research and have not been used for routine or rapid identification. RFLP and PCR-RFLP methods have also been applied to the study of molecular systematics and the genotyping of Pleurotus (2, 12). In the two latter studies (2, 12), however, the numbers of restriction enzymes screened (four and seven, respectively) and species identified (5 and 10, respectively) were limited because the selection of restriction endonucleases was based on results from extensive experiments. It is necessary to establish a rapid, accurate, and simple molecular method for the identification of Pleurotus species, based on the analysis of a large number of sequences and the selection of restriction enzymes from thousands of candidate endonucleases.
There are about 20 Pleurotus species recognized worldwide (15). Recent molecular phylogenetic analysis of Pleurotus provides a useful framework for understanding species concepts and taxonomy. Relationships between species and species determination have also been studied, and the results of systematic analyses of LSU and ITS sequences in nuclear rDNA (42, 43, 50) and of small-subunit sequences in mitochondrial rDNA (8) show that ITS sequences are an ideal marker for species identification. A large number of ITS sequences from this genus have been accumulated, and a molecular identification method using the PCR-RFLP technique to distinguish Pleurotus species can be developed through the DNA sequence analyses.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Fungal material used in this study
|
PCR products were purified using cleanup plates (Millipore Corporation). Sequencing was performed by the cyclic reaction termination method on an ABI Prism 3100 genetic analyzer (Applera Corporation), and data were collected on a Dell computer with the ABI Prism DNA sequencing analysis software, version 3.7. Each fragment was sequenced in both directions for confirmation, and the sequences of the two strands were assembled with ABI Prism SeqScape software, version 1.1.
Sequence analysis.
The retrieval tool from the National Centre of Biotechnology Information was used to search ITS sequences by using the phrase "Pleurotus internal transcribed spacer 1 internal transcribed spacer 2" as keywords. All available ITS sequences from GenBank were aligned using Clustal X 1.81 (38) and then further manually adjusted using BioEdit 5.0.6 (11) to reduce some obvious mismatching of sequences created by computer alignment. Sequences with regions of consecutive ambiguous bases or with potentially confounding ambiguous bases, i.e., unresolved bases or ambiguous bases in a position that may affect a restriction site, were excluded. Only sequences from material identified to the species level were used, and pairs of identical sequences were represented by one haplotype. A total of 259 ITS sequences were retrieved, and 85 haplotypes representing 151 sequences met all of the above criteria of selection for study. These sequences were used for phylogenetic analysis and for the selection of restriction enzymes (Table 2). Phylogenetic analyses were performed by PAUP 4.0b10 for Macintosh (36) by using ITS sequences from Hohenbuehelia grisea as the outgroup. A parsimony analysis was performed using a heuristic search, with the random addition of sequences with 1,000 replicates, tree bisection-reconnection as the branch-swapping algorithm, one tree held at each step during stepwise addition, and the MULTREES option off. Gaps were treated as missing data. Bootstrap values were calculated from 1,000 replicates.
|
View this table: [in a new window] |
TABLE 2. Representative sequences of ITS haplotypes used for the phylogenetic analyses and the autoscreening of restriction endonucleases for PCR-RFLP
|
![]() View larger version (27K): [in a new window] |
FIG. 1. The PRE Auto work flow includes steps 1 through 4, as shown in the figure. Rectangles show the tasks that the program performs. Parallelograms show the data that the user supplies or that every step produces. The program may produce four files, as shown: *.biosequence in step 1, *.txt in steps 2 and 3, and *.wmf in step 4, where * represents an optional string for writing the file name.
|
The screening for a single enzyme was followed by a further search for a combination of two enzymes, because all 12 species could not be distinguished by any one enzyme. Candidate restriction enzymes were determined based on two parameters, the scoring and the degree of cleavage site coverage, after the autoscreening of restriction enzymes was implemented. The predicted band patterns after digestion by candidate enzymes having a coefficient above 0.1 and a cleavage site coverage of 100% for a single enzyme and 50% for a combination of two enzymes were examined through the electrophoresis map output from the PRE Auto program. The major selection criteria for enzymes were based on the minimum number of enzymes and the steps required to produce discriminating band patterns. In addition, enzymes producing too many small fragments (<50 bp) or showing intraspecific polymorphism were avoided. The availability and the cost of enzymes were also taken into account. Rare enzymes difficult to procure were also avoided, and common and low-cost enzymes were selected when multiple candidates with the same recognition site were available.
Digestion reactions were performed for each of the selected restriction enzymes, i.e., HaeIII, AluI, and HpyCH4IV (all from New England Biolabs, Beverly, MA), in a total volume of 20 µl containing 2 µl of 10x reaction buffer, 0.5 µl of each restriction enzyme (10 U/µl), and 17.5 µl (for a single enzyme) or 17 µl (for a combination of two enzymes) of the PCR products of each sample. Digestions were incubated overnight at the optimal temperature (37°C). Digested products were separated by 3% agarose gel electrophoresis in 10x Tris-borate-EDTA buffer for 3 h at 80 V. The gels were stained with 0.1 g of ethidium bromide/liter and visualized with UV light. Sizes of restriction fragments were determined by comparison with a standard DNA molecular mass marker, a 100-bp DNA ladder (Beijing Yuanchen Bio Company, Beijing, China).
Experimental validation of PCR-RFLP species identification.
A total of 28 samples representing 12 Pleurotus species were selected for PCR-RFLP identification and validation of the method (Table 1). ITS sequences from five of these samples had been sequenced previously and submitted to GenBank (accession no. AY562495, AY540320, AY540321, AY540325, and AY315779). Of the five sequenced samples, two (CGMCC 5.467 and CGMCC 5.494, identified as P. cystidiosus) were proven to be misidentified, as revealed by their ITS sequences (AY540320 and AY540321, deposited in GenBank under the name P. cystidiosus), and were used as negative controls, while the other three (HMAS 63355 [P. calyptratus], CGMCC 5.344 [P. ostreatus], and IE 74 [P. smithii]) were used as positive controls. Additional samples were randomly selected from materials available in our laboratory to make two to three samples for each species. After the PCR products from the selected samples were confirmed by electrophoresis, the PCR-RFLP procedure described above was performed for the molecular identification of the samples through the comparison of the actual band patterns after digestion by the tested restriction endonucleases with the patterns predicted by the PRE Auto program. The species identification of the samples was further confirmed by sequencing the PCR products from ITS sequences from nine samples representing six species of Pleurotus, in addition to the five sequenced samples used as controls (Table 1).
Nucleotide sequence accession numbers.
The sequences determined in this study have been deposited in GenBank under accession numbers EF514242 to EF514250.
|
|
|---|
![]() View larger version (44K): [in a new window] |
FIG. 2. One of the 1,288 most parsimonious trees obtained from the analysis of nucleotide sequences of ITS regions (nuclear rDNA). The upper and lower numbers on each branch denote the number of estimated substitutions and the percentage of bootstrap replicates, respectively. Only bootstrap values higher than 50% are shown. Numbers in the brackets after species names are the numbers of sequences that the haplotypes represented. The length of the tree is 701 steps, with a consistency index of 0.6904 and a retention index of 0.9469. Grouping is as follows: group 1, P. abalonus; group 2, P. cystidiosus; group 3, P. smithii; group 4, P. dryinus; group 5, P. tuber-regium; group 6, P. ostreatus; group 7, P. pulmonarius; group 8, P. eryngii; group 9, P. nebrodensis; group 10, P. cornucopiae; group 11, P. calyptratus; and group 12, P. djamor.
|
The expected enzyme digestion band patterns for HaeIII are shown in Fig. 3, with representative sequences from the 12 species. Among the patterns, six were species-specific profiles, representing P. calyptratus (lanes 1 to 3), P. smithii (lanes 4 to 6), P. cystidiosus (lanes 7 to 9), P. abalonus (lanes 10 to 12), P. dryinus (lane 13), and P. cornucopiae (lanes 14 to 16). There were some differences in band patterns of P. pulmonarius (lanes 17 to 19), P. tuber-regium (lanes 20 to 22), and P. djamor (lanes 23 to 25), but the sizes of the fragments were very similar. The band patterns of the remaining three species, P. eryngii (lanes 26 to 28), P. nebrodensis (lanes 29 to 1), and P. ostreatus (lanes 32 to 34), were identical.
![]() View larger version (14K): [in a new window] |
FIG. 3. Predicted band patterns produced by the PRE Auto program after digestion with HaeIII. Lane 1, P. calyptratus AY562495; lane 2, P. calyptratus AY265814; lane 3, P. calyptratus AY450338; lane 4, P. smithii AY315779; lane 5, P. smithii AY315781; lane 6, P. smithii AY315786; lane 7, P. cystidiosus AY315767; lane 8, P. cystidiosus AY315770; lane 9, P. cystidiosus AY315773; lane 10, P. abalonus AY315794; lane 11, P. abalonus AY315806; lane 12, P. abalonus AY315808; lane 13, P. dryinus AY450343 (one haplotype with four sequences); lane 14, P. cornucopiae AB115037; lane 15, P. cornucopiae AY265817; lane 16, P. citrinopileatus DQ077889; lane 17, P. pulmonarius AB115046; lane 18, P. pulmonarius AY696300; lane 19, P. pulmonarius U60648; lane 20, P. tuber-regium AB115045; lane 21, P. tuber-regium AF109983; lane 22, P. tuber-regium AY450344; lane 23, P. djamor AB115053; lane 24, P. salmoneostramineus AY265843; lane 25, P. salmoneostramineus AY265845; lane 26, P. eryngii AB115042; lane 27, P. eryngii AY450347; lane 28, P. eryngii AY540333; lane 29, P. nebrodensis AY540331; lane 30, P. nebrodensis AY581427; lane 31, P. nebrodensis AY720935; lane 32, P. ostreatus AY368665; lane 33, P. ostreatus AY540332; and lane 34, P. ostreatus AY636055. Lane M, 100 bp DNA ladder.
|
![]() View larger version (17K): [in a new window] |
FIG. 4. Predicted band pattern after digestion with AluI and HpyCH4IV produced by the PRE Auto program. Lane 1, P. pulmonarius AB115046; lane 2, P. pulmonarius AY696300; lane 3, P. pulmonarius U60648; lane 4, P. tuber-regium AB115045; lane 5, P. tuber-regium AF109983; lane 6, P. tuber-regium AY450344; lane 7, P. djamor AB115053; lane 8, P. salmoneostramineus AY265843; lane 9, P. salmoneostramineus AY265845; lane 10, P. eryngii AB115042; lane 11, P. eryngii AY450347; lane 12, P. eryngii AY540333; lane 13, P. nebrodensis AY540331; lane 14, P. nebrodensis AY581427; lane 15, P. nebrodensis AY720935; lane 16, P. ostreatus AY368665; lane 17, P. ostreatus AY540332; and lane 18, P. ostreatus AY636055. Lane M, 100-bp DNA ladder.
|
![]() View larger version (23K): [in a new window] |
FIG. 5. Optimized flow chart for identification of Pleurotus species using three restriction endonucleases.
|
![]() View larger version (35K): [in a new window] |
FIG. 6. PCR amplification products from Pleurotus species. Lane 1, Pleurotus calyptratus HMAS 63355; lane 2, P. calyptratus HMAS 77117; lane 3, P. smithii IE 74; lane 4, P. cystidiosus ATCC 28597; lane 5, P. cystidiosus CGMCC 5.467; lane 6, P. cystidiosus CBS 80391; lane 7, P. abalonus MG 005; lane 8, P. abalonus CGMCC 5.409; lane 9, P. dryinus F 14011; lane 10, P. dryinus CBS 44977; lane 11, P. dryinus HKAS 17450; lane 12, P. cornucopiae ATCC 38547; lane 13, P. cornucopiae MG 504; lane 14, P. cornucopiae HMAS 76520; lane 15, P. pulmonarius MG 502; lane 16, P. pulmonarius HMAS 76672; lane 17, P. pulmonarius HMAS 72869; lane 18, P. tuber-regium MG 506; lane 19, P. tuber-regium HMAS 84647; lane 20, P. djamor CGMCC 5.600; lane 21, P. djamor CGMCC 5.407; lane 22, P. eryngii MG 497; lane 23, P. eryngii HMAS 25978; lane 24, P. nebrodensis MG 500; lane 25, P. nebrodensis HMAS 86357; lane 26, P. ostreatus CGMCC 5.344; lane 27, P. ostreatus CGMCC 5.37; and lane 28, P. ostreatus HMAS 66080. Lane M, 100-bp DNA ladder.
|
![]() View larger version (63K): [in a new window] |
FIG. 7. PCR-RFLP band patterns observed with HaeIII. Lane 1, Pleurotus calyptratus HMAS 63355; lane 2, P. calyptratus HMAS 77117; lane 3, P. smithii IE 74; lane 4, P. cystidiosus ATCC 28597; lane 5, P. cystidiosus CGMCC 5.467; lane 6, P. cystidiosus CBS 80391; lane 7, P. abalonus MG 005; lane 8, P. abalonus CGMCC 5.409; lane 9, P. dryinus F 14011; lane 10, P. dryinus CBS 44977; lane 11, P. dryinus HKAS 17450; lane 12, P. cornucopiae ATCC 38547; lane 13, P. cornucopiae MG 504; lane 14, P. cornucopiae HMAS 76520; lane 15, P. pulmonarius MG 502; lane 16, P. pulmonarius HMAS 76672; lane 17, P. pulmonarius HMAS 72869; lane 18, P. tuber-regium MG 506; lane 19, P. tuber-regium HMAS 84647; lane 20, P. djamor CGMCC 5.600; lane 21, P. djamor CGMCC 5.407; lane 22, P. eryngii MG 497; lane 23, P. eryngii HMAS 25978; lane 24, P. nebrodensis MG 500; lane 25, P. nebrodensis HMAS 86357; lane 26, P. ostreatus CGMCC 5.344; lane 27, P. ostreatus CGMCC 5.37; and lane 28, P. ostreatus HMAS 66080. Lane M, 100-bp DNA ladder.
|
![]() View larger version (105K): [in a new window] |
FIG. 8. PCR-RFLP band patterns observed after digestion with the combination of AluI and HpyCH4IV. Lane 1, P. dryinus HKAS 17450; lane 2, P. pulmonarius MG 502; lane 3, P. pulmonarius HMAS 76672; lane 4, P. pulmonarius HMAS 72869; lane 5, P. tuber-regium MG 506; lane 6, P. tuber-regium HMAS 84647; lane 7, P. djamor CGMCC 5.600; lane 8, P. djamor CGMCC 5.407; lane 9, P. eryngii MG 497; lane 10, P. eryngii HMAS 25978; lane 11, P. nebrodensis MG 500; lane 12, P. nebrodensis HMAS 86357; lane 13, P. ostreatus CGMCC 5.344; lane 14, P. ostreatus CGMCC 5.37; lane 15, P. ostreatus HMAS 66080; lane 16, P. cornucopiae HMAS 76520; and lane M, 100-bp DNA ladder.
|
|
|
|---|
Although the existing design tools for the PCR-RFLP assay, e.g., DNA CLUB (47), NEBcutter version 2.0 (44), Genetyx version 6.1 (26), and SNP Cutter (51), have been used to search for cleavage sites and to calculate the sizes of restriction fragments, the selection of restriction endonucleases by manual comparison of restriction fragment profiles of different sequences with various restriction enzymes is still very time-consuming. For example, using the DNA CLUB and Origin 5.0 computer packages, Wright and Pimm (47) had to manually compare the restriction fragment patterns of 29 sequences with 55 different restriction enzymes and finally selected 10 endonucleases to digest the 16S gene for the molecular identification of methanogens. The PRE Auto software reported here is a much more efficient and informative program than the other computer packages for PCR-RFLP experiment design, especially for the functions of multiple-sequence input, autoscreening of endonucleases, and production of an electrophoresis gel map. After the autocomparison and scoring of restriction fragment patterns of each restriction enzyme for all the input ITS sequences, the selection of restriction endonucleases was very efficient with the guidance of the electrophoresis map. Further, the treatment of grouped sequences is a novel function that makes it possible to consider the sequence variation within a species for the autoscreening of restriction enzymes. Although the haplotype sequences from the Pleurotus species used were considerably variable, e.g., groups 5 and 10 (Fig. 2), or very conservative, e.g., groups 8 and 9 (Fig. 2), within a group, the PRE Auto program could still find the restriction endonucleases to differentiate the groups, which represent different species. This result will avoid the possible restriction sites within an individual sequence but without the capability to separate species. The new method developed in this study for the rapid selection of restriction enzymes can be applied widely in the molecular identification of living organisms using the PCR-RFLP technique when a large number of target sequences are considered. However, it is worth notice that group 6, representing P. ostreatus, is a paraphyletic group in the ITS sequence analysis (Fig. 2), closely related to group 7, P. pulmonarius. Because groups 6 and 7 were input into PRE Auto as separate groups, they were successfully distinguished from each other by the restriction digestion by HaeIII. As a computer program, the PRE Auto program is able to search for the most effective enzymes to separate the sequence groups input by the researcher, but it cannot make the taxonomic decision on the grouping of the sequences.
In addition, all other available computer packages, except the SNP Cutter, take only an individual sequence representative of a species to select restriction endonucleases. It is an advantage of both the PRE Auto program and the SNP Cutter to be able to analyze multiple sequences from a species for searching the cleavage sites of restriction enzymes. However, SNP Cutter is unable to do the autoscreening of restriction endonucleases and to output the predicted electrophoresis gel map after digestion.
The newly developed technique for the rapid selection of restriction enzymes has been validated by experiments using 17 living strains and 11 dried specimens, representing 12 Pleurotus species. This method could unequivocally identify all the 28 Pleurotus samples, including 24 in conformity with the previous species determination based on the morphological method and four in discordance with the original identification. Among the 28 tested samples, the five samples used as controls (three as positive and two as negative controls) based on ITS sequences previously submitted to GenBank were verified by the PCR-RFLP method. ITS sequences from two misidentified samples detected here (HMAS 76520 and HKAS 17450) (Table 1) and from seven correctly named samples (ATCC 28597, CBS 44977, HMAS 25978, HMAS 86357, HMAS 66080, CGMCC 5.37, and HMAS 84647) also confirmed the determination of species using the PCR-RFLP method. The results indicate that the method developed in this study is accurate and reliable.
Many efforts have been devoted to the identification of Pleurotus species, including the application of molecular methods. In the work on RFLP and PCR-RFLP by Bao et al. (2) and Iracabal et al. (12), extensive experiments were conducted to screen the restriction enzymes from a very limited set of enzyme candidates based on a few DNA sequences. The autoscreening method reported here, supported by the PRE Auto program and based on the analysis of a large number of sequences, can select restriction enzymes from thousands of candidate endonucleases, and the 12 Pleurotus species were unambiguously identified with only two steps and three restriction endonucleases. In comparison with previous studies identifying Pleurotus species by the PCR-RFLP technique, this study used a minimal number of restriction endonucleases and required minor costs and a minor amount of time for the selection of enzymes. Further, there is much scope for increase in the number of species for identification and the number of endonucleases for screening in the PRE Auto program.
In conclusion, a new method for the rapid selection of restriction endonucleases with the aid of the PRE Auto computer program was established for the molecular identification of living organisms using the PCR-RFLP technique. The method is based on the sequence analysis of target DNA fragments of species for screening of restriction enzymes from a large number of candidate endonucleases, and the autoscreening is performed by the PRE Auto program. This is a genotypic identification approach that can be applied for the discrimination of a large number of species, confirmed with at least 12 Pleurotus species in this study, and is flexible to take more species and more endonucleases into its database. The computer autoscreening of restriction enzymes for PCR-RFLP identification of species is a very efficient and time- and cost-saving method for the characterization of living organisms at the species level.
Published ahead of print on 26 October 2007. ![]()
Supplemental material for this article may be found at http://aem.asm.org/. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»