Previous Article | Next Article ![]()
Applied and Environmental Microbiology, November 2002, p. 5437-5444, Vol. 68, No. 11
0099-2240/02/$04.00+0 DOI: 10.1128/AEM.68.11.5437-5444.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
Marilena Budroni,
Madan Thangavelu, and Francesca Clementi
Dipartimento di Biotecnologie Agrarie ed Ambientali, Università di Ancona, Ancona, Italy
Received 13 May 2002/ Accepted 2 August 2002
|
|
|---|
|
|
|---|
Analysis of the amino acid sequence of Sed1p in S. cerevisiae S288C revealed the presence of repeated amino acid motifs localized within two distinct regions of the polypeptide chain (15). Based on the SED1 nucleotide sequence of S288C, the first region (region 1) contains three repeat units of 66 bp and a truncated one of 42 bp arranged in the following order: 66, 42, 66, 66. The penultimate codon of each 66-bp unit encodes an asparagine residue that is a potential N-linked glycosylation site. The second region (region 2) contains two 153-bp repeats and the information for a total of six cysteine residues. Similar to their role in mucins, the cysteine residues may be involved in one or more functions, such as disulfide-dependent intramolecular interactions required for protein folding or inter-Sed1p interactions on the cell wall, or in interactions with other cell wall proteins, or they may be necessary for higher-order assemblies involving Sed1p.
Our objective in this study was to assess whether the observed tandem repeats could mimic a minisatellite-like behavior and cause length polymorphism in an S. cerevisiae gene coding for a structural protein. Thus, after a preliminary PCR survey of strain collections and previously uncharacterized S. cerevisiae isolates from diverse sources (I. Mannazzu and M. Thangavelu, unpublished data), we surveyed SED1 sequence variation in a collection of previously uncharacterized grape must isolates of S. cerevisiae and observed abundant SED1 polymorphisms in the population analyzed. In this paper, we report the presence of minisatellite-like sequences within the open reading frame (ORF) of the SED1 gene and speculate on the reasons for SED1 length variation and heterozygosis in feral strains of S. cerevisiae.
|
|
|---|
The wild S. cerevisiae strains were isolated, during the 1998 vintage, from a total of nine vats, each containing must of a single grape variety, and sampled in three separate wine cellars located in a traditional wine-producing area of the Italian region of Marche (Table 1). The musts were collected from the fermentation vats immediately after crushing of the grapes and transferred to sterile 250-ml flasks, which were taken to the laboratory within 8 h after collection and incubated statically at 25°C. The yeasts were isolated by streaking triplicate must aliquots on WL nutrient agar (Oxoid) immediately upon sampling (time zero) and at different enrichment stages (early, middle, and late) in order to collect the strains acting in sequence during must fermentation. A total of 286 colonies exhibiting Saccharomyces morphology on WL agar were purified by repeated streaking on YEPD, and 186 isolates, confirmed as belonging to the species S. cerevisiae, were utilized in the present study. They were identified according to the method of Vaughan-Martini and Martini (31), utilizing the dichotomic key proposed by Boulton et al. (4) as described by Guerra et al. (12).
|
View this table: [in a new window] |
TABLE 1. Origin of isolates and distribution of the 13 SED1 PCR profiles within the population analyzed
|
DNA extraction, PCR conditions, and restriction analyses.
Total genomic DNA for PCR analysis was isolated from 24-h cultures as described by Ushinsky et al. (30). The PCR primers SED1FOR (5'-ATGAAATTATCAACTGTCCTATTATCTGCCGG-3'; bases 1 to 32) and SED1REV (5'-TTATAAGAATAACATAGCAACACCAGCCAAACC-3'; bases 1017 to 984) were designed on the S. cerevisiae S288C sequence. PCR amplification reactions were performed on a Perkin-Elmer Gene AMP PCR System 9700 in 25-µl reaction mixtures containing 2 µl of template DNA (approximately 10 ng/µl), 0.5 U of DyNAzyne II DNA polymerase (Finnzymes), 1x reaction buffer (Mg2+ free), 1.5 mM MgCl2, 100 µM each deoxynucleoside triphosphate, and 2 pmol each of SED1FOR and SED1REV primers. The reactions were run for 35 cycles as follows: denaturation at 94°C for 1 min, annealing at 64°C for 1 min, and elongation at 72°C for 2 min. An initial denaturation step at 94°C for 3 min and a final 7-min extension at 72°C were performed. The PCR products were analyzed by electrophoresis on a 1.4% agarose gel in 1x Tris-borate-EDTA buffer. The gel images were visualized by means of a Bio-Rad Gel DOC 1000 and acquired with Multi-Analyst software (Bio-Rad). The restriction fragment length polymorphism (RFLP) analysis of the SED1 amplicons was performed as follows: 25 µl of the PCR product was digested overnight with an excess of HpaII and KpnI in a final volume of 50 µl. The restriction fragments were analyzed as described above.
DNA sequencing and sequence analyses.
PCR products were purified using a Microcon-PCR Centrifugal Filter Devices (Millipore) PCR template purification kit or by means of the QIAquick gel extraction kit protocol (Qiagen) and sequenced directly. Sequencing reactions were prepared by PCR by using SED1FOR and SED1REV and the internal primers S1 (5'-TTACAACCCATCTACTGACTACACC-3') and S2 (5'-GGTGTAGTCAGTAGATGGGTTGTAA-3') with the ABI PRISM Big Dye Terminator Cycle Sequencing Ready Reaction kit (Perkin-Elmer). The products of these reactions were purified on Centri-Sep spin columns (Amicon), dried, denatured by adding Template Suppression Reagent (Perkin-Elmer), and sequenced by using the ABI PRISM 310 Genetic Analyzer (Perkin-Elmer). The primary sequence alignment was performed using the Multiple Alignment Program (http://dot.imgen.bcm.tmc.edu:9331). Minor adjustments of the output alignment were performed by hand.
Tetrad analysis.
Yeasts grown on YEPD were streaked on sporulation medium and incubated at 25°C until asci were observed microscopically. Depending on spore viability, a total of 16 to 48 asci for each isolate were dissected as described by Rose et al. (26). The single-spore progeny from asci with four viable meiotic products were grown on YEPD and replica plated onto GAL, MEL, TRE, MAL, CEL, SUC, RAF, Biggy agar, CUP, and GLY as described by Mortimer et al. (21). Sporulation efficiency was determined as the sum of four- and three-spore asci divided by the number of asci dissected. Spore viability was determined as the number of viable spores divided by the total number of spores dissected (5). Homothallism was assessed by examining microscopically the abilities of the meiotic derivatives from an ascus to sporulate. Segregation of SED1 alleles among single-spore progeny was analyzed by PCR.
Nucleotide sequence accession numbers.
The sequences of the CBS1171 SED1 gene and of Sed1-1 to Sed1-7 were deposited in GenBank under accession numbers AF510219, AF510220, AF510221, AF510222, AF510223, AF510224, AF510225, and AF510226, respectively.
|
|
|---|
Under relatively stringent PCR conditions, the primer pair based on the S. cerevisiae S288C SED1 sequence was utilized to amplify total DNA from the two reference strains, CBS1171 and S288C, and from all the must isolates. Both of the reference strains produced a single amplicon; S288C yielded the expected 1,017-bp product, while CBS1171 yielded a product which was smaller than that of S288C (Fig. 1). The 186 S. cerevisiae must isolates displayed striking differences in the electrophoretic profiles of their PCR products. The major proportion of the isolates (72%) yielded a single amplification product of variable length, suggesting either a single copy of the gene per haploid genome or multiple copies of the same size in a diploid or polyploid genome. The remaining isolates contained two or three SED1 amplicons of different lengths per genome complement. The SED1 amplicons ranged from approximately 950 to 1,300 bp.
![]() View larger version (32K): [in a new window] |
FIG. 1. SED1 gene polymorphisms. PCR primers designed on the SED1 sequence were utilized to amplify the SED1 genes of the two reference strains, CBS1171 and S288C, and of the 186 must isolates of S. cerevisiae. Lane 1, 100-bp ladder (Amersham-Pharmacia); lanes A to O, PCR profiles observed within the population analyzed.
|
RFLP analysis of SED1 amplicons.
Based on the restriction map of the SED1 ORF of S288C (http://genome-www.stanford.edu/Saccharomyces/), it was easy to observe that the HpaII and KpnI restriction enzymes could be used as convenient markers to localize the regions of the gene involved in the observed length variations. In fact, when digested with HpaII, the SED1 ORF of S288C yields two well-resolved fragments. The shorter one (333 bp) contains all of region 1, including the mosaic of 66- and 42-bp repeat units, while the larger fragment (654 bp) contains region 2, which includes the two 153-bp repeats. KpnI cuts the SED1 gene within region 1 and yields a band corresponding to the 42-bp block located within this region (data not shown). Comparison of the HpaII and KpnI restriction products of the S288C and CBS1171 SED1 genes readily localized the observed length difference to region 1. In fact, the S288C region 1 fragment was about 40 bp longer than the corresponding fragment in CBS1171, possibly due to the lack of the 42-bp block in the latter (data not shown). Similarly, the HpaII and KpnI restriction profiles of the SED1 amplification products of the isolates presenting PCR profiles A to O revealed that the observed length polymorphisms were due to length variation localized in region 1 and/or region 2. The restriction analyses of the SED1 amplicons also showed that all the PCR profiles could invariably be considered as derived from combinations of seven SED1 length variants labeled Sed1-1 to Sed1-7, from the shortest to the longest (Fig. 2). Interestingly, the SED1 amplicons of the must isolates never contained the short DNA fragment including the 42-bp block present in S288C and were in that respect similar to that in CBS1171 (Fig. 2).
![]() View larger version (29K): [in a new window] |
FIG. 2. Structures of the seven SED1 alleles with the locations of the repeat units and SNPs. The dotted lines indicate gaps in the alignments. The checkered box represents the 42-bp unit exclusive to S288C. Repeat units within regions 1 and 2 are represented as boxes with the same fill patterns to indicate identical sequences and with different fill patterns to underline the presence of differences in the sequences (see Fig. 4). SNPs are indicated as N/n, where N stands for the nucleotide in CBS1171 and n represents the replacing nucleotide. The wave-patterned boxes represent sequence that is not present in the SED1 alleles of the reference strains and in Sed1-1 to Sed1-3 and differs from the sequence represented by the brick-patterned boxes by five point mutations conserved in Sed1-4 to Sed1-7.
|
Consistent with the restriction analyses, the SED1 sequences of S288C and CBS1171 differed by a 42-nucleotide sequence in region 1. Based on the predicted amino acid sequence, this short fragment is most likely the result of an in-frame deletion of a duplication of the DNA sequence containing the first 66-bp blocks of region 1 (Fig. 3). The comparison of the nucleotide sequences of the S288C and CBS1171 SED1 genes also revealed the existence of six single-nucleotide polymorphisms (SNPs) located in region 2 of the S288C gene (at positions 459, 510, 579, 630, 732, and 891). All these changes were at the third position of the respective codons and resulted in silent point mutations.
![]() View larger version (45K): [in a new window] |
FIG. 3. Multiple alignments of the predicted amino acid sequences of the seven SED1 alleles and the SED1 genes of CBS1171 and S288C. The repeat units within regions 1 and 2 are underlined. The dashes indicate gaps introduced to maximize the alignment, and the dots indicate identical residues. Amino acid coordinates are provided. The beginning and end of each repeat unit is indicated by a left and right "corner," respectively, above the sequence.
|
The Sed1-1 to Sed1-7 ORFs varied in length from 972 to 1,278 bp. Multiple sequence alignment confirmed that the SED1 length polymorphisms were due to the insertion or deletion of perfect duplicates of DNA sequences containing the 66- and 153-bp units in regions 1 and 2 and corresponding to tandem repeats of 22 and 51 amino acids, respectively. Thus, two minisatellite-like DNA sequences that exhibit the property of possessing a variable number of tandem repeats are responsible for SED1 polymorphism in a population of wild yeast (Fig. 4).
![]() View larger version (25K): [in a new window] |
FIG. 4. Alignment between repeat units in regions 1 and 2 of CBS1171 SED1 gene. Nucleotide coordinates are given.
|
The analysis of the changes in the sequences and the numbers of repeat units within regions 1 and/or 2 provided a way of exploring the routes leading to the SED1 polymorphism observed in the must isolates and led us to hypothesize that all the alleles have arisen from a common ancestor as a consequence of tandem duplication events of the repeat units within these two regions. For example, the comparison of both the sequences and structures of Sed1-1, Sed1-4, and Sed1-7 revealed that these three alleles differ in the number of 153-bp repeat units in region 2 (Fig. 2). Moreover, while their first and last 153-bp units are identical to the first and last such units of every other allele sequenced, the two central 153-bp repeat units of Sed1-7 are identical to the central repeat unit of Sed1-4 and differ by five substitutions from the first 153-bp repeat unit of region 2. These observations suggested that Sed1-4 might have originated from Sed1-1 subsequent to a tandem duplication of a 153-bp unit in region 2 followed by the occurrence of the five point mutations in this unit. Similarly, a plausible course of events in the history of Sed1-7 would have involved a tandem duplication of the Sed1-4 central 153-bp unit.
Segregation analysis of the seven SED1 alleles and other genetic markers.
Thirteen isolates representative of each PCR profile were subjected to genetic analyses as previously done by Mortimer et al. (21). As shown in Table 2, the 13 isolates were homothallic and exhibited various sporulation efficiencies and spore viabilities, as expected in wine strains of S. cerevisiae, due to their complex genetic constitution (2, 8). Apart from isolate Sc157, presenting PCR profile A, which was homozygous for all the characteristics considered, all the other isolates presented one or more heterozygosities, as expected in diploid strains (21).
|
View this table: [in a new window] |
TABLE 2. Genetic analyses of 13 isolates representative of SED1 PCR profiles
|
![]() View larger version (14K): [in a new window] |
FIG. 5. Segregation of SED1 length variants in F1 progeny. O and H, PCR profiles of isolates Sc93 and Sc59, respectively. The boxes contain the SED1 PCR profiles of the monosporial cultures deriving from each ascus. N, number of asci.
|
|
|
|---|
The sequence analysis of a representative of each of the observed SED1 length variants unequivocally identified them as seven alleles of the SED1 gene. However, as other alleles of the same size but with meaningful SNPs could have gone undetected by the PCR-RFLP procedure, it is plausible that a higher number of alleles is present within the population analyzed.
Aiming to redefine the structure of the gene and elucidate the sequence of events that may have led to the observed length variations, we first compared the SED1 sequences of the seven alleles with those of the two reference strains. The results obtained showed that the seven alleles analyzed have high structural and nucleotide sequence similarities with the SED1 allele of CBS1171, isolated as brewer's top yeast and considered the type strain of S. cerevisiae. On the other hand, the SED1 allele of S288C, which has both a recent "origin" and a complex pedigree (20), differs from those of CBS1171 and all the isolates in both the presence of a 42-bp block in region 1 and six silent SNPs in region 2, which are not present in either CBS1171 or the seven alleles sequenced. Other authors have already highlighted the existence of variations between feral S. cerevisiae and S288C, or its isogenic strains, concerning the coding sequences of different genes (18, 19). Our data, besides adding another interesting example to this list, underline once again that this laboratory strain may not be considered fully representative of the species S. cerevisiae.
The annealing of the SED1 sequences indicates that length variations are due to changes in the number of 66- and 153-bp tandem repeat units which mimic the characteristics of classic minisatellites. Thus, the observed gene length polymorphism is very likely a consequence of the different molecular mechanisms proposed for minisatellite array expansion and contraction (3, 10, 27). Hoyer et al. (16), Andersen and Nilsson-Tillgren (1), and Ingavale et al. (17) have reported the presence of repeat motifs within the ORFs of Saccharomyces carlsbergensis, Candida albicans, and Schizosaccharomyces pombe genes, respectively. Our results, besides suggesting that minisatellites in yeast are not as rare as initially supposed, represent the first description of polymorphic minisatellite-like sequences within the ORF of an S. cerevisiae gene.
Taking into account that Sed1p provides stress resistance during stationary phase, the phase in which S. cerevisiae carries out at least two-thirds of the alcoholic fermentation (23), one interesting question would be whether the generation and maintenance of different SED1 alleles offer wine yeasts an adaptive advantage in response to the numerous environmental stresses that they undergo during must fermentation. Indeed, the analyses of the predicted amino acid sequences of the alleles sequenced indicate that the observed changes in the number of the 66-bp repeat units in region 1 may affect the number of potential N-glycosylation sites. This could provide a basis for a great diversity in the sugar-mediated interactions with other cell wall components. Similarly, changes in the number of 153-bp repeat units in region 2, by changing the number of cysteine residues, could have functional and biological consequences for the cell wall properties, in case cysteines are involved in molecular disulfide bonds, or for intermolecular interaction with neighboring Sed1p or other cell wall proteins.
At present, the consequences of the observed length variations for the function of the resulting Sed1p have not been explored. However, the fact that all the different SED1 alleles are transcribed in strains homozygous or heterozygous for this gene (I. Mannazzu and P. Marinangeli, unpublished data) suggests that the observed length polymorphisms are tolerated and that any functional consequence for the resulting proteins should be compatible with the range of functional parameters seen in the population.
Evaluation of the distribution of the 13 SED1 PCR profiles within the population under study showed that the vast majority of the isolates are homozygous for the SED1 gene and indicated that PCR profile D predominates in the population analyzed (Table 1). Interestingly, a PCR survey of the SED1 gene in a population of 132 S. cerevisiae isolates from 26 different grapes and musts sampled in six different cellars located in the Puglia region of Italy showed a similar distribution of SED1 PCR profiles (I. Mannazzu, unpublished data), indicating that these data are probably not the consequence of a population sample bias.
The homothallic behavior, the low number of heterozygosities observed, and the 4:0 segregation ratio of the SED1 alleles in the isolates presenting PCR profiles A to E are consistent with the "genome renewal" model, according to which diploid homothallic wine strains evolve by meiotic rearrangements which lead to fitter homozygous progeny (21). Also, isolates presenting PCR profiles F to O, heterozygous for SED1, are homothallic and show a low number of heterozygosities. Thus, based on this evidence, it seems quite unlikely that the coexistence of different SED1 alleles in these isolates derives from recent conjugation events between strains carrying different alleles. On the other hand, a plausible course of events in the generation of SED1 heterozygosity in feral strains of S. cerevisiae could involve the occurrence of interallelic or intra-allelic recombination in isolates originally homozygous for this gene.
This work was supported by Fondo di Ateneo, 1999 (I.M.), and by Regione Marche-REG. CEE 2081/83 ob. 5b.
Present address: Dipartimento di Biologia Cellulare ed Oncologia, Consorzio M. Negri Sud, Chieti, Italy. ![]()
Present address: Dipartimento di Scienze Ambientali Agrarie e Biotecnologie Agro-Alimentari, Università degli Studi di Sassari, Sassari, Italy. ![]()
|
|
|---|
-minus region participate in cellular localization of yeast glycosylphosphatidylinositol-attached proteins. J. Bacteriol. 181:3886-3889.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»