Previous Article | Next Article 
Applied and Environmental Microbiology, December 2006, p. 7949-7953, Vol. 72, No. 12
0099-2240/06/$08.00+0 doi:10.1128/AEM.01046-06
Sequence Analysis of the rfb Loci, Encoding Proteins Involved in the Biosynthesis of the Salmonella enterica O17 and O18 Antigens: Serogroup-Specific Identification by PCR
Collette Fitzgerald,*
Linda Gheesling,
Marcus Collins, and
Patricia I. Fields
Foodborne and Diarrheal Diseases Branch, Division of Bacterial and Mycotic Diseases, National Center for Infectious Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia
Received 5 May 2006/
Accepted 9 October 2006

ABSTRACT
We report sequencing of the O antigen encoded by the
rfb gene
cluster of
Salmonella enterica serotype Jangwani (O17) and
Salmonella serotype Cerro (O18). We developed serogroup O17- and O18-specific
PCR assays based on
rfb gene targets and found them to be sensitive
and specific for rapid identification of
Salmonella serogroups
O17 and O18.

INTRODUCTION
Serotyping of
Salmonella enterica based on O and H surface antigens
has proven to be extremely useful for understanding the host
range and disease spectrum of this pathogen (
20) and has been
an invaluable typing method for epidemiological investigations
(
2,
3,
19,
20). It does, however, have limitations; production
and quality control of the hundreds of antisera required for
serotyping is difficult and time consuming, and not all
Salmonella strains are typeable. Many of the enzymes involved in O-antigen
biosynthesis are encoded by an
rfb gene cluster which varies
in size for each serogroup depending on the sugar composition
and complexity of the O antigen (
24).
rfb gene clusters have
been characterized from a growing number of gram-negative bacteria
(
23), including 13
Salmonella serogroups (
9,
22,
26,
28). The
use of PCR targeting regions within
rfb gene clusters for O-antigen
identification of various organisms (
6,
7,
11,
27), including
Salmonella (
9,
10,
16), has been reported as an alternative
to O-group detection by traditional serotyping. The
rfb gene
clusters of the more common
Salmonella serogroups have been
the most extensively studied at the molecular level to date.
Little is known about the
rfb gene clusters of other
Salmonella O serogroups. As part of a larger project to develop a DNA-based
approach for serotyping
Salmonella, we report here the sequence
analysis of the
rfb gene cluster of two serogroups, O17 from
Salmonella enterica serotype Jangwani and O18 from
Salmonella serotype Cerro, and the development of serogroup-specific PCR
detection assays based on the
rfb gene targets
wzy and
wzx,
respectively.
Salmonella isolates 00-0073 (Salmonella serotype Jangwani, 17:a:1,5) and 99-0087 (Salmonella serotype Cerro, 18:z4,z23:[1,5]) were selected for rfb sequence analysis. Serotyping was performed according to the Kaufmann-White scheme (2, 3). For evaluation of the serogroup O17- and O18-specific PCR assays, 443 strains representing all of the subspecies/serogroup combinations recognized in the Kaufmann-White scheme were evaluated. The rfb regions of O17 and O18 were amplified by PCR using previously described primers (14). A combination of subcloning and primer walking was used to sequence the rfb regions in both directions on an automated DNA sequencer (PE Applied Biosystems, California). DNA sequence data were assembled and analyzed using Lasergene 99 (DNAStar) software.

O17 rfb gene cluster.
The O17-antigen structure is unknown. Chemical analyses suggest
that the O17 antigen contains a polysaccharide core comprised
of galactose, glucose, glucosamine, heptose, and 3-deoxy-oct-2-ulosonic
acid but has no reported additional sugars in its O17-antigen-specific
side chain (
15). Analysis of the DNA sequence of the
rfb gene
cluster from
Salmonella serotype Jangwani revealed 9.130 kbp
flanked by JUMPstart and
gnd and eight open reading frames (ORFs)
(Fig.
1A). All ORFs were analyzed using BLAST, and the genes
were named in accordance with Bacterial Polysaccharide Gene
Database (BPGD) nomenclature (
21). All genes had the same transcriptional
direction and have low G+C content (Table
1).
We identified the gene
mnaA, encoding a UDP-
N-acetylglucosamine
(GlcNAc)-2-epimerase. When analyzed using Pfam protein analysis,
the predicted protein exhibited significant identity to the
UDP-
N-acetylglucosamine-2-epimerase (PN 00235) protein family.
This enzyme catalyzes the reversible epimerization of UDP-
N-acetylglucosamine
(UDP-GlcNAc) and thereby provides the cell with UDP-
N-acetylmannosamine
(UDP-ManNAc), the activated donor of ManNAc residues. The presence
of this gene in the O17
rfb gene cluster suggests that ManNAc
may be one of the sugars in the
Salmonella O17 antigen.
orf7 showed the highest identity (65%) at the amino acid level with
Glf, a UDP-galactopyranose mutase from
Streptococcus suis. Salmonella serogroup B and D isolates have also been shown to have genes
with identity to
glf (Table
1), but they are not located in
the
rfb gene cluster. These enzymes catalyze the interconversion
of the pyranose (UDP-Gal
p) and furanose (UDP-Gal
f) forms of
galactose (
18). The presence of this gene in the O17
rfb gene
cluster suggests that galactofuranose is one of the sugars in
the O17 antigen. The O17
rfb gene cluster contains four ORFs
predicted to encode putative transferases.
orf3, -5, -6, and
-8 shared various degrees of similarity with glycosyltransferases
from a range of other organisms (Table
1) and have been named
wfbQ,
wfbR,
wfbS, and
wfbU. Three of four putative O17 transferases
(
wfbQ,
wfbR, and
wfbS) are part of the GT4 family of retaining
glycosyltransferases, and
wfbU is a putative inverting GT2 family
glycosyltransferase based on the carbohydrate-active enzyme
database (CAZy) (
http://www.cazy.org) classification (
4,
5).
We identified two ORFs encoding hydrophobic proteins with 10
and 11 predicted transmembrane domains. These are characteristic
features for the O-antigen polymerase and transporter proteins,
and we named these genes
wzy and
wzx. The putative O17 Wzx protein
had similarities to several putative polysaccharide repeat unit
transporters (Table
1).

O18 rfb gene cluster.
The O18-antigen-specific O-subunit structure has been determined
using nuclear magnetic resonance and chemicals methods (
25).
Analysis of the DNA sequence from the
Salmonella enterica O18
rfb gene cluster revealed 9.018 kbp flanked by JUMPstart and
the
gnd gene and contained seven open reading frames (Table
2). Five of the seven genes had a low G+C content ranging from
28.27% to 31.21% (Fig.
1B), a feature characteristic of other
Salmonella rfb gene clusters. The two genes,
manC and
manB,
had G+C contents of 47.46% and 55.29%, respectively. All of
the genes have the same transcriptional direction. Since the
O18 antigen contains mannose, it was anticipated that the
rfb cluster would contain
manB, encoding phosphomannomutase, and
manC, encoding mannose-1-phosphate guanylytransferase, the two
genes responsible for the biosynthesis of GDP-mannose from mannose-6-phosphate
(
12).
orf3 and
orf4 were identified as
manB and
manC, respectively,
based on their high levels of identity to other GDP-mannose
biosynthesis genes (
12). The deduced gene products of
orf1,
orf2, and
orf7 shared identity to glycosyltransferases, and
we have named these genes
wfbV,
wfbW, and
wfbX. wfbV and
wfbW have specific homology to mannosyltransferases and retain glycosyltransferases
from family GT4 (
5). The remaining O:18 transferase, encoded
by
wfbV, could not be assigned to 1 of the 65 different glycosyltransferase
families previously described and was left unclassified (
5).
A Pfam search (
1) suggests it to be related to an acyltransferase
family (PF01757).
orf5 and
orf6 are predicted to encode hydrophobic
integral membrane proteins with 12 and 11 transmembrane domains,
respectively, and have a 45% and 48% amino acid identity to
the Wzy and Wzx proteins of
Salmonella enterica O6,14 (
9). We
have named these genes
wzy and
wzx, respectively.

O17- and O18-specific PCR.
A total of 46 pools of genomic DNA (50 ng/µl per isolate)
were made, corresponding to the 46 O serogroups; each pool contained
DNA from 1 to 12 isolates. The O17- and O18-specific primers
(O17, 5'-GGC TGG GTT GTG GCT TTT T-3' and 5'-CTT CCG AAA TCA
ATA GAA AAA TCA A-3'; and O18, 5'-CTC TAG GAT CAA CTG AAG GTG
GTC-3' and 5'-CAA CCC AGC AAT AAA GCA GAA-3') were derived from
sequences within
wzy and
wzx to generate a 565-bp and a 370-bp
PCR amplicon. PCR amplification was carried out with the following
parameters: 96°C for 2 min, 25 cycles of 94°C for 30
s, 58°C for 30 s, and 72°C for 45 s, and 72°C for
10 min. When a positive PCR result was seen for a particular
genomic DNA pool, individual DNA samples from isolates represented
in that pool were individually retested by PCR. A 565-bp amplicon
was generated only with the pool containing DNA from isolates
from different serotypes belonging to serogroup O17, and a 370-bp
amplicon was generated only with the pool containing DNA from
isolates from different serotypes belonging to serogroup O18.
All other pools, containing DNA from 398 strains representing
the other 44 serogroups, were negative in the PCR assays. DNA
samples from the 12 strains in each of the O17 and O18 pools
were individually tested in the PCR, and all generated a PCR
product of the expected size (Fig.
2). An additional 19 O17
strains and 20 O18 strains were tested, and all generated the
expected 565- and 370-bp PCR products, respectively.

Conclusions.
DNA sequence analysis of the
rfb gene cluster from
Salmonella serogroup O17 had limited identity to previously characterized
rfb gene clusters and appears to have been acquired by lateral
transfer from a low-G+C-content species. Evolution of most of
the O18
rfb gene cluster also occurred within a low-G+C-content
species prior to transfer to
S. enterica, but the cluster appears
to have been assembled from more than one source as the
manC and
manB genes at least appear to have been derived from the
Salmonella cps cluster. The O18
rfb gene cluster is most related
to
Salmonella serogroups O6,14 and C1. The PCR assays described
here form the basis for the development of molecular detection
assays for the identification of
Salmonella serogroups O17 and
O18 based on genes directly involved in the formation of the
O antigens. Together with the serogroup PCR assays that have
previously been described for serogroups A/D, B, C
2, and D (
16)
and, more recently, O6,14 (
9), we intend to identify additional
rfb targets specific for other
Salmonella O antigens associated
with the top 100 serotypes of human
Salmonella infection. Our
overall goal is to combine this serogroup identification system
with DNA targets specific for H antigens (
17) to establish a
comprehensive DNA-based scheme for the identification of the
major
Salmonella serotypes (
8). Currently, we are exploring
the utility of additional technologies, including Luminex, as
a potential platform to combine the O- and H-antigen molecular
assays on a single platform. This will allow a rapid and convenient
alternative for the definitive identification of
Salmonella serotypes attainable by nonspecialized laboratories.

Nucleotide sequence accession numbers.
The DNA sequences have been deposited in GenBank and assigned
accession numbers EF032634 and EF032635.

ACKNOWLEDGMENTS
We thank Bernard Henrissat (University of Marseille, France)
for classifying the glycosyltransferases from
Salmonella serogroups
O17 and O18 using the CAZy scheme and Kate Wilson and Peter
Reeves (University of Sydney, Australia) for assigning BPGD
gene names for the
Salmonella O17 and O18
rfb gene clusters.
The use of trade names is for identification only and does not imply endorsement by the Public Health Service or by the U.S. Department of Health and Human Services.

FOOTNOTES
* Corresponding author. Mailing address: Foodborne and Diarrheal Diseases Branch, Division of Bacterial and Mycotic Diseases, NCID, Centers for Disease Control and Prevention, 1600 Clifton Road NE, Atlanta, GA 30333. Phone: (404) 639-0838. Fax: (404) 639-3333. E-mail:
chf3{at}cdc.gov.

Published ahead of print on 20 October 2006. 

REFERENCES
1 - Bateman, A., L. Coin, R. Durbin, R. D. Finn, V. Hollich, S. Griffiths-Jones, A. Khanna, M. Marshall, S. Moxon, E. L. Sonnhammer, D. J. Studholme, C. Yeats, and S. R. Eddy. 2004. The Pfam protein families database. Nucleic Acids Res. 32:D138-D141.[Abstract/Free Full Text]
2 - Brenner, F. W. 1998. Modified Kaufmann-White scheme. Centers for Diseases Control and Prevention, Atlanta, Ga.
3 - Brenner, F. W., and A. C. McWhorter-Murlin. 1998. Identification and serotyping of Salmonella. Centers for Diseases Control and Prevention, Atlanta, Ga.
4 - Campbell, J. A., G. J. Davies, V. Bulone, and B. Henrissat. 1997. A classification of nucleotide-diphospho-sugar glycosyltransferases based on amino acid sequence similarities. Biochem. J. 326:929-939.
5 - Coutinho, P. M., E. Deleury, G. J. Davies, and B. Henrissat. 2003. An evolving hierarchical family classification for glycosyltransferases. J. Mol. Biol. 328:307-317.[CrossRef][Medline]
6 - Feng, L., S. N. Senchenkova, J. Tao, A. S. Shashkov, B. Liu, S. D. Shevelev, P. R. Reeves, J. Xu, Y. A. Knirel, and L. Wang. 2005. Structural and genetic characterization of enterohemorrhagic Escherichia coli O145 O antigen and development of an O145 serogroup-specific PCR assay. J. Bacteriol. 187:758-764.[Abstract/Free Full Text]
7 - Feng, L., J. Tao, H. Guo, J. Xu, Y. Li, F. Rezwan, P. Reeves, and L. Wang. 2004. Structure of the Shigella dysenteriae 7 O antigen gene cluster and identification of its antigen specific genes. Microb. Pathog. 36:109-115.[CrossRef][Medline]
8 - Fields, P. I., C. Fitzgerald, J. R. McQuistion, M. L. Collins, L. T. Brynant, and L. L. Gheesling. 2006. Multiplex, bead-based suspension arrays for the molecular determination of serotype in Salmonella. Presented at the International Symposium on Salmonella and Salmonellosis, Saint Malo, France.
9 - Fitzgerald, C., R. Sherwood, L. L. Gheesling, F. W. Brenner, and P. I. Fields. 2003. Molecular analysis of the rfb O antigen gene cluster of Salmonella enterica serogroup O:6,14 and development of a serogroup-specific PCR assay. Appl. Environ. Microbiol. 69:6099-6105.[Abstract/Free Full Text]
10 - Hoorfar, J., D. L. Baggesen, and P. H. Porting. 1999. A PCR-based strategy for simple and rapid identification of rough presumptive Salmonella isolates. J. Microbiol. Methods 35:77-84.[CrossRef][Medline]
11 - Hoshino, K., S. Yamasaki, A. K. Mukhopadhyay, S. Chakraborty, A. Basu, S. K. Bhattacharya, G. B. Nair, T. Shimada, and Y. Takeda. 1998. Development and evaluation of a multiplex PCR assay for rapid detection of toxigenic Vibrio cholerae O1 and O139. FEMS Immunol. Med. Microbiol. 20:201-207.[CrossRef][Medline]
12 - Jensen, S. O., and P. R. Reeves. 2001. Molecular evolution of the GDP-mannose pathway genes (manB and manC) in Salmonella enterica. Microbiology 147:599-610.[Abstract/Free Full Text]
13 - Lee, S. J., L. K. Romana, and P. R. Reeves. 1992. Sequence and structural analysis of the rfb (O antigen) gene cluster from a group C1 Salmonella enterica strain. J. Gen. Microbiol. 138:1843-1855.[Abstract/Free Full Text]
14 - Li, Q., and P. R. Reeves. 2000. Genetic variation of dTDP-L-rhamnose pathway genes in Salmonella enterica. Microbiology 146:2291-2307.[Abstract/Free Full Text]
15 - Lüderitz, O., A. M. Staub, and O. Westphal. 1966. Immunochemistry of O and R antigens of Salmonella and related Enterobacteriaceae. Bacteriol. Rev. 30:192-255.[Free Full Text]
16 - Luk, J. M. C., U. Kongmuang, P. R. Reeves, and A. A. Lindberg. 1993. Selective amplification of abequose and paratose synthase genes (rfb) by polymerase chain reaction for identification of Salmonella major serogroups (A, B, C2, and D). J. Clin. Microbiol. 31:2118-2123.[Abstract/Free Full Text]
17 - McQuiston, J. R., R. Parrenas, M. Ortiz-Rivera, L. Gheesling, F. Brenner, and P. I. Fields. 2004. Sequencing and comparative analysis of flagellin genes fliC, fljB, and flpA from Salmonella. J. Clin. Microbiol. 42:1923-1932.[Abstract/Free Full Text]
18 - Nassau, P. M., S. L. Martin, R. E. Brown, A. Weston, D. Monsey, M. R. McNeil, and K. Duncan. 1996. Galactofuranose biosynthesis in Escherichia coli K-12: identification and cloning of UDP-galactopyranose mutase. J. Bacteriol. 178:1047-1052.[Abstract/Free Full Text]
19 - Olsen, S. J., R. Bishop, F. W. Brenner, T. H. Roels, N. Bean, R. V. Tauxe, and L. Slutsker. 2001. The changing epidemiology of Salmonella: trends in serotypes isolated from humans in the United States, 1987-1997. J. Infect. Dis. 183:753-761.[CrossRef][Medline]
20 - Porwollik, S., and M. McClelland. 2003. Lateral gene transfer in Salmonella. Microbes Infect. 5:977-989.[CrossRef][Medline]
21 - Reeves, P. R., M. Hobbs, M. A. Valvano, M. Skurnik, C. Whitfield, D. Coplin, N. Kido, J. Klena, D. Maskell, C. R. Raetz, and P. D. Rick. 1996. Bacterial polysaccharide synthesis and gene nomenclature. Trends Microbiol. 4:495-503.[CrossRef][Medline]
22 - Samuel, G., J. P. Hogbin, L. Wang, and P. R. Reeves. 2004. Relationships of the Escherichia coli O157, O111, and O55 O-antigen gene clusters with those of Salmonella enterica and Citrobacter freundii, which express identical O antigens. J. Bacteriol. 186:6536-6543.[Abstract/Free Full Text]
23 - Samuel, G., and P. Reeves. 2003. Biosynthesis of O-antigens: genes and pathways involved in nucleotide sugar precursor synthesis and O-antigen assembly. Carbohydr. Res. 338:2503-2519.[CrossRef][Medline]
24 - Schnaitman, C. A., and J. D. Klena. 1993. Genetics of lipopolysaccharide biosynthesis in enteric bacteria. Microbiol. Rev. 57:655-682.[Abstract/Free Full Text]
25 - Vinogradov, E., L. Nossova, and J. Radziejewska-Lebrecht. 2004. The structure of the O-specific polysaccharide from Salmonella cerro (serogroup K, O:6,14,18). Carbohydr. Res. 339:2441-2443.[CrossRef][Medline]
26 - Wang, L., and P. R. Reeves. 2000. The Escherichia coli O111 and Salmonella enterica O35 gene clusters: gene clusters encoding the same colitose-containing O antigen are highly conserved. J. Bacteriol. 182:5256-5261.[Abstract/Free Full Text]
27 - Weynants, V., V. Jadot, P. A. Denoel, A. Tibor, and J. J. Letesson. 1996. Detection of Yersinia enterocolitica serogroup O:3 by a PCR method. J. Clin. Microbiol. 34:1224-1227.[Abstract]
28 - Xiang, S. H., A. M. Haase, and P. R. Reeves. 1993. Variation of the rfb gene clusters in Salmonella enterica. J. Bacteriol. 175:4877-4884.[Abstract/Free Full Text]
Applied and Environmental Microbiology, December 2006, p. 7949-7953, Vol. 72, No. 12
0099-2240/06/$08.00+0 doi:10.1128/AEM.01046-06
This article has been cited by other articles:
-
Arrach, N., Porwollik, S., Cheng, P., Cho, A., Long, F., Choi, S.-H., McClelland, M.
(2008). Salmonella Serovar Identification Using PCR-Based Detection of Gene Presence and Absence. J. Clin. Microbiol.
46: 2581-2589
[Abstract]
[Full Text]
-
Mullane, N., O'Gaora, P., Nally, J. E., Iversen, C., Whyte, P., Wall, P. G., Fanning, S.
(2008). Molecular Analysis of the Enterobacter sakazakii O-Antigen Gene Locus. Appl. Environ. Microbiol.
74: 3783-3794
[Abstract]
[Full Text]
-
Fitzgerald, C., Collins, M., van Duyne, S., Mikoleit, M., Brown, T., Fields, P.
(2007). Multiplex, Bead-Based Suspension Array for Molecular Determination of Common Salmonella Serogroups. J. Clin. Microbiol.
45: 3323-3334
[Abstract]
[Full Text]