Table 3

Characteristics of ORFs in the C. sakazakii O4 O-antigen gene clustera

ORFGeneGene position (length [aa])G+C content (%)Pfam conserved domain(s) (Pfam no./e value)Similar protein(s), strain(s) (GenBank accession no.)% Identical (aa overlap)Putative function of protein
4-1rmlB124–1215 (363)45.51NAD-dependent epimerase/dehydratase family (PF01370/1.7e-78)dTDP-glucose 4,6-dehydratase, E. coli O103:H2 (YP_003222430)84 (356)Glucose dehydratase
4-2rmlA1217–2080 (287)42.24Nucleotidyltransferase (PF00483/6.8e-70)d-Glucose-1-phosphate thymidylyltransferase, E. coli O103:H2 strain 12009 (YP_003222429)84 (286)Glucose-1-phosphate thymidylyltransferase
4-3fdtA2082–2483 (133)35.57WxcM-like, C-terminal (PF05523/1.3e-54)dTDP-6-deoxy-3,4-keto-hexulose isomerase, E. coli O103:H2 strain 12009 (YP_003222428)64 (131)Hexose isomerase
4-4fdhC2476–3009 (177)32.95Putative butyryltransferase, E. coli O103:H2 (EDV84013)55 (177)Butyryltransferase
4-5fdtB3014–4120 (368)39.74DegT/DnrJ/EryC1/StrS aminotransferase family (PF01041/4e-118)Putative aminotransferase, E. coli (AAS73167)69 (372)Aminotransferase
4-6wzx4108–5394 (428)35.50Polysaccharide biosynthesis protein (PF01943/7.3e-13)O-antigen flippase Wzx, E. coli O103:H2 (YP_003222425)79 (419)Flippase
4-7wepD5378–6370 (330)34.44Glycosyltransferase family 2 (PF00535/2.1e-32)Putative glycosyltransferase, E. coli (EDV84144)56 (323)Glycosyltransferase
4-8wepE6367–7257 (297)34.34Glycosyltransferase family 2 (PF00535/7.6e-29)Putative glycosyltransferase, E. coli O103:H2 (YP_003222423)71 (296)Glycosyltransferase
4-9wzy7259–8413 (384)33.5Putative O-antigen polymerase, E. coli (ABK27352)51 (378)Polymerase
4-10wepF8410–9507 (365)39.7Glycosyltransferase group 1 (PF00534/3.5e-37)Putative glycosyltransferase, E. coli (AAS73172)67 (361)Glycosyltransferase
4-11wepG9511–10632 (374)39.48Glycosyltransferase group 1 (PF00534/4.5e-31)Putative galactosyltransferase, E. coli O103:H2 (ABK27323)60 (373)Galactosyltransferase
4-12gne10684–11706 (340)42.03NAD-dependent epimerase/dehydratase family (PF01370/1.7e-58)UDP-glucose 4-epimerase, H. parainfluenzae ATCC 33392 (ZP_08147973)67 (337)UDP-glucose 4-epimerase
  • a aa, amino acid(s).