Previous Article | Next Article 
Applied and Environmental Microbiology, December 2005, p. 7724-7736, Vol. 71, No. 12
0099-2240/05/$08.00+0 doi:10.1128/AEM.71.12.7724-7736.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
At Least 1 in 20 16S rRNA Sequence Records Currently Held in Public Repositories Is Estimated To Contain Substantial Anomalies
Kevin E. Ashelford,1*
Nadia A. Chuzhanova,3
John C. Fry,1
Antonia J. Jones,2 and
Andrew J. Weightman1
Cardiff School of Biosciences, Cardiff University, Main Building, Park Place, P.O. Box 915, Cardiff CF10 3TL, United Kingdom,1
Cardiff School of Computer Science, Cardiff University, Queen's Buildings, 5 The Parade, Roath, Cardiff CF24 3AA, United Kingdom,2
Biostatistics and Bioinformatics Unit and Institute of Medical Genetics, Cardiff School of Medicine, Cardiff University, Heath Park, Cardiff CF14 4XN, United Kingdom3
Received 7 April 2005/
Accepted 28 July 2005
A new method for detecting chimeras and other anomalies within 16S rRNA sequence records is presented. Using this method, we screened 1,399 sequences from 19 phyla, as defined by the Ribosomal Database Project, release 9, update 22, and found 5.0% to harbor substantial errors. Of these, 64.3% were obvious chimeras, 14.3% were unidentified sequencing errors, and 21.4% were highly degenerate. In all, 11 phyla contained obvious chimeras, accounting for 0.8 to 11% of the records for these phyla. Many chimeras (43.1%) were formed from parental sequences belonging to different phyla. While most comprised two fragments, 13.7% were composed of at least three fragments, often from three different sources. A separate analysis of the Bacteroidetes phylum (2,739 sequences) also revealed 5.8% records to be anomalous, of which 65.4% were apparently chimeric. Overall, we conclude that, as a conservative estimate, 1 in every 20 public database records is likely to be corrupt. Our results support concerns recently expressed over the quality of the public repositories. With 16S rRNA sequence data increasingly playing a dominant role in bacterial systematics and environmental biodiversity studies, it is vital that steps be taken to improve screening of sequences prior to submission. To this end, we have implemented our method as a program with a simple-to-use graphic user interface that is capable of running on a range of computer platforms. The program is called Pintail, is released under the terms of the GNU General Public License open source license, and is freely available from our website at http://www.cardiff.ac.uk/biosi/research/biosoft/.
* Corresponding author. Mailing address: Cardiff School of Biosciences, Cardiff University, Main Building, Park Place, P.O. Box 915, Cardiff CF10 3TL, United Kingdom. Phone: 44 (0)29 20 876002. Fax: 44 (0)29 20 874305. E-mail: ashelford{at}cardiff.ac.uk.
Applied and Environmental Microbiology, December 2005, p. 7724-7736, Vol. 71, No. 12
0099-2240/05/$08.00+0 doi:10.1128/AEM.71.12.7724-7736.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
This article has been cited by other articles:
-
Klitgaard, K., Boye, M., Capion, N., Jensen, T. K.
(2008). Evidence of Multiple Treponema Phylotypes Involved in Bovine Digital Dermatitis as Shown by 16S rRNA Gene Analysis and Fluorescence In Situ Hybridization. J. Clin. Microbiol.
46: 3012-3020
[Abstract]
[Full Text]
-
Hall, J. R., Mitchell, K. R., Jackson-Weaver, O., Kooser, A. S., Cron, B. R., Crossey, L. J., Takacs-Vesbach, C. D.
(2008). Molecular Characterization of the Diversity and Distribution of a Thermal Spring Microbial Community by Using rRNA and Metabolic Genes. Appl. Environ. Microbiol.
74: 4910-4922
[Abstract]
[Full Text]
-
Mendez, M. O., Neilson, J. W., Maier, R. M.
(2008). Characterization of a Bacterial Community in an Abandoned Semiarid Lead-Zinc Mine Tailing Site. Appl. Environ. Microbiol.
74: 3899-3907
[Abstract]
[Full Text]
-
Cardenas, E., Wu, W.-M., Leigh, M. B., Carley, J., Carroll, S., Gentry, T., Luo, J., Watson, D., Gu, B., Ginder-Vogel, M., Kitanidis, P. K., Jardine, P. M., Zhou, J., Criddle, C. S., Marsh, T. L., Tiedje, J. M.
(2008). Microbial Communities in Contaminated Sediments, Associated with Bioremediation of Uranium to Submicromolar Levels. Appl. Environ. Microbiol.
74: 3718-3729
[Abstract]
[Full Text]
-
Omoregie, E. O., Mastalerz, V., de Lange, G., Straub, K. L., Kappler, A., Roy, H., Stadnitskaia, A., Foucher, J.-P., Boetius, A.
(2008). Biogeochemistry and Community Composition of Iron- and Sulfur-Precipitating Microbial Mats at the Chefren Mud Volcano (Nile Deep Sea Fan, Eastern Mediterranean). Appl. Environ. Microbiol.
74: 3198-3215
[Abstract]
[Full Text]
-
Gieg, L. M., Duncan, K. E., Suflita, J. M.
(2008). Bioenergy Production via Microbial Conversion of Residual Oil to Natural Gas. Appl. Environ. Microbiol.
74: 3022-3029
[Abstract]
[Full Text]
-
Wery, N., Bru-Adan, V., Minervini, C., Delgenes, J.-P., Garrelly, L., Godon, J.-J.
(2008). Dynamics of Legionella spp. and Bacterial Populations during the Proliferation of L. pneumophila in a Cooling Tower Facility. Appl. Environ. Microbiol.
74: 3030-3037
[Abstract]
[Full Text]
-
Klassen, J. L., Foght, J. M.
(2008). Differences in Carotenoid Composition among Hymenobacter and Related Strains Support a Tree-Like Model of Carotenoid Evolution. Appl. Environ. Microbiol.
74: 2016-2022
[Abstract]
[Full Text]
-
Isenbarger, T. A., Finney, M., Rios-Velazquez, C., Handelsman, J., Ruvkun, G.
(2008). Miniprimer PCR, a New Lens for Viewing the Microbial World. Appl. Environ. Microbiol.
74: 840-849
[Abstract]
[Full Text]
-
Bibiloni, R., Tandon, P., Vargas-Voracka, F., Barreto-Zuniga, R., Lupian-Sanchez, A., Rico-Hinojosa, M. A., Guban, J., Fedorak, R., Tannock, G. W.
(2008). Differential clustering of bowel biopsy-associated bacterial profiles of specimens collected in Mexico and Canada: what do these profiles represent?. J Med Microbiol
57: 111-117
[Abstract]
[Full Text]
-
Pruesse, E., Quast, C., Knittel, K., Fuchs, B. M., Ludwig, W., Peplies, J., Glockner, F. O.
(2007). SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB. Nucleic Acids Res
35: 7188-7196
[Abstract]
[Full Text]
-
Newton, R. J., Jones, S. E., Helmus, M. R., McMahon, K. D.
(2007). Phylogenetic Ecology of the Freshwater Actinobacteria acI Lineage. Appl. Environ. Microbiol.
73: 7169-7176
[Abstract]
[Full Text]
-
Janda, J. M., Abbott, S. L.
(2007). 16S rRNA Gene Sequencing for Bacterial Identification in the Diagnostic Laboratory: Pluses, Perils, and Pitfalls. J. Clin. Microbiol.
45: 2761-2764
[Full Text]
-
Besemer, K., Singer, G., Limberger, R., Chlup, A.-K., Hochedlinger, G., Hodl, I., Baranyi, C., Battin, T. J.
(2007). Biophysical Controls on Community Succession in Stream Biofilms. Appl. Environ. Microbiol.
73: 4966-4974
[Abstract]
[Full Text]
-
Schmitt, S., Weisz, J. B., Lindquist, N., Hentschel, U.
(2007). Vertical Transmission of a Phylogenetically Complex Microbial Consortium in the Viviparous Sponge Ircinia felix. Appl. Environ. Microbiol.
73: 2067-2078
[Abstract]
[Full Text]
-
Cole, J. R., Chai, B., Farris, R. J., Wang, Q., Kulam-Syed-Mohideen, A. S., McGarrell, D. M., Bandela, A. M., Cardenas, E., Garrity, G. M., Tiedje, J. M.
(2007). The ribosomal database project (RDP-II): introducing myRDP space and quality controlled public data. Nucleic Acids Res
35: D169-D172
[Abstract]
[Full Text]
-
Kendall, M. M., Wardlaw, G. D., Tang, C. F., Bonin, A. S., Liu, Y., Valentine, D. L.
(2007). Diversity of Archaea in Marine Sediments from Skan Bay, Alaska, Including Cultivated Methanogens, and Description of Methanogenium boonei sp. nov.. Appl. Environ. Microbiol.
73: 407-414
[Abstract]
[Full Text]
-
Simmon, K. E., Croft, A. C., Petti, C. A.
(2006). Application of SmartGene IDNS Software to Partial 16S rRNA Gene Sequences for a Diverse Group of Bacteria in a Clinical Laboratory. J. Clin. Microbiol.
44: 4400-4406
[Abstract]
[Full Text]
-
Oline, D. K.
(2006). Phylogenetic Comparisons of Bacterial Communities from Serpentine and Nonserpentine Soils. Appl. Environ. Microbiol.
72: 6965-6971
[Abstract]
[Full Text]
-
Lloyd, K. G., Lapham, L., Teske, A.
(2006). An Anaerobic Methane-Oxidizing Community of ANME-1b Archaea in Hypersaline Gulf of Mexico Sediments. Appl. Environ. Microbiol.
72: 7218-7230
[Abstract]
[Full Text]
-
Piccini, C., Conde, D., Alonso, C., Sommaruga, R., Pernthaler, J.
(2006). Blooms of single bacterial species in a coastal lagoon of the southwestern atlantic ocean.. Appl. Environ. Microbiol.
72: 6560-6568
[Abstract]
[Full Text]
-
Pang, C. M., Liu, W.-T.
(2006). Biological Filtration Limits Carbon Availability and Affects Downstream Biofilm Formation and Community Structure. Appl. Environ. Microbiol.
72: 5702-5712
[Abstract]
[Full Text]
-
Ashelford, K. E., Chuzhanova, N. A., Fry, J. C., Jones, A. J., Weightman, A. J.
(2006). New Screening Software Shows that Most Recent Large 16S rRNA Gene Clone Libraries Contain Chimeras. Appl. Environ. Microbiol.
72: 5734-5741
[Abstract]
[Full Text]
-
DeSantis, T. Z., Hugenholtz, P., Larsen, N., Rojas, M., Brodie, E. L., Keller, K., Huber, T., Dalevi, D., Hu, P., Andersen, G. L.
(2006). Greengenes, a Chimera-Checked 16S rRNA Gene Database and Workbench Compatible with ARB.. Appl. Environ. Microbiol.
72: 5069-5072
[Abstract]
[Full Text]
Copyright © 2005 by the American Society for Microbiology. All rights reserved.