AEM
Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrowReprints and Permissions
Right arrow Copyright Information
Right arrow Books from ASM Press
Right arrow MicrobeWorld
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Ashelford, K. E.
Right arrow Articles by Weightman, A. J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ashelford, K. E.
Right arrow Articles by Weightman, A. J.
Agricola
Right arrow Articles by Ashelford, K. E.
Right arrow Articles by Weightman, A. J.

 Previous Article  |  Next Article 

Applied and Environmental Microbiology, December 2005, p. 7724-7736, Vol. 71, No. 12
0099-2240/05/$08.00+0     doi:10.1128/AEM.71.12.7724-7736.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.

At Least 1 in 20 16S rRNA Sequence Records Currently Held in Public Repositories Is Estimated To Contain Substantial Anomalies

Kevin E. Ashelford,1* Nadia A. Chuzhanova,3 John C. Fry,1 Antonia J. Jones,2 and Andrew J. Weightman1

Cardiff School of Biosciences, Cardiff University, Main Building, Park Place, P.O. Box 915, Cardiff CF10 3TL, United Kingdom,1 Cardiff School of Computer Science, Cardiff University, Queen's Buildings, 5 The Parade, Roath, Cardiff CF24 3AA, United Kingdom,2 Biostatistics and Bioinformatics Unit and Institute of Medical Genetics, Cardiff School of Medicine, Cardiff University, Heath Park, Cardiff CF14 4XN, United Kingdom3

Received 7 April 2005/ Accepted 28 July 2005

A new method for detecting chimeras and other anomalies within 16S rRNA sequence records is presented. Using this method, we screened 1,399 sequences from 19 phyla, as defined by the Ribosomal Database Project, release 9, update 22, and found 5.0% to harbor substantial errors. Of these, 64.3% were obvious chimeras, 14.3% were unidentified sequencing errors, and 21.4% were highly degenerate. In all, 11 phyla contained obvious chimeras, accounting for 0.8 to 11% of the records for these phyla. Many chimeras (43.1%) were formed from parental sequences belonging to different phyla. While most comprised two fragments, 13.7% were composed of at least three fragments, often from three different sources. A separate analysis of the Bacteroidetes phylum (2,739 sequences) also revealed 5.8% records to be anomalous, of which 65.4% were apparently chimeric. Overall, we conclude that, as a conservative estimate, 1 in every 20 public database records is likely to be corrupt. Our results support concerns recently expressed over the quality of the public repositories. With 16S rRNA sequence data increasingly playing a dominant role in bacterial systematics and environmental biodiversity studies, it is vital that steps be taken to improve screening of sequences prior to submission. To this end, we have implemented our method as a program with a simple-to-use graphic user interface that is capable of running on a range of computer platforms. The program is called Pintail, is released under the terms of the GNU General Public License open source license, and is freely available from our website at http://www.cardiff.ac.uk/biosi/research/biosoft/.


* Corresponding author. Mailing address: Cardiff School of Biosciences, Cardiff University, Main Building, Park Place, P.O. Box 915, Cardiff CF10 3TL, United Kingdom. Phone: 44 (0)29 20 876002. Fax: 44 (0)29 20 874305. E-mail: ashelford{at}cardiff.ac.uk.


Applied and Environmental Microbiology, December 2005, p. 7724-7736, Vol. 71, No. 12
0099-2240/05/$08.00+0     doi:10.1128/AEM.71.12.7724-7736.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.




This article has been cited by other articles:




Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
J. Bacteriol. Microbiol. Mol. Biol. Rev. Eukaryot. Cell All ASM Journals

Copyright © 2005 by the American Society for Microbiology. All rights reserved.