AEM Accepts, published online ahead of print on 11 January 2008
This Article
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrowReprints and Permissions
Right arrow Copyright Information
Right arrow Books from ASM Press
Right arrow MicrobeWorld
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Wommack, K. E.
Right arrow Articles by Ravel, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wommack, K. E.
Right arrow Articles by Ravel, J.
Agricola
Right arrow Articles by Wommack, K. E.
Right arrow Articles by Ravel, J.

 Previous Article  |  Next Article 

Appl. Environ. Microbiol. doi:10.1128/AEM.02181-07
Copyright (c) 2008, American Society for Microbiology and/or the Listed Authors/Institutions. All Rights Reserved.

Metagenomics: Read length matters

K. Eric Wommack, Jaysheel Bhavsar, and Jacques Ravel*

Delaware Biotechnology Institute, University of Delaware, 15 Innovation Way, Newark, DE 19711, USA.; Institute for Genome Sciences, Department of Microbiology & Immunology, University of Maryland School of Medicine, 20 Penn Street, Baltimore, MD 21201, USA

* To whom correspondence should be addressed. Email: jravel{at}som.umaryland.edu.


arrow
Abstract

Obtaining an unbiased view of the phylogenetic composition and functional diversity within a microbial community is one central objective of metagenomic analysis. New technologies, such as 454 pyrosequencing, have dramatically reduced sequencing costs to a level where metagenomic analysis may become a viable alternative to more focused assessments of the phylogenetic (eg. 16S rDNA) and functional diversity of microbial communities. To determine whether the short (~100-200 bp) sequence reads obtained from pyrosequencing are appropriate for phylogenetic and functional characterization of microbial communities the results of BLAST and COG analyses were compared for long (~750 bp) and randomly derived short reads from each of two microbial and one virioplankton metagenome libraries. Overall, BLASTX searches against GenBank nr found far fewer homologs within the short sequence library. This was especially pronounced for a Chesapeake Bay virioplankton metagenome library. Increasing the short read sampling depth or the length of derived short reads (up to 400 bp) did not completely resolve the discrepancy in BLASTX homolog detection. Only in cases where the long read sequence had a close homolog (low BLAST E-score) did the derived short read sequence also find a significant homolog. Thus, more distant homologs of microbial and viral genes are not detected by short read sequences. Among COG hits, derived short reads sampled at a depth of two short reads per long read missed up to 72% of the COGs found using long reads. Noting the current limitation in computational approaches for analysis of short sequences, use of short read length libraries does not appear to be an appropriate tool for metagenomic characterization of microbial communities.




This article has been cited by other articles:

  • Hamady, M., Knight, R. (2009). Microbial community profiling for human microbiome projects: Tools, techniques, and challenges. Genome Res 19: 1141-1152 [Abstract] [Full Text]  
  • Victoria, J. G., Kapoor, A., Li, L., Blinkova, O., Slikas, B., Wang, C., Naeem, A., Zaidi, S., Delwart, E. (2009). Metagenomic Analyses of Viruses in Stool Samples from Children with Acute Flaccid Paralysis. J. Virol. 83: 4642-4651 [Abstract] [Full Text]  
  • Helton, R. R., Wommack, K. E. (2009). Seasonal Dynamics and Metagenomic Characterization of Estuarine Viriobenthos Assemblages by Randomly Amplified Polymorphic DNA PCR. Appl. Environ. Microbiol. 75: 2259-2265 [Abstract] [Full Text]  
  • Lacerda, C. M. R., Reardon, K. F. (2009). Environmental proteomics: applications of proteome profiling in environmental microbiology and biotechnology. Brief Funct Genomic Proteomic 0: elp005v1-elp005 [Abstract] [Full Text]  
  • Kunin, V., Copeland, A., Lapidus, A., Mavromatis, K., Hugenholtz, P. (2008). A Bioinformatician's Guide to Metagenomics. Microbiol. Mol. Biol. Rev. 72: 557-578 [Abstract] [Full Text]  
  • Vega Thurber, R. L., Barott, K. L., Hall, D., Liu, H., Rodriguez-Mueller, B., Desnues, C., Edwards, R. A., Haynes, M., Angly, F. E., Wegley, L., Rohwer, F. L. (2008). Metagenomic analysis indicates that stressors induce production of herpes-like viruses in the coral Porites compressa. Proc. Natl. Acad. Sci. USA 105: 18413-18418 [Abstract] [Full Text]