A Bioinformatician's Guide to Metagenomics

Kunin, Victor; Copeland, Alex; Lapidus, Alla; Mavromatis, Konstantinos; Hugenholtz, Philip

doi:10.1128/MMBR.00009-08

A Bioinformatician's Guide to Metagenomics

Journal Article · Fri Aug 01 00:00:00 EDT 2008 · PLoS

DOI:https://doi.org/10.1128/MMBR.00009-08· OSTI ID:943447

Kunin, Victor; Copeland, Alex; Lapidus, Alla; Mavromatis, Konstantinos; Hugenholtz, Philip

As random shotgun metagenomic projects proliferate and become the dominant source of publicly available sequence data, procedures for best practices in their execution and analysis become increasingly important. Based on our experience at the Joint Genome Institute, we describe step-by-step the chain of decisions accompanying a metagenomic project from the viewpoint of a bioinformatician. We guide the reader through a standard workflow for a metagenomic project beginning with pre-sequencing considerations such as community composition and sequence data type that will greatly influence downstream analyses. We proceed with recommendations for sampling and data generation including sample and metadata collection, community profiling, construction of shotgun libraries and sequencing strategies. We then discuss the application of generic sequence processing steps (read preprocessing, assembly, and gene prediction and annotation) to metagenomic datasets by contrast to genome projects. Different types of data analyses particular to metagenomes are then presented including binning, dominant population analysis and gene-centric analysis. Finally data management systems and issues are presented and discussed. We hope that this review will assist bioinformaticians and biologists in making better-informed decisions on their journey during a metagenomic project.

Research Organization:: Ernest Orlando Lawrence Berkeley National Laboratory, Berkeley, CA (US)

Sponsoring Organization:: Genomics Division

DOE Contract Number:: AC02-05CH11231

OSTI ID:: 943447

Report Number(s):: LBNL-916E

Journal Information:: PLoS, Journal Name: PLoS

Country of Publication:: United States

Language:: English

Similar Records

Related Subjects

59 BASIC BIOLOGICAL SCIENCES
CONSTRUCTION
FORECASTING
GENES
MANAGEMENT
PROCESSING
RECOMMENDATIONS
SAMPLING

A Bioinformatician's Guide to Metagenomics

Citation Formats

Similar Records

Related Subjects