skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Using populations of human and microbial genomes for organism detection in metagenomes

Journal Article · · Genome Research
 [1];  [2];  [3];  [4];  [1];  [4]
  1. Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States). Center for Applied Scientific Computing
  2. Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States). Global Security Computer Applications Division
  3. Univ. of Valencia, Valencia (Spain). Inst. de Fisica Corpuscular
  4. Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States). Global Security Computer Applications Division

Identifying causative disease agents in human patients from shotgun metagenomic sequencing (SMS) presents a powerful tool to apply when other targeted diagnostics fail. Numerous technical challenges remain, however, before SMS can move beyond the role of research tool. Accurately separating the known and unknown organism content remains difficult, particularly when SMS is applied as a last resort. The true amount of human DNA that remains in a sample after screening against the human reference genome and filtering nonbiological components left from library preparation has previously been underreported. In this study, we create the most comprehensive collection of microbial and reference-free human genetic variation available in a database optimized for efficient metagenomic search by extracting sequences from GenBank and the 1000 Genomes Project. The results reveal new human sequences found in individual Human Microbiome Project (HMP) samples. Individual samples contain up to 95% human sequence, and 4% of the individual HMP samples contain 10% or more human reads. In conclusion, left unidentified, human reads can complicate and slow down further analysis and lead to inaccurately labeled microbial taxa and ultimately lead to privacy concerns as more human genome data is collected.

Research Organization:
Lawrence Livermore National Laboratory (LLNL), Livermore, CA (United States)
Sponsoring Organization:
USDOE
Grant/Contract Number:
AC52-07NA27344; 33-ER-2012; 08-ER-2011
OSTI ID:
1263571
Journal Information:
Genome Research, Vol. 25, Issue 7; ISSN 1088-9051
Publisher:
Cold Spring Harbor Laboratory PressCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 24 works
Citation information provided by
Web of Science

References (40)

Genome sequences of rare, uncultured bacteria obtained by differential coverage binning of multiple metagenomes journal May 2013
DNA signatures for detecting genetic engineering in bacteria journal January 2008
Basic Local Alignment Search Tool journal October 1990
Scalable metagenomic taxonomy classification using a reference genome database journal July 2013
Design and Optimization of a Metagenomics Analysis Workflow for NVRAM conference May 2014
Comparison of Sequencing Utility Programs journal January 2013
GenBank journal November 2012
Rapid phylogenetic and functional classification of short genomic fragments with signature peptides journal January 2012
Clinical PathoScope: rapid alignment and filtration for accurate pathogen identification in clinical samples using unassembled sequencing data journal January 2014
Full Genome Virus Detection in Fecal Samples Using Sensitive Nucleic Acid Preparation, Deep Sequencing, and a Novel Iterative Sequence Classification Algorithm journal April 2014
Geographic population structure analysis of worldwide human populations infers their biogeographical origins journal April 2014
Bacterial genome sequencing in the clinic: bioinformatic challenges and solutions journal November 2013
CD-HIT: accelerated for clustering the next-generation sequencing data journal October 2012
lobSTR: A short tandem repeat profiler for personal genomes journal April 2012
Novel Plasmid and Its Variant Harboring both a bla NDM-1 Gene and Type IV Secretion System in Clinical Isolates of Acinetobacter lwoffii journal April 2012
Short read alignment with populations of genomes journal June 2013
Repbase Update, a database of eukaryotic repetitive elements journal January 2005
A Physicians' Wish List for the Clinical Application of Intestinal Metagenomics journal April 2014
Mycoplasma contamination in the 1000 Genomes Project journal April 2014
Common Contaminants in Next-Generation Sequencing That Hinder Discovery of Low-Abundance Microbes journal May 2014
Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences journal January 2011
A fast, lock-free approach for efficient parallel counting of occurrences of k-mers journal January 2011
Optimizing Read Mapping to Reference Genomes to Determine Composition and Species Prevalence in Microbial Communities journal June 2012
SIANN: Strain Identification by Alignment to Near Neighbors posted_content January 2014
A cloud-compatible bioinformatics pipeline for ultrarapid pathogen identification from next-generation sequencing of clinical samples journal June 2014
Direct Metagenomic Detection of Viral Pathogens in Nasal and Fecal Specimens Using an Unbiased High-Throughput Sequencing Approach journal January 2009
Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes journal July 2014
Derived immune and ancestral pigmentation alleles in a 7,000-year-old Mesolithic European journal January 2014
Reagent and laboratory contamination can critically impact sequence-based microbiome analyses journal November 2014
Fast Identification and Removal of Sequence Contamination from Genomic and Metagenomic Datasets journal March 2011
Metagenomic microbial community profiling using unique clade-specific marker genes journal June 2012
Metagenomic species profiling using universal phylogenetic marker genes journal October 2013
MePIC, Metagenomic Pathogen Identification for Clinical Specimens journal January 2014
Strain/species identification in metagenomes using genome-specific markers journal February 2014
DI-MMAP: A High Performance Memory-Map Runtime for Data-Intensive Applications conference November 2012
The landscape of human STR variation journal August 2014
Kraken: ultrafast metagenomic sequence classification using exact alignments journal January 2014
Virus Identification in Unknown Tropical Febrile Illness Cases Using Deep Sequencing journal February 2012
RAPSearch2: a fast and memory-efficient protein similarity search tool for next-generation sequencing data journal October 2011
Identification of Novel Viruses Using VirusHunter -- an Automated Data Analysis Pipeline journal October 2013

Cited By (9)

Recentrifuge: Robust comparative analysis and contamination removal for metagenomics journal April 2019
Host genetic variation impacts microbiome composition across human body sites journal September 2015
Host‐derived population genomics data provides insights into bacterial and diatom composition of the killer whale skin journal October 2018
A novel variant of torque teno virus 7 identified in patients with Kawasaki disease journal December 2018
Re-analysis of metagenomic sequences from acute flaccid myelitis patients reveals alternatives to enterovirus D68 infection journal July 2015
Embracing the gut microbiota: the new frontier for inflammatory and infectious diseases journal January 2017
Host-derived population genomics data provides insights into bacterial and diatom composition of the killer whale skin posted_content July 2018
Gut microbiota composition and functional changes in inflammatory bowel disease and irritable bowel syndrome journal December 2018
Health and Disease Imprinted in the Time Variability of the Human Microbiome journal March 2017

Similar Records

Recruiting Human Microbiome Shotgun Data to Site-Specific Reference Genomes
Journal Article · Wed Jan 15 00:00:00 EST 2014 · PLoS ONE · OSTI ID:1263571

Review, Evaluation, and Directions for Gene-Targeted Assembly for Ecological Analyses of Metagenomes
Journal Article · Tue Oct 15 00:00:00 EDT 2019 · Frontiers in Genetics · OSTI ID:1263571

Improved Microbial Community Characterization of 16S rRNA via Metagenome Hybridization Capture Enrichment
Journal Article · Tue Apr 27 00:00:00 EDT 2021 · Frontiers in Microbiology · OSTI ID:1263571

Related Subjects