Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Accelerating large scale de novo metagenome assembly using GPUs

Conference · · Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis

Metagenomic workflows involve studying uncultured microorganisms directly from the environment. These environmental samples when processed by modern sequencing machines yield large and complex datasets that exceed the capabilities of metagenomic software. The increasing sizes and complexities of datasets make a strong case for exascale-capable metagenome assemblers. However, the underlying algorithmic motifs are not well suited for GPUs. This poses a challenge since the majority of next-generation supercomputers will rely primarily on GPUs for computation. In this paper we present the first of its kind GPU-Accelerated implementation of the local assembly approach that is an integral part of a widely used large-scale metagenome assembler, MetaHipMer. Local assembly uses algorithms that induce random memory accesses and non-deterministic workloads, which make GPU offloading a challenging task. Our GPU implementation outperforms the CPU version by about 7x and boosts the performance of MetaHipMer by 42% when running on 64 Summit nodes.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21)
DOE Contract Number:
AC02-05CH11231
OSTI ID:
1896672
Journal Information:
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, Journal Name: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
Country of Publication:
United States
Language:
English

References (10)

MEGAHIT v1.0: A fast and scalable metagenome assembler driven by advanced methodologies and community practices journal June 2016
Structure, function and diversity of the healthy human microbiome journal June 2012
Comparative metatranscriptomics reveals kingdom level changes in the rhizosphere microbiome of plants journal July 2013
Terabase-scale metagenome coassembly with MetaHipMer journal July 2020
Overlap graphs and de Bruijn graphs: data structures for de novo genome assembly in the big data era journal December 2019
Comparison of the two major classes of assembly algorithms: overlap-layout-consensus and de-bruijn-graph journal December 2011
GRASShopPER—An algorithm for de novo assembly based on GPU alignments journal August 2018
IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth journal April 2012
A new antibiotic kills pathogens without detectable resistance journal January 2015
Scaling metagenome sequence assembly with probabilistic de Bruijn graphs journal July 2012

Similar Records

Extreme Scale De Novo Metagenome Assembly
Conference · Thu Mar 14 00:00:00 EDT 2019 · OSTI ID:1581597

ADEPT: a domain independent sequence alignment strategy for gpu architectures
Journal Article · Tue Sep 15 00:00:00 EDT 2020 · BMC Bioinformatics · OSTI ID:1706662

Related Subjects