skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: LOGAN: High-Performance GPU-Based X-Drop Long-Read Alignment

Journal Article · · Proceedings - IEEE International Parallel and Distributed Processing Symposium (IPDPS)
 [1];  [2];  [2];  [3];  [1];  [3];  [2];  [3];  [2]
  1. Politecnico di Milano (Italy)
  2. Univ. of California, Berkeley, CA (United States)
  3. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)

Pairwise sequence alignment is one of the most computationally intensive kernels in genomic data analysis, accounting for more than 90% of the runtime for key bioinformatics applications. This method is particularly expensive for third-generation sequences due to the high computational cost of analyzing sequences of length between 1Kb and 1Mb. Given the quadratic overhead of exact pairwise algorithms for long alignments, the community primarily relies on approximate algorithms that search only for high-quality alignments and stop early when one is not found. In this work, we present the first GPU optimization of the popular X-drop alignment algorithm, that we named LOGAN. Results show that our high-performance multi-GPU implementation achieves up to 181.6 GCUPS and speed-ups up to 6.6× and 30.7× using 1 and 6 NVIDIA Tesla V100, respectively, over the state-of-the-art software running on two IBM Power9 processors using 168 CPU threads, with equivalent accuracy. We also demonstrate a 2.3× LOGAN speed-up versus ksw2, a state-of-art vectorized algorithm for sequence alignment implemented in minimap2, a long-read mapping software. Furthermore, to highlight the impact of our work on a real-world application, we couple LOGAN with a many-to-many long-read alignment software called BELLA, and demonstrate that our implementation improves the overall BELLA runtime by up to 10.6×. Finally, we adapt the Roofline model for LOGAN and demonstrate that our implementation is near optimal on the NVIDIA Tesla V100s.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Basic Energy Sciences (BES)
Grant/Contract Number:
AC02-05CH11231; AC05-00OR22725
OSTI ID:
1650093
Journal Information:
Proceedings - IEEE International Parallel and Distributed Processing Symposium (IPDPS), Vol. 2020; Conference: 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 18-22 May 2020; ISSN 1530-2075
Publisher:
IEEECopyright Statement
Country of Publication:
United States
Language:
English

References (20)

Striped Smith-Waterman speeds database searches six times over other SIMD implementations journal November 2006
Fast gapped-read alignment with Bowtie 2 journal March 2012
SWPS3 – fast multi-threaded vectorized Smith-Waterman for IBM Cell/B.E. and ×86/SSE2 journal January 2008
CUDASW++ 3.0: accelerating Smith-Waterman protein database search by coupling CPU and GPU SIMD instructions journal April 2013
160-fold acceleration of the Smith-Waterman algorithm using a field programmable gate array (FPGA) journal January 2007
Architectural optimizations for high performance and energy efficient Smith-Waterman implementation on FPGAs using OpenCL
  • Di Tucci, Lorenzo; O'Brien, Kenneth; Blott, Michaela
  • 2017 Design, Automation & Test in Europe Conference & Exhibition (DATE), Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017 https://doi.org/10.23919/DATE.2017.7927082
conference March 2017
Darwin: A Genomics Co-processor Provides up to 15,000X Acceleration on Long Read Assembly journal March 2018
Accelerating Long Read Alignment on Three Processors
  • Feng, Zonghao; Qiu, Shuang; Wang, Lipeng
  • ICPP 2019: 48th International Conference on Parallel Processing, Proceedings of the 48th International Conference on Parallel Processing https://doi.org/10.1145/3337821.3337918
conference August 2019
Minimap2: pairwise alignment for nucleotide sequences journal May 2018
Basic Local Alignment Search Tool journal October 1990
A Greedy Algorithm for Aligning DNA Sequences journal February 2000
Human-Mouse Alignments with BLASTZ journal January 2003
Adaptive seeds tame genomic sequence comparison journal January 2011
Parameters for accurate genome alignment journal February 2010
Identification of common molecular subsequences journal March 1981
A general method applicable to the search for similarities in the amino acid sequence of two proteins journal March 1970
Introducing difference recurrence relations for faster semi-global alignment of long sequences journal February 2018
SeqAn An efficient, generic C++ library for sequence analysis journal January 2008
An Instruction Roofline Model for GPUs conference November 2019
Roofline: an insightful visual performance model for multicore architectures journal April 2009

Similar Records

LOGAN: High-Performance X-Drop Pairwise Alignment on GPU (LOGAN) v1.0
Software · Tue Nov 05 00:00:00 EST 2019 · OSTI ID:1650093

High performance sparse multifrontal solvers on modern GPUs
Journal Article · Sat Feb 05 00:00:00 EST 2022 · Parallel Computing · OSTI ID:1650093

Accelerating the density-functional tight-binding method using graphical processing units
Journal Article · Fri Feb 24 00:00:00 EST 2023 · Journal of Chemical Physics · OSTI ID:1650093

Related Subjects