skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: On the Impact of Widening Vector Registers on Sequence Alignment

Conference ·
DOI:https://doi.org/10.1109/ICPP.2016.65· OSTI ID:1340891

Vector extensions, such as SSE, have been part of the x86 since the 1990s, with applications in graphics, signal processing, and scientific applications. Although many algorithms and applications can naturally benefit from automatic vectorization techniques, there are still many that are difficult to vectorize due to their dependence on irregular data structures, dense branch operations, or data dependencies. Sequence alignment, one of the most widely used operations in bioinformatics workflows, has a computational footprint that features complex data dependencies. In this paper, we demonstrate that the trend of widening vector registers adversely affects the state-of-the-art sequence alignment algorithm based on striped data layouts. We present a practically efficient SIMD implementation of a parallel scan based sequence alignment algorithm that can better exploit wider SIMD units. We conduct comprehensive workload and use case analyses to characterize the relative behavior of the striped and scan approaches and identify the best choice of algorithm based on input length and SIMD width.

Research Organization:
Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-76RL01830
OSTI ID:
1340891
Report Number(s):
PNNL-SA-118504; KJ0402000
Resource Relation:
Conference: 45th International Conference on Parallel Processing (ICPP 2016), August 15-19, 2016, Philadelphia, Pennsylvania, 506 - 515
Country of Publication:
United States
Language:
English

Similar Records

Pairwise Sequence Alignment Library
Software · Wed May 20 00:00:00 EDT 2015 · OSTI ID:1340891

Parasail: SIMD C library for global, semi-global, and local pairwise sequence alignments
Journal Article · Wed Feb 10 00:00:00 EST 2016 · BMC Bioinformatics · OSTI ID:1340891

ADEPT: a domain independent sequence alignment strategy for gpu architectures
Journal Article · Tue Sep 15 00:00:00 EDT 2020 · BMC Bioinformatics · OSTI ID:1340891