Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Optimizing FPGA-based Accelerator Design for Large-Scale Molecular Similarity Search (Special Session Paper)

Conference ·
 [1];  [2];  [3];  [4];  [2];  [5];  [5];  [1];  [1];  [6];  [2];  [1]
  1. University of Connecticut
  2. Stevens Institute Of Technology
  3. George Mason Universitiy
  4. University of New Mexico
  5. BATTELLE (PACIFIC NW LAB)
  6. University of Notre Dame
Molecular similarity search has been widely used in drug discovery to rapidly identify structurally similar compounds from large molecular databases. With the increasing size of chemical libraries, there is growing interest in the efficient ac- celeration of large-scale similarity search. Existing works mainly focus on CPU and GPU to accelerate the computation of Tatimoto coefficient in measuring the pairwise similarity between different molecular fingerprints. In this paper, we propose and optimize an FPGA-based accelerator design on exhaustive and approximate search algorithms. On exhaustive search using BitBound & fold- ing, we analyze the similarity cutoff and folding level relationship with search speedup and accuracy, and propose a scalable on- the-fly query engine on FPGAs to reduce the resource utilization and pipeline interval. We achieve a 450 million compounds-per- second processing throughput for a single query engine. On approximate search using hierarchical navigable small world (HNSW), a popular algorithm with high recall and query speed, we propose an FPGA-based graph traversal engine to utilize high throughput register array based priority queue and fine- grained distance calculation engine to increase the processing capability. Experimental results show that the proposed FPGA- based HNSW implementation achieves a 35× speedup than existing works on CPU. To the best of our knowledge, our FPGA- based implementation is the first attempt to accelerate molecular similarity search on FPGA and has the highest performance among existing approaches.
Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-76RL01830
OSTI ID:
1855412
Report Number(s):
PNNL-SA-166147
Country of Publication:
United States
Language:
English

Similar Records

Accelerated Charged Particle Tracking with Graph Neural Networks on FPGAs
Conference · Sun Nov 29 23:00:00 EST 2020 · OSTI ID:1823535

Evaluation of the OpenCL AES Kernel using the Intel FPGA SDK for OpenCL
Technical Report · Thu Apr 20 00:00:00 EDT 2017 · OSTI ID:1357909

Evaluation of CHO Benchmarks on the Arria 10 FPGA using Intel FPGA SDK for OpenCL
Technical Report · Tue May 23 00:00:00 EDT 2017 · OSTI ID:1372106

Related Subjects