Optimizing FPGA-based Accelerator Design for Large-Scale Molecular Similarity Search (Special Session Paper)
- University of Connecticut
- Stevens Institute Of Technology
- George Mason Universitiy
- University of New Mexico
- BATTELLE (PACIFIC NW LAB)
- University of Notre Dame
Molecular similarity search has been widely used in drug discovery to rapidly identify structurally similar compounds from large molecular databases. With the increasing size of chemical libraries, there is growing interest in the efficient ac- celeration of large-scale similarity search. Existing works mainly focus on CPU and GPU to accelerate the computation of Tatimoto coefficient in measuring the pairwise similarity between different molecular fingerprints. In this paper, we propose and optimize an FPGA-based accelerator design on exhaustive and approximate search algorithms. On exhaustive search using BitBound & fold- ing, we analyze the similarity cutoff and folding level relationship with search speedup and accuracy, and propose a scalable on- the-fly query engine on FPGAs to reduce the resource utilization and pipeline interval. We achieve a 450 million compounds-per- second processing throughput for a single query engine. On approximate search using hierarchical navigable small world (HNSW), a popular algorithm with high recall and query speed, we propose an FPGA-based graph traversal engine to utilize high throughput register array based priority queue and fine- grained distance calculation engine to increase the processing capability. Experimental results show that the proposed FPGA- based HNSW implementation achieves a 35× speedup than existing works on CPU. To the best of our knowledge, our FPGA- based implementation is the first attempt to accelerate molecular similarity search on FPGA and has the highest performance among existing approaches.
- Research Organization:
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-76RL01830
- OSTI ID:
- 1855412
- Report Number(s):
- PNNL-SA-166147
- Country of Publication:
- United States
- Language:
- English
Similar Records
Accelerated Charged Particle Tracking with Graph Neural Networks on FPGAs
Evaluation of the OpenCL AES Kernel using the Intel FPGA SDK for OpenCL
Evaluation of CHO Benchmarks on the Arria 10 FPGA using Intel FPGA SDK for OpenCL
Conference
·
Sun Nov 29 23:00:00 EST 2020
·
OSTI ID:1823535
Evaluation of the OpenCL AES Kernel using the Intel FPGA SDK for OpenCL
Technical Report
·
Thu Apr 20 00:00:00 EDT 2017
·
OSTI ID:1357909
Evaluation of CHO Benchmarks on the Arria 10 FPGA using Intel FPGA SDK for OpenCL
Technical Report
·
Tue May 23 00:00:00 EDT 2017
·
OSTI ID:1372106