DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Real-time data analysis for medical diagnosis using FPGA-accelerated neural networks

Abstract

Real-time analysis of patient data during medical procedures can provide vital diagnostic feedback that significantly improves chances of success. With sensors becoming increasingly fast, frameworks such as Deep Neural Networks are required to perform calculations within the strict timing constraints for real-time operation. However, traditional computing platforms responsible for running these algorithms incur a large overhead due to communication protocols, memory accesses, and static (often generic) architectures. In this work, we implement a low-latency Multi-Layer Perceptron (MLP) processor using Field Programmable Gate Arrays (FPGAs). Unlike CPUs and Graphics Processing Units (GPUs), our FPGA-based design can directly interface sensors, storage devices, display devices and even actuators, thus reducing the delays of data movement between ports and compute pipelines. Moreover, the compute pipelines themselves are tailored specifically to the application, improving resource utilization and reducing idle cycles. We demonstrate the effectiveness of our approach using mass-spectrometry data sets for real-time cancer detection. We demonstrate that correct parameter sizing, based on the application, can reduce latency by 20% on average. Furthermore, we show that in an application with tightly coupled data-path and latency constraints, having a large amount of computing resources can actually reduce performance. Using mass-spectrometry benchmarks, we show that our proposedmore » FPGA design outperforms both CPU and GPU implementations, with an average speedup of 144x and 21x, respectively. In our work, we demonstrate the importance of application-specific optimizations in order to minimize latency and maximize resource utilization for MLP inference. By directly interfacing and processing sensor data with ultra-low latency, FPGAs can perform real-time analysis during procedures and provide diagnostic feedback that can be critical to achieving higher percentages of successful patient outcomes.« less

Authors:
 [1];  [1];  [2];  [2];  [1]
  1. Boston Univ., MA (United States)
  2. Argonne National Lab. (ANL), Argonne, IL (United States)
Publication Date:
Research Org.:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Org.:
National Science Foundation (NSF); USDOE
OSTI Identifier:
1493450
Grant/Contract Number:  
AC02-06CH11357
Resource Type:
Accepted Manuscript
Journal Name:
BMC Bioinformatics
Additional Journal Information:
Journal Volume: 19; Journal Issue: S18; Journal ID: ISSN 1471-2105
Publisher:
BioMed Central
Country of Publication:
United States
Language:
English
Subject:
60 APPLIED LIFE SCIENCES

Citation Formats

Sanaullah, Ahmed, Yang, Chen, Alexeev, Yuri, Yoshii, Kazutomo, and Herbordt, Martin C. Real-time data analysis for medical diagnosis using FPGA-accelerated neural networks. United States: N. p., 2018. Web. doi:10.1186/s12859-018-2505-7.
Sanaullah, Ahmed, Yang, Chen, Alexeev, Yuri, Yoshii, Kazutomo, & Herbordt, Martin C. Real-time data analysis for medical diagnosis using FPGA-accelerated neural networks. United States. https://doi.org/10.1186/s12859-018-2505-7
Sanaullah, Ahmed, Yang, Chen, Alexeev, Yuri, Yoshii, Kazutomo, and Herbordt, Martin C. Fri . "Real-time data analysis for medical diagnosis using FPGA-accelerated neural networks". United States. https://doi.org/10.1186/s12859-018-2505-7. https://www.osti.gov/servlets/purl/1493450.
@article{osti_1493450,
title = {Real-time data analysis for medical diagnosis using FPGA-accelerated neural networks},
author = {Sanaullah, Ahmed and Yang, Chen and Alexeev, Yuri and Yoshii, Kazutomo and Herbordt, Martin C.},
abstractNote = {Real-time analysis of patient data during medical procedures can provide vital diagnostic feedback that significantly improves chances of success. With sensors becoming increasingly fast, frameworks such as Deep Neural Networks are required to perform calculations within the strict timing constraints for real-time operation. However, traditional computing platforms responsible for running these algorithms incur a large overhead due to communication protocols, memory accesses, and static (often generic) architectures. In this work, we implement a low-latency Multi-Layer Perceptron (MLP) processor using Field Programmable Gate Arrays (FPGAs). Unlike CPUs and Graphics Processing Units (GPUs), our FPGA-based design can directly interface sensors, storage devices, display devices and even actuators, thus reducing the delays of data movement between ports and compute pipelines. Moreover, the compute pipelines themselves are tailored specifically to the application, improving resource utilization and reducing idle cycles. We demonstrate the effectiveness of our approach using mass-spectrometry data sets for real-time cancer detection. We demonstrate that correct parameter sizing, based on the application, can reduce latency by 20% on average. Furthermore, we show that in an application with tightly coupled data-path and latency constraints, having a large amount of computing resources can actually reduce performance. Using mass-spectrometry benchmarks, we show that our proposed FPGA design outperforms both CPU and GPU implementations, with an average speedup of 144x and 21x, respectively. In our work, we demonstrate the importance of application-specific optimizations in order to minimize latency and maximize resource utilization for MLP inference. By directly interfacing and processing sensor data with ultra-low latency, FPGAs can perform real-time analysis during procedures and provide diagnostic feedback that can be critical to achieving higher percentages of successful patient outcomes.},
doi = {10.1186/s12859-018-2505-7},
journal = {BMC Bioinformatics},
number = S18,
volume = 19,
place = {United States},
year = {Fri Dec 21 00:00:00 EST 2018},
month = {Fri Dec 21 00:00:00 EST 2018}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record

Citation Metrics:
Cited by: 16 works
Citation information provided by
Web of Science

Save / Share:

Works referenced in this record:

FPGA acceleration of rigid-molecule docking codes
journal, May 2010


FPGA acceleration of rigid-molecule docking codes
journal, May 2010


Machine Learning in Medical Imaging
journal, July 2010

  • Wernick, Miles; Yang, Yongyi; Brankov, Jovan
  • IEEE Signal Processing Magazine, Vol. 27, Issue 4
  • DOI: 10.1109/MSP.2010.936730

Molecular Dynamics Simulations on High-Performance Reconfigurable Computing Systems
journal, November 2010

  • Chiu, Matt; Herbordt, Martin C.
  • ACM Transactions on Reconfigurable Technology and Systems, Vol. 3, Issue 4
  • DOI: 10.1145/1862648.1862653

A reconfigurable fabric for accelerating large-scale datacenter services
journal, October 2014

  • Putnam, Andrew; Jan, Gopal; Michael, Gray
  • ACM SIGARCH Computer Architecture News, Vol. 42, Issue 3
  • DOI: 10.1145/2678373.2665678

Molecular Dynamics Simulations on High-Performance Reconfigurable Computing Systems
journal, November 2010

  • Chiu, Matt; Herbordt, Martin C.
  • ACM Transactions on Reconfigurable Technology and Systems, Vol. 3, Issue 4
  • DOI: 10.1145/1862648.1862653

Machine learning for real-time single-trial EEG-analysis: From brain–computer interfacing to mental state monitoring
journal, January 2008


Machine learning classifiers and fMRI: A tutorial overview
journal, March 2009


Classifier ensemble construction with rotation forest to improve medical diagnosis performance of machine learning algorithms
journal, December 2011


A Framework for Medical Image Retrieval Using Machine Learning and Statistical Similarity Matching Techniques With Relevance Feedback
journal, January 2007

  • Rahman, Md. Mahmudur; Bhattacharya, Prabir; Desai, Bipin C.
  • IEEE Transactions on Information Technology in Biomedicine, Vol. 11, Issue 1
  • DOI: 10.1109/titb.2006.884364

Classifier ensemble construction with rotation forest to improve medical diagnosis performance of machine learning algorithms
journal, December 2011


In-Datacenter Performance Analysis of a Tensor Processing Unit
journal, June 2017

  • Jouppi, Norman P.; Borchers, Al; Boyle, Rick
  • ACM SIGARCH Computer Architecture News, Vol. 45, Issue 2
  • DOI: 10.1145/3140659.3080246

Collective Communication on FPGA Clusters with Static Scheduling
journal, January 2017

  • Sheng, Jiayi; Xiong, Qingqing; Yang, Chen
  • ACM SIGARCH Computer Architecture News, Vol. 44, Issue 4
  • DOI: 10.1145/3039902.3039904

Classification of brain tumor type and grade using MRI texture and shape in a machine learning scheme
journal, December 2009

  • Zacharaki, Evangelia I.; Wang, Sumei; Chawla, Sanjeev
  • Magnetic Resonance in Medicine, Vol. 62, Issue 6
  • DOI: 10.1002/mrm.22147

MLP Neural Network Based Gas Classification System on Zynq SoC
journal, January 2016


Machine learning for medical diagnosis: history, state of the art and perspective
journal, August 2001


A review of content-based image retrieval systems in medical applications—clinical benefits and future directions
journal, February 2004

  • Müller, Henning; Michoux, Nicolas; Bandon, David
  • International Journal of Medical Informatics, Vol. 73, Issue 1
  • DOI: 10.1016/j.ijmedinf.2003.11.024

A review of content-based image retrieval systems in medical applications—clinical benefits and future directions
journal, February 2004

  • Müller, Henning; Michoux, Nicolas; Bandon, David
  • International Journal of Medical Informatics, Vol. 73, Issue 1
  • DOI: 10.1016/j.ijmedinf.2003.11.024

Realizing general MLP networks with minimal FPGA resources
conference, June 2009

  • Latino, Carl; Moreno-Armendariz, Marco A.; Hagan, Martin
  • 2009 International Joint Conference on Neural Networks (IJCNN 2009 - Atlanta)
  • DOI: 10.1109/IJCNN.2009.5178680

NCBI BLASTP on High-Performance Reconfigurable Computing Systems
journal, January 2015

  • Mahram, Atabak; Herbordt, Martin C.
  • ACM Transactions on Reconfigurable Technology and Systems, Vol. 7, Issue 4
  • DOI: 10.1145/2629691

Training neural network classifiers for medical decision making: The effects of imbalanced datasets on classification performance
journal, March 2008


Machine learning classifiers and fMRI: A tutorial overview
journal, March 2009


Efficient FPGA Implementation of Sigmoid and Bipolar Sigmoid Activation Functions for Multilayer Perceptrons
journal, June 2012


Machine learning for real-time single-trial EEG-analysis: From brain–computer interfacing to mental state monitoring
journal, January 2008


Bonded Force Computations on FPGAs
conference, April 2017

  • Xiong, Qingqing; Herbordt, Martin C.
  • 2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)
  • DOI: 10.1109/fccm.2017.49

FPGA-Accelerated Particle-Grid Mapping
conference, May 2016

  • Sanaullah, Ahmed; Khoshparvar, Arash; Herbordt, Martin C.
  • 2016 IEEE 24th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)
  • DOI: 10.1109/fccm.2016.53

Development and Implementation of Parameterized FPGA-Based General Purpose Neural Networks for Online Applications
journal, February 2011

  • Gomperts, Alexander; Ukil, Abhisek; Zurfluh, Franz
  • IEEE Transactions on Industrial Informatics, Vol. 7, Issue 1
  • DOI: 10.1109/TII.2010.2085006

A reconfigurable fabric for accelerating large-scale datacenter services
journal, October 2014

  • Putnam, Andrew; Jan, Gopal; Michael, Gray
  • ACM SIGARCH Computer Architecture News, Vol. 42, Issue 3
  • DOI: 10.1145/2678373.2665678

Collective Communication on FPGA Clusters with Static Scheduling
journal, January 2017

  • Sheng, Jiayi; Xiong, Qingqing; Yang, Chen
  • ACM SIGARCH Computer Architecture News, Vol. 44, Issue 4
  • DOI: 10.1145/3039902.3039904