Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Enabling end-to-end secure federated learning in biomedical research on heterogeneous computing environments with APPFLx

Journal Article · · Computational and Structural Biotechnology Journal
 [1];  [2];  [3];  [4];  [5];  [5];  [6];  [7];  [6];  [6];  [2];  [6]
  1. Univ. of Illinois at Urbana-Champaign, IL (United States)
  2. Univ. of Chicago, IL (United States)
  3. Broad Institute, Cambridge, MA (United States)
  4. Argonne National Laboratory (ANL), Argonne, IL (United States); Georgia Institute of Technology, Atlanta, GA (United States)
  5. Argonne National Laboratory (ANL), Argonne, IL (United States); Univ. of Illinois at Urbana-Champaign, IL (United States)
  6. Argonne National Laboratory (ANL), Argonne, IL (United States)
  7. Arizona State Univ., Tempe, AZ (United States)

Facilitating large-scale, cross-institutional collaboration in biomedical machine learning (ML) projects requires a trustworthy and resilient federated learning (FL) environment to ensure that sensitive information such as protected health information is kept confidential. Specifically designed for this purpose, this work introduces APPFLx - a low-code, easy-to-use FL framework that enables easy setup, configuration, and running of FL experiments. APPFLx removes administrative boundaries of research organizations and healthcare systems while providing secure end-to-end communication, privacy-preserving functionality, and identity management. Furthermore, it is completely agnostic to the underlying computational infrastructure of participating clients, allowing an instantaneous deployment of this framework into existing computing infrastructures. Experimentally, the utility of APPFLx is demonstrated in two case studies: (1) predicting participant age from electrocardiogram (ECG) waveforms, and (2) detecting COVID-19 disease from chest radiographs. Here, ML models were securely trained across heterogeneous computing resources, including a combination of on-premise high-performance computing and cloud computing facilities. By securely unlocking data from multiple sources for training without directly sharing it, these FL models enhance generalizability and performance compared to centralized training models while ensuring data remains protected. In conclusion, APPFLx demonstrated itself as an easy-to-use framework for accelerating biomedical studies across organizations and healthcare systems on large datasets while maintaining the protection of private medical data.

Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC); National Institutes of Health (NIH)
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
2586549
Journal Information:
Computational and Structural Biotechnology Journal, Journal Name: Computational and Structural Biotechnology Journal Vol. 28; ISSN 2001-0370
Publisher:
Elsevier BVCopyright Statement
Country of Publication:
United States
Language:
English

References (24)

Experiences building Globus Genomics: a next-generation sequencing analysis service using Galaxy, Globus, and Amazon Web Services: EXPERIENCES BUILDING GLOBUS GENOMICS
  • Madduri, Ravi K.; Sulakhe, Dinanath; Lacinski, Lukasz
  • Concurrency and Computation: Practice and Experience, Vol. 26, Issue 13 https://doi.org/10.1002/cpe.3274
journal April 2014
Utopia: A load sharing facility for large, heterogeneous distributed computer systems journal December 1993
A Real Time Processing system for big data in astronomy: Applications to HERA journal July 2021
The globus compute dataset: An open function-as-a-service dataset from the edge to the cloud journal April 2024
Deep neural network-estimated electrocardiographic age as a mortality predictor journal August 2021
Truly privacy-preserving federated analytics for precision medicine with multiparty homomorphic encryption journal October 2021
Accelerated, scalable and reproducible AI-driven gravitational wave detection journal July 2021
The UK Biobank resource with deep phenotyping and genomic data journal October 2018
Secure, privacy-preserving and federated machine learning in medical imaging journal June 2020
Common pitfalls and recommendations for using machine learning to detect and prognosticate for COVID-19 using chest radiographs and CT scans journal March 2021
End-to-end privacy preserving deep learning on multi-institutional medical imaging journal May 2021
Advancing COVID-19 diagnosis with privacy-preserving collaboration in artificial intelligence journal December 2021
2′-O methylation of RNA cap in SARS-CoV-2 captured by serial crystallography journal May 2021
OpenFL: the open federated learning library journal October 2022
MIDRC CRP10 AI interface—an integrated tool for exploring, testing and visualization of AI models journal March 2023
Globus Online: Accelerating and Democratizing Science through Cloud-Based Services journal May 2011
FedHealth: A Federated Transfer Learning Framework for Wearable Healthcare journal July 2020
Federated Learning With Differential Privacy: Algorithms and Performance Analysis journal January 2020
Do Gradient Inversion Attacks Make Federated Learning Unsafe? journal January 2023
Role of standard and soft tissue chest radiography images in deep-learning-based early diagnosis of COVID-19 journal September 2021
Software as a service for data scientists journal February 2012
FederatedScope: A Flexible Federated Learning Platform for Heterogeneity journal January 2023
Federated Learning for Edge Computing: A Survey journal September 2022
Predicting “Heart Age” Using Electrocardiography journal March 2014

Similar Records

Emerging Technologies for Privacy Preservation in Energy Systems
Conference · Wed Jun 05 00:00:00 EDT 2024 · OSTI ID:2475808

Adversarial Training for Privacy-Preserving Deep Learning Model Distribution
Conference · Sat Nov 30 23:00:00 EST 2019 · OSTI ID:1606810