Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Federated benchmarking of medical artificial intelligence with MedPerf

Journal Article · · Nature Machine Intelligence
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; more »; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; « less
Medical artificial intelligence (AI) has tremendous potential to advance healthcare by supporting and contributing to the evidence-based practice of medicine, personalizing patient treatment, reducing costs, and improving both healthcare provider and patient experience. Unlocking this potential requires systematic, quantitative evaluation of the performance of medical AI models on large-scale, heterogeneous data capturing diverse patient populations. Here, to meet this need, we introduce MedPerf, an open platform for benchmarking AI models in the medical domain. MedPerf focuses on enabling federated evaluation of AI models, by securely distributing them to different facilities, such as healthcare organizations. This process of bringing the model to the data empowers each facility to assess and verify the performance of AI models in an efficient and human-supervised process, while prioritizing privacy. We describe the current challenges healthcare and AI communities face, the need for an open platform, the design philosophy of MedPerf, its current implementation status and real-world deployment, our roadmap and, importantly, the use of MedPerf with multiple international institutions within cloud-based technology and on-premises scenarios. Finally, we welcome new contributions by researchers and organizations to further strengthen MedPerf as an open benchmarking platform.
Research Organization:
Lawrence Livermore National Laboratory (LLNL), Livermore, CA (United States)
Sponsoring Organization:
AI Singapore Programme; Career Development Fund; Helmholtz Association; National Institutes of Health (NIH); USDOE National Nuclear Security Administration (NNSA)
Contributing Organization:
AI4SafeChole Consortium; BraTS-2020 Consortium; FeTS Consortium
Grant/Contract Number:
AC52-07NA27344
OSTI ID:
2203350
Report Number(s):
LLNL--JRNL-834413; 1052386
Journal Information:
Nature Machine Intelligence, Journal Name: Nature Machine Intelligence Journal Issue: 7 Vol. 5; ISSN 2522-5839
Publisher:
Springer NatureCopyright Statement
Country of Publication:
United States
Language:
English

References (42)

Geographic Distribution of US Cohorts Used to Train Deep Learning Algorithms journal September 2020
Association Between Surgical Skin Markings in Dermoscopic Images and Diagnostic Performance of a Deep Learning Convolutional Neural Network for Melanoma Recognition journal October 2019
Randomized Clinical Trials of Machine Learning Interventions in Health Care journal September 2022
How to Exploit Weaknesses in Biomedical Challenge Design and Organization book January 2018
TeCNO: Surgical Phase Recognition with Multi-stage Temporal Convolutional Networks book January 2020
A Review of Medical Federated Learning: Applications in Oncology and Cancer Research book January 2022
Mining Adverse Drug Reactions from Unstructured Mediums at Scale book November 2022
The EU General Data Protection Regulation (GDPR): A Practical Guide book January 2017
Implementation and Benefits of a Vendor-Neutral Archive and Enterprise-Imaging Management System in an Integrated Delivery Network journal October 2018
From knowledge to action: the impact of benchmarking on organizational performance journal June 1997
Continual learning in medical devices: FDA's action plan and beyond journal June 2021
Artificial intelligence for clinical oncology journal July 2021
Using HL7 FHIR to achieve interoperability in patient health record journal June 2019
Spark NLP: Natural Language Understanding at Scale journal May 2021
Accurate Clinical and Biomedical Named Entity Recognition at Scale journal August 2022
Why rankings of biomedical image analysis competitions should be interpreted with care journal December 2018
Federated learning enables big data for rare cancer boundary detection journal December 2022
How medical AI devices are evaluated: limitations and recommendations from an analysis of FDA approvals journal April 2021
Federated learning for predicting clinical outcomes in patients with COVID-19 journal September 2021
Multimodal biomedical AI journal September 2022
Federated learning for predicting histological response to neoadjuvant chemotherapy in triple-negative breast cancer journal January 2023
A deep learning algorithm to predict risk of pancreatic cancer from disease trajectories journal May 2023
Federated learning in medicine: facilitating multi-institutional collaborations without sharing patient data journal July 2020
Prognostic factors analysis for oral cavity cancer survival in the Netherlands and Taiwan using a privacy-preserving federated infrastructure journal November 2020
The “inconvenient truth” about AI in healthcare journal August 2019
The future of digital health with federated learning journal September 2020
End-to-end privacy preserving deep learning on multi-institutional medical imaging journal May 2021
GaNDLF: the generally nuanced deep learning framework for scalable end-to-end clinical workflows journal May 2023
HIPAA Regulations — A New Era of Medical-Record Privacy? journal April 2003
OpenFL: the open federated learning library journal October 2022
Patient data ownership: who owns your health? journal August 2021
Nimg-32. the Federated Tumor Segmentation (Fets) Initiative: the First Real-World Large-Scale Data-Private Collaboration Focusing on Neuro-Oncology journal November 2021
MLPerf: An Industry Standard Benchmark Suite for Machine Learning Performance journal March 2020
Dissecting racial bias in an algorithm used to manage the health of populations journal October 2019
Deep Models Under the GAN conference October 2017
Ethics of Using and Sharing Clinical Imaging Data for Artificial Intelligence: A Proposed Framework journal June 2020
Building Tools for Machine Learning and Artificial Intelligence in Cancer Research: Best Practices and a Case Study with the PathML Toolkit for Computational Pathology journal December 2021
Reproducible biomedical benchmarking in the cloud: lessons from crowd-sourced data challenges journal September 2019
Joint Imaging Platform for Federated Clinical Data Analytics journal November 2020
Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study journal November 2018
Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements conference January 2022
Twenty Years of Digital Pathology: An Overview of the Road Travelled, What is on the Horizon, and the Emergence of Vendor-Neutral Archives journal January 2018

Similar Records

Intern-Artificial Intelligence Benchmarking
Journal Article · Mon Jan 19 19:00:00 EST 2026 · No journal information · OSTI ID:3014039

Artificial Intelligence Benchmarking
Conference · Wed Aug 06 20:00:00 EDT 2025 · No journal information · OSTI ID:3019384

Artificial Intelligence
Journal Article · Mon Sep 30 20:00:00 EDT 2019 · Geographic Information Science & Technology Body of Knowledge · OSTI ID:1607204