Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

FAIR for AI: An interdisciplinary and international community building perspective

Journal Article · · Scientific Data (Online)
 [1];  [1];  [2];  [3];  [4];  [5];  [4];  [6];  [1];  [7];  [8];  [9];  [10];  [11];  [11];  [4];  [12];  [6];  [11];  [13] more »;  [11];  [14];  [11];  [11] « less
  1. Argonne National Laboratory (ANL), Argonne, IL (United States); Univ. of Chicago, IL (United States)
  2. Duke Univ., Durham, NC (United States)
  3. Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States); University of California, Berkeley, CA (United States)
  4. Univ. of California, San Diego, CA (United States)
  5. Lund Univ. (Sweden); Univ. of Manchester (United Kingdom)
  6. Argonne National Laboratory (ANL), Argonne, IL (United States)
  7. Univ. of Virginia, Charlottesville, VA (United States)
  8. Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States)
  9. Technical Univ. Munich (Germany)
  10. Brookhaven National Laboratory (BNL), Upton, NY (United States); Rutgers Univ., Piscataway, NJ (United States)
  11. Univ. of Illinois at Urbana-Champaign, IL (United States)
  12. Univ. of Helsinki (Finland)
  13. Institute of Applied Biosciences (Greece)
  14. Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
A foundational set of findable, accessible, interoperable, and reusable (FAIR) principles were proposed in 2016 as prerequisites for proper data management and stewardship, with the goal of enabling the reusability of scholarly data. The principles were also meant to apply to other digital assets, at a high level, and over time, the FAIR guiding principles have been re-interpreted or extended to include the software, tools, algorithms, and workflows that produce data. FAIR principles are now being adapted in the context of AI models and datasets. Here, we present the perspectives, vision, and experiences of researchers from different countries, disciplines, and backgrounds who are leading the definition and adoption of FAIR principles in their communities of practice, and discuss outcomes that may result from pursuing and incentivizing FAIR AI research. The material for this report builds on the FAIR for AI Workshop held at Argonne National Laboratory on June 7, 2022.
Research Organization:
Univ. of Virginia, Charlottesville, VA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
Grant/Contract Number:
SC0023452; AC02-06CH11357; SC0021258; SC0021352; SC0021293
OSTI ID:
3000813
Journal Information:
Scientific Data (Online), Journal Name: Scientific Data (Online) Journal Issue: 1 Vol. 10; ISSN 2052-4463
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United States
Language:
English

References (52)

The Materials Data Facility: Data Services to Advance Materials Science Research journal July 2016
Globus Nexus: A Platform-as-a-Service provider of research identity, profile, and group management journal March 2016
Deep learning at scale for the construction of galaxy catalogs in the Dark Energy Survey journal August 2019
Physics-inspired deep learning to characterize the signal manifold of quasi-circular, spinning, non-precessing binary black hole mergers journal September 2020
Polymer Nanocomposite Data: Curation, Frameworks, Access, and Potential for Discovery and Design journal July 2020
Now Is the Time to Build a National Data Ecosystem for Materials Science and Chemistry Research Data journal April 2022
Invest 5% of research funds in ensuring data are reusable journal February 2020
Accelerated, scalable and reproducible AI-driven gravitational wave detection journal July 2021
Advancing mathematics by guiding human intuition with AI journal December 2021
DOME: recommendations for supervised machine learning validation in biology journal July 2021
OPTIMADE, an API for exchanging materials data journal August 2021
A FAIR and AI-ready Higgs boson decay dataset journal February 2022
FAIR principles for AI models with a practical application for accelerated high energy diffraction microscopy journal November 2022
Scientific machine learning benchmarks journal April 2022
Dual use of artificial-intelligence-powered drug discovery journal March 2022
Integrating multiple materials science projects in a single neural network journal July 2020
Artificial intelligence for search and discovery of quantum materials journal October 2021
Automated discovery of fundamental variables hidden in experimental data journal July 2022
The FAIR Guiding Principles for scientific data management and stewardship journal March 2016
A design framework and exemplar metrics for FAIRness journal June 2018
Applications of physics informed neural operators journal May 2023
Deep transfer learning for star cluster classification: I. application to the PHANGS– HST survey journal February 2020
Star cluster classification in the PHANGS– HST survey: Comparison between human and machine learning approaches journal July 2021
Xception: Deep Learning with Depthwise Separable Convolutions conference July 2017
Explaining Explanations: An Overview of Interpretability of Machine Learning conference October 2018
DLHub: Model and Data Serving for Science conference May 2019
HPCFAIR: Enabling FAIR AI for HPC Applications conference November 2021
HPC Ontology: Towards a Unified Ontology for Managing Training Datasets and AI Models for High-Performance Computing conference November 2021
Playing catch-up in building an open research commons journal July 2022
ImageNet classification with deep convolutional neural networks journal May 2017
funcX: A Federated Function Serving Fabric for Science
  • Chard, Ryan; Babuji, Yadu; Li, Zhuozhao
  • HPDC '20: The 29th International Symposium on High-Performance Parallel and Distributed Computing, Proceedings of the 29th International Symposium on High-Performance Parallel and Distributed Computing https://doi.org/10.1145/3369583.3392683
conference June 2020
Sharing Begins at Home: How Continuous and Ubiquitous FAIRness Can Enhance Research Productivity and Data Reuse journal July 2022
FAIR Computational Workflows journal January 2020
Credible practice of modeling and simulation in healthcare: ten rules from a multidisciplinary perspective journal September 2020
Convergence of artificial intelligence and high performance computing on NSF-supported cyberinfrastructure journal October 2020
Reproducible big data science: A case study in continuous FAIRness journal April 2019
FAIR Principles for Research Software (FAIR4RS Principles) text January 2021
A data ecosystem to support machine learning in materials science journal October 2019
The materials tetrahedron has a “digital twin” journal February 2022
The Machine Learning landscape of top taggers journal January 2019
Data Science and Machine Learning in Education report July 2022
Interpretability of an Interaction Network for identifying $H \rightarrow b\bar{b}$ jets conference November 2022
Inference-Optimized AI and High Performance Computing for Gravitational Wave Detection at Scale journal February 2022
On the Opportunities and Risks of Foundation Models preprint January 2021
Data and Analysis Preservation, Recasting, and Reinterpretation preprint January 2022
PaLM: Scaling Language Modeling with Pathways preprint January 2022
Explainable AI for High Energy Physics preprint January 2022
A Detailed Study of Interpretability of Deep Neural Network based Top Taggers text January 2022
FAIR AI Models in High Energy Physics text January 2022
Magnetohydrodynamics with Physics Informed Neural Operators text January 2023
GPT-4 Technical Report preprint March 2024
Working Towards Understanding the Role of FAIR for Machine Learning text January 2021

Similar Records

FAIR principles for AI models with a practical application for accelerated high energy diffraction microscopy
Journal Article · Wed Nov 09 23:00:00 EST 2022 · Scientific Data · OSTI ID:1897765

Applying the FAIR Principles to computational workflows
Journal Article · Sun Feb 23 23:00:00 EST 2025 · Scientific Data (Online) · OSTI ID:2530828