Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Causal discovery from data assisted by large language models

Journal Article · · Applied Physics Letters
DOI:https://doi.org/10.1063/5.0272287· OSTI ID:2997644
Knowledge-driven discovery of novel materials necessitates the development of causal models for property emergence. While in the classical physical paradigm, the causal relationships are deduced based on physical principles or via experiment, the rapid accumulation of observational data necessitates learning causal relationships between dissimilar aspects of material structure and functionalities based on observations. For this, it is essential to integrate experimental data with prior domain knowledge. Here, we demonstrate this approach by combining high-resolution scanning transmission electron microscopy data with insights derived from large language models (LLMs). By applying ChatGPT to domain-specific literature, such as arXiv papers on ferroelectrics, and combining the obtained information with data-driven causal discovery, we construct adjacency matrices for directed acyclic graphs that map the causal relationships between structural, chemical, and polarization degrees of freedom in Sm-doped BiFeO3. This approach enables us to hypothesize how synthesis conditions influence material properties and guides experimental validation. Furthermore, the ultimate objective of this work is to develop a unified framework that integrates LLM-driven literature analysis with data-driven discovery, facilitating the precise engineering of ferroelectric materials by establishing clear connections between synthesis conditions and their resulting material properties.
Research Organization:
The Pennsylvania State University, University Park, PA (United States)
Sponsoring Organization:
National Institute of Standards and Technology; USDOE Office of Science (SC), Basic Energy Sciences (BES)
Grant/Contract Number:
SC0021118
OSTI ID:
2997644
Journal Information:
Applied Physics Letters, Journal Name: Applied Physics Letters Journal Issue: 12 Vol. 127; ISSN 1077-3118; ISSN 0003-6951
Publisher:
AIP PublishingCopyright Statement
Country of Publication:
United States
Language:
English

References (14)

Causal Directed Acyclic Graphs journal March 2022
Reward Driven Workflows for Unsupervised Explainable Analysis of Phases and Ferroic Variants From Atomically Resolved Imaging Data journal June 2025
Exploring physics of ferroelectric domain walls via Bayesian analysis of atomically resolved STEM data journal December 2020
Causal analysis of competing atomistic mechanisms in ferroelectric materials from high-resolution scanning transmission electron microscopy data journal August 2020
Bridging microscopy with molecular dynamics and quantum simulations: an atomAI based pipeline journal April 2022
MatKG: An autonomously generated knowledge graph in Material Science journal February 2024
AtomAI framework for deep learning analysis of image and spectroscopy data in electron and scanning probe microscopy journal December 2022
Predictability as a probe of manifest and latent physics: The case of atomic scale structural, chemical, and polarization behaviors in multiferroic Sm-doped BiFeO 3 journal March 2021
Multi-objective Bayesian optimization of ferroelectric materials with interfacial control for memory and energy storage applications journal November 2021
Chemical control of polarization in thin strained films of a multiaxial ferroelectric: Phase diagrams and polarization rotation journal March 2022
The Properties of Ferroelectric Films at Small Dimensions journal August 2000
A Uniformly Consistent Estimator of Causal Effects under the $k$-Triangle-Faithfulness Assumption journal November 2014
Piezoelectric Thin Films for Sensors, Actuators, and Energy Harvesting journal September 2009
The Materials Science Procedural Text Corpus: Annotating Materials Synthesis Procedures with Shallow Semantic Structures conference January 2019

Similar Records

Mapping causal patterns in crystalline solids
Journal Article · Thu Sep 25 20:00:00 EDT 2025 · APL Machine Learning · OSTI ID:2997119

A Causal Approach to Model Validation and Calibration
Conference · Mon Sep 23 00:00:00 EDT 2024 · OSTI ID:2479889