DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Combining data and theory for derivable scientific discovery with AI-Descartes

Journal Article · · Nature Communications

Abstract Scientists aim to discover meaningful formulae that accurately describe experimental data. Mathematical models of natural phenomena can be manually created from domain knowledge and fitted to data, or, in contrast, created automatically from large datasets with machine-learning algorithms. The problem of incorporating prior knowledge expressed as constraints on the functional form of a learned model has been studied before, while finding models that are consistent with prior knowledge expressed via general logical axioms is an open problem. We develop a method to enable principled derivations of models of natural phenomena from axiomatic knowledge and experimental data by combining logical reasoning with symbolic regression. We demonstrate these concepts for Kepler’s third law of planetary motion, Einstein’s relativistic time-dilation law, and Langmuir’s theory of adsorption. We show we can discover governing laws from few data points when logical reasoning is used to distinguish between candidate formulae having similar error on the data.

Sponsoring Organization:
USDOE
OSTI ID:
1969567
Journal Information:
Nature Communications, Journal Name: Nature Communications Journal Issue: 1 Vol. 14; ISSN 2041-1723
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (18)

Constraint-Based Visual Generation book January 2019
KeYmaera X: An Axiomatic Tactical Theorem Prover for Hybrid Systems book January 2015
Translating math formula images to LaTeX sequences using deep neural networks with sequence-level training journal November 2020
A global MINLP approach to symbolic regression journal May 2018
The Adsorption of Gases on Plane Surfaces of Glass, mica and Platinum. journal September 1918
Adsorption Equilibria of C 1 to C 4 Alkanes, CO 2 , and SF 6 on Silicalite journal February 1998
A simple electromagnetic model for the light clock of special relativity journal October 2011
Numerical methods for experimental design of large-scale linear ill-posed inverse problems journal September 2008
Discovering Physical Concepts with Neural Networks journal January 2020
A Simple Derivation of Time Dilation and Length Contraction in Special Relativity journal October 2014
AI Feynman: A physics-inspired method for symbolic regression journal April 2020
Distilling Free-Form Natural Laws from Experimental Data journal April 2009
Optical Clocks and Relativity journal September 2010
Symbolic regression driven by training data and prior knowledge conference June 2020
Semantic Search in Millions of Equations conference August 2020
LGML: Logic Guided Machine Learning (Student Abstract) journal April 2020
Logic Guided Genetic Algorithms (Student Abstract) journal May 2021
Complexity of Semialgebraic Proofs journal January 2002

Similar Records

Reasoning about knowledge and action
Technical Report · Wed Oct 01 00:00:00 EDT 1980 · OSTI ID:6755363

A spectrum of applications of automated reasoning.
Conference · Thu Jan 31 23:00:00 EST 2002 · OSTI ID:793070

Northeast Artificial Intelligence Consortium annual report for 1987. Volume 7. Part A. Time oriented problem solving. Interim report, December 1986-December 1987
Technical Report · Tue Feb 28 23:00:00 EST 1989 · OSTI ID:5393063

Related Subjects