Accelerating Chemical Discovery with Machine Learning: Simulated Evolution of Spin Crossover Complexes with an Artificial Neural Network
Abstract
Machine learning (ML) has emerged as a powerful complement to simulation for materials discovery by reducing time for evaluation of energies and properties at accuracy competitive with first-principles methods. We use genetic algorithm (GA) optimization to discover unconventional spin-crossover complexes in combination with efficient scoring from an artificial neural network (ANN) that predicts spin-state splitting of inorganic complexes. We explore a compound space of over 5600 candidate materials derived from eight metal/oxidation state combinations and a 32-ligand pool. We introduce a strategy for error-aware ML-driven discovery by limiting how far the GA travels away from the nearest ANN training points while maximizing property (i.e., spin-splitting) fitness, leading to discovery of 80% of the leads from full chemical space enumeration. Over a 51-complex subset, average unsigned errors (4.5 kcal/mol) are close to the ANN’s baseline 3 kcal/mol error. By obtaining leads from the trained ANN within seconds rather than days from a DFT-driven GA, this strategy demonstrates the power of ML for accelerating inorganic material discovery.
- Authors:
-
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
- Publication Date:
- Research Org.:
- Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC), Basic Energy Sciences (BES)
- OSTI Identifier:
- 1421993
- Alternate Identifier(s):
- OSTI ID: 1508752
- Grant/Contract Number:
- SC0018096; N00014-17-1-2956; CBET-1704266; ACI-1548562
- Resource Type:
- Published Article
- Journal Name:
- Journal of Physical Chemistry Letters
- Additional Journal Information:
- Journal Name: Journal of Physical Chemistry Letters Journal Volume: 9 Journal Issue: 5; Journal ID: ISSN 1948-7185
- Publisher:
- American Chemical Society
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 37 INORGANIC, ORGANIC, PHYSICAL, AND ANALYTICAL CHEMISTRY; 36 MATERIALS SCIENCE
Citation Formats
Janet, Jon Paul, Chan, Lydia, and Kulik, Heather J. Accelerating Chemical Discovery with Machine Learning: Simulated Evolution of Spin Crossover Complexes with an Artificial Neural Network. United States: N. p., 2018.
Web. doi:10.1021/acs.jpclett.8b00170.
Janet, Jon Paul, Chan, Lydia, & Kulik, Heather J. Accelerating Chemical Discovery with Machine Learning: Simulated Evolution of Spin Crossover Complexes with an Artificial Neural Network. United States. doi:10.1021/acs.jpclett.8b00170.
Janet, Jon Paul, Chan, Lydia, and Kulik, Heather J. Thu .
"Accelerating Chemical Discovery with Machine Learning: Simulated Evolution of Spin Crossover Complexes with an Artificial Neural Network". United States. doi:10.1021/acs.jpclett.8b00170.
@article{osti_1421993,
title = {Accelerating Chemical Discovery with Machine Learning: Simulated Evolution of Spin Crossover Complexes with an Artificial Neural Network},
author = {Janet, Jon Paul and Chan, Lydia and Kulik, Heather J.},
abstractNote = {Machine learning (ML) has emerged as a powerful complement to simulation for materials discovery by reducing time for evaluation of energies and properties at accuracy competitive with first-principles methods. We use genetic algorithm (GA) optimization to discover unconventional spin-crossover complexes in combination with efficient scoring from an artificial neural network (ANN) that predicts spin-state splitting of inorganic complexes. We explore a compound space of over 5600 candidate materials derived from eight metal/oxidation state combinations and a 32-ligand pool. We introduce a strategy for error-aware ML-driven discovery by limiting how far the GA travels away from the nearest ANN training points while maximizing property (i.e., spin-splitting) fitness, leading to discovery of 80% of the leads from full chemical space enumeration. Over a 51-complex subset, average unsigned errors (4.5 kcal/mol) are close to the ANN’s baseline 3 kcal/mol error. By obtaining leads from the trained ANN within seconds rather than days from a DFT-driven GA, this strategy demonstrates the power of ML for accelerating inorganic material discovery.},
doi = {10.1021/acs.jpclett.8b00170},
journal = {Journal of Physical Chemistry Letters},
number = 5,
volume = 9,
place = {United States},
year = {2018},
month = {2}
}
DOI: 10.1021/acs.jpclett.8b00170
Web of Science
Figures / Tables:

Works referencing / citing this record:
Enumeration of de novo inorganic complexes for chemical discovery and machine learning
journal, January 2020
- Gugler, Stefan; Janet, Jon Paul; Kulik, Heather J.
- Molecular Systems Design & Engineering, Vol. 5, Issue 1
Enumeration of de novo inorganic complexes for chemical discovery and machine learning
journal, January 2020
- Gugler, Stefan; Janet, Jon Paul; Kulik, Heather J.
- Molecular Systems Design & Engineering, Vol. 5, Issue 1
Toward Predicting Efficiency of Organic Solar Cells via Machine Learning and Improved Descriptors
journal, July 2018
- Sahu, Harikrishna; Rao, Weining; Troisi, Alessandro
- Advanced Energy Materials, Vol. 8, Issue 24
Machine learning for heterogeneous catalyst design and discovery
journal, May 2018
- Goldsmith, Bryan R.; Esterhuizen, Jacques; Liu, Jin-Xun
- AIChE Journal, Vol. 64, Issue 7
Data Mining the C−C Cross‐Coupling Genome
journal, May 2019
- Sawatlon, Boodsarin; Wodrich, Matthew D.; Meyer, Benjamin
- ChemCatChem, Vol. 11, Issue 16
Methane Chemisorption on Oxide‐Supported Pt Single Atom
journal, July 2019
- Fung, Victor; Hu, Guoxiang; Tao, Franklin (Feng)
- ChemPhysChem, Vol. 20, Issue 17
Making machine learning a useful tool in the accelerated discovery of transition metal complexes
journal, July 2019
- Kulik, Heather J.
- WIREs Computational Molecular Science, Vol. 10, Issue 1
Accelerated discovery of stable lead-free hybrid organic-inorganic perovskites via machine learning
journal, August 2018
- Lu, Shuaihua; Zhou, Qionghua; Ouyang, Yixin
- Nature Communications, Vol. 9, Issue 1
The basis for reevaluating the reactivity of pyrite surfaces: spin states and crystal field d-orbital splitting energies of bulk, terrace, edge, and corner Fe( ii ) ions
journal, January 2019
- Arumugam, Krishnamoorthy; Renock, Devon; Becker, Udo
- Physical Chemistry Chemical Physics, Vol. 21, Issue 12
Towards operando computational modeling in heterogeneous catalysis
journal, January 2018
- Grajciar, Lukáš; Heard, Christopher J.; Bondarenko, Anton A.
- Chemical Society Reviews, Vol. 47, Issue 22
Can machine learning identify the next high-temperature superconductor? Examining extrapolation performance for materials discovery
journal, January 2018
- Meredig, Bryce; Antono, Erin; Church, Carena
- Molecular Systems Design & Engineering, Vol. 3, Issue 5
Machine learning meets volcano plots: computational discovery of cross-coupling catalysts
journal, January 2018
- Meyer, Benjamin; Sawatlon, Boodsarin; Heinen, Stefan
- Chemical Science, Vol. 9, Issue 35
Mapping binary copolymer property space with neural networks
journal, January 2019
- Wilbraham, Liam; Sprick, Reiner Sebastian; Jelfs, Kim E.
- Chemical Science, Vol. 10, Issue 19
Benchmarking quantum chemistry methods for spin-state energetics of iron complexes against quantitative experimental data
journal, January 2019
- Radoń, Mariusz
- Physical Chemistry Chemical Physics, Vol. 21, Issue 9
Hammett neural networks: prediction of frontier orbital energies of tungsten–benzylidyne photoredox complexes
journal, January 2019
- Chang, Alexander M.; Freeze, Jessica G.; Batista, Victor S.
- Chemical Science, Vol. 10, Issue 28
Machine learning for renewable energy materials
journal, January 2019
- Gu, Geun Ho; Noh, Juhwan; Kim, Inkyung
- Journal of Materials Chemistry A, Vol. 7, Issue 29
Designing promising molecules for organic solar cells via machine learning assisted virtual screening
journal, January 2019
- Sahu, Harikrishna; Yang, Feng; Ye, Xiaobo
- Journal of Materials Chemistry A, Vol. 7, Issue 29
In silico high throughput screening of bimetallic and single atom alloys using machine learning and ab initio microkinetic modelling
journal, January 2020
- Saxena, Shivam; Khan, Tuhin Suvra; Jalid, Fatima
- Journal of Materials Chemistry A, Vol. 8, Issue 1
Calculating spin crossover temperatures by a first-principles LDA+ U scheme with parameter U evaluated from GW
journal, October 2019
- Zhang, Yachao
- The Journal of Chemical Physics, Vol. 151, Issue 13
Accurate and transferable multitask prediction of chemical properties with an atoms-in-molecules neural network
journal, August 2019
- Zubatyuk, Roman; Smith, Justin S.; Leszczynski, Jerzy
- Science Advances, Vol. 5, Issue 8
Figures / Tables found in this record: