Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Deep learning for computational chemistry

Journal Article · · Journal of Computational Chemistry
DOI:https://doi.org/10.1002/jcc.24764· OSTI ID:1406688
 [1];  [1];  [1]
  1. Advanced Computing, Mathematics, and Data Division, Pacific Northwest National Laboratory, 902 Battelle Blvd Richland Washington 99354
The rise and fall of artificial neural networks is well documented in the scientific literature of both the fields of computer science and computational chemistry. Yet almost two decades later, we are now seeing a resurgence of interest in deep learning, a machine learning algorithm based on “deep” neural networks. Within the last few years, we have seen the transformative impact of deep learning the computer science domain, notably in speech recognition and computer vision, to the extent that the majority of practitioners in those field are now regularly eschewing prior established models in favor of deep learning models. In this review, we provide an introductory overview into the theory of deep neural networks and their unique properties as compared to traditional machine learning algorithms used in cheminformatics. By providing an overview of the variety of emerging applications of deep neural networks, we highlight its ubiquity and broad applicability to a wide range of challenges in the field, including QSAR, virtual screening, protein structure modeling, QM calculations, materials synthesis and property prediction. In reviewing the performance of deep neural networks, we observed a consistent outperformance against non neural networks state-of-the-art models across disparate research topics, and deep neural network based models often exceeded the “glass ceiling” expectations of their respective tasks. Coupled with the maturity of GPU-accelerated computing for training deep neural networks and the exponential growth of chemical data on which to train these networks on, we anticipate that deep learning algorithms will be a useful tool and may grow into a pivotal role for various challenges in the computational chemistry field.
Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (US)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-76RL01830
OSTI ID:
1406688
Report Number(s):
PNNL-SA-121040
Journal Information:
Journal of Computational Chemistry, Journal Name: Journal of Computational Chemistry Journal Issue: 16 Vol. 38; ISSN 0192-8651
Publisher:
Wiley
Country of Publication:
United States
Language:
English

References (97)

XenoSite: Accurately Predicting CYP-Mediated Sites of Metabolism with Neural Networks journal November 2013
Principles of QSAR models validation: internal and external journal May 2007
A decade of CASP: progress, bottlenecks and prognosis in protein structure prediction journal June 2005
Exploring Chemical Space for Drug Discovery Using the Chemical Universe Database journal May 2012
3D-QSAR in Drug Design - A Review journal January 2010
Learning representations by back-propagating errors journal October 1986
CHARMM: The biomolecular simulation program journal July 2009
Mastering the game of Go with deep neural networks and tree search journal January 2016
Quantitative Nanostructure−Activity Relationship Modeling journal September 2010
THEORY OF PROTEIN FOLDING: The Energy Landscape Perspective journal October 1997
Advances in methods and algorithms in a modern quantum chemistry program package journal January 2006
Best Practices for QSAR Model Development, Validation, and Exploitation journal July 2010
Deep Machine Learning - A New Frontier in Artificial Intelligence Research [Research Frontier] journal November 2010
Extended-Connectivity Fingerprints journal April 2010
SPINE X: Improving protein secondary structure prediction by multistep learning coupled with prediction of solvent accessible surface area and backbone torsion angles journal November 2011
Stalking the Materials Genome: A Data-Driven Approach to the Virtual Design of Nanostructured Polymers journal June 2013
Quantitative Structure-Fluorescence Property Relationship Analysis of a Large BODIPY Library journal October 2010
Deep Neural Nets as a Method for Quantitative Structure–Activity Relationships journal February 2015
Progress and challenges in protein structure prediction journal June 2008
DeepTox: Toxicity Prediction using Deep Learning journal February 2016
Modeling Reactivity to Biological Macromolecules with a Deep Multitask Network journal July 2016
Big–deep–smart data in imaging for guiding materials design journal September 2015
Scalable molecular dynamics with NAMD journal January 2005
Searching for exotic particles in high-energy physics with deep learning journal July 2014
Evaluation of methods for modeling transcription factor sequence specificity journal January 2013
Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning journal July 2015
A Fast Learning Algorithm for Deep Belief Nets journal July 2006
Applications of Deep Learning in Biomedicine journal March 2016
NWChem: A comprehensive and scalable open-source solution for large scale molecular simulations journal September 2010
QSPR: the correlation and quantitative prediction of chemical and physical properties from structure journal January 1995
Fourier series of atomic radial distribution functions: A molecular fingerprint for machine learning models of quantum chemical properties
  • von Lilienfeld, O. Anatole; Ramakrishnan, Raghunathan; Rupp, Matthias
  • International Journal of Quantum Chemistry, Vol. 115, Issue 16 https://doi.org/10.1002/qua.24912
journal April 2015
Robust QSAR Models Using Bayesian Regularized Neural Networks journal July 1999
Are Protein Force Fields Getting Better? A Systematic Benchmark on 524 Diverse NMR Measurements journal March 2012
Commentary: The Materials Project: A materials genome approach to accelerating materials innovation journal July 2013
Lessons Learned in Empirical Scoring with smina from the CSAR 2011 Benchmarking Exercise journal February 2013
Directory of Useful Decoys, Enhanced (DUD-E): Better Ligands and Decoys for Better Benchmarking journal July 2012
CHARMM36 all-atom additive protein force field: Validation based on comparison to NMR data journal July 2013
Deep Blue journal January 2002
The Amber biomolecular simulation programs journal January 2005
How Fast-Folding Proteins Fold journal October 2011
The art and practice of structure-based drug design: A molecular modeling perspective journal January 1996
Profiling of the Tox21 10K compound library for agonists and antagonists of the estrogen receptor alpha signaling pathway journal July 2014
Consensus Modeling for HTS Assays Using In silico Descriptors Calculates the Best Balanced Accuracy in Tox21 Challenge journal February 2016
Random Forest:  A Classification and Regression Tool for Compound Classification and QSAR Modeling journal November 2003
Maximum Unbiased Validation (MUV) Data Sets for Virtual Screening Based on PubChem Bioactivity Data journal January 2009
Deep Architectures and Deep Learning in Chemoinformatics: The Prediction of Aqueous Solubility for Drug-Like Molecules journal July 2013
Deep learning journal May 2015
Accelerating molecular modeling applications with graphics processors journal January 2007
Identifying Biological Pathway Interrupting Toxins Using Multi-Tree Ensembles journal August 2016
Improved Prediction of CYP-Mediated Metabolism with Chemical Fingerprints journal May 2015
Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning journal January 2012
Predicting physical properties of ionic liquids journal January 2006
Machine-learning approach for one- and two-body corrections to density functional theory: Applications to molecular and condensed water journal August 2013
A logical calculus of the ideas immanent in nervous activity journal January 1990
Artificial evolution of coumarin dyes for dye sensitized solar cells journal January 2015
Framewise phoneme classification with bidirectional LSTM and other neural network architectures journal July 2005
Assessment and Validation of Machine Learning Methods for Predicting Molecular Atomization Energies journal July 2013
Improving prediction of secondary structure, local backbone angles and solvent accessible surface area of proteins by iterative deep learning journal June 2015
Deep Learning in Drug Discovery journal December 2015
Mold 2 , Molecular Descriptors from 2D Structures for Chemoinformatics and Toxicoinformatics journal June 2008
Finding Nature’s Missing Ternary Oxide Compounds Using Machine Learning and Density Functional Theory journal June 2010
PaDEL-descriptor: An open source software to calculate molecular descriptors and fingerprints journal December 2010
Protein structure prediction from sequence variation journal November 2012
GROMACS 4:  Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation journal February 2008
Review: Protein Secondary Structure Prediction Continues to Rise journal May 2001
Quantum-Chemical Descriptors in QSAR/QSPR Studies journal January 1996
Accelerating Density Functional Calculations with Graphics Processing Unit journal July 2008
Predicting Continuous Local Structure and the Effect of Its Substitution for Secondary Structure in Fragment-Free Protein Structure Prediction journal November 2009
Machine Learning of Parameters for Accurate Semiempirical Quantum Chemical Calculations journal April 2015
Quantitative Structure–Property Relationship Modeling of Diverse Materials Properties journal January 2012
Optimal Programming Problems with Inequality Constraints journal November 1963
Machine learning of molecular electronic properties in chemical compound space journal September 2013
Statistical potential for assessment and prediction of protein structures journal November 2006
Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach journal April 2015
Representation Learning: A Review and New Perspectives journal August 2013
Deep Learning for Drug-Induced Liver Injury journal October 2015
Crystal structure representations for machine learning models of formation energies journal April 2015
Predicting protein residue–residue contacts using deep networks and boosting journal October 2012
Predicting backbone Cα angles and dihedrals from protein sequences by stacked sparse auto-encoder deep neural network journal September 2014
PISCES: recent improvements to a PDB sequence culling server journal July 2005
Deep architectures for protein contact map prediction journal July 2012
Predicting residue–residue contacts using random forest models journal October 2011
A Kirkwood-Buff Derived Force Field for Aqueous Alkali Halides journal April 2011
ImageNet Large Scale Visual Recognition Challenge journal April 2015
General atomic and molecular electronic structure system journal November 1993
PubChem's BioAssay Database journal December 2011
Consistent blind protein structure generation from NMR chemical shift data journal March 2008
Development and large scale benchmark testing of the PROSPECTOR_3 threading algorithm journal April 2004
Simulation of Osmotic Pressure in Concentrated Aqueous Salt Solutions journal November 2009
The ASTRAL Compendium in 2004 journal January 2004
Learning from the Harvard Clean Energy Project: The Use of Neural Networks to Accelerate Materials Discovery journal September 2015
Machine-learning-assisted materials discovery using failed experiments journal May 2016
A deep learning framework for modeling structural features of RNA-binding protein targets journal October 2015
Modeling Epoxidation of Drug-like Molecules with a Deep Machine Learning Network journal June 2015
Deep learning in neural networks: An overview journal January 2015
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs journal September 1997
Modeling electronic quantum transport with machine learning journal June 2014

Similar Records

How Much Chemistry Does a Deep Neural Network Need to Know to Make Accurate Predictions?
Conference · Mon May 07 00:00:00 EDT 2018 · OSTI ID:1558182

Quantum Neural Networks: Issues, Training, and Applications
Technical Report · Fri Sep 29 00:00:00 EDT 2023 · OSTI ID:2337965

ChemNet: A Transferable and Generalizable Deep Neural Network for Small-Molecule Property Prediction
Conference · Thu Dec 07 23:00:00 EST 2017 · OSTI ID:1415704