How Much Chemistry Does a Deep Neural Network Need to Know to Make Accurate Predictions?
- BATTELLE (PACIFIC NW LAB)
In the last few years, we have seen the rise of deep learning applications in a broad range of computational chemistry research problems. Using human-engineered chemical features, such as molecular descriptors and fingerprints, deep learning models have shown similar, if not better performance that most traditional machine learning algorithms. Recently, we reported on the development of Chemception, a deep convolutional neural network (CNN) architecture for general-purpose small molecule property prediction. On average, Chemception matched the performance of expert-developed QSAR/QSPR models trained on chemical features (molecular fingerprints), despite that it was trained on just 2D images of molecular drawings with minimal chemical information. Here, we investigate the effects of systematically removing and adding basic chemical information to the image channels of the 2D images used to train Chemception. By augmenting our images with only 3 additional basic chemical information, we demonstrate the improvement of Chemception performance – that it is now more accurate than contemporary deep learning models trained on ECFP fingerprints for the prediction of toxicity, activity and solvation free energy, as well as physics-based free energy simulation methods for computing solvation properties. By altering the chemical information content in the image channels, and examining the resulting performance of Chemception, we also identify to two different “learning patterns” in toxicity/activity as compared to solvation free energy, and it parallels the fundamental differences in contemporary chemistry research for predicting toxicity/activity and solvation free energy.
- Research Organization:
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-76RL01830
- OSTI ID:
- 1558182
- Report Number(s):
- PNNL-SA-127201
- Country of Publication:
- United States
- Language:
- English
Similar Records
ChemNet: A Transferable and Generalizable Deep Neural Network for Small-Molecule Property Prediction
Using Rule-Based Models for Weak Supervised Learning: A ChemNet for Transferable Chemical Property Prediction
Generalizable, fast, and accurate DeepQSPR with fastprop
Conference
·
Thu Dec 07 23:00:00 EST 2017
·
OSTI ID:1415704
Using Rule-Based Models for Weak Supervised Learning: A ChemNet for Transferable Chemical Property Prediction
Conference
·
Sun Aug 19 00:00:00 EDT 2018
·
OSTI ID:1764978
Generalizable, fast, and accurate DeepQSPR with fastprop
Journal Article
·
Mon May 12 20:00:00 EDT 2025
· Journal of Cheminformatics
·
OSTI ID:2565981