Using Rule-Based Models for Weak Supervised Learning: A ChemNet for Transferable Chemical Property Prediction
Conference
·
OSTI ID:1764978
- BATTELLE (PACIFIC NW LAB)
With access to large datasets, deep neural networks (DNN) have achieved human-level accuracy in image and speech recognition tasks. However, in chemistry, data is inherently small and fragmented. In this work, we develop an approach of integrating rule-based knowledge into the Chemception CNN model via transfer learning techniques. The resulting model, ChemNet, is a transferable and generalizable deep neural network for chemical property prediction that learns in a semi-supervised manner from large unlabeled chemical databases. When ChemNet is further fine-tuned on 3 smaller datasets to predict chemical properties that it was not originally trained on, we show that ChemNet exceeds the accuracy of existing Chemception CNN models and other contemporary DNN models that were trained using conventional supervised learning approaches. These results indicate that pre-training ChemNet on a large diverse chemical database while incorporating chemistry domain knowledge, enables the development of more generalizable deep neural networks for the prediction of novel chemical properties.
- Research Organization:
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-76RL01830
- OSTI ID:
- 1764978
- Report Number(s):
- PNNL-SA-132274
- Country of Publication:
- United States
- Language:
- English
Similar Records
ChemNet: A Transferable and Generalizable Deep Neural Network for Small-Molecule Property Prediction
How Much Chemistry Does a Deep Neural Network Need to Know to Make Accurate Predictions?
Conference
·
Thu Dec 07 23:00:00 EST 2017
·
OSTI ID:1415704
How Much Chemistry Does a Deep Neural Network Need to Know to Make Accurate Predictions?
Conference
·
Mon May 07 00:00:00 EDT 2018
·
OSTI ID:1558182