Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Using Rule-Based Models for Weak Supervised Learning: A ChemNet for Transferable Chemical Property Prediction

Conference ·
OSTI ID:1764978
With access to large datasets, deep neural networks (DNN) have achieved human-level accuracy in image and speech recognition tasks. However, in chemistry, data is inherently small and fragmented. In this work, we develop an approach of integrating rule-based knowledge into the Chemception CNN model via transfer learning techniques. The resulting model, ChemNet, is a transferable and generalizable deep neural network for chemical property prediction that learns in a semi-supervised manner from large unlabeled chemical databases. When ChemNet is further fine-tuned on 3 smaller datasets to predict chemical properties that it was not originally trained on, we show that ChemNet exceeds the accuracy of existing Chemception CNN models and other contemporary DNN models that were trained using conventional supervised learning approaches. These results indicate that pre-training ChemNet on a large diverse chemical database while incorporating chemistry domain knowledge, enables the development of more generalizable deep neural networks for the prediction of novel chemical properties.
Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-76RL01830
OSTI ID:
1764978
Report Number(s):
PNNL-SA-132274
Country of Publication:
United States
Language:
English