Evaluating uncertainty-based active learning for accelerating the generalization of molecular property prediction
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Deep learning models have proven to be a powerful tool for the prediction of molecular properties for applications including drug design and the development of energy storage materials. However, in order to learn accurate and robust structure–property mappings, these models require large amounts of data which can be a challenge to collect given the time and resource-intensive nature of experimental material characterization efforts. Additionally, such models fail to generalize to new types of molecular structures that were not included in the model training data. The acceleration of material development through uncertainty-guided experimental design has the promise to significantly reduce the data requirements and enable faster generalization to new types of materials. To evaluate the potential of such approaches for electrolyte design applications, we perform comprehensive evaluation of existing uncertainty quantification methods on the prediction of two relevant molecular properties - aqueous solubility and redox potential. We develop novel evaluation methods to probe the utility of the uncertainty estimates for both in-domain and out-of-domain data sets. Finally, we leverage selected uncertainty estimation methods for active learning to evaluate their capacity to support experimental design.
- Research Organization:
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
- Sponsoring Organization:
- USDOE Laboratory Directed Research and Development (LDRD) Program
- Grant/Contract Number:
- AC05-76RL01830
- OSTI ID:
- 2228269
- Report Number(s):
- PNNL-SA--179045
- Journal Information:
- Journal of Cheminformatics, Journal Name: Journal of Cheminformatics Journal Issue: 1 Vol. 15; ISSN 1758-2946
- Publisher:
- Chemistry Central Ltd.Copyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Towards Efficient Uncertainty estimation in deep learning for robust energy prediction in crystal materials
mystic: software for autonomous discovery and design under uncertainty