Machine learning-based discovery of molecular descriptors that control polymer gas permeation
Journal Article
·
· Journal of Membrane Science
- Columbia University, New York, NY (United States)
- California Institute of Technology (CalTech), Pasadena, CA (United States)
- Brookhaven National Laboratory (BNL), Upton, NY (United States)
While machine learning has found increasing use in predicting the properties of polymeric materials with only a knowledge of chain architecture, determining the molecular factors underpinning properties (“interpretable AI”) has remained less well explored. We show that encoding chain chemistry in commonly employed formats, e.g., binary-valued fingerprints, leads to uniqueness issues during the hashing process to save storage space. This is because the hashing algorithm can map several chemical moieties into the same bit. These issues carry over into the ML algorithms, especially for “inverse” design and interpretable AI, and cannot be avoided by changing the length of the fingerprint. Using MACCS key featurizations of monomer repeats resolves some of these issues, and we show that a few substructures consistently appear in top features for maximizing permeability across several gases and ML models. These are carbon–carbon double bonds (as in polyacetylenes) especially when they are associated with methyl groups (found in branching architectures). Here these results, derived from the limited data set of ~ 500 polymers with experimental gas permeation data, are in agreement with physical insight and thus provide a robust foundation which could further enable study of these material classes through detailed experiments and simulations.
- Research Organization:
- Brookhaven National Laboratory (BNL), Upton, NY (United States)
- Sponsoring Organization:
- King Abdullah University of Science and Technology; USDOE Laboratory Directed Research and Development (LDRD) Program; USDOE Office of Science (SC), Basic Energy Sciences (BES)
- Grant/Contract Number:
- SC0008772; SC0012704
- OSTI ID:
- 2338112
- Alternate ID(s):
- OSTI ID: 2369909
- Report Number(s):
- BNL--225505-2024-JAAM
- Journal Information:
- Journal of Membrane Science, Journal Name: Journal of Membrane Science Vol. 697; ISSN 0376-7388
- Publisher:
- ElsevierCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Fingerprinting Interactions between Proteins and Ligands for Facilitating Machine Learning in Drug Discovery
BeyondFingerprinting: AI-guided discovery of robust materials & processes
Journal Article
·
Thu Jan 04 19:00:00 EST 2024
· Biomolecules
·
OSTI ID:2471231
BeyondFingerprinting: AI-guided discovery of robust materials & processes
Technical Report
·
Sun Sep 01 00:00:00 EDT 2024
·
OSTI ID:2462990