Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Determining best practices for using genetic algorithms in molecular discovery

Journal Article · · Journal of Chemical Physics
DOI:https://doi.org/10.1063/5.0158053· OSTI ID:2246912
Genetic algorithms (GAs) are a powerful tool to search large chemical spaces for inverse molecular design. However, GAs have multiple hyperparameters that have not been thoroughly investigated for chemical space searches. In this tutorial, we examine the general effects of a number of hyperparameters, such as population size, elitism rate, selection method, mutation rate, and convergence criteria, on key GA performance metrics. Here, we show that using a self-termination method with a minimum Spearman’s rank correlation coefficient of 0.8 between generations maintained for 50 consecutive generations along with a population size of 32, a 50% elitism rate, three-way tournament selection, and a 40% mutation rate provides the best balance of finding the overall champion, maintaining good coverage of elite targets, and improving relative speedup for general use in molecular design GAs.
Research Organization:
University of Pittsburgh, PA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Basic Energy Sciences (BES)
Grant/Contract Number:
SC0019335
OSTI ID:
2246912
Journal Information:
Journal of Chemical Physics, Journal Name: Journal of Chemical Physics Journal Issue: 9 Vol. 159; ISSN 0021-9606
Publisher:
American Institute of Physics (AIP)Copyright Statement
Country of Publication:
United States
Language:
English

References (47)

A Genetic Algorithm Approach to Design Principles for Organic Photovoltaic Materials journal July 2020
A Fukui function‐guided genetic algorithm. Assessment on structural prediction of Sin (n = 12–20) clusters journal April 2017
GAMaterial—A genetic‐algorithm software for material design and discovery journal November 2022
Effect of the Genetic Algorithm Parameters on the Optimisation of Heterogeneous Catalysts journal February 2005
Extended tight‐binding quantum chemistry methods journal August 2020
Genetic algorithms in chemistry journal January 1993
Genetic algorithms in chemistry journal July 2007
Optimization configuration of selective solar absorber using multi-island genetic algorithm journal August 2021
Discovery and Optimization of Materials Using Evolutionary Approaches journal May 2016
Genetic Algorithm Approach for the Optimization of Protein Antifreeze Activity Using Molecular Simulations journal November 2020
A Robust and Accurate Tight-Binding Quantum Chemical Method for Structures, Vibrational Frequencies, and Noncovalent Interactions of Large Molecular Systems Parametrized for All spd-Block Elements ( Z = 1–86) journal April 2017
GAtor: A First-Principles Genetic Algorithm for Molecular Crystal Structure Prediction journal February 2018
GFN2-xTB—An Accurate and Broadly Parametrized Self-Consistent Tight-Binding Quantum Chemical Method with Multipole Electrostatics and Density-Dependent Dispersion Contributions journal January 2019
In Silico Prediction of Hemolytic Toxicity on the Human Erythrocytes for Small Molecules by Machine-Learning and Genetic Algorithm journal June 2019
Heuristic Global Optimization in Chemical Compound Space journal October 2020
Pareto Optimization of Oligomer Polarizability and Dipole Moment Using a Genetic Algorithm journal April 2022
Automatic Conformational Search of Transition States for Catalytic Reactions Using Genetic Algorithm journal November 2019
The XtalOpt Evolutionary Algorithm for Crystal Structure Prediction journal December 2020
Screening Efficient Tandem Organic Solar Cells with Machine Learning and Genetic Algorithms journal March 2023
Intelligent Selection of Metal–Organic Framework Arrays for Methane Sensing via Genetic Algorithms journal May 2019
Genetic Algorithm Based Design and Experimental Characterization of a Highly Thermostable Metalloprotein journal January 2018
Computational Design and Selection of Optimal Organic Photovoltaic Materials journal July 2011
Efficient Computational Screening of Organic Polymer Photovoltaics journal April 2013
Evolutionary design of molecules based on deep learning and a genetic algorithm journal August 2021
Evolving better nanoparticles: Genetic algorithms for optimising cluster geometries journal January 2003
A graph-based genetic algorithm and generative model/Monte Carlo tree search for the exploration of chemical space journal January 2019
Automated exploration of the low-energy chemical space with fast quantum chemical methods journal January 2020
Achieving ultra-narrow bandgap non-halogenated non-fullerene acceptors via vinylene π-bridges for efficient organic solar cells journal January 2021
Illuminating elite patches of chemical space journal January 2020
Alkoxy substitution on IDT-Series and Y-Series non-fullerene acceptors yielding highly efficient organic solar cells journal January 2021
Evaluating fast methods for static polarizabilities on extended conjugated oligomers journal January 2022
Parallel tempered genetic algorithm guided by deep neural networks for inverse molecular design journal January 2022
Graph-based molecular Pareto optimisation journal January 2022
Using genetic algorithms to discover novel ground-state triplet conjugated polymers journal January 2023
A simplified Tamm-Dancoff density functional approach for the electronic excitation spectra of very large molecules journal June 2013
Ultra-fast computation of electronic spectra for large systems by tight-binding based simplified Tamm-Dancoff approximation (sTDA-xTB) journal August 2016
Virtual screening of norbornadiene-based molecular solar thermal energy storage systems using a genetic algorithm journal November 2021
Computational evolution of high-performing unfused non-fullerene acceptors for organic solar cells journal May 2022
Organic photoredox catalysts for CO2 reduction: Driving discovery with genetic algorithms journal May 2022
Global optimization of atomic structure enhanced by machine learning journal June 2022
Inverse molecular design using machine learning: Generative models for matter engineering journal July 2018
Simultaneous Shape and Stacking Sequence Optimization of Laminated Composite Free-Form Shells Using Multi-Island Genetic Algorithm journal October 2019
Open Babel: An open chemical toolbox journal October 2011
UMAP: Uniform Manifold Approximation and Projection journal September 2018
Genetic Algorithm Design of MOF-based Gas Sensor Arrays for CO2-in-Air Sensing journal February 2020
Chemical space exploration: how genetic algorithms find the needle in the haystack journal July 2020
Using a genetic algorithm to find molecules with good docking scores journal May 2021