Determining best practices for using genetic algorithms in molecular discovery.

J Chem Phys

Department of Chemistry, University of Pittsburgh, 219 Parkman Avenue, Pittsburgh, Pennsylvania 15260, USA.

Published: September 2023

AI Article Synopsis

Article Abstract

Genetic algorithms (GAs) are a powerful tool to search large chemical spaces for inverse molecular design. However, GAs have multiple hyperparameters that have not been thoroughly investigated for chemical space searches. In this tutorial, we examine the general effects of a number of hyperparameters, such as population size, elitism rate, selection method, mutation rate, and convergence criteria, on key GA performance metrics. We show that using a self-termination method with a minimum Spearman's rank correlation coefficient of 0.8 between generations maintained for 50 consecutive generations along with a population size of 32, a 50% elitism rate, three-way tournament selection, and a 40% mutation rate provides the best balance of finding the overall champion, maintaining good coverage of elite targets, and improving relative speedup for general use in molecular design GAs.

Download full-text PDF

Source
http://dx.doi.org/10.1063/5.0158053DOI Listing

Publication Analysis

Top Keywords

genetic algorithms
8
molecular design
8
design gas
8
population size
8
elitism rate
8
mutation rate
8
determining best
4
best practices
4
practices genetic
4
algorithms molecular
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!