Issue 2, 2020

Constrained Bayesian optimization for automatic chemical design using variational autoencoders

Abstract

Automatic Chemical Design is a framework for generating novel molecules with optimized properties. The original scheme, featuring Bayesian optimization over the latent space of a variational autoencoder, suffers from the pathology that it tends to produce invalid molecular structures. First, we demonstrate empirically that this pathology arises when the Bayesian optimization scheme queries latent space points far away from the data on which the variational autoencoder has been trained. Secondly, by reformulating the search procedure as a constrained Bayesian optimization problem, we show that the effects of this pathology can be mitigated, yielding marked improvements in the validity of the generated molecules. We posit that constrained Bayesian optimization is a good approach for solving this kind of training set mismatch in many generative tasks involving Bayesian optimization over the latent space of a variational autoencoder.

Graphical abstract: Constrained Bayesian optimization for automatic chemical design using variational autoencoders

Supplementary files

Article information

Article type
Edge Article
Submitted
12 Aug 2019
Accepted
15 Nov 2019
First published
18 Nov 2019
This article is Open Access

All publication charges for this article have been paid for by the Royal Society of Chemistry
Creative Commons BY license

Chem. Sci., 2020,11, 577-586

Constrained Bayesian optimization for automatic chemical design using variational autoencoders

R. Griffiths and J. M. Hernández-Lobato, Chem. Sci., 2020, 11, 577 DOI: 10.1039/C9SC04026A

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements