Deep learning has shown great potential for generating molecules with desired properties. But the cost and time required to obtain relevant property data have limited study to only a few classes of materials for which extensive data have already been collected. We develop a deep learning method that combines a generative model with a property prediction model to fuse small data of one class of molecules with larger data in another class. Common low-level physicochemical properties are jointly embedded into a latent space that can be used to design molecules in the smaller class. The chemical space around the molecules in the training set is explored through local gradient ascent optimization. Based on nine molecules from the original training set, nine new molecules are found to have improved properties while remaining structurally similar to the training molecules thereby easing requirements for entirely new synthesis routes. Validation is performed using an equilibrium thermochemistry code to verify the molecules and target properties. A specific example targeting the Chapman-Jouguet velocity and small data for nitrogen-rich molecules is shown. Despite the relative lack of nitrogen-rich molecule data, the results demonstrate that fusing and joint embedding with plentiful low nitrogen molecular data can produce higher generative performance than using the scarce data alone.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1002/minf.202100011 | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!