A simple approach to rotationally invariant machine learning of a vector quantity.

J Chem Phys

J. Heyrovský Institute of Physical Chemistry, Academy of Sciences of the Czech Republic, v.v.i., Dolejškova 3, 18223 Prague 8, Czech Republic.

Published: November 2024

AI Article Synopsis

  • Machine learning (ML) for predicting vector or tensor properties requires maintaining proper invariance with molecular rotation, unlike energy prediction which is simpler due to its scalar nature.
  • The proposed "rotate-predict-rotate" (RPR) technique involves three steps: rotating the molecule to align with its principal axes, predicting the vector property in that orientation, and then transforming the prediction back to the original orientation.
  • This RPR approach ensures covariance for vector properties and can extend to tensors, allowing for rapid training of accurate ML models across numerous molecular configurations.

Article Abstract

Unlike with the energy, which is a scalar property, machine learning (ML) prediction of vector or tensor properties poses the additional challenge of achieving proper invariance (covariance) with respect to molecular rotation. For the energy gradients needed in molecular dynamics (MD), this symmetry is automatically fulfilled when taking analytic derivative of the energy, which is a scalar invariant (using properly invariant molecular descriptors). However, if the properties cannot be obtained by differentiation, other appropriate methods should be applied to retain the covariance. Several approaches have been suggested to properly treat this issue. For nonadiabatic couplings and polarizabilities, for example, it was possible to construct virtual quantities from which the above tensorial properties are obtained by differentiation and thus guarantee the covariance. Another possible solution is to build the rotational equivariance into the design of a neural network employed in the model. Here, we propose a simpler alternative technique, which does not require construction of auxiliary properties or application of special equivariant ML techniques. We suggest a three-step approach, using the molecular tensor of inertia. In the first step, the molecule is rotated using the eigenvectors of this tensor to its principal axes. In the second step, the ML procedure predicts the vector property relative to this orientation, based on a training set where all vector properties were in this same coordinate system. As the third step, it remains to transform the ML estimate of the vector property back to the original orientation. This rotate-predict-rotate (RPR) procedure should thus guarantee proper covariance of a vector property and is trivially extensible also to tensors such as polarizability. The RPR procedure has an advantage that the accurate models can be trained very fast for thousands of molecular configurations, which might be beneficial where many training sets are required (e.g., in active learning). We have implemented the RPR technique, using the MLatom and Newton-X programs for ML and MD, and performed its assessment on the dipole moment along MD trajectories of 1,2-dichloroethane.

Download full-text PDF

Source
http://dx.doi.org/10.1063/5.0230176DOI Listing

Publication Analysis

Top Keywords

vector property
12
machine learning
8
energy scalar
8
properties differentiation
8
rpr procedure
8
vector
6
properties
5
molecular
5
simple approach
4
approach rotationally
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!