We describe a method for the post-hoc interpretation of a neural network (NN) trained on the global and local minima of neutral water clusters. We use the structures recently reported in a newly published database containing over 5 × 10 unique water cluster networks (HO) of size N = 3-30. The structural properties were first characterized using chemical descriptors derived from graph theory, identifying important trends in topology, connectivity, and polygon structure of the networks associated with the various minima. The code to generate the molecular graphs and compute the descriptors is available at https://github.com/exalearn/molecular-graph-descriptors, and the graphs are available alongside the original database at https://sites.uw.edu/wdbase/. A Continuous-Filter Convolutional Neural Network (CF-CNN) was trained on a subset of 500 000 networks to predict the potential energy, yielding a mean absolute error of 0.002 ± 0.002 kcal/mol per water molecule. Clusters of sizes not included in the training set exhibited errors of the same magnitude, indicating that the CF-CNN protocol accurately predicts energies of networks for both smaller and larger sizes than those used during training. The graph-theoretical descriptors were further employed to interpret the predictive power of the CF-CNN. Topological measures, such as the Wiener index, the average shortest path length, and the similarity index, suggested that all networks from the test set were within the range of values as the ones from the training set. The graph analysis suggests that larger errors appear when the mean degree and the number of polygons in the cluster lie further from the mean of the training set. This indicates that the structural space, and not just the chemical space, is an important factor to consider when designing training sets, as predictive errors can result when the structural composition is sufficiently different from the bulk of those in the training set. To this end, the developed descriptors are quite effective in explaining the results of the CF-CNN (a.k.a. the "black box") model.

Download full-text PDF

Source
http://dx.doi.org/10.1063/5.0009933DOI Listing

Publication Analysis

Top Keywords

training set
16
neural network
12
graph-theoretical descriptors
8
continuous-filter convolutional
8
convolutional neural
8
network cf-cnn
8
cf-cnn trained
8
trained global
8
global local
8
neutral water
8

Similar Publications

Only 25% of adults meet both aerobic and strength training recommendations for physical activity. Contingency management interventions have been used to increase physical activity; however, they may be cost prohibitive. Intermittently provided incentives lower costs and are effective for various health behaviors.

View Article and Find Full Text PDF

Background: One repetition maximum (1RM) is a vital metric for exercise professionals, but various testing protocols exist, and their impacts on the resulting 1RM, barbell kinetics, and subsequent muscular performance testing are not well understood. This study aimed to compare two previously established protocols and a novel self-led method for determining bench press 1RM, 1RM barbell kinetics, and subsequent muscular performance measures.

Methods: Twenty-four resistance-trained males (n = 12, 24 ± 6.

View Article and Find Full Text PDF

The purpose of this study was to compare the internal and external load in continuous and intermittent small-sided games (SSG) formats. Eight semi-professional soccer players participated in the study, and they completed three protocols: (a) I-intermittent SSG protocol (Int-I, 4 sets of 4 min with a 3 min recovery); (b) Continuous SSG protocol (Con, 2 sets of 8 min with a 3 min recovery); (c) II-SSG protocol (Int-II, 4 sets of 4 min, where each set includes 1 min of exercise with varying recovery periods (10, 20, 30 s), with a 3 min recovery period between sets). A one-way analysis of variance (ANOVA) was used to analyze the dependent variables, with significance determined at < 0.

View Article and Find Full Text PDF

The aim of this study was to compare the acute effect of three cluster set (CS) intra-set rest intervals (15 s, 30 s, and 45 s) on mechanical performance measures during a flywheel resistance training session. Twelve amateur male field sport athletes attended three training measurement sessions (separated by 14 days of wash-out), consisting of four sets of nine repetitions (as cluster-blocks: 3 + 3 + 3), using a 0.050 kg·m inertial load.

View Article and Find Full Text PDF

The Multiple Frequency Speed of Kick Test (FSKT) is used to investigate which characteristics are necessary for, contribute to, or limit the ability to repeat high-intensity intermittent efforts in taekwondo. This cross-sectional study investigated the relationship between anthropometric and body composition characteristics, muscle power performance, and sport-specific anaerobic performance. Nineteen black belt taekwondo athletes (mean ± SD age: 17.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!