-Means Clustering in Fingerprint-Based Configuration Selection for Fitting Interatomic Potentials.

J Chem Theory Comput

Department of Physics, Faculty of Mechanical Engineering, Czech Technical University in Prague, Technická 4, Prague 6 16607, Czech Republic.

Published: December 2024

In this study, we present a method for selecting an arbitrary number of distinct configurations from a larger data set by applying -means clustering to atomistic configuration fingerprints based on the CrystalNN model and radial distribution function (RDF). This approach improves the accuracy of fitting classical molecular dynamics interatomic potentials to density functional theory (DFT) data for both energies and forces while requiring fewer configurations than random selection. We demonstrate this improvement by fitting an embedded-atom method (EAM) potential for titanium, using various configurational sizes from an initial set of 1800 configurations. The -means clustering consistently achieves better precision and lower standard deviations for a smaller number of configurations than random selection. The results also suggest that only about 30 configurations are sufficient to obtain an EAM model that describes well the full set of 1800 configurations in terms of energies and forces. Additionally, t-distributed stochastic neighbor embedding (t-SNE) method was used to reduce the configuration fingerprints into 2D space, and it revealed an overlap between two configuration subsets with and without Ti vacancy, indicating similar atomic environments. This similarity is captured by -means clustering but not by random selection. Furthermore, when the overlapping configurations with vacancies were excluded from the -means algorithm and used only as a test set, their energy and force predictions showed similar precision to those when they were included. This indicates that the overlapping configurations in the 2D t-SNE space indeed imply potential information redundancy among the atomistic configurations.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.jctc.4c01225DOI Listing

Publication Analysis

Top Keywords

-means clustering
16
random selection
12
configurations
9
interatomic potentials
8
configuration fingerprints
8
energies forces
8
configurations random
8
set 1800
8
1800 configurations
8
overlapping configurations
8

Similar Publications

Background: Paroxysmal sympathetic hyperactivity (PSH) occurs with high prevalence among critically ill patients with traumatic brain injury (TBI) and is associated with worse outcomes. The PSH-Assessment Measure (PSH-AM) consists of a Clinical Features Scale and a diagnosis likelihood tool (DLT) intended to quantify the severity of sympathetically mediated symptoms and the likelihood that they are due to PSH, respectively, on a daily basis. Here, we aim to identify and explore the value of dynamic trends in the evolution of sympathetic hyperactivity following acute TBI using elements of the PSH-AM.

View Article and Find Full Text PDF

The transition to menopause is associated with disappearance of menstrual cycle symptoms and emergence of vasomotor symptoms. Although menopausal women report a variety of additional symptoms, it remains unclear which emerge prior to menopause, which occur in predictable clusters, how clusters change across the menopausal transition, or if distinct phenotypes are present within each life stage. We present an analysis of symptoms in premenopausal to menopausal women using the MenoLife app, which includes 4789 individuals (23% premenopausal, 29% perimenopausal, 48% menopausal) and 147,501 symptom logs (19% premenopausal, 39% perimenopausal, 42% menopausal).

View Article and Find Full Text PDF

Purpose: The use of social media is transforming physician-patient communication, mainly in the field of medical oncology. The pattern of social media use by medical oncologists is poorly studied. Therefore, we developed a survey to understand the preferences, experiences, opinions, and expectations of Italian medical oncologists and oncology fellows regarding the use of social media in cancer medicine to identify the different profiles of social media users.

View Article and Find Full Text PDF

Clinical Manifestations.

Alzheimers Dement

December 2024

Frontotemporal Degeneration Center, University of Pennsylvania, Philadelphia, PA, USA.

Background: There is considerable variability in the rate of clinical progression among individuals with frontotemporal dementia (FTD) and prognostic markers are lacking. Moreover, due to the rarity of postmortem data, the relationship between rate of progression and postmortem tau and TDP-43 proteinopathy is understudied.

Method: To explore the pathologic underpinnings of differences in clinical progression of FTD, we used clinical data collected by the Penn Center for Neurodegenerative Disease Research from 130 patients with autopsy-confirmed frontotemporal lobar degeneration (FTLD-tau = 62, FTLD-TDP = 68) across six domains (age at onset, survival in years, first Clinical Dementia Rating [CDR] scale score, first Mini-Mental State Examination [MMSE] score, annual change in CDR, annual change in MMSE).

View Article and Find Full Text PDF

Background: Although it has been estimated that modifiable risk factors account for around 40% of population variability in dementia risk, understanding how risk factors are related to one another and to brain pathology and cognition has been challenging. We used a clustering approach to examine patterns of risk factor interrelationships and to investigate how these patterns affect relationships between pathology and cognition.

Method: We collected risk factor data concerning health, lifestyle, sleep, and personality from 149 cognitively normal older adults (73±6.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!