-Means Clustering Coarse-Graining (KMC-CG): A Next Generation Methodology for Determining Optimal Coarse-Grained Mappings of Large Biomolecules.

J Chem Theory Comput

Department of Chemistry, Chicago Center for Theoretical Chemistry, The James Franck Institute, and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States.

Published: December 2023

Coarse-grained (CG) molecular dynamics (MD) has become a method of choice for simulating various large scale biomolecular processes; therefore, the systematic definition of the CG mappings for biomolecules remains an important topic. Appropriate CG mappings can significantly enhance the representability of a CG model and improve its ability to capture critical features of large biomolecules. In this work, we present a systematic and more generalized method called -means clustering coarse-graining (KMC-CG), which builds on the earlier approach of essential dynamics coarse-graining (ED-CG). KMC-CG removes the sequence-dependent constraints of ED-CG, allowing it to explore a more extensive space and thus enabling the discovery of more physically optimal CG mappings. Furthermore, the implementation of the -means clustering algorithm can variationally optimize the CG mapping with efficiency and stability. This new method is tested in three cases: ATP-bound G-actin, the HIV-1 CA pentamer, and the Arp2/3 complex. In these examples, the CG models generated by KMC-CG are seen to better capture the structural, dynamic, and functional domains. KMC-CG therefore provides a robust and consistent approach to generating CG models of large biomolecules that can then be more accurately parametrized by either bottom-up or top-down CG force fields.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10720621PMC
http://dx.doi.org/10.1021/acs.jctc.3c01053DOI Listing

Publication Analysis

Top Keywords

-means clustering
12
large biomolecules
12
clustering coarse-graining
8
coarse-graining kmc-cg
8
kmc-cg
5
kmc-cg generation
4
generation methodology
4
methodology determining
4
determining optimal
4
optimal coarse-grained
4

Similar Publications

Background: Cervical spondylotic myelopathy (CSM) is a debilitating condition that affects the cervical spine, leading to neurological impairments. While the neural mechanisms underlying CSM remain poorly understood, changes in brain network connectivity, particularly within the context of static and dynamic functional network connectivity (sFNC and dFNC), may provide valuable insights into disease pathophysiology. This study investigates brain-wide connectivity alterations in CSM patients using both sFNC and dFNC, combined with machine learning approaches, to explore their potential as biomarkers for disease classification and progression.

View Article and Find Full Text PDF

Flexible and modular latent transition analysis-A tutorial using R.

PLoS One

January 2025

National Institute of Public Health, University of Southern Denmark, Copenhagen K, Denmark.

Latent transition analysis (LTA) is a useful statistical modelling approach for describe transitions between latent classes over time. LTA may be characterized in terms of prevalence at each time point and through transition probabilities over time. Investigating predictors of these transitions is often of key interest.

View Article and Find Full Text PDF

Background: Symptoms frequently associated with endometriosis affect quality of life (QoL). Our aim investigated the hypothesis that cluster analysis can be used to identify homogeneous phenotyping subgroups of women according to the burden of the endometriosis for their QoL, and then to investigate the phenotype differences observed between these subgroups.

Methods: We developed an anonymous online survey, which received responses from 1,586 French women with endometriosis.

View Article and Find Full Text PDF

Background: Amid rapid urbanisation, the health effects of the built-environment have been widely studied, while research on elderly-supportive infrastructure and its interaction with PM (PM, Particulate Matter) exposure remains limited.

Objectives: To examine the effect of PM on cardiovascular hospitalisation risk among the elderly and the moderating role of elderly-supportive infrastructure in Wuhan, a city undergoing rapid urbanisation.

Methods: A time-stratified case-crossover design was adopted in which the K-means cluster analysis was applied to categorize elderly-supportive infrastructure.

View Article and Find Full Text PDF

Background: Assessing player readiness is crucial in elite basketball. This study aims to provide a practical method for monitoring player readiness through the handgrip test and identify associations with wellness scales.

Methods: Fifteen players (age: 25.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!