Markov (state) models (MSMs) and related models of molecular kinetics have recently received a surge of interest as they can systematically reconcile simulation data from either a few long or many short simulations and allow us to analyze the essential metastable structures, thermodynamics, and kinetics of the molecular system under investigation. However, the estimation, validation, and analysis of such models is far from trivial and involves sophisticated and often numerically sensitive methods. In this work we present the open-source Python package PyEMMA ( http://pyemma.org ) that provides accurate and efficient algorithms for kinetic model construction. PyEMMA can read all common molecular dynamics data formats, helps in the selection of input features, provides easy access to dimension reduction algorithms such as principal component analysis (PCA) and time-lagged independent component analysis (TICA) and clustering algorithms such as k-means, and contains estimators for MSMs, hidden Markov models, and several other models. Systematic model validation and error calculation methods are provided. PyEMMA offers a wealth of analysis functions such that the user can conveniently compute molecular observables of interest. We have derived a systematic and accurate way to coarse-grain MSMs to few states and to illustrate the structures of the metastable states of the system. Plotting functions to produce a manuscript-ready presentation of the results are available. In this work, we demonstrate the features of the software and show new methodological concepts and results produced by PyEMMA.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.jctc.5b00743DOI Listing

Publication Analysis

Top Keywords

estimation validation
8
validation analysis
8
markov models
8
component analysis
8
models
6
pyemma
5
analysis
5
pyemma software
4
software package
4
package estimation
4

Similar Publications

Development and validation of a five-year cardiovascular risk assessment tool for Asian adults aged 75 years and older.

BMC Geriatr

January 2025

Graduate Institute of Clinical Pharmacy, College of Medicine, National Taiwan University, No. 33, Linsen S. Rd., Zhongzheng Dist., Taipei, 100025, Taiwan.

Background: To identify cardiovascular (CV) risk factors in Asian elderly aged 75 years and older and subsequently develop and validate a sex-specific five-year CV risk assessment tool for this population.

Methods: This study included 12,174 patients aged ≥ 75 years without a prior history of cardiovascular disease at a single hospital in Taiwan. Electronic health records were linked to the National Health Insurance Research Database and the National Death Registry to ensure comprehensive health information.

View Article and Find Full Text PDF

Background: Modern reconstruction algorithms for computed tomography (CT) can exhibit nonlinear properties, including non-stationarity of noise and contrast dependence of both noise and spatial resolution. Model observers have been recommended as a tool for the task-based assessment of image quality (Samei E et al., Med Phys.

View Article and Find Full Text PDF

Inflammatory bowel disease (IBD) is a multisystem condition that could affect the cutaneous systems, namely cutaneous extraintestinal manifestations (EIMs). It has been suggested that IBD is associated with erythema nodosum (EN), malignant melanoma (MM) and non-melanoma skin cancer (NMSC). However, the potential causal relationship between IBD and the mentioned above cutaneous EIMs is still unclear.

View Article and Find Full Text PDF

To develop and validate practical prediction tools to estimate poor outcomes in patients ≥ 80 years old with acute ischemic stroke after intravenous alteplase thrombolysis, aiding clinical decision-making.To explore the longest benefit window after thrombolysis in the elderly. 1: A retrospectively analysis was conducted on acute stroke patients who underwent intravenous thrombolysis.

View Article and Find Full Text PDF

Carbon emission research based on input-output tables (IOTs) has received attention, but data quality issues persist due to inconsistencies between the sectoral scopes of energy statistics and IOTs. Specifically, China's official energy data are reported at the industry level, whereas IOTs are organized by product sectors. Valid IOT-based environmental models require consistent transformation from industry-level to product-level emissions.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!