An Information-Theory-Based Approach for Optimal Model Reduction of Biomolecules.

J Chem Theory Comput

Physics Department, University of Trento, via Sommarive 14, I-38123 Trento, Italy.

Published: November 2020

In theoretical modeling of a physical system, a crucial step consists of the identification of those degrees of freedom that enable a synthetic yet informative representation of it. While in some cases this selection can be carried out on the basis of intuition and experience, straightforward discrimination of the important features from the negligible ones is difficult for many complex systems, most notably heteropolymers and large biomolecules. We here present a thermodynamics-based theoretical framework to gauge the effectiveness of a given simplified representation by measuring its information content. We employ this method to identify those reduced descriptions of proteins, in terms of a subset of their atoms, that retain the largest amount of information from the original model; we show that these highly informative representations share common features that are intrinsically related to the biological properties of the proteins under examination, thereby establishing a bridge between protein structure, energetics, and function.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7659038PMC
http://dx.doi.org/10.1021/acs.jctc.0c00676DOI Listing

Publication Analysis

Top Keywords

information-theory-based approach
4
approach optimal
4
optimal model
4
model reduction
4
reduction biomolecules
4
biomolecules theoretical
4
theoretical modeling
4
modeling physical
4
physical system
4
system crucial
4

Similar Publications

The Murciano-Granadina goat (MUG) is a renowned dairy breed, known for its adaptability and resilience, as well as for its exceptional milk traits characterized by high protein and fat content, along with low somatic cell counts. These traits are governed by complex biological processes, crucial in shaping phenotypic diversity. Thus, it is imperative to explore the factors regulating milk production and lactation for this breed.

View Article and Find Full Text PDF

Elucidating the molecular dependencies behind the cancer-type specificity of driver mutations may reveal new therapeutic opportunities. We hypothesized that developmental programs would impact the transduction of oncogenic signaling activated by a driver mutation and shape its cancer-type specificity. Therefore, we designed a computational analysis framework by combining single-cell gene expression profiles during fetal organ development, latent factor discovery, and information theory-based differential network analysis to systematically identify transcription factors that selectively respond to driver mutations under the influence of organ-specific developmental programs.

View Article and Find Full Text PDF

Analyzing entropy features in time-series data for pattern recognition in neurological conditions.

Artif Intell Med

April 2024

Department of Brain Sciences, Imperial College London, London, UK; The Great Ormond Street Institute of Child Health, University College London, London, UK; Great Ormond Street Hospital for Children, London, UK; Care Research and Technology Centre, The UK Dementia Research Institute, London, UK. Electronic address:

In the field of medical diagnosis and patient monitoring, effective pattern recognition in neurological time-series data is essential. Traditional methods predominantly based on statistical or probabilistic learning and inference often struggle with multivariate, multi-source, state-varying, and noisy data while also posing privacy risks due to excessive information collection and modeling. Furthermore, these methods often overlook critical statistical information, such as the distribution of data points and inherent uncertainties.

View Article and Find Full Text PDF

The measure of partial mutual information from mixed embedding (PMIME) is an information theory-based measure to accurately identify the direct and directional coupling, termed Granger causality or simply causality, between the observed variables or subsystems of a high-dimensional dynamical and complex system, without any a priori assumptions about the nature of the coupling relationship. In its core, it is a forward selection procedure that aims to iteratively identify the lag-dependence structure of a given observed variable (response) to all the other observed variables (candidate drivers). This model-free approach is capable of detecting nonlinear interactions, abundantly present in real-world complex systems, and it was shown to perform well on multivariate time series of moderately high dimension.

View Article and Find Full Text PDF

Deep learning-based efficient drone-borne sensing of cyanobacterial blooms using a clique-based feature extraction approach.

Sci Total Environ

February 2024

School of Environmental Engineering, University of Seoul, Dongdaemun-gu, Seoul 02504, Republic of Korea. Electronic address:

Article Synopsis
  • Recent advancements in remote sensing and deep learning have led to the development of the Hierarchical Concatenated Variational Autoencoder (HCVAE), which effectively monitors harmful algal blooms (HABs) by analyzing hyperspectral data in bodies of water such as Daecheong Lake in South Korea.
  • * HCVAE utilizes a multi-level hierarchical approach that allows for efficient extraction of key spectral features, enabling accurate estimation of chlorophyll-a (Chl-a) and phycocyanin (PC) concentrations with high agreement to measured values.
  • * The study showcases HCVAE's ability to create spatial distribution maps of algal pigments using drone technology, thereby improving near-real-time monitoring and assessment of HABs.
View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!