Mutual information (MI) is a general measure of statistical dependence with widespread application across the sciences. However, estimating MI between multi-dimensional variables is challenging because the number of samples necessary to converge to an accurate estimate scales unfavorably with dimensionality. In practice, existing techniques can reliably estimate MI in up to tens of dimensions, but fail in higher dimensions, where sufficient sample sizes are infeasible. Here, we explore the idea that underlying low-dimensional structure in high-dimensional data can be exploited to faithfully approximate MI in high-dimensional settings with realistic sample sizes. We develop a method that we call latent MI (LMI) approximation, which applies a nonparametric MI estimator to low-dimensional representations learned by a simple, theoretically-motivated model architecture. Using several benchmarks, we show that unlike existing techniques, LMI can approximate MI well for variables with $> 10^3$ dimensions if their dependence structure has low intrinsic dimensionality. Finally, we showcase LMI on two open problems in biology. First, we approximate MI between protein language model (pLM) representations of interacting proteins, and find that pLMs encode non-trivial information about protein-protein interactions. Second, we quantify cell fate information contained in single-cell RNA-seq (scRNA-seq) measurements of hematopoietic stem cells, and find a sharp transition during neutrophil differentiation when fate information captured by scRNA-seq increases dramatically.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11398540 | PMC |
Oral Radiol
January 2025
Faculty of Dentistry, Department of Oral and Maxillofacial Radiology, Istanbul University, Istanbul, Turkey.
Objectives: This study evaluates the potential of pulp volume/total tooth-volume measurements of canine teeth in relation to chronologic age in patients with cleft lip and palate (CLP). The significance of this study lies in its exploration of the usability of these measurements for age determination in CLP patients, providing a novel perspective to the existing literature.
Methods: Cone beam computed tomography images of 33 patients (16 females, 17 males) with unilateral CLP aged 14-45 years and 33 age- and sex-matched healthy individuals (16 females, 17 males) were retrospectively evaluated.
J Imaging Inform Med
January 2025
Leiden University Medical Center (LUMC), Leiden, the Netherlands.
Rising computed tomography (CT) workloads require more efficient image interpretation methods. Digitally reconstructed radiographs (DRRs), generated from CT data, may enhance workflow efficiency by enabling faster radiological assessments. Various techniques exist for generating DRRs.
View Article and Find Full Text PDFSupport Care Cancer
January 2025
Division of Hematology, Oncology, and Transplantation, University of Minnesota, 516 Delaware Street SE, MMC 480, PWB 14-100, Minneapolis, MN, 55455, USA.
Purpose: As cancer care is increasingly delivered in the home, more tasks and responsibilities fall on patients and their informal care partners. These time costs can present significant mental, physical, and financial burdens, and are undercounted in current measures of time toxicity that only consider care received in formal healthcare settings.
Methods: Semi-structured qualitative interviews were conducted with patients with gastrointestinal cancer and informal care partners at a single tertiary cancer center between March and October 2023.
Arch Gynecol Obstet
January 2025
Faculty of Medicine and Health Sciences, Tel Aviv University, Tel Aviv, Israel.
Purpose: To quantify the separation between maternal blood cell-free (cf)DNA markers in preeclampsia and unaffected pregnancies and compare with existing markers. This approach has not been used in previous studies.
Methods: Comprehensive systematic literature search of PubMed to identify studies measuring total cfDNA, fetal cf(f)DNA or the fetal fraction (FF) in pregnant women.
Nat Methods
January 2025
Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada.
Teravoxel-scale, cellular-resolution images of cleared rodent brains acquired with light-sheet fluorescence microscopy have transformed the way we study the brain. Realizing the potential of this technology requires computational pipelines that generalize across experimental protocols and map neuronal activity at the laminar and subpopulation-specific levels, beyond atlas-defined regions. Here, we present artficial intelligence-based cartography of ensembles (ACE), an end-to-end pipeline that employs three-dimensional deep learning segmentation models and advanced cluster-wise statistical algorithms, to enable unbiased mapping of local neuronal activity and connectivity.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!