Background: Single-cell sequencing (sc-Seq) experiments are producing increasingly large data sets. However, large data sets do not necessarily contain large amounts of information.

Results: Here, we formally quantify the information obtained from a sc-Seq experiment and show that it corresponds to an intuitive notion of gene expression heterogeneity. We demonstrate a natural relation between our notion of heterogeneity and that of cell type, decomposing heterogeneity into that component attributable to differential expression between cell types (inter-cluster heterogeneity) and that remaining (intra-cluster heterogeneity). We test our definition of heterogeneity as the objective function of a clustering algorithm, and show that it is a useful descriptor for gene expression patterns associated with different cell types.

Conclusions: Thus, our definition of gene heterogeneity leads to a biologically meaningful notion of cell type, as groups of cells that are statistically equivalent with respect to their patterns of gene expression. Our measure of heterogeneity, and its decomposition into inter- and intra-cluster, is non-parametric, intrinsic, unbiased, and requires no additional assumptions about expression patterns. Based on this theory, we develop an efficient method for the automatic unsupervised clustering of cells from sc-Seq data, and provide an R package implementation.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10422744PMC
http://dx.doi.org/10.1186/s12859-023-05424-8DOI Listing

Publication Analysis

Top Keywords

gene expression
12
large data
8
data sets
8
sets large
8
heterogeneity
8
cell type
8
expression patterns
8
cell
5
expression
5
information-theoretic approach
4

Similar Publications

Pulmonary hypertension (PH) increases the mortality of preterm infants with bronchopulmonary dysplasia (BPD). There are no curative therapies for this disease. Lung endothelial carnitine palmitoyltransferase 1a (Cpt1a), the rate-limiting enzyme of the carnitine shuttle system, is reduced in a rodent model of BPD.

View Article and Find Full Text PDF

Protocol to generate a 3D atherogenesis-on-chip model for studying endothelial-macrophage crosstalk in atherogenesis.

STAR Protoc

January 2025

Department of Experimental Vascular Medicine, Amsterdam UMC, location AMC, Meibergdreef 9, Amsterdam, the Netherlands; Amsterdam Cardiovascular Sciences, Atherosclerosis & Ischemic Syndromes, Amsterdam, the Netherlands; Laboratory of Angiogenesis and Vascular Metabolism, VIB-KU Leuven Center for Cancer Biology, VIB, 3000 Leuven, Belgium; Laboratory of Angiogenesis and Vascular Metabolism, Department of Oncology, KU Leuven and Leuven Cancer Institute (LKI), 3000 Leuven, Belgium. Electronic address:

The endothelium is the gatekeeper of vessel health, and its dysfunction is pivotal in driving atherogenesis. Here, we present a protocol to replicate endothelial-macrophage crosstalk during atherogenesis, called the "atherogenesis-on-chip" model, based on the Emulate dual-channel perfusion system. We describe a model for studying endothelial-macrophage interactions during atherogenesis in human aortic endothelial cells and human macrophages using qPCR and secretome analysis, fluorescence microscopy, and flow cytometry.

View Article and Find Full Text PDF

Angiogenesis begins as endothelial cells migrate, forming a sprouting tip and subsequent growth-rich stalk cells. Here, we present a protocol for transcriptomic and epigenomic analyses of tip-like cells in cultured endothelial cells. We describe steps for stimulating human umbilical vein endothelial cells (HUVECs) with vascular endothelial cell growth factor (VEGF) to generate tip-like cells.

View Article and Find Full Text PDF

Cadmium (Cd) is a toxic heavy metal which induces vascular disorders. Previous studies suggest that Cd in the bloodstream affects vascular endothelial cells (ECs), potentially contributing to vascular-related diseases. However, the molecular mechanisms of effects of Cd on ECs remain poorly understood.

View Article and Find Full Text PDF

In the present study, we identified 22 significant SNPs, eight stable QTLs and 17 potential candidate genes associated with 100-seed weight in soybean. Soybean is an economically important crop that is rich in seed oil and protein. The 100-seed weight (HSW) is a crucial yield contributing trait.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!