Motivation: Normalization to remove technical or experimental artifacts is critical in the analysis of single-cell RNA-sequencing experiments, even those for which unique molecular identifiers are available. The majority of methods for normalizing single-cell RNA-sequencing data adjust average expression for library size (LS), allowing the variance and other properties of the gene-specific expression distribution to be non-constant in LS. This often results in reduced power and increased false discoveries in downstream analyses, a problem which is exacerbated by the high proportion of zeros present in most datasets.

Results: To address this, we present Dino, a normalization method based on a flexible negative-binomial mixture model of gene expression. As demonstrated in both simulated and case study datasets, by normalizing the entire gene expression distribution, Dino is robust to shallow sequencing, sample heterogeneity and varying zero proportions, leading to improved performance in downstream analyses in a number of settings.

Availability And Implementation: The R package, Dino, is available on GitHub at https://github.com/JBrownBiostat/Dino. The Dino package is further archived and freely available on Zenodo at https://doi.org/10.5281/zenodo.4897558.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9502161PMC
http://dx.doi.org/10.1093/bioinformatics/btab450DOI Listing

Publication Analysis

Top Keywords

single-cell rna-sequencing
12
rna-sequencing data
8
expression distribution
8
downstream analyses
8
gene expression
8
normalization distributional
4
distributional resampling
4
resampling high
4
high throughput
4
throughput single-cell
4

Similar Publications

Background: Bone-invasive Pituitary Neuroendocrine Tumors (BI PitNETs) epitomize an aggressive subtype of pituitary tumors characterized by bone invasion, culminating in extensive skull base bone destruction and fragmentation. This infiltration poses a significant surgical risk due to potential damage to vital nerves and arteries. However, the mechanisms underlying bone invasion caused by PitNETs remain elusive, and effective interventions for PitNET-induced bone invasion are lacking in clinical practice.

View Article and Find Full Text PDF

An essential task in spatial transcriptomics is identifying spatially variable genes (SVGs). Here, we present Celina, a statistical method for systematically detecting cell type-specific SVGs (ct-SVGs)-a subset of SVGs exhibiting distinct spatial expression patterns within specific cell types. Celina utilizes a spatially varying coefficient model to accurately capture each gene's spatial expression pattern in relation to the distribution of cell types across tissue locations, ensuring effective type I error control and high power.

View Article and Find Full Text PDF

Integration of human papillomavirus (HPV) into the host genome drives HPV-positive head and neck squamous cell carcinoma (HPV HNSCC). Whole-genome sequencing of 51 tumors revealed intratumor heterogeneity of HPV integration, with 44% of breakpoints subclonal, and a biased distribution of integration breakpoints across the HPV genome. Four HPV physical states were identified, with at least 49% of tumors progressing without integration.

View Article and Find Full Text PDF

DOGMA-seq and multimodal, single-cell analysis in acute myeloid leukemia.

Int Rev Cell Mol Biol

January 2025

Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, United States; The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, United States. Electronic address:

Acute myeloid leukemia (AML) is a complex cancer, yet advances in recent years from integrated genomics methods have helped improve diagnosis, treatment, and means of patient stratification. A recent example of a powerful, multimodal method is DOGMA-seq, which can measure chromatin accessibility, gene expression, and cell-surface protein levels from the same individual cell simultaneously. Previous bimodal single-cell techniques, such as CITE-seq (Cellular indexing of transcriptomes and epitopes), have only permitted the transcriptome and cell-surface protein expression measurement.

View Article and Find Full Text PDF

Background: The incidence of papillary thyroid carcinoma (PTC) is on the rise globally. It is frequently associated with early lymphatic metastasis, and the poor prognosis tends to be poor once metastasis or recurrence occurs, even with current treatment modalities. Kushenol O, a novel extract derived from Sophora flavescens, has shown remarkable anticancer properties.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!