A key challenge in analyzing single cell RNA-sequencing data is the large number of false zeros, where genes actually expressed in a given cell are incorrectly measured as unexpressed. We present a method based on low-rank matrix approximation which imputes these values while preserving biologically non-expressed genes (true biological zeros) at zero expression levels. We provide theoretical justification for this denoising approach and demonstrate its advantages relative to other methods on simulated and biological datasets.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8752663PMC
http://dx.doi.org/10.1038/s41467-021-27729-zDOI Listing

Publication Analysis

Top Keywords

zero-preserving imputation
4
imputation single-cell
4
single-cell rna-seq
4
rna-seq data
4
data key
4
key challenge
4
challenge analyzing
4
analyzing single
4
single cell
4
cell rna-sequencing
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!