Single-cell RNA sequencing (scRNA-seq) data are commonly affected by technical artifacts known as "doublets," which limit cell throughput and lead to spurious biological conclusions. Here, we present a computational doublet detection tool-DoubletFinder-that identifies doublets using only gene expression data. DoubletFinder predicts doublets according to each real cell's proximity in gene expression space to artificial doublets created by averaging the transcriptional profile of randomly chosen cell pairs. We first use scRNA-seq datasets where the identity of doublets is known to show that DoubletFinder identifies doublets formed from transcriptionally distinct cells. When these doublets are removed, the identification of differentially expressed genes is enhanced. Second, we provide a method for estimating DoubletFinder input parameters, allowing its application across scRNA-seq datasets with diverse distributions of cell types. Lastly, we present "best practices" for DoubletFinder applications and illustrate that DoubletFinder is insensitive to an experimentally validated kidney cell type with "hybrid" expression features.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6853612 | PMC |
http://dx.doi.org/10.1016/j.cels.2019.03.003 | DOI Listing |
Nat Commun
November 2024
Peter MacCallum Cancer Centre, Melbourne, VIC, Australia.
The use of circulating tumour DNA (ctDNA) to profile mutational signatures represents a non-invasive opportunity for understanding cancer mutational processes. Here we present MisMatchFinder, a liquid biopsy approach for mutational signature detection using low-coverage whole-genome sequencing of ctDNA. Through analysis of 375 plasma samples across 9 cancers, we demonstrate that MisMatchFinder accurately infers single-base and doublet-base substitutions, as well as insertions and deletions to enhance the detection of ctDNA and clinically relevant mutational signatures.
View Article and Find Full Text PDFGenomics Proteomics Bioinformatics
December 2024
State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou 510275, China.
Multiplexing across donors has emerged as a popular strategy to increase throughput, reduce costs, overcome technical batch effects, and improve doublet detection in single-cell genomic studies. To eliminate additional experimental steps, endogenous nuclear genome variants are used for demultiplexing pooled single-cell RNA sequencing (scRNA-seq) data by several computational tools. However, these tools have limitations when applied to single-cell sequencing methods that do not cover nuclear genomic regions well, such as single-cell assay for transposase-accessible chromatin with sequencing (scATAC-seq).
View Article and Find Full Text PDFJCO Precis Oncol
September 2024
Department of Surgery, University of California, Irvine, Orange, CA.
Purpose: High-grade appendiceal adenocarcinomas (HGAA) with peritoneal metastases (PMs) are associated with poor survival. Hyperthermic intraperitoneal chemotherapy (HIPEC) is a novel treatment approach for unresectable HGAA-PM. However, its influence on immunogenomic profiles has not yet been fully explored.
View Article and Find Full Text PDFSingle-cell sequencing technologies have revolutionized biomedical research by enabling deconvolution of cell type-specific properties in highly heterogeneous tissue. While robust tools have been developed to handle bioinformatic challenges posed by single-cell RNA and ATAC data, options for emergent modalities such as methylation are much more limited, impeding the utility of results. Here we present Amethyst, a comprehensive R package for atlas-scale single-cell methylation sequencing data analysis.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!