A systematic performance evaluation of clustering methods for single-cell RNA-seq data.

Angelo Duò Mark D Robinson Charlotte Soneson

F1000Res

Institute of Molecular Life Sciences, University of Zurich, Zurich, 8057, Switzerland.

Published: July 2018

Subpopulation identification, usually via some form of unsupervised clustering, is a fundamental step in the analysis of many single-cell RNA-seq data sets. This has motivated the development and application of a broad range of clustering methods, based on various underlying algorithms. Here, we provide a systematic and extensible performance evaluation of 14 clustering algorithms implemented in R, including both methods developed explicitly for scRNA-seq data and more general-purpose methods. The methods were evaluated using nine publicly available scRNA-seq data sets as well as three simulations with varying degree of cluster separability. The same feature selection approaches were used for all methods, allowing us to focus on the investigation of the performance of the clustering algorithms themselves. We evaluated the ability of recovering known subpopulations, the stability and the run time and scalability of the methods. Additionally, we investigated whether the performance could be improved by generating consensus partitions from multiple individual clustering methods. We found substantial differences in the performance, run time and stability between the methods, with SC3 and Seurat showing the most favorable results. Additionally, we found that consensus clustering typically did not improve the performance compared to the best of the combined methods, but that several of the top-performing methods already perform some type of consensus clustering. All the code used for the evaluation is available on GitHub ( https://github.com/markrobinsonuzh/scRNAseq_clustering_comparison). In addition, an R package providing access to data and clustering results, thereby facilitating inclusion of new methods and data sets, is available from Bioconductor ( https://bioconductor.org/packages/DuoClustering2018).

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6134335	PMC
http://dx.doi.org/10.12688/f1000research.15666.3	DOI Listing

Publication Analysis

Top Keywords

methods

clustering methods

data sets

clustering

performance evaluation

evaluation clustering

single-cell rna-seq

rna-seq data

clustering algorithms

scrna-seq data

Similar Publications

Multimodal integration of blood RNA and ctDNA reflects response to immunotherapy in metastatic urothelial cancer.

JCI Insight

January 2025

Medical Oncology Department, Research Institute for Medical Innovation, Radboud University Medical Center, Nijmegen, Netherlands.

Sandra van Wilpe Davide Croci Sara S Fonseca Costa Iris Baw Te Paske Sofie H Tolmeijer

Background: Previously, we demonstrated that changes in circulating tumor DNA (ctDNA) are promising biomarkers for early response prediction (ERP) to immune checkpoint inhibitors (ICI) in metastatic urothelial cancer (mUC). In this study, we investigated the value of whole blood immunotranscriptomics for ERP-ICI and integrated both biomarkers into a multimodal model to boost accuracy.

Methods: Blood samples of 93 patients were collected at baseline and after 2-6 weeks of ICI for ctDNA (N=88) and immunotranscriptome (N=79) analyses.

View Article and Find Full Text PDF

Similar Publications

Characterization of cognitive decline in long-duration type 1 diabetes by cognitive, neuroimaging and pathological examinations.

JCI Insight

January 2025

Dianne Hoppes Nunnally Laboratory Research Division, Joslin Diabetes Center, Boston, United States of America.

Hetal S Shah Matthew N DeSalvo Anastasia Haidar Surya Vishva Teja Jangolla Marc Gregory Yu

Background: We aimed to characterize factors associated with the under-studied complication of cognitive decline in aging people with long-duration type 1 diabetes (T1D).

Methods: Joslin "Medalists" (n = 222; T1D ≥ 50 years) underwent cognitive testing. Medalists (n = 52) and age-matched non-diabetic controls (n = 20) underwent neuro- and retinal imaging.

View Article and Find Full Text PDF

Similar Publications

Interplay between Skeletal Muscle Catabolism and Remodeling of Arteriovenous Fistula via YAP1 Signaling.

J Am Soc Nephrol

January 2025

Selzman Institute for Kidney Health, Section of Nephrology, Department of Medicine, Baylor College of Medicine, Houston, Texas 77030.

Yongdong Wu Tae Hoon Lee Owen H Cheng Eric Peden Qingtian Li

Background: Arteriovenous (AV) fistulas are the preferred access for dialysis but have a high incidence of failure. This study aims to understand the crosstalk between skeletal muscle catabolism and AV fistula maturation failure.

Methods: Skeletal muscle metabolism and AV fistula maturation were evaluated in mice with chronic kidney disease (CKD).

View Article and Find Full Text PDF

Similar Publications

Reporting of concomitant and rescue topical therapies in atopic dermatitis randomised controlled trials evaluating a systemic treatment: a scoping review.

Br J Dermatol

January 2025

Centre of Evidence Based Dermatology, School of Medicine, Faculty of Medicine & Health Sciences, University of Nottingham, UK.

Sophie Leducq Wei Chern Gavin Fong Hywel C Williams Lucy Bradshaw Kim S Thomas

Background: Randomised controlled trials (RCTs) evaluating new systemic treatments for atopic dermatitis (AD) have increased dramatically over the last decade. These trials often incorporate topical therapies either as permitted concomitant or rescue treatments. Differential use of these topicals post-randomisation introduces potential bias as they may nullify or exaggerate treatment responses.

View Article and Find Full Text PDF

Similar Publications

Attention-aware differential learning for predicting peptide-MHC class I binding and T cell receptor recognition.

Brief Bioinform

November 2024

School of Computer Science, Northwestern Polytechnical University, Xi'an, 710129 Shaanxi, China.

Rui Niu Jingwei Wang Yanli Li Jiren Zhou Yang Guo

The identification of neoantigens is crucial for advancing vaccines, diagnostics, and immunotherapies. Despite this importance, a fundamental question remains: how to model the presentation of neoantigens by major histocompatibility complex class I molecules and the recognition of the peptide-MHC-I (pMHC-I) complex by T cell receptors (TCRs). Accurate prediction of pMHC-I binding and TCR recognition remains a significant computational challenge in immunology due to intricate binding motifs and the long-tail distribution of known binding pairs in public databases.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!