scCross: efficient search for rare subpopulations across multiple single-cell samples.

Bioinformatics

ICTEAM/INGI/Artificial Intelligence and Algorithms Group, UCLouvain, Louvain-la-Neuve 1348, Belgium.

Published: June 2024

Motivation: Identifying rare cell types is an important task to capture the heterogeneity of single-cell data, such as scRNA-seq. The widespread availability of such data enables to aggregate multiple samples, corresponding for example to different donors, into the same study. Yet, such aggregated data is often subject to batch effects between samples. Clustering it therefore generally requires the use of data integration methods, which can lead to overcorrection, making the identification of rare cells difficult. We present scCross, a biclustering method identifying rare subpopulations of cells present across multiple single-cell samples. It jointly identifies a group of cells with specific marker genes by relying on a global sum criterion, computed over entire subpopulation of cells, rather than pairwise comparisons between individual cells. This proves robust with respect to the high variability of scRNA-seq data, in particular batch effects.

Results: We show through several case studies that scCross is able to identify rare subpopulations across multiple samples without performing prior data integration. Namely, it identifies a cilium subpopulation with potential new ciliary genes from lung cancer cells, which is not detected by typical alternatives. It also highlights rare subpopulations in human pancreas samples sequenced with different protocols, despite visible shifts in expression levels between batches. We further show that scCross outperforms typical alternatives at identifying a target rare cell type in a controlled experiment with artificially created batch effects. This shows the ability of scCross to efficiently identify rare cell subpopulations characterized by specific genes despite the presence of batch effects.

Availability And Implementation: The R and Scala implementation of scCross is freely available on GitHub, at https://github.com/agerniers/scCross/. A snapshot of the code and the data underlying this article are available on Zenodo, at https://zenodo.org/doi/10.5281/zenodo.10471063.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11256925PMC
http://dx.doi.org/10.1093/bioinformatics/btae371DOI Listing

Publication Analysis

Top Keywords

rare subpopulations
16
rare cell
12
rare
8
subpopulations multiple
8
multiple single-cell
8
single-cell samples
8
identifying rare
8
multiple samples
8
batch effects
8
data integration
8

Similar Publications

Identification of clinical subgroups in anti-SRP positive immune-mediated necrotizing myopathy patients using cluster analysis.

Ther Adv Musculoskelet Dis

January 2025

Department of Rheumatology and Immunology, West China Hospital, Sichuan University, No. 37 Guoxue Alley, Chengdu, 610041, China.

Background: Anti-signal recognition particle immune-mediated necrotizing myopathy (anti-SRP IMNM) is a rare autoimmune disorder characterized by muscle weakness and necrosis. Identifying clinical subgroups within this patient population could facilitate the management of the disease.

Objectives: To identify distinct clinical subgroups of anti-SRP IMNM patients.

View Article and Find Full Text PDF

Although the criteria that support reimbursement decisions for medicines are often set by legislation, as is the case in Spain, in many cases neither the definition nor the measurement methods for these criteria are provided. Our goal was to elicit the views of a large sample of Spanish technical specialists on how to evaluate each one of the criteria that inform pricing and reimbursement decisions in Spain. Professionals from various stakeholder groups involved in health economics, health technology assessment, and industry participated in a survey.

View Article and Find Full Text PDF

A novel compound heterozygous mutation in the DYNC2H1 gene in a Chinese family with Jeune syndrome.

Hereditas

January 2025

Key Laboratory of Reproductive Health Diseases Research and Translation of Ministry of Education & Key Laboratory of Human Reproductive Medicine and Genetic Research of Hainan Provincie & Hainan Provincial Clinical Research Center for Thalassemia, The First Affiliated Hospital of Hainan Medical University, Hainan Medical University, Haikou, Hainan, 571101, China.

Background: The dynein cytoplasmic two heavy chain 1 (DYNC2H1) gene encodes a cytoplasmic dynein subunit. Cytoplasmic dyneins transport cargo towards the minus end of microtubules and are thus termed the "retrograde" cellular motor. Mutations in DYNC2H1 are the main causative mutations of short rib-thoracic dysplasia syndrome type III with or without polydactyly (SRTD3).

View Article and Find Full Text PDF

Objectives: Acute type A aortic dissection is a life-threatening cardiovascular disease commonly seen in emergency department, resulting in substantial mortality and morbidity. We aimed to investigate the prognostic value of N-terminal pro-B type natriuretic peptide (NT-proBNP) among this critically ill population.

Design: The design of this study was a retrospective cohort study.

View Article and Find Full Text PDF

Late-Onset Krabbe Disease: Case Report of Two Patients in a Chinese Family and Literature Review.

Mol Genet Genomic Med

February 2025

Department of Orthopeadic Surgery, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou, People's Republic of China.

Background: Krabbe disease (KD; globoid cell leucodystrophy) is a rare autosomal recessive lipid storage disorder that affects the white matter of the peripheral and central nervous. Late-onset KD is less frequently diagnosed and often presents with milder symptoms, making accurate diagnosis challenging, especially when distinguishing it from peripheral neuropathy. In this report, we present two cases of late-onset KD in a Chinese family.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!