To overcome the computational barriers of analyzing large-scale single-cell sequencing data, we introduce MetaQ, a metacell algorithm that scales to arbitrarily large datasets with linear runtime and constant memory usage. Inspired by cellular development, MetaQ conceptualizes each metacell as a collective ancestor of biologically similar cells. By quantizing cells into a discrete codebook, where each entry represents a metacell capable of reconstructing the original cells it quantizes, MetaQ identifies homogeneous cell subsets for efficient and accurate metacell inference. This approach reduces computational complexity from exponential to linear while maintaining or surpassing the performance of existing metacell algorithms. Extensive experiments demonstrate that MetaQ excels in downstream tasks such as cell type annotation, developmental trajectory inference, batch integration, and differential expression analysis. Thanks to its superior efficiency and effectiveness, MetaQ makes analyzing datasets with millions of cells practical, offering a powerful solution for single-cell studies in the era of high-throughput profiling.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1038/s41467-025-56424-6 | DOI Listing |
Nat Commun
January 2025
School of Computer Science, Sichuan University, Chengdu, Sichuan, China.
To overcome the computational barriers of analyzing large-scale single-cell sequencing data, we introduce MetaQ, a metacell algorithm that scales to arbitrarily large datasets with linear runtime and constant memory usage. Inspired by cellular development, MetaQ conceptualizes each metacell as a collective ancestor of biologically similar cells. By quantizing cells into a discrete codebook, where each entry represents a metacell capable of reconstructing the original cells it quantizes, MetaQ identifies homogeneous cell subsets for efficient and accurate metacell inference.
View Article and Find Full Text PDFMetabolites
July 2021
Structural and Computational Biology Unit, European Molecular Biology Laboratory, 69117 Heidelberg, Germany.
Metabolite annotation from imaging mass spectrometry (imaging MS) data is a difficult undertaking that is extremely resource intensive. Here, we adapted METASPACE, cloud software for imaging MS metabolite annotation and data interpretation, to quickly annotate microbial specialized metabolites from high-resolution and high-mass accuracy imaging MS data. Compared with manual ion image and MS1 annotation, METASPACE is faster and, with the appropriate database, more accurate.
View Article and Find Full Text PDFBMC Neurosci
August 2016
Institut de Neuroscienes de la Timone (INT), CNRS & Aix-Marseille University, 27 Boulevard Jean Moulin, 13005 Marseille, France
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!