High-throughput image-based profiling platforms are powerful technologies capable of collecting data from billions of cells exposed to thousands of perturbations in a time- and cost-effective manner. Therefore, image-based profiling data has been increasingly used for diverse biological applications, such as predicting drug mechanism of action or gene function. However, batch effects pose severe limitations to community-wide efforts to integrate and interpret image-based profiling data collected across different laboratories and equipment. To address this problem, we benchmarked seven high-performing scRNA-seq batch correction techniques, representing diverse approaches, using a newly released Cell Painting dataset, the largest publicly accessible image-based dataset. We focused on five different scenarios with varying complexity, and we found that Harmony, a mixture-model based method, consistently outperformed the other tested methods. Our proposed framework, benchmark, and metrics can additionally be used to assess new batch correction methods in the future. Overall, this work paves the way for improvements that allow the community to make best use of public Cell Painting data for scientific discovery.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10516049PMC
http://dx.doi.org/10.1101/2023.09.15.558001DOI Listing

Publication Analysis

Top Keywords

batch correction
12
image-based profiling
12
correction methods
8
profiling data
8
cell painting
8
image-based
5
evaluating batch
4
methods image-based
4
image-based cell
4
profiling
4

Similar Publications

Mitigating matrix effects in oil and gas wastewater analysis: LC-MS/MS method for ethanolamines.

Environ Sci Process Impacts

January 2025

Massachusetts Institute of Technology, Department of Civil and Environmental Engineering, Parsons Laboratory, 15 Vassar Street, Cambridge, Massachusetts 02139, USA.

The high salinity and organic content in oil and gas wastewaters can cause ion suppression during liquid chromatography mass spectrometry (LC/MS) analysis, diminishing the sensitivity and accuracy of measurements in available methods. This suppression is severe for low molecular weight organic compounds such as ethanolamines (, monoethanolamine (MEA), diethanolamine (DEA), triethanolamine (TEA), -methyldiethanolamine (MDEA), and ,-ethyldiethanolamine (EDEA)). Here, we deployed solid phase extraction (SPE), mixed-mode LC, triple quadrupole MS with positive electrospray ionization (ESI), and a suite of stable isotope standards (, one per target compound) to correct for ion suppression by salts and organic matter, SPE losses, and instrument variability.

View Article and Find Full Text PDF

While gas chromatography mass spectrometry (GC-MS) has long been used to identify compounds in complex mixtures, this process is often subjective and time-consuming and leaves a large fraction of seemingly good-quality spectra unidentified. In this work, we describe a set of new mass spectral library-based methods to assist compound identification in complex mixtures. These methods employ mass spectral uniqueness and compound ubiquity of library entries alongside noise reduction and automated comparison of retention indices to library compounds.

View Article and Find Full Text PDF

Highly effective batch effect correction method for RNA-seq count data.

Comput Struct Biotechnol J

December 2024

Department of Computer Science and Information Science, California State University San Marcos, 333 S. Twin Oaks Valley Rd, San Marcos, CA 92096, USA.

RNA sequencing (RNA-seq) has become a cornerstone of transcriptomics, providing detailed insights into gene expression across diverse biological conditions and sample types. However, RNA-seq data are often confounded by batch effects, systematic non-biological variations that compromise data reliability and obscure true biological differences. To address these challenges, we introduce ComBat-ref, a refined batch effect correction method designed to enhance the statistical power and reliability of differential expression analysis in RNA-seq data.

View Article and Find Full Text PDF

Tablet diversion strategy based on in-line NIR tablet press feed frame measurements.

Int J Pharm

January 2025

Pharmaceutical Engineering Research Group (PharmaEng), Department of Pharmaceutical Analysis, Ghent University, Ottergemsesteenweg 460, 9000 Ghent, Belgium. Electronic address:

The tablet diversion strategy, based on in-line near-infrared (NIR) tablet press feed frame measurements, can be a key component of both batch and continuous oral solid dose manufacturing processes. It enables real-time, high-frequency monitoring and control, enhancing process understanding and compliance compared to conventional interval-based sampling methods. Central to this strategy are NIR spectrometers, which serve as PAT systems for in-line blend uniformity monitoring in the feed of the tablet press.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!