Characterizing efficient feature selection for single-cell expression analysis.

Brief Bioinform

Department of Biological Sciences, Ulsan National Institute of Science and Technology (UNIST), 50, UNIST-gil, Ulsan 44919, Republic of Korea.

Published: May 2024

Unsupervised feature selection is a critical step for efficient and accurate analysis of single-cell RNA-seq data. Previous benchmarks used two different criteria to compare feature selection methods: (i) proportion of ground-truth marker genes included in the selected features and (ii) accuracy of cell clustering using ground-truth cell types. Here, we systematically compare the performance of 11 feature selection methods for both criteria. We first demonstrate the discordance between these criteria and suggest using the latter. We then compare the distribution of selected genes in their means between feature selection methods. We show that lowly expressed genes exhibit seriously high coefficients of variation and are mostly excluded by high-performance methods. In particular, high-deviation- and high-expression-based methods outperform the widely used in Seurat package in clustering cells and data visualization. We further show they also enable a clear separation of the same cell type from different tissues as well as accurate estimation of cell trajectories.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11229035PMC
http://dx.doi.org/10.1093/bib/bbae317DOI Listing

Publication Analysis

Top Keywords

feature selection
20
selection methods
12
criteria compare
8
feature
5
selection
5
methods
5
characterizing efficient
4
efficient feature
4
selection single-cell
4
single-cell expression
4

Similar Publications

Although radiotherapy techniques are the primary treatment for head and neck cancer (HNC), they are still associated with substantial toxicity, and side effect. Machine learning (ML) based radiomics models for predicting toxicity mostly rely on features extracted from pre-treatment imaging data. This study aims to compare different models in predicting radiation-induced xerostomia and sticky saliva in both early and late stage of HNC patients using CT and MRI image features along with demographics and dosimetric information.

View Article and Find Full Text PDF

Objective: The study aims to develop continuous trajectory profiles along curves with minimal error. It also focuses on formulating a percentage trajectory transection rate model as a function of geometric parameters (e.g.

View Article and Find Full Text PDF

Purpose: The purpose of the current study was to gain insight on augmentative and alternative communication (AAC) interface designs for children with cortical visual impairment (CVI). Children with CVI frequently require AAC and specific interface supports, and customization may be necessary to support access and use of speech-generating devices.

Method: A focus group methodology was selected to gain feedback from vision professionals on helpful AAC features for children with CVI.

View Article and Find Full Text PDF

Shallow whole-genome sequencing (sWGS) offers a cost-effective approach to detect copy number alterations (CNAs). However, there remains a gap for a standardized workflow specifically designed for sWGS analysis. To address this need, in this work we present SAMURAI, a bioinformatics pipeline specifically designed for analyzing CNAs from sWGS data in a standardized and reproducible manner.

View Article and Find Full Text PDF

Cost-Effective Synthesis of Carbazole-Based Nanoporous Organic Polymers for SO Capture.

ACS Appl Mater Interfaces

January 2025

International Scientific and Technological Cooperation Base of Industrial Solid Waste Cyclic Utilization and Advanced Materials, School of Materials Science and Engineering, North Minzu University, Yinchuan 750021, China.

Sulfur dioxide (SO), a pervasive air pollutant, poses significant environmental and health risks, necessitating advanced materials for its efficient capture. Nanoporous organic polymers (NOPs) have emerged as promising candidates; however, their development is often hindered by high synthesis temperatures, complex precursors, and limited SO selectivity. Herein, we report a room-temperature, cost-effective synthesis of carbazole-based nanoporous organic polymers (CNOPs) using 1,3,5-trioxane and paraldehyde, offering a significant advancement over traditional Friedel-Crafts alkylation methods.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!