Denoising adaptive deep clustering with self-attention mechanism on single-cell sequencing data.

Brief Bioinform

Key Lab of Intelligent Computing and Signal Processing of Ministry of Education, School of Artificial Intelligence, Anhui University, Hefei, 230601, China.

Published: March 2023

A large number of works have presented the single-cell RNA sequencing (scRNA-seq) to study the diversity and biological functions of cells at the single-cell level. Clustering identifies unknown cell types, which is essential for downstream analysis of scRNA-seq samples. However, the high dimensionality, high noise and pervasive dropout rate of scRNA-seq samples have a significant challenge to the cluster analysis of scRNA-seq samples. Herein, we propose a new adaptive fuzzy clustering model based on the denoising autoencoder and self-attention mechanism called the scDASFK. It implements the comparative learning to integrate cell similar information into the clustering method and uses a deep denoising network module to denoise the data. scDASFK consists of a self-attention mechanism for further denoising where an adaptive clustering optimization function for iterative clustering is implemented. In order to make the denoised latent features better reflect the cell structure, we introduce a new adaptive feedback mechanism to supervise the denoising process through the clustering results. Experiments on 16 real scRNA-seq datasets show that scDASFK performs well in terms of clustering accuracy, scalability and stability. Overall, scDASFK is an effective clustering model with great potential for scRNA-seq samples analysis. Our scDASFK model codes are freely available at https://github.com/LRX2022/scDASFK.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bib/bbad021DOI Listing

Publication Analysis

Top Keywords

scrna-seq samples
16
self-attention mechanism
12
clustering
9
denoising adaptive
8
analysis scrna-seq
8
clustering model
8
scrna-seq
6
denoising
5
scdasfk
5
adaptive deep
4

Similar Publications

Background: Lung adenocarcinoma is one of the most common malignant tumors worldwide. Its complex molecular mechanisms and high tumor heterogeneity pose significant challenges for clinical treatment. The manganese ion metabolism family plays a crucial role in various biological processes, and the abnormal expression of the NUDT3 gene in multiple cancers has drawn considerable attention.

View Article and Find Full Text PDF

Introduction: Our aim was to investigate the insufficiently understood differences in the immune system between anti-citrullinated peptide antibody (ACPA)-positive (ACPA) and ACPA-negative (ACPA) early rheumatoid arthritis (eRA) patients.

Methods: We performed multiple cytokine assays using sera from drug-naïve ACPA and ACPA eRA patients. Additionally, we conducted single-cell RNA sequencing of CD45 cells from peripheral blood samples to analyze and compare the distribution and functional characteristics of the cell subsets based on the ACPA status.

View Article and Find Full Text PDF

Background: Immune cells within tumor tissues play important roles in remodeling the tumor microenvironment, thus affecting tumor progression and the therapeutic response. The current study was designed to identify key markers of plasma cells and explore their role in high-grade serous ovarian cancer (HGSOC).

Methods: We utilized single-cell sequencing data from the Gene Expression Omnibus (GEO) database to identify key immune cell types within HGSOC tissues and to extract related markers via the Seurat package.

View Article and Find Full Text PDF

Background: Fibrotic skin disease represents a major global healthcare burden, characterized by fibroblast hyperproliferation and excessive accumulation of extracellular matrix components. The immune cells are postulated to exert a pivotal role in the development of fibrotic skin disease. Single-cell RNA sequencing has been used to explore the composition and functionality of immune cells present in fibrotic skin diseases.

View Article and Find Full Text PDF

Motivation: Bispecific antibodies (bsAbs) that bind to two distinct surface antigens on cancer cells are emerging as an appealing therapeutic strategy in cancer immunotherapy. However, considering the vast number of surface proteins, experimental identification of potential antigen pairs that are selectively expressed in cancer cells and not in normal cells is both costly and time-consuming. Recent studies have utilized large bulk RNA-seq databases to propose bispecific targets for various cancers.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!