Single-cell ribonucleic acid (RNA)-sequencing (scRNA-seq) data analysis refers to the use of appropriate methods to analyze the dataset generated by RNA-sequencing performed on the single-cell transcriptome. It usually contains three steps: normalization to eliminate the technical noise, dimensionality reduction to facilitate visual understanding and data compression and clustering to divide the data into several similarity-based clusters. In addition, the gene expression data contain a large number of zero counts. These zero counts are considered relevant to random dropout events induced by multiple factors in the sequencing experiments, such as low RNA input, and the stochastic nature of the gene expression pattern at the single-cell level. The zero counts can be eliminated only through the analysis of the scRNA-seq data, and although many methods have been proposed to this end, there is still a lack of research on the combined effect of existing methods. In this paper, we summarize the two kinds of normalization, two kinds of dimension reduction and three kinds of clustering methods widely used in the current mainstream scRNA-seq data analysis. Furthermore, we propose to combine these methods into 12 technology combinations, each with a whole set of scRNA-seq data analysis processes. We evaluated the proposed combinations using Goolam, a publicly available scRNA-seq, by comparing the final clustering results and found the most suitable collection scheme of these classic methods. Our results showed that using appropriate technology combinations can improve the efficiency and accuracy of the scRNA-seq data analysis. The combinations not only satisfy the basic requirements of noise reduction, dimension reduction and cell clustering but also ensure preserving the heterogeneity of cells in downstream analysis. The dataset, Goolam, used in the study can be obtained from the ArrayExpress database under the accession number E-MTAB-3321.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bfgp/elac027DOI Listing

Publication Analysis

Top Keywords

scrna-seq data
24
data analysis
20
data
9
gene expression
8
dimension reduction
8
technology combinations
8
analysis
7
scrna-seq
6
methods
6
combinations
5

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!