Scdrake: a reproducible and scalable pipeline for scRNA-seq data analysis.

Bioinform Adv

Laboratory of Genomics and Bioinformatics, Institute of Molecular Genetics of the Czech Academy of Sciences, Vídeňská 1083, 142 20 Prague 4, Czech Republic.

Published: July 2023

AI Article Synopsis

  • - The text discusses the need for a reliable and automated secondary analysis pipeline for single-cell RNA-seq (scRNA-seq) data, as current methods often rely on custom scripts and lack reproducibility in R.
  • - A solution called scdrake has been developed, which automates various secondary analysis steps, from quality control to differential expression analysis, while ensuring efficiency and reproducibility.
  • - Scdrake is available as a Docker image and can be accessed through GitHub, offering users structured documentation and supplementary data for easier implementation and use.

Article Abstract

Motivation: While the workflow for primary analysis of single-cell RNA-seq (scRNA-seq) data is well established, the secondary analysis of the feature-barcode matrix is usually done by custom scripts. There is no fully automated pipeline in the R statistical environment, which would follow the current best programming practices and requirements for reproducibility.

Results: We have developed scdrake, a fully automated workflow for secondary analysis of scRNA-seq data, which is fully implemented in the R language and built within the drake framework. The pipeline includes quality control, cell and gene filtering, normalization, detection of highly variable genes, dimensionality reduction, clustering, cell type annotation, detection of marker genes, differential expression analysis and integration of multiple samples. The pipeline is reproducible and scalable, has an efficient execution, provides easy extendability and access to intermediate results and outputs rich HTML reports. Scdrake is distributed as a Docker image, which provides a straightforward setup and enhances reproducibility.

Availability And Implementation: The source code and documentation are available under the MIT license at https://github.com/bioinfocz/scdrake and https://bioinfocz.github.io/scdrake, respectively.

Supplementary Information: Supplementary data are available at online.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10351969PMC
http://dx.doi.org/10.1093/bioadv/vbad089DOI Listing

Publication Analysis

Top Keywords

scrna-seq data
12
reproducible scalable
8
secondary analysis
8
fully automated
8
analysis
5
scdrake reproducible
4
pipeline
4
scalable pipeline
4
pipeline scrna-seq
4
data
4

Similar Publications

Single-nucleus RNA sequencing (snRNA-seq), an alternative to single-cell RNA sequencing (scRNA-seq), encounters technical challenges in obtaining high-quality nuclei and RNA, persistently hindering its applications. Here, we present a robust technique for isolating nuclei across various tissue types, remarkably enhancing snRNA-seq data quality. Employing this approach, we comprehensively characterize the depot-dependent cellular dynamics of various cell types underlying mouse adipose tissue remodeling during obesity.

View Article and Find Full Text PDF

Purpose: Previous studies have reported divergent sexual responses to aging; however, specific variations in gene expression between aging males and females and their potential association with age-related retinal diseases remain unclear. This study collected data from public databases and developed a comprehensive comparison of retina between aging females and males.

Methods: Single-cell RNA (scRNA) and bulk RNA sequencing data of the aging retina from females and males in public databases were utilized for integrated analysis to investigate sex-biased expression in retina.

View Article and Find Full Text PDF

Replication timing (RT) allows us to analyze temporal patterns of genome-wide replication, i.e., if genes replicate early or late during the S-phase of the cell cycle.

View Article and Find Full Text PDF

Unlabelled: Quantitative understanding of mitochondrial heterogeneity is necessary for elucidating the precise role of these multifaceted organelles in tumor cell development. We demonstrate an early mechanistic role of mitochondria in initiating neoplasticity by performing quantitative analyses of structure-function of single mitochondrial components coupled with single cell transcriptomics. We demonstrate that the large Hyperfused-Mitochondrial-Networks (HMNs) of keratinocytes promptly get converted to the heterogenous Small-Mitochondrial-Networks (SMNs) as the stem cell enriching dose of the model carcinogen, TCDD, depolarizes mitochondria.

View Article and Find Full Text PDF

Cancer cells within tumors exhibit a wide range of phenotypic states driven by non-genetic mechanisms in addition to extensively studied genetic alterations. Conversions among cancer cell states can result in intratumoral heterogeneity which contributes to metastasis and development of drug resistance. However, mechanisms underlying the initiation and/or maintenance of such phenotypic plasticity are poorly understood.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!