A step-by-step workflow for low-level analysis of single-cell RNA-seq data with Bioconductor.

F1000Res

Cancer Research UK Cambridge Institute, Cambridge, UK; EMBL European Bioinformatics Institute, Cambridge, UK; Wellcome Trust Sanger Institute, Cambridge, UK.

Published: August 2016

Single-cell RNA sequencing (scRNA-seq) is widely used to profile the transcriptome of individual cells. This provides biological resolution that cannot be matched by bulk RNA sequencing, at the cost of increased technical noise and data complexity. The differences between scRNA-seq and bulk RNA-seq data mean that the analysis of the former cannot be performed by recycling bioinformatics pipelines for the latter. Rather, dedicated single-cell methods are required at various steps to exploit the cellular resolution while accounting for technical noise. This article describes a computational workflow for low-level analyses of scRNA-seq data, based primarily on software packages from the open-source Bioconductor project. It covers basic steps including quality control, data exploration and normalization, as well as more complex procedures such as cell cycle phase assignment, identification of highly variable and correlated genes, clustering into subpopulations and marker gene detection. Analyses were demonstrated on gene-level count data from several publicly available datasets involving haematopoietic stem cells, brain-derived cells, T-helper cells and mouse embryonic stem cells. This will provide a range of usage scenarios from which readers can construct their own analysis pipelines.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5112579PMC
http://dx.doi.org/10.12688/f1000research.9501.2DOI Listing

Publication Analysis

Top Keywords

workflow low-level
8
rna-seq data
8
rna sequencing
8
technical noise
8
stem cells
8
data
6
cells
5
step-by-step workflow
4
low-level analysis
4
analysis single-cell
4

Similar Publications

The intelligent selenium-enriched tea withering control system.

Sci Rep

January 2025

College of Intelligent Systems Science and Engineering, Hubei Minzu University, Enshi, 445000, China.

This paper addresses the low level of intelligence in tea processing equipment in Enshi Prefecture by designing an intelligent withering control system based on the STMicroelectronics 32-bit Microcontroller (STM32). This control system can achieve real-time monitoring of the withering environment and automate the control of heating and ventilation dehumidification modules. By integrating IoT technology, relevant users can view the tea production process via mobile devices, enabling intelligent and remote production operations.

View Article and Find Full Text PDF

Lipid analysis of human primary dermal fibroblasts and epidermal keratinocytes after near-infrared exposure using mass spectrometry imaging.

J Biotechnol

December 2024

The Maastricht MultiModal Molecular Imaging (M4I) institute, Division of Imaging Mass Spectrometry (IMS), Maastricht University, Maastricht 6229 ER, The Netherlands. Electronic address:

Photobiomodulation (PBM) therapy is the application of near-infrared (NIR) exposure to injuries or lesions to (among others) improve wound healing, reduce inflammation, and decreases acute and chronic pain. However, the understanding of the molecular mechanism of PBM, more specifically the effects of NIR on skin cells is still lacking behind. Lipids are essential components of cellular membranes that are integral to skin structure and function.

View Article and Find Full Text PDF

High-throughput drug discovery on the microgram scale is now common, making analyte quantitation without molecule-specific calibration imperative. The charged aerosol detector (CAD) was invented to be a next-generation universal liquid chromatography (LC) detector with excellent response universality for nonvolatile analytes as well as sensitivity for nonchromophoric compounds. Although the CAD is a mass flow-sensitive detector, its response to mass is inherently nonlinear, which challenges traditional quantitation.

View Article and Find Full Text PDF

Multimodal representations of biomedical knowledge from limited training whole slide images and reports using deep learning.

Med Image Anal

October 2024

Information Systems Institute, University of Applied Sciences Western Switzerland (HES-SO Valais), Sierre, Switzerland; Department of Neurosciences, University of Padua, Padua, Italy.

The increasing availability of biomedical data creates valuable resources for developing new deep learning algorithms to support experts, especially in domains where collecting large volumes of annotated data is not trivial. Biomedical data include several modalities containing complementary information, such as medical images and reports: images are often large and encode low-level information, while reports include a summarized high-level description of the findings identified within data and often only concerning a small part of the image. However, only a few methods allow to effectively link the visual content of images with the textual content of reports, preventing medical specialists from properly benefitting from the recent opportunities offered by deep learning models.

View Article and Find Full Text PDF

Advances in proteomics and mass spectrometry enable the study of limited cell populations, where high-mass accuracy instruments are typically required. While triple quadrupoles offer fast and sensitive low-mass accuracy measurements, these instruments are effectively restricted to targeted proteomics. Linear ion traps (LITs) offer a versatile, cost-effective alternative capable of both targeted and global proteomics.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!