Stormbow: A Cloud-Based Tool for Reads Mapping and Expression Quantification in Large-Scale RNA-Seq Studies.

ISRN Bioinform

Translational Informatics IT, Janssen Research & Development, LLC, 3210 Merryfield Row, San Diego, CA 92121, USA.

Published: May 2015

AI Article Synopsis

  • RNA-Seq is gaining traction as a more effective alternative to microarrays for analyzing gene expression, bolstered by decreased sequencing costs and a surge in data availability.
  • To address the challenges of processing large RNA-Seq datasets locally, a cloud-based tool named Stormbow was developed, which can handle data analysis in parallel, significantly speeding up the process.
  • In tests, Stormbow processed samples with 100 million reads in 6 to 8 hours for an average cost of $3.50 per sample, making it a scalable, cost-effective, and user-friendly solution for researchers.

Article Abstract

RNA-Seq is becoming a promising replacement to microarrays in transcriptome profiling and differential gene expression study. Technical improvements have decreased sequencing costs and, as a result, the size and number of RNA-Seq datasets have increased rapidly. However, the increasing volume of data from large-scale RNA-Seq studies poses a practical challenge for data analysis in a local environment. To meet this challenge, we developed Stormbow, a cloud-based software package, to process large volumes of RNA-Seq data in parallel. The performance of Stormbow has been tested by practically applying it to analyse 178 RNA-Seq samples in the cloud. In our test, it took 6 to 8 hours to process an RNA-Seq sample with 100 million reads, and the average cost was $3.50 per sample. Utilizing Amazon Web Services as the infrastructure for Stormbow allows us to easily scale up to handle large datasets with on-demand computational resources. Stormbow is a scalable, cost effective, and open-source based tool for large-scale RNA-Seq data analysis. Stormbow can be freely downloaded and can be used out of box to process Illumina RNA-Seq datasets.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4393068PMC
http://dx.doi.org/10.1155/2013/481545DOI Listing

Publication Analysis

Top Keywords

large-scale rna-seq
12
rna-seq
9
stormbow cloud-based
8
rna-seq studies
8
rna-seq datasets
8
data analysis
8
rna-seq data
8
stormbow
6
cloud-based tool
4
tool reads
4

Similar Publications

Motivation: Bispecific antibodies (bsAbs) that bind to two distinct surface antigens on cancer cells are emerging as an appealing therapeutic strategy in cancer immunotherapy. However, considering the vast number of surface proteins, experimental identification of potential antigen pairs that are selectively expressed in cancer cells and not in normal cells is both costly and time-consuming. Recent studies have utilized large bulk RNA-seq databases to propose bispecific targets for various cancers.

View Article and Find Full Text PDF

Targeted barcoding of variable antibody domains and individual transcriptomes of the human B-cell repertoire using Link-Seq.

PNAS Nexus

January 2025

Institute of Bioengineering, School of Engineering, École Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland.

Here, we present Link-Seq, a highly efficient droplet microfluidic method for combined sequencing of antibody-encoding genes and the transcriptome of individual B cells at large scale. The method is based on 3' barcoding of the transcriptome and subsequent single-molecule PCR in droplets, which freely shift the barcode along specific gene regions, such as the antibody heavy- and light-chain genes. Using the immune repertoire of COVID-19 patients and healthy donors as a model system, we obtain up to 91.

View Article and Find Full Text PDF

Deciphering cell states and the cellular ecosystem to improve risk stratification in acute myeloid leukemia.

Brief Bioinform

November 2024

State Key Laboratory of Cellular Stress Biology, Xiang'an Hospital, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, No. 4221, Xiang'an South Road, Xiamen, Fujian 361102, China.

Acute myeloid leukemia (AML) demonstrates significant cellular heterogeneity in both leukemic and immune cells, providing valuable insights into clinical outcomes. Here, we constructed an AML single-cell transcriptome atlas and proposed sciNMF workflow to systematically dissect underlying cellular heterogeneity. Notably, sciNMF identified 26 leukemic and immune cell states that linked to clinical variables, mutations, and prognosis.

View Article and Find Full Text PDF

The rapid advance of large-scale atlas-level single cell RNA sequences and single-cell chromatin accessibility data provide extraordinary avenues to broad and deep insight into complex biological mechanism. Leveraging the datasets and transfering labels from scRNA-seq to scATAC-seq will empower the exploration of single-cell omics data. However, the current label transfer methods have limited performance, largely due to the lower capable of preserving fine-grained cell populations and intrinsic or extrinsic heterogeneity between datasets.

View Article and Find Full Text PDF

The expansion of single-cell analytical techniques has empowered the exploration of diverse biological questions at the individual cells. Droplet-based single-cell RNA sequencing (scRNA-seq) methods have been particularly widely used due to their high-throughput capabilities and small reaction volumes. While commercial systems have contributed to the widespread adoption of droplet-based scRNA-seq, their relatively high cost limits the ability to profile large numbers of cells and samples.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!