Shark: fishing relevant reads in an RNA-Seq sample.

Bioinformatics

Department of Informatics, Systems and Communication, University of Milano-Bicocca, Milano 20126, Italy.

Published: May 2021

Motivation: Recent advances in high-throughput RNA-Seq technologies allow to produce massive datasets. When a study focuses only on a handful of genes, most reads are not relevant and degrade the performance of the tools used to analyze the data. Removing irrelevant reads from the input dataset leads to improved efficiency without compromising the results of the study.

Results: We introduce a novel computational problem, called gene assignment and we propose an efficient alignment-free approach to solve it. Given an RNA-Seq sample and a panel of genes, a gene assignment consists in extracting from the sample, the reads that most probably were sequenced from those genes. The problem becomes more complicated when the sample exhibits evidence of novel alternative splicing events. We implemented our approach in a tool called Shark and assessed its effectiveness in speeding up differential splicing analysis pipelines. This evaluation shows that Shark is able to significantly improve the performance of RNA-Seq analysis tools without having any impact on the final results.

Availability And Implementation: The tool is distributed as a stand-alone module and the software is freely available at https://github.com/AlgoLab/shark.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8088329PMC
http://dx.doi.org/10.1093/bioinformatics/btaa779DOI Listing

Publication Analysis

Top Keywords

rna-seq sample
8
gene assignment
8
shark fishing
4
fishing relevant
4
reads
4
relevant reads
4
rna-seq
4
reads rna-seq
4
sample
4
sample motivation
4

Similar Publications

Background: MARVEL domain-containing 1 (MARVELD1) has been implicated in the progression of several cancers, but its role in pancreatic adenocarcinoma (PAAD) remains poorly understood.

Methods: RNA-seq data from the TCGA-PAAD and GTEx-Pancreas cohorts were analyzed to assess MARVELD1 expression. Stable MARVELD1 knockdown and overexpression were conducted in BxPC3 and PANC-1 cells.

View Article and Find Full Text PDF

Background: Osteosarcoma is the most common malignant bone tumor in children and adolescents, characterized by high disability and mortality rates. Over the past three decades, therapeutic outcomes have plateaued, underscoring the critical need for innovative therapeutic targets. Solute carrier (SLC) family transporters have been implicated in the malignant progression of a variety of tumors, however, their specific role in osteosarcoma remains poorly understood.

View Article and Find Full Text PDF

Immune checkpoint blockade (ICB) therapies have emerged as a promising avenue for the treatment of various cancers. Despite their success, the efficacy of these treatments is variable across patients and cancer types. Numerous single-cell RNA-sequencing (scRNA-seq) studies have been conducted to unravel cell-specific responses to ICB treatment.

View Article and Find Full Text PDF

Purpose: Investigate the role of lipid metabolism in the tumor immune microenvironment (TIME) of lung adenocarcinoma (LUAD) and identify vital lipid metabolism-related genes (LMRGs) that contribute to immunotherapy outcomes.

Materials And Methods: 1130 LUAD patients were acquired utilizing public databases. Multiple algorithms were used to analyze the contribution of lipid metabolism in TIME.

View Article and Find Full Text PDF

Stemness-associated cell states are linked to chemotherapy resistance in AML. We uncovered a direct mechanistic link between expression of the stem cell transcription factor GATA2 and drug resistance. The GATA-binding protein 2 (GATA2) plays a central role in blood stem cell generation and maintenance.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!