A fast and globally optimal solution for RNA-seq quantification.

Brief Bioinform

School of Life Sciences, Southern University of Science and Technology, 1088 Xueyuan Blvd, Shenzhen 518055, Guangdong, China.

Published: September 2023

Alignment-based RNA-seq quantification methods typically involve a time-consuming alignment process prior to estimating transcript abundances. In contrast, alignment-free RNA-seq quantification methods bypass this step, resulting in significant speed improvements. Existing alignment-free methods rely on the Expectation-Maximization (EM) algorithm for estimating transcript abundances. However, EM algorithms only guarantee locally optimal solutions, leaving room for further accuracy improvement by finding a globally optimal solution. In this study, we present TQSLE, the first alignment-free RNA-seq quantification method that provides a globally optimal solution for transcript abundances estimation. TQSLE adopts a two-step approach: first, it constructs a k-mer frequency matrix A for the reference transcriptome and a k-mer frequency vector b for the RNA-seq reads; then, it directly estimates transcript abundances by solving the linear equation ATAx = ATb. We evaluated the performance of TQSLE using simulated and real RNA-seq data sets and observed that, despite comparable speed to other alignment-free methods, TQSLE outperforms them in terms of accuracy. TQSLE is freely available at https://github.com/yhg926/TQSLE.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bib/bbad298DOI Listing

Publication Analysis

Top Keywords

rna-seq quantification
16
transcript abundances
16
globally optimal
12
optimal solution
12
quantification methods
8
estimating transcript
8
alignment-free rna-seq
8
alignment-free methods
8
k-mer frequency
8
rna-seq
6

Similar Publications

Background: Drought stress is a major environmental constraint affecting crop yields. Plants in agricultural and natural environments have developed various mechanisms to cope with drought stress. Identifying genes associated with drought stress tolerance in potato and elucidating their regulatory mechanisms is crucial for the breeding of new potato germplasms.

View Article and Find Full Text PDF

Background: Alternative cleavage and polyadenylation (APA) is a crucial post-transcriptional gene regulation mechanism that regulates gene expression in eukaryotes by increasing the diversity and complexity of both the transcriptome and proteome. Despite the development of more than a dozen experimental methods over the last decade to identify and quantify APA events, widespread adoption of these methods has been limited by technical, financial, and time constraints. Consequently, APA remains poorly understood in most eukaryotes.

View Article and Find Full Text PDF

Transcriptome and translatome profiling of Col-0 and grp7grp8 under ABA treatment in Arabidopsis.

Sci Data

December 2024

Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China.

Abscisic acid (ABA) is a crucial phytohormone that regulates plant growth and stress responses. While substantial knowledge exists about transcriptional regulation, the molecular mechanisms underlying ABA-triggered translational regulation remain unclear. Recent advances in deep sequencing of ribosome footprints (Ribo-seq) enable the mapping and quantification of mRNA translation efficiency.

View Article and Find Full Text PDF

Background: Extracellular vesicles (EVs) are essential for cell-to-cell communication because they transport functionally active molecules, including proteins, RNA, and lipids, from secretory cells to nearby or distant target cells. Seminal plasma contains a large number of EVs (sEVs) that are phenotypically heterogeneous. The aim of the present study was to identify the RNA species contained in two subsets of porcine sEVs of different sizes, namely small sEVs (S-sEVs) and large sEVs (L-sEVs).

View Article and Find Full Text PDF

RNA sequencing (RNA-seq) is widely adopted for transcriptome analysis but has inherent biases that hinder the comprehensive detection and quantification of alternative splicing. To address this, we present an efficient targeted RNA-seq method that greatly enriches for splicing-informative junction-spanning reads. Local splicing variation sequencing (LSV-seq) utilizes multiplexed reverse transcription from highly scalable pools of primers anchored near splicing events of interest.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!