RNA splicing analysis using heterogeneous and large RNA-seq datasets.

Jorge Vaquero-Garcia Joseph K Aicher San Jewell Matthew R Gazzara Caleb M Radens Anupama Jha Scott S Norton Nicholas F Lahens Gregory R Grant Yoseph Barash

Nat Commun

Department of Genetics, University of Pennsylvania, Philadelphia, PA, USA.

Published: March 2023

The ubiquity of RNA-seq has led to many methods that use RNA-seq data to analyze variations in RNA splicing. However, available methods are not well suited for handling heterogeneous and large datasets. Such datasets scale to thousands of samples across dozens of experimental conditions, exhibit increased variability compared to biological replicates, and involve thousands of unannotated splice variants resulting in increased transcriptome complexity. We describe here a suite of algorithms and tools implemented in the MAJIQ v2 package to address challenges in detection, quantification, and visualization of splicing variations from such datasets. Using both large scale synthetic data and GTEx v8 as benchmark datasets, we assess the advantages of MAJIQ v2 compared to existing methods. We then apply MAJIQ v2 package to analyze differential splicing across 2,335 samples from 13 brain subregions, demonstrating its ability to offer insights into brain subregion-specific splicing regulation.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9984406	PMC
http://dx.doi.org/10.1038/s41467-023-36585-y	DOI Listing

Publication Analysis

Top Keywords

rna splicing

heterogeneous large

majiq package

datasets

splicing analysis

analysis heterogeneous

large rna-seq

rna-seq datasets

datasets ubiquity

ubiquity rna-seq

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!