Differences in alternative splicing patterns can reveal important markers of phenotypic differentiation, including biomarkers of disease. Emerging large and complex RNA-seq datasets from disease and population studies include multiple confounders such as sex, age, ethnicity and clinical attributes, which demand highly specialized data analysis tools. However, few methods are equipped to handle the new challenges. We describe an implementation of our programs MntJULiP and Jutils for differential splicing detection and visualization from RNA-seq data that takes into account covariates. MntJULiP detects intron-level differences in alternative splicing from RNA-seq data using a Bayesian mixture model. Jutils visualizes alternative splicing variation with heatmaps, PCA and sashimi plots, and Venn diagrams. Our tools are scalable and can process thousands of samples within hours. We applied our methods to the collection of GTEx brain RNA-seq samples to deconvolute the effects of sex and age at death on the splicing patterns. In particular, clustering of covariate adjusted data identifies a subgroup of individuals undergoing a distinct splicing program during aging. MntJULiP and Jutils are implemented in Python and are available from https://github.com/splicebox/.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10802308 | PMC |
http://dx.doi.org/10.1101/2024.01.01.573825 | DOI Listing |
bioRxiv
January 2024
Department of Computer Science, Johns Hopkins University, Baltimore MD 21205.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!