In this manuscript, we introduce and benchmark Mandalorion v4.1 for the identification and quantification of full-length transcriptome sequencing reads. It further improves upon the already strong performance of Mandalorion v3.6 used in the LRGASP consortium challenge. By processing real and simulated data, we show three main features of Mandalorion: first, Mandalorion-based isoform identification has very high precision and maintains high recall even in the absence of any genome annotation. Second, isoform read counts as quantified by Mandalorion show a high correlation with simulated read counts. Third, isoforms identified by Mandalorion closely reflect the full-length transcriptome sequencing data sets they are based on.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10351160PMC
http://dx.doi.org/10.1186/s13059-023-02999-6DOI Listing

Publication Analysis

Top Keywords

full-length transcriptome
12
transcriptome sequencing
12
sequencing reads
8
read counts
8
mandalorion
6
identifying quantifying
4
quantifying isoforms
4
isoforms accurate
4
accurate full-length
4
reads mandalorion
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!