Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads.

Genome Biol

Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, 15213, PA, USA.

Published: December 2019

Single-molecule long-read sequencing has been used to improve mRNA isoform identification. However, not all single-molecule long reads represent full transcripts due to incomplete cDNA synthesis and sequencing length limits. This drives a need for long-read transcript assembly. By adding long-read-specific optimizations to Scallop, we developed Scallop-LR, a reference-based long-read transcript assembler. Analyzing 26 PacBio samples, we quantified the benefit of performing transcript assembly on long reads. We demonstrate Scallop-LR identifies more known transcripts and potentially novel isoforms for the human transcriptome than Iso-Seq Analysis and StringTie, indicating that long-read transcript assembly by Scallop-LR can reveal a more complete human transcriptome.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6918626PMC
http://dx.doi.org/10.1186/s13059-019-1883-0DOI Listing

Publication Analysis

Top Keywords

transcript assembly
16
long reads
12
long-read transcript
12
assembly scallop-lr
8
single-molecule long
8
human transcriptome
8
transcript
5
quantifying benefit
4
benefit offered
4
offered transcript
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!