AI Article Synopsis

  • Tumor driver somatic mutations can aid in diagnosis and targeted therapy, primarily detected from tumor DNA, while RNA sequencing (RNA-seq) is becoming popular for measuring gene expression and structural variations.
  • Detecting intermediate long insertions/deletions (indels) using RNA-seq poses challenges, as most analysis tools overlook these due to complexity.
  • This study assesses the sensitivity of various RNA-seq analysis programs for indel detection and concludes that RNA-seq alignment is critical, providing practical recommendations for accurate indel identification to enhance clinical decision-making.

Article Abstract

Driver somatic mutations are a hallmark of a tumor that can be used for diagnosis and targeted therapy. Mutations are primarily detected from tumor DNA. As dynamic molecules of gene activities, transcriptome profiling by RNA sequence (RNA-seq) is becoming increasingly popular, which not only measures gene expression but also structural variations such as mutations and fusion transcripts. Although single-nucleotide variants (SNVs) can be easily identified from RNA-seq, intermediate long insertions/deletions (indels  > 2 bases and less than sequence reads) cause significant challenges and are ignored by most RNA-seq analysis tools. This study evaluates commonly used RNA-seq analysis programs along with variant and somatic mutation callers in a series of data sets with simulated and known indels. The aim is to develop strategies for accurate indel detection. Our results show that the RNA-seq alignment is the most important step for indel identification and the evaluated programs have a wide range of sensitivity to map sequence reads with indels, from not at all to decently sensitive. The sensitivity is impacted by sequence read lengths. Most variant calling programs rely on hard evidence indels marked in the alignment and the programs with realignment may use soft-clipped reads for indel inferencing. Based on the observations, we have provided practical recommendations for indel detection when different RNA-seq aligners are used and demonstrated the best option with highly reliable results. With careful customization of bioinformatics algorithms, RNA-seq can be reliably used for both SNV and indel mutation detection that can be used for clinical decision-making.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5862335PMC
http://dx.doi.org/10.1093/bib/bbw069DOI Listing

Publication Analysis

Top Keywords

indel detection
12
detection rna-seq
12
rna-seq
8
strategies accurate
8
sequence reads
8
rna-seq analysis
8
indel
6
rna-seq data
4
data tool
4
tool evaluation
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!