AI Article Synopsis

  • Most tools for identifying cancer driver genes were designed for whole-exome sequencing, but targeted sequencing is cheaper and provides more detailed data for specific genes.
  • This study tested seven popular driver gene identification tools using targeted sequencing data and found that how these tools account for mutation rates significantly influences their accuracy.
  • The researchers recommend using OncodriveFML, OncodriveCLUSTL, 20/20+, dNdSCv, and ActiveDriver for targeted sequencing, while advising against MutSigCV and DriverML in this context.

Article Abstract

Motivation: Most cancer driver gene identification tools have been developed for whole-exome sequencing data. Targeted sequencing is a popular alternative to whole-exome sequencing for large cancer studies due to its greater depth at a lower cost per tumor. Unlike whole-exome sequencing, targeted sequencing only enables mutation calling for a selected subset of genes. Whether existing driver gene identification tools remain valid in that context has not previously been studied.

Results: We evaluated the validity of seven popular driver gene identification tools when applied to targeted sequencing data. Based on whole-exome data of 14 different cancer types from TCGA, we constructed matching targeted datasets by keeping only the mutations overlapping with the pan-cancer MSK-IMPACT panel and, in the case of breast cancer, also the breast-cancer-specific B-CAST panel. We then compared the driver gene predictions obtained on whole-exome and targeted mutation data for each of the seven tools. Differences in how the tools model background mutation rates were the most important determinant of their validity on targeted sequencing data. Based on our results, we recommend OncodriveFML, OncodriveCLUSTL, 20/20+, dNdSCv, and ActiveDriver for driver gene identification in targeted sequencing data, whereas MutSigCV and DriverML are best avoided in that context.

Availability And Implementation: Code for the analyses is available at https://github.com/SchmidtGroupNKI/TGSdrivergene_validity.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11132814PMC
http://dx.doi.org/10.1093/bioadv/vbae073DOI Listing

Publication Analysis

Top Keywords

driver gene
24
gene identification
20
sequencing data
20
targeted sequencing
20
identification tools
16
whole-exome sequencing
12
sequencing
9
targeted
8
data based
8
data
7

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!