Background: Detecting somatic mutations in whole exome sequencing data of cancer samples has become a popular approach for profiling cancer development, progression and chemotherapy resistance. Several studies have proposed software packages, filters and parametrizations. However, many research groups reported low concordance among different methods. We aimed to develop a pipeline which detects a wide range of single nucleotide mutations with high validation rates. We combined two standard tools - Genome Analysis Toolkit (GATK) and MuTect - to create the GATK-LOD method. As proof of principle, we applied our pipeline to exome sequencing data of hematological (Acute Myeloid and Acute Lymphoblastic Leukemias) and solid (Gastrointestinal Stromal Tumor and Lung Adenocarcinoma) tumors. We performed experiments on simulated data to test the sensitivity and specificity of our pipeline.

Results: The software MuTect presented the highest validation rate (90 %) for mutation detection, but limited number of somatic mutations detected. The GATK detected a high number of mutations but with low specificity. The GATK-LOD increased the performance of the GATK variant detection (from 5 of 14 to 3 of 4 confirmed variants), while preserving mutations not detected by MuTect. However, GATK-LOD filtered more variants in the hematological samples than in the solid tumors. Experiments in simulated data demonstrated that GATK-LOD increased both specificity and sensitivity of GATK results.

Conclusion: We presented a pipeline that detects a wide range of somatic single nucleotide variants, with good validation rates, from exome sequencing data of cancer samples. We also showed the advantage of combining standard algorithms to create the GATK-LOD method, that increased specificity and sensitivity of GATK results. This pipeline can be helpful in discovery studies aimed to profile the somatic mutational landscape of cancer genomes.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5123378PMC
http://dx.doi.org/10.1186/s12859-016-1190-7DOI Listing

Publication Analysis

Top Keywords

sequencing data
16
single nucleotide
12
exome sequencing
12
somatic single
8
somatic mutations
8
data cancer
8
cancer samples
8
pipeline detects
8
detects wide
8
wide range
8

Similar Publications

Background And Objectives: Although multiple sclerosis (MS) can be conceptualized as a network disorder, brain network analyses typically require advanced MRI sequences not commonly acquired in clinical practice. Using conventional MRI, we assessed cross-sectional and longitudinal structural disconnection and morphometric similarity networks in people with MS (pwMS), along with their relationship with clinical disability.

Methods: In this longitudinal monocentric study, 3T structural MRI of pwMS and healthy controls (HC) was retrospectively analyzed.

View Article and Find Full Text PDF

Background: One in five sebaceous tumour (ST) patients may have Lynch syndrome (LS), a hereditary cancer predisposition. LS patients benefit from cancer surveillance and prevention programmes and immunotherapy. Whilst universal tumour mismatch repair (MMR) deficiency testing is recommended in colorectal and endometrial cancers to screen for LS, there is no consensus screening strategy for ST, leading to low testing rates and inequity of care.

View Article and Find Full Text PDF

Cost-benefit tradeoff mediates the transition from rule-based to memory-based processing during practice.

PLoS Biol

January 2025

Cognitive Control Collaborative, University of Iowa, Iowa City, Iowa, United States of America.

Practice not only improves task performance but also changes task execution from rule- to memory-based processing by incorporating experiences from practice. However, how and when this change occurs is unclear. We test the hypothesis that strategy transitions in task learning can result from decision-making guided by cost-benefit analysis.

View Article and Find Full Text PDF

The continued evolution of SARS-CoV-2 variants capable of subverting vaccine and infection-induced immunity suggests the advantage of a broadly protective vaccine against betacoronaviruses (β-CoVs). Recent studies have isolated monoclonal antibodies (mAbs) from SARS-CoV-2 recovered-vaccinated donors capable of neutralizing many variants of SARS-CoV-2 and other β-CoVs. Many of these mAbs target the conserved S2 stem region of the SARS-CoV-2 spike protein, rather than the receptor binding domain contained within S1 primarily targeted by current SARS-CoV-2 vaccines.

View Article and Find Full Text PDF

Antibodies are extensively used in biomedical research, clinical fields, and disease treatment. However, to enhance the reproducibility and reliability of antibody-based experiments, it is crucial to have a detailed understanding of the antibody's target specificity and epitope. In this study, we developed a high-throughput and precise epitope analysis method, DECODE (Decoding Epitope Composition by Optimized-mRNA-display, Data analysis, and Expression sequencing).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!