Background: RNA sequencing allows the study of both gene expression changes and transcribed mutations, providing a highly effective way to gain insight into cancer biology. When planning the sequencing of a large cohort of samples, library size is a fundamental factor affecting both the overall cost and the quality of the results. Here we specifically address how overall library size influences the detection of somatic mutations in RNA-seq data in two acute myeloid leukaemia datasets. RESULTS : We simulated shallower sequencing depths by downsampling 45 acute myeloid leukaemia samples (100 bp PE) that are part of the Leucegene project, which were originally sequenced at high depth. We compared the sensitivity of six methods of recovering validated mutations on the same samples. The methods compared are a combination of three popular callers (MuTect, VarScan, and VarDict) and two filtering strategies. We observed an incremental loss in sensitivity when simulating libraries of 80M, 50M, 40M, 30M and 20M fragments, with the largest loss detected with less than 30M fragments (below 90%, average loss of 7%). The sensitivity in recovering insertions and deletions varied markedly between callers, with VarDict showing the highest sensitivity (60%). Single nucleotide variant sensitivity is relatively consistent across methods, apart from MuTect, whose default filters need adjustment when using RNA-Seq. We also analysed 136 RNA-Seq samples from the TCGA-LAML cohort (50 bp PE) and assessed the change in sensitivity between the initial libraries (average 59M fragments) and after downsampling to 40M fragments. When considering single nucleotide variants in recurrently mutated myeloid genes we found a comparable performance, with a 6% average loss in sensitivity using 40M fragments.

Conclusions: Between 30M and 40M 100 bp PE reads are needed to recover 90-95% of the initial variants on recurrently mutated myeloid genes. To extend this result to another cancer type, an exploration of the characteristics of its mutations and gene expression patterns is suggested.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7708150PMC
http://dx.doi.org/10.1186/s12859-020-03860-4DOI Listing

Publication Analysis

Top Keywords

library size
12
loss sensitivity
12
gene expression
8
acute myeloid
8
myeloid leukaemia
8
average loss
8
single nucleotide
8
variants recurrently
8
recurrently mutated
8
mutated myeloid
8

Similar Publications

High-affinity VNARs targeting human hemoglobin: Screening, stability and binding analysis.

Int J Biol Macromol

January 2025

College of Ocean Food and Biological Engineering, Jimei University, Xiamen 361021, China. Electronic address:

Hemoglobin, composed of α- and β-chains, is essential for oxygen transport and is key in diagnosing and treating gastrointestinal and blood disorders. It also aids in detecting blood contamination and estimating transfusion volumes. Immunological methods, based on antigen-antibody interactions, are distinguished by their high sensitivity and accuracy.

View Article and Find Full Text PDF

Background: Chronic ankle instability (CAI) has been associated with neuromuscular control dysfunction, particularly of the peroneal musculature.

Research Question: How do neuromuscular characteristics of the peroneal muscles, including corticospinal excitability, strength, proprioception (force sense) and electromyographic measures differ in individuals with CAI compared to healthy control counterparts aged 18-45?

Methods: A systematic review with meta-analysis was conducted by retrieving relevant articles from electronic databases including EBSCOhost (CINAHL Complete, AMED, SPORTDiscus), Ovid (MEDLINE, Embase), Web of Science, Scopus and Cochrane Library as well as Grey literature sources. The eligibility and methodological quality of the included case-control and cross-sectional studies were assessed by two reviewers.

View Article and Find Full Text PDF

Assembly and Annotation of the Tetraploid Salsola tragus (Russian thistle) Genome.

Genome Biol Evol

January 2025

Department of Agricultural Biology, 1177 Campus Delivery, Colorado State University, Fort Collins, CO, 80523, USA.

This report presents two phased chromosome-scale genome assemblies of allotetraploid Salsola tragus (2n=4x=36) and fills the current genomics resource gap for this species. Flow cytometry estimated 1C genome size was 1.319 Gbp.

View Article and Find Full Text PDF

Long-acting and extended-release drug delivery strategies have greatly improved treatment for a variety of medical conditions. Special populations, specifically infants, children, young people, and pregnant and postpartum women, could greatly benefit from access to these strategies but are often excluded from clinical trials. We conducted a systematic review of all clinical studies involving the use of a long-acting intramuscular injection or implant in infants, children, young people, and pregnant and postpartum people.

View Article and Find Full Text PDF

Outcomes of Left Ventricular Assist Devices as Destination Therapy: A Systematic Review with Meta-Analysis.

Life (Basel)

January 2025

Internal Medicine Department, College of Medicine, King Faisal University, Al-Ahsa 31982, Saudi Arabia.

Background: Heart failure (HF) is a chronic condition that significantly affects morbidity and mortality. For patients with end-stage HF who are not candidates for heart transplantation, left ventricular assist devices (LVADs) provide mechanical circulatory support as a long-term solution, known as destination therapy (DT).

Objective: This meta-analysis aims to synthesize evidence on the survival rates, complications, and quality-of-life improvements associated with LVADs used as destination therapy in patients with end-stage HF.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!