Detecting large deletions at base pair level by combining split read and paired read data.

BMC Bioinformatics

Department of Computer Science, Tennessee State University, 3500 John A. Merritt Blvd., Nashville, 37221, Tennessee, USA.

Published: October 2017

Background: Genomic structural variants (SV) play a significant role in the onset and progression of cancer. Genomic deletions can create oncogenic fusion genes or cause the loss of tumor suppressing gene function which can lead to tumorigenesis by downregulating these genes. Detecting these variants has clinical importance in the treatment of diseases. Furthermore, it is also clinically important to detect their breakpoint boundaries at high resolution. We have generalized the framework of a previously-published algorithm that located translocations, and we have applied that framework to develop a method to locate deletions at base pair level using next-generation sequencing data. Our method uses abnormally mapped read pairs, and then subsequently maps split reads to identify precise breakpoints.

Results: On a primary prostate cancer dataset and a simulated dataset, our method predicted the number, type, and breakpoints of biologically validated SVs at high accuracy. It also outperformed two existing algorithms on precise breakpoint prediction, which is clinically important.

Conclusion: Our algorithm, called Pegasus, accurately calls deletion breakpoints. However, the method must be extended to allow for germline variant filtering and heterozygous deletion detection. The source code that implements Pegasus can be downloaded from the following URL: http://github.com/mhayes20/Pegasus .

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5657039PMC
http://dx.doi.org/10.1186/s12859-017-1829-zDOI Listing

Publication Analysis

Top Keywords

deletions base
8
base pair
8
pair level
8
detecting large
4
large deletions
4
level combining
4
combining split
4
split read
4
read paired
4
paired read
4

Similar Publications

Novel 327bp Alu element insertion in LDLR exon 17 causes alternative splicing and familial hypercholesterolemia.

J Clin Lipidol

December 2024

Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, India; Apollo Genomics Institute, Indraprastha Apollo Hospital, New Delhi, 110076, India. Electronic address:

Background: Homozygous familial hypercholesterolemia (HoFH) is a severe form of familial hypercholesterolemia (FH), characterized by high low-density lipoprotein cholesterol (LDL-C) levels and increased coronary artery disease risk. This study reports a novel Alu insertion in the LDLR gene in a consanguineous Indian family, causing FH.

Objective: To identify and characterize the mutation causing HoFH in a proband and their family members.

View Article and Find Full Text PDF

Gene editing technologies, particularly clustered regularly interspersed short palindromic repeats (CRISPR) and CRISPR-associated (Cas) proteins, have revolutionized the ability to modify gene sequences in living cells for therapeutic purposes. Delivery of CRISPR/Cas ribonucleoprotein (RNP) is preferred over its DNA and RNA formats in terms of gene editing effectiveness and low risk of off-target events. However, the intracellular delivery of RNP poses significant challenges and necessitates the development of non-viral vectors.

View Article and Find Full Text PDF

Current progress in CRISPR-Cas systems for rare diseases.

Prog Mol Biol Transl Sci

January 2025

Department of Biotechnology, Faculty of Engineering and Technology, Rama University, Kanpur, Uttar Pradesh, India. Electronic address:

The groundbreaking CRISPR-Cas gene editing method permits exact genetic code alteration. The "CRISPR" DNA protects bacteria from viruses. CRISPR-Cas utilizes a guide RNA to steer the Cas enzyme to the genome's gene editing target.

View Article and Find Full Text PDF

Malic acid markedly affects watermelon flavor. Reducing the malic acid content can significantly increase the sweetness of watermelon. An effective solution strategy is to reduce watermelon malic acid content through molecular breeding technology.

View Article and Find Full Text PDF

Structural variations (SVs) play important roles in genetic diversity, evolution, and carcinogenesis and are, as such, important for human health. However, it remains unclear how spatial proximity of double-strand breaks (DSBs) affects the formation of SVs. To investigate if spatial proximity between two DSBs affects DNA repair, we used data from 3C experiments (Hi-C, ChIA-PET, and ChIP-seq) to identify highly interacting loci on six different chromosomes.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!