LongAGE: defining breakpoints of genomic structural variants through optimal and memory efficient alignments of long reads.

Bioinformatics

Department of Health Sciences Research, Center for Individualized Medicine, Mayo Clinic, Rochester, MN 55905, USA.

Published: May 2021

Summary: Defining the precise location of structural variations (SVs) at single-nucleotide breakpoint resolution is a challenging problem due to large gaps in alignment. Previously, Alignment with Gap Excision (AGE) enabled us to define breakpoints of SVs at single-nucleotide resolution; however, AGE requires a vast amount of memory when aligning a pair of long sequences. To address this, we developed a memory-efficient implementation-LongAGE-based on the classical Hirschberg algorithm. We demonstrate an application of LongAGE for resolving breakpoints of SVs embedded into segmental duplications on Pacific Biosciences (PacBio) reads that can be longer than 10 kb. Furthermore, we observed different breakpoints for a deletion and a duplication in the same locus, providing direct evidence that such multi-allelic copy number variants (mCNVs) arise from two or more independent ancestral mutations.

Availability And Implementation: LongAGE is implemented in C++ and available on Github at https://github.com/Coaxecva/LongAGE.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8128450PMC
http://dx.doi.org/10.1093/bioinformatics/btaa703DOI Listing

Publication Analysis

Top Keywords

svs single-nucleotide
8
breakpoints svs
8
longage defining
4
breakpoints
4
defining breakpoints
4
breakpoints genomic
4
genomic structural
4
structural variants
4
variants optimal
4
optimal memory
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!