A survey of algorithms for the detection of genomic structural variants from long-read sequencing data.

Nat Methods

Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children's Hospital of Philadelphia, Philadelphia, PA, USA.

Published: August 2023

As long-read sequencing technologies are becoming increasingly popular, a number of methods have been developed for the discovery and analysis of structural variants (SVs) from long reads. Long reads enable detection of SVs that could not be previously detected from short-read sequencing, but computational methods must adapt to the unique challenges and opportunities presented by long-read sequencing. Here, we summarize over 50 long-read-based methods for SV detection, genotyping and visualization, and discuss how new telomere-to-telomere genome assemblies and pangenome efforts can improve the accuracy and drive the development of SV callers in the future.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11208083PMC
http://dx.doi.org/10.1038/s41592-023-01932-wDOI Listing

Publication Analysis

Top Keywords

long-read sequencing
12
structural variants
8
long reads
8
survey algorithms
4
algorithms detection
4
detection genomic
4
genomic structural
4
variants long-read
4
sequencing
4
sequencing data
4

Similar Publications

The COVID-19 pandemic has underscored the importance of virus surveillance in public health and wastewater-based epidemiology (WBE) has emerged as a non-invasive, cost-effective method for monitoring SARS-CoV-2 and its variants at the community level. Unfortunately, current variant surveillance methods depend heavily on updated genomic databases with data derived from clinical samples, which can become less sensitive and representative as clinical testing and sequencing efforts decline.In this paper, we introduce HERCULES (High-throughput Epidemiological Reconstruction and Clustering for Uncovering Lineages from Environmental SARS-CoV-2), an unsupervised method that uses long-read sequencing of a single 1 Kb fragment of the Spike gene.

View Article and Find Full Text PDF

Resolving the molecular basis of a Mendelian condition remains challenging owing to the diverse mechanisms by which genetic variants cause disease. To address this, we developed a synchronized long-read genome, methylome, epigenome and transcriptome sequencing approach, which enables accurate single-nucleotide, insertion-deletion and structural variant calling and diploid de novo genome assembly. This permits the simultaneous elucidation of haplotype-resolved CpG methylation, chromatin accessibility and full-length transcript information in a single long-read sequencing run.

View Article and Find Full Text PDF

Chromosome-level genome assembly and characterization of Kaixuan 016: A high-oleic peanut variety with improved agronomic traits developed through gamma-radiation-assisted breeding.

Genomics

January 2025

Shennong Laboratory/ Henan Academy of Crop Molecular Breeding, Henan Academy of Agricultural Sciences/Henan Provincial Key Laboratory for Oil Crops Improvement, Zhengzhou 450002, China. Electronic address:

High-oleic peanuts are increasingly valued in agricultural production and consumer markets. Nevertheless, limited genomic information hinders the integration of genetic analyses and modern breeding strategies. This study details a chromosome-level genome assembly of Kaixuan 016, a high-oleic peanut variety developed through gamma-radiation-assisted breeding, exhibiting enhanced agronomic traits.

View Article and Find Full Text PDF

Objective: This study aims to improve genetic diagnosis in childhood onset epilepsy with neurodevelopmental problems by utilizing RNA sequencing of fibroblasts to identify pathogenic variants that may be missed by exome sequencing and copy number variation analysis.

Methods: We enrolled 41 individuals with childhood onset epilepsy and neurodevelopmental problems who previously had inconclusive genetic testing. Fibroblast samples were cultured and analyzed using RNA sequencing to detect aberrant expression, aberrant splicing, and monoallelic expression using the Detection of RNA Outlier Pipeline (DROP) pipeline.

View Article and Find Full Text PDF

Telomemore enables single-cell analysis of cell cycle and chromatin condensation.

Nucleic Acids Res

January 2025

Laboratory for Molecular Infection Medicine Sweden (MIMS), Umeå University, Biomedicinbyggnaden 6K och 6L, Umeå universitetssjukhus, 901 87, Umeå, Sweden.

Single-cell RNA-seq methods can be used to delineate cell types and states at unprecedented resolution but do little to explain why certain genes are expressed. Single-cell ATAC-seq and multiome (ATAC + RNA) have emerged to give a complementary view of the cell state. It is however unclear what additional information can be extracted from ATAC-seq data besides transcription factor binding sites.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!