Tandem repeats (TRs) play important roles in genomic variation and disease risk in humans. Long-read sequencing allows for the accurate characterization of TRs; however, the underlying bioinformatics perspectives remain challenging. We present and TREAT: is a fast targeted local assembler, cross-compatible across different sequencing platforms. It is integrated in TREAT, an end-to-end workflow for TR characterization, visualization, and analysis across multiple genomes. In a comparison with existing tools based on long-read sequencing data from both Oxford Nanopore Technology (ONT, Simplex and Duplex) and Pacific Bioscience (PacBio, Sequel II and Revio), and TREAT achieve state-of-the-art genotyping and motif characterization accuracy. Applied to clinically relevant TRs, TREAT/ significantly identify individuals with pathogenic TR expansions. When applied to a case-control setting, we replicate previously reported associations of TRs with Alzheimer's disease, including those near or within ( = 2.63 × 10), ( = 6.5 × 10), and ( = 0.04) genes. Finally, we use TREAT/ to systematically evaluate potential biases when genotyping TRs using diverse ONT and PacBio long-read sequencing data sets. We show that, in rare cases (0.06%), long-read sequencing from coverage drops in TRs, including the disease-associated TRs in and genes. Such coverage drops can lead to TR misgenotyping, hampering the accurate characterization of TR alleles. Taken together, our tools can accurately genotype TRs across different sequencing technologies and with minimal requirements, allowing end-to-end analysis and comparisons of TRs in human genomes, with broad applications in research and clinical fields.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11610572PMC
http://dx.doi.org/10.1101/gr.279351.124DOI Listing

Publication Analysis

Top Keywords

long-read sequencing
20
trs
9
sequencing platforms
8
accurate characterization
8
sequencing data
8
coverage drops
8
sequencing
7
long-read
5
characterizing tandem
4
tandem repeat
4

Similar Publications

Genes encoding OXA-48-like carbapenem-hydrolyzing enzymes are often located on plasmids and are abundant among carbapenemase-producing (CPE) worldwide. After a large plasmid-mediated outbreak in 2011, routine screening of patients at risk of CPE carriage on admission and every 7 days during hospitalization was implemented in a large hospital in the Netherlands. The objective of this study was to investigate the dynamics of the hospitals' 2011 outbreak-associated plasmid among CPE collected from 2011 to 2021.

View Article and Find Full Text PDF

Emerging carbapenem-resistant in a tertiary care hospital in Lima, Peru.

Microbiol Spectr

January 2025

Instituto de Medicina Tropical Alexander von Humboldt, Universidad Peruana Cayetano Heredia, Lima, Peru.

The emergence of carbapenem-resistant (CRKP) poses a significant public health threat, particularly in low- and middle-income countries (LMICs) with limited surveillance and treatment options. This study examines the genetic diversity, resistance patterns, and transmission dynamics of 66 CRKP isolates recovered over 5 years (2015-2019) after the first case of CRKP was identified at a tertiary care hospital in Lima, Peru. Our findings reveal a shift from to as the dominant carbapenemase gene after 2017.

View Article and Find Full Text PDF

The surveillance of mobile genetic elements facilitating the spread of antimicrobial resistance genes has been challenging. Here, we tracked both clonal and plasmid transmission in colistin- and carbapenem-resistant using short- and long-read sequencing technologies. We observed three clonal transmissions, all containing Incompatibility group (Inc) L plasmids and New Delhi metallo-beta-lactamase , although not co-located on the same plasmid.

View Article and Find Full Text PDF

The number of high-quality genomes is rapidly increasing across taxa. However, it remains limited for coral reef fish of the Pomacentrid family, with most research focused on anemonefish. Here, we present the first assembly for a Pomacentrid of the genus .

View Article and Find Full Text PDF

The Ralstonia solanacearum Species Complex (RSSC) is the most significant plant pathogen group with a wide host range. It is genetically related but displays distinct biological features, such as restrictive geography occurrence. The RSSC comprises three species: Ralstonia pseudosolanacearum (phylotype I and III), Ralstonia solanacearum (phylotype IIA and IIB), and Ralstonia syzygii (phylotype IV) (Fegan and Prior 2005).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!