Publications by J Fostier

Publications by authors named "J Fostier"

Page 1 of 8

b-move: Faster Lossless Approximate Pattern Matching in a Run-Length Compressed Index.

Lore Depuydt Luca Renders Simon Van de Vyver Lennart Veys Travis Gagie Jan Fostier

Res Sq

November 2024

Background: Due to the increasing availability of high-quality genome sequences, pan-genomes are gradually replacing single consensus reference genomes in many bioinformatics pipelines to better capture genetic diversity. Traditional bioinformatics tools using the FM-index face memory limitations with such large genome collections. Recent advancements in run-length compressed indices like Gagie et al.

View Article and Find Full Text PDF

Lossless Approximate Pattern Matching: Automated Design of Efficient Search Schemes.

Luca Renders Lore Depuydt Sven Rahmann Jan Fostier

J Comput Biol

October 2024

This study introduces a pioneering approach to automate the creation of search schemes for lossless approximate pattern matching. Search schemes are combinatorial structures that define a series of searches over a partitioned pattern. Each search specifies the processing order of these parts and the cumulative lower and upper bounds on the number of errors in each part of the pattern.

View Article and Find Full Text PDF

Faster Maximal Exact Matches with Lazy LCP Evaluation.

Adrián Goga Lore Depuydt Nathaniel K Brown Jan Fostier Travis Gagie

Proc Data Compress Conf

March 2024

MONI (Rossi et al., 2022) is a BWT-based compressed index for computing the matching statistics and maximal exact matches (MEMs) of a pattern (usually a DNA read) with respect to a highly repetitive text (usually a database of genomes) using two operations: LF-steps and longest common extension (LCE) queries on a grammar-compressed representation of the text. In practice, most of the operations are constant-time LF-steps but most of the time is spent evaluating LCE queries.

View Article and Find Full Text PDF

b-move: faster bidirectional character extensions in a run-length compressed index.

Lore Depuydt Luca Renders Simon Van de Vyver Lennart Veys Travis Gagie Jan Fostier

bioRxiv

June 2024

Due to the increasing availability of high-quality genome sequences, pan-genomes are gradually replacing single consensus reference genomes in many bioinformatics pipelines to better capture genetic diversity. Traditional bioinformatics tools using the FM-index face memory limitations with such large genome collections. Recent advancements in run-length compressed indices like Gagie et al.

View Article and Find Full Text PDF

Pan-genome de Bruijn graph using the bidirectional FM-index.

Lore Depuydt Luca Renders Thomas Abeel Jan Fostier

BMC Bioinformatics

October 2023

Background: Pan-genome graphs are gaining importance in the field of bioinformatics as data structures to represent and jointly analyze multiple genomes. Compacted de Bruijn graphs are inherently suited for this purpose, as their graph topology naturally reveals similarity and divergence within the pan-genome. Most state-of-the-art pan-genome graphs are represented explicitly in terms of nodes and edges.

View Article and Find Full Text PDF