Publications by Felipe Llinares-Lopez

Publications by authors named "Felipe Llinares-Lopez"

Page 1 of 1

InterPro: the protein sequence classification resource in 2025.

Matthias Blum Antonina Andreeva Laise Cavalcanti Florentino Sara Rocio Chuguransky Tiago Grego

Nucleic Acids Res

November 2024

InterPro (https://www.ebi.ac.

View Article and Find Full Text PDF

The Pfam protein families database: embracing AI/ML.

Typhaine Paysan-Lafosse Antonina Andreeva Matthias Blum Sara Rocio Chuguransky Tiago Grego

Nucleic Acids Res

November 2024

The Pfam protein families database is a comprehensive collection of protein domains and families used for genome annotation and protein structure and function analysis (https://www.ebi.ac.

View Article and Find Full Text PDF

Deep embedding and alignment of protein sequences.

Felipe Llinares-López Quentin Berthet Mathieu Blondel Olivier Teboul Jean-Philippe Vert

Nat Methods

January 2023

Protein sequence alignment is a key component of most bioinformatics pipelines to study the structures and functions of proteins. Aligning highly divergent sequences remains, however, a difficult task that current algorithms often fail to perform accurately, leaving many proteins or open reading frames poorly annotated. Here we leverage recent advances in deep learning for language modeling and differentiable programming to propose DEDAL (deep embedding and differentiable alignment), a flexible model to align protein sequences and detect homologs.

View Article and Find Full Text PDF

DeepConsensus improves the accuracy of sequences with a gap-aware sequence transformer.

Gunjan Baid Daniel E Cook Kishwar Shafin Taedong Yun Felipe Llinares-López

Nat Biotechnol

February 2023

Article Synopsis

Scientists developed a new way called DeepConsensus to help correct DNA sequences more accurately than an older method called pbccs.
DeepConsensus uses advanced technology to lower errors in the DNA reads by 42%, which means it helps make the sequencing more reliable.
This new approach not only improves the quality of the DNA readings but also enhances how genes are understood and reduces mistakes in identifying genetic variations.

View Article and Find Full Text PDF

CASMAP: detection of statistically significant combinations of SNPs in association mapping.

Felipe Llinares-López Laetitia Papaxanthos Damian Roqueiro Dean Bodenham Karsten Borgwardt

Bioinformatics

August 2019

Summary: Combinatorial association mapping aims to assess the statistical association of higher-order interactions of genetic markers with a phenotype of interest. This article presents combinatorial association mapping (CASMAP), a software package that leverages recent advances in significant pattern mining to overcome the statistical and computational challenges that have hindered combinatorial association mapping. CASMAP can be used to perform region-based association studies and to detect higher-order epistatic interactions of genetic variants.

View Article and Find Full Text PDF

graphkernels: R and Python packages for graph comparison.

Mahito Sugiyama M Elisabetta Ghisu Felipe Llinares-López Karsten Borgwardt

Bioinformatics

February 2018

Summary: Measuring the similarity of graphs is a fundamental step in the analysis of graph-structured data, which is omnipresent in computational biology. Graph kernels have been proposed as a powerful and efficient approach to this problem of graph comparison. Here we provide graphkernels, the first R and Python graph kernel libraries including baseline kernels such as label histogram based kernels, classic graph kernels such as random walk based kernels, and the state-of-the-art Weisfeiler-Lehman graph kernel.

View Article and Find Full Text PDF

Genome-wide genetic heterogeneity discovery with categorical covariates.

Felipe Llinares-López Laetitia Papaxanthos Dean Bodenham Damian Roqueiro

Bioinformatics

June 2017

Motivation: Genetic heterogeneity is the phenomenon that distinct genetic variants may give rise to the same phenotype. The recently introduced algorithm Fast Automatic Interval Search ( FAIS ) enables the genome-wide search of candidate regions for genetic heterogeneity in the form of any contiguous sequence of variants, and achieves high computational efficiency and statistical power. Although FAIS can test all possible genomic regions for association with a phenotype, a key limitation is its inability to correct for confounders such as gender or population structure, which may lead to numerous false-positive associations.

View Article and Find Full Text PDF

Genome-wide detection of intervals of genetic heterogeneity associated with complex traits.

Felipe Llinares-López Dominik G Grimm Dean A Bodenham Udo Gieraths Mahito Sugiyama

Bioinformatics

June 2015

Motivation: Genetic heterogeneity, the fact that several sequence variants give rise to the same phenotype, is a phenomenon that is of the utmost interest in the analysis of complex phenotypes. Current approaches for finding regions in the genome that exhibit genetic heterogeneity suffer from at least one of two shortcomings: (i) they require the definition of an exact interval in the genome that is to be tested for genetic heterogeneity, potentially missing intervals of high relevance, or (ii) they suffer from an enormous multiple hypothesis testing problem due to the large number of potential candidate intervals being tested, which results in either many false positives or a lack of power to detect true intervals.

Results: Here, we present an approach that overcomes both problems: it allows one to automatically find all contiguous sequences of single nucleotide polymorphisms in the genome that are jointly associated with the phenotype.

View Article and Find Full Text PDF

Publications by authors named "Felipe Llinares-Lopez"

InterPro: the protein sequence classification resource in 2025.

The Pfam protein families database: embracing AI/ML.

Deep embedding and alignment of protein sequences.

DeepConsensus improves the accuracy of sequences with a gap-aware sequence transformer.

Article Synopsis

CASMAP: detection of statistically significant combinations of SNPs in association mapping.

graphkernels: R and Python packages for graph comparison.

Genome-wide genetic heterogeneity discovery with categorical covariates.

Genome-wide detection of intervals of genetic heterogeneity associated with complex traits.

A PHP Error was encountered

A PHP Error was encountered