Publications by authors named "Manuel Lafond"

Gene duplication has a central role in evolution; still, little is known on the fates of the duplicated copies, their relative frequency, and on how environmental conditions affect them. Moreover, the lack of rigorous definitions concerning the fate of duplicated genes hinders the development of a global vision of this process. In this paper, we present a new framework aiming at characterizing and formally differentiating the fate of duplicated genes.

View Article and Find Full Text PDF

Motivation: Noncoding RNAs (ncRNAs) express their functions by adopting molecular structures. Specifically, RNA secondary structures serve as a relatively stable intermediate step before tertiary structures, offering a reliable signature of molecular function. Consequently, within an RNA functional family, secondary structures are generally more evolutionarily conserved than sequences.

View Article and Find Full Text PDF

Background: Horizontal gene transfer inference approaches are usually based on gene sequences: parametric methods search for patterns that deviate from a particular genomic signature, while phylogenetic methods use sequences to reconstruct the gene and species trees. However, it is well-known that sequences have difficulty identifying ancient transfers since mutations have enough time to erase all evidence of such events. In this work, we ask whether character-based methods can predict gene transfers.

View Article and Find Full Text PDF

Background: Evolutionary scenarios describing the evolution of a family of genes within a collection of species comprise the mapping of the vertices of a gene tree T to vertices and edges of a species tree S. The relative timing of the last common ancestors of two extant genes (leaves of T) and the last common ancestors of the two species (leaves of S) in which they reside is indicative of horizontal gene transfers (HGT) and ancient duplications. Orthologous gene pairs, on the other hand, require that their last common ancestors coincides with a corresponding speciation event.

View Article and Find Full Text PDF

In phylogenetic networks, picking a cherry consists of removing a leaf that shares a parent with another leaf, or removing a reticulate edge whose endpoints are parents of leaves. Cherry-picking operations were recently shown to have several structural and algorithmic applications in the study of networks, for instance in determining their reconstructibility or in solving the network hybridization and network containment problems. In particular, some networks within certain classes are isomorphic if they can be reduced to a single node by the same sequence of cherry-picking operations.

View Article and Find Full Text PDF

Clustering genes in similarity graphs is a popular approach for orthology prediction. Most algorithms group genes without considering their species, which results in clusters that contain several paralogous genes. Moreover, clustering is known to be problematic when in-paralogs arise from ancient duplications.

View Article and Find Full Text PDF
Article Synopsis
  • All mitochondria are believed to have originated from an ancient endosymbiotic event where an α-proteobacterium merged with a eukaryotic ancestor, leading to significant gene transfers from mitochondria to the nucleus and a reduction in mitochondrial genome size over time.
  • * In some species, like land plants, there is ongoing Endosymbiotic Gene Transfer (EGT) which showcases a varied gene distribution between nuclear and mitochondrial genomes, providing insights into eukaryotic evolution.
  • * The study introduces an algorithm, EndoRex, for tracing the evolution of gene families through duplication, loss, and EGT events, with implementations available on GitHub for further research.*
View Article and Find Full Text PDF

Several implicit methods to infer horizontal gene transfer (HGT) focus on pairs of genes that have diverged only after the divergence of the two species in which the genes reside. This situation defines the edge set of a graph, the later-divergence-time (LDT) graph, whose vertices correspond to genes colored by their species. We investigate these graphs in the setting of relaxed scenarios, i.

View Article and Find Full Text PDF

Assembly is a fundamental task in genome sequencing, and many assemblers have been made available in the last decade. Because of the wide range of possible choices, it can be hard to determine which tool or parameter to use for a specific genome sequencing project. In this paper, we propose a consensus approach that takes the best parts of several contigs datasets produced by different methods, and combines them into a better assembly.

View Article and Find Full Text PDF

Background: The Robinson-Foulds (RF) distance is a well-established measure between phylogenetic trees. Despite a lack of biological justification, it has the advantages of being a proper metric and being computable in linear time. For phylogenetic applications involving genes, however, a crucial aspect of the trees ignored by the RF metric is the type of the branching event (e.

View Article and Find Full Text PDF

Background: The history of gene families-which are equivalent to event-labeled gene trees-can to some extent be reconstructed from empirically estimated evolutionary event-relations containing pairs of orthologous, paralogous or xenologous genes. The question then arises as whether inferred event-labeled gene trees are "biologically feasible" which is the case if one can find a species tree with which the gene tree can be reconciled in a time-consistent way.

Results: In this contribution, we consider event-labeled gene trees that contain speciations, duplications as well as horizontal gene transfer (HGT) and we assume that the species tree is unknown.

View Article and Find Full Text PDF
Article Synopsis
  • The classical gene and species tree reconciliation typically assumes that gene families evolve independently, which is valid for distant genes but not for those in syntenic blocks that may evolve together.
  • The study introduces a challenge of deducing the history of segmental duplication and loss for neighboring genes, expanding the traditional Duplication-Loss reconciliation to accommodate multiple gene trees.
  • The research demonstrates that reconstructing a most parsimonious Super-Reconciliation is NP-hard, offers an exponential-time algorithm to solve it, and presents a polynomial-time algorithm for a rearrangement-inclusive model focused on minimizing duplication and loss events.
View Article and Find Full Text PDF

Background: In the field of genome rearrangement algorithms, models accounting for gene duplication lead often to hard problems. For example, while computing the pairwise distance is tractable in most duplication-free models, the problem is NP-complete for most extensions of these models accounting for duplicated genes. Moreover, problems involving more than two genomes, such as the genome median and the Small Parsimony problem, are intractable for most duplication-free models, with some exceptions, for example the Single-Cut-or-Join (SCJ) model.

View Article and Find Full Text PDF

Background: During cancer progression, malignant cells accumulate somatic mutations that can lead to genetic aberrations. In particular, evolutionary events akin to segmental duplications or deletions can alter the copy-number profile (CNP) of a set of genes in a genome. Our aim is to compute the evolutionary distance between two cells for which only CNPs are known.

View Article and Find Full Text PDF

Reconciling gene trees with a species tree is a fundamental problem to understand the evolution of gene families. Many existing approaches reconcile each gene tree independently. However, it is well-known that the evolution of gene families is interconnected.

View Article and Find Full Text PDF

Motivation: When gene duplication occurs, one of the copies may become free of selective pressure and evolve at an accelerated pace. This has important consequences on the prediction of orthology relationships, since two orthologous genes separated by divergence after duplication may differ in both sequence and function. In this work, we make the distinction between the primary orthologs, which have not been affected by accelerated mutation rates on their evolutionary path, and the secondary orthologs, which have.

View Article and Find Full Text PDF
Article Synopsis
  • The architecture of eukaryotic coding genes allows for the creation of multiple protein isoforms, but current gene phylogeny methods typically focus on just one protein per gene, leading to inaccuracies in gene tree constructions.
  • A new approach is proposed that incorporates alternative protein isoforms in gene tree and protein tree reconstruction, addressing issues using a double reconciliation cost method for better accuracy.
  • The authors offer algorithmic solutions for their new frameworks (MinDRGT and MinDRPGT) and provide tools for implementation available on GitHub, along with testing results using the Ensembl database.
View Article and Find Full Text PDF

Phylogenetic tree reconstruction is usually done by local search heuristics that explore the space of the possible tree topologies via simple rearrangements of their structure. Tree rearrangement heuristics have been used in combination with practically all optimization criteria in use, from maximum likelihood and parsimony to distance-based principles, and in a Bayesian context. Their basic components are rearrangement moves that specify all possible ways of generating alternative phylogenies from a given one, and whose fundamental property is to be able to transform, by repeated application, any phylogeny into any other phylogeny.

View Article and Find Full Text PDF
Article Synopsis
  • This text discusses using the supertree method to reconstruct gene trees from multiple input gene trees, focusing on partial data.
  • The authors develop algorithms to create and correct gene trees, emphasizing reconciliation costs and the preservation of gene relationships such as orthology and paralogy.
  • They demonstrate the effectiveness of these algorithms on Ensembl gene trees, especially in correcting duplication nodes that lack strong support, and provide the C++ source code for public access.
View Article and Find Full Text PDF

Background: Given a gene family, the relations between genes (orthology/paralogy), are represented by a relation graph, where edges connect pairs of orthologous genes and "missing" edges represent paralogs. While a gene tree directly induces a relation graph, the converse is not always true. Indeed, a relation graph is not necessarily "satisfiable", i.

View Article and Find Full Text PDF

Reconstructing ancestral gene orders in a given phylogeny is a classical problem in comparative genomics. Most existing methods compare conserved features in extant genomes in the phylogeny to define potential ancestral gene adjacencies, and either try to reconstruct all ancestral genomes under a global evolutionary parsimony criterion, or, focusing on a single ancestral genome, use a scaffolding approach to select a subset of ancestral gene adjacencies, generally aiming at reducing the fragmentation of the reconstructed ancestral genome. In this paper, we describe an exact algorithm for the Small Parsimony Problem that combines both approaches.

View Article and Find Full Text PDF
Article Synopsis
  • Gene trees often struggle with weak support, but these issues can be improved by integrating information from species trees and genomic context.
  • The new method called ProfileNJ enhances existing gene trees by correcting weak branches using species tree data and distance matrices, making it efficient enough to analyze large databases like Ensembl Compara.
  • A web tool called RefineTree, which includes ProfileNJ and other gene tree correction methods, is now available for researchers to access and utilize for genomic analyses.
View Article and Find Full Text PDF

Background: While tree-oriented methods for inferring orthology and paralogy relations between genes are based on reconciling a gene tree with a species tree, many tree-free methods are also available (usually based on sequence similarity). Recently, the link between orthology relations and gene trees has been formally considered from the perspective of reconstructing phylogenies from orthology relations. In this paper, we consider this link from a correction point of view.

View Article and Find Full Text PDF

Combining a set of trees on partial datasets into a single tree is a classical method for inferring large phylogenetic trees. Ideally, the combined tree should display each input partial tree, which is only possible if input trees do not contain contradictory phylogenetic information. The simplest version of the supertree problem is thus to state whether a set of trees is compatible, and if so, construct a tree displaying them all.

View Article and Find Full Text PDF