Gene tree correction for reconciliation and species tree inference.

Algorithms Mol Biol

Département d'Informatique et de Recherche Opérationnelle, Université de Montréal, CP 6128 succ Centre-Ville, Montréal, H3C 3J7, Québec, Canada.

Published: November 2012

AI Article Synopsis

  • Reconciliation is a method used to infer the evolutionary history of gene families by mapping gene trees onto a known species tree, but relies heavily on the accuracy of the gene trees involved.
  • * The paper introduces a way to preprocess gene trees by identifying "non-apparent duplication" (NAD) vertices, which are indications of misplaced leaves affecting the reconciliation results.
  • * The authors propose a polynomial-time heuristic for both species tree inference and reconciliation, aimed at minimizing the removal of species or leaves to ensure accurate gene tree analysis without NAD vertices.

Article Abstract

Background: Reconciliation is the commonly used method for inferring the evolutionary scenario for a gene family. It consists in "embedding" inferred gene trees into a known species tree, revealing the evolution of the gene family by duplications and losses. When a species tree is not known, a natural algorithmic problem is to infer a species tree from a set of gene trees, such that the corresponding reconciliation minimizes the number of duplications and/or losses. The main drawback of reconciliation is that the inferred evolutionary scenario is strongly dependent on the considered gene trees, as few misplaced leaves may lead to a completely different history, with significantly more duplications and losses.

Results: In this paper, we take advantage of certain gene trees' properties in order to preprocess them for reconciliation or species tree inference. We flag certain duplication vertices of a gene tree, the "non-apparent duplication" (NAD) vertices, as resulting from the misplacement of leaves. In the case of species tree inference, we develop a polynomial-time heuristic for removing the minimum number of species leading to a set of gene trees that exhibit no NAD vertices with respect to at least one species tree. In the case of reconciliation, we consider the optimization problem of removing the minimum number of leaves or species leading to a tree without any NAD vertex. We develop a polynomial-time algorithm that is exact for two special classes of gene trees, and show a good performance on simulated data sets in the general case.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3567949PMC
http://dx.doi.org/10.1186/1748-7188-7-31DOI Listing

Publication Analysis

Top Keywords

species tree
28
gene trees
20
tree inference
12
gene
10
species
9
tree
9
gene tree
8
reconciliation species
8
evolutionary scenario
8
gene family
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!