A two-stage pruning algorithm for likelihood computation for a population tree.

Genetics

Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138, USA.

Published: October 2008

We have developed a pruning algorithm for likelihood estimation of a tree of populations. This algorithm enables us to compute the likelihood for large trees. Thus, it gives an efficient way of obtaining the maximum-likelihood estimate (MLE) for a given tree topology. Our method utilizes the differences accumulated by random genetic drift in allele count data from single-nucleotide polymorphisms (SNPs), ignoring the effect of mutation after divergence from the common ancestral population. The computation of the maximum-likelihood tree involves both maximizing likelihood over branch lengths of a given topology and comparing the maximum-likelihood across topologies. Here our focus is the maximization of likelihood over branch lengths of a given topology. The pruning algorithm computes arrays of probabilities at the root of the tree from the data at the tips of the tree; at the root, the arrays determine the likelihood. The arrays consist of probabilities related to the number of coalescences and allele counts for the partially coalesced lineages. Computing these probabilities requires an unusual two-stage algorithm. Our computation is exact and avoids time-consuming Monte Carlo methods. We can also correct for ascertainment bias.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2567359PMC
http://dx.doi.org/10.1534/genetics.107.085753DOI Listing

Publication Analysis

Top Keywords

pruning algorithm
12
algorithm likelihood
8
likelihood branch
8
branch lengths
8
lengths topology
8
likelihood
6
tree
6
algorithm
5
two-stage pruning
4
likelihood computation
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!