Probabilistic phylogenetic inference with insertions and deletions.

PLoS Comput Biol

Janelia Farm Research Campus, Howard Hughes Medical Institute, Ashburn, Virginia, United States of America.

Published: September 2008

A fundamental task in sequence analysis is to calculate the probability of a multiple alignment given a phylogenetic tree relating the sequences and an evolutionary model describing how sequences change over time. However, the most widely used phylogenetic models only account for residue substitution events. We describe a probabilistic model of a multiple sequence alignment that accounts for insertion and deletion events in addition to substitutions, given a phylogenetic tree, using a rate matrix augmented by the gap character. Starting from a continuous Markov process, we construct a non-reversible generative (birth-death) evolutionary model for insertions and deletions. The model assumes that insertion and deletion events occur one residue at a time. We apply this model to phylogenetic tree inference by extending the program dnaml in phylip. Using standard benchmarking methods on simulated data and a new "concordance test" benchmark on real ribosomal RNA alignments, we show that the extended program dnamlepsilon improves accuracy relative to the usual approach of ignoring gaps, while retaining the computational efficiency of the Felsenstein peeling algorithm.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2527138PMC
http://dx.doi.org/10.1371/journal.pcbi.1000172DOI Listing

Publication Analysis

Top Keywords

phylogenetic tree
12
insertions deletions
8
evolutionary model
8
insertion deletion
8
deletion events
8
model
5
probabilistic phylogenetic
4
phylogenetic inference
4
inference insertions
4
deletions fundamental
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!