Correcting for sequencing error in maximum likelihood phylogeny inference.

G3 (Bethesda)

Department of Genome Sciences, University of Washington, Seattle, Washington 98195-5065.

Published: November 2014

Accurate phylogenies are critical to taxonomy as well as studies of speciation processes and other evolutionary patterns. Accurate branch lengths in phylogenies are critical for dating and rate measurements. Such accuracy may be jeopardized by unacknowledged sequencing error. We use simulated data to test a correction for DNA sequencing error in maximum likelihood phylogeny inference. Over a wide range of data polymorphism and true error rate, we found that correcting for sequencing error improves recovery of the branch lengths, even if the assumed error rate is up to twice the true error rate. Low error rates have little effect on recovery of the topology. When error is high, correction improves topological inference; however, when error is extremely high, using an assumed error rate greater than the true error rate leads to poor recovery of both topology and branch lengths. The error correction approach tested here was proposed in 2004 but has not been widely used, perhaps because researchers do not want to commit to an estimate of the error rate. This study shows that correction with an approximate error rate is generally preferable to ignoring the issue.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4267948PMC
http://dx.doi.org/10.1534/g3.114.014365DOI Listing

Publication Analysis

Top Keywords

error rate
28
sequencing error
16
error
15
branch lengths
12
true error
12
correcting sequencing
8
error maximum
8
maximum likelihood
8
likelihood phylogeny
8
phylogeny inference
8

Similar Publications

Biokinetic models can optimise pollutant degradation and enhance microbial growth processes, aiding to protect ecosystem protection. Traditional biokinetic approaches (such as Monod, Haldane, etc.) can be challenging, as they require detailed knowledge of the organism's metabolism and the ability to solve numerous kinetic differential equations based on the principles of micro, molecular biology and biochemistry (first engineering principles) which can lead to discrepancies between predicted and actual degradation rates.

View Article and Find Full Text PDF

Prediction of rain garden runoff control effects based on multiple machine learning techniques.

Environ Technol

January 2025

Shaanxi Huashan Road and Bridge Group Co., Ltd., Xi'an, People's Republic of China.

Due to the rapid development of urbanisation, cities frequently experience waterlogging during rainfall. Rain gardens are widely used in new urban construction because they effectively control surface runoff from rainwater, thereby reducing waterlogging. The runoff control effectiveness of rain gardens is influenced by multiple factors.

View Article and Find Full Text PDF

Objective: The study aims to develop continuous trajectory profiles along curves with minimal error. It also focuses on formulating a percentage trajectory transection rate model as a function of geometric parameters (e.g.

View Article and Find Full Text PDF

Objectives: Artificial intelligence (AI) software including Brainomix "e-CTA" which detect large vessel occlusions (LVO) have clinical potential. We hypothesised that in real world use where prevalence is low, its clinical utility may be overstated.

Methods: In this single centre retrospective service evaluation project, data sent to Brainomix from a medium size acute National Health Service (NHS) Trust hospital between 1/3/2022-1/3/2023 was reviewed.

View Article and Find Full Text PDF

An evolved, orthogonal ssDNA generator for targeted hypermutation of multiple genomic loci.

Nucleic Acids Res

January 2025

Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China.

Achieving targeted hypermutation of specific genomic sequences without affecting other regions remains a key challenge in continuous evolution. To address this, we evolved a T7 RNA polymerase (RNAP) mutant that synthesizes single-stranded DNA (ssDNA) instead of RNA in vivo, while still exclusively recognizing the T7 promoter. By increasing the error rate of the T7 RNAP mutant, it generates mutated ssDNA that recombines with homologous sequences in the genome, leading to targeted genomic hypermutation.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!