Phylogenetic diversity statistics for all clades in a phylogeny.

Bioinformatics

Department of Computer Science, Iowa State University, Ames, IA 50010, United States.

Published: June 2023

The classic quantitative measure of phylogenetic diversity (PD) has been used to address problems in conservation biology, microbial ecology, and evolutionary biology. PD is the minimum total length of the branches in a phylogeny required to cover a specified set of taxa on the phylogeny. A general goal in the application of PD has been identifying a set of taxa of size k that maximize PD on a given phylogeny; this has been mirrored in active research to develop efficient algorithms for the problem. Other descriptive statistics, such as the minimum PD, average PD, and standard deviation of PD, can provide invaluable insight into the distribution of PD across a phylogeny (relative to a fixed value of k). However, there has been limited or no research on computing these statistics, especially when required for each clade in a phylogeny, enabling direct comparisons of PD between clades. We introduce efficient algorithms for computing PD and the associated descriptive statistics for a given phylogeny and each of its clades. In simulation studies, we demonstrate the ability of our algorithms to analyze large-scale phylogenies with applications in ecology and evolutionary biology. The software is available at https://github.com/flu-crew/PD_stats.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10311342PMC
http://dx.doi.org/10.1093/bioinformatics/btad263DOI Listing

Publication Analysis

Top Keywords

phylogenetic diversity
8
ecology evolutionary
8
evolutionary biology
8
set taxa
8
efficient algorithms
8
descriptive statistics
8
phylogeny
7
statistics
4
diversity statistics
4
statistics clades
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!