One of the Grand Challenges in Science is the construction of the Tree of Life, an evolutionary tree containing several million species, spanning all life on earth. However, the construction of the Tree of Life is enormously computationally challenging, as all the current most accurate methods are either heuristics for NP-hard optimization problems or Bayesian MCMC methods that sample from tree space. One of the most promising approaches for improving scalability and accuracy for phylogeny estimation uses divide-and-conquer: a set of species is divided into overlapping subsets, trees are constructed on the subsets, and then merged together using a "supertree method". Here, we present Exact-RFS-2, the first polynomial-time algorithm to find an optimal supertree of two trees, using the Robinson-Foulds Supertree (RFS) criterion (a major approach in supertree estimation that is related to maximum likelihood supertrees), and we prove that finding the RFS of three input trees is NP-hard. Exact-RFS-2 is available in open source form on Github at https://github.com/yuxilin51/GreedyRFS .

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8240396PMC
http://dx.doi.org/10.1186/s13015-021-00189-2DOI Listing

Publication Analysis

Top Keywords

phylogeny estimation
8
construction tree
8
tree life
8
robinson-foulds supertrees
4
supertrees divide-and-conquer
4
divide-and-conquer phylogeny
4
estimation grand
4
grand challenges
4
challenges science
4
science construction
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!