AI Article Synopsis

  • Phylogenetic placement helps organize new genetic sequences within existing large trees for better taxon identification and tree construction.
  • Current top methods like pplacer and EPA-ng use maximum likelihood for high accuracy but struggle with very large trees, often failing with over 50,000 leaves.
  • The new SCAMPP method enhances these traditional techniques, allowing for accurate placement in ultra-large trees with up to 200,000 leaves and performs better than faster alternatives like APPLES and APPLES-2.

Article Abstract

Phylogenetic placement, the problem of placing a "query" sequence into a precomputed phylogenetic "backbone" tree, is useful for constructing large trees, performing taxon identification of newly obtained sequences, and other applications. The most accurate current methods, such as pplacer and EPA-ng, are based on maximum likelihood and require that the query sequence be provided within a multiple sequence alignment that includes the leaf sequences in the backbone tree. This approach enables high accuracy but also makes these likelihood-based methods computationally intensive on large backbone trees, and can even lead to them failing when the backbone trees are very large (e.g., having 50,000 or more leaves). We present SCAMPP (SCaling AlignMent-based Phylogenetic Placement), a technique to extend the scalability of these likelihood-based placement methods to ultra-large backbone trees. We show that pplacer-SCAMPP and EPA-ng-SCAMPP both scale well to ultra-large backbone trees (even up to 200,000 leaves), with accuracy that improves on APPLES and APPLES-2, two recently developed fast phylogenetic placement methods that scale to ultra-large datasets. EPA-ng-SCAMPP and pplacer-SCAMPP are available at https://github.com/chry04/PLUSplacer.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TCBB.2022.3170386DOI Listing

Publication Analysis

Top Keywords

phylogenetic placement
16
backbone trees
16
scampp scaling
8
scaling alignment-based
8
alignment-based phylogenetic
8
large trees
8
placement methods
8
ultra-large backbone
8
trees
6
phylogenetic
5

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!