Binning aims to recover microbial genomes from metagenomic data. For complex metagenomic communities, the available binning methods are far from satisfactory, which usually do not fully use different types of features and important biological knowledge. We developed a novel ensemble binner, MetaBinner, which generates component results with multiple types of features by k-means and uses single-copy gene information for initialization. It then employs a two-stage ensemble strategy based on single-copy genes to integrate the component results efficiently and effectively. Extensive experimental results on three large-scale simulated datasets and one real-world dataset demonstrate that MetaBinner outperforms the state-of-the-art binners significantly.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9817263PMC
http://dx.doi.org/10.1186/s13059-022-02832-6DOI Listing

Publication Analysis

Top Keywords

communities binning
8
types features
8
metabinner high-performance
4
high-performance stand-alone
4
stand-alone ensemble
4
ensemble binning
4
binning method
4
method recover
4
recover individual
4
individual genomes
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!