RTK: efficient rarefaction analysis of large datasets.

Bioinformatics

Structural & Computational Biology Unit, EMBL, 69117 Heidelberg, Germany.

Published: August 2017

Motivation: The rapidly expanding microbiomics field is generating increasingly larger datasets, characterizing the microbiota in diverse environments. Although classical numerical ecology methods provide a robust statistical framework for their analysis, software currently available is inadequate for large datasets and some computationally intensive tasks, like rarefaction and associated analysis.

Results: Here we present a software package for rarefaction analysis of large count matrices, as well as estimation and visualization of diversity, richness and evenness. Our software is designed for ease of use, operating at least 7x faster than existing solutions, despite requiring 10x less memory.

Availability And Implementation: C ++ and R source code (GPL v.2) as well as binaries are available from https://github.com/hildebra/Rarefaction and from CRAN (https://cran.r-project.org/).

Contact: bork@embl.de or falk.hildebrand@embl.de.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870771PMC
http://dx.doi.org/10.1093/bioinformatics/btx206DOI Listing

Publication Analysis

Top Keywords

rarefaction analysis
8
analysis large
8
large datasets
8
rtk efficient
4
efficient rarefaction
4
datasets motivation
4
motivation rapidly
4
rapidly expanding
4
expanding microbiomics
4
microbiomics field
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!