reval: A Python package to determine best clustering solutions with stability-based relative clustering validation.

Patterns (N Y)

Laboratory for Autism and Neurodevelopmental Disorders, Center for Neuroscience and Cognitive Systems @UniTn, Istituto Italiano di Tecnologia, Rovereto, Italy.

Published: April 2021

Determining the best partition for a dataset can be a challenging task because of the lack of information within an unsupervised learning framework and the absence of a unique clustering validation approach to evaluate clustering solutions. Here we present reval: a Python package that leverages stability-based relative clustering validation methods to select best clustering solutions as the ones that replicate, via supervised learning, on unseen subsets of data. The implementation of relative validation methods can contribute to the theory of clustering by fostering new approaches for the investigation of clustering results in different situations and for different data distributions. This work aims at contributing to this effort by implementing a package that works with multiple clustering and classification algorithms, hence allowing both the automation of the labeling process and the assessment of the stability of different clustering mechanisms.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8085609PMC
http://dx.doi.org/10.1016/j.patter.2021.100228DOI Listing

Publication Analysis

Top Keywords

clustering solutions
12
clustering validation
12
clustering
10
reval python
8
python package
8
best clustering
8
stability-based relative
8
relative clustering
8
validation methods
8
package determine
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!