RSQRT: AN HEURISTIC FOR ESTIMATING THE NUMBER OF CLUSTERS TO REPORT.

Electron Commer Res Appl

Computer Science and Engineering; University of Minnesota, 4-192 Keller Hall; 200 Union St. SE, Minneapolis, MN USA 55455, Tel: 612-625-6092;

Published: March 2012

Clustering can be a valuable tool for analyzing large datasets, such as in e-commerce applications. Anyone who clusters must choose how many item clusters, K, to report. Unfortunately, one must guess at K or some related parameter. Elsewhere we introduced a strongly-supported heuristic, RSQRT, which predicts K as a function of the attribute or item count, depending on attribute scales. We conducted a second analysis where we sought confirmation of the heuristic, analyzing data sets from theUCImachine learning benchmark repository. For the 25 studies where sufficient detail was available, we again found strong support. Also, in a side-by-side comparison of 28 studies, RSQRT best-predicted K and the Bayesian information criterion (BIC) predicted K are the same. RSQRT has a lower cost of O(log log n) versus O(n(2)) for BIC, and is more widely applicable. Using RSQRT prospectively could be much better than merely guessing.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3388514PMC
http://dx.doi.org/10.1016/j.elerap.2011.12.006DOI Listing

Publication Analysis

Top Keywords

clusters report
8
rsqrt
5
rsqrt heuristic
4
heuristic estimating
4
estimating number
4
number clusters
4
report clustering
4
clustering valuable
4
valuable tool
4
tool analyzing
4

Similar Publications

The nutrient germinant receptors (GRs) in spores of Bacillus species consist of a cluster of three proteins- designated A, B, and C subunits- that play a critical role in initiating the germination of dormant spores in response to specific nutrient molecules. The Bacillus cereus GerI GR is essential for inosine-induced germination; however, the roles of the individual subunits and the mechanism by which germinant binding activates GR function remain unclear. In this study, we report the backbone chemical shift assignments of the N-terminal domain (NTD) of the A subunit of GerI (GerIA).

View Article and Find Full Text PDF

Disentangling the neural underpinnings of response inhibition in disruptive behavior and co-occurring ADHD.

Eur Child Adolesc Psychiatry

January 2025

Department of Child and Adolescent Psychiatry, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands.

While impaired response inhibition has been reported in attention-deficit/hyperactivity disorder (ADHD), findings in disruptive behavior disorders (DBDs) have been inconsistent, probably due to unaccounted effects of co-occurring ADHD in DBD. This study investigated the associations of behavioral and neural correlates of response inhibition with DBD and ADHD symptom severity, covarying for each other in a dimensional approach. Functional magnetic resonance imaging data were available for 35 children and adolescents with DBDs (8-18 years old, 19 males), and 31 age-matched unaffected controls (18 males) while performing a performance-adjusted stop-signal task.

View Article and Find Full Text PDF

Background: Complex regional pain syndrome (CRPS) is a debilitating condition characterised by significant heterogeneity. Early diagnosis is critical, but limited data exists on the condition's early stages. This study aimed to characterise (very) early CRPS patients and explore potential subgroups to enhance understanding of its mechanisms.

View Article and Find Full Text PDF

The transcriptomic classification of primary colorectal cancer (CRC) into distinct consensus molecular subtypes (CMSs) is a well-described strategy for patient stratification. However, the molecular nature of CRC metastases remains poorly investigated. To this end, this study aimed to identify and compare organotropic CMS frequencies in CRC liver and brain metastases.

View Article and Find Full Text PDF

Aggregation intermediates play a pivotal role in the assembly of amyloid fibrils, which are central to the pathogenesis of neurodegenerative diseases. The structures of filamentous intermediates and mature fibrils are now efficiently determined by single-particle cryo-electron microscopy. By contrast, smaller pre-fibrillar α-Synuclein (αS) oligomers, crucial for initiating amyloidogenesis, remain largely uncharacterized.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!