An analysis of classical multidimensional scaling with applications to clustering.

Inf inference

Department of Statistical Sciences, University of Toronto, Toronto, ON M5G 1Z5, Canada.

Published: March 2023

Classical multidimensional scaling is a widely used dimension reduction technique. Yet few theoretical results characterizing its statistical performance exist. This paper provides a theoretical framework for analyzing the quality of embedded samples produced by classical multidimensional scaling. This lays a foundation for various downstream statistical analyses, and we focus on clustering noisy data. Our results provide scaling conditions on the signal-to-noise ratio under which classical multidimensional scaling followed by a distance-based clustering algorithm can recover the cluster labels of all samples. Simulation studies confirm these scaling conditions are sharp. Applications to the cancer gene-expression data, the single-cell RNA sequencing data and the natural language data lend strong support to the methodology and theory.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9893760PMC
http://dx.doi.org/10.1093/imaiai/iaac004DOI Listing

Publication Analysis

Top Keywords

classical multidimensional
16
multidimensional scaling
16
scaling conditions
8
scaling
6
analysis classical
4
multidimensional
4
scaling applications
4
applications clustering
4
clustering classical
4
scaling dimension
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!