From sample similarity to ensemble similarity: probabilistic distance measures in reproducing kernel Hilbert space.

IEEE Trans Pattern Anal Mach Intell

Integrated Data Systems Department, Siemens Corporate Research, 755 College Road East, Princeton, NJ 08540, USA.

Published: June 2006

This paper addresses the problem of characterizing ensemble similarity from sample similarity in a principled manner. Using reproducing kernel as a characterization of sample similarity, we suggest a probabilistic distance measure in the reproducing kernel Hilbert space (RKHS) as the ensemble similarity. Assuming normality in the RKHS, we derive analytic expressions for probabilistic distance measures that are commonly used in many applications, such as Chernoff distance (or the Bhattacharyya distance as its special case), Kullback-Leibler divergence, etc. Since the reproducing kernel implicitly embeds a nonlinear mapping, our approach presents a new way to study these distances whose feasibility and efficiency is demonstrated using experiments with synthetic and real examples. Further, we extend the ensemble similarity to the reproducing kernel for ensemble and study the ensemble similarity for more general data representations.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2006.120DOI Listing

Publication Analysis

Top Keywords

ensemble similarity
20
reproducing kernel
20
sample similarity
12
probabilistic distance
12
similarity probabilistic
8
distance measures
8
kernel hilbert
8
hilbert space
8
similarity
7
ensemble
6

Similar Publications

Exploring Mortality and Prognostic Factors of Heart Failure with In-Hospital and Emergency Patients by Electronic Medical Records: A Machine Learning Approach.

Risk Manag Healthc Policy

January 2025

Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, New Taipei City, 235603, Taiwan.

Purpose: As HF progresses into advanced HF, patients experience a poor quality of life, distressing symptoms, intensive care use, social distress, and eventual hospital death. We aimed to investigate the relationship between morality and potential prognostic factors among in-patient and emergency patients with HF.

Patients And Methods: A case series study: Data are collected from in-hospital and emergency care patients from 2014 to 2021, including their international classification of disease at admission, and laboratory data such as blood count, liver and renal functions, lipid profile, and other biochemistry from the hospital's electrical medical records.

View Article and Find Full Text PDF

Human-induced global warming, primarily attributed to the rise in atmospheric CO, poses a substantial risk to the survival of humanity. While most research focuses on predicting annual CO emissions, which are crucial for setting long-term emission mitigation targets, the precise prediction of daily CO emissions is equally vital for setting short-term targets. This study examines the performance of 14 models in predicting daily CO emissions data from 1/1/2022 to 30/9/2023 across the top four polluting regions (China, India, the USA, and the EU27&UK).

View Article and Find Full Text PDF

Understanding drug-target interactions is crucial for identifying novel lead compounds, enhancing efficacy, and reducing toxicity. Phenotype-based approaches, like analyzing drug-induced gene expression changes, have shown effectiveness in drug discovery and precision medicine. However, experimentally determining gene expression for all relevant chemicals is impractical, limiting large-scale gene expression-based screening.

View Article and Find Full Text PDF

Psoriasis is a chronic, immune-mediated skin disease characterized by lifelong persistence and fluctuating symptoms. The clinical similarities among its subtypes and the diversity of symptoms present challenges in diagnosis. Early diagnosis plays a vital role in preventing the spread of lesions and improving patients' quality of life.

View Article and Find Full Text PDF

Here we report four experiments that explore the nature of perceptual averaging. We examine the evidence that participants recover and store a representation of the mean value of a set of perceptual features that are distributed across the optic array. The extant evidence shows that participants are particularly accurate in estimating the relevant mean value, but we ask whether this might be due to processes that reflect assessing featural similarity rather than computing an average.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!