Nearest-neighbor estimators for the Kullback-Leiber (KL) divergence that are asymptotically unbiased have recently been proposed and demonstrated in a number of applications. However, with a small number of samples, nonparametric methods typically suffer from large estimation bias due to the nonlocality of information derived from nearest-neighbor statistics. In this letter, we show that this estimation bias can be mitigated by modifying the metric function, and we propose a novel method for learning a locally optimal Mahalanobis distance function from parametric generative models of the underlying density distributions. Using both simulations and experiments on a variety of data sets, we demonstrate that this interplay between approximate generative models and nonparametric techniques can significantly improve the accuracy of nearest-neighbor-based estimation of the KL divergence.

Download full-text PDF

Source
http://dx.doi.org/10.1162/neco_a_01092DOI Listing

Publication Analysis

Top Keywords

estimation bias
8
generative models
8
bias reduction
4
reduction metric
4
metric learning
4
learning nearest-neighbor
4
estimation
4
nearest-neighbor estimation
4
estimation kullback-leibler
4
kullback-leibler divergence
4

Similar Publications

Background: In 2023, there were 39.9 million people living with HIV (PLWH) worldwide and 630 000 deaths related to HIV. New strategies are needed, and long-acting antiretrovirals (LAAs) are now widely considered to have great potential to help end the HIV epidemic.

View Article and Find Full Text PDF

Background: During the pandemic, there was concern that underascertainment of COVID-19 outcomes may impact treatment effect estimation in pharmacoepidemiologic studies. We assessed the impact of outcome misclassification on the association between inhaled corticosteroids (ICS) and COVID-19 hospitalisation and death in the United Kingdom during the first pandemic wave using probabilistic bias analysis (PBA).

Methods: Using data from the Clinical Practice Research Datalink Aurum, we defined a cohort with chronic obstructive pulmonary disease (COPD) on 1 March 2020.

View Article and Find Full Text PDF

Prostate cancer (PC) is a common malignancy among men globally. Although genetic, hormonal, and environmental factors contribute to its development, the role of heavy metals remains unclear. This study evaluated serum levels of arsenic, cadmium, lead, mercury, and nickel in PC patients compared to healthy controls.

View Article and Find Full Text PDF

Randomised controlled trials (RCTs) are the gold standard for evaluating health interventions but often face ethical and practical challenges. When RCTs are not feasible, large observational data sets emerge as a pivotal resource, though these data sets may be subject to bias and unmeasured confounding. Traditional statistical (or non-causal) learning methods, while useful, face limitations in fully uncovering causal effects, i.

View Article and Find Full Text PDF

Background: Puncture biopsy is a primary method for obtaining tissue or cell samples from tumors for histopathological diagnosis. However, patients often experience pain, anxiety, and discomfort during the procedure. Virtual reality is a novel technology developed through advancements in computer skill, and it is utilized in healthcare as a cognitive approach to relieve pain and relaxation.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!