Covariance Matrix Adaptation for Multiobjective Multiarmed Bandits.

IEEE Trans Neural Netw Learn Syst

Published: August 2019

Upper confidence bound (UCB) is a successful multiarmed bandit for regret minimization. The covariance matrix adaptation (CMA) for Pareto UCB (CMA-PUCB) algorithm considers stochastic reward vectors with correlated objectives. We upper bound the cumulative pseudoregret of pulling suboptimal arms for the CMA-PUCB algorithm to logarithmic number of arms K , objectives D , and samples n , O (ln(nDK) ∑ (|| Σ ||/∆)) , using a variant of Berstein inequality for matrices, where ∆ is the regret of pulling the suboptimal arm i . For unknown covariance matrices between objectives Σ , we upper bound the approximation of the covariance matrix using the number of samples to O (n ln(nDK) + ln(nDK) ∑ (1/∆)) . Simulations on a three objective stochastic environment show the applicability of our method.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2018.2885123DOI Listing

Publication Analysis

Top Keywords

covariance matrix
12
matrix adaptation
8
cma-pucb algorithm
8
objectives upper
8
upper bound
8
pulling suboptimal
8
samples lnndk
8
lnndk ∑
8
covariance
4
adaptation multiobjective
4

Similar Publications

Purpose: To improve the current method for MRI turbulence quantification which is the intravoxel phase dispersion (IVPD) method. Turbulence is commonly characterized by the Reynolds stress tensor (RST) which describes the velocity covariance matrix. A major source for systematic errors in MRI is the sequence's sensitivity to the variance of the derivatives of velocity, such as the acceleration variance, which can lead to a substantial measurement bias.

View Article and Find Full Text PDF

Objective: To present a remodeling of the electroretinogram waveform using a covariance matrix to identify regions of interest and distinction between a control and attention deficit/hyperactivity disorder (ADHD) group. Electroretinograms were recorded in n = 25 ADHD (16 male; age 11.9 ± 2.

View Article and Find Full Text PDF

In adaptive beamforming, the array signal processing adjusts its sensor delays and weights based on the incoming data. In conventional beamforming, these parameters are instead given from a predefined model. Adaptive beamformers can improve measurement precision by dynamically rejecting spatial interference.

View Article and Find Full Text PDF

Semiparametric estimator for the covariate-specific receiver operating characteristic curve.

Stat Methods Med Res

January 2025

CITMAga and Department of Statistics and Operations Research, Universidade de Vigo, Vigo, Galicia, Spain.

The study of the predictive ability of a marker is mainly based on the accuracy measures provided by the so-called confusion matrix. Besides, the area under the receiver operating characteristic curve has become a popular index for summarizing the overall accuracy of a marker. However, the nature of the relationship between the marker and the outcome, and the role that potential confounders play in this relationship could be fundamental in order to extrapolate the observed results.

View Article and Find Full Text PDF

To better understand the sources of biological diversity in nature, we need information on the mechanisms underlying population divergence. Biological systems with patterns of naturally occurring adaptive variation among populations can provide insight into the genetic architecture of diverging traits and the influence of genetic constraints on responses to selection. Using a system of reproductive character displacement in the North American mushroom-feeding fly Drosophila subquinaria, we assessed patterns of genetic (co)variance among a suite of chemical signaling traits and divergence in this pattern among populations.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!