Online learning in bandits with predicted context.

Proc Mach Learn Res

Department of Statistics, Harvard University, Cambridge, MA, 02138.

Published: May 2024

We consider the contextual bandit problem where at each time, the agent only has access to a noisy version of the context and the error variance (or an estimator of this variance). This setting is motivated by a wide range of applications where the true context for decision-making is unobserved, and only a prediction of the context by a potentially complex machine learning algorithm is available. When the context error is non-vanishing, classical bandit algorithms fail to achieve sublinear regret. We propose the first online algorithm in this setting with sublinear regret guarantees under mild conditions. The key idea is to extend the measurement error model in classical statistics to the online decision-making setting, which is nontrivial due to the policy being dependent on the noisy context observations. We further demonstrate the benefits of the proposed approach in simulation environments based on synthetic and real digital intervention datasets.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11501084PMC

Publication Analysis

Top Keywords

context error
8
sublinear regret
8
context
6
online learning
4
learning bandits
4
bandits predicted
4
predicted context
4
context consider
4
consider contextual
4
contextual bandit
4

Similar Publications

A Complex Intervention to Minimize Medication Error by Nurses in Intensive Care: A Case Study.

Healthcare (Basel)

January 2025

School of Health Sciences, Polytechnic of Leiria, Rua General Norton de Matos, Apartado 4133, 2411-901 Leiria, Portugal.

Medication errors are the most frequent and critical issues in healthcare settings, often leading to worsened clinical outcomes, increased treatment costs, extended hospital stays, and heightened mortality and morbidity rates. These errors are particularly prevalent in intensive care units (ICUs), where the complexity and critical nature of the care elevate the risks. Nurses play a pivotal role in preventing medication errors and require strategies and methods to enhance patient safety.

View Article and Find Full Text PDF

Neuropathological contributions to grey matter atrophy and white matter hyperintensities in amnestic dementia.

Alzheimers Res Ther

January 2025

Laboratory for Clinical Neuroscience, Center for Biomedical Technology, Universidad Politécnica de Madrid, IdISSC, Crta M40, km38, Madrid, 28223, Spain.

Background: Dementia patients commonly present multiple neuropathologies, worsening cognitive function, yet structural neuroimaging signatures of dementia have not been positioned in the context of combined pathology. In this study, we implemented an MRI voxel-based approach to explore combined and independent effects of dementia pathologies on grey and white matter structural changes.

Methods: In 91 amnestic dementia patients with post-mortem brain donation, grey matter density and white matter hyperintensity (WMH) burdens were obtained from pre-mortem MRI and analyzed in relation to Alzheimer's, vascular, Lewy body, TDP-43, and hippocampal sclerosis (HS) pathologies.

View Article and Find Full Text PDF

Analyzing video footage of falls in older adults has emerged as an alternative to traditional lab studies. However, this approach is limited by the labor-intensive process of manually labeling body parts. To address this limitation, we aimed to validate the use of the AI-based pose estimation algorithm (OpenPose) in assessing the hip impact velocity and acceleration of video-captured falls.

View Article and Find Full Text PDF

Evaluation of pre-analytical factors impacting urine test strip and chemistry results.

Clin Chem Lab Med

January 2025

Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada.

Objectives: Careful consideration of the pre-analytical process for urine examination is essential to avoid errors and support accurate results and decision-making. Our objective was to assess the impact of various pre-analytical factors on urine test strip and quantitative chemistry results, including stability, tube type, fill volume, and centrifugation.

Methods: Residual random urine specimens were identified.

View Article and Find Full Text PDF

Research on virtual collection method of layer house temperature for the construction requirements of digital twin system.

Poult Sci

January 2025

Hebei Agricultural University, Baoding, Hebei 071000, China; Key Laboratory of Intelligent Equipment and New Energy Utilization in Livestock and Poultry Farming of Hebei Province, Baoding, Hebei 071000, China.

At present, in the context of the highly intensive development of livestock and poultry breeding, digital management is becoming increasingly important, and digital twin systems are gradually being applied. To solve the contradiction between data acquisition and sensor network congestion, a virtual acquisition method based on historical data and real-time reference of point data is proposed when constructing a digital twin system. Firstly, computational fluid dynamics (CFD) simulation was used to analyze and determine the temperature distribution and environmental characteristics inside the layer house, and the collection area was preliminarily divided according to the CFD simulation results.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!