Thompson Sampling for Non-Stationary Bandit Problems.

Entropy (Basel)

School of Software Engineering, Xi'an Jiaotong University, Xi'an 710049, China.

Published: January 2025

Non-stationary multi-armed bandit (MAB) problems have recently attracted extensive attention. We focus on the abruptly changing scenario where reward distributions remain constant for a certain period and change at unknown time steps. Although Thompson sampling (TS) has shown success in non-stationary settings, there is currently no regret bound analysis for TS with uninformative priors. To address this, we propose two algorithms, discounted TS and sliding-window TS, designed for sub-Gaussian reward distributions. For these algorithms, we establish an upper bound for the expected regret by bounding the expected number of times a suboptimal arm is played. We show that the regret upper bounds of both algorithms are O~(TBT), where is the time horizon and BT is the number of breakpoints. This upper bound matches the lower bound for abruptly changing problems up to a logarithmic factor. Empirical comparisons with other non-stationary bandit algorithms highlight the competitive performance of our proposed methods.

Download full-text PDF

Source
http://dx.doi.org/10.3390/e27010051DOI Listing

Publication Analysis

Top Keywords

thompson sampling
8
non-stationary bandit
8
abruptly changing
8
reward distributions
8
upper bound
8
non-stationary
4
sampling non-stationary
4
bandit problems
4
problems non-stationary
4
non-stationary multi-armed
4

Similar Publications

This study introduces a novel ensemble learning technique namely Multi-Armed Bandit Ensemble (MAB-Ensemble), designed for lane detection in road images intended for autonomous vehicles. The foundation of the proposed MAB-Ensemble technique is inspired in terms of Multi-Armed bandit optimization to facilitate efficient model selection for lane segmentation. The benchmarking dataset namely TuSimple is used for training, validating and testing the proposed and existing lane detection techniques.

View Article and Find Full Text PDF

Infection with the zoonotic nematode, Trichinella murrelli in bobcats (Lynx rufus) from Minnesota, USA.

Vet Parasitol Reg Stud Reports

January 2025

Minnesota Department of Natural Resources, Wildlife Health Program, 5463 West Broadway, Forest Lake, MN 55025, USA.

During a survey for other parasitic infections, tongue and limb muscles of wild bobcats (Lynx rufus) were examined microscopically from a sample of 190 Minnesota bobcats. One batch (n = 50) was studied in 2015, yielding no cases of Trichinella when examined only with hematoxylin and eosin (HE) staining of <25 mg of tissue. In a 2024 cohort (n = 140), this method was paired with a compression of ∼5 g of tissue.

View Article and Find Full Text PDF

Multi-season survey of ixodid tick species collected from domestic dogs in Chad, Africa.

Vet Parasitol Reg Stud Reports

January 2025

Southeastern Cooperative Wildlife Disease Study, Department of Population Health, College of Veterinary Medicine, Wildlife Health Building, 589 D.W. Brooks Dr., University of Georgia, Athens, GA 30602, USA; Warnell School of Forestry and Natural Resources, University of Georgia, Athens, GA 30602, USA; Center for Ecology of Infectious Diseases, University of Georgia, Athens, GA 30602, USA. Electronic address:

Ticks are medically important vectors of pathogens, many of which are zoonotic or impact domestic animal and/or wildlife health. Climate change, landuse modifications, and increasing interactions between domestic animals, wildlife, and humans have resulted in changes in tick-host dynamics and the emergence of novel pathogens worldwide. Therefore, describing the host and geographic ranges of vector species is essential in assessing disease risk, especially in understudied areas, and should be conducted in a One Health context.

View Article and Find Full Text PDF

Background: Cognition is a research priority for people living with chronic kidney disease (CKD), but identification of critical research questions is lacking. This study aimed to determine which cognition-related research questions are most important to CKD stakeholders.

Methods: A modified Delphi technique with 3 survey rounds was used.

View Article and Find Full Text PDF

Cases for a disease can be defined broadly using diagnostic codes, or narrowly using gold-standard confirmation that often is not available in large administrative datasets. These different definitions can have significant impacts on the results and conclusions of studies. We conducted this study to assess how using melanoma phecodes versus histologic confirmation for invasive or in situ melanoma impacts the results of a genome-wide association study (GWAS) using the Million Veteran Program.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!