Many important behavior changes are ; they require individuals to expend effort over a long period with little immediate gratification. Here, an artificial intelligence (AI) agent can provide personalized interventions to help individuals stick to their goals. In these settings, the AI agent must personalize (before the individual disengages) and , to help us understand the behavioral interventions. In this paper, we introduce Behavior Model Reinforcement Learning (BMRL), a framework in which an AI agent intervenes on the parameters of a Markov Decision Process (MDP) belonging to a . Our formulation of the human decision-maker as a planning agent allows us to attribute undesirable human policies (ones that do not lead to the goal) to their maladapted MDP parameters, such as an extremely low discount factor. Furthermore, we propose a class of tractable human models that captures fundamental behaviors in frictionful tasks. Introducing a notion of specific to BMRL, we theoretically and empirically show that AI planning with our human models can lead to helpful policies on a wide range of more complex, ground-truth humans.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11460771PMC

Publication Analysis

Top Keywords

reinforcement learning
8
frictionful tasks
8
human models
8
human
5
learning interventions
4
interventions boundedly
4
boundedly rational
4
rational human
4
human agents
4
agents frictionful
4

Similar Publications

TIMAR: Transition-informed representation for sample-efficient multi-agent reinforcement learning.

Neural Netw

December 2024

CAS Key Laboratory of GIPAS, University of Science and Technology of China, Hefei, China; Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, China. Electronic address:

In MARL (Multi-Agent Reinforcement Learning), the trial-and-error learning paradigm based on multiple agents requires massive interactions to produce training samples, significantly increasing both the training cost and difficulty. Therefore, enhancing data efficiency is a core issue in MARL. However, in the context of MARL, agent partially observed information leads to a lack of consideration for agent interactions and coordination from an ego perspective under the world model, which becomes the main obstacle to improving the data efficiency of current proposed MARL methods.

View Article and Find Full Text PDF

Secondary motor cortex tracks decision value during the learning of a non-instructed task.

Cell Rep

January 2025

Institut Interdisciplinaire de Neurosciences (IINS), University Bordeaux, CNRS, IINS, UMR 5297, 33000 Bordeaux, France; Centre Broca Nouvelle-Aquitaine, 146, rue Léo-Saignat, 33076 Bordeaux, France. Electronic address:

Optimal decision-making depends on interconnected frontal brain regions, enabling animals to adapt decisions based on internal states, experiences, and contexts. The secondary motor cortex (M2) is key in adaptive behaviors in expert rodents, particularly in encoding decision values guiding complex probabilistic tasks. However, its role in deterministic tasks during initial learning remains uncertain.

View Article and Find Full Text PDF

The present study investigated the neuromodulatory substrates of salience processing and its impact on memory encoding and behaviour, with a specific focus on two distinct types of salience: reward and contextual unexpectedness. 46 Participants performed a novel task paradigm modulating these two aspects independently and allowing for investigating their distinct and interactive effects on memory encoding while undergoing high-resolution fMRI. By using advanced image processing techniques tailored to examine midbrain and brainstem nuclei with high precision, our study additionally aimed to elucidate differential activation patterns in subcortical nuclei in response to reward-associated and contextually unexpected stimuli, including distinct pathways involving in particular dopaminergic modulation.

View Article and Find Full Text PDF

The ability of environmental cues to trigger alcohol-seeking behaviours is thought to facilitate problematic alcohol use. Individuals' tendency to attribute incentive salience to cues may increase the risk of addiction. We sought to study the relationship between incentive salience and alcohol addiction using non-preferring rats to model the heterogeneity of human alcohol consumption, investigating both males and females.

View Article and Find Full Text PDF

Background: Pulmonary arterial hypertension (PAH) is a rare but severe and life-threatening condition that primarily affects the pulmonary blood vessels and the right ventricle of the heart. The limited availability of human tissue for research ~most of which represents only end-stage disease~ has led to a reliance on preclinical animal models. However, these models often fail to capture the heterogeneity and complexity of the human condition.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!