We consider an agent that must choose repeatedly among several actions. Each action has a certain probability of giving the agent an energy reward, and costs may be associated with switching between actions. The agent does not know which action has the highest reward probability, and the probabilities change randomly over time. We study two learning rules that have been widely used to model decision-making processes in animals-one deterministic and the other stochastic. In particular, we examine the influence of the rules' 'learning rate' on the agent's energy gain. We compare the performance of each rule with the best performance attainable when the agent has either full knowledge or no knowledge of the environment. Over relatively short periods of time, both rules are successful in enabling agents to exploit their environment. Moreover, under a range of effective learning rates, both rules are equivalent, and can be expressed by a third rule that requires the agent to select the action for which the current run of unsuccessful trials is shortest. However, the performance of both rules is relatively poor over longer periods of time, and under most circumstances no better than the performance an agent could achieve without knowledge of the environment. We propose a simple extension to the original rules that enables agents to learn about and effectively exploit a changing environment for an unlimited period of time.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3226992PMC
http://dx.doi.org/10.1098/rsif.2007.1348DOI Listing

Publication Analysis

Top Keywords

learning rules
8
knowledge environment
8
periods time
8
rules
6
agent
6
simple learning
4
rules cope
4
cope changing
4
changing environments
4
environments consider
4

Similar Publications

Background And Aims: Infertility, as defined by the World Health Organization, is the inability to conceive after 12 months of regular, unprotected intercourse. This study aimed to identify factors influencing infertility by applying data mining techniques, specifically rule-mining methods, to analyze diverse patient data and uncover relevant insights. This approach involves a thorough analysis of patients' clinical characteristics, dietary habits, and overall conditions to identify complex patterns and relationships that may contribute to infertility.

View Article and Find Full Text PDF

In bioneuronal systems, the synergistic interaction between mechanosensitive piezo channels and neuronal synapses can convert and transmit pressure signals into complex temporal plastic pulses with excitatory and inhibitory features. However, existing artificial tactile neuromorphic systems struggle to replicate the elaborate temporal plasticity observed between excitatory and inhibitory features in biological systems, which is critical for the biomimetic processing and memorizing of tactile information. Here we demonstrate a mechano-gated iontronic piezomemristor with programmable temporal-tactile plasticity.

View Article and Find Full Text PDF

Background: Drug-drug interactions (DDIs) especially antagonistic ones present significant risks to patient safety, underscoring the urgent need for reliable prediction methods. Recently, substructure-based DDI prediction has garnered much attention due to the dominant influence of functional groups and substructures on drug properties. However, existing approaches face challenges regarding the insufficient interpretability of identified substructures and the isolation of chemical substructures.

View Article and Find Full Text PDF

An Iterative Design Method for Advancing Air Traffic Control and Management Training Through Immersive VFR 3D Map Visualization.

IISE Trans Occup Ergon Hum Factors

January 2025

The Bradley Department of Electrical and Computer Engineering, College of Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, USA.

OCCUPATIONAL APPLICATIONSInnovative tools that align with modern learners' preferences are essential for training in safety-critical professions like Air Traffic Control/Management. This study evaluated a Virtual Reality Visual Flight Rules 3D Map Visualization Tool designed to meet the Federal Aviation Administration's (FAA) modernization goals. The tool immerses trainee in contextually accurate environments, enhancing engagement and self-paced learning.

View Article and Find Full Text PDF

Development of an Analytical Model for Predicting the Shear Viscosity of Polypropylene Compounds.

Polymers (Basel)

January 2025

Institute for Plastics Processing (IKV) in Industry and Craft, RWTH Aachen University, Seffenter Weg 201, 52074 Aachen, Germany.

The need for an efficient adaptation of existing polypropylene (PP) formulations or the creation of new formulations has become increasingly important in various industries. Variations in viscosity resulting from changes in raw materials, fillers, and additives can have a significant impact on the processing and quality of PP products. This study presents the development of an analytical model designed to predict the shear viscosity of complex PP blends.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!