Reinforcement learning models generally assume that a stimulus is presented that allows a learner to unambiguously identify the state of nature, and the reward received is drawn from a distribution that depends on that state. However, in any natural environment, the stimulus is noisy. When there is state uncertainty, it is no longer immediately obvious how to perform reinforcement learning, since the observed reward cannot be unambiguously allocated to a state of the environment.
View Article and Find Full Text PDFWe consider an agent that must choose repeatedly among several actions. Each action has a certain probability of giving the agent an energy reward, and costs may be associated with switching between actions. The agent does not know which action has the highest reward probability, and the probabilities change randomly over time.
View Article and Find Full Text PDFMany animals nest or roost colonially. At the start of a potential foraging period, they may set out independently or await information from returning foragers. When should such individuals act independently and when should they wait for information? In a social insect colony, for example, information transfer may greatly increase a recruit's probability of finding food, and it is commonly assumed that this will always increase the colony's net energy gain.
View Article and Find Full Text PDF