Publications by authors named "Gershman S"

Limits on information processing capacity impose limits on task performance. We show that male and female mice achieve performance on a perceptual decision task that is near-optimal given their capacity limits, as measured by policy complexity (the mutual information between states and actions). This behavioral profile could be achieved by reinforcement learning with a penalty on high complexity policies, realized through modulation of dopaminergic learning signals.

View Article and Find Full Text PDF

Individual contributors to a collaborative task are often rewarded for going above and beyond-salespeople earn commissions, athletes earn performance bonuses, and companies award special parking spots to their employee of the month. How do we decide when to reward collaborators, and are these decisions closely aligned with how responsible they were for the outcome of a collaboration? In Experiments 1a and 1b ( ), we tested how participants give bonuses, using stimuli and an experiment design that has previously been used to elicit responsibility judgments (Xiang et al., 2023a).

View Article and Find Full Text PDF

Despite overwhelming scientific consensus on the existence of human-caused climate change, public opinion among Americans remains split. Directly informing people of scientific consensus is among the most prominent strategies for climate communication, yet the reasons for its effectiveness and its limitations are not fully understood. Here, we propose that consensus messaging provides information not only about the existence of climate change but also traits of climate scientists themselves.

View Article and Find Full Text PDF

How are people able to understand everyday physical events with such ease? One hypothesis suggests people use an approximate probabilistic simulation of the world. A contrasting hypothesis is that people use a collection of abstractions or features. While it has been noted that the two hypotheses explain complementary aspects of physical reasoning, there has yet to be a model of how these two modes of reasoning can be used together.

View Article and Find Full Text PDF

Humans form sequences of -representations of the current situation-to predict how activity will unfold. Multiple mechanisms have been proposed for how the cognitive system determines when to segment the stream of behavior and switch from one active event model to another. Here, we constructed a computational model that learns knowledge about event classes (event schemas), by combining recurrent neural networks for short-term dynamics with Bayesian inference over event classes for event-to-event transitions.

View Article and Find Full Text PDF

Limits on information processing capacity impose limits on task performance. We show that animals achieve performance on a perceptual decision task that is near-optimal given their capacity limits, as measured by policy complexity (the mutual information between states and actions). This behavioral profile could be achieved by reinforcement learning with a penalty on high complexity policies, realized through modulation of dopaminergic learning signals.

View Article and Find Full Text PDF

Over the last few decades, psychologists have developed precise quantitative models of human recall performance in visual working memory (VWM) tasks. However, these models are tailored to a particular class of artificial stimulus displays and simple feature reports from participants (e.g.

View Article and Find Full Text PDF

We all possess a mental library of schemas that specify how different types of events unfold. How are these schemas acquired? A key challenge is that learning a new schema can catastrophically interfere with old knowledge. One solution to this dilemma is to use interleaved training to learn a single representation that accommodates all schemas.

View Article and Find Full Text PDF
Article Synopsis
  • Adaptive behavior relies on the ability to predict future events, and reinforcement learning theory outlines useful predictive representations and methods for computing them.
  • This review combines insights from reinforcement learning with findings in cognition and neuroscience, highlighting the significance of successor representation and its variations.
  • The alignment of these theories with brain function indicates that certain predictive representations could serve as fundamental components of intelligence.
View Article and Find Full Text PDF

Habituation, the reduction of responding to repetitive stimuli, is often conceptualized as a kind of attentional filter, amplifying salient signals at the expense of non-salient signals. No prior account has explicitly formalized filtering principles that can explain the major characteristics of habituation. In this paper, a simple probabilistic model is developed which permits analysis of the optimal filtering problem.

View Article and Find Full Text PDF

Background: The Oncotype DX Genomic Prostate Score (ODX-GPS) is a gene expression assay that predicts disease aggressiveness. The objective of this study was to identify sociodemographic and regional factors associated with ODX-GPS uptake.

Methods: Data from Surveillance Epidemiology and End Results registries on men with localized prostate cancer with a Gleason score of 3 + 3 or 3 + 4, PSA ≤20 ng/mL, and stage T1c to T2c disease from 2013 through 2017 were linked with ODX-GPS data.

View Article and Find Full Text PDF
Article Synopsis
  • Precision medicine is increasingly important in cancer care, but tumor genomic data has been lacking in the National Cancer Institute's SEER Program, limiting research on molecular subtypes.
  • To improve this, the SEER Program has implemented a centralized process to link cancer cases in their registries with genomic test results from molecular labs, using specialized software and a trusted third party for data handling.
  • Recent linkages have included various OncotypeDX tests and results from other genomic classifiers, which facilitate the research community's access to valuable, de-identified data for cancer studies.
View Article and Find Full Text PDF

The most influential account of phasic dopamine holds that it reports reward prediction errors (RPEs). The RPE-based interpretation of dopamine signaling is, in its original form, probably too simple and fails to explain all the properties of phasic dopamine observed in behaving animals. This Perspective helps to resolve some of the conflicting interpretations of dopamine that currently exist in the literature.

View Article and Find Full Text PDF
Article Synopsis
  • Researchers suggest that humans process events by identifying latent causes (LCs) which aid in learning based on context, but it's uncertain how this works when contexts share common structures.
  • The Latent Cause Network (LCNet) is introduced as a neural network model that effectively infers LCs by balancing shared and context-specific structures through innovative learning mechanisms.
  • LCNet successfully demonstrated its ability to extract shared LCs across tasks, align with human learning patterns, and interpret complex real-life event data, offering a scalable solution for understanding context in learning processes.
View Article and Find Full Text PDF

How do teachers learn about what learners already know? How do learners aid teachers by providing them with information about their background knowledge and what they find confusing? We formalize this collaborative reasoning process using a hierarchical Bayesian model of pedagogy. We then evaluate this model in two online behavioral experiments (N = 312 adults). In Experiment 1, we show that teachers select examples that account for learners' background knowledge, and adjust their examples based on learners' feedback.

View Article and Find Full Text PDF

Board, card or video games have been played by virtually every individual in the world. Games are popular because they are intuitive and fun. These distinctive qualities of games also make them ideal for studying the mind.

View Article and Find Full Text PDF

Psychopathology is vast and diverse. Across distinct disease states, individuals exhibit symptoms that appear counter to the standard view of rationality (expected utility maximization). We argue that some aspects of psychopathology can be described as resource-rational, reflecting a rational trade-off between reward and cognitive resources.

View Article and Find Full Text PDF

As investigations in the biomedical applications of plasma advance, a demand for describing safe and efficacious delivery of plasma is emerging. It is quite clear that not all plasmas are "equal" for all applications. This Perspective discusses limitations of the existing parameters used to define plasma in context of the need for the "right plasma" at the "right dose" for each "disease system.

View Article and Find Full Text PDF

Policy compression is a computational framework that describes how capacity-limited agents trade reward for simpler action policies to reduce cognitive cost. In this study, we present behavioral evidence that humans prefer simpler policies, as predicted by a capacity-limited reinforcement learning model. Across a set of tasks, we find that people exploit structure in the relationships between states, actions, and rewards to "compress" their policies.

View Article and Find Full Text PDF

In order to efficiently divide labor with others, it is important to understand what our collaborators can do (i.e., their competence).

View Article and Find Full Text PDF

Associative learning depends on contingency, the degree to which a stimulus predicts an outcome. Despite its importance, the neural mechanisms linking contingency to behavior remain elusive. Here we examined the dopamine activity in the ventral striatum - a signal implicated in associative learning - in a Pavlovian contingency degradation task in mice.

View Article and Find Full Text PDF

Neuroscience and artificial intelligence (AI) share a long, intertwined history. It has been argued that discoveries in neuroscience were (and continue to be) instrumental in driving the development of new AI technology. Scrutinizing these historical claims yields a more nuanced story, where AI researchers were loosely inspired by the brain, but ideas flowed mostly in the other direction.

View Article and Find Full Text PDF

Computational phenotyping has emerged as a powerful tool for characterizing individual variability across a variety of cognitive domains. An individual's computational phenotype is defined as a set of mechanistically interpretable parameters obtained from fitting computational models to behavioural data. However, the interpretation of these parameters hinges critically on their psychometric properties, which are rarely studied.

View Article and Find Full Text PDF