A mathematical theory of relational generalization in transitive inference.

Proc Natl Acad Sci U S A

Mortimer B. Zuckerman Mind Brain Behavior Institute, Department of Neuroscience, Columbia University, New York, NY 10027.

Published: July 2024

Humans and animals routinely infer relations between different items or events and generalize these relations to novel combinations of items. This allows them to respond appropriately to radically novel circumstances and is fundamental to advanced cognition. However, how learning systems (including the brain) can implement the necessary inductive biases has been unclear. We investigated transitive inference (TI), a classic relational task paradigm in which subjects must learn a relation ([Formula: see text] and [Formula: see text]) and generalize it to new combinations of items ([Formula: see text]). Through mathematical analysis, we found that a broad range of biologically relevant learning models (e.g. gradient flow or ridge regression) perform TI successfully and recapitulate signature behavioral patterns long observed in living subjects. First, we found that models with item-wise additive representations automatically encode transitive relations. Second, for more general representations, a single scalar "conjunctivity factor" determines model behavior on TI and, further, the principle of norm minimization (a standard statistical inductive bias) enables models with fixed, partly conjunctive representations to generalize transitively. Finally, neural networks in the "rich regime," which enables representation learning and improves generalization on many tasks, unexpectedly show poor generalization and anomalous behavior on TI. We find that such networks implement a form of norm minimization (over hidden weights) that yields a local encoding mechanism lacking transitivity. Our findings show how minimal statistical learning principles give rise to a classical relational inductive bias (transitivity), explain empirically observed behaviors, and establish a formal approach to understanding the neural basis of relational abstraction.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11252811PMC
http://dx.doi.org/10.1073/pnas.2314511121DOI Listing

Publication Analysis

Top Keywords

[formula text]
12
transitive inference
8
combinations items
8
norm minimization
8
inductive bias
8
mathematical theory
4
relational
4
theory relational
4
relational generalization
4
generalization transitive
4

Similar Publications

Erectile Dysfunction (ED) is the leading cause of sexual dysfunction affecting hundreds of millions of men worldwide, and has been described as an important public health problem. The association of five novel anthropometrics related to obesity, lipids and glucose with ED remains unclear. To investigate the association of lipid accumulation products index (LAP), triglyceride glucose index (TyG), waist triglyceride index (WTI), weight-adjusted-waist index (WWI) and a body shape index (ABSI) with ED.

View Article and Find Full Text PDF

Dissolution of CO in water followed by the subsequent hydrolysis reactions is of great importance to the global carbon cycle, and carbon capture and storage. Despite numerous previous studies, the reactions are still not fully understood at the atomistic scale. Here, we combined ab initio molecular dynamics (AIMD) simulations with Markov state models to elucidate the reaction mechanisms and kinetics of CO in supercritical water both in the bulk and nanoconfined states.

View Article and Find Full Text PDF

The false evidence rate: An approach to frequentist error rate control conditioning on the observed value.

Proc Natl Acad Sci U S A

January 2025

Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford OX3 7BN, United Kingdom.

A value is conventionally interpreted either as a) the probability by chance of obtaining more extreme results than those observed or b) a tool for declaring significance at a prespecified level. Both approaches carry difficulties: b) does not allow users to make inferences based on the data in hand, and is not rigorously followed by researchers in practice, while (a) is not meaningful as an error rate. Although values retain an important role, these shortcomings are likely to have contributed significantly to the scientific reproducibility crisis.

View Article and Find Full Text PDF

The use of winglet devices is an efficient technique for enhancing aerodynamic performance. This study investigates the effects of winglet cant angles on both the aerodynamics and aeroacoustics of a commercial wing, comparing them to other significant parameters using a parametric analysis. A Full Factorial Design method is employed to generate a matrix of experiments, facilitating a detailed exploration of flow physics, with lift-to-drag ratio (L/D) and the integral of Acoustic Power Level (APL) as the primary representatives of aerodynamic and acoustic performance, respectively.

View Article and Find Full Text PDF

Among expanding discoveries of quantum phases in moiré superlattices, correlated insulators stand out as both the most stable and most commonly observed. Despite the central importance of these states in moiré physics, little is known about their underlying nature. Here, we use pump-probe spectroscopy to show distinct time-domain signatures of correlated insulators at fillings of one (ν = -1) and two (ν = -2) holes per moiré unit cell in the angle-aligned WSe/WS system.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!