Active object recognition, fundamental to tasks like reading and driving, relies on the ability to make time-sensitive decisions. People exhibit a flexible tradeoff between speed and accuracy, a crucial human skill. However, current computational models struggle to incorporate time. To address this gap, we present the first dataset (with 148 observers) exploring the speed-accuracy tradeoff (SAT) in ImageNet object recognition. Participants performed a 16-way ImageNet categorization task where their responses counted only if they occurred near the time of a fixed-delay beep. Each block of trials allowed one reaction time. As expected, human accuracy increases with reaction time. We compare human performance with that of dynamic neural networks that adapt their computation to the available inference time. Time is a scarce resource for human object recognition, and finding an appropriate analog in neural networks is challenging. Networks can repeat operations by using layers, recurrent cycles, or early exits. We use the repetition count as a network's analog for time. In our analysis, the number of layers, recurrent cycles, and early exits correlates strongly with floating-point operations, making them suitable time analogs. Comparing networks and humans on SAT-fit error, category-wise correlation, and SAT-curve steepness, we find cascaded dynamic neural networks most promising in modeling human speed and accuracy. Surprisingly, convolutional recurrent networks, typically favored in human object recognition modeling, perform the worst on our benchmark.

Download full-text PDF

Source
http://dx.doi.org/10.1167/jov.25.1.4DOI Listing

Publication Analysis

Top Keywords

object recognition
20
neural networks
16
speed-accuracy tradeoff
8
speed accuracy
8
time
8
reaction time
8
dynamic neural
8
human object
8
layers recurrent
8
recurrent cycles
8

Similar Publications

This study presents a novel approach to identifying meters and their pointers in modern industrial scenarios using deep learning. We developed a neural network model that can detect gauges and one or more of their pointers on low-quality images. We use an encoder network, jump connections, and a modified Convolutional Block Attention Module (CBAM) to detect gauge panels and pointer keypoints in images.

View Article and Find Full Text PDF

Background: Modern dietary trends have led to an increase in foods that are relatively high in n-6 polyunsaturated fatty acids (PUFAs) and low in n-3 PUFAs. We previously reported that the offspring of mother mice that consumed a diet high in n-6 linoleic acid (LA) and low in n-3 α-linolenic acid (ALA), hereinafter called the LA/ALA diet, exhibit behavioral abnormalities related to anxiety and feeding.

Objective: We currently lack a comprehensive overview of the behavioral abnormalities in these offspring, which was investigated in this study.

View Article and Find Full Text PDF

Objective: The aim of this study was to investigate the role of ferroptosis in the occurrence of postoperative cognitive dysfunction (POCD) using a mouse model and to elucidate whether electroacupuncture (EA) can improve POCD by suppressing ferroptosis via the transferrin receptor 1 (TFR1)-divalent metal transporter 1 (DMT1)-ferroportin (FPN) pathway.

Methods: The experiment involved three groups: the control group, the POCD group and the POCD + EA group. The POCD animal model was established using sevoflurane anesthesia and tibial fracture.

View Article and Find Full Text PDF

Following recent advances in post-thrombectomy stroke care, the role of neuroinflammation and neuroprotective strategies in mitigating secondary injury has gained prominence. Yet, while neuroprotection and anti-inflammatory agents have re-emerged in clinical trials, their success has been limited. The neuroinflammatory response in cerebral ischemia is robust and multifactorial, complicating therapeutic approaches targeting single pathways.

View Article and Find Full Text PDF

Triboelectric tactile sensor for pressure and temperature sensing in high-temperature applications.

Nat Commun

January 2025

Guangxi Key Laboratory of Clean Pulp & Papermaking and Pollution Control, School of Light Industry and Food Engineering, Guangxi University, Nanning, 530004, PR China.

Skin-like sensors capable of detecting multiple stimuli simultaneously have great potential in cutting-edge human-machine interaction. However, realizing multimodal tactile recognition beyond human tactile perception still faces significant challenges. Here, an extreme environments-adaptive multimodal triboelectric sensor was developed, capable of detecting pressure/temperatures beyond the range of human perception.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!