Feedforward object-vision models only tolerate small image variations compared to human.

Masoud Ghodrati Amirhossein Farzmahdi Karim Rajaei Reza Ebrahimpour Seyed-Mahdi Khaligh-Razavi

Front Comput Neurosci

MRC Cognition and Brain Sciences Unit, University of Cambridge Cambridge, UK.

Published: August 2014

Invariant object recognition is a remarkable ability of primates' visual system that its underlying mechanism has constantly been under intense investigations. Computational modeling is a valuable tool toward understanding the processes involved in invariant object recognition. Although recent computational models have shown outstanding performances on challenging image databases, they fail to perform well in image categorization under more complex image variations. Studies have shown that making sparse representation of objects by extracting more informative visual features through a feedforward sweep can lead to higher recognition performances. Here, however, we show that when the complexity of image variations is high, even this approach results in poor performance compared to humans. To assess the performance of models and humans in invariant object recognition tasks, we built a parametrically controlled image database consisting of several object categories varied in different dimensions and levels, rendered from 3D planes. Comparing the performance of several object recognition models with human observers shows that only in low-level image variations the models perform similar to humans in categorization tasks. Furthermore, the results of our behavioral experiments demonstrate that, even under difficult experimental conditions (i.e., briefly presented masked stimuli with complex image variations), human observers performed outstandingly well, suggesting that the models are still far from resembling humans in invariant object recognition. Taken together, we suggest that learning sparse informative visual features, although desirable, is not a complete solution for future progresses in object-vision modeling. We show that this approach is not of significant help in solving the computational crux of object recognition (i.e., invariant object recognition) when the identity-preserving image variations become more complex.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4103258	PMC
http://dx.doi.org/10.3389/fncom.2014.00074	DOI Listing

Publication Analysis

Top Keywords

object recognition

image variations

invariant object

image

object

recognition

complex image

informative visual

visual features

humans invariant

Similar Publications

A Scene Knowledge Integrating Network for Transmission Line Multi-Fitting Detection.

Sensors (Basel)

December 2024

Automation Department, North China Electric Power University, Baoding 071003, China.

Xinhang Chen Xinsheng Xu Jing Xu Wenjie Zheng Qianming Wang

Aiming at the severe occlusion problem and the tiny-scale object problem in the multi-fitting detection task, the Scene Knowledge Integrating Network (SKIN), including the scene filter module (SFM) and scene structure information module (SSIM) is proposed. Firstly, the particularity of the scene in the multi-fitting detection task is analyzed. Hence, the aggregation of the fittings is defined as the scene according to the professional knowledge of the power field and the habit of the operators in identifying the fittings.

View Article and Find Full Text PDF

Similar Publications

Impact of Omega-3 on Endocannabinoid System Expression and Function, Enhancing Cognition and Behavior in Male Mice.

Nutrients

December 2024

Department of Neurosciences, Faculty of Medicine and Nursing, University of the Basque Country UPV/EHU, 48940 Leioa, Spain.

Maitane Serrano Miquel Saumell-Esnaola Garazi Ocerin Gontzal García Del Caño Nagore Puente

: Omega-3 long-chain polyunsaturated fatty acids (PUFAs) support brain cell membrane integrity and help mitigate synaptic plasticity deficits. The endocannabinoid system (ECS) is integral to synaptic plasticity and regulates various brain functions. While PUFAs influence the ECS, the effects of omega-3 on the ECS, cognition, and behavior in a healthy brain remain unclear.

View Article and Find Full Text PDF

Similar Publications

Visualizing Plant Disease Distribution and Evaluating Model Performance for Deep Learning Classification with YOLOv8.

Pathogens

November 2024

College of Information Engineering, Yangzhou University, Yangzhou 225009, China.

Abdul Ghafar Caikou Chen Syed Atif Ali Shah Zia Ur Rehman Gul Rahman

This paper presents a novel methodology for plant disease detection using YOLOv8 (You Only Look Once version 8), a state-of-the-art object detection model designed for real-time image classification and recognition tasks. The proposed approach involves training a custom YOLOv8 model to detect and classify various plant conditions accurately. The model was evaluated using a testing subset to measure its performance in detecting different plant diseases.

View Article and Find Full Text PDF

Similar Publications

A High-Repeatability Three-Dimensional Force Tactile Sensing System for Robotic Dexterous Grasping and Object Recognition.

Micromachines (Basel)

December 2024

School of Aerospace Science and Technology, Xidian University, Xi'an 710071, China.

Yaoguang Shi Xiaozhou Lü Wenran Wang Xiaohui Zhou Wensong Zhu

Robotic devices with integrated tactile sensors can accurately perceive the contact force, pressure, sliding, and other tactile information, and they have been widely used in various fields, including human-robot interaction, dexterous manipulation, and object recognition. To address the challenges associated with the initial value drift, and to improve the durability and accuracy of the tactile detection for a robotic dexterous hand, in this study, a flexible tactile sensor is designed with high repeatability by introducing a supporting layer for pre-separation. The proposed tactile sensor has a detection range of 0-5 N with a resolution of 0.

View Article and Find Full Text PDF

Similar Publications

Org24598, a Selective Glycine Transporter 1 (GlyT1) Inhibitor, Reverses Object Recognition and Spatial Memory Impairments Following Binge-like Ethanol Exposure in Rats.

Molecules

December 2024

Department of Pharmacology and Pharmacodynamics, Medical University of Lublin, Chodzki 4a, 20-093 Lublin, Poland.

Joanna Filarowska-Jurko Pawel Grochecki Ewa Gibuła-Tarlowska Joanna Listos Ewa Kedzierska

The N-methyl-D-aspartate (NMDA) glutamate receptor is a major target of ethanol, and it is implicated in learning and memory formation, and other cognitive functions. Glycine acts as a co-agonist for this receptor. We examined whether Org24598, a selective inhibitor of glycine transporter1 (GlyT1), affects ethanol withdrawal-induced deficits in recognition memory (Novel Object Recognition (NOR) task) and spatial memory (Barnes Maze (BM) task) in rats, and whether the NMDA receptor glycine site participates in this phenomenon.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!