Knowledge-augmented face perception: Prospects for the Bayesian brain-framework to align AI and human vision.

Conscious Cogn

Cluster of Excellence Science of Intelligence, Technische Universität Berlin, Germany; Department of Psychology, Humboldt-Universität zu Berlin, Germany.

Published: May 2022

Human visual perception is efficient, flexible and context-sensitive. The Bayesian brain view explains this with probabilistic perceptual inference integrating prior experience and knowledge through top-down influences. Advances in machine learning, such as Artificial Neural Networks (ANNs), have enabled considerable progress in computer vision. Unlike humans, these networks do not yet adaptively draw on meaningful and task-relevant contextual cues and prior knowledge. We propose ideas to better align human and computer vision, applied to facial expression recognition. We review evidence of knowledge-augmented and context-sensitive face perception in humans and approaches trying to leverage such sources of information in computer vision. We discuss how both fields can establish an epistemic loop: Redesigning synthetic systems with inspiration from the Bayesian brain-framework could make networks more flexible and useful for human-machine interaction. In turn, employing ANNs as scientific tools will widen the scope of empirical research into human knowledge-augmented perception.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.concog.2022.103301DOI Listing

Publication Analysis

Top Keywords

computer vision
12
face perception
8
bayesian brain-framework
8
align human
8
knowledge-augmented face
4
perception
4
perception prospects
4
prospects bayesian
4
brain-framework align
4
human
4

Similar Publications

Physics-Based Synthetic Data Model for Automated Segmentation in Catalysis Microscopy.

Microsc Microanal

January 2025

Fritz-Haber-Institut der Max-Planck-Gesellschaft, Berlin 14195, Germany.

In catalysis research, the amount of microscopy data acquired when imaging dynamic processes is often too much for nonautomated quantitative analysis. Developing machine learned segmentation models is challenged by the requirement of high-quality annotated training data. We thus substitute expert-annotated data with a physics-based sequential synthetic data model.

View Article and Find Full Text PDF

Background: Effective pain recognition and treatment in perioperative environments reduce length of stay and decrease risk of delirium and chronic pain. We sought to develop and validate preliminary computer vision-based approaches for nociception detection in hospitalized patients.

Methods: Prospective observational cohort study using red-green-blue camera detection of perioperative patients.

View Article and Find Full Text PDF

Malaria is a major public health challenge in sub-Saharan Africa. Timely and accurate diagnosis of malaria is vital to reduce the caseload and mortality rates associated with malaria The use of microscopy in malaria screening is the gold standard recommended method by the World Health Organisation (WHO). In Uganda, utilization of microscopy is challenged by insufficient expertise to interpret the images accurately, affecting the efficiency, effectiveness and accuracy of malaria detection and diagnosis.

View Article and Find Full Text PDF

MEVDT: Multi-modal event-based vehicle detection and tracking dataset.

Data Brief

February 2025

Department of Electrical and Computer Engineering, University of Michigan-Dearborn, 4901 Evergreen Rd, Dearborn, 48128 MI, USA.

In this data article, we introduce the Multi-Modal Event-based Vehicle Detection and Tracking (MEVDT) dataset. This dataset provides a synchronized stream of event data and grayscale images of traffic scenes, captured using the Dynamic and Active-Pixel Vision Sensor (DAVIS) 240c hybrid event-based camera. MEVDT comprises 63 multi-modal sequences with approximately 13k images, 5M events, 10k object labels, and 85 unique object tracking trajectories.

View Article and Find Full Text PDF

Glaucoma, a severe eye disease leading to irreversible vision loss if untreated, remains a significant challenge in healthcare due to the complexity of its detection. Traditional methods rely on clinical examinations of fundus images, assessing features like optic cup and disc sizes, rim thickness, and other ocular deformities. Recent advancements in artificial intelligence have introduced new opportunities for enhancing glaucoma detection.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!