Visual tracking and attribute estimation related to age or gender information of multiple person entities in a scene are mature research topics with the advent of deep learning techniques. However, when it comes to indoor images such as video sequences of retail consumers, data are not always adequate or accurate enough to essentially train effective models for consumer detection and tracking under various adverse factors. This in turn affects the quality of recognizing age or gender for those detected instances. In this work, we introduce two novel datasets: comprises 145 video sequences compliant to personal information regulations as far as facial images are concerned and is a set of cropped body images from each sequence that can be used for numerous computer vision tasks. We also propose an end-to-end framework which comprises CNNs as object detectors, LSTMs for motion forecasting of the tracklet association component in a sequence, along with a multi-attribute classification model for apparent demographic estimation of the detected outputs, aiming to capture useful metadata of consumer product preferences. Obtained results on tracking and age/gender prediction are promising with respect to reference systems while they indicate the proposed model's potential for practical consumer metadata extraction.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10708599PMC
http://dx.doi.org/10.3390/s23239510DOI Listing

Publication Analysis

Top Keywords

visual tracking
8
apparent demographic
8
demographic estimation
8
age gender
8
video sequences
8
benchmark consumer
4
consumer visual
4
tracking
4
tracking apparent
4
estimation rgb
4

Similar Publications

In order to address the issue of tracking errors of collision Caenorhabditis elegans, this research proposes an improved particle filter tracking method integrated with cultural algorithm. The particle filter algorithm is enhanced through the integration of the sine cosine algorithm, thereby facilitating uninterrupted tracking of the target C. elegans.

View Article and Find Full Text PDF

Background: Food image recognition, a crucial step in computational gastronomy, has diverse applications across nutritional platforms. Convolutional neural networks (CNNs) are widely used for this task due to their ability to capture hierarchical features. However, they struggle with long-range dependencies and global feature extraction, which are vital in distinguishing visually similar foods or images where the context of the whole dish is crucial, thus necessitating transformer architecture.

View Article and Find Full Text PDF

A Dual-Channel and Frequency-Aware Approach for Lightweight Video Instance Segmentation.

Sensors (Basel)

January 2025

The Higher Educational Key Laboratory for Measuring & Control Technology and Instrumentation of Heilongjiang Province, Harbin University of Science and Technology, Harbin 150080, China.

Video instance segmentation, a key technology for intelligent sensing in visual perception, plays a key role in automated surveillance, robotics, and smart cities. These scenarios rely on real-time and efficient target-tracking capabilities for accurate perception and intelligent analysis of dynamic environments. However, traditional video instance segmentation methods face complex models, high computational overheads, and slow segmentation speeds in time-series feature extraction, especially in resource-constrained environments.

View Article and Find Full Text PDF

Study on the Influence of Rural Highway Landscape Green Vision Rate on Driving Load Based on Factor Analysis.

Sensors (Basel)

January 2025

School of Civil Engineering Architecture and the Environment, Hubei University of Technology, Wuhan 430068, China.

The green vision rate of rural highway greening landscape is a key factor affecting the driver's visual load. Based on this, this paper uses the eye tracking method to study the visual characteristics of drivers in different green vision environments on rural highways in Xianning County. Based on the HSV color space model, this paper obtains four sections of rural highway with a green vision rate of 10~20%, green vision rate of 20~30%, green vision rate of 30~40%, and green vision rate of 40~50%.

View Article and Find Full Text PDF

Gut Colonization of Zebrafish Larvae Induces a Dampened Sensorimotor Response.

Biomedicines

January 2025

Department of Biochemistry, Microbiology, and Immunology, Wayne State University, Detroit, MI 48201, USA.

Cholera is a diarrheal disease prevalent in populations without access to clean water. Cholera is caused by which colonizes the upper small intestine in humans once ingested. A growing number of studies suggest that the gut microbiome composition modulates animal behavior.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!