Visual features are successfully exploited in several applications (e.g., visual search, object recognition and tracking, etc.) due to their ability to efficiently represent image content. Several visual analysis tasks require features to be transmitted over a bandwidth-limited network, thus calling for coding techniques to reduce the required bit budget, while attaining a target level of efficiency. In this paper, we propose, for the first time, a coding architecture designed for local features (e.g., SIFT, SURF) extracted from video sequences. To achieve high coding efficiency, we exploit both spatial and temporal redundancy by means of intraframe and interframe coding modes. In addition, we propose a coding mode decision based on rate-distortion optimization. The proposed coding scheme can be conveniently adopted to implement the analyze-then-compress (ATC) paradigm in the context of visual sensor networks. That is, sets of visual features are extracted from video frames, encoded at remote nodes, and finally transmitted to a central controller that performs visual analysis. This is in contrast to the traditional compress-then-analyze (CTA) paradigm, in which video sequences acquired at a node are compressed and then sent to a central unit for further processing. In this paper, we compare these coding paradigms using metrics that are routinely adopted to evaluate the suitability of visual features in the context of content-based retrieval, object recognition, and tracking. Experimental results demonstrate that, thanks to the significant coding gains achieved by the proposed coding scheme, ATC outperforms CTA with respect to all evaluation metrics.

Download full-text PDF

Source
http://dx.doi.org/10.1109/tip.2014.2312617DOI Listing

Publication Analysis

Top Keywords

visual features
16
extracted video
12
video sequences
12
coding
10
features extracted
8
object recognition
8
recognition tracking
8
visual analysis
8
proposed coding
8
coding scheme
8

Similar Publications

Unlabelled: Considering the similarity in clinical presentations of iris neoplasms of various origins, questions of their noninvasive diagnosis remain relevant. Optical coherence tomography angiography (OCT-A) is one of the imaging method that enables visualization of tumor vessels.

Purpose: This article examines the features of angioarchitecture, vascular network density, and perfusion density of iris melanoma and progressive iris nevus using OCT-A.

View Article and Find Full Text PDF

The social life of creative methods: Filmmaking, fabulation and recovery.

Br J Sociol

December 2024

Artist in Residence, Centre for Health, Arts, Society and the Environment (CHASE), University of Liverpool, Liverpool, UK.

In this article we consider the theoretical and methodological implications of Deleuzian fabulation for research on recovery from drugs and alcohol as an alternative way of making and doing methods in sociology. The article draws on data produced as part of an ongoing interdisciplinary research collaboration, begun in 2019, with the visual artist and filmmaker Melanie Manchot, social scientists Nicole Vitellone and Lena Theodoropoulou, and people in recovery from drugs and alcohol engaged in the production of Manchot's first feature film STEPHEN. This project attends to the methodological practice of filmmaking as a way of thinking with and alongside colleagues from divergent disciplines about the role of methods, concepts and practices for confronting and resisting processes of stigmatisation.

View Article and Find Full Text PDF

The quantity of cable conductors is a crucial parameter in cable manufacturing, and accurately detecting the number of conductors can effectively promote the digital transformation of the cable manufacturing industry. Challenges such as high density, adhesion, and knife mark interference in cable conductor images make intelligent detection of conductor quantity particularly difficult. To address these challenges, this study proposes the YOLO-cable model, which is an improvement made upon the YOLOv10 model.

View Article and Find Full Text PDF

GPS/VIO integrated navigation system based on factor graph and fuzzy logic.

Sci Rep

December 2024

Department of Electrical Engineering, Iran University of Science and Technology, Tehran, 16846-13114, Iran.

In today's technologically advanced landscape, precision in navigation and positioning holds paramount importance across various applications, from robotics to autonomous vehicles. A common predicament in location-based systems is the reliance on Global Positioning System (GPS) signals, which may exhibit diminished accuracy and reliability under certain conditions. Moreover, when integrated with the Inertial Navigation System (INS), the GPS/INS system could not provide a long-term solution for outage problems due to its accumulated errors.

View Article and Find Full Text PDF

Cerebral microbleeds (CMB) represent a feature of cerebral small vessel disease (cSVD), a prominent vascular contributor to age-related cognitive decline, dementia, and stroke. They are visible as spherical hypointense signals on T2*- or susceptibility-weighted magnetic resonance imaging (MRI) sequences. An increasing number of automated CMB detection methods being proposed are based on supervised deep learning (DL).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!