A tree-based context model for object recognition.

IEEE Trans Pattern Anal Mach Intell

Two Sigma Investments, 379 West Broadway, New York, NY 10012, USA.

Published: February 2012

There has been a growing interest in exploiting contextual information in addition to local features to detect and localize multiple object categories in an image. A context model can rule out some unlikely combinations or locations of objects and guide detectors to produce a semantically coherent interpretation of a scene. However, the performance benefit of context models has been limited because most of the previous methods were tested on data sets with only a few object categories, in which most images contain one or two object categories. In this paper, we introduce a new data set with images that contain many instances of different object categories, and propose an efficient model that captures the contextual information among more than a hundred object categories using a tree structure. Our model incorporates global image features, dependencies between object categories, and outputs of local detectors into one probabilistic framework. We demonstrate that our context model improves object recognition performance and provides a coherent interpretation of a scene, which enables a reliable image querying system by multiple object categories. In addition, our model can be applied to scene understanding tasks that local detectors alone cannot solve, such as detecting objects out of context or querying for the most typical and the least typical scenes in a data set.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2011.119DOI Listing

Publication Analysis

Top Keywords

object categories
28
context model
12
object
9
object recognition
8
multiple object
8
coherent interpretation
8
interpretation scene
8
data set
8
local detectors
8
categories
7

Similar Publications

Towards zero-shot human-object interaction detection via vision-language integration.

Neural Netw

March 2025

School of Future Technology, South China University of Technology, Guangdong Guangzhou, 511400, PR China. Electronic address:

Human-object interaction (HOI) detection aims to locate human-object pairs and identify their interaction categories in images. Most existing methods primarily focus on supervised learning, which relies on extensive manual HOI annotations. Such heavy reliance on closed-set supervised learning limits their generalization capabilities to unseen object categories.

View Article and Find Full Text PDF

This study presents the results of archaeometallurgical investigation of iron objects from the Sanctuary of Apollo in ancient Didyma, dating to the Archaic period (7th to the early 5th centuries BCE). The analysed precision work tools and semi-formed objects exhibit distinct material characteristics that differentiate them from other iron-steel artefacts of both small and large formats (weapons, implements, and architectural fittings) so far investigated in the Aegean. They were made of medium, high, and ultra-high carbon steel.

View Article and Find Full Text PDF

The ventral temporal cortex (VTC) of the human cerebrum is critically engaged in high-level vision. One intriguing aspect of this region is its functional lateralization, with neural responses to words being stronger in the left hemisphere, and neural responses to faces being stronger in the right hemisphere; such patterns can be summarized with a signed laterality index (LI), positive for leftward laterality. Converging evidence has suggested that word laterality emerges to couple efficiently with left-lateralized frontotemporal language regions, but evidence is more mixed regarding the sources of the right lateralization for face perception.

View Article and Find Full Text PDF

Color and form are closely related to our daily lives and can directly and rapidly affect people's emotions, and it is of great significance to study the effects of color and form of garden plants on the body and mind of urban residents. In this study, the shrub L., which has rich germplasm resources, was selected as the research object.

View Article and Find Full Text PDF

Trends in Physical Activity Research on Tobacco and/or Alcohol: A Bibliometric Analysis.

Healthcare (Basel)

February 2025

Escuela de Ciencias de la Actividad Física, el Deporte y la Salud, Universidad de Santiago de Chile (USACH), Santiago 9170022, Chile.

: Physical activity allows the enjoyment of personal health benefits in those who practice it, including the possibility of modifying behavioral risk factors such as tobacco and alcohol consumption. These risk factors are responsible for the development of non-communicable diseases, which are preventable and controllable. The scientific field on this object of study has grown in recent years.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!