Constrained Structure Learning for Scene Graph Generation.

IEEE Trans Pattern Anal Mach Intell

Published: October 2023

AI Article Synopsis

Article Abstract

As a structured prediction task, scene graph generation aims to build a visually-grounded scene graph to explicitly model objects and their relationships in an input image. Currently, the mean field variational Bayesian framework is the de facto methodology used by the existing methods, in which the unconstrained inference step is often implemented by a message passing neural network. However, such formulation fails to explore other inference strategies, and largely ignores the more general constrained optimization models. In this paper, we present a constrained structure learning method, for which an explicit constrained variational inference objective is proposed. Instead of applying the ubiquitous message-passing strategy, a generic constrained optimization method - entropic mirror descent - is utilized to solve the constrained variational inference step. We validate the proposed generic model on various popular scene graph generation benchmarks and show that it outperforms the state-of-the-art methods.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2023.3282889DOI Listing

Publication Analysis

Top Keywords

scene graph
16
graph generation
12
constrained structure
8
structure learning
8
inference step
8
constrained optimization
8
constrained variational
8
variational inference
8
constrained
6
scene
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!