Zero-Shot Human-Object Interaction Detection via Similarity Propagation.

IEEE Trans Neural Netw Learn Syst

Published: September 2023

Human-object interaction (HOI) detection involves identifying interactions represented as [Formula: see text] , requiring the localization of human-object pairs and interaction classification within an image. This work focuses on the challenge of detecting HOIs with unseen objects using the prevalent Transformer architecture. Our empirical analysis reveals that the performance degradation of novel HOI instances primarily arises from misclassifying unseen objects as confusable seen objects. To address this issue, we propose a similarity propagation (SP) scheme that leverages cosine similarity distance to regulate the prediction margin between seen and unseen objects. In addition, we introduce pseudo-supervision for unseen objects based on class semantic similarities during training. Furthermore, we incorporate semantic-aware instance-level and interaction-level contrastive losses with Transformer to enhance intraclass compactness and interclass separability, resulting in improved visual representations. Extensive experiments on two challenging benchmarks, V-COCO and HICO-DET, demonstrate the effectiveness of our model, outperforming current state-of-the-art methods under various zero-shot settings.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2023.3309104DOI Listing

Publication Analysis

Top Keywords

unseen objects
16
human-object interaction
8
similarity propagation
8
objects
5
zero-shot human-object
4
interaction detection
4
detection similarity
4
propagation human-object
4
interaction hoi
4
hoi detection
4

Similar Publications

This paper presents a novel methodology for plant disease detection using YOLOv8 (You Only Look Once version 8), a state-of-the-art object detection model designed for real-time image classification and recognition tasks. The proposed approach involves training a custom YOLOv8 model to detect and classify various plant conditions accurately. The model was evaluated using a testing subset to measure its performance in detecting different plant diseases.

View Article and Find Full Text PDF

Objectives: Dental health is integral to overall well-being, with early detection of issues critical for prevention. This research work focuses on utilizing artificial intelligence and deep learning-based object detection techniques for automated detection of common dental issues in orthopantomography x-ray images, including broken roots, periodontally compromised teeth, and the Kennedy classification of partially edentulous arches.

Methods: An orthopantomography dataset has been used to train several models employing various object detection architectures, hyperparameters, and training techniques.

View Article and Find Full Text PDF

Plant population counts are highly valued by crop producers as important early-season indicators of field health. Traditionally, emergence rate estimates have been acquired through manual counting, an approach that is labor-intensive and relies heavily on sampling techniques. By applying deep learning-based object detection models to aerial field imagery, accurate plant population counts can be obtained for much larger areas of a field.

View Article and Find Full Text PDF

Neural Radiance Fields for Fisheye Driving Scenes Using Edge-Aware Integrated Depth Supervision.

Sensors (Basel)

October 2024

Division of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea.

Neural radiance fields (NeRF) have become an effective method for encoding scenes into neural representations, allowing for the synthesis of photorealistic views of unseen views from given input images. However, the applicability of traditional NeRF is significantly limited by its assumption that images are captured for object-centric scenes with a pinhole camera. Expanding these boundaries, we focus on driving scenarios using a fisheye camera, which offers the advantage of capturing visual information from a wide field of view.

View Article and Find Full Text PDF

Remote sensing anomaly detector can find the objects deviating from the background as potential targets for Earth monitoring. Given the diversity in earth anomaly types, designing a transferring model with cross-modality detection ability should be cost-effective and flexible to new earth observation sources and anomaly types. However, the current anomaly detectors aim to learn the certain background distribution, the trained model cannot be transferred to unseen images.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!