Human-object interaction (HOI) detection involves identifying interactions represented as [Formula: see text] , requiring the localization of human-object pairs and interaction classification within an image. This work focuses on the challenge of detecting HOIs with unseen objects using the prevalent Transformer architecture. Our empirical analysis reveals that the performance degradation of novel HOI instances primarily arises from misclassifying unseen objects as confusable seen objects. To address this issue, we propose a similarity propagation (SP) scheme that leverages cosine similarity distance to regulate the prediction margin between seen and unseen objects. In addition, we introduce pseudo-supervision for unseen objects based on class semantic similarities during training. Furthermore, we incorporate semantic-aware instance-level and interaction-level contrastive losses with Transformer to enhance intraclass compactness and interclass separability, resulting in improved visual representations. Extensive experiments on two challenging benchmarks, V-COCO and HICO-DET, demonstrate the effectiveness of our model, outperforming current state-of-the-art methods under various zero-shot settings.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TNNLS.2023.3309104 | DOI Listing |
Pathogens
November 2024
College of Information Engineering, Yangzhou University, Yangzhou 225009, China.
This paper presents a novel methodology for plant disease detection using YOLOv8 (You Only Look Once version 8), a state-of-the-art object detection model designed for real-time image classification and recognition tasks. The proposed approach involves training a custom YOLOv8 model to detect and classify various plant conditions accurately. The model was evaluated using a testing subset to measure its performance in detecting different plant diseases.
View Article and Find Full Text PDFInt Dent J
December 2024
Department of Prosthodontics and Dental Implantology, College of Dentistry, King Faisal University, Al-Ahsa, KSA. Electronic address:
Objectives: Dental health is integral to overall well-being, with early detection of issues critical for prevention. This research work focuses on utilizing artificial intelligence and deep learning-based object detection techniques for automated detection of common dental issues in orthopantomography x-ray images, including broken roots, periodontally compromised teeth, and the Kennedy classification of partially edentulous arches.
Methods: An orthopantomography dataset has been used to train several models employing various object detection architectures, hyperparameters, and training techniques.
Plant Phenomics
November 2024
Department of Computer Science, University of Saskatchewan, Saskatoon, Canada.
Plant population counts are highly valued by crop producers as important early-season indicators of field health. Traditionally, emergence rate estimates have been acquired through manual counting, an approach that is labor-intensive and relies heavily on sampling techniques. By applying deep learning-based object detection models to aerial field imagery, accurate plant population counts can be obtained for much larger areas of a field.
View Article and Find Full Text PDFSensors (Basel)
October 2024
Division of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea.
Neural radiance fields (NeRF) have become an effective method for encoding scenes into neural representations, allowing for the synthesis of photorealistic views of unseen views from given input images. However, the applicability of traditional NeRF is significantly limited by its assumption that images are captured for object-centric scenes with a pinhole camera. Expanding these boundaries, we focus on driving scenarios using a fisheye camera, which offers the advantage of capturing visual information from a wide field of view.
View Article and Find Full Text PDFIEEE Trans Image Process
November 2024
Remote sensing anomaly detector can find the objects deviating from the background as potential targets for Earth monitoring. Given the diversity in earth anomaly types, designing a transferring model with cross-modality detection ability should be cost-effective and flexible to new earth observation sources and anomaly types. However, the current anomaly detectors aim to learn the certain background distribution, the trained model cannot be transferred to unseen images.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!