Unsupervised Domain Adaptation for Object Detection (UDA-OD) aims to adapt a model trained on a labeled source domain to an unlabeled target domain, addressing challenges posed by domain shifts. However, existing methods often face significant challenges, particularly in detecting small objects and over-relying on classification confidence for pseudo-label selection, which often leads to inaccurate bounding box localization. To address these issues, we propose a novel UDA-OD framework that leverages scale consistency (SC) and Temporal Ensemble Pseudo-Label Selection (TEPLS) to enhance cross-domain robustness and detection performance. Specifically, we introduce Cross-Scale Prediction Consistency (CSPC) to enforce consistent detection across multiple resolutions, improving detection robustness for objects of varying scales. Additionally, we integrate Intra-Class Feature Consistency (ICFC), which employs contrastive learning to align feature representations within each class, further enhancing adaptation. To ensure high-quality pseudo-labels, TEPLS combines temporal localization stability with classification confidence, mitigating the impact of noisy predictions and improving both classification and localization accuracy. Extensive experiments on challenging benchmarks, including Cityscapes to Foggy Cityscapes, Sim10k to Cityscapes, and Virtual Mine to Actual Mine, demonstrate that our method achieves state-of-the-art performance, with notable improvements in small object detection and overall cross-domain robustness. These results highlight the effectiveness of our framework in addressing key limitations of existing UDA-OD approaches.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.3390/s25010230 | DOI Listing |
Sensors (Basel)
January 2025
Instituto de Telecomunicações (IT), Instituto Superior Técnico, Universidade de Lisboa, 1049-001 Lisbon, Portugal.
Shrimp farming is a growing industry, and automating certain processes within aquaculture tanks is becoming increasingly important to improve efficiency. This paper proposes an image-based system designed to address four key tasks in an aquaculture tank with : estimating shrimp length and weight, counting shrimps, and evaluating feed pellet food attractiveness. A setup was designed, including a camera connected to a Raspberry Pi computer, to capture high-quality images around a feeding plate during feeding moments.
View Article and Find Full Text PDFSensors (Basel)
January 2025
School of Mechanical and Electrical Engineering, China University of Mining and Technology (Beijing), Beijing 100083, China.
Unsupervised Domain Adaptation for Object Detection (UDA-OD) aims to adapt a model trained on a labeled source domain to an unlabeled target domain, addressing challenges posed by domain shifts. However, existing methods often face significant challenges, particularly in detecting small objects and over-relying on classification confidence for pseudo-label selection, which often leads to inaccurate bounding box localization. To address these issues, we propose a novel UDA-OD framework that leverages scale consistency (SC) and Temporal Ensemble Pseudo-Label Selection (TEPLS) to enhance cross-domain robustness and detection performance.
View Article and Find Full Text PDFSensors (Basel)
January 2025
Industrial Systems Institute, Athena Research and Innovation Center, 26504 Patras, Greece.
Object detection is a pivotal research domain within computer vision, with applications spanning from autonomous vehicles to medical diagnostics. This comprehensive survey presents an in-depth analysis of the evolution and significant advancements in object detection, emphasizing the critical role of machine learning (ML) and deep learning (DL) techniques. We explore a wide spectrum of methodologies, ranging from traditional approaches to the latest DL models, thoroughly evaluating their performance, strengths, and limitations.
View Article and Find Full Text PDFSensors (Basel)
January 2025
Cognitive Systems Lab, University of Bremen, 28359 Bremen, Germany.
Over recent years, automated Human Activity Recognition (HAR) has been an area of concern for many researchers due to its widespread application in surveillance systems, healthcare environments, and many more. This has led researchers to develop coherent and robust systems that efficiently perform HAR. Although there have been many efficient systems developed to date, still, there are many issues to be addressed.
View Article and Find Full Text PDFSensors (Basel)
December 2024
Department of Computer Science & Artificial Intelligence, Jeonbuk National University, Jeonju-si 54896, Republic of Korea.
Recently, computer vision methods have been widely applied to agricultural tasks, such as robotic harvesting. In particular, fruit harvesting robots often rely on object detection or segmentation to identify and localize target fruits. During the model selection process for object detection, the average precision (AP) score typically provides the de facto standard.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!