Accurate object detection requires correct classification and high-quality localization. Currently, most of the single shot detectors (SSDs) conduct simultaneous classification and regression using a fully convolutional network. Despite high efficiency, this structure has some inappropriate designs for accurate object detection. The first one is the mismatch of bounding box classification, where the classification results of the default bounding boxes are improperly treated as the results of the regressed bounding boxes during the inference. The second one is that only one-time regression is not good enough for high-quality object localization. To solve the problem of classification mismatch, we propose a novel reg-offset-cls (ROC) module including three hierarchical steps: the regression of the default bounding box, the prediction of new feature sampling locations, and the classification of the regressed bounding box with more accurate features. For high-quality localization, we stack two ROC modules together. The input of the second ROC module is the output of the first ROC module. In addition, we inject a feature enhanced (FE) module between two stacked ROC modules to extract more contextual information. The experiments on three different datasets (i.e., MS COCO, PASCAL VOC, and UAVDT) are performed to demonstrate the effectiveness and superiority of our method. Without any bells or whistles, our proposed method outperforms state-of-the-art one-stage methods at a real-time speed. The source code is available at https://github.com/JialeCao001/HSD.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TNNLS.2021.3106641 | DOI Listing |
Sensors (Basel)
December 2024
Department of Automation, North China Electric Power University, Baoding 071003, China.
To address the difficulty in detecting workers' violation behaviors in electric power construction scenarios, this paper proposes an innovative method that integrates knowledge reasoning and progressive multi-level distillation techniques. First, standards, norms, and guidelines in the field of electric power construction are collected to build a comprehensive knowledge graph, aiming to provide accurate knowledge representation and normative analysis. Then, the knowledge graph is combined with the object-detection model in the form of triplets, where detected objects and their interactions are represented as subject-predicate-object relationship.
View Article and Find Full Text PDFSensors (Basel)
December 2024
Faculty of Engineering and IT, University of Technology Sydney, Sydney 2052, Australia.
To achieve high-precision 3D reconstruction, a comprehensive improvement has been made to the binocular structured light calibration method. During the calibration process, the calibration object's imaging quality and the camera parameters' nonlinear optimization effect directly affect the caibration accuracy. Firstly, to address the issue of poor imaging quality of the calibration object under tilted conditions, a pixel-level adaptive fill light method was designed using the programmable light intensity feature of the structured light projector, allowing the calibration object to receive uniform lighting and thus improve the quality of the captured images.
View Article and Find Full Text PDFSensors (Basel)
December 2024
School of Mechanical and Electrical Engineering, Changchun University of Science and Technology, Changchun 130022, China.
In industrial applications, robotic arm grasp detection tasks frequently suffer from inadequate accuracy and success rates, which result in reduced operational efficiency. Although existing methods have achieved some success, limitations remain in terms of detection accuracy, real-time performance, and generalization ability. To address these challenges, this paper proposes an enhanced grasp detection model, G-RCenterNet, based on the CenterNet framework.
View Article and Find Full Text PDFSensors (Basel)
December 2024
School of Computer and Information Engineering, Xinjiang Agricultural University, Urumqi 830052, China.
Reducing damage and missed harvest rates is essential for improving efficiency in unmanned cabbage harvesting. Accurate real-time segmentation of cabbage heads can significantly alleviate these issues and enhance overall harvesting performance. However, the complexity of the growing environment and the morphological variability of field-grown cabbage present major challenges to achieving precise segmentation.
View Article and Find Full Text PDFSensors (Basel)
December 2024
Electrical, Computer, and Biomedical Engineering, Toronto Metropolitan University, Toronto, ON M5B 2K3, Canada.
Autonomous technologies have revolutionized transportation, military operations, and space exploration, necessitating precise localization in environments where traditional GPS-based systems are unreliable or unavailable. While widespread for outdoor localization, GPS systems face limitations in obstructed environments such as dense urban areas, forests, and indoor spaces. Moreover, GPS reliance introduces vulnerabilities to signal disruptions, which can lead to significant operational failures.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!