The problem of pedestrian detection is considered. The design of complexity-aware cascaded pedestrian detectors, combining features of very different complexities, is investigated. A new cascade design procedure is introduced, by formulating cascade learning as the Lagrangian optimization of a risk that accounts for both accuracy and complexity. A boosting algorithm, denoted as complexity aware cascade training (CompACT), is then derived to solve this optimization. CompACT cascades are shown to seek an optimal trade-off between accuracy and complexity by pushing features of higher complexity to the later cascade stages, where only a few difficult candidate patches remain to be classified. This enables the use of features of vastly different complexities in a single detector. In result, the feature pool can be expanded to features previously impractical for cascade design, such as the responses of a deep convolutional neural network (CNN). This is demonstrated through the design of pedestrian detectors with a pool of features whose complexities span orders of magnitude. The resulting cascade generalizes the combination of a CNN with an object proposal mechanism: rather than a pre-processing stage, CompACT cascades seamlessly integrate CNNs in their stages. This enables accurate detection at fairly fast speeds.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2019.2910514DOI Listing

Publication Analysis

Top Keywords

pedestrian detection
8
pedestrian detectors
8
features complexities
8
cascade design
8
accuracy complexity
8
compact cascades
8
cascade
6
features
5
learning complexity-aware
4
complexity-aware cascades
4

Similar Publications

Accurate multi-object tracking (MOT) is essential for autonomous vehicles, enabling them to perceive and interact with dynamic environments effectively. Single-modality 3D MOT algorithms often face limitations due to sensor constraints, resulting in unreliable tracking. Recent multi-modal approaches have improved performance but rely heavily on complex, deep-learning-based fusion techniques.

View Article and Find Full Text PDF

RC-DETR: Improving DETRs in crowded pedestrian detection via rank-based contrastive learning.

Neural Netw

February 2025

Chongqing Key Laboratory of Image Cognition, College of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China. Electronic address:

The variants of DEtection TRansformer (DETRs) have achieved impressive performance in general object detection. However, they suffer notable performance degradation in scenarios involving crowded pedestrian detection. This decline primarily occurs during the training phase, where DETRs are constrained solely by pedestrian labels.

View Article and Find Full Text PDF

DV-DETR: Improved UAV Aerial Small Target Detection Algorithm Based on RT-DETR.

Sensors (Basel)

November 2024

School of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai 201620, China.

For drone-based detection tasks, accurately identifying small-scale targets like people, bicycles, and pedestrians remains a key challenge. In this paper, we propose DV-DETR, an improved detection model based on the Real-Time Detection Transformer (RT-DETR), specifically optimized for small target detection in high-density scenes. To achieve this, we introduce three main enhancements: (1) ResNet18 as the backbone network to improve feature extraction and reduce model complexity; (2) the integration of recalibration attention units and deformable attention mechanisms in the neck network to enhance multi-scale feature fusion and improve localization accuracy; and (3) the use of the Focaler-IoU loss function to better handle the imbalanced distribution of target scales and focus on challenging samples.

View Article and Find Full Text PDF

An Enhanced Feature-Fusion Network for Small-Scale Pedestrian Detection on Edge Devices.

Sensors (Basel)

November 2024

Department of Medical Electronics, School of Biomedical Engineering, Air Force Medical University, Xi'an 710032, China.

Small-scale pedestrian detection is one of the challenges in general object detection. Factors such as complex backgrounds, long distances, and low-light conditions make the image features of small-scale pedestrians less distinct, further increasing the difficulty of detection. To address these challenges, an Enhanced Feature-Fusion YOLO network (EFF-YOLO) for small-scale pedestrian detection is proposed.

View Article and Find Full Text PDF

Evaluating the safety impact of mid-block pedestrian signals (MPS).

Accid Anal Prev

February 2025

Department of Civil, Environmental and Construction Engineering, University of Central Florida, Orlando, FL 32816, USA. Electronic address:

Article Synopsis
  • * Using advanced computer vision and a specific statistical model, the study categorized vehicle-pedestrian interactions into conflict levels and found that MPS significantly reduced moderate and serious conflicts.
  • * Factors like vehicle speed and land-use mix were found to increase the risk of serious conflicts, but the presence of MPS helped mitigate these risks, showing its potential to enhance safety at mid-block crossings.
View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!