AI Article Synopsis

  • Few-shot object detection (FSOD) focuses on developing a detector that can quickly adapt to new tasks with limited data, but existing methods often struggle with efficiency, particularly in terms of speed and computational demands.
  • The authors introduce a new efficient pretrain-transfer framework (PTF) that maintains performance similar to state-of-the-art methods while not increasing computational costs, and they enhance the framework with a knowledge inheritance (KI) initializer to speed up adaptation.
  • Their approach demonstrates significant improvements in adaptation speed (1.8-100× faster) on public benchmarks like PASCAL VOC, COCO, and LVIS, marking a novel effort to address efficiency in FSOD.

Article Abstract

Few-shot object detection (FSOD), which aims at learning a generic detector that can adapt to unseen tasks with scarce training samples, has witnessed consistent improvement recently. However, most existing methods ignore the efficiency issues, e.g., high computational complexity and slow adaptation speed. Notably, efficiency has become an increasingly important evaluation metric for few-shot techniques due to an emerging trend toward embedded AI. To this end, we present an efficient pretrain-transfer framework (PTF) baseline with no computational increment, which achieves comparable results with previous state-of-the-art (SOTA) methods. Upon this baseline, we devise an initializer named knowledge inheritance (KI) to reliably initialize the novel weights for the box classifier, which effectively facilitates the knowledge transfer process and boosts the adaptation speed. Within the KI initializer, we propose an adaptive length re-scaling (ALR) strategy to alleviate the vector length inconsistency between the predicted novel weights and the pretrained base weights. Finally, our approach not only achieves the SOTA results across three public benchmarks, i.e., PASCAL VOC, COCO and LVIS, but also exhibits high efficiency with 1.8-100× faster adaptation speed against the other methods on COCO/LVIS benchmark during few-shot transfer. To our best knowledge, this is the first work to consider the efficiency problem in FSOD. We hope to motivate a trend toward powerful yet efficient few-shot technique development. The codes are publicly available at https://github.com/Ze-Yang/Efficient-FSOD.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TIP.2022.3228162DOI Listing

Publication Analysis

Top Keywords

adaptation speed
12
efficient few-shot
8
few-shot object
8
object detection
8
knowledge inheritance
8
novel weights
8
knowledge
4
detection knowledge
4
few-shot
4
inheritance few-shot
4

Similar Publications

The traditional phase shift measurement technique necessitates two orthogonally oriented fringe patterns to complete the phase measurement, which is time-consuming, and the phase modulation of the traditional fringe image exhibits only a gradient change in a single direction of the horizontal-vertical fringes, or a smooth gradient change in the tangential direction of the circular fringes. To enhance the measurement speed and improve the adaptability to large curvature measured specular surfaces, this paper proposes a phase measurement deflectometry (PMD) technique based on composite circular fringes. The composite circular fringes demonstrate a steeper slope in the phase change, enabling the acquisition of finer surface features under identical measurement conditions, effectively improving the detection sensitivity to small shape changes and enhancing the ability to discern fine details.

View Article and Find Full Text PDF

A high-speed entanglement assisted communication that operates at 10 Gb/s is proposed, which performs a highly efficient, PPLN-waveguide-based, entanglement generation by making the simultaneous use of S- and L-band pumps. The two-pump-based entanglement generation source satisfies the quasi-phase-matching-condition over the entire C-band. To improve the system reliability, our system performs the phase-conjugation on idler photons in contrast to conventional ways of performing the phase-conjugation on signal photons.

View Article and Find Full Text PDF

In this paper, we explore the impact of exposure time on optical-phase measurements collected on light that has propagated through atmospheric-optical turbulence. We model the exposure time by phase averaging over a convective distance, and we quantify the associated impact of imposing an exposure time using the piston- and tilt-removed phase variance. We accomplish this analysis through the development of an analytic solution and wave-optics simulations.

View Article and Find Full Text PDF

Tunas are high-performance pelagic fishes of considerable economic importance and have a suite of biological adaptations for high-speed locomotion. In contrast to our understanding of tuna body and muscle function, mechanosensory systems of tuna are poorly understood. Here we present the discovery of a remarkable sensory lateral line canal within the bilateral tuna keels with tubules that extend to the upper and lower keel surfaces.

View Article and Find Full Text PDF

Introduction: Weeds are a major factor affecting crop yield and quality. Accurate identification and localization of crops and weeds are essential for achieving automated weed management in precision agriculture, especially given the challenges in recognition accuracy and real-time processing in complex field environments. To address this issue, this paper proposes an efficient crop-weed segmentation model based on an improved UNet architecture and attention mechanisms to enhance both recognition accuracy and processing speed.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!