One shot, generic object detection involves searching for a single query object in a larger target image. Relevant approaches have benefited from features that typically model the local similarity patterns. In this paper, we combine local similarity (encoded by local descriptors) with a global context (i.e., a graph structure) of pairwise affinities among the local descriptors, embedding the query descriptors into a low dimensional but discriminatory subspace. Unlike principal components that preserve global structure of feature space, we actually seek a linear approximation to the Laplacian eigenmap that permits us a locality preserving embedding of high dimensional region descriptors. Our second contribution is an accelerated but exact computation of matrix cosine similarity as the decision rule for detection, obviating the computationally expensive sliding window search. We leverage the power of Fourier transform combined with integral image to achieve superior runtime efficiency that allows us to test multiple hypotheses (for pose estimation) within a reasonably short time. Our approach to one shot detection is training-free, and experiments on the standard data sets confirm the efficacy of our model. Besides, low computation cost of the proposed (codebook-free) object detector facilitates rather straightforward query detection in large data sets including movie videos.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2015.2453950DOI Listing

Publication Analysis

Top Keywords

shot detection
8
matrix cosine
8
cosine similarity
8
local similarity
8
local descriptors
8
data sets
8
detection laplacian
4
object
4
laplacian object
4
object fast
4

Similar Publications

Tea bud detection technology is of great significance in realizing automated and intelligent plucking of tea buds. This study proposes a lightweight tea bud identification model based on modified Yolov5 to increase the picking accuracy and labor efficiency of intelligent tea bud picking while lowering the deployment pressure of mobile terminals. The following methods are used to make improvements: the backbone network CSPDarknet-53 of YOLOv5 is replaced with the EfficientNetV2 feature extraction network to reduce the number of parameters and floating-point operations of the model; the neck network of YOLOv5, the Ghost module is introduced to construct the ghost convolution and C3ghost module to further reduce the number of parameters and floating-point operations of the model; replacing the upsampling module of the neck network with the CARAFE upsampling module can aggregate the contextual tea bud feature information within a larger sensory field and improve the mean average precision of the model in detecting tea buds.

View Article and Find Full Text PDF

Raccoons (Procyon lotor) originated in North America and have been introduced to Europe. Due to their close contact with human settlements, they are important reservoirs for zoonotic pathogens, such as Baylisascaris procyonis. The relevance and prevalence of vector-borne pathogens have not yet been fully elucidated.

View Article and Find Full Text PDF

Digital PCR (dPCR) has transformed nucleic acid diagnostics by enabling the absolute quantification of rare mutations and target sequences. However, traditional dPCR detection methods, such as those involving flow cytometry and fluorescence imaging, may face challenges due to high costs, complexity, limited accuracy, and slow processing speeds. In this study, SAM-dPCR is introduced, a training-free open-source bioanalysis paradigm that offers swift and precise absolute quantification of biological samples.

View Article and Find Full Text PDF

A zero-shot attribute-embedded model with a feature difference mapping sigmoid function for compound fault diagnosis of rotating machinery.

ISA Trans

December 2024

State Key Laboratory of Mechanical Transmission for Advanced Equipment, Chongqing University, Chongqing 400044, PR China. Electronic address:

Article Synopsis
  • Current methods for detecting machinery compound faults struggle due to the lack of available training data, as collecting sufficient compound fault samples is often impractical in engineering.
  • The paper introduces a zero-shot attribute-embedded model (ZSAECFD), which allows for diagnosing unseen compound faults using only single fault data by constructing attribute prototypes and utilizing a new activation function, F-sigmoid.
  • The model demonstrates high diagnostic accuracy—81.82% for bearing faults and 88.17% for gear faults—showing its effectiveness compared to traditional methods, even without training on compound fault data.
View Article and Find Full Text PDF

Identifying potential drug-drug interactions (DDIs) before clinical use is essential for patient safety yet remains a significant challenge in drug development. We presented DDI-GPT, a deep learning framework that predicts DDIs by combining knowledge graphs (KGs) and pre-trained large language models (LLMs), enabling early detection of potential drug interactions. We demonstrated that DDI-GPT outperforms current state-of-the-art methods by capturing contextual dependencies between biomedical entities to infer potential DDIs.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!