One shot, generic object detection involves searching for a single query object in a larger target image. Relevant approaches have benefited from features that typically model the local similarity patterns. In this paper, we combine local similarity (encoded by local descriptors) with a global context (i.e., a graph structure) of pairwise affinities among the local descriptors, embedding the query descriptors into a low dimensional but discriminatory subspace. Unlike principal components that preserve global structure of feature space, we actually seek a linear approximation to the Laplacian eigenmap that permits us a locality preserving embedding of high dimensional region descriptors. Our second contribution is an accelerated but exact computation of matrix cosine similarity as the decision rule for detection, obviating the computationally expensive sliding window search. We leverage the power of Fourier transform combined with integral image to achieve superior runtime efficiency that allows us to test multiple hypotheses (for pose estimation) within a reasonably short time. Our approach to one shot detection is training-free, and experiments on the standard data sets confirm the efficacy of our model. Besides, low computation cost of the proposed (codebook-free) object detector facilitates rather straightforward query detection in large data sets including movie videos.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TPAMI.2015.2453950 | DOI Listing |
Sci Rep
December 2024
College of Horticulture and Plant Protection, Henan University of Science and Technology, Luoyang, 47100, China.
Tea bud detection technology is of great significance in realizing automated and intelligent plucking of tea buds. This study proposes a lightweight tea bud identification model based on modified Yolov5 to increase the picking accuracy and labor efficiency of intelligent tea bud picking while lowering the deployment pressure of mobile terminals. The following methods are used to make improvements: the backbone network CSPDarknet-53 of YOLOv5 is replaced with the EfficientNetV2 feature extraction network to reduce the number of parameters and floating-point operations of the model; the neck network of YOLOv5, the Ghost module is introduced to construct the ghost convolution and C3ghost module to further reduce the number of parameters and floating-point operations of the model; replacing the upsampling module of the neck network with the CARAFE upsampling module can aggregate the contextual tea bud feature information within a larger sensory field and improve the mean average precision of the model in detecting tea buds.
View Article and Find Full Text PDFSci Rep
December 2024
Department of Zoology, Musée National d'Histoire Naturelle, Luxembourg, Luxembourg.
Raccoons (Procyon lotor) originated in North America and have been introduced to Europe. Due to their close contact with human settlements, they are important reservoirs for zoonotic pathogens, such as Baylisascaris procyonis. The relevance and prevalence of vector-borne pathogens have not yet been fully elucidated.
View Article and Find Full Text PDFAdv Sci (Weinh)
December 2024
Department of Biomedical Engineering, The Chinese University of Hong Kong, Shatin, Hong Kong SAR, China.
Digital PCR (dPCR) has transformed nucleic acid diagnostics by enabling the absolute quantification of rare mutations and target sequences. However, traditional dPCR detection methods, such as those involving flow cytometry and fluorescence imaging, may face challenges due to high costs, complexity, limited accuracy, and slow processing speeds. In this study, SAM-dPCR is introduced, a training-free open-source bioanalysis paradigm that offers swift and precise absolute quantification of biological samples.
View Article and Find Full Text PDFISA Trans
December 2024
State Key Laboratory of Mechanical Transmission for Advanced Equipment, Chongqing University, Chongqing 400044, PR China. Electronic address:
Identifying potential drug-drug interactions (DDIs) before clinical use is essential for patient safety yet remains a significant challenge in drug development. We presented DDI-GPT, a deep learning framework that predicts DDIs by combining knowledge graphs (KGs) and pre-trained large language models (LLMs), enabling early detection of potential drug interactions. We demonstrated that DDI-GPT outperforms current state-of-the-art methods by capturing contextual dependencies between biomedical entities to infer potential DDIs.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!