Searching by parts: Towards fine-grained image retrieval respecting species correlation.

Gene Expr Patterns

School of Computer Science and Technology, Harbin Institute of Technology, Harbin, 150001, China.

Published: March 2023

Most of the existing works on fine-grained image categorization and retrieval focus on finding similar images from the same species and often give little importance to inter-species similarities. However, these similarities may carry species correlations such as the same ancestors or similar habits, which are helpful in taxonomy and understanding biological traits. In this paper, we devise a new fine-grained retrieval task that searches for similar instances from different species based on body parts. To this end, we propose a two-step strategy. In the first step, we search for visually similar parts to a query image using a deep convolutional neural network (CNN). To improve the quality of the retrieved candidates, structural cues are introduced into the CNN using a novel part-pooling layer, in which the receptive field of each part is adjusted automatically. In the second step, we re-rank the retrieved candidates to improve the species diversity. We achieve this by formulating a novel ranking function that balances between the similarity of the candidates to the queried parts, while decreasing the similarity to the query species. We provide experiments on the benchmark CUB200 dataset and Columbia Dogs dataset, and demonstrate clear benefits of our schemes.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.gep.2023.119304DOI Listing

Publication Analysis

Top Keywords

fine-grained image
8
retrieved candidates
8
species
6
searching parts
4
parts fine-grained
4
image retrieval
4
retrieval respecting
4
respecting species
4
species correlation
4
correlation existing
4

Similar Publications

Chip defect detection is a crucial aspect of the semiconductor production industry, given its significant impact on chip performance. This paper proposes a lightweight neural network with dual decoding paths for LED chip segmentation, named LDDP-Net. Within the LDDP-Net framework, the receptive field of the MobileNetv3 backbone is modified to mitigate information loss.

View Article and Find Full Text PDF

Fine-Grained Building Classification in Rural Areas Based on GF-7 Data.

Sensors (Basel)

January 2025

Bureau of Emergency Management of Pingquan City, Pingquan 067500, China.

Building type information is widely used in various fields, such as disaster management, urbanization studies, and population modelling. Few studies have been conducted on fine-grained building classification in rural areas using China's Gaofen-7 (GF-7) high-resolution stereo mapping satellite data. In this study, we employed a two-stage method combining supervised classification and unsupervised clustering to classify buildings in the rural area of Pingquan, northern China, based on building footprints, building heights, and multispectral information extracted from GF-7 data.

View Article and Find Full Text PDF

Advancing neutron imaging techniques to highest resolution with fluorescent nuclear track detectors.

Sci Rep

January 2025

High Energy Nuclear Physics Laboratory, Cluster for Pioneering Research, RIKEN, Wako, Saitama, 351-0198, Japan.

Neutron imaging is a nondestructive and noninvasive inspection technique with a wide range of potential applications. However, the fundamentals of this technique still need to be improved, one of which involves achieving micrometer scale or even better resolution, which is a challenging task. Recently, a high-resolution neutron imaging device based on fine-grained nuclear emulsions was developed.

View Article and Find Full Text PDF

Enhanced Image Retrieval Using Multiscale Deep Feature Fusion in Supervised Hashing.

J Imaging

January 2025

RCAM Laboratory, Telecommunications Department, Sidi Bel Abbes University, Sidi Bel Abbes 22000, Algeria.

In recent years, deep-network-based hashing has gained prominence in image retrieval for its ability to generate compact and efficient binary representations. However, most existing methods predominantly focus on high-level semantic features extracted from the final layers of networks, often neglecting structural details that are crucial for capturing spatial relationships within images. Achieving a balance between preserving structural information and maximizing retrieval accuracy is the key to effective image hashing and retrieval.

View Article and Find Full Text PDF

A 3D decoupling Alzheimer's disease prediction network based on structural MRI.

Health Inf Sci Syst

December 2025

School of Mathematics and Computing, University of Southern Queensland, 487-535 West Street, Toowoomba, QLD 4350 Australia.

Purpose: This paper aims to develop a three-dimensional (3D) Alzheimer's disease (AD) prediction method, thereby bettering current predictive methods, which struggle to fully harness the potential of structural magnetic resonance imaging (sMRI) data.

Methods: Traditional convolutional neural networks encounter pressing difficulties in accurately focusing on the AD lesion structure. To address this issue, a 3D decoupling, self-attention network for AD prediction is proposed.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!