Person search aims to localize a person of interest in a large image gallery captured by multiple, non-overlapping cameras. Prevalent unified methods have suffered from (1) noisy proposals with mis-detection and occlusion, and (2) large appearance variation within a class, which deteriorates the prototype-based metric learning. To address these problems, we introduce a Prototype-guided Attention Distillation, shortly PAD, which exploits a prototype (a typical representation of an identity) as a guidance to the attention module to consistently highlight identity-inherent regions across different poses. To utilize the knowledge encoded in prototypes for matching unseen IDs, PAD conducts attention distillation to guide student Re-ID queries by deeply mimicking attention maps from the prototype query. Additionally, to address large intra-class variation induced by pose or camera views, we extend PAD with multiple part prototypes representing consistent local regions across different instances. Furthermore, we exploit an adaptive momentum strategy for robust attention distillation in PAD to update more distinct prototypes. Extensive experiments conducted on CUHK-SYSU and PRW demonstrate the effectiveness of PAD, showcasing state-of-the-art performance. Moreover, our distilled attention surprisingly highlights distinguished multiple regions for person search.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2024.3461778DOI Listing

Publication Analysis

Top Keywords

attention distillation
16
person search
12
prototype-guided attention
8
attention
6
pad
5
distillation
4
distillation discriminative
4
person
4
discriminative person
4
search person
4

Similar Publications

Endogenous peptides in Baijiu have primarily focused on finished liquor research, with limited attention given to the peptides in base liquor prior to blending. Liquid chromatography-tandem mass spectrometry (LC-MS) was employed to identify endogenous peptides in the distillates from the first to seventh rounds of soy sauce-flavored Baijiu. Two hundred and five oligopeptides were identified from these distillates, all of which had molecular weights below 1000 Da and were composed of amino acid residues associated with flavor (sweet, sour, and bitter) and biological activity.

View Article and Find Full Text PDF

Purpose: The purpose of this study was to develop a deep learning approach that restores artifact-laden optical coherence tomography (OCT) scans and predicts functional loss on the 24-2 Humphrey Visual Field (HVF) test.

Methods: This cross-sectional, retrospective study used 1674 visual field (VF)-OCT pairs from 951 eyes for training and 429 pairs from 345 eyes for testing. Peripapillary retinal nerve fiber layer (RNFL) thickness map artifacts were corrected using a generative diffusion model.

View Article and Find Full Text PDF

Masked autoencoder of multi-scale convolution strategy combined with knowledge distillation for facial beauty prediction.

Sci Rep

January 2025

School of Electronics and Information Engineering, Wuyi University, Jiangmen, 529020, Guangdong, China.

Facial beauty prediction (FBP) is a leading area of research in artificial intelligence. Currently, there is a small amount of labeled data and a large amount of unlabeled data in the FBP database. The features extracted by the model based on supervised training are limited, resulting in low prediction accuracy.

View Article and Find Full Text PDF

Effects of chitosan-gentianic acid derivatives on myofibrillar proteins in sea bass (Lateolabrax maculatus) during refrigerated storage.

Int J Biol Macromol

January 2025

College of Food Science and Technology, Shanghai Ocean University, Shanghai 201306, China; Shanghai Aquatic Products Processing and Storage Engineering Technology Research Center, Shanghai 201306, China; National Experimental Teaching Demonstration Center for Food Science and Engineering (Shanghai Ocean University), Shanghai 201306, China. Electronic address:

Phenolic acid-chitosan derivatives have received extensive attention due to their greatly enhanced mechanical, antibacterial and antioxidant properties, especially in food preservation. The chitosan-gentianic acid (CS-g-GA) was prepared and its impact on myofibrillar proteins (MPs) in sea bass (Lateolabrax maculatus) during refrigerated storage was investigated in this study. Fish fillets were immersed in distilled water, CS, GA and CS-g-GA solutions, respectively, followed by an 18-day refrigerated storage.

View Article and Find Full Text PDF

With the rapid development of artificial intelligence technology, an increasing number of village-related modeling problems have been addressed. However, first, the exploration of village-related watershed fine-grained classification problems, particularly the multi-view watershed fine-grained classification problem, has been hindered by dataset collection limitations; Second, village-related modeling networks typically employ convolutional modules for attentional modeling to extract salient features, yet they lack global attentional feature modeling capabilities; Lastly, the extensive number of parameters and significant computational demands render village-related watershed fine-grained classification networks infeasible for end-device deployment. To tackle these challenges, we introduce a multi-view attention mechanism designed for precise watershed classification, leveraging knowledge distillation techniques, abbreviated as MANet-KD.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!