Video-based person re-identification (re-id) matches two tracks of persons from different cameras. Features are extracted from the images of a sequence and then aggregated as a track feature. Compared to existing works that aggregate frame features by simply averaging them or using temporal models such as recurrent neural networks, we propose an intelligent feature aggregate method based on reinforcement learning. Specifically, we train an agent to determine which frames in the sequence should be abandoned in the aggregation, which can be treated as a decision making process. By this way, the proposed method avoids introducing noisy information of the sequence and retains these valuable frames when generating a track feature. On benchmark data sets, experimental results show that our method can boost the re-id accuracy obviously based on the state-of-the-art models.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2019.2899588DOI Listing

Publication Analysis

Top Keywords

reinforcement learning
8
video-based person
8
person re-identification
8
track feature
8
feature
4
feature aggregation
4
aggregation reinforcement
4
learning video-based
4
re-identification video-based
4
re-identification re-id
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!