In this article, we provide an intuitive viewing to simplify the Siamese-based trackers by converting the tracking task to a classification. Under this viewing, we perform an in-depth analysis for them through visual simulations and real tracking examples, and find that the failure cases in some challenging situations can be regarded as the issue of missing decisive samples in offline training. Since the samples in the initial (first) frame contain rich sequence-specific information, we can regard them as the decisive samples to represent the whole sequence. To quickly adapt the base model to new scenes, a compact latent network is presented via fully using these decisive samples. Specifically, we present a statistics-based compact latent feature for fast adjustment by efficiently extracting the sequence-specific information. Furthermore, a new diverse sample mining strategy is designed for training to further improve the discrimination ability of the proposed compact latent network. Finally, a conditional updating strategy is proposed to efficiently update the basic models to handle scene variation during the tracking phase. To evaluate the generalization ability and effectiveness and of our method, we apply it to adjust three classical Siamese-based trackers, namely SiamRPN++, SiamFC, and SiamBAN. Extensive experimental results on six recent datasets demonstrate that all three adjusted trackers obtain the superior performance in terms of the accuracy, while having high running speed.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2022.3230064DOI Listing

Publication Analysis

Top Keywords

compact latent
16
latent network
12
decisive samples
12
siamese-based trackers
8
adaptive siamese
4
tracking
4
siamese tracking
4
compact
4
tracking compact
4
latent
4

Similar Publications

Analysis of multi-condition single-cell data with latent embedding multivariate regression.

Nat Genet

January 2025

Genome Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany.

Identifying gene expression differences in heterogeneous tissues across conditions is a fundamental biological task, enabled by multi-condition single-cell RNA sequencing (RNA-seq). Current data analysis approaches divide the constituent cells into clusters meant to represent cell types, but such discrete categorization tends to be an unsatisfactory model of the underlying biology. Here, we introduce latent embedding multivariate regression (LEMUR), a model that operates without, or before, commitment to discrete categorization.

View Article and Find Full Text PDF

In plant breeding and genetics, predictive models traditionally rely on compact representations of high-dimensional data, often using methods like Principal Component Analysis (PCA) and, more recently, Autoencoders (AE). However, these methods do not separate genotype-specific and environment-specific features, limiting their ability to accurately predict traits influenced by both genetic and environmental factors. We hypothesize that disentangling these representations into genotype-specific and environment-specific components can enhance predictive models.

View Article and Find Full Text PDF

Latent membrane protein 1 (LMP1) plays a crucial role in Epstein-Barr virus (EBV)'s ability to establish latency and is involved in developing and progressing EBV-associated cancers. Additionally, EBV-infected cells affect the immune responses, making it challenging for the immune system to eliminate them. Due to the aforementioned reasons, it is crucial to understand the structural features of LMP1, which are essential for the development of novel cancer therapies that target its signaling pathways.

View Article and Find Full Text PDF

Group-equivariant neural networks have emerged as an efficient approach to model complex data, using generalized convolutions that respect the relevant symmetries of a system. These techniques have made advances in both the supervised learning tasks for classification and regression, and the unsupervised tasks to generate new data. However, little work has been done in leveraging the symmetry-aware expressive representations that could be extracted from these approaches.

View Article and Find Full Text PDF

Single-cell RNA sequencing (scRNA-seq) enables high-resolution transcriptional profiling of cell heterogeneity. However, analyzing this noisy, high-dimensional matrix remains challenging. We present scVAG, an integrated deep learning framework combining Variational-Autoencoder (VAE) and Graph Attention Autoencoder (GATE) for enhanced single-cell clustering.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!