Deriving from the gradient vector of a generative model of local features, Fisher vector coding (FVC) has been identified as an effective coding method for image classification. Most, if not all, FVC implementations employ the Gaussian mixture model (GMM) as the generative model for local features. However, the representative power of a GMM can be limited because it essentially assumes that local features can be characterized by a fixed number of feature prototypes, and the number of prototypes is usually small in FVC. To alleviate this limitation, in this work, we break the convention which assumes that a local feature is drawn from one of a few Gaussian distributions. Instead, we adopt a compositional mechanism which assumes that a local feature is drawn from a Gaussian distribution whose mean vector is composed as a linear combination of multiple key components, and the combination weight is a latent random variable. In doing so we greatly enhance the representative power of the generative model underlying FVC. To implement our idea, we design two particular generative models following this compositional approach. In our first model, the mean vector is sampled from the subspace spanned by a set of bases and the combination weight is drawn from a Laplace distribution. In our second model, we further assume that a local feature is composed of a discriminative part and a residual part. As a result, a local feature is generated by the linear combination of discriminative part bases and residual part bases. The decomposition of the discriminative and residual parts is achieved via the guidance of a pre-trained supervised coding method. By calculating the gradient vector of the proposed models, we derive two new Fisher vector coding strategies. The first is termed Sparse Coding-based Fisher Vector Coding (SCFVC) and can be used as the substitute of traditional GMM based FVC. The second is termed Hybrid Sparse Coding-based Fisher vector coding (HSCFVC) since it combines the merits of both pre-trained supervised coding methods and FVC. Using pre-trained Convolutional Neural Network (CNN) activations as local features, we experimentally demonstrate that the proposed methods are superior to traditional GMM based FVC and achieve state-of-the-art performance in various image classification tasks.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2017.2651061DOI Listing

Publication Analysis

Top Keywords

fisher vector
20
vector coding
20
local features
16
local feature
16
image classification
12
generative model
12
assumes local
12
vector
9
coding
8
gradient vector
8

Similar Publications

Detecting Multipartite Entanglement Patterns Using Single-Particle Green's Functions.

Phys Rev Lett

December 2024

Brookhaven National Laboratory, Condensed Matter Physics and Materials Science Division, Upton, New York 11973, USA.

We present a protocol for detecting multipartite entanglement in itinerant many-body electronic systems using single-particle Green's functions. To achieve this, we first establish a connection between the quantum Fisher information and single-particle Green's functions by constructing a set of witness operators built out of single electron creation and destruction operators in a doubled system. This set of witness operators is indexed by a momentum k.

View Article and Find Full Text PDF

: Alzheimer's disease is a progressive neurological condition marked by a decline in cognitive abilities. Early diagnosis is crucial but challenging due to overlapping symptoms among impairment stages, necessitating non-invasive, reliable diagnostic tools. : We applied information geometry and manifold learning to analyze grayscale MRI scans classified into No Impairment, Very Mild, Mild, and Moderate Impairment.

View Article and Find Full Text PDF

Context: Accurate prognosis prediction for cancer patients in palliative care is critical for clinical decision-making and personalized care. Traditional statistical models have been complemented by machine learning approaches; however, their comparative effectiveness remains underexplored.

Objectives: To assess the prognostic accuracy of statistical and machine learning models in predicting 30-day survival in patients with advanced cancer using objective data, such as the result of the blood test.

View Article and Find Full Text PDF

Background: Missed or delayed diagnosis of occult fractures of tibial plateau may cause adverse effects on patients. The objective of this study was to evaluate the diagnostic performance of texture analysis (TA) of knee joint radiographs combined with machine learning (ML) in identifying patients at risk of tibial plateau occult fractures.

Methods: A total of 169 patients with negative fracture on knee X-ray films from 2018 to 2022 who were diagnosed with occult tibial plateau fractures or no fractures by subsequent magnetic resonance imaging (MRI) examination were retrospectively enrolled.

View Article and Find Full Text PDF

Long-term risks of gene therapy are not fully understood. In this study, we evaluated safety outcomes in 783 patients over more than 2,200 total patient-years of observation from 38 T cell therapy trials. The trials employed integrating gammaretroviral or lentiviral vectors to deliver engineered receptors to target HIV-1 infection or cancer.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!