Learning deep hierarchical visual feature coding.

IEEE Trans Neural Netw Learn Syst

Published: December 2014

In this paper, we propose a hybrid architecture that combines the image modeling strengths of the bag of words framework with the representational power and adaptability of learning deep architectures. Local gradient-based descriptors, such as SIFT, are encoded via a hierarchical coding scheme composed of spatial aggregating restricted Boltzmann machines (RBM). For each coding layer, we regularize the RBM by encouraging representations to fit both sparse and selective distributions. Supervised fine-tuning is used to enhance the quality of the visual representation for the categorization task. We performed a thorough experimental evaluation using three image categorization data sets. The hierarchical coding scheme achieved competitive categorization accuracies of 79.7% and 86.4% on the Caltech-101 and 15-Scenes data sets, respectively. The visual representations learned are compact and the model's inference is fast, as compared with sparse coding methods. The low-level representations of descriptors that were learned using this method result in generic features that we empirically found to be transferrable between different image data sets. Further analysis reveal the significance of supervised fine-tuning when the architecture has two layers of representations as opposed to a single layer.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2014.2307532DOI Listing

Publication Analysis

Top Keywords

data sets
12
learning deep
8
hierarchical coding
8
coding scheme
8
supervised fine-tuning
8
coding
5
deep hierarchical
4
hierarchical visual
4
visual feature
4
feature coding
4

Similar Publications

Fully Synthetic Data for Complex Surveys.

Surv Methodol

December 2024

Department of Statistical Science, 214a Old Chemistry Building, Duke University, Durham, NC 27708-0251.

When seeking to release public use files for confidential data, statistical agencies can generate fully synthetic data. We propose an approach for making fully synthetic data from surveys collected with complex sampling designs. Our approach adheres to the general strategy proposed by Rubin (1993).

View Article and Find Full Text PDF

Blood carries some of the most valuable biomarkers for disease screening as it interacts with various tissues and organs in the body. Human blood serum is a reservoir of high molecular weight fraction (HMWF) and low molecular weight fraction (LMWF) proteins. The LMWF proteins are considered disease marker proteins and are often suppressed by HMWF proteins during analysis.

View Article and Find Full Text PDF

Accurate drug-target binding affinity (DTA) prediction is crucial in drug discovery. Recently, deep learning methods for DTA prediction have made significant progress. However, there are still two challenges: (1) recent models always ignore the correlations in drug and target data in the drug/target representation process and (2) the interaction learning of drug-target pairs always is by simple concatenation, which is insufficient to explore their fusion.

View Article and Find Full Text PDF

Here, we apply SuperResNET network analysis of dSTORM single-molecule localization microscopy (SMLM) to determine how the clathrin endocytosis inhibitors pitstop 2, dynasore and Latrunculin A alter the morphology of clathrin-coated pits. SuperResNET analysis of HeLa and Cos7 cells identifies: small oligomers (Class I); pits and vesicles (Class II); and larger clusters corresponding to fused pits or clathrin plaques (Class III). Pitstop 2 and dynasore induce distinct homogeneous populations of Class II structures in HeLa cells suggesting that they arrest endocytosis at different stages.

View Article and Find Full Text PDF

Background: Endogenous Alu RNAs form double-stranded RNAs recognized by double-stranded RNA sensors and activate IRF and NF-kB transcriptional paths and innate immunity. Deamination of adenosines to inosines by the ADAR family of enzymes, a process termed A-to-I editing, disrupts double-stranded RNA structure and prevents innate immune activation. Innate immune activation is observed in Alzheimer's disease, the most common form of dementia.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!