Learning deep hierarchical visual feature coding.

Hanlin Goh Nicolas Thome Matthieu Cord Joo-Hwee Lim

IEEE Trans Neural Netw Learn Syst

Published: December 2014

In this paper, we propose a hybrid architecture that combines the image modeling strengths of the bag of words framework with the representational power and adaptability of learning deep architectures. Local gradient-based descriptors, such as SIFT, are encoded via a hierarchical coding scheme composed of spatial aggregating restricted Boltzmann machines (RBM). For each coding layer, we regularize the RBM by encouraging representations to fit both sparse and selective distributions. Supervised fine-tuning is used to enhance the quality of the visual representation for the categorization task. We performed a thorough experimental evaluation using three image categorization data sets. The hierarchical coding scheme achieved competitive categorization accuracies of 79.7% and 86.4% on the Caltech-101 and 15-Scenes data sets, respectively. The visual representations learned are compact and the model's inference is fast, as compared with sparse coding methods. The low-level representations of descriptors that were learned using this method result in generic features that we empirically found to be transferrable between different image data sets. Further analysis reveal the significance of supervised fine-tuning when the architecture has two layers of representations as opposed to a single layer.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TNNLS.2014.2307532	DOI Listing

Publication Analysis

Top Keywords

data sets

learning deep

hierarchical coding

coding scheme

supervised fine-tuning

coding

deep hierarchical

hierarchical visual

visual feature

feature coding

Similar Publications

Fully Synthetic Data for Complex Surveys.

Surv Methodol

December 2024

Department of Statistical Science, 214a Old Chemistry Building, Duke University, Durham, NC 27708-0251.

Shirley Mathur Yajuan Si Jerome P Reiter

When seeking to release public use files for confidential data, statistical agencies can generate fully synthetic data. We propose an approach for making fully synthetic data from surveys collected with complex sampling designs. Our approach adheres to the general strategy proposed by Rubin (1993).

View Article and Find Full Text PDF

Similar Publications

SERS profiling of blood serum filtrate components from patients with type II diabetes using 100 kDa filtration devices.

RSC Adv

January 2025

Département de Chimie, Faculté des Sciences et de Génie, Université Laval Québec QC G1V 0A6 Canada.

Zainub Shoukat Rafia Atta Muhammad Irfan Majeed Haq Nawaz Nosheen Rashid

Blood carries some of the most valuable biomarkers for disease screening as it interacts with various tissues and organs in the body. Human blood serum is a reservoir of high molecular weight fraction (HMWF) and low molecular weight fraction (LMWF) proteins. The LMWF proteins are considered disease marker proteins and are often suppressed by HMWF proteins during analysis.

View Article and Find Full Text PDF

Similar Publications

Deep Drug-Target Binding Affinity Prediction Base on Multiple Feature Extraction and Fusion.

ACS Omega

January 2025

School of Computer Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China.

Zepeng Li Yuni Zeng Mingfeng Jiang Bo Wei

Accurate drug-target binding affinity (DTA) prediction is crucial in drug discovery. Recently, deep learning methods for DTA prediction have made significant progress. However, there are still two challenges: (1) recent models always ignore the correlations in drug and target data in the drug/target representation process and (2) the interaction learning of drug-target pairs always is by simple concatenation, which is insufficient to explore their fusion.

View Article and Find Full Text PDF

Similar Publications

SuperResNET: Single molecule network analysis detects changes to clathrin structure by small molecule inhibitors.

J Cell Sci

January 2025

Department of Cellular & Physiological Sciences, Life Sciences Institute, University of British Columbia, Vancouver, BC V6T 1Z3, Canada.

Timothy H Wong Ismail M Khater Christian Hallgrimson Y Lydia Li Ghassan Hamarneh

Here, we apply SuperResNET network analysis of dSTORM single-molecule localization microscopy (SMLM) to determine how the clathrin endocytosis inhibitors pitstop 2, dynasore and Latrunculin A alter the morphology of clathrin-coated pits. SuperResNET analysis of HeLa and Cos7 cells identifies: small oligomers (Class I); pits and vesicles (Class II); and larger clusters corresponding to fused pits or clathrin plaques (Class III). Pitstop 2 and dynasore induce distinct homogeneous populations of Class II structures in HeLa cells suggesting that they arrest endocytosis at different stages.

View Article and Find Full Text PDF

Similar Publications

Increased unedited Alu RNA patterns found in cortex extracellular vesicles in Alzheimer's disease resemble hippocampus vasculature Alu RNA editing patterns but not cortex Alu RNA editing patterns.

J Alzheimers Dis

January 2025

Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA.

Philip S Crooke John T Tossberg Thomas M Aune

Background: Endogenous Alu RNAs form double-stranded RNAs recognized by double-stranded RNA sensors and activate IRF and NF-kB transcriptional paths and innate immunity. Deamination of adenosines to inosines by the ADAR family of enzymes, a process termed A-to-I editing, disrupts double-stranded RNA structure and prevents innate immune activation. Innate immune activation is observed in Alzheimer's disease, the most common form of dementia.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!