While deep convolutional neural networks have shown a remarkable success in image classification, the problems of inter-class similarities, intra-class variances, the effective combination of multi-modal data, and the spatial variability in images of objects remain to be major challenges. To address these problems, this paper proposes a novel framework to learn a discriminative and spatially invariant classification model for object and indoor scene recognition using multi-modal RGB-D imagery. This is achieved through three postulates: 1) spatial invariance $-$ this is achieved by combining a spatial transformer network with a deep convolutional neural network to learn features which are invariant to spatial translations, rotations, and scale changes, 2) high discriminative capability $-$ this is achieved by introducing Fisher encoding within the CNN architecture to learn features which have small inter-class similarities and large intra-class compactness, and 3) multi-modal hierarchical fusion$-$ this is achieved through the regularization of semantic segmentation to a multi-modal CNN architecture, where class probabilities are estimated at different hierarchical levels (i.e., image- and pixel-levels), and fused into a Conditional Random Field (CRF)-based inference hypothesis, the optimization of which produces consistent class labels in RGB-D images. Extensive experimental evaluations on RGB-D object and scene datasets, and live video streams (acquired from Kinect) show that our framework produces superior object and scene classification results compared to the state-of-the-art methods.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2017.2747134DOI Listing

Publication Analysis

Top Keywords

discriminative spatially
8
spatially invariant
8
rgb-d object
8
deep convolutional
8
convolutional neural
8
inter-class similarities
8
$-$ achieved
8
learn features
8
cnn architecture
8
object scene
8

Similar Publications

Intranasal iron administration induces iron deposition, immunoactivation, and cell-specific vulnerability in the olfactory bulb of C57BL/6 mice.

Zool Res

January 2025

School of Basic Medicine, Institute of Brain Science and Disease, Shandong Provincial Key Laboratory of Pathogenesis and Prevention of Brain Diseases, Qingdao University, Qingdao, Shandong, 266071, China. E-mail:

Iron is the most abundant transition metal in the brain and is essential for brain development and neuronal function; however, its abnormal accumulation is also implicated in various neurological disorders. The olfactory bulb (OB), an early target in neurodegenerative diseases, acts as a gateway for environmental toxins and contains diverse neuronal populations with distinct roles. This study explored the cell-specific vulnerability to iron in the OB using a mouse model of intranasal administration of ferric ammonium citrate (FAC).

View Article and Find Full Text PDF

Amyloid-β deposition in basal frontotemporal cortex is associated with selective disruption of temporal mnemonic discrimination.

J Neurosci

January 2025

Department of Neurobiology and Behavior and Center for the Neurobiology of Learning and Memory, University of California, Irvine, Irvine, California 92697 USA

Cerebral amyloid-beta (Aβ) accumulation, a hallmark pathology of Alzheimer's disease (AD), precedes clinical impairment by two to three decades. However, it is unclear whether Aβ contributes to subtle memory deficits observed during the preclinical stage. The heterogenous emergence of Aβ deposition may selectively impact certain memory domains, which rely on distinct underlying neural circuits.

View Article and Find Full Text PDF

Background: In the realm of breast cancer diagnosis and treatment, accurately discerning molecular subtypes is of paramount importance, especially when aiming to avoid invasive tests. The updated guidelines for diagnosing and treating HER2 positive advanced breast cancer, as presented at the 2021 National Breast Cancer Conference and the Annual Meeting of the Chinese Society of Clinical Oncology, highlight the significance of this approach. A new generation of drug-antibody combinations has emerged, expanding the array of treatment options for HER2 positive advanced breast cancer and significantly improving patient survival rates.

View Article and Find Full Text PDF

Colorectal cancer is the second leading cause of cancer-related deaths worldwide, and its development typically involves complex metabolic reprogramming. By mapping the spatial distributions of metabolites and -glycans in heterogeneous colorectal cancer tissues, we can elucidate cancer-associated metabolic and -glycan changes. Herein, we combine mass spectrometry imaging-based metabolomics and -glycomics to characterize the spatially resolved reprogramming of metabolites and -glycans in colorectal cancer tissues.

View Article and Find Full Text PDF

The touristification of Old Havana is resulting in unique patterns of gentrification that rely on a new spatial imaginary, the enforcement of which is resulting in the loss of places for residents to be young. The Cuban state's preservation of significant proportions of social housing as part of its investments in the heritage tourism industry is disrupting common housing-led displacement in the city. The neighbourhood's economic transition is concentrated instead in public spaces, as squares and streets are taken over by new tourist-serving businesses.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!