Convolution-Based Encoding of Depth Images for Transfer Learning in RGB-D Scene Classification.

Sensors (Basel)

Department of Computer Engineering, College of Computer and Information Sciences, King Saud University, P.O. Box 57168, Riyadh 11543, Saudi Arabia.

Published: November 2021

Classification of indoor environments is a challenging problem. The availability of low-cost depth sensors has opened up a new research area of using depth information in addition to color image (RGB) data for scene understanding. Transfer learning of deep convolutional networks with pairs of RGB and depth (RGB-D) images has to deal with integrating these two modalities. Single-channel depth images are often converted to three-channel images by extracting horizontal disparity, height above ground, and the angle of the pixel's local surface normal (HHA) to apply transfer learning using networks trained on the Places365 dataset. The high computational cost of HHA encoding can be a major disadvantage for the real-time prediction of scenes, although this may be less important during the training phase. We propose a new, computationally efficient encoding method that can be integrated with any convolutional neural network. We show that our encoding approach performs equally well or better in a multimodal transfer learning setup for scene classification. Our encoding is implemented in a customized and pretrained VGG16 Net. We address the class imbalance problem seen in the image dataset using a method based on the synthetic minority oversampling technique (SMOTE) at the feature level. With appropriate image augmentation and fine-tuning, our network achieves scene classification accuracy comparable to that of other state-of-the-art architectures.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8659746PMC
http://dx.doi.org/10.3390/s21237950DOI Listing

Publication Analysis

Top Keywords

transfer learning
16
scene classification
12
depth images
8
depth
5
convolution-based encoding
4
encoding depth
4
images
4
transfer
4
images transfer
4
learning
4

Similar Publications

Objective: The first objective is to develop a nuchal thickness reference chart. The second objective is to compare rule-based algorithms and machine learning models in predicting small-for-gestational-age infants.

Method: This retrospective study involved singleton pregnancies at University Malaya Medical Centre, Malaysia, developed a nuchal thickness chart and evaluated its predictive value for small-for-gestational-age using Malaysian and Singapore cohorts.

View Article and Find Full Text PDF

Radiography is a field of medicine inherently intertwined with technology. The dependency on technology is very high for obtaining images in ultrasound (US), computed tomography (CT), and magnetic resonance imaging (MRI). Although the reduction in radiation dose is not applicable in US and MRI, advancements in technology have made it possible in CT, with ongoing studies aimed at further optimization.

View Article and Find Full Text PDF

Purpose: Patients with advanced non-small cell lung cancer (NSCLC) have varying responses to immunotherapy, but there are no reliable, accepted biomarkers to accurately predict its therapeutic efficacy. The present study aimed to construct individualized models through automatic machine learning (autoML) to predict the efficacy of immunotherapy in patients with inoperable advanced NSCLC.

Methods: A total of 63 eligible participants were included and randomized into training and validation groups.

View Article and Find Full Text PDF

Plastic waste management is one of the key issues in global environmental protection. Integrating spectroscopy acquisition devices with deep learning algorithms has emerged as an effective method for rapid plastic classification. However, the challenges in collecting plastic samples and spectroscopy data have resulted in a limited number of data samples and an incomplete comparison of relevant classification algorithms.

View Article and Find Full Text PDF

Background And Aim: Discriminating between idiosyncratic drug-induced liver injury (DILI) and autoimmune hepatitis (AIH) is critical yet challenging. We aim to develop and validate a machine learning (ML)-based model to aid in this differentiation.

Methods: This multicenter cohort study utilised a development set from Beijing Friendship Hospital, with retrospective and prospective validation sets from 10 tertiary hospitals across various regions of China spanning January 2009 to May 2023.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!