Multi-Channel Convolutional Neural Network Based 3D Object Detection for Indoor Robot Environmental Perception.

Sensors (Basel)

ST Engineering-NTU Corporate Laboratory, School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore 637335, Singapore.

Published: February 2019

Environmental perception is a vital feature for service robots when working in an indoor environment for a long time. The general 3D reconstruction is a low-level geometric information description that cannot convey semantics. In contrast, higher level perception similar to humans requires more abstract concepts, such as objects and scenes. Moreover, the 2D object detection based on images always fails to provide the actual position and size of an object, which is quite important for a robot's operation. In this paper, we focus on the 3D object detection to regress the object's category, 3D size, and spatial position through a convolutional neural network (CNN). We propose a multi-channel CNN for 3D object detection, which fuses three input channels including RGB, depth, and bird's eye view (BEV) images. We also propose a method to generate 3D proposals based on 2D ones in the RGB image and semantic prior. Training and test are conducted on the modified NYU V2 dataset and SUN RGB-D dataset in order to verify the effectiveness of the algorithm. We also carry out the actual experiments in a service robot to utilize the proposed 3D object detection method to enhance the environmental perception of the robot.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6412344PMC
http://dx.doi.org/10.3390/s19040893DOI Listing

Publication Analysis

Top Keywords

object detection
20
environmental perception
12
convolutional neural
8
neural network
8
object
6
detection
5
multi-channel convolutional
4
network based
4
based object
4
detection indoor
4

Similar Publications

Object detection in motion management scenarios based on deep learning.

PLoS One

January 2025

School of Physical Education, Jinjiang College, Sichuan University, Chengdu, Sichuan Province, People's Republic of China.

In athletes' competitions and daily training, in order to further strengthen the athletes' sports level, it is usually necessary to analyze the athletes' sports actions at a specific moment, in which it is especially important to quickly and accurately identify the categories and positions of the athletes, sports equipment, field boundaries and other targets in the sports scene. However, the existing detection methods failed to achieve better detection results, and the analysis found that the reasons for this phenomenon mainly lie in the loss of temporal information, multi-targeting, target overlap, and coupling of regression and classification tasks, which makes it more difficult for these network models to adapt to the detection task in this scenario. Based on this, we propose for the first time a supervised object detection method for scenarios in the field of motion management.

View Article and Find Full Text PDF

The visual environment of sign language users is markedly distinct in its spatiotemporal parameters compared to that of non-signers. Although the importance of temporal and spectral resolution in the auditory modality for language development is well established, the spectrotemporal parameters of visual attention necessary for sign language comprehension remain less understood. This study investigates visual temporal resolution in learners of American Sign Language (ASL) at various stages of acquisition to determine how experience with sign language affects perceptual sampling.

View Article and Find Full Text PDF

Background: Grid cells are spatially modulated cells in the entorhinal cortex (EC) that fire in a hexagonally patterned grid which tiles the environment. These cells are assumed important in human spatial navigation. The EC is vulnerable to neurodegenerative processes in both normal aging and Alzheimer's disease and decline in grid cell function may be a key factor in understanding age-related navigational decline.

View Article and Find Full Text PDF

Basic Science and Pathogenesis.

Alzheimers Dement

December 2024

Laboratory of Neuroscience (LIM27), Departamento e Instituto de Psiquiatria, Hospital das Clínicas, Faculdade de Medicina da Universidade de São Paulo, São Paulo, São Paulo, Brazil.

Background: Nearly all individuals with Alzheimer's disease (AD) develop neuropsychiatric symptoms (NPS). Lithium is a mood-stabilizer and is efficient in reducing disruptive behaviors in bipolar-disorder; this characteristic could be an opportunity to investigate the use of lithium in treating behavioral changes in AD.

Method: We tested lithium carbonate treatment in 3xTg-AD and age-matched Wild-type male mice (CEUA/PROCESS: 1605/2020; 4127240122).

View Article and Find Full Text PDF

Background: Alzheimer's disease (AD) has traditionally been recognized as progressive dementia with brain deposits of amyloid (Aβ) and Tau (MAPT) proteins starting 20 and 10 years before the onset of clinical symptoms. Aggregation and deposition of Aβ and Tau proteins have been successfully studied in vitro, cell cultures, and animal models, but clinical deficits have been more difficult to assess. Behavior in mice is a complex phenomenon and subject to variation based on mouse interest, moods, stress-induced distraction, and other undefined parameters.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!