Refined Voting and Scene Feature Fusion for 3D Object Detection in Point Clouds.

Comput Intell Neurosci

Fujian Key Laboratory of Sensing and Computing for Smart Cities, School of Informatics, Xiamen University, Xiamen 361005, China.

Published: January 2023

AI Article Synopsis

  • 3D object detection in lidar point clouds is crucial for understanding 3D visual worlds, but existing methods can generate inaccurate and redundant bounding boxes due to voting errors in Hough voting techniques.
  • The proposed RSFF-Net network improves indoor 3D object detection through refined voting and fusion of scene features, utilizing three key modules: geometric function, refined voting, and scene constraint.
  • RSFF-Net demonstrates strong performance on indoor object detection benchmarks, including ScanNet V2 and SUN RGB-D, by effectively integrating geometric features and contextual scene relationships.

Article Abstract

An essential task for 3D visual world understanding is 3D object detection in lidar point clouds. To predict directly bounding box parameters from point clouds, existing voting-based methods use Hough voting to obtain the centroid of each object. However, it may be difficult for the inaccurately voted centers to regress boxes accurately, leading to the generation of redundant bounding boxes. For objects in indoor scenes, there are several co-occurrence patterns for objects in indoor scenes. Concurrently, semantic relations between object layouts and scenes can be used as prior context to guide object detection. We propose a simple, yet effective network, RSFF-Net, which adds refined voting and scene feature fusion for indoor 3D object detection. The RSFF-Net consists of three modules: geometric function, refined voting, and scene constraint. First, a geometric function module is used to capture the geometric features of the nearest object of the voted points. Then, the coarse votes are revoted by a refined voting module, which is based on the fused feature between the coarse votes and geometric features. Finally, a scene constraint module is used to add the association information between candidate objects and scenes. RSFF-Net achieves competitive results on indoor 3D object detection benchmarks: ScanNet V2 and SUN RGB-D.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9815914PMC
http://dx.doi.org/10.1155/2022/3023934DOI Listing

Publication Analysis

Top Keywords

object detection
20
refined voting
16
voting scene
12
point clouds
12
scene feature
8
feature fusion
8
object
8
objects indoor
8
indoor scenes
8
indoor object
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!