An efficient point cloud semantic segmentation network with multiscale super-patch transformer.

Yongwei Miao Yuliang Sun Yimin Zhang Jinrong Wang Xudong Zhang

Sci Rep

School of Information Science and Technology, Zhejiang Shuren University, Hangzhou, 310015, China.

Published: June 2024

Efficient semantic segmentation of large-scale point clouds is essential for understanding 3D environments, but the diverse shapes and occlusions make it challenging to train deep neural networks effectively.
The proposed multiscale super-patch transformer network (MSSPTNet) uses a novel approach that involves extracting super-patches and aggregating local features while leveraging contextual information through a self-attention mechanism.
Experimental results show that MSSPTNet outperforms traditional segmentation networks in terms of efficiency, especially in indoor scenes with repetitive structures, achieving much faster training times.

Efficient semantic segmentation of large-scale point cloud scenes is a fundamental and essential task for perception or understanding the surrounding 3d environments. However, due to the vast amount of point cloud data, it is always a challenging to train deep neural networks efficiently and also difficult to establish a unified model to represent different shapes effectively due to their variety and occlusions of scene objects. Taking scene super-patch as data representation and guided by its contextual information, we propose a novel multiscale super-patch transformer network (MSSPTNet) for point cloud segmentation, which consists of a multiscale super-patch local aggregation (MSSPLA) module and a super-patch transformer (SPT) module. Given large-scale point cloud data as input, a dynamic region-growing algorithm is first adopted to extract scene super-patches from the sampling points with consistent geometric features. Then, the MSSPLA module aggregates local features and their contextual information of adjacent super-patches at different scales. Owing to the self-attention mechanism, the SPT module exploits the similarity among scene super-patches in high-level feature space. By combining these two modules, our MSSPTNet can effectively learn both local and global features from the input point clouds. Finally, the interpolating upsampling and multi-layer perceptrons are exploited to generate semantic labels for the original point cloud data. Experimental results on the public S3DIS dataset demonstrate its efficiency of the proposed network for segmenting large-scale point cloud scenes, especially for those indoor scenes with a large number of repetitive structures, i.e., the network training of our MSSPTNet is much faster than other segmentation networks by a factor of tens to hundreds.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11199674	PMC
http://dx.doi.org/10.1038/s41598-024-63451-8	DOI Listing

Publication Analysis

Top Keywords

point cloud

multiscale super-patch

super-patch transformer

large-scale point

cloud data

semantic segmentation

cloud scenes

msspla module

spt module

scene super-patches

Similar Publications

LiDAR point cloud denoising for individual tree extraction based on the Noise4Denoise.

Front Plant Sci

January 2025

Research Institute of Forest Policy and Information, Chinese Academy of Forestry, Beijing, China.

Xiangfei Lu Zongyu Ye Liyong Fu Huaiyi Wang Kaiyu Wang

The processing of LiDAR point cloud data is of critical importance in the context of forest resource surveys, as well as representing a pivotal element in the realm of forest physiological and ecological studies.Nonetheless, conventional denoising algorithms frequently exhibit deficiencies with regard to adaptability and denoising efficacy, particularly when employed in relation to disparate datasets.To address these issues, this study introduces DEN4, an unsupervised, deep learning-based point cloud denoising algorithm designed to improve the accuracy of single tree segmentation in LiDAR point clouds.

View Article and Find Full Text PDF

Similar Publications

The application of design of experiments and artificial neural networks in the evaluation of the impact of acidic conditions on cloud point extraction.

J Chromatogr A

January 2025

Department of Physical Pharmacy and Pharmacokinetics, Poznań University of Medical Sciences, Rokietnicka 3 Street, Poznań 60-806, Poland. Electronic address:

Bartosz Sznek Andrzej Czyrski

This study aimed to analyze the impact of acidic conditions on the recovery of ciprofloxacin and levofloxacin for cloud point extraction with the Design of Experiments and Artificial Neural Networks. The design included 27 experiments featuring three repetitions of the central point for both drugs. The tested parameters included Triton X-114 concentration, HCl concentration, NaCl concentration, and incubation temperature, which were coded at five levels.

View Article and Find Full Text PDF

Similar Publications

An autonomous navigation method for orchard mobile robots based on octree 3D point cloud optimization.

Front Plant Sci

January 2025

Institute of Agricultural Facilities and Equipment, Jiangsu Academy of Agricultural Sciences, Nanjing, Jiangsu, China.

Hailong Li Kai Huang Yuanhao Sun Xiaohui Lei Quanchun Yuan

Three-dimensional (3D) LiDAR is crucial for the autonomous navigation of orchard mobile robots, offering comprehensive and accurate environmental perception. However, the increased richness of information provided by 3D LiDAR also leads to a higher computational burden for point cloud data processing, posing challenges to real-time navigation. To address these issues, this paper proposes a 3D point cloud optimization method based on the octree data structure for autonomous navigation of orchard mobile robots.

View Article and Find Full Text PDF

Similar Publications

Predicting the location of coordinated metal ion-ligand binding sites using geometry-aware graph neural networks.

Comput Struct Biotechnol J

December 2024

Department of Electrical Engineering and Computer Science, Bond Life Sciences Center, University of Missouri, Columbia, MO, USA.

Clement Essien Ning Wang Yang Yu Salhuldin Alqarghuli Yongfang Qin

More than 50 % of proteins bind to metal ions. Interactions between metal ions and proteins, especially coordinated interactions, are essential for biological functions, such as maintaining protein structure and signal transport. Physiological metal-ion binding prediction is pivotal for both elucidating the biological functions of proteins and for the design of new drugs.

View Article and Find Full Text PDF

Similar Publications

Navigating the Complex Terrain of Methane Synthesis: Multienzyme Control Points and Data-Driven Strategies.

ACS Omega

January 2025

Department of Biotechnology and Food Science, Durban University of Technology, Durban 4001, South Africa.

Nikwando Mohlomi Rosina Nkuna Kugenthiren Permaul Suren Singh Adarsh Kumar Puri

Anaerobic digestion is a crucial process in wastewater treatment, renowned for its sustainable biogas production capabilities and the simultaneous reduction of environmental pollution. However, dysregulation of vital biological processes and pathways can lead to reduced efficiency and suboptimal biogas output, which can be seen through low counts per million of sequences related to three critical control points for methane synthesis. Namely, tetrahydromethanopterin S-methyltransferase (MTR), methyl-coenzyme reductase M (MCR), and CoB/CoM heterodisulfide oxidoreductase (HDR) are the last reactions that must occur.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!