In this paper, we come up with a simple yet effective approach for instance segmentation on 3D point cloud with strong robustness. Previous top-performing methods for this task adopt a bottom-up strategy, which often involves various inefficient operations or complex pipelines, such as grouping over-segmented components, introducing heuristic post-processing steps, and designing complex loss functions. As a result, the inevitable variations of the instances sizes make it vulnerable and sensitive to the values of pre-defined hyper-parameters. To this end, we instead propose a novel pipeline that applies dynamic convolution to generate instance-aware parameters in response to the characteristics of the instances. The representation capability of the parameters is greatly improved by gathering homogeneous points that have identical semantic categories and close votes for the geometric centroids. Instances are then decoded via several simple convolution layers, where the parameters are generated depending on the input. In addition, to introduce a large context and maintain limited computational overheads, a light-weight transformer is built upon the bottleneck layer to capture the long-range dependencies. With the only post-processing step, non-maximum suppression (NMS), we demonstrate a simpler and more robust approach that achieves promising performance on various datasets: ScanNetV2, S3DIS, and PartNet. The consistent improvements on both voxel- and point-based architectures imply the effectiveness of the proposed method. Code is available at: https://git.io/DyCo3D.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2022.3216926DOI Listing

Publication Analysis

Top Keywords

dynamic convolution
8
point cloud
8
instance segmentation
8
convolution point
4
cloud instance
4
segmentation paper
4
paper simple
4
simple effective
4
effective approach
4
approach instance
4

Similar Publications

To enhance the intelligent classification of computer vulnerabilities and improve the efficiency and accuracy of network security management, this study delves into the application of a comprehensive classification system that integrates the Memristor Neural Network (MNN) and an improved Temporal Convolutional Neural Network (TCNN) in network security management. This system not only focuses on the precise classification of vulnerability data but also emphasizes its core role in strengthening the network security management framework. Firstly, the study designs and implements a neural network model based on memristors.

View Article and Find Full Text PDF

It is of great significance to realize the accurate prediction of the key output response of the chemical synthetic ammonia process for optimizing system performance and operation monitoring. Because many key intermediate variables of complex systems are difficult to measure comprehensively, there are great difficulties and errors in mechanism analysis and identification modeling techniques. Based on random forest (RF) variable selection, a deep neural network combining temporal convolutional network (TCN) and transformer is proposed to predict the output variables of the synthetic ammonia process.

View Article and Find Full Text PDF

This study introduces a novel ensemble learning technique namely Multi-Armed Bandit Ensemble (MAB-Ensemble), designed for lane detection in road images intended for autonomous vehicles. The foundation of the proposed MAB-Ensemble technique is inspired in terms of Multi-Armed bandit optimization to facilitate efficient model selection for lane segmentation. The benchmarking dataset namely TuSimple is used for training, validating and testing the proposed and existing lane detection techniques.

View Article and Find Full Text PDF

Co-Infection of Mosquitoes with Rift Valley Fever Phlebovirus Strains Results in Efficient Viral Reassortment.

Viruses

January 2025

Center of Excellence for Emerging and Zoonotic Animal Diseases, Diagnostic Medicine/Pathobiology, Kansas State University, Manhattan, KS 66506, USA.

Rift Valley fever phlebovirus (RVFV) is a zoonotic mosquito-borne pathogen endemic to sub-Saharan Africa and the Arabian Peninsula which causes Rift Valley fever in ruminant livestock and humans. Co-infection with divergent viral strains can produce reassortment among the L, S, and M segments of the RVFV genome. Reassortment events can produce novel genotypes with altered virulence, transmission dynamics, and/or mosquito host range.

View Article and Find Full Text PDF

TBF-YOLOv8n: A Lightweight Tea Bud Detection Model Based on YOLOv8n Improvements.

Sensors (Basel)

January 2025

School of Electrical and Electronic Engineering, Wuhan Polytechnic University, Wuhan 430048, China.

Tea bud localization detection not only ensures tea quality, improves picking efficiency, and advances intelligent harvesting, but also fosters tea industry upgrades and enhances economic benefits. To solve the problem of the high computational complexity of deep learning detection models, we developed the Tea Bud DSCF-YOLOv8n (TBF-YOLOv8n)lightweight detection model. Improvement of the Cross Stage Partial Bottleneck Module with Two Convolutions(C2f) module via efficient Distributed Shift Convolution (DSConv) yields the C2f module with DSConv(DSCf)module, which reduces the model's size.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!