OD-MVSNet: Omni-dimensional dynamic multi-view stereo network.

PLoS One

College of Information Science and Electrical Engineering, Shandong Jiaotong University, Jinan, Shandong, China.

Published: August 2024

Multi-view stereo based on learning is a critical task in three-dimensional reconstruction, enabling the effective inference of depth maps and the reconstruction of fine-grained scene geometry. However, the results obtained by current popular 3D reconstruction methods are not precise, and achieving high-accuracy scene reconstruction remains challenging due to the pervasive impact of feature extraction and the poor correlation between cost and volume. In addressing these issues, we propose a cascade deep residual inference network to enhance the efficiency and accuracy of multi-view stereo depth estimation. This approach builds a cost-volume pyramid from coarse to fine, generating a lightweight, compact network to improve reconstruction results. Specifically, we introduce the omni-dimensional dynamic atrous spatial pyramid pooling (OSPP), a multiscale feature extraction module capable of generating dense feature maps with multiscale contextual information. The feature maps encoded by the OSPP module can generate dense point clouds without consuming significant memory. Furthermore, to alleviate the issue of feature mismatch in cost volume regularization, we propose a normalization-based 3D attention module. The 3D attention module aggregates crucial information within the cost volume across the dimensions of channel, spatial, and depth. Through extensive experiments on benchmark datasets, notably DTU, we found that the OD-MVSNet model outperforms the baseline model by approximately 1.4% in accuracy loss, 0.9% in completeness loss, and 1.2% in overall loss, demonstrating the effectiveness of our module.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11326553PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0309029PLOS

Publication Analysis

Top Keywords

multi-view stereo
12
cost volume
12
omni-dimensional dynamic
8
feature extraction
8
feature maps
8
attention module
8
reconstruction
5
feature
5
module
5
od-mvsnet omni-dimensional
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!