Self-supervised monocular depth estimation has shown impressive results in static scenes. It relies on the multi-view consistency assumption for training networks, however, that is violated in dynamic object regions and occlusions. Consequently, existing methods show poor accuracy in dynamic scenes, and the estimated depth map is blurred at object boundaries because they are usually occluded in other training views. In this paper, we propose SC-DepthV3 for addressing the challenges. Specifically, we introduce an external pretrained monocular depth estimation model for generating single-image depth prior, namely pseudo-depth, based on which we propose novel losses to boost self-supervised training. As a result, our model can predict sharp and accurate depth maps, even when training from monocular videos of highly dynamic scenes. We demonstrate the significantly superior performance of our method over previous methods on six challenging datasets, and we provide detailed ablation studies for the proposed terms.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2023.3322549DOI Listing

Publication Analysis

Top Keywords

monocular depth
12
depth estimation
12
dynamic scenes
12
self-supervised monocular
8
depth
6
sc-depthv3 robust
4
robust self-supervised
4
monocular
4
dynamic
4
estimation dynamic
4

Similar Publications

Monocular meta-imaging camera sees depth.

Light Sci Appl

January 2025

Innovation Center for Advanced Medical Imaging and Intelligent Medicine, Guangzhou Institute of Technology, Xidian University, Guangzhou, 510555, Guangdong, China.

A novel monocular depth-sensing camera based on meta-imaging sensor technology has been developed, offering more precise depth sensing with millimeter-level accuracy and enhanced robustness compared to conventional 2D and light-field cameras.

View Article and Find Full Text PDF

Deep Neural Networks for Accurate Depth Estimation with Latent Space Features.

Biomimetics (Basel)

December 2024

School of Artificial Intelligence, Tongmyong University, Busan 48520, Republic of Korea.

Depth estimation plays a pivotal role in advancing human-robot interactions, especially in indoor environments where accurate 3D scene reconstruction is essential for tasks like navigation and object handling. Monocular depth estimation, which relies on a single RGB camera, offers a more affordable solution compared to traditional methods that use stereo cameras or LiDAR. However, despite recent progress, many monocular approaches struggle with accurately defining depth boundaries, leading to less precise reconstructions.

View Article and Find Full Text PDF

Laparoscopic video tracking primarily focuses on two target types: surgical instruments and anatomy. The former could be used for skill assessment, while the latter is necessary for the projection of virtual overlays. Where instrument and anatomy tracking have often been considered two separate problems, in this article, a method is proposed for joint tracking of all structures simultaneously.

View Article and Find Full Text PDF

Purpose: This study evaluated the objective changes in the contralateral eye after unilateral cataract surgery.

Methods: The study was designed as retrospective observational study. It included 44 patients who underwent unilateral cataract surgery.

View Article and Find Full Text PDF

Purpose: To describe the visual, refractive, functional, and patient satisfaction outcomes of the Bi-Flex POB-MA 877PEY (Elon, Medicontur Medical Engineering Ltd. Zsámbék, Hungary) extended depth-of-focus intraocular lens (EDoF IOL).

Patients And Methods: This was a prospective longitudinal descriptive study.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!