Self-supervised monocular depth estimation has shown impressive results in static scenes. It relies on the multi-view consistency assumption for training networks, however, that is violated in dynamic object regions and occlusions. Consequently, existing methods show poor accuracy in dynamic scenes, and the estimated depth map is blurred at object boundaries because they are usually occluded in other training views. In this paper, we propose SC-DepthV3 for addressing the challenges. Specifically, we introduce an external pretrained monocular depth estimation model for generating single-image depth prior, namely pseudo-depth, based on which we propose novel losses to boost self-supervised training. As a result, our model can predict sharp and accurate depth maps, even when training from monocular videos of highly dynamic scenes. We demonstrate the significantly superior performance of our method over previous methods on six challenging datasets, and we provide detailed ablation studies for the proposed terms.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TPAMI.2023.3322549 | DOI Listing |
Light Sci Appl
January 2025
Innovation Center for Advanced Medical Imaging and Intelligent Medicine, Guangzhou Institute of Technology, Xidian University, Guangzhou, 510555, Guangdong, China.
A novel monocular depth-sensing camera based on meta-imaging sensor technology has been developed, offering more precise depth sensing with millimeter-level accuracy and enhanced robustness compared to conventional 2D and light-field cameras.
View Article and Find Full Text PDFBiomimetics (Basel)
December 2024
School of Artificial Intelligence, Tongmyong University, Busan 48520, Republic of Korea.
Depth estimation plays a pivotal role in advancing human-robot interactions, especially in indoor environments where accurate 3D scene reconstruction is essential for tasks like navigation and object handling. Monocular depth estimation, which relies on a single RGB camera, offers a more affordable solution compared to traditional methods that use stereo cameras or LiDAR. However, despite recent progress, many monocular approaches struggle with accurately defining depth boundaries, leading to less precise reconstructions.
View Article and Find Full Text PDFLaparoscopic video tracking primarily focuses on two target types: surgical instruments and anatomy. The former could be used for skill assessment, while the latter is necessary for the projection of virtual overlays. Where instrument and anatomy tracking have often been considered two separate problems, in this article, a method is proposed for joint tracking of all structures simultaneously.
View Article and Find Full Text PDFKorean J Ophthalmol
December 2024
Department of Ophthalmology, Chung-Ang University Hospital, Seoul, Korea.
Purpose: This study evaluated the objective changes in the contralateral eye after unilateral cataract surgery.
Methods: The study was designed as retrospective observational study. It included 44 patients who underwent unilateral cataract surgery.
Clin Ophthalmol
December 2024
Department of Ophthalmology, Donostia University Hospital, Donostia-San Sebastián, Spain.
Purpose: To describe the visual, refractive, functional, and patient satisfaction outcomes of the Bi-Flex POB-MA 877PEY (Elon, Medicontur Medical Engineering Ltd. Zsámbék, Hungary) extended depth-of-focus intraocular lens (EDoF IOL).
Patients And Methods: This was a prospective longitudinal descriptive study.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!