The Coarse-To-Fine (CTF) matching scheme has been widely applied to reduce computational complexity and matching ambiguity in stereo matching and optical flow tasks by converting image pairs into multi-scale representations and performing matching from coarse to fine levels. Despite its efficiency, it suffers from several weaknesses, such as tending to blur the edges and miss small structures like thin bars and holes. We find that the pixels of small structures and edges are often assigned with wrong disparity/flow in the upsampling process of the CTF framework, introducing errors to the fine levels and leading to such weaknesses. We observe that these wrong disparity/flow values can be avoided if we select the best-matched value among their neighborhood, which inspires us to propose a novel differentiable Neighbor-Search Upsampling (NSU) module. The NSU module first estimates the matching scores and then selects the best-matched disparity/flow for each pixel from its neighbors. It effectively preserves finer structure details by exploiting the information from the finer level while upsampling the disparity/flow. The proposed module can be a drop-in replacement of the naive upsampling in the CTF matching framework and allows the neural networks to be trained end-to-end. By integrating the proposed NSU module into a baseline CTF matching network, we design our Detail Preserving Coarse-To-Fine (DPCTF) matching network. Comprehensive experiments demonstrate that our DPCTF can boost performances for both stereo matching and optical flow tasks. Notably, our DPCTF achieves new state-of-the-art performances for both tasks - it outperforms the competitive baseline (Bi3D) by 28.8% (from 0.73 to 0.52) on EPE of the FlyingThings3D stereo dataset, and ranks first in KITTI flow 2012 benchmark. The code is available at https://github.com/Deng-Y/DPCTF.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TIP.2021.3088635DOI Listing

Publication Analysis

Top Keywords

stereo matching
12
matching optical
12
optical flow
12
ctf matching
12
nsu module
12
matching
11
detail preserving
8
preserving coarse-to-fine
8
flow tasks
8
fine levels
8

Similar Publications

Objective: Somatic variants causing epilepsy are challenging to detect, as they are only present in a subset of brain cells (e.g., mosaic), resulting in low variant allele frequencies.

View Article and Find Full Text PDF

We report a stereo-differentiating dynamic kinetic asymmetric Rh(I)-catalyzed Pauson-Khand reaction, which provides access to an array of thapsigargin stereoisomers. Using catalyst-control, a consistent stereochemical outcome is achieved at C2─for both matched and mismatched cases─regardless of the allene-yne C8 stereochemistry. The stereochemical configuration for all stereoisomers was assigned by comparing experimental vibrational circular dichroism (VCD) and C NMR to DFT-computed spectra.

View Article and Find Full Text PDF

This paper presents a novel method to enhance ground truth disparity maps generated by Semi-Global Matching (SGM) using Maximum a Posteriori (MAP) estimation. SGM, while not producing visually appealing outputs like neural networks, offers high disparity accuracy in valid regions and avoids the generalization issues often encountered with neural network-based disparity estimation. However, SGM struggles with occlusions and textureless areas, leading to invalid disparity values.

View Article and Find Full Text PDF

: There remains a lack of compelling objective evidence on whether stereopsis is necessary for an ophthalmic surgical career. It is also unclear if high-grade stereoacuity correlates with better surgical performance. The present study attempts to address this question by comparing the simulated surgical performance of subjects with different levels of stereoacuity using a virtual reality (VR) intraocular surgical simulator (EYESi, VRmagic, Mannheim, Germany).

View Article and Find Full Text PDF

A Near-Infrared Imaging System for Robotic Venous Blood Collection.

Sensors (Basel)

November 2024

Jiangsu Key Laboratory of Bionic Materials and Equipment, Nanjing 210016, China.

Venous blood collection is a widely used medical diagnostic technique, and with rapid advancements in robotics, robotic venous blood collection has the potential to replace traditional manual methods. The success of this robotic approach is heavily dependent on the quality of vein imaging. In this paper, we develop a vein imaging device based on the simulation analysis of vein imaging parameters and propose a U-Net+ResNet18 neural network for vein image segmentation.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!