Video frame interpolation aims to improve users' watching experiences by generating high-frame-rate videos from low-frame-rate ones. Existing approaches typically focus on synthesizing intermediate frames using high-quality reference images. However, the captured reference frames may suffer from inevitable spatial degradations such as motion blur, sensor noise, etc. Few studies have approached the joint video enhancement problem, namely synthesizing high-frame-rate and high-quality results from low-frame-rate degraded inputs. In this paper, we propose a unified optimization framework for video frame interpolation with spatial degradations. Specifically, we develop a frame interpolation module with a pyramid structure to cyclically synthesize high-quality intermediate frames. The pyramid module features adjustable spatial receptive field and temporal scope, thus contributing to controllable computational complexity and restoration ability. Besides, we propose an inter-pyramid recurrent module to connect sequential models to exploit the temporal relationship. The pyramid module integrates the recurrent module, thus can iteratively synthesize temporally smooth results. And the pyramid modules share weights across iterations, thus it does not expand the model's parameter size. Our model can be generalized to several applications such as up-converting the frame rate of videos with motion blur, reducing compression artifacts, and jointly super-resolving low-resolution videos. Extensive experimental results demonstrate that our method performs favorably against state-of-the-art methods on various video frame interpolation and enhancement tasks.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TIP.2020.3033617DOI Listing

Publication Analysis

Top Keywords

frame interpolation
20
video frame
16
interpolation enhancement
8
framework video
8
intermediate frames
8
spatial degradations
8
motion blur
8
pyramid module
8
recurrent module
8
video
5

Similar Publications

Enhancing space-time video super-resolution via spatial-temporal feature interaction.

Neural Netw

December 2024

College of Electronic and Information Engineering, Tongji University, China; Shanghai Institute of Intelligent Science and Technology, Tongji University, China. Electronic address:

The target of space-time video super-resolution (STVSR) is to increase both the frame rate (also referred to as the temporal resolution) and the spatial resolution of a given video. Recent approaches solve STVSR using end-to-end deep neural networks. A popular solution is to first increase the frame rate of the video; then perform feature refinement among different frame features; and at last, increase the spatial resolutions of these features.

View Article and Find Full Text PDF

Background: Shear wave elastography (SWE) is a technique that quantifies tissue stiffness by assessing the speed of shear waves propagating after being excited by acoustic radiation force. SWE allows the quantification of elastic tissue properties and serves as an adjunct to conventional ultrasound techniques, aiding in tissue characterization. To capture this transient propagation of the shear wave, the ultrasound device must be able to reach very high frame rates.

View Article and Find Full Text PDF

This study demonstrates high volume rate bistatic 3-D vascular strain imaging, to overcome well-known challenges caused by the anisotropic resolution and contrast inherent to ultrasound imaging. Approach. Using two synchronized 32x32 element matrix arrays (3.

View Article and Find Full Text PDF

Real-time 3D MR guided radiation therapy through orthogonal MR imaging and manifold learning.

Med Phys

December 2024

Department of Radiation Oncology, Duke Cancer Center, Duke University, Durham, North Carolina, USA.

Background: In magnetic resonance image (MRI)-guided radiotherapy (MRgRT), 2D rapid imaging is commonly used to track moving targets with high temporal frequency to minimize gating latency. However, anatomical motion is not constrained to 2D, and a portion of the target may be missed during treatment if 3D motion is not evaluated. While some MRgRT systems attempt to capture 3D motion by sequentially tracking motion in 2D orthogonal imaging planes, this approach assesses 3D motion via independent 2D measurements at alternating instances, lacking a simultaneous 3D motion assessment in both imaging planes.

View Article and Find Full Text PDF

Respiration-correlated cone-beam computed tomography (4D-CBCT) is an X-ray-based imaging modality that uses reconstruction algorithms to produce time-varying volumetric images of moving anatomy over a cycle of respiratory motion. The quality of the produced images is affected by the number of CBCT projections available for reconstruction. Interpolation techniques have been used to generate intermediary projections to be used, along with the original projections, for reconstruction.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!