A novel method, unsupervised video matting via sparse and low-rank representation, is proposed which can achieve high quality in a variety of challenging examples featuring illumination changes, feature ambiguity, topology changes, transparency variation, dis-occlusion, fast motion and motion blur. Some previous matting methods introduced a nonlocal prior to search samples for estimating the alpha matte, which have achieved impressive results on some data. However, on one hand, searching inadequate or excessive samples may miss good samples or introduce noise; on the other hand, it is difficult to construct consistent nonlocal structures for pixels with similar features, yielding video mattes with spatial and temporal inconsistency. In this paper, we proposed a novel video matting method to achieve spatially and temporally consistent matting result. Toward this end, a sparse and low-rank representation model is introduced to pursue consistent nonlocal structures for pixels with similar features. The sparse representation is used to adaptively select best samples and accurately construct the nonlocal structures for all pixels, while the low-rank representation is used to globally ensure consistent nonlocal structures for pixels with similar features. The two representations are combined to generate spatially and temporally consistent video mattes. We test our method on lots of dataset including the benchmark dataset for image matting and dataset for video matting. Our method has achieved the best performance among all unsupervised matting methods in the public alpha matting evaluation dataset for images.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TPAMI.2019.2895331 | DOI Listing |
Sensors (Basel)
April 2024
School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China.
In order to enhance the matting performance in multi-person dynamic scenarios, we introduce a robust, real-time, high-resolution, and controllable human video matting method that achieves state of the art on all metrics. Unlike most existing methods that perform video matting frame by frame as independent images, we design a unified architecture using a controllable generation model to solve the problem of the lack of overall semantic information in multi-person video. Our method, called ControlMatting, uses an independent recurrent architecture to exploit temporal information in videos and achieves significant improvements in temporal coherence and detailed matting quality.
View Article and Find Full Text PDFJ Endourol
August 2024
Department of Uro-Oncology and Robotic Surgery, HCG Cancer Hospital, Bengaluru, Karnataka, India.
To report outcomes of multicenter series of penile cancer patients undergoing robot-assisted video endoscopic inguinal lymph node dissection (RA-VEIL). In this retrospective analysis from 3 tertiary care centers in India, consecutive intermediate-/high-risk carcinoma penis (CaP) patients with nonpalpable inguinal lymphadenopathy and/or nonbulky (<3 cm) mobile inguinal lymphadenopathy undergoing RA-VEIL were included. Patients with matted/bulky (>3 cm) and fixed lymphadenopathy were excluded.
View Article and Find Full Text PDFCurr Med Imaging
August 2024
Department of Ultrasound, Shenzhen People's Hospital (The Second Clinical Medical College, Jinan University, The First Affiliated Hospital, Southern University of Science and Technology), Shenzhen 518020, Guangdong, China.
Background And Objective: The incidence of stroke is rising, and it is the second major cause of mortality and the third leading cause of disability around the globe. The goal of this study was to rapidly and accurately identify carotid plaques and automatically quantify plaque burden using our automated tracking and segmentation US-video system.
Methods: We collected 88 common carotid artery transection videos (11048 frames) with a history of atherosclerosis or risk factors for atherosclerosis, which were randomly divided into training, test, and validation sets using a 6:3:1 ratio.
Image matting is a fundamental and challenging problem in computer vision and graphics. Most existing matting methods leverage a user-supplied trimap as an auxiliary input to produce good alpha matte. However, obtaining high-quality trimap itself is arduous.
View Article and Find Full Text PDFBiomimetics (Basel)
July 2023
Key Laboratory of Big Data and Intelligent Robot (SCUT), MOE of China, School of Software Engineering, South China University of Technology, Guangzhou 510006, China.
Natural image matting is an essential technique for image processing that enables various applications, such as image synthesis, video editing, and target tracking. However, the existing image matting methods may fail to produce satisfactory results when computing resources are limited. Sampling-based methods can reduce the dimensionality of the decision space and, therefore, reduce computational resources by employing different sampling strategies.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!