IEEE Trans Neural Netw Learn Syst
February 2020
Convolutional sparse coding (CSC) is a useful tool in many image and audio applications. Maximizing the performance of CSC requires that the dictionary used to store the features of signals can be learned from real data. The so-called convolutional dictionary learning (CDL) problem is formulated within a nonconvex, nonsmooth optimization framework.
View Article and Find Full Text PDFIEEE Trans Image Process
July 2019
In this paper, we propose a novel approach to convolutional sparse representation with the aim of resolving the dictionary learning problem. The proposed method, referred to as the adaptive alternating direction method of multipliers (AADMM), employs constraints comprising non-convex, non-smooth terms, such as the l -norm imposed on the coefficients and the unit-norm sphere imposed on the length of each dictionary element. The proposed scheme incorporates a novel parameter adaption scheme that enables ADMM to achieve convergence more quickly, as evidenced by numerical and theoretical analysis.
View Article and Find Full Text PDFIn this paper, we present a theoretical analysis of the distortion in multilayer coding structures. Specifically, we analyze the prediction structure used to achieve temporal, spatial, and quality scalability of scalable video coding (SVC) and show that the average peak signal-to-noise ratio (PSNR) of SVC is a weighted combination of the bit rates assigned to all the streams. Our analysis utilizes the end user's preference for certain resolutions.
View Article and Find Full Text PDFIEEE Trans Image Process
January 2009
Performing optimal bit-allocation with 3-D wavelet coding methods is difficult because energy is not conserved after applying the motion-compensated temporal filtering (MCTF) process and the spatial wavelet transform. The problem cannot be solved by extending the 2-D wavelet coefficients weighting method directly and then applying the result to 3-D wavelet coefficients, since this approach does not consider the complicated pixel connectivity that results from the lifting-based MCTF process. In this paper, we propose a novel weighting method, which takes account of the pixel connectivity, to solve the problem and derive the effect of the quantization error of a subband on the reconstruction error of a group of pictures.
View Article and Find Full Text PDF