Progressive Dictionary Learning With Hierarchical Predictive Structure for Low Bit-Rate Scalable Video Coding.

Wenrui Dai Yangmei Shen Hongkai Xiong Xiaoqian Jiang Junni Zou David Taubman

IEEE Trans Image Process

Published: June 2017

Dictionary learning has emerged as a promising alternative to the conventional hybrid coding framework. However, the rigid structure of sequential training and prediction degrades its performance in scalable video coding. This paper proposes a progressive dictionary learning framework with hierarchical predictive structure for scalable video coding, especially in low bitrate region. For pyramidal layers, sparse representation based on spatio-temporal dictionary is adopted to improve the coding efficiency of enhancement layers with a guarantee of reconstruction performance. The overcomplete dictionary is trained to adaptively capture local structures along motion trajectories as well as exploit the correlations between the neighboring layers of resolutions. Furthermore, progressive dictionary learning is developed to enable the scalability in temporal domain and restrict the error propagation in a closed-loop predictor. Under the hierarchical predictive structure, online learning is leveraged to guarantee the training and prediction performance with an improved convergence rate. To accommodate with the state-of-the-art scalable extension of H.264/AVC and latest High Efficiency Video Coding (HEVC), standardized codec cores are utilized to encode the base and enhancement layers. Experimental results show that the proposed method outperforms the latest scalable extension of HEVC and HEVC simulcast over extensive test sequences with various resolutions.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5638692	PMC
http://dx.doi.org/10.1109/TIP.2017.2692882	DOI Listing

Publication Analysis

Top Keywords

dictionary learning

video coding

progressive dictionary

hierarchical predictive

predictive structure

scalable video

training prediction

enhancement layers

scalable extension

coding

Similar Publications

CircWaveDL: Modeling of optical coherence tomography images based on a new supervised tensor-based dictionary learning for classification of macular abnormalities.

Artif Intell Med

December 2024

Medical Image and Signal Processing Research Center, Isfahan University of Medical Sciences, Isfahan 81746-73461, Iran. Electronic address:

Roya Arian Alireza Vard Rahele Kafieh Gerlind Plonka Hossein Rabbani

Modeling Optical Coherence Tomography (OCT) images is crucial for numerous image processing applications and aids ophthalmologists in the early detection of macular abnormalities. Sparse representation-based models, particularly dictionary learning (DL), play a pivotal role in image modeling. Traditional DL methods often transform higher-order tensors into vectors and then aggregate them into a matrix, which overlooks the inherent multi-dimensional structure of the data.

View Article and Find Full Text PDF

Similar Publications

Compressive electron backscatter diffraction imaging.

J Microsc

January 2025

Department of Mechanical, Materials and Aerospace Engineering, University of Liverpool, Liverpool, UK.

Zoë Broad Alex W Robinson Jack Wells Daniel Nicholls Amirafshar Moshtaghpour

Electron backscatter diffraction (EBSD) has developed over the last few decades into a valuable crystallographic characterisation method for a wide range of sample types. Despite these advances, issues such as the complexity of sample preparation, relatively slow acquisition, and damage in beam-sensitive samples, still limit the quantity and quality of interpretable data that can be obtained. To mitigate these issues, here we propose a method based on the subsampling of probe positions and subsequent reconstruction of an incomplete data set.

View Article and Find Full Text PDF

Similar Publications

The integrated teaching practice of medical cloud dictionary development and project-based learning.

BMC Med Educ

January 2025

The First Clinical Medicine School of Guangdong Pharmaceutical University, Guangdong, People's Republic of China.

Jiayi Zhang Jiexuan Chen Hongbin Guo Wei Liu Mingzhe Li

Objective: This study examines a novel teaching model that integrates the development and use of a Medical Cloud Dictionary with project-based learning (PBL). We investigate whether this integrated approach improves teaching effectiveness, enhances student learning outcomes, and reduces teaching pressure compared to traditional PBL.

Methods: One hundred student volunteers were randomly assigned to an experimental group (n = 50) and a control group (n = 50).

View Article and Find Full Text PDF

Similar Publications

Hybrid natural language processing tool for semantic annotation of medical texts in Spanish.

BMC Bioinformatics

January 2025

Centro de Salud Retiro, Hospital Universitario Gregorio Marañon, C/Lope de Rueda, 43, 28009, Madrid, Spain.

Leonardo Campillos-Llanos Ana Valverde-Mateos Adrián Capllonch-Carrión

Background: Natural language processing (NLP) enables the extraction of information embedded within unstructured texts, such as clinical case reports and trial eligibility criteria. By identifying relevant medical concepts, NLP facilitates the generation of structured and actionable data, supporting complex tasks like cohort identification and the analysis of clinical records. To accomplish those tasks, we introduce a deep learning-based and lexicon-based named entity recognition (NER) tool for texts in Spanish.

View Article and Find Full Text PDF

Similar Publications

Public Health Discussions on Social Media: Evaluating Automated Sentiment Analysis Methods.

JMIR Form Res

January 2025

Department of Health Administration, The College of Health Professions, Central Michigan University, Mt Pleasant, MI, United States.

Lisa M Gandy Lana V Ivanitskaya Leeza L Bacon Rodina Bizri-Baryak

Article Synopsis

Sentiment analysis is a key method for analyzing text, especially in social media research, where the choice between manual and automated methods is crucial.
The study compared several sentiment analysis tools, including VADER, TEXT2DATA, LIWC-22, and ChatGPT 4.0, against manually coded sentiment scores from YouTube comments on the opioid crisis, assessing factors like ease of use and cost.
Findings revealed that LIWC-22 excelled in identifying sentiment patterns, while VADER was best at classifying negative comments, but overall, automated tools showed only fair agreement with manual coding, with ChatGPT performing poorly.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!