Dictionary learning has emerged as a promising alternative to the conventional hybrid coding framework. However, the rigid structure of sequential training and prediction degrades its performance in scalable video coding. This paper proposes a progressive dictionary learning framework with hierarchical predictive structure for scalable video coding, especially in low bitrate region. For pyramidal layers, sparse representation based on spatio-temporal dictionary is adopted to improve the coding efficiency of enhancement layers with a guarantee of reconstruction performance. The overcomplete dictionary is trained to adaptively capture local structures along motion trajectories as well as exploit the correlations between the neighboring layers of resolutions. Furthermore, progressive dictionary learning is developed to enable the scalability in temporal domain and restrict the error propagation in a closed-loop predictor. Under the hierarchical predictive structure, online learning is leveraged to guarantee the training and prediction performance with an improved convergence rate. To accommodate with the state-of-the-art scalable extension of H.264/AVC and latest High Efficiency Video Coding (HEVC), standardized codec cores are utilized to encode the base and enhancement layers. Experimental results show that the proposed method outperforms the latest scalable extension of HEVC and HEVC simulcast over extensive test sequences with various resolutions.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5638692PMC
http://dx.doi.org/10.1109/TIP.2017.2692882DOI Listing

Publication Analysis

Top Keywords

dictionary learning
16
video coding
16
progressive dictionary
12
hierarchical predictive
12
predictive structure
12
scalable video
12
training prediction
8
enhancement layers
8
scalable extension
8
coding
6

Similar Publications

Modeling Optical Coherence Tomography (OCT) images is crucial for numerous image processing applications and aids ophthalmologists in the early detection of macular abnormalities. Sparse representation-based models, particularly dictionary learning (DL), play a pivotal role in image modeling. Traditional DL methods often transform higher-order tensors into vectors and then aggregate them into a matrix, which overlooks the inherent multi-dimensional structure of the data.

View Article and Find Full Text PDF

Compressive electron backscatter diffraction imaging.

J Microsc

January 2025

Department of Mechanical, Materials and Aerospace Engineering, University of Liverpool, Liverpool, UK.

Electron backscatter diffraction (EBSD) has developed over the last few decades into a valuable crystallographic characterisation method for a wide range of sample types. Despite these advances, issues such as the complexity of sample preparation, relatively slow acquisition, and damage in beam-sensitive samples, still limit the quantity and quality of interpretable data that can be obtained. To mitigate these issues, here we propose a method based on the subsampling of probe positions and subsequent reconstruction of an incomplete data set.

View Article and Find Full Text PDF

Objective: This study examines a novel teaching model that integrates the development and use of a Medical Cloud Dictionary with project-based learning (PBL). We investigate whether this integrated approach improves teaching effectiveness, enhances student learning outcomes, and reduces teaching pressure compared to traditional PBL.

Methods: One hundred student volunteers were randomly assigned to an experimental group (n = 50) and a control group (n = 50).

View Article and Find Full Text PDF

Background: Natural language processing (NLP) enables the extraction of information embedded within unstructured texts, such as clinical case reports and trial eligibility criteria. By identifying relevant medical concepts, NLP facilitates the generation of structured and actionable data, supporting complex tasks like cohort identification and the analysis of clinical records. To accomplish those tasks, we introduce a deep learning-based and lexicon-based named entity recognition (NER) tool for texts in Spanish.

View Article and Find Full Text PDF

Public Health Discussions on Social Media: Evaluating Automated Sentiment Analysis Methods.

JMIR Form Res

January 2025

Department of Health Administration, The College of Health Professions, Central Michigan University, Mt Pleasant, MI, United States.

Article Synopsis
  • Sentiment analysis is a key method for analyzing text, especially in social media research, where the choice between manual and automated methods is crucial.
  • The study compared several sentiment analysis tools, including VADER, TEXT2DATA, LIWC-22, and ChatGPT 4.0, against manually coded sentiment scores from YouTube comments on the opioid crisis, assessing factors like ease of use and cost.
  • Findings revealed that LIWC-22 excelled in identifying sentiment patterns, while VADER was best at classifying negative comments, but overall, automated tools showed only fair agreement with manual coding, with ChatGPT performing poorly.
View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!