Dictionary learning has emerged as a promising alternative to the conventional hybrid coding framework. However, the rigid structure of sequential training and prediction degrades its performance in scalable video coding. This paper proposes a progressive dictionary learning framework with hierarchical predictive structure for scalable video coding, especially in low bitrate region. For pyramidal layers, sparse representation based on spatio-temporal dictionary is adopted to improve the coding efficiency of enhancement layers with a guarantee of reconstruction performance. The overcomplete dictionary is trained to adaptively capture local structures along motion trajectories as well as exploit the correlations between the neighboring layers of resolutions. Furthermore, progressive dictionary learning is developed to enable the scalability in temporal domain and restrict the error propagation in a closed-loop predictor. Under the hierarchical predictive structure, online learning is leveraged to guarantee the training and prediction performance with an improved convergence rate. To accommodate with the state-of-the-art scalable extension of H.264/AVC and latest High Efficiency Video Coding (HEVC), standardized codec cores are utilized to encode the base and enhancement layers. Experimental results show that the proposed method outperforms the latest scalable extension of HEVC and HEVC simulcast over extensive test sequences with various resolutions.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5638692 | PMC |
http://dx.doi.org/10.1109/TIP.2017.2692882 | DOI Listing |
Artif Intell Med
December 2024
Medical Image and Signal Processing Research Center, Isfahan University of Medical Sciences, Isfahan 81746-73461, Iran. Electronic address:
Modeling Optical Coherence Tomography (OCT) images is crucial for numerous image processing applications and aids ophthalmologists in the early detection of macular abnormalities. Sparse representation-based models, particularly dictionary learning (DL), play a pivotal role in image modeling. Traditional DL methods often transform higher-order tensors into vectors and then aggregate them into a matrix, which overlooks the inherent multi-dimensional structure of the data.
View Article and Find Full Text PDFJ Microsc
January 2025
Department of Mechanical, Materials and Aerospace Engineering, University of Liverpool, Liverpool, UK.
Electron backscatter diffraction (EBSD) has developed over the last few decades into a valuable crystallographic characterisation method for a wide range of sample types. Despite these advances, issues such as the complexity of sample preparation, relatively slow acquisition, and damage in beam-sensitive samples, still limit the quantity and quality of interpretable data that can be obtained. To mitigate these issues, here we propose a method based on the subsampling of probe positions and subsequent reconstruction of an incomplete data set.
View Article and Find Full Text PDFBMC Med Educ
January 2025
The First Clinical Medicine School of Guangdong Pharmaceutical University, Guangdong, People's Republic of China.
Objective: This study examines a novel teaching model that integrates the development and use of a Medical Cloud Dictionary with project-based learning (PBL). We investigate whether this integrated approach improves teaching effectiveness, enhances student learning outcomes, and reduces teaching pressure compared to traditional PBL.
Methods: One hundred student volunteers were randomly assigned to an experimental group (n = 50) and a control group (n = 50).
BMC Bioinformatics
January 2025
Centro de Salud Retiro, Hospital Universitario Gregorio Marañon, C/Lope de Rueda, 43, 28009, Madrid, Spain.
Background: Natural language processing (NLP) enables the extraction of information embedded within unstructured texts, such as clinical case reports and trial eligibility criteria. By identifying relevant medical concepts, NLP facilitates the generation of structured and actionable data, supporting complex tasks like cohort identification and the analysis of clinical records. To accomplish those tasks, we introduce a deep learning-based and lexicon-based named entity recognition (NER) tool for texts in Spanish.
View Article and Find Full Text PDFJMIR Form Res
January 2025
Department of Health Administration, The College of Health Professions, Central Michigan University, Mt Pleasant, MI, United States.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!