Image compression distortion can cause performance degradation of machine analysis tasks, therefore recent years have witnessed fast progress in developing deep image compression methods optimized for machine perception. However, the investigation still lacks for saliency segmentation. First, in this paper we propose a deep compression network increasing local signal fidelity of important image pixels for saliency segmentation, which is different from existing methods utilizing the analysis network loss for backward propagation. By this means, these two types of networks can be decoupled to improve the compatibility of proposed compression method for diverse saliency segmentation networks. Second, pixel-level bit weights are modeled with probability distribution in the proposed bit allocation method. The ascending cosine roll-down (ACRD) function allocates bits to those important pixels, which fits the essence that saliency segmentation can be regarded as the pixel-level bi-classification task. Third, the compression network is trained without the help of saliency segmentation, where latent representations are decomposed into base and enhancement channels. Base channels are retained in the whole image, while enhancement channels are utilized only for important pixels, and therefore more bits can benefit saliency segmentation via enhancement channels. Extensive experimental results demonstrate that the proposed method can save an average of 10.34% bitrate compared with the state-of-the-art deep image compression method, where the rate-accuracy (R-A) performances are evaluated on sixteen downstream saliency segmentation networks with five conventional SOD datasets.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TIP.2024.3504282DOI Listing

Publication Analysis

Top Keywords

saliency segmentation
32
image compression
16
deep image
12
enhancement channels
12
saliency
8
bit allocation
8
compression network
8
compression method
8
segmentation networks
8
compression
7

Similar Publications

In the diagnosis and treatment of brain tumors, the automatic classification and segmentation of medical images play a pivotal role. Early detection facilitates timely intervention, significantly improving patient survival rates. This study introduces a novel method for the automated classification and segmentation of brain tumors, aiming to enhance both diagnostic accuracy and efficiency.

View Article and Find Full Text PDF

Optical Coherence Tomography (OCT) imaging is extensively utilized for non-invasive observation of pathological conditions, such as retinal fluid-associated diseases. Accurate fluid segmentation in OCT images is therefore critical for quantifying disease severity and aiding clinical decision-making. However, achieving precise segmentation remains challenging due to pathological variations in shape and size, uncertain boundaries, and low contrast of fluid.

View Article and Find Full Text PDF

Image compression distortion can cause performance degradation of machine analysis tasks, therefore recent years have witnessed fast progress in developing deep image compression methods optimized for machine perception. However, the investigation still lacks for saliency segmentation. First, in this paper we propose a deep compression network increasing local signal fidelity of important image pixels for saliency segmentation, which is different from existing methods utilizing the analysis network loss for backward propagation.

View Article and Find Full Text PDF

Semantic segmentation of volumetric medical images is essential for accurate delineation of anatomic structures and pathology, enabling quantitative analysis in precision medicine applications. While volumetric segmentation has been extensively studied, most existing methods require full supervision and struggle to generalize to new classes at inference time, particularly for irregular, ill-defined targets such as tumors, where fine-grained, high-salience segmentation is required. Consequently, conventional semantic segmentation methods cannot easily offer zero/few-shot generalization to segment objects of interest beyond their closed training set.

View Article and Find Full Text PDF

Introduction: Providing tools for comprehensively evaluating scintigraphy images could enhance transthyretin amyloid cardiomyopathy (ATTR-CM) diagnosis. This study aims to automatically detect and score ATTR-CM in total body scintigraphy images using deep learning on multi-tracer, multi-scanner, and multi-center datasets.

Methods: In the current study, we employed six datasets (from 12 cameras) for various tasks and purposes.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!