RGBD Salient Object Detection via Deep Fusion.

Liangqiong Qu Shengfeng He Jiawei Zhang Jiandong Tian Yandong Tang Qingxiong Yang

IEEE Trans Image Process

Published: May 2017

Numerous efforts have been made to design various low-level saliency cues for RGBD saliency detection, such as color and depth contrast features as well as background and color compactness priors. However, how these low-level saliency cues interact with each other and how they can be effectively incorporated to generate a master saliency map remain challenging problems. In this paper, we design a new convolutional neural network (CNN) to automatically learn the interaction mechanism for RGBD salient object detection. In contrast to existing works, in which raw image pixels are fed directly to the CNN, the proposed method takes advantage of the knowledge obtained in traditional saliency detection by adopting various flexible and interpretable saliency feature vectors as inputs. This guides the CNN to learn a combination of existing features to predict saliency more effectively, which presents a less complex problem than operating on the pixels directly. We then integrate a superpixel-based Laplacian propagation framework with the trained CNN to extract a spatially consistent saliency map by exploiting the intrinsic structure of the input image. Extensive quantitative and qualitative experimental evaluations on three data sets demonstrate that the proposed method consistently outperforms the state-of-the-art methods.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TIP.2017.2682981	DOI Listing

Publication Analysis

Top Keywords

rgbd salient

salient object

object detection

saliency

low-level saliency

saliency cues

saliency detection

saliency map

proposed method

detection

Similar Publications

Salient Object Detection From Arbitrary Modalities.

IEEE Trans Image Process

November 2024

Nianchang Huang Yang Yang Ruida Xi Qiang Zhang Jungong Han

Toward desirable saliency prediction, the types and numbers of inputs for a salient object detection (SOD) algorithm may dynamically change in many real-life applications. However, existing SOD algorithms are mainly designed or trained for one particular type of inputs, failing to be generalized to other types of inputs. Consequentially, more types of SOD algorithms need to be prepared in advance for handling different types of inputs, raising huge hardware and research costs.

View Article and Find Full Text PDF

Similar Publications

DMGNet: Depth mask guiding network for RGB-D salient object detection.

Neural Netw

December 2024

School of Electrical Engineering, Yanshan University, Qinhuangdao, Hebei 066004, China. Electronic address:

Yinggan Tang Mengyao Li

Though depth images can provide supplementary spatial structural cues for salient object detection (SOD) task, inappropriate utilization of depth features may introduce noisy or misleading features, which may greatly destroy SOD performance. To address this issue, we propose a depth mask guiding network (DMGNet) for RGB-D SOD. In this network, a depth mask guidance module (DMGM) is designed to pre-segment the salient objects from depth images and then create masks using pre-segmented objects to guide the RGB subnetwork to extract more discriminative features.

View Article and Find Full Text PDF

Similar Publications

CalibNet: Dual-Branch Cross-Modal Calibration for RGB-D Salient Instance Segmentation.

IEEE Trans Image Process

August 2024

Jialun Pei Tao Jiang He Tang Nian Liu Yueming Jin

In this study, we propose a novel approach for RGB-D salient instance segmentation using a dual-branch cross-modal feature calibration architecture called CalibNet. Our method simultaneously calibrates depth and RGB features in the kernel and mask branches to generate instance-aware kernels and mask features. CalibNet consists of three simple modules, a dynamic interactive kernel (DIK) and a weight-sharing fusion (WSF), which work together to generate effective instance-aware kernels and integrate cross-modal features.

View Article and Find Full Text PDF

Similar Publications

SLMSF-Net: A Semantic Localization and Multi-Scale Fusion Network for RGB-D Salient Object Detection.

Sensors (Basel)

February 2024

School of Information and Electronic Engineering, Zhejiang University of Science and Technology, Hangzhou 310023, China.

Yanbin Peng Zhinian Zhai Mingkun Feng

Salient Object Detection (SOD) in RGB-D images plays a crucial role in the field of computer vision, with its central aim being to identify and segment the most visually striking objects within a scene. However, optimizing the fusion of multi-modal and multi-scale features to enhance detection performance remains a challenge. To address this issue, we propose a network model based on semantic localization and multi-scale fusion (SLMSF-Net), specifically designed for RGB-D SOD.

View Article and Find Full Text PDF

Similar Publications

EM-Trans: Edge-Aware Multimodal Transformer for RGB-D Salient Object Detection.

IEEE Trans Neural Netw Learn Syst

February 2024

Geng Chen Qingyue Wang Bo Dong Ruitao Ma Nian Liu

RGB-D salient object detection (SOD) has gained tremendous attention in recent years. In particular, transformer has been employed and shown great potential. However, existing transformer models usually overlook the vital edge information, which is a major issue restricting the further improvement of SOD accuracy.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!