On scene segmentation and histograms-based curve evolution.

IEEE Trans Pattern Anal Mach Intell

Department of Computer Science, Technion - Israel Institute of Technology, Haifa, Israel.

Published: September 2009

We consider curve evolution based on comparing distributions of features, and its applications for scene segmentation. In the first part, we promote using cross-bin metrics such as the Earth Mover's Distance (EMD), instead of standard bin-wise metrics as the Bhattacharyya or Kullback-Leibler metrics. To derive flow equations for minimizing functionals involving the EMD, we employ a tractable expression for calculating EMD between one-dimensional distributions. We then apply the derived flows to various examples of single image segmentation, and to scene analysis using video data. In the latter, we consider the problem of segmenting a scene to spatial regions in which different activities occur. We use a nonparametric local representation of the regions by considering multiple one-dimensional histograms of normalized spatiotemporal derivatives. We then obtain semisupervised segmentation of regions using the flows derived in the first part of the paper. Our results are demonstrated on challenging surveillance scenes, and compare favorably with state-of-the-art results using parametric representations by dynamic systems or mixtures of them.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2009.21DOI Listing

Publication Analysis

Top Keywords

scene segmentation
8
curve evolution
8
scene
4
segmentation histograms-based
4
histograms-based curve
4
evolution consider
4
consider curve
4
evolution based
4
based comparing
4
comparing distributions
4

Similar Publications

A reward shaping deep deterministic policy gradient (RS-DDPG) and simultaneous localization and mapping (SLAM) path tracking algorithm is proposed to address the issues of low accuracy and poor robustness of target path tracking for robotic control during maneuver. RS-DDPG algorithm is based on deep reinforcement learning (DRL) and designs a reward function to optimize the parameters of DDPG to achieve the required tracking accuracy and stability. A visual SLAM algorithm based on semantic segmentation and geometric information is proposed to address the issues of poor robustness and susceptibility to interference from dynamic objects in dynamic scenes for SLAM based on visual sensors.

View Article and Find Full Text PDF

This paper presents a synthetic holographic stereogram printing approach that integrates neural radiance fields (NeRF) with the effective perspective images segmentation and mosaicking (EPISM) method. Sparse perspectives of a 3D scene are captured through random sampling and used to train a NeRF model with multi-resolution hash encoding, enabling rapid construction of an implicit scene representation. The EPISM method calculates the camera pose parameters needed for parallax images, which are rendered through the trained neural network.

View Article and Find Full Text PDF

Travelable area boundaries not only constrain the movement of field robots but also indicate alternative guiding routes for dynamic objects. Publicly available road boundary datasets have outlined boundaries by binary segmentation labels. However, hard post-processes have to be done to extract from detected boundaries further semantics including the shapes of the boundaries and guiding routes, which poses challenges to a real-time visual navigation system without detailed prior maps.

View Article and Find Full Text PDF

Objective: To develop a deep learning (DL) model for carotid plaque detection based on CTA images and evaluate the clinical application feasibility and value of the model.

Methods: We retrospectively collected data from patients with carotid atherosclerotic plaques who underwent continuous CTA examinations of the head and neck at a tertiary hospital from October 2020 to October 2022. The model combined ResUNet with the Pyramid Scene Parsing Network (PSPNet) to enhance plaque segmentation.

View Article and Find Full Text PDF

The advancement of neural radiance fields (NeRFs) has facilitated the high-quality 3D reconstruction of complex scenes. However, for most NeRFs, reconstructing 3D tissues from endoscopy images poses significant challenges due to the occlusion of soft tissue regions by invalid pixels, deformations in soft tissue, and poor image quality, which severely limits their application in endoscopic scenarios. To address the above issues, we propose a novel framework to reconstruct high-fidelity soft tissue scenes from low-quality endoscopic images.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!