Video segmentation is an important building block for high level applications such as scene understanding and interaction analysis. While outstanding results are achieved in this field by state-of-the-art learning and model based methods, they are restricted to certain types of scenes or require a large amount of annotated training data to achieve object segmentation in generic scenes. On the other hand, RGBD data, widely available with the introduction of consumer depth sensors, provides actual world 3D geometry compared to 2D images. The explicit geometry in RGBD data greatly helps in computer vision tasks, but the lack of annotations in this type of data may also hinder the extension of learning based methods to RGBD. In this paper, we present a novel generic segmentation approach for 3D point cloud video (stream data) thoroughly exploiting the explicit geometry in RGBD. Our proposal is only based on low level features, such as connectivity and compactness. We exploit temporal coherence by representing the rough estimation of objects in a single frame with a hierarchical structure, and propagating this hierarchy along time. The hierarchical structure provides an efficient way to establish temporal correspondences at different scales of object-connectivity, and to temporally manage the splits and merges of objects. This allows updating the segmentation according to the evidence observed in the history. The proposed method is evaluated on several challenging datasets, with promising results for the presented approach.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TIP.2018.2811541DOI Listing

Publication Analysis

Top Keywords

point cloud
8
cloud video
8
video segmentation
8
segmentation generic
8
generic scenes
8
based methods
8
rgbd data
8
explicit geometry
8
geometry rgbd
8
hierarchical structure
8

Similar Publications

Copy number variation (CNV) is an important part of human genetic variations, which is associated with various kinds of diseases. To tackle the limitations of traditional CNV detection methods, such as restricted detection types, high error rates, and challenges in precisely identifying the location of variant breakpoints, a new method called MSCNV (copy number variations detection method for multi-strategies integration based on a one-class support vector machine model) is proposed. MSCNV establishes a multi-signal channel that integrates three strategies: read depth, split read, and read pair.

View Article and Find Full Text PDF

The attention mechanism has significantly progressed in various point cloud tasks. Benefiting from its significant competence in capturing long-range dependencies, research in point cloud completion has achieved promising results. However, the typically disordered point cloud data features complicated non-Euclidean geometric structures and exhibits unpredictable behavior.

View Article and Find Full Text PDF

WildPose: A long-range 3D wildlife motion capture system.

J Exp Biol

January 2025

African Robotics Unit, University of Cape Town, Cape Town, 7700, Western Cape, South Africa.

Understanding and monitoring wildlife behavior is crucial in ecology and biomechanics, yet challenging due to the limitations of current methods. To address this issue, we introduce WildPose, a novel long-range motion capture system specifically tailored for free-ranging wildlife observation. This system combines an electronically controllable zoom-lens camera with a LiDAR to capture both 2D videos and 3D point cloud data, thereby allowing researchers to observe high-fidelity animal morphometrics, behavior and interactions in a completely remote manner.

View Article and Find Full Text PDF

Fast surface reconstruction algorithm with adaptive step size.

PLoS One

January 2025

School of Mathematics and Finance, Hunan University of Humanities, Science and Technology, Loudi, China.

In (Dai et al. 2023), the authors proposed a fast algorithm for surface reconstruction that converges rapidly from point cloud data by alternating Anderson extrapolation with implicit progressive iterative approximation (I-PIA). This algorithm employs a fixed step size during iterations to enhance convergence.

View Article and Find Full Text PDF

Generative design of crystal structures by point cloud representations and diffusion model.

iScience

January 2025

School of Physics and Electronic Engineering, Jiangsu University, Zhenjiang, Jiangsu 212013, P.R. China.

Efficiently generating energetically stable crystal structures has long been a challenge in material design, primarily due to the immense arrangement of atoms in a crystal lattice. To facilitate the discovery of stable materials, we present a framework for the generation of synthesizable materials leveraging a point cloud representation to encode intricate structural information. At the heart of this framework lies the introduction of a diffusion model as its foundational pillar.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!