Keyframe image processing of semantic 3D point clouds based on deep learning.

Front Neurorobot

Department of English, Faculty of Arts and Humanities, University of Macau, Zhuhai, China.

Published: January 2023

AI Article Synopsis

Article Abstract

With the rapid development of web technologies and the popularity of smartphones, users are uploading and sharing a large number of images every day. Therefore, it is a very important issue nowadays to enable users to discover exactly the information they need in the vast amount of data and to make it possible to integrate their large amount of image material efficiently. However, traditional content-based image retrieval techniques are based on images, and there is a "semantic gap" between this and people's understanding of images. To address this "semantic gap," a keyframe image processing method for 3D point clouds is proposed, and based on this, a U-Net-based binary data stream semantic segmentation network is established for keyframe image processing of 3D point clouds in combination with deep learning techniques.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9890954PMC
http://dx.doi.org/10.3389/fnbot.2022.988024DOI Listing

Publication Analysis

Top Keywords

keyframe image
12
image processing
12
point clouds
12
deep learning
8
"semantic gap"
8
processing semantic
4
semantic point
4
clouds based
4
based deep
4
learning rapid
4

Similar Publications

Event-Based Visual/Inertial Odometry for UAV Indoor Navigation.

Sensors (Basel)

December 2024

SOTI Aerospace, SOTI Inc., Mississauga, ON L5N 8L9, Canada.

Indoor navigation is becoming increasingly essential for multiple applications. It is complex and challenging due to dynamic scenes, limited space, and, more importantly, the unavailability of global navigation satellite system (GNSS) signals. Recently, new sensors have emerged, namely event cameras, which show great potential for indoor navigation due to their high dynamic range and low latency.

View Article and Find Full Text PDF

Background: Accurate preoperative prediction of cervical lymph node metastasis (LNM) for papillary thyroid carcinoma (PTC) patients is essential for disease staging and individualized treatment planning, which can improve prognosis and facilitate better management.

Purpose: To establish a fully automated deep learning-enabled model (FADLM) for automated tumor segmentation and cervical LNM prediction in PTC using ultrasound (US) video keyframes.

Methods: The bicentral study retrospective enrolled 518 PTC patients, who were then randomly divided into the training (Hospital 1, n = 340), internal test (Hospital 1, n = 83), and external test cohorts (Hospital 2, n = 95).

View Article and Find Full Text PDF

This study aims to improve the helicopter electric power inspection process by using the feature embedding convolution (FEC) model to solve the problems of small scope and poor real-time inspection. First, simulation experiments and model analysis determine the keyframe and flight trajectory. Second, an improved FEC model is proposed, extracting features from aerial images in large ranges in real time and accurately identifying and classifying electric power inspection targets.

View Article and Find Full Text PDF

IG-Net: An Instrument-guided real-time semantic segmentation framework for prostate dissection during surgery for low rectal cancer.

Comput Methods Programs Biomed

December 2024

Division of Colorectal Surgery, Department of General Surgery, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, No. 1 Shuai Fu Yuan, Dongcheng District, Beijing, 100730, China. Electronic address:

Background And Objective: Accurate prostate dissection is crucial in transanal surgery for patients with low rectal cancer. Improper dissection can lead to adverse events such as urethral injury, severely affecting the patient's postoperative recovery. However, unclear boundaries, irregular shape of the prostate, and obstructive factors such as smoke present significant challenges for surgeons.

View Article and Find Full Text PDF

We present the EuroCity Persons (ECP) 2.0 dataset, a novel image dataset for person detection, tracking and prediction in traffic. The dataset was collected on-board a vehicle driving through 29 cities in 11 European countries.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!