A pothole video dataset for semantic segmentation.

Data Brief

Department of Computer Science and Electronics, Universitas Gadjah Mada, Indonesia.

Published: April 2024

This paper introduces a video dataset for semantic segmentation of road potholes. This dataset contains 619 high-resolution videos captured in January 2023, covering locations in eight villages within the Hulu Sungai Tengah regency of South Kalimantan, Indonesia. The dataset is divided into three main folders, namely train, val, and test. The train, val, and test folders contain 372 videos for training, 124 videos for validation, and 123 videos for testing, respectively. Each of these main folders has two subfolders, ``RGB'' for the video in the RGB format and ``mask'' for the ground truth segmentation. These videos are precisely two seconds long, containing 48 frames each, and all are in MP4 format. The dataset offers remarkable flexibility, accommodating various research needs, from full-video segmentation to frame extraction. It enables researchers to create ground truth annotations and change the combination of videos in the folders according to their needs. This resource is an asset for researchers, engineers, policymakers, and anyone interested in advancing algorithms for pothole detection and analysis. This dataset allows for benchmarking semantic segmentation algorithms, conducting comparative studies on pothole detection methods, and exploring innovative approaches, offering valuable contributions to the computer vision community.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10867608PMC
http://dx.doi.org/10.1016/j.dib.2024.110131DOI Listing

Publication Analysis

Top Keywords

semantic segmentation
12
video dataset
8
dataset semantic
8
main folders
8
train val
8
val test
8
ground truth
8
pothole detection
8
dataset
6
videos
6

Similar Publications

Physics-Based Synthetic Data Model for Automated Segmentation in Catalysis Microscopy.

Microsc Microanal

January 2025

Fritz-Haber-Institut der Max-Planck-Gesellschaft, Berlin 14195, Germany.

In catalysis research, the amount of microscopy data acquired when imaging dynamic processes is often too much for nonautomated quantitative analysis. Developing machine learned segmentation models is challenged by the requirement of high-quality annotated training data. We thus substitute expert-annotated data with a physics-based sequential synthetic data model.

View Article and Find Full Text PDF

Glaucoma, a severe eye disease leading to irreversible vision loss if untreated, remains a significant challenge in healthcare due to the complexity of its detection. Traditional methods rely on clinical examinations of fundus images, assessing features like optic cup and disc sizes, rim thickness, and other ocular deformities. Recent advancements in artificial intelligence have introduced new opportunities for enhancing glaucoma detection.

View Article and Find Full Text PDF

Background: EUS-guided fine-needle biopsy is the procedure of choice for the diagnosis of pancreatic ductal adenocarcinoma (PDAC). Nevertheless, the samples obtained are small and require expertise in pathology, whereas the diagnosis is difficult in view of the scarcity of malignant cells and the important desmoplastic reaction of these tumors. With the help of artificial intelligence, the deep learning architectures produce a fast, accurate, and automated approach for PDAC image segmentation based on whole-slide imaging.

View Article and Find Full Text PDF

Glo-net: A dual task branch based neural network for multi-class glomeruli segmentation.

Comput Biol Med

January 2025

Jiangsu Key Laboratory of Intelligent Medical Image Computing, School of Future Technology, Nanjing University of Information Science and Technology, Nanjing, 210044, China. Electronic address:

Accurate segmentation and classification of glomeruli are fundamental to histopathology slide analysis in renal pathology, which helps to characterize individual kidney disease. Accurate segmentation of glomeruli of different types faces two main challenges compared to traditional primitives segmentation in computational image analysis. Limited by small kernel size, traditional convolutional neural networks could hardly understand the complete context information of different glomeruli.

View Article and Find Full Text PDF

Data-Efficient Bone Segmentation Using Feature Pyramid- Based SegFormer.

Sensors (Basel)

December 2024

Master's Program in Information and Computer Science, Doshisha University, Kyoto 610-0394, Japan.

The semantic segmentation of bone structures demands pixel-level classification accuracy to create reliable bone models for diagnosis. While Convolutional Neural Networks (CNNs) are commonly used for segmentation, they often struggle with complex shapes due to their focus on texture features and limited ability to incorporate positional information. As orthopedic surgery increasingly requires precise automatic diagnosis, we explored SegFormer, an enhanced Vision Transformer model that better handles spatial awareness in segmentation tasks.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!