This paper introduces a video dataset for semantic segmentation of road potholes. This dataset contains 619 high-resolution videos captured in January 2023, covering locations in eight villages within the Hulu Sungai Tengah regency of South Kalimantan, Indonesia. The dataset is divided into three main folders, namely train, val, and test. The train, val, and test folders contain 372 videos for training, 124 videos for validation, and 123 videos for testing, respectively. Each of these main folders has two subfolders, ``RGB'' for the video in the RGB format and ``mask'' for the ground truth segmentation. These videos are precisely two seconds long, containing 48 frames each, and all are in MP4 format. The dataset offers remarkable flexibility, accommodating various research needs, from full-video segmentation to frame extraction. It enables researchers to create ground truth annotations and change the combination of videos in the folders according to their needs. This resource is an asset for researchers, engineers, policymakers, and anyone interested in advancing algorithms for pothole detection and analysis. This dataset allows for benchmarking semantic segmentation algorithms, conducting comparative studies on pothole detection methods, and exploring innovative approaches, offering valuable contributions to the computer vision community.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10867608 | PMC |
http://dx.doi.org/10.1016/j.dib.2024.110131 | DOI Listing |
Microsc Microanal
January 2025
Fritz-Haber-Institut der Max-Planck-Gesellschaft, Berlin 14195, Germany.
In catalysis research, the amount of microscopy data acquired when imaging dynamic processes is often too much for nonautomated quantitative analysis. Developing machine learned segmentation models is challenged by the requirement of high-quality annotated training data. We thus substitute expert-annotated data with a physics-based sequential synthetic data model.
View Article and Find Full Text PDFMethodsX
June 2025
Assistant Professor, Department of Electronics and Communication Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Tamil Nadu, 600062, India.
Glaucoma, a severe eye disease leading to irreversible vision loss if untreated, remains a significant challenge in healthcare due to the complexity of its detection. Traditional methods rely on clinical examinations of fundus images, assessing features like optic cup and disc sizes, rim thickness, and other ocular deformities. Recent advancements in artificial intelligence have introduced new opportunities for enhancing glaucoma detection.
View Article and Find Full Text PDFEndosc Ultrasound
December 2024
Department of Gastroenterology, Ponderas Academic Hospital, Bucharest, Romania.
Background: EUS-guided fine-needle biopsy is the procedure of choice for the diagnosis of pancreatic ductal adenocarcinoma (PDAC). Nevertheless, the samples obtained are small and require expertise in pathology, whereas the diagnosis is difficult in view of the scarcity of malignant cells and the important desmoplastic reaction of these tumors. With the help of artificial intelligence, the deep learning architectures produce a fast, accurate, and automated approach for PDAC image segmentation based on whole-slide imaging.
View Article and Find Full Text PDFComput Biol Med
January 2025
Jiangsu Key Laboratory of Intelligent Medical Image Computing, School of Future Technology, Nanjing University of Information Science and Technology, Nanjing, 210044, China. Electronic address:
Accurate segmentation and classification of glomeruli are fundamental to histopathology slide analysis in renal pathology, which helps to characterize individual kidney disease. Accurate segmentation of glomeruli of different types faces two main challenges compared to traditional primitives segmentation in computational image analysis. Limited by small kernel size, traditional convolutional neural networks could hardly understand the complete context information of different glomeruli.
View Article and Find Full Text PDFSensors (Basel)
December 2024
Master's Program in Information and Computer Science, Doshisha University, Kyoto 610-0394, Japan.
The semantic segmentation of bone structures demands pixel-level classification accuracy to create reliable bone models for diagnosis. While Convolutional Neural Networks (CNNs) are commonly used for segmentation, they often struggle with complex shapes due to their focus on texture features and limited ability to incorporate positional information. As orthopedic surgery increasingly requires precise automatic diagnosis, we explored SegFormer, an enhanced Vision Transformer model that better handles spatial awareness in segmentation tasks.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!