Dataset for detecting motorcyclists in pedestrian areas.

Data Brief

Computer Eng., Universidad Tecnologica de Bolívar, Km 1 vía a Turbaco, Cartagena, 130010, Bolívar, Colombia.

Published: October 2023

This paper presents a semi-automated, scalable, and homologous methodology towards IoT implemented in Python for extracting and integrating images in pedestrian and motorcyclist areas on the road for constructing a multiclass object classifier. It consists of two stages. The first stage deals with creating a non-debugged data set by acquiring images related to the semantic context previously mentioned, using an embedded device connected 24/7 via Wi-Fi to a free and public CCTV service in Medellin, Colombia. Through artificial vision techniques, and automatically performs a comparative chronological analysis to download the images observed by 80 cameras that report data asynchronously. The second stage proposes two algorithms focused on debugging the previously obtained data set. The first one facilitates the user in labeling the data set not debugged through Regions of Interest (ROI) and hotkeys. It decomposes the information in the nth image of the data set in the same dictionary to store it in a binary Pickle file. The second one is nothing more than an observer of the classification performed by the user through the first algorithm to allow the user to verify if the information contained in the Pickle file built is correct.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10558723PMC
http://dx.doi.org/10.1016/j.dib.2023.109610DOI Listing

Publication Analysis

Top Keywords

data set
16
pickle file
8
data
5
dataset detecting
4
detecting motorcyclists
4
motorcyclists pedestrian
4
pedestrian areas
4
areas paper
4
paper presents
4
presents semi-automated
4

Similar Publications

Background: Comprehensive clinical data regarding factors influencing the individual disease course of patients with movement disorders treated with deep brain stimulation might help to better understand disease progression and to develop individualized treatment approaches.

Methods: The clinical core data set was developed by a multidisciplinary working group within the German transregional collaborative research network ReTune. The development followed standardized methodology comprising review of available evidence, a consensus process and performance of the first phase of the study.

View Article and Find Full Text PDF

Objectives: The aim of this study was to determine the status of tertiary lymphoid structures (TLSs) using radiomic features in patients with invasive pulmonary adenocarcinoma (IA).

Methods: In this retrospective study, patients with IA from November 2015 to March 2024 were recruited from two independent centers (center 1, training and internal test data set; center 2, external test data set). TLS was divided into two groups according to hematoxylin-eosin staining.

View Article and Find Full Text PDF

A clinical narrative corpus on nut allergy: annotation schema, guidelines and use case.

Sci Data

January 2025

Computer Science and Engineering Department, Universidad Carlos III de Madrid, Av. Universidad, 30, Leganés, 28911, Madrid, Spain.

This article describes a dataset on nut allergy extracted from Spanish clinical records provided by the Hospital Universitario Fundación de Alcorcón (HUFA) in Madrid, Spain, in collaboration with its Allergology Unit and Information Systems and Technologies Department. There are few publicly available clinical texts in Spanish and having more is essential as a valuable resource to train and test information extraction systems. In total, 828 clinical notes in Spanish were employed and several experts participated in the annotation process by categorizing the annotated entities into medical semantic groups related to allergies.

View Article and Find Full Text PDF

The distinctive characteristics of an individual's T cell receptor repertoire are crucial in recognizing and responding to a diverse array of antigens, contributing to immune specificity and adaptability. The repertoire, famously vast due to a series of cellular mechanisms, can be quantified using repertoire sequencing. In this study, we sampled the repertoire of 85 women: ovarian cancer patients (OC) and healthy donors (HD), generating a dataset of T cell clones and their abundance.

View Article and Find Full Text PDF

Automatic Compliance Checking (ACC) within the Architecture, Engineering, and Construction (AEC) sector necessitates automating the interpretation of building regulations to achieve its full potential. Converting textual rules into machine-readable formats is challenging due to the complexities of natural language and the scarcity of resources for advanced Machine Learning (ML). Addressing these challenges, we introduce CODE-ACCORD, a dataset of 862 sentences from the building regulations of England and Finland.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!