The ability to train on a large dataset of labeled samples is critical to the success of deep learning in many domains. In this paper, we focus on motor vehicle classification and localization from a single video frame and introduce the "MIOvision Traffic Camera Dataset" (MIO-TCD) in this context. MIO-TCD is the largest dataset for motorized traffic analysis to date. It includes 11 traffic object classes such as cars, trucks, buses, motorcycles, bicycles, pedestrians. It contains 786,702 annotated images acquired at different times of the day and different periods of the year by hundreds of traffic surveillance cameras deployed across Canada and the United States. The dataset consists of two parts: a "localization dataset", containing 137,743 full video frames with bounding boxes around traffic objects, and a "classification dataset", containing 648,959 crops of traffic objects from the 11 classes. We also report results from the 2017 CVPR MIO-TCD Challenge, that leveraged this dataset, and compare them with results for state-of-the-art deep learning architectures. These results demonstrate the viability of deep learning methods for vehicle localization and classification from a single video frame in real-life traffic scenarios. The topperforming methods achieve both accuracy and Kappa score above 96% on the classification dataset and mean-average precision of 77% on the localization dataset. We also identify scenarios in which state-of-the-art methods still fail and we suggest avenues to address these challenges. Both the dataset and detailed results are publicly available on-line [1].

Download full-text PDF

Source
http://dx.doi.org/10.1109/TIP.2018.2848705DOI Listing

Publication Analysis

Top Keywords

deep learning
12
dataset
8
vehicle classification
8
classification localization
8
single video
8
video frame
8
traffic objects
8
traffic
7
mio-tcd
4
mio-tcd benchmark
4

Similar Publications

Objective: Segmentation of individual thigh muscles in MRI images is essential for monitoring neuromuscular diseases and quantifying relevant biomarkers such as fat fraction (FF). Deep learning approaches such as U-Net have demonstrated effectiveness in this field. However, the impact of reducing neural network complexity remains unexplored in the FF quantification in individual muscles.

View Article and Find Full Text PDF

A new vision of the role of the cerebellum in pain processing.

J Neural Transm (Vienna)

January 2025

Postgraduate Program in Physical Therapy (PPGFT), Department of Physical Therapy (DFisio), University of São Carlos (UFSCar), Washington Luis Road, Km 235, São Carlos, São Paulo, 13565-905, Brazil.

The cerebellum is a structure in the suprasegmental nervous system classically known for its involvement in motor functions such as motor planning, coordination, and motor learning. However, with scientific advances, other functions of the cerebellum, such as cognitive, emotional, and autonomic processing, have been discovered. Currently, there is a body of evidence demonstrating the involvement of the cerebellum in nociception and pain processing.

View Article and Find Full Text PDF

Background: Recent advances in artificial intelligence have facilitated the automatic diagnosis of middle ear diseases using endoscopic tympanic membrane imaging.

Aim: We aimed to develop an automated diagnostic system for middle ear diseases by applying deep learning techniques to tympanic membrane images obtained during routine clinical practice.

Material And Methods: To augment the training dataset, we explored the use of generative adversarial networks (GANs) to produce high-quality synthetic tympanic images that were subsequently added to the training data.

View Article and Find Full Text PDF

Integrating Model-Informed Drug Development With AI: A Synergistic Approach to Accelerating Pharmaceutical Innovation.

Clin Transl Sci

January 2025

Global Biometrics and Data Management, Pfizer Research and Development, New York, New York, USA.

The pharmaceutical industry constantly strives to improve drug development processes to reduce costs, increase efficiencies, and enhance therapeutic outcomes for patients. Model-Informed Drug Development (MIDD) uses mathematical models to simulate intricate processes involved in drug absorption, distribution, metabolism, and excretion, as well as pharmacokinetics and pharmacodynamics. Artificial intelligence (AI), encompassing techniques such as machine learning, deep learning, and Generative AI, offers powerful tools and algorithms to efficiently identify meaningful patterns, correlations, and drug-target interactions from big data, enabling more accurate predictions and novel hypothesis generation.

View Article and Find Full Text PDF

Self-Driving Microscopes: AI Meets Super-Resolution Microscopy.

Small Methods

January 2025

Dept. Chemical Engineering and Biotechnology, University of Cambridge, Cambridge, CB3 0AS, UK.

The integration of Machine Learning (ML) with super-resolution microscopy represents a transformative advancement in biomedical research. Recent advances in ML, particularly deep learning (DL), have significantly enhanced image processing tasks, such as denoising and reconstruction. This review explores the growing potential of automation in super-resolution microscopy, focusing on how DL can enable autonomous imaging tasks.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!