MIO-TCD: A new benchmark dataset for vehicle classification and localization.

Zhiming Luo Frederic B-Charron Carl Lemaire Janusz Konrad Shaozi Li Akshaya Mishra Andrew Achkar Justin Eichel Pierre-Marc Jodoin

IEEE Trans Image Process

Published: June 2018

The ability to train on a large dataset of labeled samples is critical to the success of deep learning in many domains. In this paper, we focus on motor vehicle classification and localization from a single video frame and introduce the "MIOvision Traffic Camera Dataset" (MIO-TCD) in this context. MIO-TCD is the largest dataset for motorized traffic analysis to date. It includes 11 traffic object classes such as cars, trucks, buses, motorcycles, bicycles, pedestrians. It contains 786,702 annotated images acquired at different times of the day and different periods of the year by hundreds of traffic surveillance cameras deployed across Canada and the United States. The dataset consists of two parts: a "localization dataset", containing 137,743 full video frames with bounding boxes around traffic objects, and a "classification dataset", containing 648,959 crops of traffic objects from the 11 classes. We also report results from the 2017 CVPR MIO-TCD Challenge, that leveraged this dataset, and compare them with results for state-of-the-art deep learning architectures. These results demonstrate the viability of deep learning methods for vehicle localization and classification from a single video frame in real-life traffic scenarios. The topperforming methods achieve both accuracy and Kappa score above 96% on the classification dataset and mean-average precision of 77% on the localization dataset. We also identify scenarios in which state-of-the-art methods still fail and we suggest avenues to address these challenges. Both the dataset and detailed results are publicly available on-line [1].

Download full-text PDF	Source
http://dx.doi.org/10.1109/TIP.2018.2848705	DOI Listing

Publication Analysis

Top Keywords

deep learning

dataset

vehicle classification

classification localization

single video

video frame

traffic objects

traffic

mio-tcd

mio-tcd benchmark

Similar Publications

Importance of neural network complexity for the automatic segmentation of individual thigh muscles in MRI images from patients with neuromuscular diseases.

MAGMA

January 2025

Aix Marseille Univ, CNRS, CRMBM, Marseille, France.

Sandra Martin Rémi André Amira Trabelsi Constance P Michel Etienne Fortanier

Objective: Segmentation of individual thigh muscles in MRI images is essential for monitoring neuromuscular diseases and quantifying relevant biomarkers such as fat fraction (FF). Deep learning approaches such as U-Net have demonstrated effectiveness in this field. However, the impact of reducing neural network complexity remains unexplored in the FF quantification in individual muscles.

View Article and Find Full Text PDF

Similar Publications

A new vision of the role of the cerebellum in pain processing.

J Neural Transm (Vienna)

January 2025

Postgraduate Program in Physical Therapy (PPGFT), Department of Physical Therapy (DFisio), University of São Carlos (UFSCar), Washington Luis Road, Km 235, São Carlos, São Paulo, 13565-905, Brazil.

José Mário Prati Anna Carolyna Gianlorenço

The cerebellum is a structure in the suprasegmental nervous system classically known for its involvement in motor functions such as motor planning, coordination, and motor learning. However, with scientific advances, other functions of the cerebellum, such as cognitive, emotional, and autonomic processing, have been discovered. Currently, there is a body of evidence demonstrating the involvement of the cerebellum in nociception and pain processing.

View Article and Find Full Text PDF

Similar Publications

Deep learning multi-classification of middle ear diseases using synthetic tympanic images.

Acta Otolaryngol

January 2025

Department of Otorhinolaryngology, Institute of Science Tokyo, Tokyo, Japan.

Yoshimaru Mizoguchi Taku Ito Masato Yamada Takeshi Tsutsumi

Background: Recent advances in artificial intelligence have facilitated the automatic diagnosis of middle ear diseases using endoscopic tympanic membrane imaging.

Aim: We aimed to develop an automated diagnostic system for middle ear diseases by applying deep learning techniques to tympanic membrane images obtained during routine clinical practice.

Material And Methods: To augment the training dataset, we explored the use of generative adversarial networks (GANs) to produce high-quality synthetic tympanic images that were subsequently added to the training data.

View Article and Find Full Text PDF

Similar Publications

Integrating Model-Informed Drug Development With AI: A Synergistic Approach to Accelerating Pharmaceutical Innovation.

Clin Transl Sci

January 2025

Global Biometrics and Data Management, Pfizer Research and Development, New York, New York, USA.

Karthik Raman Rukmini Kumar Cynthia J Musante Subha Madhavan

The pharmaceutical industry constantly strives to improve drug development processes to reduce costs, increase efficiencies, and enhance therapeutic outcomes for patients. Model-Informed Drug Development (MIDD) uses mathematical models to simulate intricate processes involved in drug absorption, distribution, metabolism, and excretion, as well as pharmacokinetics and pharmacodynamics. Artificial intelligence (AI), encompassing techniques such as machine learning, deep learning, and Generative AI, offers powerful tools and algorithms to efficiently identify meaningful patterns, correlations, and drug-target interactions from big data, enabling more accurate predictions and novel hypothesis generation.

View Article and Find Full Text PDF

Similar Publications

Self-Driving Microscopes: AI Meets Super-Resolution Microscopy.

Small Methods

January 2025

Dept. Chemical Engineering and Biotechnology, University of Cambridge, Cambridge, CB3 0AS, UK.

Edward N Ward Anna Scheeder Max Barysevich Clemens F Kaminski

The integration of Machine Learning (ML) with super-resolution microscopy represents a transformative advancement in biomedical research. Recent advances in ML, particularly deep learning (DL), have significantly enhanced image processing tasks, such as denoising and reconstruction. This review explores the growing potential of automation in super-resolution microscopy, focusing on how DL can enable autonomous imaging tasks.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!