Attention-Based Multi-Scale Convolutional Neural Network (A+MCNN) for Multi-Class Classification in Road Images.

Sensors (Basel)

Civil, Environmental, and Construction, Engineering Department, University of Central Florida, Orlando, FL 32816, USA.

Published: July 2021

Automated pavement distress recognition is a key step in smart infrastructure assessment. Advances in deep learning and computer vision have improved the automated recognition of pavement distresses in road surface images. This task remains challenging due to the high variation of defects in shapes and sizes, demanding a better incorporation of contextual information into deep networks. In this paper, we show that an attention-based multi-scale convolutional neural network (A+MCNN) improves the automated classification of common distress and non-distress objects in pavement images by (i) encoding contextual information through multi-scale input tiles and (ii) employing a mid-fusion approach with an attention module for heterogeneous image contexts from different input scales. A+MCNN is trained and tested with four distress classes (crack, crack seal, patch, pothole), five non-distress classes (joint, marker, manhole cover, curbing, shoulder), and two pavement classes (asphalt, concrete). A+MCNN is compared with four deep classifiers that are widely used in transportation applications and a generic CNN classifier (as the control model). The results show that A+MCNN consistently outperforms the baselines by 1∼26% on average in terms of the F-score. A comprehensive discussion is also presented regarding how these classifiers perform differently on different road objects, which has been rarely addressed in the existing literature.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8347086PMC
http://dx.doi.org/10.3390/s21155137DOI Listing

Publication Analysis

Top Keywords

attention-based multi-scale
8
multi-scale convolutional
8
convolutional neural
8
neural network
8
network a+mcnn
8
a+mcnn
5
a+mcnn multi-class
4
multi-class classification
4
classification road
4
road images
4

Similar Publications

Motivation: The classification task based on whole-slide images (WSIs) is a classic problem in computational pathology. Multiple Instance Learning (MIL) provides a robust framework for analyzing whole slide images with slide-level labels at gigapixel resolution. However, existing MIL models typically focus on modeling the relationships between instances while neglecting the variability across the channel dimensions of instances, which prevents the model from fully capturing critical information in the channel dimension.

View Article and Find Full Text PDF

Traffic flow prediction is a pivotal element in Intelligent Transportation Systems (ITSs) that provides significant opportunities for real-world applications. Capturing complex and dynamic spatio-temporal patterns within traffic data remains a significant challenge for traffic flow prediction. Different approaches to effectively modeling complex spatio-temporal correlations within traffic data have been proposed.

View Article and Find Full Text PDF

Background And Objective: Accurate extraction of retinal vascular components is vital in diagnosing and treating retinal diseases. Achieving precise segmentation of retinal blood vessels is challenging due to their complex structure and overlapping vessels with other anatomical features. Existing deep neural networks often suffer from false positives at vessel branches or missing fragile vessel patterns.

View Article and Find Full Text PDF

Cloud Removal in the Tibetan Plateau Region Based on Self-Attention and Local-Attention Models.

Sensors (Basel)

December 2024

School of Surveying and Geo-Informatics, Shandong Jianzhu University, Fengming Road, Jinan 250101, China.

Optical remote sensing images have a wide range of applications but are often affected by cloud cover, which interferes with subsequent analysis. Therefore, cloud removal has become indispensable in remote sensing data processing. The Tibetan Plateau, as a sensitive region to climate change, plays a crucial role in the East Asian water cycle and regional climate due to its snow cover.

View Article and Find Full Text PDF

Brain-computer interfaces (BCIs) establish a direct communication pathway between the brain and external devices and have been widely applied in upper limb rehabilitation for hemiplegic patients. However, significant individual variability in motor imagery electroencephalogram (MI-EEG) signals leads to poor generalization performance of MI-based BCI decoding methods to new patients. This paper proposes a Multi-scale Frequency domain Feature-based Dynamic graph Attention Network (MFF-DANet) for upper limb MI decoding in hemiplegic patients.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!