Multi-Resolution Learning and Semantic Edge Enhancement for Super-Resolution Semantic Segmentation of Urban Scene Images.

Sensors (Basel)

College of Electronic and Information Engineering, Tongji University, Shanghai 201804, China.

Published: July 2024

Super-resolution semantic segmentation (SRSS) is a technique that aims to obtain high-resolution semantic segmentation results based on resolution-reduced input images. SRSS can significantly reduce computational cost and enable efficient, high-resolution semantic segmentation on mobile devices with limited resources. Some of the existing methods require modifications of the original semantic segmentation network structure or add additional and complicated processing modules, which limits the flexibility of actual deployment. Furthermore, the lack of detailed information in the low-resolution input image renders existing methods susceptible to misdetection at the semantic edges. To address the above problems, we propose a simple but effective framework called multi-resolution learning and semantic edge enhancement-based super-resolution semantic segmentation (MS-SRSS) which can be applied to any existing encoder-decoder based semantic segmentation network. Specifically, a multi-resolution learning mechanism (MRL) is proposed that enables the feature encoder of the semantic segmentation network to improve its feature extraction ability. Furthermore, we introduce a semantic edge enhancement loss (SEE) to alleviate the false detection at the semantic edges. We conduct extensive experiments on the three challenging benchmarks, Cityscapes, Pascal Context, and Pascal VOC 2012, to verify the effectiveness of our proposed MS-SRSS method. The experimental results show that, compared with the existing methods, our method can obtain the new state-of-the-art semantic segmentation performance.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11280809	PMC
http://dx.doi.org/10.3390/s24144522	DOI Listing

Publication Analysis

Top Keywords

semantic segmentation

semantic

multi-resolution learning

semantic edge

super-resolution semantic

existing methods

segmentation network

segmentation

learning semantic

edge enhancement

Similar Publications

EGNet: 3D Semantic Segmentation Through Point-Voxel-Mesh Data for Euclidean-Geodesic Feature Fusion.

Sensors (Basel)

December 2024

School of Computer Science and Technology, Changchun University of Science and Technology, Changchun 130022, China.

Qi Li Yu Song Xiaoqian Jin Yan Wu Hang Zhang

With the advancement of service robot technology, the demand for higher boundary precision in indoor semantic segmentation has increased. Traditional methods of extracting Euclidean features using point cloud and voxel data often neglect geodesic information, reducing boundary accuracy for adjacent objects and consuming significant computational resources. This study proposes a novel network, the Euclidean-geodesic network (EGNet), which uses point cloud-voxel-mesh data to characterize detail, contour, and geodesic features, respectively.

View Article and Find Full Text PDF

Similar Publications

HKAN: A Hybrid Kolmogorov-Arnold Network for Robust Fabric Defect Segmentation.

Sensors (Basel)

December 2024

School of Computer and Artificial Intelligence, Wuhan Textile Unversity, Wuhan 430200, China.

Min Li Pei Ye Shuqin Cui Ping Zhu Junping Liu

Currently, fabric defect detection methods predominantly rely on CNN models. However, due to the inherent limitations of CNNs, such models struggle to capture long-distance dependencies in images and fail to accurately detect complex defect features. While Transformers excel at modeling long-range dependencies, their quadratic computational complexity poses significant challenges.

View Article and Find Full Text PDF

Similar Publications

Advanced Monocular Outdoor Pose Estimation in Autonomous Systems: Leveraging Optical Flow, Depth Estimation, and Semantic Segmentation with Dynamic Object Removal.

Sensors (Basel)

December 2024

Electrical, Computer, and Biomedical Engineering, Toronto Metropolitan University, Toronto, ON M5B 2K3, Canada.

Alireza Ghasemieh Rasha Kashef

Autonomous technologies have revolutionized transportation, military operations, and space exploration, necessitating precise localization in environments where traditional GPS-based systems are unreliable or unavailable. While widespread for outdoor localization, GPS systems face limitations in obstructed environments such as dense urban areas, forests, and indoor spaces. Moreover, GPS reliance introduces vulnerabilities to signal disruptions, which can lead to significant operational failures.

View Article and Find Full Text PDF

Similar Publications

Development of a Grape Cut Point Detection System Using Multi-Cameras for a Grape-Harvesting Robot.

Sensors (Basel)

December 2024

Laboratory of Bio-Mechatronics, Faculty of Engineering, Kitami Institute of Technology, Koentyo 165, Kitami Shi 090-8507, Hokkaido, Japan.

Liangliang Yang Tomoki Noguchi Yohei Hoshino

Harvesting grapes requires a large amount of manual labor. To reduce the labor force for the harvesting job, in this study, we developed a robot harvester for the vine grapes. In this paper, we proposed an algorithm that using multi-cameras, as well as artificial intelligence (AI) object detection methods, to detect the thin stem and decide the cut point.

View Article and Find Full Text PDF

Similar Publications

Infrared Aircraft Detection Algorithm Based on High-Resolution Feature-Enhanced Semantic Segmentation Network.

Sensors (Basel)

December 2024

College of Information Engineering, Henan University of Science and Technology, Luoyang 471023, China.

Gang Liu Jiangtao Xi Chao Ma Huixiang Chen

In order to achieve infrared aircraft detection under interference conditions, this paper proposes an infrared aircraft detection algorithm based on high-resolution feature-enhanced semantic segmentation network. Firstly, the designed location attention mechanism is utilized to enhance the current-level feature map by obtaining correlation weights between pixels at different positions. Then, it is fused with the high-level feature map rich in semantic features to construct a location attention feature fusion network, thereby enhancing the representation capability of target features.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!