Semantic segmentation of high-resolution images from remote sensing is crucial across various sectors. However, due to limitations in computational resources and the complexity of network architectures, many sophisticated semantic segmentation models struggle with efficiency in real-world applications, leading to an interest in developing lightweight model like borders. These models often employ a dual-branch structure, which balances processing speed and performance effectively. Yet, this design typically falls short in leveraging shallow structural information to enrich the dual branches with comprehensive multiscale data. Additionally, the lightweight components struggle to capture the global contextual details of feature sets efficiently. When compared to state-of-the-art models, lightweight semantic segmentation models usually exhibit performance gaps. To address these issues, we introduce a novel approach that incorporates a deep-shallow interaction mechanism with an attention module to improve water body segmentation efficiency. This method spatially adjusts feature representations to better identify water-related data, utilizing a U-Net frame work to enhance the accuracy of edge detection in water zones by providing more precise local positioning information. The attention mechanism processes and merges low and high-level data separately in different dimensions, allowing for the effective distinction of water areas from their surroundings by blending spatial attributes with in-depth context insights. Experimental outcomes demonstrate a remarkable 95% accuracy, showcasing the proposed method's superiority over existing models.

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41598-024-84134-4DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11696499PMC

Publication Analysis

Top Keywords

semantic segmentation
12
attention mechanism
8
remote sensing
8
segmentation models
8
models
5
dsia u-net
4
u-net deep
4
deep shallow
4
shallow interaction
4
interaction attention
4

Similar Publications

In this study, we explore an enhancement to the U-Net architecture by integrating SK-ResNeXt as the encoder for Land Cover Classification (LCC) tasks using Multispectral Imaging (MSI). SK-ResNeXt introduces cardinality and adaptive kernel sizes, allowing U-Net to better capture multi-scale features and adjust more effectively to variations in spatial resolution, thereby enhancing the model's ability to segment complex land cover types. We evaluate this approach using the Five-Billion-Pixels dataset, composed of 150 large-scale RGB-NIR images and over 5 billion labeled pixels across 24 categories.

View Article and Find Full Text PDF

Multi-scale multi-object semi-supervised consistency learning for ultrasound image segmentation.

Neural Netw

December 2024

Department of Mechanical Engineering, National University of Singapore, 9 Engineering Drive 1, Singapore 117575, Singapore. Electronic address:

Manual annotation of ultrasound images relies on expert knowledge and requires significant time and financial resources. Semi-supervised learning (SSL) exploits large amounts of unlabeled data to improve model performance under limited labeled data. However, it faces two challenges: fusion of contextual information at multiple scales and bias of spatial information between multiple objects.

View Article and Find Full Text PDF

Detecting brain tumours (BT) early improves treatment possibilities and increases patient survival rates. Magnetic resonance imaging (MRI) scanning offers more comprehensive information, such as better contrast and clarity, than any alternative scanning process. Manually separating BTs from several MRI images gathered in medical practice for cancer analysis is challenging and time-consuming.

View Article and Find Full Text PDF

Motivation: Skeletal muscle cells (skMCs) combine together to create long, multi-nucleated structures called myotubes. By studying the size, length, and number of nuclei in these myotubes, we can gain a deeper understanding of skeletal muscle development. However, human experimenters may often derive unreliable results owing to the unusual shape of the myotube, which causes significant measurement variability.

View Article and Find Full Text PDF

Background: Neuropsychological language assessment batteries usually include connected speech tasks (e.g. the description of a picture).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!