An improved feature extraction algorithm for robust Swin Transformer model in high-dimensional medical image analysis.

Comput Biol Med

Department of Computer Science and Engineering, Kamala Nehru Institute of Technology Sultanpur, Kadipur Rd, Sultanpur, Affiliated to AKTU, Uttar Pradesh, 228118, India. Electronic address:

Published: April 2025

The Swin Transformer is recently developed transformer architecture with promising results in various computer vision tasks. Medical image analysis is a complex and critical task that requires high dimensional feature extraction. The significant challenge in medical image analysis is the limited availability of annotated data for training. It has been proposed that a multitask learning scheme be put in place. Swin Transformer can be trained for all the medical image analysis tasks simultaneously so that general features can be learned from the model and used for other new tasks and data. In most cases, the medical images have poor properties such as noise, artifacts, and low contrast. The Swin Transformer presents an adaptive attention mechanism: its attention weights are learned dynamically according to input quality. It could selectively focus on essential regions in an image while discarding noise or irrelevant information. Medical images may have very complex anatomical structures. In this sense, an iterative transformer encoder is proposed to form a hierarchical structure with gradually decreasing dimensionality between layers-so that the attention mechanism is applied at different scales, capturing local and long-range relationships between image patches. This research proposes a robust Swin Transformer architecture for high-dimensional feature extraction in medical images. The proposed algorithm reached 80.76 % accuracy, 80.28 % precision, 78.04 % recall, 76.46 % F1-Score and 73.46 % critical success index.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiomed.2025.109822DOI Listing

Publication Analysis

Top Keywords

swin transformer
20
medical image
16
image analysis
16
feature extraction
12
medical images
12
robust swin
8
transformer architecture
8
attention mechanism
8
transformer
7
medical
7

Similar Publications

Primary bone tumors (PBTs) present significant diagnostic challenges due to their heterogeneous nature and similarities with bone infections. This study aimed to develop an ensemble deep learning framework that integrates multicenter radiographs and extensive clinical features to accurately differentiate between PBTs and bone infections. We compared the performance of the ensemble model with four imaging models based solely on radiographs utilizing EfficientNet B3, EfficientNet B4, Vision Transformer, and Swin Transformers.

View Article and Find Full Text PDF

Disease detection on exterior surfaces of buildings using deep learning in China.

Sci Rep

March 2025

School of Housing, Building and Planning, Universiti Sains Malaysia, Penang, 11700, Malaysia.

Urban infrastructure, particularly in ageing cities, faces significant challenges in maintaining building aesthetics and structural integrity. Traditional methods for detecting diseases on building exteriors, such as manual inspections, are often inefficient, costly, and prone to errors, leading to incomplete assessments and delayed maintenance actions. This study explores the application of advanced deep learning techniques to accurately detect diseases on the exterior surfaces of buildings in urban environments, aiming to enhance detection efficiency and accuracy while providing a real-time monitoring solution that can be widely implemented in infrastructure health management.

View Article and Find Full Text PDF

Integrated multi-omics characterization of neuroblastoma with bone or bone marrow metastasis.

Genes Dis

May 2025

Department of Pediatric Surgical Oncology, The Children's Hospital of Chongqing Medical University, National Clinical Research Center for Child Health and Disorders, Ministry of Education Key Laboratory of Child Development and Disorders, Chongqing Key Laboratory of Pediatric Metabolism and Inflammatory Diseases, Chongqing 400014, China.

The pathogenesis of neuroblastoma with bone or bone marrow metastasis (NB-BBM) and its complex immune microenvironment remain poorly elucidated, hampering the advancement of effective risk prediction for BBM and limiting therapeutic strategies. Feature recognition of 142 paraffin-embedded hematoxylin-eosin-stained tumor section images was conducted using a Swin-Transformer for pathological histology to predict NB-BBM occurrence. Single-cell transcriptomics identified a tumor cell subpopulation (NB3) and two tumor-associated macrophage (TAM) subpopulations (SPP1 TAMs and IGHM TAMs) closely associated with BBM and highlighted transketolase (TKT) as a key molecular marker for metastatic progression in NB.

View Article and Find Full Text PDF

Purpose: Despite major advances in Computer Assisted Diagnosis (CAD), the need for carefully labeled training data remains an important clinical translation barrier. This work aims to overcome this barrier for ultrasound video-based CAD, using video-level classification labels combined with a novel training strategy to improve the generalization performance of state-of-the-art (SOTA) video classifiers.

Methods: SOTA video classifiers were trained and evaluated on a novel ultrasound video dataset of liver and kidney pathologies, and they all struggled to generalize, especially for kidney pathologies.

View Article and Find Full Text PDF

Super-resolution (SR) techniques present a suitable solution to increase the image resolution acquired using an ultrasound device characterized by a low image resolution. This can be particularly beneficial in low-resource imaging settings. This work surveys advanced SR techniques applied to enhance the resolution and quality of fetal ultrasound images, focusing Dual back-projection based internal learning (DBPISR) technique, which utilizes internal learning for blind super-resolution, as opposed to blind super-resolution generative adversarial network (BSRGAN), real-world enhanced super-resolution generative adversarial network (Real-ESRGAN), swin transformer for image restoration (SwinIR) and SwinIR-Large.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!