End to end vision transformer architecture for brain stroke assessment based on multi-slice classification and localization using computed tomography.

Comput Med Imaging Graph

Department of Mechanical Engineering, College of Engineering, University of Saskatchewan, S7N 5A9 Saskatoon, SK, Canada. Electronic address:

Published: October 2023

AI Article Synopsis

  • Brain stroke is a major cause of disability and death, highlighting the need for improved, automated diagnostic methods for better patient outcomes.
  • This study enhances the Vision Transformer (ViT) architecture to classify CT scan slices into categories like Normal, Infarction, and Hemorrhage and localize strokes effectively.
  • The modified ViT framework achieved 87.51% accuracy in classifying slices and demonstrated high precision in patient-wise stroke localization, showcasing its potential for transforming stroke diagnosis and treatment.

Article Abstract

Background: Brain stroke is a leading cause of disability and death worldwide, and early diagnosis and treatment are critical to improving patient outcomes. Current stroke diagnosis methods are subjective and prone to errors, as radiologists rely on manual selection of the most important CT slice. This highlights the need for more accurate and reliable automated brain stroke diagnosis and localization methods to improve patient outcomes.

Purpose: In this study, we aimed to enhance the vision transformer architecture for the multi-slice classification of CT scans of each patient into three categories, including Normal, Infarction, Hemorrhage, and patient-wise stroke localization, based on end-to-end vision transformer architecture. This framework can provide an automated, objective, and consistent approach to stroke diagnosis and localization, enabling personalized treatment plans based on the location and extent of the stroke.

Methods: We modified the Vision Transformer (ViT) in combination with neural network layers for the multi-slice classification of brain CT scans of each patient into normal, infarction, and hemorrhage classes. For stroke localization, we used the ViT architecture and convolutional neural network layers to detect stroke and localize it by bounding boxes for infarction and hemorrhage regions in a patient-wise manner based on multi slices.

Results: Our proposed framework achieved an overall accuracy of 87.51% in classifying brain CT scan slices and showed high precision in localizing the stroke patient-wise. Our results demonstrate the potential of our method for accurate and reliable stroke diagnosis and localization.

Conclusion: Our study enhanced ViT architecture for automated stroke diagnosis and localization using brain CT scans, which could have significant implications for stroke management and treatment. The use of deep learning algorithms can provide a more objective and consistent approach to stroke diagnosis and potentially enable personalized treatment plans based on the location and extent of the stroke. Further studies are needed to validate our method on larger and more diverse datasets and to explore its clinical utility in real-world settings.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compmedimag.2023.102294DOI Listing

Publication Analysis

Top Keywords

stroke diagnosis
24
vision transformer
16
stroke
14
transformer architecture
12
brain stroke
12
multi-slice classification
12
diagnosis localization
12
infarction hemorrhage
12
accurate reliable
8
scans patient
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!