Lightweight Transformer exhibits comparable performance to LLMs for Seizure Prediction: A case for light-weight models for EEG data.

Proc IEEE Int Conf Big Data

Knight Foundation School of Computing and Information Sciences, Florida International University, Miami, FL 33172, USA.

Published: December 2024

Predicting seizures ahead of time will have a significant positive clinical impact for people with epilepsy. Advances in machine learning/artificial intelligence (ML/AI) has provided us the tools needed to perform such predictive tasks. To date, advanced deep learning (DL) architectures such as the convolutional neural network (CNN) and long short-term memory (LSTM) have been used with mixed results. However, highly connected activity exhibited by epileptic seizures necessitates the design of more complex ML techniques which can better capture the complex interconnected neurological processes. Other challenges include the variability of EEG sensor data quality, different epilepsy and seizure profiles, lack of annotated datasets and absence of ML-ready benchmarks. In addition, successful models will need to perform inference in almost real-time using limited hardware compute-capacity. To address these challenges, we propose a lightweight architecture, called , whose novelty lies in the simple and smaller model-size and a lower computational load footprint needed to infer in real-time compared to other works in the literature. To quantify the performance of this lightweight model, we compared its performance with a custom-designed residual neural network (ResNet), a pre-trained vision transformer (ViT) and a pre-trained large-language model (LLM). We tested ESPFormer on MLSPred-Bench which is the largest patient-independent seizure prediction dataset comprising 12 benchmarks. Our results demonstrate that ESPFormer provides the best performance in terms of prediction accuracy for 4/12 benchmarks with an average improvement of 2.65% compared to the LLM, 3.35% compared to the ViT and 17.65% compared to the ResNet - and comparable results for other benchmarks. Our results indicate that lightweight transformer architecture may outperform resource-intensive LLM based models for real-time EEG-based seizure predictions.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11877310PMC
http://dx.doi.org/10.1109/bigdata62323.2024.10825319DOI Listing

Publication Analysis

Top Keywords

lightweight transformer
8
seizure prediction
8
neural network
8
compared
5
lightweight
4
transformer exhibits
4
exhibits comparable
4
performance
4
comparable performance
4
performance llms
4

Similar Publications

The integration of gaze/eye tracking into virtual and augmented reality devices has unlocked new possibilities, offering a novel human-computer interaction (HCI) modality for on-device extended reality (XR). Emerging applications in XR, such as low-effort user authentication, mental health diagnosis, and foveated rendering, demand real-time eye tracking at high frequencies, a capability that current solutions struggle to deliver. To address this challenge, we present EX-Gaze, an event-based real-time eye tracking system designed for on-device extended reality.

View Article and Find Full Text PDF

Vision Mamba and xLSTM-UNet for medical image segmentation.

Sci Rep

March 2025

School of Information Science and Engineering, Yunnan University, Yunnan, 650504, China.

Deep learning-based medical image segmentation methods are generally divided into convolutional neural networks (CNNs) and Transformer-based models. Traditional CNNs are limited by their receptive field, making it challenging to capture long-range dependencies. While Transformers excel at modeling global information, their high computational complexity restricts their practical application in clinical scenarios.

View Article and Find Full Text PDF

In response to issues with existing classical semantic segmentation models, such as inaccurate landslide edge extraction in high-resolution images, large numbers of network parameters, and long training times, this paper proposes a lightweight landslide detection model, Landslide Detection Network (LDNet), based on DeepLabv3+  and a dual attention mechanism. LDNet uses the lightweight network MobileNetv2 to replace the Xception backbone of DeepLabv3+, thereby reducing model parameters and improving training speed. Additionally, the model incorporates a dual attention mechanism from the lightweight Convolutional Block Attention Module to more accurately and efficiently detect landslide features.

View Article and Find Full Text PDF

Waste management handles all kinds of waste, including household, industrial, municipal, organic, biomedical, biological, and radioactive wastes. People still face challenges in proper disposal methods for different types of waste, including landfill-bound items, recyclable materials, and biodegradable waste. Inadequate waste management poses a significant and multifaceted global challenge.

View Article and Find Full Text PDF

Predicting seizures ahead of time will have a significant positive clinical impact for people with epilepsy. Advances in machine learning/artificial intelligence (ML/AI) has provided us the tools needed to perform such predictive tasks. To date, advanced deep learning (DL) architectures such as the convolutional neural network (CNN) and long short-term memory (LSTM) have been used with mixed results.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!