Enhancing Spam Message Classification and Detection Using Transformer-Based Embedding and Ensemble Learning.

Sensors (Basel)

Department of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia.

Published: April 2023

Over the last decade, the Short Message Service (SMS) has become a primary communication channel. Nevertheless, its popularity has also given rise to the so-called SMS spam. These messages, i.e., spam, are annoying and potentially malicious by exposing SMS users to credential theft and data loss. To mitigate this persistent threat, we propose a new model for SMS spam detection based on pre-trained Transformers and Ensemble Learning. The proposed model uses a text embedding technique that builds on the recent advancements of the GPT-3 Transformer. This technique provides a high-quality representation that can improve detection results. In addition, we used an Ensemble Learning method where four machine learning models were grouped into one model that performed significantly better than its separate constituent parts. The experimental evaluation of the model was performed using the SMS Spam Collection Dataset. The obtained results showed a state-of-the-art performance that exceeded all previous works with an accuracy that reached 99.91%.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10146782PMC
http://dx.doi.org/10.3390/s23083861DOI Listing

Publication Analysis

Top Keywords

ensemble learning
12
sms spam
12
model performed
8
sms
5
enhancing spam
4
spam message
4
message classification
4
classification detection
4
detection transformer-based
4
transformer-based embedding
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!