Improving sentiment classification using a RoBERTa-based hybrid model.

Front Hum Neurosci

Department of Information Technology, Faculty of Computers and Information, Menoufia University, Shibin El Kom, Egypt.

Published: December 2023

Introduction: Several attempts have been made to enhance text-based sentiment analysis's performance. The classifiers and word embedding models have been among the most prominent attempts. This work aims to develop a hybrid deep learning approach that combines the advantages of transformer models and sequence models with the elimination of sequence models' shortcomings.

Methods: In this paper, we present a hybrid model based on the transformer model and deep learning models to enhance sentiment classification process. Robustly optimized BERT (RoBERTa) was selected for the representative vectors of the input sentences and the Long Short-Term Memory (LSTM) model in conjunction with the Convolutional Neural Networks (CNN) model was used to improve the suggested model's ability to comprehend the semantics and context of each input sentence. We tested the proposed model with two datasets with different topics. The first dataset is a Twitter review of US airlines and the second is the IMDb movie reviews dataset. We propose using word embeddings in conjunction with the SMOTE technique to overcome the challenge of imbalanced classes of the Twitter dataset.

Results: With an accuracy of 96.28% on the IMDb reviews dataset and 94.2% on the Twitter reviews dataset, the hybrid model that has been suggested outperforms the standard methods.

Discussion: It is clear from these results that the proposed hybrid RoBERTa-(CNN+ LSTM) method is an effective model in sentiment classification.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10733963PMC
http://dx.doi.org/10.3389/fnhum.2023.1292010DOI Listing

Publication Analysis

Top Keywords

sentiment classification
12
hybrid model
12
reviews dataset
12
model
8
deep learning
8
hybrid
5
improving sentiment
4
classification roberta-based
4
roberta-based hybrid
4
model introduction
4

Similar Publications

Sentiment analysis has become a difficult and important task in the current world. Because of several features of data, including abbreviations, length of tweet, and spelling error, there should be some other non-conventional methods to achieve the accurate results and overcome the current issue. In other words, because of those issues, conventional approaches cannot perform well and accomplish results with high efficiency.

View Article and Find Full Text PDF

Perspectives surrounding robotic total hip arthroplasty: a cross-sectional analysis using natural language processing.

Can J Surg

January 2025

From the Faculty of Medicine, Université de Montréal, Montréal, Que. (Levett); the Department of Neurology and Neurosurgery, McGill University, Montréal, Que. (Elkaim); the Department of Orthopaedic Surgery, McGill University, Jewish General Hospital, Montréal, Que. (Zukor, Huk, Antoniou)

Background: Robotic technology has been used in total hip arthroplasty (THA) for several years. Despite the advances in this field, perspectives surrounding robotic THA are not fully understood. This study aimed to characterize the landscape of robotic THA on social media.

View Article and Find Full Text PDF

Objectives: Diabetes distress can negatively affect the well-being of individuals with type 1 diabetes (T1D). Voice-based (VB) technology can be used to develop inexpensive and ecological tools for managing diabetes distress. This study explored the competencies to engage with digital health services, needs and preferences of individuals with T1D or caring for a child with this condition regarding VB technology to inform the tailoring of a co-designed tool for supporting diabetes distress management.

View Article and Find Full Text PDF

Background: The pervasiveness of drug culture has become evident in popular music and social media. Previous research has examined drug abuse content in both social media and popular music; however, to our knowledge, the intersection of drug abuse content in these 2 domains has not been explored. To address the ongoing drug epidemic, we analyzed drug-related content on Twitter (subsequently rebranded X), with a specific focus on lyrics.

View Article and Find Full Text PDF

Clinical Manifestations.

Alzheimers Dement

December 2024

ki elements UG, Saarbrücken, Germany.

Future clinical trials targeting Alzheimer's disease (AD) on new disease modifying drugs necessitate a paradigm shift towards early identification of individuals at risk. Emerging evidence indicates that subtle alterations in language and speech characteristics may manifest concurrently with the progression of neurodegenerative disorders like AD. These changes manifest as discernible variations, assessable through semantic nuances, word choices, sentiment, grammar usage (linguistic features), and phonetic/acoustic traits (paralinguistic features).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!