AI Article Synopsis

  • Sentiment analysis plays a crucial role in natural language applications, and this paper explores two models specifically for Arabic sentiment analysis using ASTD and ATDFS datasets in both 2-class and multiclass settings.
  • The two proposed models, MC1 (a 2-layer CNN with global average pooling) and MC2 (a 2-layer CNN with max pooling followed by a BiGRU), achieved notable accuracy improvements over previous studies, reaching 73.17% on the challenging 4-class task and 90.06% on the simpler 2-class task.
  • The research emphasizes the importance of effective Arabic preprocessing, incorporating innovative techniques, and finding that processing emoticons and using a custom stoplist can significantly enhance model performance,

Article Abstract

Sentiment analysis is an essential process which is important to many natural language applications. In this paper, we apply two models for Arabic sentiment analysis to the ASTD and ATDFS datasets, in both 2-class and multiclass forms. Model MC1 is a 2-layer CNN with global average pooling, followed by a dense layer. MC2 is a 2-layer CNN with max pooling, followed by a BiGRU and a dense layer. On the difficult ASTD 4-class task, we achieve 73.17%, compared to 65.58% reported by Attia et al., 2018. For the easier 2-class task, we achieve 90.06% with MC1 compared to 85.58% reported by Kwaik et al., 2019. We carry out experiments on various data splits, to match those used by other researchers. We also pay close attention to Arabic preprocessing and include novel steps not reported in other works. In an ablation study, we investigate the effect of two steps in particular, the processing of emoticons and the use of a custom stoplist. On the 4-class task, these can make a difference of up to 4.27% and 5.48%, respectively. On the 2-class task, the maximum improvements are 2.95% and 3.87%.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8449738PMC
http://dx.doi.org/10.1155/2021/5538791DOI Listing

Publication Analysis

Top Keywords

sentiment analysis
12
arabic sentiment
8
2-layer cnn
8
dense layer
8
4-class task
8
task achieve
8
2-class task
8
improving arabic
4
analysis cnn-based
4
cnn-based architectures
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!