Sentiment classification is a form of data analytics where people's feelings and attitudes toward a topic are mined from data. This tantalizing power to "predict the zeitgeist" means that sentiment classification has long attracted interest, but with mixed results. However, the recent development of the BERT framework and its pretrained neural language models is seeing new-found success for sentiment classification. BERT models are trained to capture word-level information via mask language modeling and sentence-level contexts via next sentence prediction tasks. Out of the box, they are adequate models for some natural language processing tasks. However, most models are further fine-tuned with domain-specific information to increase accuracy and usefulness. Motivated by the idea that a further fine-tuning step would improve the performance for downstream sentiment classification tasks, we developed TopicBERT-a BERT model fine-tuned to recognize topics at the corpus level in addition to the word and sentence levels. TopicBERT comprises two variants: TopicBERT-ATP (aspect topic prediction), which captures topic information via an auxiliary training task, and TopicBERT-TA, where topic representation is directly injected into a topic augmentation layer for sentiment classification. With TopicBERT-ATP, the topics are predetermined by an LDA mechanism and collapsed Gibbs sampling. With TopicBERT-TA, the topics can change dynamically during the training. Experimental results show that both approaches deliver the state-of-the-art performance in two different domains with SemEval 2014 Task 4. However, in a test of methods, direct augmentation outperforms further training. Comprehensive analyses in the form of ablation, parameter, and complexity studies accompany the results.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2021.3094987DOI Listing

Publication Analysis

Top Keywords

sentiment classification
24
neural language
8
model fine-tuned
8
sentiment
6
classification
6
topic
5
topicbert topic-enhanced
4
topic-enhanced neural
4
language
4
language model
4

Similar Publications

Fears about the destabilizing impact of misinformation online have motivated individuals and platforms to respond. Individuals have increasingly challenged others' online claims with fact-checks in pursuit of a healthier information ecosystem and to break down echo chambers of self-reinforcing opinion. Using Twitter (now X) data, here we show the consequences of individual misinformation tagging: tagged posters had explored novel political information and expanded topical interests immediately prior, but being tagged caused posters to retreat into information bubbles.

View Article and Find Full Text PDF

Trump tariffs and the U.S. defense industry.

PLoS One

January 2025

Netherlands Defense Academy, Breda, The Netherlands.

In March 2018, U.S. President Trump announced that the U.

View Article and Find Full Text PDF

To enhance emergency management and public opinion governance, improve the accuracy of forecasting group emotional responses, and elucidate the complex pathways of multi-factor coupling in the formation of group emotions, this study constructs a theoretical framework grounded in the social combustion theory. Through web scraping and text sentiment analysis, group emotional tendencies were measured in 40 public emergency cases from the past five years. Using the fuzzy-set qualitative comparative analysis (fsQCA) method, the study explored the coupling, configuration effect, and formation pathways of factors such as "burning substance", "accelerant", and "ignition" in the emergence of group emotions.

View Article and Find Full Text PDF

Background Understanding the attitudes and perceptions of the general population is necessary for organizing health promotion initiatives. During outbreaks, social media has a significant impact on creating social perceptions. This study aims to identify and examine the emotions expressed and topics of discussion among Indian citizens related to COVID-19 third wave, from the messages posted on Twitter using text mining techniques.

View Article and Find Full Text PDF

Optical semantic communication through multimode fiber: from symbol transmission to sentiment analysis.

Light Sci Appl

January 2025

Wuhan National Laboratory for Optoelectronics, Next Generation Internet Access National Engineering Laboratory, and Hubei Optics Valley Laboratory, School of Optical and Electronic Information, Huazhong University of Science and Technology, Wuhan, 430074, China.

We propose and validate a novel optical semantic transmission scheme using multimode fiber (MMF). By leveraging the frequency sensitivity of intermodal dispersion in MMFs, we achieve high-dimensional semantic encoding and decoding in the frequency domain. Our system maps symbols to 128 distinct frequencies spaced at 600 kHz intervals, demonstrating a seven-fold increase in capacity compared to conventional communication encoding.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!