BEmoC: A Corpus for Identifying Emotion in Bengali Texts.

SN Comput Sci

Department of CSE, CUET, Chittagong, 4349 Bangladesh.

Published: January 2022

Emotion classification in text has growing interest among NLP experts due to the enormous availability of people's emotions and its emergence on various Web 2.0 applications/services. Emotion classification in the Bengali texts is also gradually being considered as an important task for sports, e-commerce, entertainments, and security applications. However, It is a very critical task to develop an automatic emotion classification system for low-resource languages such as, Bengali. Scarcity of resources and deficiency of benchmark corpora make the task more complicated. Thus, the development of a benchmark corpus is the prerequisite to develop an emotion classifier for Bengali texts. This paper describes the development of an emotional corpus (hereafter called 'BEmoC') for classifying six emotions in Bengali texts. The corpus development process consists of four key steps: data crawling, pre-processing, labelling, and verification. A total of 7000 texts are labelled into six basic emotion categories such as anger, fear, surprise, sadness, joy, and disgust, respectively. Dataset evaluation with 0.969 Cohen's score indicates the close agreement between the corpus annotators and the expert. The analysis of evaluation also represents that the distribution of emotion words obeys Zipf's law. Moreover, the results of BEmoC analysis shown in terms of coding reliability, emotion density, and most frequent emotion words, respectively.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8762637PMC
http://dx.doi.org/10.1007/s42979-022-01028-wDOI Listing

Publication Analysis

Top Keywords

bengali texts
16
emotion classification
12
emotion
9
bengali
5
texts
5
bemoc corpus
4
corpus identifying
4
identifying emotion
4
emotion bengali
4
texts emotion
4

Similar Publications

The life and accomplishments of Madhusudan Gupta, a significant person in Indian medical history, are discussed in this review article. Born into an aristocratic Bengali family, Gupta initially showed little interest in formal education. However, his enrolment in Sanskrit College and subsequent involvement with Calcutta Medical College (CMC) marked a turning point in his life.

View Article and Find Full Text PDF

This study addresses the pervasive challenges of low hepatitis B (HBV) and hepatitis C (HCV) testing rates coupled with the stigma associated with these diseases in low- and middle-income countries (LMICs) with a special focus on Bangladesh. This study aims to introduce an innovative crowdsourcing intervention that involves medical students, a crucial cohort with the potential to shape healthcare attitudes. Through a structured crowdsourcing approach, the study designs and implements a digital intervention to counter stigma and promote testing among medical students in Dhaka, Bangladesh.

View Article and Find Full Text PDF

Covid text identification (CTI) is a crucial research concern in natural language processing (NLP). Social and electronic media are simultaneously adding a large volume of Covid-affiliated text on the World Wide Web due to the effortless access to the Internet, electronic gadgets and the Covid outbreak. Most of these texts are uninformative and contain misinformation, disinformation and malinformation that create an infodemic.

View Article and Find Full Text PDF

Background: There are a myriad of language cues that indicate depression in written texts, and natural language processing (NLP) researchers have proven the ability of machine learning and deep learning approaches to detect these cues. However, to date, these approaches bridging NLP and the domain of mental health for Bengali literature are not comprehensive. The Bengali-speaking population can express emotions in their native language in greater detail.

View Article and Find Full Text PDF

BEmoC: A Corpus for Identifying Emotion in Bengali Texts.

SN Comput Sci

January 2022

Department of CSE, CUET, Chittagong, 4349 Bangladesh.

Emotion classification in text has growing interest among NLP experts due to the enormous availability of people's emotions and its emergence on various Web 2.0 applications/services. Emotion classification in the Bengali texts is also gradually being considered as an important task for sports, e-commerce, entertainments, and security applications.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!