Emotions are integral to human social interactions, with diverse responses elicited by various situational contexts. Particularly, the prevalence of negative emotional states has been correlated with negative outcomes for mental health, necessitating a comprehensive analysis of their occurrence and impact on individuals. In this paper, we introduce a novel dataset named DepressionEmo designed to detect 8 emotions associated with depression by 6037 examples of long Reddit user posts. This dataset was created through a majority vote over inputs by zero-shot classifications from pre-trained models and validating the quality by annotators and ChatGPT, exhibiting an acceptable level of inter-rater reliability between annotators. The correlation between emotions, and linguistic analysis are conducted on DepressionEmo. Besides, we provide several text classification methods classified into two groups: machine learning methods such as SVM, XGBoost, and LightGBM; and deep learning methods such as BERT, BART, GAN-BERT, and T5. Despite achieving the same F1 Macro score of 0.76 as BART, the pretrained BERT model, bert-base-uncased, stands out as the most efficient model in our experiments due to its lower number of parameters. Across all emotions, the highest F1 Macro value is achieved by suicide intent, indicating a certain value of our dataset in identifying emotions in individuals with depression symptoms through text analysis. The curated dataset is publicly available at: https://github.com/abuBakarSiddiqurRahman/DepressionEmo.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jad.2024.08.013DOI Listing

Publication Analysis

Top Keywords

novel dataset
8
learning methods
8
emotions
6
dataset
5
depressionemo novel
4
dataset multilabel
4
multilabel classification
4
classification depression
4
depression emotions
4
emotions emotions
4

Similar Publications

Objective: A comprehensive bioinformatics analysis was conducted to investigate potential new diagnostic biomarkers and immune infiltration characteristics associated with tubulointerstitial injury in lupus nephritis (LN), and to examine possible correlations between key genes and infiltrating immune cells.

Methods: The GSE32591, GSE113342, and GSE200306 datasets were downloaded from the Gene Expression Omnibus database and differentially expressed genes (DEGs) were identified in the pooled dataset. Support vector machine-recursive feature elimination analysis and the least absolute shrinkage and selection operator regression model were used to screen for possible markers, and the compositional patterns of the 22 types of immune cell fractions in LN were determined using CIBERSORT.

View Article and Find Full Text PDF

Background: Several clinical trials have shown that immunotherapy plays a pivotal role in the treatment of patients with metastatic synovial sarcoma. Immune-related genes (IRGs) have been demonstrated to predict the immunotherapy response in certain malignant tumours. However, the clinical significance of IRGs in patients with synovial sarcoma (SS) is still unclear.

View Article and Find Full Text PDF

A novel methodology for dataset augmentation in the semantic segmentation of coil-coated surface degradation is presented in this study. Deep convolutional generative adversarial networks (DCGAN) are employed to generate synthetic input-target pairs, which closely resemble real-world data, with the goal of expanding an existing dataset. These augmented datasets are used to train two state-of-the-art models, U-net, and DeepLabV3, for the precise detection of degradation areas around scribes.

View Article and Find Full Text PDF

Quantitative site-specific N-glycosylation analysis reveals IgG glyco-signatures for pancreatic cancer diagnosis.

Clin Proteomics

December 2024

Department of Pancreatic Surgery and Institutes for Systems Genetics, West China Hospital, Sichuan University, Keyuan 4th Road, Gaopeng Avenue, Hi-tech Zone, Chengdu, Sichuan, 610041, China.

Background: Pancreatic cancer is a highly aggressive tumor with a poor prognosis due to a low early detection rate and a lack of biomarkers. Most of pancreatic cancer is pancreatic ductal adenocarcinoma (PDAC). Alterations in the N-glycosylation of plasma immunoglobulin G (IgG) have been shown to be closely associated with the onset and development of several cancers and could be used as biomarkers for diagnosis.

View Article and Find Full Text PDF

TD-STrans: Tri-domain sparse-view CT reconstruction based on sparse transformer.

Comput Methods Programs Biomed

December 2024

Department of Information and Communication Engineering, North University of China, Taiyuan 030051, China; The State Key Lab for Electronic Testing Technology, North University of China, Taiyuan 030051, China. Electronic address:

Background And Objective: Sparse-view computed tomography (CT) speeds up scanning and reduces radiation exposure in medical diagnosis. However, when the projection views are severely under-sampled, deep learning-based reconstruction methods often suffer from over-smoothing of the reconstructed images due to the lack of high-frequency information. To address this issue, we introduce frequency domain information into the popular projection-image domain reconstruction, proposing a Tri-Domain sparse-view CT reconstruction model based on Sparse Transformer (TD-STrans).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!