Using large language models for extracting and pre-annotating texts on mental health from noisy data in a low-resource language.

Sergei Koltcov Anton Surkov Olessia Koltsova Vera Ignatenko

PeerJ Comput Sci

Laboratory for Social & Cognitive Informatics, National Research University Higher School of Economics, St. Petersburg, Russia.

Published: November 2024

Recent advancements in large language models (LLMs) have the potential to improve conversational agents in mental healthcare, but challenges such as limited training data and privacy concerns persist.
A proposed solution involves leveraging human-AI annotation systems based on public domain discussions on social media, which require extensive cleaning to be effective.
Our research shows that while zero-shot classification offers some benefits for categorizing discussions about psychiatric disorders, fine-tuning LLMs significantly enhances accuracy, though it comes with a trade-off in processing speed.

Recent advancements in large language models (LLMs) have opened new possibilities for developing conversational agents (CAs) in various subfields of mental healthcare. However, this progress is hindered by limited access to high-quality training data, often due to privacy concerns and high annotation costs for low-resource languages. A potential solution is to create human-AI annotation systems that utilize extensive public domain user-to-user and user-to-professional discussions on social media. These discussions, however, are extremely noisy, necessitating the adaptation of LLMs for fully automatic cleaning and pre-classification to reduce human annotation effort. To date, research on LLM-based annotation in the mental health domain is extremely scarce. In this article, we explore the potential of zero-shot classification using four LLMs to select and pre-classify texts into topics representing psychiatric disorders, in order to facilitate the future development of CAs for disorder-specific counseling. We use 64,404 Russian-language texts from online discussion threads labeled with seven most commonly discussed disorders: depression, neurosis, paranoia, anxiety disorder, bipolar disorder, obsessive-compulsive disorder, and borderline personality disorder. Our research shows that while preliminary data filtering using zero-shot technology slightly improves classification, LLM fine-tuning makes a far larger contribution to its quality. Both standard and natural language inference (NLI) modes of fine-tuning increase classification accuracy by more than three times compared to non-fine-tuned training with preliminarily filtered data. Although NLI fine-tuning achieves slightly higher accuracy (0.64) than the standard approach, it is six times slower, indicating a need for further experimentation with NLI hypothesis engineering. Additionally, we demonstrate that lemmatization does not affect classification quality and that multilingual models using texts in their original language perform slightly better than English-only models using automatically translated texts. Finally, we introduce our dataset and model as the first openly available Russian-language resource for developing conversational agents in the domain of mental health counseling.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11623104	PMC
http://dx.doi.org/10.7717/peerj-cs.2395	DOI Listing

Publication Analysis

Top Keywords

mental health

large language

language models

developing conversational

conversational agents

texts

models

models extracting

extracting pre-annotating

pre-annotating texts

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!

A PHP Error was encountered