Validity of Chatbot Use for Mental Health Assessment: Experimental Study.

JMIR Mhealth Uhealth

Department of Public Mental Health, Central Institute of Mental Health, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany.

Published: October 2022

Background: Mental disorders in adolescence and young adulthood are major public health concerns. Digital tools such as text-based conversational agents (ie, chatbots) are a promising technology for facilitating mental health assessment. However, the human-like interaction style of chatbots may induce potential biases, such as socially desirable responding (SDR), and may require further effort to complete assessments.

Objective: This study aimed to investigate the convergent and discriminant validity of chatbots for mental health assessments, the effect of assessment mode on SDR, and the effort required by participants for assessments using chatbots compared with established modes.

Methods: In a counterbalanced within-subject design, we assessed 2 different constructs-psychological distress (Kessler Psychological Distress Scale and Brief Symptom Inventory-18) and problematic alcohol use (Alcohol Use Disorders Identification Test-3)-in 3 modes (chatbot, paper-and-pencil, and web-based), and examined convergent and discriminant validity. In addition, we investigated the effect of mode on SDR, controlling for perceived sensitivity of items and individuals' tendency to respond in a socially desirable way, and we also assessed the perceived social presence of modes. Including a between-subject condition, we further investigated whether SDR is increased in chatbot assessments when applied in a self-report setting versus when human interaction may be expected. Finally, the effort (ie, complexity, difficulty, burden, and time) required to complete the assessments was investigated.

Results: A total of 146 young adults (mean age 24, SD 6.42 years; n=67, 45.9% female) were recruited from a research panel for laboratory experiments. The results revealed high positive correlations (all P<.001) of measures of the same construct across different modes, indicating the convergent validity of chatbot assessments. Furthermore, there were no correlations between the distinct constructs, indicating discriminant validity. Moreover, there were no differences in SDR between modes and whether human interaction was expected, although the perceived social presence of the chatbot mode was higher than that of the established modes (P<.001). Finally, greater effort (all P<.05) and more time were needed to complete chatbot assessments than for completing the established modes (P<.001).

Conclusions: Our findings suggest that chatbots may yield valid results. Furthermore, an understanding of chatbot design trade-offs in terms of potential strengths (ie, increased social presence) and limitations (ie, increased effort) when assessing mental health were established.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9664331PMC
http://dx.doi.org/10.2196/28082DOI Listing

Publication Analysis

Top Keywords

mental health
12
health assessment
8
socially desirable
8
convergent discriminant
8
discriminant validity
8
mode sdr
8
validity chatbot
4
mental
4
chatbot mental
4
health
4

Similar Publications

Investigating Smartphone-Based Sensing Features for Depression Severity Prediction: Observation Study.

J Med Internet Res

January 2025

Department of Clinical Psychology and Psychotherapy, Institute of Psychology and Education, Ulm University, Ulm, Germany.

Background: Unobtrusively collected objective sensor data from everyday devices like smartphones provide a novel paradigm to infer mental health symptoms. This process, called smart sensing, allows a fine-grained assessment of various features (eg, time spent at home based on the GPS sensor). Based on its prevalence and impact, depression is a promising target for smart sensing.

View Article and Find Full Text PDF

Aim: To investigate the detection and initial management of first psychotic episodes, as well as established schizophrenia, within the primary care of the Andalusian Health System.

Background: Delay in detecting and treating psychosis is associated with slower recovery, higher relapse risk, and poorer long-term outcomes. Often, psychotic episodes go unnoticed for years before a diagnosis is established.

View Article and Find Full Text PDF

This study aimed to assess measurement invariance for the Five-Factor Inventory for (Oltmanns & Widiger, 2020) across nine national samples from four continents ( = 6,342), and to validate a French translation in seven French-speaking national samples. All were convenience samples of adults. Exploratory factor analyses supported a four-factor structure in the French-speaking Western samples (Belgium, Canada, France, and Switzerland) while a three-factor structure was preferred in the French-speaking African samples (Burkina Faso and Togo), and no adequate structure was found in the Indian sample.

View Article and Find Full Text PDF

Antagonism is a personality domain located in most major trait models and is central to multiple personality disorders. This construct has been linked to many societally harmful externalizing behaviors (e.g.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!