Objective: This study aimed to investigate the performance of generative pre-trained transformer-4 (GPT-4) on the Certification Test for Mental Health Management and whether tuned prompts could improve its performance.

Methods: This study used a 3 × 2 factorial design to examine the performance according to test difficulty (courses) and prompt conditions. We prepared 200 multiple-choice questions (600 questions overall) for each course using the Certification Test for Mental Health Management (levels I-III) and essay questions from the level I test for the previous four examinations. Two conditions were used: a simple prompt condition using the questions as prompts and tuned prompt condition using techniques to obtain better answers. GPT-4 (gpt-4-0613) was adopted and implemented using the OpenAI API.

Results: The simple prompt condition scores were 74.5, 71.5, and 64.0 for levels III, II, and I, respectively. The tuned and simple prompt condition scores had no significant differences (Odds ratio = 1.03, 95% Confidence interval; 0.65-1.62, p = 0.908). Incorrect answers were observed in the simple prompt condition because of the inability to make choices, whereas no incorrect answers were observed in the tuned prompt condition. The average score for the essay questions under the simple prompt condition was 22.5 out of 50 points (45.0%).

Conclusion: GPT-4 had a sufficient knowledge network for occupational mental health, surpassing the criteria for levels II and III tests. For the level I test, which required the ability to describe more advanced knowledge accurately, GPT-4 did not meet the criteria. External information may be needed when using GPT-4 at this level. Although the tuned prompts did not significantly improve the performance, they were promising in avoiding unintended outputs and organizing output formats. UMIN trial registration: UMIN-CTR ID = UMIN000053582.

Download full-text PDF

Source
http://dx.doi.org/10.1539/sangyoeisei.2024-017-BDOI Listing

Publication Analysis

Top Keywords

prompt condition
28
simple prompt
20
mental health
16
certification test
12
test mental
12
health management
12
generative pre-trained
8
pre-trained transformer-4
8
tuned prompts
8
prompts improve
8

Similar Publications

Background: Severe acute malnutrition (SAM) is a severe condition causing bilateral pitting edema or signs of wasting in children, with a high mortality risk. An outpatient therapeutic program is recommended for managing SAM children without complications, but there is limited information on recovery time and its determinants.

Objective: This study aims to assess the time to recovery and its predictors among children aged 6-59 months with SAM admitted to the Outpatient therapeutic program in the Borena zone, Oromia region, Southern Ethiopia in 2023.

View Article and Find Full Text PDF

Parenthood inevitably includes caring for a child suffering from mild-moderate illness requiring access to health care. Most childhood illnesses can be managed in the community, and parents are encouraged to attend the most suitable primary care service for their needs. Yet the number of children visiting emergency departments with non-urgent illness continues to rise annually, with child attendance representing over 25% of the total workload.

View Article and Find Full Text PDF

Objective: The objective of this study is to present the clinical characteristics of immunoglobulin G4-related diseases (IgG4-RD) patients and describe associated overlap with autoimmune rheumatic diseases (ARDs).

Patients And Methods: This cross-sectional study included 81 patients with IgG4-RD who were recruited from 13 specialized rheumatology departments and centers across the country in collaboration with the Egyptian College of Rheumatology (ECR). Patients underwent a thorough history-taking and clinical examination.

View Article and Find Full Text PDF

Basic Science and Pathogenesis.

Alzheimers Dement

December 2024

Laboratory of Neuro Imaging (LONI), University of Southern California, Los Angeles, CA, USA.

Background: An elevated neutrophil-lymphocyte ratio (NLR) has been associated with Alzheimer's disease (AD). However, an elevated NLR has also been implicated in many other conditions that are risk factors for AD, prompting investigation into whether the NLR is directly linked with AD pathology or a result of underlying comorbidities.

Method: We explored the relationship between the NLR and AD biomarkers in the cerebrospinal fluid (CSF) of cognitively unimpaired (CU) subjects.

View Article and Find Full Text PDF

Basic Science and Pathogenesis.

Alzheimers Dement

December 2024

Memory and Aging Center, UCSF Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA, USA.

Background: Synaptic loss, a key indicator of cognitive decline in neurodegenerative diseases, lacks a clinical biomarker, but emerging PET-scan tracers targeting synaptic vesicle protein 2A (SV2A) show promise. The current understanding of regional changes in neurodegenerative disorders and the distribution of SV2A in the human brain is quite limited. This knowledge gap presents challenges when assessing the feasibility of using SV2A tracers in therapeutic applications.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!