ChatGPT and trainee performances in the management of maxillofacial patients.

Mélissa Peters Maxime Le Clercq Antoine Yanni Xavier Vanden Eynden Lalmand Martin Noémie Vanden Haute Szonja Tancredi Céline De Passe Edward Boutremans Jerome Lechien Didier Dequanter

J Stomatol Oral Maxillofac Surg

Department of Stomatology, Oral & Maxillofacial Surgery, CHU Saint Pierre, Brussels, Belgium; Faculty of Medicine, Department of Human Anatomy and Experimental Oncology UMONS, Mons, Belgium.

Published: September 2024

Introduction: ChatGPT is an artificial intelligence based large language model with the ability to generate human-like response to text input, its performance has already been the subject of several studies in different fields. The aim of this study was to evaluate the performance of ChatGPT in the management of maxillofacial clinical cases.

Materials And Methods: A total of 38 clinical cases consulting at the Stomatology-Maxillofacial Surgery Department were prospectively recruited and presented to ChatGPT, which was interrogated for diagnosis, differential diagnosis, management and treatment. The performance of trainees and ChatGPT was compared by three blinded board-certified maxillofacial surgeons using the AIPI score.

Results: The average total AIPI score assigned to the practitioners was 18.71 and 16.39 to ChatGPT, significantly lower (p < 0.001). According to the experts, ChatGPT was significantly less effective for diagnosis and treatment (p < 0.001). Following two of the three experts, ChatGPT was significantly less effective in considering patient data (p = 0.001) and suggesting additional examinations (p < 0.0001). The primary diagnosis proposed by ChatGPT was judged by the experts as not plausible and /or incomplete in 2.63 % to 18 % of the cases, the additional examinations were associated with inadequate examinations in 2.63 %, to 21.05 % of the cases and proposed an association of pertinent, but incomplete therapeutic findings in 18.42 % to 47.37 % of the cases, while the therapeutic findings were considered pertinent, necessary and inadequate in 18.42 % of cases.

Conclusions: ChatGPT appears less efficient in diagnosis, the selection of the most adequate additional examination and the proposition of pertinent and necessary therapeutic approaches.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.jormas.2024.102090	DOI Listing

Publication Analysis

Top Keywords

chatgpt

management maxillofacial

experts chatgpt

chatgpt effective

additional examinations

therapeutic findings

diagnosis

chatgpt trainee

trainee performances

performances management

Similar Publications

Preparing the AI-assisted animal scientist: faculty and student perspectives on enhancing animal science education with artificial intelligence.

Anim Front

December 2024

Department of Animal Science, Iowa State University, Ames, Iowa 50011, USA.

Allison Baumhover Stephanie L Hansen

View Article and Find Full Text PDF

Similar Publications

Me-LLaMA: Medical Foundation Large Language Models for Comprehensive Text Analysis and Beyond.

Res Sq

December 2024

Qianqian Xie Qingyu Chen Aokun Chen Cheng Peng Yan Hu

Recent advancements in large language models (LLMs) like ChatGPT and LLaMA have shown significant potential in medical applications, but their effectiveness is limited by a lack of specialized medical knowledge due to general-domain training. In this study, we developed Me-LLaMA, a new family of open-source medical LLMs that uniquely integrate extensive domain-specific knowledge with robust instruction-following capabilities. Me-LLaMA comprises foundation models (Me-LLaMA 13B and 70B) and their chat-enhanced versions, developed through comprehensive continual pretraining and instruction tuning of LLaMA2 models using both biomedical literature and clinical notes.

View Article and Find Full Text PDF

Similar Publications

Expert Consensus on Developing Information and Communication Technology-Based Patient Education Guidelines for Rheumatic Diseases in the Korea.

J Korean Med Sci

January 2025

Department of Rheumatology, Hanyang University Hospital for Rheumatic Diseases, Seoul, Korea.

Junghee Yoon Soo-Kyung Cho Se Rim Choi Soo-Bin Lee Juhee Cho

Background: This study aimed to identify key priorities for the development of guidelines for information and communication technology (ICT)-based patient education tailored to the needs of patients with rheumatic diseases (RDs) in the Republic of Korea, based on expert consensus.

Methods: A two-round modified Delphi study was conducted with 20 rheumatology, patient education, and digital health literacy experts. A total of 35 items covering 7 domains and 18 subdomains were evaluated.

View Article and Find Full Text PDF

Similar Publications

Gene expression and immunohistochemistry analysis of ADAMTS-1 and its substrates in odontogenic keratocyst.

Diagn Pathol

January 2025

Cell Culture Laboratory, School of Dentistry, Federal University of Para, Rua Augusto Correa, 01 Guama, Belem, PA, 66075110, Brazil.

Osvaldo Rodrigues de Souza Neto Antonia Taiane Lopes de Moraes Hellen Thais Fuzii Antonio Guilherme Maneschy Faria Vanessa Morais Freitas

Background: Considering the significant participation of the microenvironment in the local aggressiveness of odontogenic keratocysts, this study aims to evaluate the expression of ADAMTS-1 and its substrates, versican, aggrecan and brevican in this locally invasive odontogenic cyst.

Methods: Immunohistochemistry and polymerase chain reaction (PCR) were conducted on 30 cases of odontogenic keratocysts (OKCs) and 20 dental follicles (DFs).

Results: The immunohistochemical expression of these proteins was predominantly cytoplasmic and granular across all samples.

View Article and Find Full Text PDF

Similar Publications

Evaluation of Generative Artificial Intelligence Models in Predicting Pediatric Emergency Severity Index Levels.

Pediatr Emerg Care

January 2025

University of California Davis School of Medicine, Sacramento, CA.

Brandon Ho Meng Lu Xuan Wang Russell Butler Joshua Park

Objective: Evaluate the accuracy and reliability of various generative artificial intelligence (AI) models (ChatGPT-3.5, ChatGPT-4.0, T5, Llama-2, Mistral-Large, and Claude-3 Opus) in predicting Emergency Severity Index (ESI) levels for pediatric emergency department patients and assess the impact of medically oriented fine-tuning.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!