Innovation and application of Large Language Models (LLMs) in dentistry - a scoping review.

BDJ Open

Resident, Operative Dentistry & Endodontics, Aga Khan University Hospital, Karachi, Pakistan.

Published: December 2024

Objective: Large Language Models (LLMs) have revolutionized healthcare, yet their integration in dentistry remains underexplored. Therefore, this scoping review aims to systematically evaluate current literature on LLMs in dentistry.

Data Sources: The search covered PubMed, Scopus, IEEE Xplore, and Google Scholar, with studies selected based on predefined criteria. Data were extracted to identify applications, evaluation metrics, prompting strategies, and deployment levels of LLMs in dental practice.

Results: From 4079 records, 17 studies met the inclusion criteria. ChatGPT was the predominant model, mainly used for post-operative patient queries. Likert scale was the most reported evaluation metric, and only two studies employed advanced prompting strategies. Most studies were at level 3 of deployment, indicating practical application but requiring refinement.

Conclusion: LLMs showed extensive applicability in dental specialties; however, reliance on ChatGPT necessitates diversified assessments across multiple LLMs. Standardizing reporting practices and employing advanced prompting techniques are crucial for transparency and reproducibility, necessitating continuous efforts to optimize LLM utility and address existing challenges.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11609263PMC
http://dx.doi.org/10.1038/s41405-024-00277-6DOI Listing

Publication Analysis

Top Keywords

large language
8
language models
8
models llms
8
scoping review
8
prompting strategies
8
advanced prompting
8
llms
5
innovation application
4
application large
4
llms dentistry
4

Similar Publications

The use of large language models in detecting Chinese ultrasound report errors.

NPJ Digit Med

January 2025

Department of Diagnostic Ultrasound Imaging & Interventional Therapy, Zhejiang Cancer Hospital, Hangzhou, Zhejiang, China.

This retrospective study evaluated the efficacy of large language models (LLMs) in improving the accuracy of Chinese ultrasound reports. Data from three hospitals (January-April 2024) including 400 reports with 243 errors across six categories were analyzed. Three GPT versions and Claude 3.

View Article and Find Full Text PDF

A phenotype-based AI pipeline outperforms human experts in differentially diagnosing rare diseases using EHRs.

NPJ Digit Med

January 2025

Department of Computer Science and Technology & Institute for Artificial Intelligence & BNRist, Tsinghua University, Beijing, China.

Rare diseases, affecting ~350 million people worldwide, pose significant challenges in clinical diagnosis due to the lack of experienced physicians and the complexity of differentiating between numerous rare diseases. To address these challenges, we introduce PhenoBrain, a fully automated artificial intelligence pipeline. PhenoBrain utilizes a BERT-based natural language processing model to extract phenotypes from clinical texts in EHRs and employs five new diagnostic models for differential diagnoses of rare diseases.

View Article and Find Full Text PDF

Recent advancements of large language models (LLMs) like generative pre-trained transformer 4 (GPT-4) have generated significant interest among the scientific community. Yet, the potential of these models to be utilized in clinical settings remains largely unexplored. In this study, we investigated the abilities of multiple LLMs and traditional machine learning models to analyze emergency department (ED) reports and determine if the corresponding visits were due to symptomatic kidney stones.

View Article and Find Full Text PDF

Introduction: Prehospital identification of stroke patients with large vessel occlusion (LVO) is crucial to optimize transport to an endovascular thrombectomy (EVT)-capable center. Existing scores require medical or paramedical expertise and specific teachings. We aimed to validate a simple prehospital phone-based score for LVO identification.

View Article and Find Full Text PDF

Objective: This study aimed to assess people's preference between traditional and Artificial Intelligence (AI)-generated colon cancer staging Patient Education Materials (PEMs).

Methods: We assessed preference among patients and companions being seen for a non-cancer diagnosis at the UT Health Austin Colon and Rectal Surgery Clinic. Participants were blinded to the study concept of AI and generation method of PEMs (Traditional: National Cancer Institute and the American Cancer Society; AI-generated: ChatGPT and Google Bard).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!