ChatGPT in healthcare: A taxonomy and systematic review.

Jianning Li Amin Dada Behrus Puladi Jens Kleesiek Jan Egger

Comput Methods Programs Biomed

Institute for Artificial Intelligence in Medicine, University Hospital Essen (AöR), Girardetstraße 2, 45131 Essen, Germany; Center for Virtual and Extended Reality in Medicine (ZvRM), University Hospital Essen, University Medicine Essen, Hufelandstraße 55, 45147 Essen, Germany. Electronic address:

Published: March 2024

The release of ChatGPT by OpenAI has generated significant interest among the public and medical professionals, showcasing the ‘productization’ of advanced technologies that make AI more accessible.
A systematic review of publications on ChatGPT's applications in healthcare reveals it performs only moderately and is not reliable for clinical use, as it wasn't designed for such applications.
The study suggests that developing specialized NLP models trained on biomedical datasets is a better approach for addressing serious clinical needs.

The recent release of ChatGPT, a chat bot research project/product of natural language processing (NLP) by OpenAI, stirs up a sensation among both the general public and medical professionals, amassing a phenomenally large user base in a short time. This is a typical example of the 'productization' of cutting-edge technologies, which allows the general public without a technical background to gain firsthand experience in artificial intelligence (AI), similar to the AI hype created by AlphaGo (DeepMind Technologies, UK) and self-driving cars (Google, Tesla, etc.). However, it is crucial, especially for healthcare researchers, to remain prudent amidst the hype. This work provides a systematic review of existing publications on the use of ChatGPT in healthcare, elucidating the 'status quo' of ChatGPT in medical applications, for general readers, healthcare professionals as well as NLP scientists. The large biomedical literature database PubMed is used to retrieve published works on this topic using the keyword 'ChatGPT'. An inclusion criterion and a taxonomy are further proposed to filter the search results and categorize the selected publications, respectively. It is found through the review that the current release of ChatGPT has achieved only moderate or 'passing' performance in a variety of tests, and is unreliable for actual clinical deployment, since it is not intended for clinical applications by design. We conclude that specialized NLP models trained on (bio)medical datasets still represent the right direction to pursue for critical clinical applications.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.cmpb.2024.108013	DOI Listing

Publication Analysis

Top Keywords

chatgpt healthcare

systematic review

release chatgpt

general public

clinical applications

chatgpt

healthcare taxonomy

taxonomy systematic

review release

chatgpt chat

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!