Background: The utility of large language model-based (LLM) artificial intelligence (AI) chatbots in many aspects of healthcare is becoming apparent though their ability to address patient concerns remains unknown. We sought to evaluate the performance of two well-known, freely-accessible chatbots, ChatGPT and Google Bard, in responding to common questions about stroke rehabilitation posed by patients and their caregivers.
Methods: We collected questions from outpatients and their caregivers through a survey, categorised them by theme, and created representative questions to be posed to both chatbots. We then evaluated the chatbots' responses based on accuracy, safety, relevance, and readability. Interrater agreement was also tracked.
Results: Although both chatbots achieved similar overall scores, Google Bard performed slightly better in relevance and safety. Both provided readable responses with some general accuracy, but struggled with hallucinated responses, were often not specific, and lacked awareness of the possibility for emotional situations with the potential to turn dangerous. Additionally, interrater agreement was low, highlighting the variability in physician acceptance of their responses.
Conclusions: AI chatbots show potential in patient-facing support roles, but issues remain regarding safety, accuracy, and relevance. Future chatbots should address these problems to ensure that they can reliably and independently manage the concerns and questions of stroke patients and their caregivers.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11111889 | PMC |
http://dx.doi.org/10.3389/fdgth.2024.1395501 | DOI Listing |
Updates Surg
January 2025
Alluri Sitarama Raju Academy of Medical Sciences, Eluru, India.
There is a growing importance for patients to easily access information regarding their medical conditions to improve their understanding and participation in health care decisions. Artificial Intelligence (AI) has proven as a fast, efficient, and effective tool in educating patients regarding their health care conditions. The aim of the study is to compare the responses provided by AI tools, ChatGPT and Google Gemini, to assess for conciseness and understandability of information provided for the medical conditions Deep vein thrombosis, decubitus ulcers, and hemorrhoids.
View Article and Find Full Text PDFWell-designed effective interventions promoting sustainable diets are urgently needed to benefit both human and planetary health. This study evaluated the feasibility, acceptability, and potential impact of a pilot blended digital intervention aimed at promoting sustainable diets. We conducted a series of ABA n-of-1 trials with baseline, intervention, and follow-up phases over the course of a year, involving twelve participants.
View Article and Find Full Text PDFTrends Cancer
January 2025
Else Kroener Fresenius Center for Digital Health, Medical Faculty Carl Gustav Carus, Dresden University of Technology (TUD), Dresden, Germany; Department of Medicine I, University Hospital Dresden, Dresden, Germany; Medical Oncology, National Center for Tumor Diseases (NCT), University Hospital Heidelberg, Heidelberg, Germany. Electronic address:
The development of new therapeutic strategies such as immune checkpoint inhibitors (ICIs) and targeted therapies has increased the complexity of the treatment landscape for solid tumors. At the current rate of annual FDA approvals, the potential treatment options could increase by tenfold over the next 5 years. The cost of personalized medicine technologies limits its accessibility, thus increasing socioeconomic disparities in the treated population.
View Article and Find Full Text PDFJ Voice
January 2025
Department of Statistics, Purdue University, Mathematical Sciences Building, 150 N. University Street, Room 231, West Lafayette, IN 47907.
Background: Methods to elicit the vital capacity (VC) include forced vital capacity (FVC) and slow vital capacity (SVC). Because the FVC maneuver can be affected by air trapping or inefficiencies in lung emptying vs. the SVC, the SVC-FVC difference may be substantial and diagnostically meaningful in elderly individuals and patients with respiratory obstruction.
View Article and Find Full Text PDFClin Rehabil
January 2025
School of Nursing, The Hong Kong Polytechnic University, Kowloon, Hong Kong.
Objective: To map evidence on the characteristics, effectiveness, and potential mechanisms of motor imagery interventions targeting cognitive function and depression in adults with neurological disorders and/or mobility impairments.
Data Sources: Six English databases (The Cochrane Library, PubMed, Embase, Scopus, Web of Sciences, and PsycINFO), two Chinese databases (CNKI and WanFang), and a gray literature database were searched from inception to December 2024.
Review Methods: This scoping review followed the Joanna Briggs Institute Scoping Review methodology.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!