Pharmacogenetics represents one of the most promising areas of precision medicine, with several guidelines for genetics-guided treatment ready for clinical use. Despite this, implementation has been slow, with few health systems incorporating the technology into their standard of care. One major barrier to uptake is the lack of education and awareness of pharmacogenetics among clinicians and patients. The introduction of large language models (LLMs) like GPT-4 has raised the possibility of medical chatbots that deliver timely information to clinicians, patients, and researchers with a simple interface. Although state-of-the-art LLMs have shown impressive performance at advanced tasks like medical licensing exams, in practice they still often provide false information, which is particularly hazardous in a clinical context. To quantify the extent of this issue, we developed a series of automated and expert-scored tests to evaluate the performance of chatbots in answering pharmacogenetics questions from the perspective of clinicians, patients, and researchers. We applied this benchmark to state-of-the-art LLMs and found that newer models like GPT-4o greatly outperform their predecessors, but still fall short of the standards required for clinical use. Our benchmark will be a valuable public resource for subsequent developments in this space as we work towards better clinical AI for pharmacogenetics.

Download full-text PDF

Source

Publication Analysis

Top Keywords

clinicians patients
12
patients researchers
8
state-of-the-art llms
8
pgxqa resource
4
resource evaluating
4
evaluating llm
4
llm performance
4
performance pharmacogenomic
4
pharmacogenomic tasks
4
pharmacogenetics
4

Similar Publications

Ultra-high-frequency Ultrasound in the Objective Assessment of Chlormethine Gel Efficacy: A Case Report.

Acta Dermatovenerol Croat

November 2024

Agata Janowska, MD, Department of Dermatology, , University of Pisa, Via Roma 67, 56126, Pisa, Italy; Phone: +39 050 992436, Fax: +39 050 992556,

Mycosis fungoides (MF) represent the most frequent form of cutaneous T-cell lymphoma (CTCL). Chlormethine gel has been approved as first-line therapy in MF. The classification of early forms of MF is clinically and histologically complex even for experienced clinicians.

View Article and Find Full Text PDF

XGBoost-based nomogram for predicting lymph node metastasis in endometrial carcinoma.

Am J Cancer Res

December 2024

Department of Reproductive Medicine, The First Affiliated Hospital, Jinan University Guangzhou 510000, Guangdong, China.

This study aims to construct and optimize risk prediction models for lymph node metastasis (LNM) in endometrial carcinoma (EC) patients, thus improving the identification of patients at high risk of LNM and further providing accurate support for clinical decision-making. This retrospective analysis included 541 cases of EC treated at The First Affiliated Hospital, Jinan University between January 2017 and January 2022. Various clinical and pathological variables were incorporated, including age, body mass index (BMI), pathological grading, myometrial invasion, lymphovascular space invasion (LVSI), estrogen receptor (ER) and progesterone receptor (PR) levels, and tumor size.

View Article and Find Full Text PDF

Cardiac implantable electronic devices (CIEDs) generate substantial data, often stored in image or PDF formats. Remote monitoring, now an integral component of patient care, places considerable administrative burdens on clinicians and staff, in large part due to the challenge of integrating these data seamlessly into electronic health records. Since 2006, the Heart Rhythm Society, in collaboration with the CIED industry, has led an initiative to establish a unified standard nomenclature.

View Article and Find Full Text PDF

Expansion of home hemodialysis (HHD) provides an opportunity to improve clinical outcomes, reduce cost of care, and address the staffing challenges currently faced in caring for patients with kidney failure on replacement therapy. To increase HHD expansion, current practices and barriers to home dialysis must be examined and addressed. One such barrier is vascular access for HHD; although tunneled hemodialysis central venous catheters (CVCs) have been used for decades, physicians still hesitate to send patients home without a mature, functional arteriovenous access.

View Article and Find Full Text PDF

Background: Recurrent symptom-relevant negative autobiographical memories are common in patients with emotional disorders such as anxiety and depression, even among those without a trauma-related diagnosis. Recurrent negative autobiographical memories may also contribute to distress in non-clinical populations.

Methods: To examine the prevalence of recurrent negative autobiographical memories and associated psychological features, we recruited a student sample ( = 101) and a treatment-seeking sample of patients with emotional disorders ( = 123).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!