Red teaming, the practice of adversarially exposing unexpected or undesired model behaviors, is critical towards improving equity and accuracy of large language models, but non-model creator-affiliated red teaming is scant in healthcare. We convened teams of clinicians, medical and engineering students, and technical professionals (80 participants total) to stress-test models with real-world clinical cases and categorize inappropriate responses along axes of safety, privacy, hallucinations/accuracy, and bias. Six medically-trained reviewers re-analyzed prompt-response pairs and added qualitative annotations. Of 376 unique prompts (1504 responses), 20.1% were inappropriate (GPT-3.5: 25.8%; GPT-4.0: 16%; GPT-4.0 with Internet: 17.8%). Subsequently, we show the utility of our benchmark by testing GPT-4o, a model released after our event (20.4% inappropriate). 21.5% of responses appropriate with GPT-3.5 were inappropriate in updated models. We share insights for constructing red teaming prompts, and present our benchmark for iterative model assessments.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11889229PMC
http://dx.doi.org/10.1038/s41746-025-01542-0DOI Listing

Publication Analysis

Top Keywords

red teaming
16
red
4
teaming chatgpt
4
chatgpt medicine
4
medicine yield
4
yield real-world
4
real-world insights
4
model
4
insights model
4
model behavior
4

Similar Publications

Background: Health inequality in Latin America is particularly severe for individuals living with HIV (PLHIV) and key populations, such as men who have sex with men, transgender women, people who use drugs, and sex workers. Despite regional programs aimed at reducing health inequalities, such as the Sustainable Development Goals and the Sustainable Health Agenda for the Americas 2018-2030, the COVID-19 health crisis has exposed significant shortcomings in national healthcare systems for PLHIV and key populations. The multi-country, community-based research program, EPIC, was developed by Coalition PLUS within an network of community-based organizations engaged in the response to HIV and viral hepatitis.

View Article and Find Full Text PDF

Objectives: To compare strength parameters and pain ratings across three different positions forisometric hip abduction and adduction strength testing. Design: Cross-sectional study. Setting: Two elite European football academies.

View Article and Find Full Text PDF

Red teaming, the practice of adversarially exposing unexpected or undesired model behaviors, is critical towards improving equity and accuracy of large language models, but non-model creator-affiliated red teaming is scant in healthcare. We convened teams of clinicians, medical and engineering students, and technical professionals (80 participants total) to stress-test models with real-world clinical cases and categorize inappropriate responses along axes of safety, privacy, hallucinations/accuracy, and bias. Six medically-trained reviewers re-analyzed prompt-response pairs and added qualitative annotations.

View Article and Find Full Text PDF

Phosphatidylethanol clearance after packed red blood cell transfusion.

Clin Biochem

March 2025

Department of Laboratory Medicine and Pathology, Mayo Clinic Arizona, United States. Electronic address:

Objectives: Phosphatidylethanol (PEth) is a long-term marker of alcohol consumption used clinically for evaluating abstinence in patients including transplant candidates. Packed red blood cell (pRBC) transfusion can introduce exogenous PEth to recipients, complicating interpretation. This study evaluated the kinetics and duration of PEth 16:0/18:1 positivity post-transfusion.

View Article and Find Full Text PDF

Current management and future perspectives of covert hepatic encephalopathy in Japan: a nationwide survey.

J Gastroenterol

March 2025

Department of Gastroenterology/Internal Medicine, Graduate School of Medicine, Gifu University, 1-1 Yanagido, Gifu, 501-1194, Japan.

Background: Covert hepatic encephalopathy (CHE) leads to devastating outcomes in patients with cirrhosis. This study aims to elucidate the current management and future perspectives of CHE in Japan.

Methods: A questionnaire-based cross-sectional study was conducted among physicians involved in managing cirrhosis in Japan.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!