A PHP Error was encountered

Severity: Warning

Message: file_get_contents(https://...@pubfacts.com&api_key=b8daa3ad693db53b1410957c26c9a51b4908&a=1): Failed to open stream: HTTP request failed! HTTP/1.1 429 Too Many Requests

Filename: helpers/my_audit_helper.php

Line Number: 176

Backtrace:

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 176
Function: file_get_contents

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 250
Function: simplexml_load_file_from_url

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 3122
Function: getPubMedXML

File: /var/www/html/application/controllers/Detail.php
Line: 575
Function: pubMedSearch_Global

File: /var/www/html/application/controllers/Detail.php
Line: 489
Function: pubMedGetRelatedKeyword

File: /var/www/html/index.php
Line: 316
Function: require_once

Constructing synthetic datasets with generative artificial intelligence to train large language models to classify acute renal failure from clinical notes. | LitMetric

Constructing synthetic datasets with generative artificial intelligence to train large language models to classify acute renal failure from clinical notes.

J Am Med Inform Assoc

Division of Perioperative Informatics, Department of Anesthesiology, University of California, San Diego, La Jolla, CA 92037, United States.

Published: May 2024

Objectives: To compare performances of a classifier that leverages language models when trained on synthetic versus authentic clinical notes.

Materials And Methods: A classifier using language models was developed to identify acute renal failure. Four types of training data were compared: (1) notes from MIMIC-III; and (2, 3, and 4) synthetic notes generated by ChatGPT of varied text lengths of 15 (GPT-15 sentences), 30 (GPT-30 sentences), and 45 (GPT-45 sentences) sentences, respectively. The area under the receiver operating characteristics curve (AUC) was calculated from a test set from MIMIC-III.

Results: With RoBERTa, the AUCs were 0.84, 0.80, 0.84, and 0.76 for the MIMIC-III, GPT-15, GPT-30- and GPT-45 sentences training sets, respectively.

Discussion: Training language models to detect acute renal failure from clinical notes resulted in similar performances when using synthetic versus authentic training data.

Conclusion: The use of training data derived from protected health information may not be needed.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11105122PMC
http://dx.doi.org/10.1093/jamia/ocae081DOI Listing

Publication Analysis

Top Keywords

language models
16
acute renal
12
renal failure
12
failure clinical
8
clinical notes
8
synthetic versus
8
versus authentic
8
training data
8
gpt-45 sentences
8
training
5

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!