A PHP Error was encountered

Severity: Warning

Message: file_get_contents(https://...@pubfacts.com&api_key=b8daa3ad693db53b1410957c26c9a51b4908&a=1): Failed to open stream: HTTP request failed! HTTP/1.1 429 Too Many Requests

Filename: helpers/my_audit_helper.php

Line Number: 176

Backtrace:

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 176
Function: file_get_contents

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 250
Function: simplexml_load_file_from_url

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 3122
Function: getPubMedXML

File: /var/www/html/application/controllers/Detail.php
Line: 575
Function: pubMedSearch_Global

File: /var/www/html/application/controllers/Detail.php
Line: 489
Function: pubMedGetRelatedKeyword

File: /var/www/html/index.php
Line: 316
Function: require_once

Evaluating Report Text Variation and Informativeness: Natural Language Processing of CT Chest Imaging for Pulmonary Embolism. | LitMetric

AI Article Synopsis

  • The study aimed to analyze language variability in free text reports from pulmonary embolus (PE) CT studies and explore how well machine learning can predict PE diagnoses from these reports.
  • Data was collected from 1,133 chest CTs conducted in 2016, revealing significant differences in report lengths and terminology, with many terms being rarely used.
  • The machine learning analysis showed high sensitivity but low specificity and positive predictive value in diagnosing PE, highlighting the challenges of interpreting free text reports and suggesting the need for structured reporting formats.

Article Abstract

Objective: The aim of this study was to quantify the variability of language in free text reports of pulmonary embolus (PE) studies and to gauge the informativeness of free text to predict PE diagnosis using machine learning as proxy for human understanding.

Materials And Methods: All 1,133 consecutive chest CTs with contrast studies performed under a PE protocol and ordered in the emergency department in 2016 were selected from our departmental electronic workflow system. We used commercial text-mining and predictive analytics software to parse and describe all report text and to generate a suite of machine learning rules that sought to predict the "gold standard" radiological diagnosis of PE.

Results: There was extensive variation in the length of Findings section and Impression section texts across the reports, only marginally associated with a positive PE diagnosis. A marked concentration of terms was found: for example, 20 words were used in the Findings section of 93% of the reports, and 896 of 2,296 distinct words were each used in only one report's Impression section. In the validation set, machine learning rules had perfect sensitivity but imperfect specificity, a low positive predictive value of 73%, and a misclassification rate of 3%.

Conclusion: Use of free text reporting was associated with extensive variability in report length and report terms used. Interpretation of the free text was a difficult machine learning task and suggests potential difficulty for human recipients in fully understanding such reports. These results support the prospective assessment of the impact of a fully structured report template with at least some mandatory discrete fields on ease of use of reports and their understanding.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jacr.2017.12.017DOI Listing

Publication Analysis

Top Keywords

free text
16
machine learning
16
report text
8
learning rules
8
text
6
reports
5
evaluating report
4
text variation
4
variation informativeness
4
informativeness natural
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!