Rising incidence and mortality of cancer have led to an incremental amount of research in the field. To learn from preexisting data, it has become important to capture maximum information related to disease type, stage, treatment, and outcomes. Medical imaging reports are rich in this kind of information but are only present as free text. The extraction of information from such unstructured text reports is labor-intensive. The use of Natural Language Processing (NLP) tools to extract information from radiology reports can make it less time-consuming as well as more effective. In this study, we have developed and compared different models for the classification of lung carcinoma reports using clinical concepts. This study was approved by the institutional ethics committee as a retrospective study with a waiver of informed consent. A clinical concept-based classification pipeline for lung carcinoma radiology reports was developed using rule-based as well as machine learning models and compared. The machine learning models used were XGBoost and two more deep learning model architectures with bidirectional long short-term neural networks. A corpus consisting of 1700 radiology reports including computed tomography (CT) and positron emission tomography/computed tomography (PET/CT) reports were used for development and testing. Five hundred one radiology reports from MIMIC-III Clinical Database version 1.4 was used for external validation. The pipeline achieved an overall F1 score of 0.94 on the internal set and 0.74 on external validation with the rule-based algorithm using expert input giving the best performance. Among the machine learning models, the Bi-LSTM_dropout model performed better than the ML model using XGBoost and the Bi-LSTM_simple model on internal set, whereas on external validation, the Bi-LSTM_simple model performed relatively better than other 2. This pipeline can be used for clinical concept-based classification of radiology reports related to lung carcinoma from a huge corpus and also for automated annotation of these reports.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10287609PMC
http://dx.doi.org/10.1007/s10278-023-00787-zDOI Listing

Publication Analysis

Top Keywords

radiology reports
24
lung carcinoma
16
clinical concept-based
12
machine learning
12
learning models
12
external validation
12
reports
11
classification pipeline
8
pipeline lung
8
concept-based classification
8

Similar Publications

Background: A modified computed tomography angiography (CTA)-based Carotid Plaque Reporting and Data System (Plaque-RADS) classification was applied to a cohort of patients with embolic stroke of undetermined source to test whether high-risk Plaque-RADS subtypes are more prevalent on the ipsilateral side of stroke. With the widespread use of CTA for stroke evaluation, a CTA-based Plaque-RADS would be valuable for generalizability.

Methods: A retrospective observational cross-sectional study was conducted at a single integrated health system comprised of 3 hospitals with a comprehensive stroke center between October 1, 2015, and April 1, 2017.

View Article and Find Full Text PDF

The introduction of devices for endovascular dialysis access creation (WavelinQ and Ellipsys) offers practitioners more options for access management in dialysis patients. Especially in terms of reducing the usage of central venous catheters, a native fistula is desirable as an initial dialysis access. We present a case in which a failed WavelinQ type fistula was reactivated using the Ellipsys procedure on the same arm.

View Article and Find Full Text PDF

Aim: Chronic hepatitis C virus infections can lead to liver fibrosis. Appropriate treatment of chronic hepatitis C may result in significant fibrosis reversal. The best method to assess liver fibrosis is an invasive hepatic biopsy.

View Article and Find Full Text PDF

This study aimed to evaluate the efficacy of antibiotic-loaded cement articulating spacers produced through a silicone mold in the two-stage revision of infected total knee arthroplasty. Five individuals were prospectively treated with 2-stage revision using spacers made by this mold. Clinical assessment was conducted during and after implantation using the WOMAC Score, Oxford knee score, and range of motion (ROM).

View Article and Find Full Text PDF

Background: The National Lung Screening Trial (NLST) has shown that screening with low dose CT in high-risk population was associated with reduction in lung cancer mortality. These patients are also at high risk of coronary artery disease, and we used deep learning model to automatically detect, quantify and perform risk categorisation of coronary artery calcification score (CACS) from non-ECG gated Chest CT scans.

Materials And Methods: Automated calcium quantification was performed using a neural network based on Mask regions with convolutional neural networks (R-CNN) for multiorgan segmentation.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!