Purpose: Radiology reports mostly contain free-text, which makes it challenging to obtain structured data. Natural language processing (NLP) techniques transform free-text reports into machine-readable document vectors that are important for creating reliable, scalable methods for data analysis. The aim of this study is to classify unstructured radiograph reports according to fractures of the distal fibula and to find the best text mining method.

Materials & Methods: We established a novel German language report dataset: a designated search engine was used to identify radiographs of the ankle and the reports were manually labeled according to fractures of the distal fibula. This data was used to establish a machine learning pipeline, which implemented the text representation methods bag-of-words (BOW), term frequency-inverse document frequency (TF-IDF), principal component analysis (PCA), non-negative matrix factorization (NMF), latent Dirichlet allocation (LDA), and document embedding (doc2vec). The extracted document vectors were used to train neural networks (NN), support vector machines (SVM), and logistic regression (LR) to recognize distal fibula fractures. The results were compared via cross-tabulations of the accuracy (acc) and area under the curve (AUC).

Results: In total, 3268 radiograph reports were included, of which 1076 described a fracture of the distal fibula. Comparison of the text representation methods showed that BOW achieved the best results (AUC = 0.98; acc = 0.97), followed by TF-IDF (AUC = 0.97; acc = 0.96), NMF (AUC = 0.93; acc = 0.92), PCA (AUC = 0.92; acc = 0.9), LDA (AUC = 0.91; acc = 0.89) and doc2vec (AUC = 0.9; acc = 0.88). When comparing the different classifiers, NN (AUC = 0,91) proved to be superior to SVM (AUC = 0,87) and LR (AUC = 0,85).

Conclusion: An automated classification of unstructured reports of radiographs of the ankle can reliably detect findings of fractures of the distal fibula. A particularly suitable feature extraction method is the BOW model.

Key Points: · The aim was to classify unstructured radiograph reports according to distal fibula fractures.. · Our automated classification system can reliably detect fractures of the distal fibula.. · A particularly suitable feature extraction method is the BOW model..

Citation Format: · Dewald CL, Balandis A, Becker LS et al. Automated Classification of Free-Text Radiology Reports: Using Different Feature Extraction Methods to Identify Fractures of the Distal Fibula. Fortschr Röntgenstr 2023; 195: 713 - 719.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10368466PMC
http://dx.doi.org/10.1055/a-2061-6562DOI Listing

Publication Analysis

Top Keywords

distal fibula
36
fractures distal
24
automated classification
16
feature extraction
16
radiology reports
12
radiograph reports
12
reports
9
distal
9
fibula
9
classification free-text
8

Similar Publications

Distal fibula fractures involving the ankle are one of the most common fractures, often requiring open reduction and internal fixation with plates and screws. The increased incidence of potential wound complications arising from open reduction methods led to a rejuvenated interest in the application of minimally invasive methods like intramedullary nailing of the fibula in the management of ankle fractures and isolated distal fibular fractures. A literature search was performed using Medline, Cochrane, and Embase from 1993 to 2023.

View Article and Find Full Text PDF
Article Synopsis
  • The study aims to improve visualization of arteries during endovascular procedures for peripheral artery disease by using an image registration technique that fuses X-ray and CT angiography images.
  • The method involved aligning digital images based on the positions of the bones and achieved successful registration in most cases, with accurate alignment of less than 1 mm in distance.
  • The results indicate that this technique is clinically viable for guiding interventions, as it allows for early detection of potential complications like guidewire perforations while maintaining a reasonable processing time.
View Article and Find Full Text PDF

The tibiofibular mortise - anatomical controversies and their clinical importance: a historical and pictorial essay.

Int Orthop

January 2025

Institute of Anatomy, First Faculty of Medicine, Charles University, U Nemocnice 3, Prague 2, Prague, Czech Republic.

Introduction: During 280 years of studies of the anatomy of the distal tibiofibular articulation, there have arisen many unclear issues regarding the description of individual structures and their terminology. These historical inaccuracies were subsequently reflected in the clinical practice.

Materials And Methods: A literature search of original publications and historical sources was performed.

View Article and Find Full Text PDF

Prevalence of Complications Due to Transphyseal Hematogenous Osteomyelitis.

J Bone Joint Surg Am

December 2024

Pediatric Orthopaedic Unit, Pediatric Surgery Service, Geneva University Hospitals, Geneva, Switzerland.

Background: Transphyseal hematogenous osteomyelitis (THO) is a common infectious condition, being present in 25% of patients with hematogenous osteomyelitis. A large proportion of pediatric hematogenous osteomyelitis infections can spread through the growth cartilage and therefore may be potentially responsible for growth disorders, leading to limb-length discrepancy or angular deformities. The purpose of the present study was to identify both the prevalence of complications caused by transphyseal osteomyelitis and factors influencing their occurrence.

View Article and Find Full Text PDF

The purpose of this study was to establish typical dose values at orthopaedic operating rooms of the Larnaca General Hospital (LGH). Kerma area product (KAP), fluoroscopy time (FT) and cumulative air-kerma (K) measurements were collected for 821 patients who underwent common and reproducible trauma surgery over a five-year period, with three mobile C-arm systems; two equipped with an image-intensifier and one with a flat-panel detector. Dose indices were automatically extracted from radiation dose structured reports or DICOM meta-data files archived in the PACS, using custom-made software.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!