Automated Classification of Radiographic Knee Osteoarthritis Severity Using Deep Neural Networks.

Radiol Artif Intell

Departments of Biomedical Data Science (K.A.T., S.L.F., G.R.V.), Bioengineering (Ł.K., S.L.D.), and Radiology (G.E.G.), Stanford University, Clark Center, 318 Campus Dr, Room S321, Stanford, CA 94305; Department of Radiology, Erasmus University Rotterdam, Rotterdam, the Netherlands (E.H.G.O.); and Department of Mechanical Engineering, Carnegie Mellon University, Pittsburgh, Pa (E.H.).

Published: March 2020

Purpose: To develop an automated model for staging knee osteoarthritis severity from radiographs and to compare its performance to that of musculoskeletal radiologists.

Materials And Methods: Radiographs from the Osteoarthritis Initiative staged by a radiologist committee using the Kellgren-Lawrence (KL) system were used. Before using the images as input to a convolutional neural network model, they were standardized and augmented automatically. The model was trained with 32 116 images, tuned with 4074 images, evaluated with a 4090-image test set, and compared to two individual radiologists using a 50-image test subset. Saliency maps were generated to reveal features used by the model to determine KL grades.

Results: With committee scores used as ground truth, the model had an average F1 score of 0.70 and an accuracy of 0.71 for the full test set. For the 50-image subset, the best individual radiologist had an average F1 score of 0.60 and an accuracy of 0.60; the model had an average F1 score of 0.64 and an accuracy of 0.66. Cohen weighted κ between the committee and model was 0.86, comparable to intraexpert repeatability. Saliency maps identified sites of osteophyte formation as influential to predictions.

Conclusion: An end-to-end interpretable model that takes full radiographs as input and predicts KL scores with state-of-the-art accuracy, performs as well as musculoskeletal radiologists, and does not require manual image preprocessing was developed. Saliency maps suggest the model's predictions were based on clinically relevant information. © RSNA, 2020.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7104788PMC
http://dx.doi.org/10.1148/ryai.2020190065DOI Listing

Publication Analysis

Top Keywords

saliency maps
12
average score
12
knee osteoarthritis
8
osteoarthritis severity
8
model
8
test set
8
model average
8
automated classification
4
classification radiographic
4
radiographic knee
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!