"What is relevant in a text document?": An interpretable machine learning approach.

Leila Arras Franziska Horn Grégoire Montavon Klaus-Robert Müller Wojciech Samek

PLoS One

Machine Learning Group, Fraunhofer Heinrich Hertz Institute, Berlin, Germany.

Published: October 2017

Text documents can be described by a number of abstract concepts such as semantic category, writing style, or sentiment. Machine learning (ML) models have been trained to automatically map documents to these abstract concepts, allowing to annotate very large text collections, more than could be processed by a human in a lifetime. Besides predicting the text's category very accurately, it is also highly desirable to understand how and why the categorization process takes place. In this paper, we demonstrate that such understanding can be achieved by tracing the classification decision back to individual words using layer-wise relevance propagation (LRP), a recently developed technique for explaining predictions of complex non-linear classifiers. We train two word-based ML models, a convolutional neural network (CNN) and a bag-of-words SVM classifier, on a topic categorization task and adapt the LRP method to decompose the predictions of these models onto words. Resulting scores indicate how much individual words contribute to the overall classification decision. This enables one to distill relevant information from text documents without an explicit semantic information extraction step. We further use the word-wise relevance scores for generating novel vector-based document representations which capture semantic information. Based on these document vectors, we introduce a measure of model explanatory power and show that, although the SVM and CNN models perform similarly in terms of classification accuracy, the latter exhibits a higher level of explainability which makes it more comprehensible for humans and potentially more useful for other applications.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5553725	PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0181142	PLOS

Publication Analysis

Top Keywords

relevant text

machine learning

text documents

abstract concepts

classification decision

"what relevant

text

text document?"

document?" interpretable

interpretable machine

Similar Publications

Evaluating Process and Outcomes of Public Involvement in Applied Health and Social Care Research: A Rapid Systematic Review.

Health Expect

February 2025

Population Health Sciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle Upon Tyne, UK.

Angela Wearn Kerry Brennan-Tovey Emma A Adams Hayley Alderson Judy Baariu

Objective: Public Involvement (PI) in applied health and social care research has grown exponentially in the UK. This review aims to synthesise published UK evidence that evaluates the process and/or outcome(s) of PI in applied health and social care research to identify key contextual factors, effective strategies, outcomes and public partner experiences underpinning meaningful PI in research.

Methods: Following a pre-registered protocol, we systematically searched four databases and two key journals for studies conducted within the UK between January 2006 and July 2024.

View Article and Find Full Text PDF

Similar Publications

Ethics of Procuring and Using Organs or Tissue from Infants and Newborns for Transplantation, Research, or Commercial Purposes: Protocol for a Bioethics Scoping Review.

Wellcome Open Res

December 2024

National University of Singapore, Singapore, Singapore.

Maide Barış Xiu Lim Melanie T Almonte David Shaw Joe Brierley

Unlabelled: Since the inception of transplantation, it has been crucial to ensure that organ or tissue donations are made with valid informed consent to avoid concerns about coercion or exploitation. This issue is particularly challenging when it comes to infants and younger children, insofar as they are unable to provide consent. Despite their vulnerability, infants' organs and tissues are considered valuable for biomedical purposes due to their size and unique properties.

View Article and Find Full Text PDF

Similar Publications

The efficacy of hypothermia combined with thrombolysis or mechanical thrombectomy on acute ischemic stroke: a systematic review and meta-analysis.

Front Neurol

January 2025

Department of Neurology, Hubei No. 3 People's Hospital of Jianghan University, Wuhan, China.

Dan Wang Dan Yan Mingmin Yan Hao Tian Haiwei Jiang

Background: Therapeutic hypothermia improves outcomes in experimental stroke models, especially after ischemia-reperfusion injury. In recent years, the safety and efficacy of hypothermia combining thrombolysis or mechanical thrombectomy have attracted widespread attention. The primary objective of the study was to evaluate the effectiveness and safety of hypothermia by combining reperfusion therapy in acute ischemic stroke patients.

View Article and Find Full Text PDF

Similar Publications

Bibliometric analysis of research on the application of deep learning to ophthalmology.

Quant Imaging Med Surg

January 2025

Department of Ophthalmology, the Fourth Affiliated Hospital of China Medical University, Shenyang, China.

Min Zhao Haoxin Guo Xindan Cao Junshi Dai Zhongqing Wang

Background: Recently, deep learning has become a popular area of research, and has revolutionized the diagnosis and prediction of ocular diseases, especially fundus diseases. This study aimed to conduct a bibliometric analysis of deep learning in the field of ophthalmology to describe international research trends and examine the current research directions.

Methods: This cross-sectional bibliometric analysis examined the development of research on deep learning in the field of ophthalmology and its sub-topics from 2015 to 2024.

View Article and Find Full Text PDF

Similar Publications

Prognostic value of CT-based skeletal muscle and adipose tissue mass and quality parameters in patients with liver metastases and intrahepatic cholangiocarcinoma undergoing Yttrium-90 radioembolization.

Eur Radiol

January 2025

Department of Radiology and Interventional Radiology, Lausanne University Hospital and Lausanne University, Lausanne, Switzerland.

Yan Zhao Fabio Becce Romain Balmer Ricardo H do Amaral Yasser Alemán-Gómez

Objectives: To investigate baseline patient characteristics associated with the risk of computed tomography (CT)-based sarcopenia and assess whether sarcopenia and other morphometric parameters influence survival outcomes in patients with liver metastases and cholangiocarcinoma after Yttrium-90 radioembolization.

Materials And Methods: We retrospectively analyzed 120 cancer patients (mean age, 62 ± 13.3 years, 61 men) who underwent preprocedural CT.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!