Text documents can be described by a number of abstract concepts such as semantic category, writing style, or sentiment. Machine learning (ML) models have been trained to automatically map documents to these abstract concepts, allowing to annotate very large text collections, more than could be processed by a human in a lifetime. Besides predicting the text's category very accurately, it is also highly desirable to understand how and why the categorization process takes place. In this paper, we demonstrate that such understanding can be achieved by tracing the classification decision back to individual words using layer-wise relevance propagation (LRP), a recently developed technique for explaining predictions of complex non-linear classifiers. We train two word-based ML models, a convolutional neural network (CNN) and a bag-of-words SVM classifier, on a topic categorization task and adapt the LRP method to decompose the predictions of these models onto words. Resulting scores indicate how much individual words contribute to the overall classification decision. This enables one to distill relevant information from text documents without an explicit semantic information extraction step. We further use the word-wise relevance scores for generating novel vector-based document representations which capture semantic information. Based on these document vectors, we introduce a measure of model explanatory power and show that, although the SVM and CNN models perform similarly in terms of classification accuracy, the latter exhibits a higher level of explainability which makes it more comprehensible for humans and potentially more useful for other applications.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5553725PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0181142PLOS

Publication Analysis

Top Keywords

relevant text
8
machine learning
8
text documents
8
abstract concepts
8
classification decision
8
"what relevant
4
text
4
text document?"
4
document?" interpretable
4
interpretable machine
4

Similar Publications

Objective: Public Involvement (PI) in applied health and social care research has grown exponentially in the UK. This review aims to synthesise published UK evidence that evaluates the process and/or outcome(s) of PI in applied health and social care research to identify key contextual factors, effective strategies, outcomes and public partner experiences underpinning meaningful PI in research.

Methods: Following a pre-registered protocol, we systematically searched four databases and two key journals for studies conducted within the UK between January 2006 and July 2024.

View Article and Find Full Text PDF

Unlabelled: Since the inception of transplantation, it has been crucial to ensure that organ or tissue donations are made with valid informed consent to avoid concerns about coercion or exploitation. This issue is particularly challenging when it comes to infants and younger children, insofar as they are unable to provide consent. Despite their vulnerability, infants' organs and tissues are considered valuable for biomedical purposes due to their size and unique properties.

View Article and Find Full Text PDF

Background: Therapeutic hypothermia improves outcomes in experimental stroke models, especially after ischemia-reperfusion injury. In recent years, the safety and efficacy of hypothermia combining thrombolysis or mechanical thrombectomy have attracted widespread attention. The primary objective of the study was to evaluate the effectiveness and safety of hypothermia by combining reperfusion therapy in acute ischemic stroke patients.

View Article and Find Full Text PDF

Background: Recently, deep learning has become a popular area of research, and has revolutionized the diagnosis and prediction of ocular diseases, especially fundus diseases. This study aimed to conduct a bibliometric analysis of deep learning in the field of ophthalmology to describe international research trends and examine the current research directions.

Methods: This cross-sectional bibliometric analysis examined the development of research on deep learning in the field of ophthalmology and its sub-topics from 2015 to 2024.

View Article and Find Full Text PDF

Objectives: To investigate baseline patient characteristics associated with the risk of computed tomography (CT)-based sarcopenia and assess whether sarcopenia and other morphometric parameters influence survival outcomes in patients with liver metastases and cholangiocarcinoma after Yttrium-90 radioembolization.

Materials And Methods: We retrospectively analyzed 120 cancer patients (mean age, 62 ± 13.3 years, 61 men) who underwent preprocedural CT.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!