Unmasking Clever Hans predictors and assessing what machines really learn.

Nat Commun

Department of Electrical Engineering and Computer Science, Technische Universität Berlin, Marchstr. 23, 10587, Berlin, Germany.

Published: March 2019

Current learning machines have successfully solved hard application problems, reaching high accuracy and displaying seemingly intelligent behavior. Here we apply recent techniques for explaining decisions of state-of-the-art learning machines and analyze various tasks from computer vision and arcade games. This showcases a spectrum of problem-solving behaviors ranging from naive and short-sighted, to well-informed and strategic. We observe that standard performance evaluation metrics can be oblivious to distinguishing these diverse problem solving behaviors. Furthermore, we propose our semi-automated Spectral Relevance Analysis that provides a practically effective way of characterizing and validating the behavior of nonlinear learning machines. This helps to assess whether a learned model indeed delivers reliably for the problem that it was conceived for. Furthermore, our work intends to add a voice of caution to the ongoing excitement about machine intelligence and pledges to evaluate and judge some of these recent successes in a more nuanced manner.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6411769PMC
http://dx.doi.org/10.1038/s41467-019-08987-4DOI Listing

Publication Analysis

Top Keywords

learning machines
12
unmasking clever
4
clever hans
4
hans predictors
4
predictors assessing
4
machines
4
assessing machines
4
machines learn
4
learn current
4
current learning
4

Similar Publications

Background: Recent research has revealed the potential value of machine learning (ML) models in improving prognostic prediction for patients with trauma. ML can enhance predictions and identify which factors contribute the most to posttraumatic mortality. However, no studies have explored the risk factors, complications, and risk prediction of preoperative and postoperative traumatic coagulopathy (PPTIC) in patients with trauma.

View Article and Find Full Text PDF

Aim: o point out how novel analysis tools of AI can make sense of the data acquired during OL and OC diagnosis and treatment in an effort to help improve and standardize the patient pathway for these disease.

Material And Methods: ultilizing programmed detection of heterogeneus OL and OC habitats through radiomics and correlate to imaging based tumor grading plus a literature review.

Results: new analysis pipelines have been generated for integrating imaging and patient demographic data and identify new multi-omic biomarkers of response prediction and tumour grading using cutting-edge artificial intelligence (AI) in OL and OC.

View Article and Find Full Text PDF

Background: Hematologic changes after splenectomy and hyperthermic intraperitoneal chemotherapy (HIPEC) can complicate postoperative assessment of infection. This study aimed to develop a machine-learning model to predict postoperative infection after cytoreductive surgery (CRS) and HIPEC with splenectomy.

Methods: The study enrolled patients in the national TriNetX database and at the Johns Hopkins Hospital (JHH) who underwent splenectomy during CRS/HIPEC from 2010 to 2024.

View Article and Find Full Text PDF

The drug combination is an attractive approach for cancer treatment. PARP and kinase inhibitors have recently been explored against cancer cells, but their combination has not been investigated comprehensively. In this study, we used various drug combination databases to build ML models for drug combinations against brain cancer cells.

View Article and Find Full Text PDF

Background: The prognostic prediction of pancreatic ductal adenocarcinoma (PDAC) remains challenging. This study aimed to develop a radiomics model to predict Ki-67 expression status in PDAC patients using radiomics features from dual-phase enhanced CT, and integrated clinical characteristics to create a radiomics-clinical nomogram for prognostic prediction.

Methods: In this retrospective study, data were collected from 124 PDAC patients treated surgically at a single center, from January 2017 to March 2023.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!