Calculation of exact Shapley values for explaining support vector machine models using the radial basis function kernel.

Sci Rep

Department of Life Science Informatics and Data Science, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Friedrich-Hirzebruch-Allee 5/6, 53115, Bonn, Germany.

Published: November 2023

Machine learning (ML) algorithms are extensively used in pharmaceutical research. Most ML models have black-box character, thus preventing the interpretation of predictions. However, rationalizing model decisions is of critical importance if predictions should aid in experimental design. Accordingly, in interdisciplinary research, there is growing interest in explaining ML models. Methods devised for this purpose are a part of the explainable artificial intelligence (XAI) spectrum of approaches. In XAI, the Shapley value concept originating from cooperative game theory has become popular for identifying features determining predictions. The Shapley value concept has been adapted as a model-agnostic approach for explaining predictions. Since the computational time required for Shapley value calculations scales exponentially with the number of features used, local approximations such as Shapley additive explanations (SHAP) are usually required in ML. The support vector machine (SVM) algorithm is one of the most popular ML methods in pharmaceutical research and beyond. SVM models are often explained using SHAP. However, there is only limited correlation between SHAP and exact Shapley values, as previously demonstrated for SVM calculations using the Tanimoto kernel, which limits SVM model explanation. Since the Tanimoto kernel is a special kernel function mostly applied for assessing chemical similarity, we have developed the Shapley value-expressed radial basis function (SVERAD), a computationally efficient approach for the calculation of exact Shapley values for SVM models based upon radial basis function kernels that are widely applied in different areas. SVERAD is shown to produce meaningful explanations of SVM predictions.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10638308PMC
http://dx.doi.org/10.1038/s41598-023-46930-2DOI Listing

Publication Analysis

Top Keywords

exact shapley
12
shapley values
12
radial basis
12
basis function
12
calculation exact
8
shapley
8
support vector
8
vector machine
8
shapley concept
8
svm models
8

Similar Publications

Protocol to calculate and compare exact Shapley values for different kernels in support vector machine models using binary features.

STAR Protoc

December 2024

Department of Life Science Informatics and Data Science, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Friedrich-Hirzebruch-Allee 5/6, 53115 Bonn, Germany; Lamarr Institute for Machine Learning and Artificial Intelligence, Friedrich-Hirzebruch-Allee 5/6, 53115 Bonn, Germany. Electronic address:

The Shapley value formalism from cooperative game theory was adapted to explain predictions of machine learning models. Here, we present a protocol to calculate and compare exact Shapley values for support vector machine models with commonly used kernels and binary input features. We describe steps for installing software, preparing data, and calculating Shapley values with customizable Python scripts.

View Article and Find Full Text PDF

Background: Pathological axillary lymph node (pALN) burden is an important factor for treatment decision-making in clinical T1-T2 (cT1-T2) stage breast cancer. Preoperative assessment of the pALN burden and prognosis aids in the individualized selection of therapeutic approaches.

Purpose: To develop and validate a machine learning (ML) model based on clinicopathological and MRI characteristics for assessing pALN burden and survival in patients with cT1-T2 stage breast cancer.

View Article and Find Full Text PDF

Prediction of sustained opioid use in children and adolescents using machine learning.

Br J Anaesth

August 2024

School of Public Health, Faculty of Medical and Health Sciences, Tel Aviv University, Tel Aviv, Israel; School of Public Health, Faculty of Medical and Health Sciences, Porter School of the Environment and Earth Sciences, Faculty of Exact Sciences, Tel Aviv University, Tel Aviv, Israel. Electronic address:

Article Synopsis
  • The study focused on developing a machine learning classifier to distinguish between occasional and sustained opioid users among children and adolescents in outpatient settings.
  • Data from over 29,000 patients under 19 years old was analyzed, using various health-related factors to predict long-term opioid use after their first prescription.
  • The model showed good predictive performance, with a notable ability to identify at-risk patients, and the online tool developed can help clinicians assess opioid use risk effectively.
View Article and Find Full Text PDF

Psilocybin, ketamine, and MDMA are psychoactive compounds that exert behavioral effects with distinguishable but also overlapping features. The growing interest in using these compounds as therapeutics necessitates preclinical assays that can accurately screen psychedelics and related analogs. We posit that a promising approach may be to measure drug action on markers of neural plasticity in native brain tissues.

View Article and Find Full Text PDF

Protocol to explain support vector machine predictions via exact Shapley value computation.

STAR Protoc

June 2024

Department of Life Science Informatics and Data Science, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Friedrich-Hirzebruch-Allee 5/6, 53115 Bonn, Germany; Lamarr Institute for Machine Learning and Artificial Intelligence, Friedrich-Hirzebruch-Allee 5/6, 53115 Bonn, Germany. Electronic address:

Shapley values from cooperative game theory are adapted for explaining machine learning predictions. For large feature sets used in machine learning, Shapley values are approximated. We present a protocol for two techniques for explaining support vector machine predictions with exact Shapley value computation.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!