For supervised classification problems involving design, control, and other practical purposes, users are not only interested in finding a highly accurate classifier but they also demand that the obtained classifier be easily interpretable. While the definition of interpretability of a classifier can vary from case to case, here, by a humanly interpretable classifier, we restrict it to be expressed in simplistic mathematical terms. As a novel approach, we represent a classifier as an assembly of simple mathematical rules using a nonlinear decision tree (NLDT). Each conditional (nonterminal) node of the tree represents a nonlinear mathematical rule (split-rule) involving features in order to partition the dataset in the given conditional node into two nonoverlapping subsets. This partitioning is intended to minimize the impurity of the resulting child nodes. By restricting the structure of the split-rule at each conditional node and depth of the decision tree, the interpretability of the classifier is ensured. The nonlinear split-rule at a given conditional node is obtained using an evolutionary bilevel optimization algorithm, in which while the upper level focuses on arriving at an interpretable structure of the split-rule, the lower level achieves the most appropriate weights (coefficients) of individual constituents of the rule to minimize the net impurity of two resulting child nodes. The performance of the proposed algorithm is demonstrated on a number of controlled test problems, existing benchmark problems, and industrial problems. Results on 2-500 feature problems are encouraging and open up further scopes of applying the proposed approach to more challenging and complex classification tasks.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TCYB.2020.3033003 | DOI Listing |
Chaos
January 2025
Classe di Scienze, Scuola Normale Superiore, Piazza dei Cavalieri 7, 56126 Pisa, Italy.
Modeling how a shock propagates in a temporal network and how the system relaxes back to equilibrium is challenging but important in many applications, such as financial systemic risk. Most studies, so far, have focused on shocks hitting a link of the network, while often it is the node and its propensity to be connected that are affected by a shock. Using the configuration model-a specific exponential random graph model-as a starting point, we propose a vector autoregressive (VAR) framework to analytically compute the Impulse Response Function (IRF) of a network metric conditional to a shock on a node.
View Article and Find Full Text PDFAim: Lymph node metastasis is an adverse prognostic factor in pancreatic ductal adenocarcinoma. However, it remains a challenge to predict lymph node metastasis using preoperative imaging alone. We used machine learning (combining preoperative imaging findings, tumor markers, and clinical information) to create a novel prediction model for lymph node metastasis in resectable pancreatic ductal adenocarcinoma.
View Article and Find Full Text PDFSurgery
January 2025
Division of Colon & Rectal Surgery, Department of Surgery, University of Minnesota, Minneapolis, MN. Electronic address:
Sci Rep
January 2025
Faculty of Education and Arts, Australian Catholic University, Sydney, NSW, 2118, Australia.
Every node in a network is said to be resolved if it can be uniquely identified by a vector of distances to a specific set of nodes. The metric dimension is equivalent to the least possible cardinal number of a resolving set. Conditional resolving sets are obtained by imposing various constraints on resolving set.
View Article and Find Full Text PDFPLoS One
December 2024
Department of Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung, Taiwan.
Kawasaki Disease (KD) is a rare febrile illness affecting infants and young children, potentially leading to coronary artery complications and, in severe cases, mortality if untreated. However, KD is frequently misdiagnosed as a common fever in clinical settings, and the inherent data imbalance further complicates accurate prediction when using traditional machine learning and statistical methods. This paper introduces two advanced approaches to address these challenges, enhancing prediction accuracy and generalizability.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!