For supervised classification problems involving design, control, and other practical purposes, users are not only interested in finding a highly accurate classifier but they also demand that the obtained classifier be easily interpretable. While the definition of interpretability of a classifier can vary from case to case, here, by a humanly interpretable classifier, we restrict it to be expressed in simplistic mathematical terms. As a novel approach, we represent a classifier as an assembly of simple mathematical rules using a nonlinear decision tree (NLDT). Each conditional (nonterminal) node of the tree represents a nonlinear mathematical rule (split-rule) involving features in order to partition the dataset in the given conditional node into two nonoverlapping subsets. This partitioning is intended to minimize the impurity of the resulting child nodes. By restricting the structure of the split-rule at each conditional node and depth of the decision tree, the interpretability of the classifier is ensured. The nonlinear split-rule at a given conditional node is obtained using an evolutionary bilevel optimization algorithm, in which while the upper level focuses on arriving at an interpretable structure of the split-rule, the lower level achieves the most appropriate weights (coefficients) of individual constituents of the rule to minimize the net impurity of two resulting child nodes. The performance of the proposed algorithm is demonstrated on a number of controlled test problems, existing benchmark problems, and industrial problems. Results on 2-500 feature problems are encouraging and open up further scopes of applying the proposed approach to more challenging and complex classification tasks.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TCYB.2020.3033003DOI Listing

Publication Analysis

Top Keywords

conditional node
12
bilevel optimization
8
nonlinear decision
8
classification problems
8
interpretability classifier
8
decision tree
8
impurity child
8
child nodes
8
structure split-rule
8
split-rule conditional
8

Similar Publications

Modeling shock propagation and resilience in financial temporal networks.

Chaos

January 2025

Classe di Scienze, Scuola Normale Superiore, Piazza dei Cavalieri 7, 56126 Pisa, Italy.

Modeling how a shock propagates in a temporal network and how the system relaxes back to equilibrium is challenging but important in many applications, such as financial systemic risk. Most studies, so far, have focused on shocks hitting a link of the network, while often it is the node and its propensity to be connected that are affected by a shock. Using the configuration model-a specific exponential random graph model-as a starting point, we propose a vector autoregressive (VAR) framework to analytically compute the Impulse Response Function (IRF) of a network metric conditional to a shock on a node.

View Article and Find Full Text PDF

Aim: Lymph node metastasis is an adverse prognostic factor in pancreatic ductal adenocarcinoma. However, it remains a challenge to predict lymph node metastasis using preoperative imaging alone. We used machine learning (combining preoperative imaging findings, tumor markers, and clinical information) to create a novel prediction model for lymph node metastasis in resectable pancreatic ductal adenocarcinoma.

View Article and Find Full Text PDF
Article Synopsis
  • National guidelines recommend segmental colectomy for appendiceal neuroendocrine neoplasms larger than 2.0 cm due to risks of lymph node involvement, with conditional overall survival becoming an important prognostic tool.
  • A study of 3,541 patients revealed that 16% had positive lymph nodes, and factors like size, depth of invasion, and lymphovascular involvement were correlated with metastasis.
  • The findings suggest that more clinicopathologic factors should influence decisions on surgical treatment and surveillance for better survival rates, especially for patients with more aggressive histologies.
View Article and Find Full Text PDF

Metric dimension of star fan graph.

Sci Rep

January 2025

Faculty of Education and Arts, Australian Catholic University, Sydney, NSW, 2118, Australia.

Every node in a network is said to be resolved if it can be uniquely identified by a vector of distances to a specific set of nodes. The metric dimension is equivalent to the least possible cardinal number of a resolving set. Conditional resolving sets are obtained by imposing various constraints on resolving set.

View Article and Find Full Text PDF

Kawasaki Disease (KD) is a rare febrile illness affecting infants and young children, potentially leading to coronary artery complications and, in severe cases, mortality if untreated. However, KD is frequently misdiagnosed as a common fever in clinical settings, and the inherent data imbalance further complicates accurate prediction when using traditional machine learning and statistical methods. This paper introduces two advanced approaches to address these challenges, enhancing prediction accuracy and generalizability.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!