Integrated Evolutionary Learning: An Artificial Intelligence Approach to Joint Learning of Features and Hyperparameters for Optimized, Explainable Machine Learning.

Nina de Lacy Michael J Ramshaw J Nathan Kutz

Front Artif Intell

Department of Applied Mathematics, AI Institute in Dynamic Systems, University of Washington, Seattle, WA, United States.

Published: April 2022

Artificial intelligence and machine learning techniques have proved fertile methods for attacking difficult problems in medicine and public health. These techniques have garnered strong interest for the analysis of the large, multi-domain open science datasets that are increasingly available in health research. Discovery science in large datasets is challenging given the unconstrained nature of the learning environment where there may be a large number of potential predictors and appropriate ranges for model hyperparameters are unknown. As well, it is likely that explainability is at a premium in order to engage in future hypothesis generation or analysis. Here, we present a novel method that addresses these challenges by exploiting evolutionary algorithms to optimize machine learning discovery science while exploring a large solution space and minimizing bias. We demonstrate that our approach, called (IEL), provides an automated, adaptive method for jointly learning features and hyperparameters while furnishing explainable models where the original features used to make predictions may be obtained even with artificial neural networks. In IEL the machine learning algorithm of choice is nested inside an evolutionary algorithm which selects features hyperparameters over generations on the basis of an information function to converge on an optimal solution. We apply IEL to three gold standard machine learning algorithms in challenging, heterogenous biobehavioral data: deep learning with artificial neural networks, decision tree-based techniques and baseline linear models. Using our novel IEL approach, artificial neural networks achieved ≥ 95% accuracy, sensitivity and specificity and 45-73% in classification and substantial gains over default settings. IEL may be applied to a wide range of less- or unconstrained discovery science problems where the practitioner wishes to jointly learn features and hyperparameters in an adaptive, principled manner within the same algorithmic process. This approach offers significant flexibility, enlarges the solution space and mitigates bias that may arise from manual or semi-manual hyperparameter tuning and feature selection and presents the opportunity to select the inner machine learning algorithm based on the results of optimized learning for the problem at hand.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9038845	PMC
http://dx.doi.org/10.3389/frai.2022.832530	DOI Listing

Publication Analysis

Top Keywords

machine learning

features hyperparameters

learning

learning artificial

discovery science

artificial neural

neural networks

artificial intelligence

learning features

solution space

Similar Publications

Network-Based Identification of Key Toxic Compounds in Airborne Chemical Exposome.

Environ Sci Technol

January 2025

State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China.

Weican Zhang Shenxi Deng Xi-En Zhang Cha Huang Qian Liu

Air pollution is a leading contributor to the global disease burden. However, the complex nature of the chemicals to which humans are exposed through inhalation has obscured the identification of the key compounds responsible for diseases. Here, we develop a network topology-based framework to identify key toxic compounds in the airborne chemical exposome.

View Article and Find Full Text PDF

Similar Publications

Blockchain-enabled digital twin system for brain stroke prediction.

Brain Inform

January 2025

Department of Computing, Glasgow Caledonian University, Glasgow, G4 0BA, Scotland.

Venkatesh Upadrista Sajid Nazir Huaglory Tianfield

A digital twin is a virtual model of a real-world system that updates in real-time. In healthcare, digital twins are gaining popularity for monitoring activities like diet, physical activity, and sleep. However, their application in predicting serious conditions such as heart attacks, brain strokes and cancers remains under investigation, with current research showing limited accuracy in such predictions.

View Article and Find Full Text PDF

Similar Publications

Clinical utility of tumor-infiltrating lymphocyte evaluation by two different methods in breast cancer patients treated with neoadjuvant chemotherapy.

Breast Cancer

January 2025

Division of Breast and Endocrine Surgery, Department of Surgery, School of Medicine, Hyogo Medical University, 1-1 Mukogawa-cho, Nishinomiya, Hyogo, 663-8501, Japan.

Masayuki Nagahashi Eri Ishikawa Takahiro Nagai Haruka Kanaoka Aoi Oshiro

Purpose: The aim of this study was to examine the clinical utility of tumor-infiltrating lymphocytes (TILs) evaluated by "average" and "hot-spot" methods in breast cancer patients.

Methods: We examined 367 breast cancer patients without neoadjuvant chemotherapy (NAC) by average and hot-spot methods to determine the consistency of TIL scores between biopsy and surgical specimens. TIL scores before NAC were also compared with the pathological complete response (pCR) rate and clinical outcomes in 144 breast cancer patients that received NAC.

View Article and Find Full Text PDF

Similar Publications

Lateral End-Range Movement Profile and Shot Effectiveness During Grand Slam Tennis Match-Play.

Eur J Sport Sci

February 2025

School of Human Sciences (Exercise and Sport Science), The University of Western Australia, Perth, Australia.

Cameron Armstrong Peter Peeling Alistair Murphy Berwin A Turlach Machar Reid

End-range movements are among the most demanding but least understood in the sport of tennis. Using male Hawk-Eye data from match-play during the 2021-2023 Australian Open tournaments, we evaluated the speed, deceleration, acceleration, and shot quality characteristics of these types of movement in men's Grand Slam tennis. Lateral end-range movements that incorporated a change of direction (CoD) were identified for analysis using k-means (end-range) and random forest (CoD) machine learning models.

View Article and Find Full Text PDF

Similar Publications

Predictive Optimization of Patient No-Show Management in Primary Healthcare Using Machine Learning.

J Med Syst

January 2025

Department of Computing, University of North Florida, 1 UNF Dr., Jacksonville, 32246, FL, USA.

Andrés Leiva-Araos Cristián Contreras Hemani Kaushal Zornitza Prodanoff

The "no-show" problem in healthcare refers to the prevalent phenomenon where patients schedule appointments with healthcare providers but fail to attend them without prior cancellation or rescheduling. In addressing this issue, our study delves into a multivariate analysis over a five-year period involving 21,969 patients. Our study introduces a predictive model framework that offers a holistic approach to managing the no-show problem in healthcare, incorporating elements into the objective function that address not only the accurate prediction of no-shows but also the management of service capacity, overbooking, and idle resource allocation resulting from mispredictions.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!