Artificial intelligence and machine learning techniques have proved fertile methods for attacking difficult problems in medicine and public health. These techniques have garnered strong interest for the analysis of the large, multi-domain open science datasets that are increasingly available in health research. Discovery science in large datasets is challenging given the unconstrained nature of the learning environment where there may be a large number of potential predictors and appropriate ranges for model hyperparameters are unknown. As well, it is likely that explainability is at a premium in order to engage in future hypothesis generation or analysis. Here, we present a novel method that addresses these challenges by exploiting evolutionary algorithms to optimize machine learning discovery science while exploring a large solution space and minimizing bias. We demonstrate that our approach, called (IEL), provides an automated, adaptive method for jointly learning features and hyperparameters while furnishing explainable models where the original features used to make predictions may be obtained even with artificial neural networks. In IEL the machine learning algorithm of choice is nested inside an evolutionary algorithm which selects features hyperparameters over generations on the basis of an information function to converge on an optimal solution. We apply IEL to three gold standard machine learning algorithms in challenging, heterogenous biobehavioral data: deep learning with artificial neural networks, decision tree-based techniques and baseline linear models. Using our novel IEL approach, artificial neural networks achieved ≥ 95% accuracy, sensitivity and specificity and 45-73% in classification and substantial gains over default settings. IEL may be applied to a wide range of less- or unconstrained discovery science problems where the practitioner wishes to jointly learn features and hyperparameters in an adaptive, principled manner within the same algorithmic process. This approach offers significant flexibility, enlarges the solution space and mitigates bias that may arise from manual or semi-manual hyperparameter tuning and feature selection and presents the opportunity to select the inner machine learning algorithm based on the results of optimized learning for the problem at hand.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9038845PMC
http://dx.doi.org/10.3389/frai.2022.832530DOI Listing

Publication Analysis

Top Keywords

machine learning
24
features hyperparameters
16
learning
12
learning artificial
12
discovery science
12
artificial neural
12
neural networks
12
artificial intelligence
8
learning features
8
solution space
8

Similar Publications

Network-Based Identification of Key Toxic Compounds in Airborne Chemical Exposome.

Environ Sci Technol

January 2025

State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China.

Air pollution is a leading contributor to the global disease burden. However, the complex nature of the chemicals to which humans are exposed through inhalation has obscured the identification of the key compounds responsible for diseases. Here, we develop a network topology-based framework to identify key toxic compounds in the airborne chemical exposome.

View Article and Find Full Text PDF

A digital twin is a virtual model of a real-world system that updates in real-time. In healthcare, digital twins are gaining popularity for monitoring activities like diet, physical activity, and sleep. However, their application in predicting serious conditions such as heart attacks, brain strokes and cancers remains under investigation, with current research showing limited accuracy in such predictions.

View Article and Find Full Text PDF

Clinical utility of tumor-infiltrating lymphocyte evaluation by two different methods in breast cancer patients treated with neoadjuvant chemotherapy.

Breast Cancer

January 2025

Division of Breast and Endocrine Surgery, Department of Surgery, School of Medicine, Hyogo Medical University, 1-1 Mukogawa-cho, Nishinomiya, Hyogo, 663-8501, Japan.

Purpose: The aim of this study was to examine the clinical utility of tumor-infiltrating lymphocytes (TILs) evaluated by "average" and "hot-spot" methods in breast cancer patients.

Methods: We examined 367 breast cancer patients without neoadjuvant chemotherapy (NAC) by average and hot-spot methods to determine the consistency of TIL scores between biopsy and surgical specimens. TIL scores before NAC were also compared with the pathological complete response (pCR) rate and clinical outcomes in 144 breast cancer patients that received NAC.

View Article and Find Full Text PDF

End-range movements are among the most demanding but least understood in the sport of tennis. Using male Hawk-Eye data from match-play during the 2021-2023 Australian Open tournaments, we evaluated the speed, deceleration, acceleration, and shot quality characteristics of these types of movement in men's Grand Slam tennis. Lateral end-range movements that incorporated a change of direction (CoD) were identified for analysis using k-means (end-range) and random forest (CoD) machine learning models.

View Article and Find Full Text PDF

The "no-show" problem in healthcare refers to the prevalent phenomenon where patients schedule appointments with healthcare providers but fail to attend them without prior cancellation or rescheduling. In addressing this issue, our study delves into a multivariate analysis over a five-year period involving 21,969 patients. Our study introduces a predictive model framework that offers a holistic approach to managing the no-show problem in healthcare, incorporating elements into the objective function that address not only the accurate prediction of no-shows but also the management of service capacity, overbooking, and idle resource allocation resulting from mispredictions.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!