Design of experiments (DOE) is an established method to allocate resources for efficient parameter space exploration. Model based active learning (AL) data sampling strategies have shown potential for further optimization. This paper introduces a workflow for conducting DOE comparative studies using automated machine learning. Based on a practical definition of model complexity in the context of machine learning, the interplay of systematic data generation and model performance is examined considering various sources of uncertainty: this includes uncertainties caused by stochastic sampling strategies, imprecise data, suboptimal modeling, and model evaluation. Results obtained from electrical circuit models with varying complexity show that not all AL sampling strategies outperform conventional DOE strategies, depending on the available data volume, the complexity of the dataset, and data uncertainties. Trade-offs in resource allocation strategies, in particular between identical replication of data points for statistical noise reduction and broad sampling for maximum parameter space exploration, and their impact on subsequent machine learning analysis are systematically investigated. Results indicate that replication oriented strategies should not be dismissed but may prove advantageous for cases with non-negligible noise impact and intermediate resource availability. The provided workflow can be used to simulate practical experimental conditions for DOE testing and DOE selection.

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41598-024-83581-3DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11688508PMC

Publication Analysis

Top Keywords

sampling strategies
12
machine learning
12
design experiments
8
experiments doe
8
doe selection
8
parameter space
8
space exploration
8
data
7
strategies
7
doe
6

Similar Publications

Background: The potential therapeutic role of magnesium (Mg) in type 2 diabetes mellitus (T2DM) remains insufficiently studied despite its known involvement in critical processes like lipid metabolism and insulin sensitivity. This study examines the impact of Mg-focused nutritional education on lipid profile parameters, total cholesterol (TC), triglycerides (TG), low-density lipoprotein cholesterol (LDL-C), and high-density lipoprotein cholesterol (HDL-C) in T2DM patients.

Methods: Thirty participants with T2DM were recruited for this within-subject experimental study.

View Article and Find Full Text PDF

Need For A Strategic Approach To Knowledge Transfer And Exchange: Late-phase clinical trials and systematic reviews find results that have the potential to improve health outcomes for people. However, there are often delays in these results influencing clinical practice. We developed a knowledge transfer and exchange strategy to support research teams, aiming to identify activities along the research process to maximise and accelerate the research impact.

View Article and Find Full Text PDF

Background: Self-neglect is a significant global public health issue, compromising the health, safety, and well-being of older adults. Despite extensive research on the prevalence and risk factors of self-neglect, the underlying psychosocial mechanisms remain underexplored. Social isolation and aging attitudes have been identified as important correlates of self-neglect; however, the precise interplay between these variables, particularly the mediating role of aging attitudes, has yet to be fully examined in the context of rural older adults.

View Article and Find Full Text PDF

Background: Rupture of extensor pollicis longus tendon (EPL) is a known complication following a distal radius fracture (DRF). Although the precise mechanisms behind these ruptures remain unclear, vascular impairment is thought to play a significant role. Additionally, the impact of an EPL rupture on microstructure of the tendon and muscle is not well understood, but such information could be important in guiding treatment strategies.

View Article and Find Full Text PDF

Genome-wide development of simple sequence repeat (SSR) markers at 2-Mb intervals in lotus (Nelumbo Adans.).

BMC Genomics

January 2025

Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, No. 3888 Chenhua Road, Songjiang District, Shanghai, 201602, China.

Background: Despite the rapid advancement of high-throughput sequencing, simple sequence repeats (SSRs) remain indispensable molecular markers for various applied and research tasks owing to their cost-effectiveness and ease of use. However, existing SSR markers cannot meet the growing demand for research on lotus (Nelumbo Adans.) given their scarcity and weak connections to the lotus genome.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!