AI Article Synopsis

  • This study examines how artificial agents perform in a noisy T-maze using Markov decision processes and specific learning algorithms (SF and PF).
  • It assesses the impact of hyperparameters like reward learning rate (αr) and eligibility trace decay rate (λ) on the agents' adaptability, revealing that an αr of 0.9 consistently leads to better outcomes.
  • The research highlights the importance of optimizing these hyperparameters for enhancing the effectiveness of learning algorithms, particularly in transfer learning contexts, and offers valuable insights for future work in AI and neuroscience.

Article Abstract

In this study, we investigate the adaptability of artificial agents within a noisy T-maze that use Markov decision processes (MDPs) and successor feature (SF) and predecessor feature (PF) learning algorithms. Our focus is on quantifying how varying the hyperparameters, specifically the reward learning rate (αr) and the eligibility trace decay rate (λ), can enhance their adaptability. Adaptation is evaluated by analyzing the hyperparameters of cumulative reward, step length, adaptation rate, and adaptation step length and the relationships between them using Spearman's correlation tests and linear regression. Our findings reveal that an αr of 0.9 consistently yields superior adaptation across all metrics at a noise level of 0.05. However, the optimal setting for λ varies by metric and context. In discussing these results, we emphasize the critical role of hyperparameter optimization in refining the performance and transfer learning efficacy of learning algorithms. This research advances our understanding of the functionality of PF and SF algorithms, particularly in navigating the inherent uncertainty of transfer learning tasks. By offering insights into the optimal hyperparameter configurations, this study contributes to the development of more adaptive and robust learning algorithms, paving the way for future explorations in artificial intelligence and neuroscience.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11479366PMC
http://dx.doi.org/10.3390/s24196419DOI Listing

Publication Analysis

Top Keywords

transfer learning
12
learning algorithms
12
learning
8
step length
8
investigating transfer
4
learning noisy
4
noisy environments
4
environments study
4
study predecessor
4
predecessor successor
4

Similar Publications

Deep learning-based design and experimental validation of a medicine-like human antibody library.

Brief Bioinform

November 2024

Biotherapeutics Molecule Discovery, Boehringer Ingelheim Pharmaceutical Inc., 900 Ridgebury Road, Ridgefield, CT 06877, United States.

Antibody generation requires the use of one or more time-consuming methods, namely animal immunization, and in vitro display technologies. However, the recent availability of large amounts of antibody sequence and structural data in the public domain along with the advent of generative deep learning algorithms raises the possibility of computationally generating novel antibody sequences with desirable developability attributes. Here, we describe a deep learning model for computationally generating libraries of highly human antibody variable regions whose intrinsic physicochemical properties resemble those of the variable regions of the marketed antibody-based biotherapeutics (medicine-likeness).

View Article and Find Full Text PDF

Accurate survival prediction of patients with long-bone metastases is challenging, but important for optimizing treatment. The Skeletal Oncology Research Group (SORG) machine learning algorithm (MLA) has been previously developed and internally validated to predict 90-day and 1-year survival. External validation showed promise in the United States and Taiwan.

View Article and Find Full Text PDF

Background: Learning health systems (LHS) have the potential to use health data in real time through rapid and continuous cycles of data interrogation, implementing insights to practice, feedback, and practice change. However, there is a lack of an appropriately skilled interprofessional informatics workforce that can leverage knowledge to design innovative solutions. Therefore, there is a need to develop tailored professional development training in digital health, to foster skilled interprofessional learning communities in the health care workforce in Australia.

View Article and Find Full Text PDF

Detecting anomalies in smart wearables for hypertension: a deep learning mechanism.

Front Public Health

January 2025

Department of Computer Science, College of Engineering and Computer Science, Jazan University, Jazan, Saudi Arabia.

Introduction: The growing demand for real-time, affordable, and accessible healthcare has underscored the need for advanced technologies that can provide timely health monitoring. One such area is predicting arterial blood pressure (BP) using non-invasive methods, which is crucial for managing cardiovascular diseases. This research aims to address the limitations of current healthcare systems, particularly in remote areas, by leveraging deep learning techniques in Smart Health Monitoring (SHM).

View Article and Find Full Text PDF

Background: Large language models (LLMs) have demonstrated impressive performance on medical licensing and diagnosis-related exams. However, comparative evaluations to optimize LLM performance and ability in the domain of comprehensive medication management (CMM) are lacking. The purpose of this evaluation was to test various LLMs performance optimization strategies and performance on critical care pharmacotherapy questions used in the assessment of Doctor of Pharmacy students.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!