Deep reinforcement learning (DRL) recently has attained remarkable results in various domains, including games, robotics, and recommender system. Nevertheless, an urgent problem in the practical application of DRL is fast adaptation. To this end, this article proposes a new and versatile metalearning approach called fast task adaptation via metalearning (FTAML), which leverages the strengths of the model-based methods and gradient-based metalearning methods for training the initial parameters of the model, such that the model is able to efficiently master unseen tasks with a little amount of data from the tasks. The proposed algorithm makes it possible to separate task optimization and task identification, specifically, the model-based learner helps to identify the pattern of a task, while the gradient-based metalearner is capable of consistently improving the performance with only a few gradient update steps through making use of the task embedding produced by the model-based learner. In addition, the choice of network for the model-based learner in the proposed method is also discussed, and the performance of networks with different depths is explored. Finally, the simulation results on reinforcement learning problems demonstrate that the proposed approach outperforms compared metalearning algorithms and delivers a new state-of-the-art performance on a variety of challenging control tasks.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TCYB.2020.3028378DOI Listing

Publication Analysis

Top Keywords

model-based learner
12
fast task
8
task adaptation
8
reinforcement learning
8
model-based
5
task
5
adaptation based
4
based combination
4
combination model-based
4
model-based gradient-based
4

Similar Publications

Infectious diseases are a global public health problem that poses a threat to human society. Since the 1970s, constantly mutated new infectious viruses have been quietly attacking humanity, and at least one new type of infectious disease is discovered every year. Therefore, early warning of infectious diseases will greatly reduce the socio-economic harm of infectious diseases.

View Article and Find Full Text PDF

Trade-offs, control conditions, and alternative designs in the experimental study of cultural evolution.

Proc Natl Acad Sci U S A

November 2024

Centre for Ecology and Conservation, University of Exeter, Cornwall TR10 9FE, United Kingdom.

Although the theoretical foundations of the modern field of cultural evolution have been in place for over 50 y, laboratory experiments specifically designed to test cultural evolutionary theory have only existed for the last two decades. Here, we review the main experimental designs used in the field of cultural evolution, as well as major findings related to the generation of cultural variation, content- and model-based biases, cumulative cultural evolution, and nonhuman culture. We then identify methodological advances that demonstrate the iterative improvement of cultural evolution experimental methods.

View Article and Find Full Text PDF

StackIL10: A stacking ensemble model for the improved prediction of IL-10 inducing peptides.

PLoS One

November 2024

Department of Information and Communication Engineering, Yeungnam University, Gyeongsangbuk-do, Gyeongsan-si, South Korea.

Interleukin-10, a highly effective cytokine recognized for its anti-inflammatory properties, plays a critical role in the immune system. In addition to its well-documented capacity to mitigate inflammation, IL-10 can unexpectedly demonstrate pro-inflammatory characteristics under specific circumstances. The presence of both aspects emphasizes the vital need to identify the IL-10-induced peptide.

View Article and Find Full Text PDF

Humans adapt their locomotion seamlessly in response to changes in the body or the environment. It is unclear how such adaptation improves performance measures like energy consumption or symmetry while avoiding falling. Here, we model locomotor adaptation as interactions between a stabilizing controller that reacts quickly to perturbations and a reinforcement learner that gradually improves the controller's performance through local exploration and memory.

View Article and Find Full Text PDF

Using the super-learner to predict the chemical acute toxicity on rats.

J Hazard Mater

December 2024

School of Materials Science and Engineering, Beihang University, Beijing 100191, China; School of Public Health, Southeast University, Nanjing, China. Electronic address:

With the rapid increase in the number of commercial chemicals, testing methods regarding on median lethal dose (LD) relying animal experiments face challenges such as high costs and ethical concerns. Classical quantitative structure-activity relationship models relying on single algorithm always lack interpretability and precision, given the complexity of the mechanisms underlying acute toxicity. To address these issues, this study has developed a predictive framework using an ensemble learning model based on Super-learner.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!