Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning.

Comput Intell Neurosci

College of Electronic & Information Engineering, Suzhou University of Science and Technology, Jiangsu, Suzhou 215000, China.

Published: February 2017

To improve the convergence rate and the sample efficiency, two efficient learning methods AC-HMLP and RAC-HMLP (AC-HMLP with -regularization) are proposed by combining actor-critic algorithm with hierarchical model learning and planning. The hierarchical models consisting of the local and the global models, which are learned at the same time during learning of the value function and the policy, are approximated by local linear regression (LLR) and linear function approximation (LFA), respectively. Both the local model and the global model are applied to generate samples for planning; the former is used only if the state-prediction error does not surpass the threshold at each time step, while the latter is utilized at the end of each episode. The purpose of taking both models is to improve the sample efficiency and accelerate the convergence rate of the whole algorithm through fully utilizing the local and global information. Experimentally, AC-HMLP and RAC-HMLP are compared with three representative algorithms on two Reinforcement Learning (RL) benchmark problems. The results demonstrate that they perform best in terms of convergence rate and sample efficiency.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5066029PMC
http://dx.doi.org/10.1155/2016/4824072DOI Listing

Publication Analysis

Top Keywords

convergence rate
12
sample efficiency
12
actor-critic algorithm
8
algorithm hierarchical
8
hierarchical model
8
model learning
8
learning planning
8
rate sample
8
ac-hmlp rac-hmlp
8
local global
8

Similar Publications

Relationship between the Geriatric Nutrition Risk Index and the prognosis of severe coronavirus disease 2019 in Korea.

Tuberc Respir Dis (Seoul)

January 2025

Department of Internal Medicine, Pusan National University Yangsan Hospital, Pusan National University School of Medicine, Yangsan, Republic of Korea.

Background: Malnutrition exacerbates the prognosis of various diseases; however, its specific impact on severe coronavirus disease (COVID-19) outcomes remains underexplored.

Methods: This multicenter study in Korea assessed the nutritional status of 1,088 adults with severe COVID-19 using the Geriatric Nutritional Risk Index (GNRI) based on serum albumin levels and body weight. The patients were divided into the GNRI>98 (no-risk) and GNRI≤98 (risk) groups.

View Article and Find Full Text PDF

This paper asserts that the Nobel Prize for Medicine/Physiology that Hermann J. Muller received in 1946 was a front to enhance the legitimacy, acceptance, and application of eugenics, a strategy to guide the direction and rate of human evolutionary change. Seven of the nine people nominating (1932-1946) Muller were proponents of eugenics with Muller being among the most visible of the scientific leaders.

View Article and Find Full Text PDF

Background: Testicular germ cell tumors (TGCTs) are the most common cancers among young men in the United States. Incidence rates among non-Hispanic White (NHW) men historically have been much higher than the rates among other men. To study whether this pattern had changed, the authors examined trends in TGCT incidence for the years 1992-2021.

View Article and Find Full Text PDF

Increased gastrointestinal bleeding-related mortality during the COVID-19 pandemic.

Therap Adv Gastroenterol

January 2025

F. Widjaja Inflammatory Bowel Disease Institute, Cedars-Sinai Medical Center, 8700 Beverly Boulevard, West Hollywood 90048, CA, USAKarsh Division of Gastroenterology and Hepatology, Cedars-Sinai Medical Center, Los Angeles, CA, USA.

Background: Despite its significant health burden, there is a lack of national-level temporal patterns in gastrointestinal bleeding (GIB) mortality.

Objectives: To comprehensively decipher the annual and monthly trend of GIB-related mortality in the United States.

Design: Cross-sectional study.

View Article and Find Full Text PDF

This study investigated the impact of aspect ratio and crystal size distribution on the mother liquor content and drying rate of l-glutamic acid (LGA). LGA cooling crystallization was performed using two methods: spontaneous nucleation and seeding. First, to identify various crystalline forms of LGA and obtain α-form seeds, cooling crystallization was carried out through spontaneous nucleation and seeding.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!