To improve the convergence rate and the sample efficiency, two efficient learning methods AC-HMLP and RAC-HMLP (AC-HMLP with -regularization) are proposed by combining actor-critic algorithm with hierarchical model learning and planning. The hierarchical models consisting of the local and the global models, which are learned at the same time during learning of the value function and the policy, are approximated by local linear regression (LLR) and linear function approximation (LFA), respectively. Both the local model and the global model are applied to generate samples for planning; the former is used only if the state-prediction error does not surpass the threshold at each time step, while the latter is utilized at the end of each episode. The purpose of taking both models is to improve the sample efficiency and accelerate the convergence rate of the whole algorithm through fully utilizing the local and global information. Experimentally, AC-HMLP and RAC-HMLP are compared with three representative algorithms on two Reinforcement Learning (RL) benchmark problems. The results demonstrate that they perform best in terms of convergence rate and sample efficiency.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5066029 | PMC |
http://dx.doi.org/10.1155/2016/4824072 | DOI Listing |
Tuberc Respir Dis (Seoul)
January 2025
Department of Internal Medicine, Pusan National University Yangsan Hospital, Pusan National University School of Medicine, Yangsan, Republic of Korea.
Background: Malnutrition exacerbates the prognosis of various diseases; however, its specific impact on severe coronavirus disease (COVID-19) outcomes remains underexplored.
Methods: This multicenter study in Korea assessed the nutritional status of 1,088 adults with severe COVID-19 using the Geriatric Nutritional Risk Index (GNRI) based on serum albumin levels and body weight. The patients were divided into the GNRI>98 (no-risk) and GNRI≤98 (risk) groups.
J Occup Environ Hyg
January 2025
Finance Department, University of Texas at Austin, Austin, Texas.
This paper asserts that the Nobel Prize for Medicine/Physiology that Hermann J. Muller received in 1946 was a front to enhance the legitimacy, acceptance, and application of eugenics, a strategy to guide the direction and rate of human evolutionary change. Seven of the nine people nominating (1932-1946) Muller were proponents of eugenics with Muller being among the most visible of the scientific leaders.
View Article and Find Full Text PDFCancer
January 2025
Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, USA.
Background: Testicular germ cell tumors (TGCTs) are the most common cancers among young men in the United States. Incidence rates among non-Hispanic White (NHW) men historically have been much higher than the rates among other men. To study whether this pattern had changed, the authors examined trends in TGCT incidence for the years 1992-2021.
View Article and Find Full Text PDFTherap Adv Gastroenterol
January 2025
F. Widjaja Inflammatory Bowel Disease Institute, Cedars-Sinai Medical Center, 8700 Beverly Boulevard, West Hollywood 90048, CA, USAKarsh Division of Gastroenterology and Hepatology, Cedars-Sinai Medical Center, Los Angeles, CA, USA.
Background: Despite its significant health burden, there is a lack of national-level temporal patterns in gastrointestinal bleeding (GIB) mortality.
Objectives: To comprehensively decipher the annual and monthly trend of GIB-related mortality in the United States.
Design: Cross-sectional study.
RSC Adv
January 2025
Department of Chemical & Biological Engineering, Hanbat National University Daejeon 34158 Republic of Korea +82 42 8211530.
This study investigated the impact of aspect ratio and crystal size distribution on the mother liquor content and drying rate of l-glutamic acid (LGA). LGA cooling crystallization was performed using two methods: spontaneous nucleation and seeding. First, to identify various crystalline forms of LGA and obtain α-form seeds, cooling crystallization was carried out through spontaneous nucleation and seeding.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!