Approximation biases of value functions are considered a key problem in reinforcement learning (RL). In particular, existing RL algorithms are hindered by overestimation and underestimation biases, i.e., value mismatching between RL's actual returns and action-value approximations limits the performance of RL algorithms. In this article, we first develop a new synthesis loss function for RL's action-value estimation integrating a regularization term and a modified "clipped double Q-learning" structure for solving overestimation and underestimation biases. To minimize the differences between action-value estimations and actual returns in RL, we develop a new discrepancy function to determine the type and magnitude of approximation biases. Then, two coefficients embedded in the synthesis loss are automatically tuned by minimizing the discrepancy function during training to minimize approximation biases. We further design a new actor-critic (AC) algorithm, named AC with synthesis loss (ACSL), by integrating the synthesis loss function and an error-controlled mechanism. Experimental results on continuous control tasks illustrate that the proposed ACSL algorithm outperforms other cutting-edge RL methods in many tasks and that the proposed synthesis loss function is easily implemented into other algorithms and significantly reduces approximation biases while improving performance. The proposed method can successfully handle many complex continuous control tasks and can greatly outperform other state-of-the-art algorithms on most tasks.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TCYB.2024.3388470DOI Listing

Publication Analysis

Top Keywords

synthesis loss
24
approximation biases
20
loss function
12
overestimation underestimation
8
underestimation biases
8
actual returns
8
discrepancy function
8
continuous control
8
control tasks
8
biases
7

Similar Publications

Copper homeostasis and pregnancy complications: a comprehensive review.

J Assist Reprod Genet

January 2025

Department of Obstetrics, Guangzhou Women and Children's Medical Center, Guangzhou Medical University, Guangzhou, China.

Pregnancy complications pose challenges for both pregnant women and obstetricians globally, with the pathogenesis of many remaining poorly understood. Recently coined as a mode of cell death, cuproptosis has been proposed but remains largely unexplored. This process involves copper overload, resulting in the accumulation of fatty acylated proteins and subsequent loss of iron-sulfur cluster proteins.

View Article and Find Full Text PDF

Purpose: Sulfur mustard gas (SM) exposure to eyes causes multiple corneal injuries including stromal cell loss in vivo. However, mechanisms mediating stromal cell loss/death remains elusive. This study sought to test the novel hypothesis that SM-induced toxicity to human corneal stromal fibroblasts involves ferroptosis mechanism via p38 MAPK signaling.

View Article and Find Full Text PDF

Purpose: Alteration of visual acuity in wet age-related macular degeneration (AMD) is mostly driven by vascular endothelial growth factor A (VEGF-A)-induced edema from leaky newly forming blood vessels below the retina layers. To date, all therapies aimed at alleviation of this process have relied on inhibition of VEGF-A activity. Although effective in preventing vascular leak and edema, this approach also leads to the loss of normal vasculature and multiple related side effects.

View Article and Find Full Text PDF

Tumor-derived extracellular vesicle (tEV)-associated RNAs hold promise as diagnostic biomarkers, but their clinical use is hindered by the rarity of tEVs among nontumor EVs. Here, we present EV-CLIP, a highly sensitive droplet-based digital method for profiling EV RNA. EV-CLIP utilizes the fusion of EVs with charged liposomes (CLIPs) in a microfluidic chip.

View Article and Find Full Text PDF

(PA) is an opportunistic gram-negative pathogen that can infect the cornea, leading to permanent vision loss. Autophagy is a cannibalistic process that drives cytoplasmic components to the lysosome for degradation and/or recycling. Autophagy has been shown to play a key role in the removal of intracellular pathogens and, as such, is an important component of the innate immune response.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!