Reinforcement learning strategies in cancer chemotherapy treatments: A review.

Comput Methods Programs Biomed

Faculty of Information Technology, University of Moratuwa, Katubedda, Moratuwa, Sri Lanka.

Published: February 2023

Background And Objective: Cancer is one of the major causes of death worldwide and chemotherapies are the most significant anti-cancer therapy, in spite of the emerging precision cancer medicines in the last 2 decades. The growing interest in developing the effective chemotherapy regimen with optimal drug dosing schedule to benefit the clinical cancer patients has spawned innovative solutions involving mathematical modeling since the chemotherapy regimens are administered cyclically until the futility or the occurrence of intolerable adverse events. Thus, in this present work, we reviewed the emerging trends involved in forming a computational solution from the aspect of reinforcement learning.

Methods: Initially, this survey in-depth focused on the details of the dynamic treatment regimens from a broad perspective and then narrowed down to inspirations from reinforcement learning that were advantageous to chemotherapy dosing, including both offline reinforcement learning and supervised reinforcement learning.

Results: The insights established in the chemotherapy-planning problem associated with the Reinforcement Learning (RL) has been discussed in this study. It showed that the researchers were able to widen their perspectives in comprehending the theoretical basis, dynamic treatment regimens (DTR), use of the adaptive control on DTR, and the associated RL techniques.

Conclusions: This study reviewed the recent researches relevant to the topic, and highlighted the challenges, open questions, possible solutions, and future steps in inventing a realistic solution for the aforementioned problem.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.cmpb.2022.107280DOI Listing

Publication Analysis

Top Keywords

reinforcement learning
16
dynamic treatment
8
treatment regimens
8
reinforcement
6
learning strategies
4
cancer
4
strategies cancer
4
chemotherapy
4
cancer chemotherapy
4
chemotherapy treatments
4

Similar Publications

Transient changes in the firing of midbrain dopamine neurons have been closely tied to the unidimensional value-based prediction error contained in temporal difference reinforcement learning models. However, whereas an abundance of work has now shown how well dopamine responses conform to the predictions of this hypothesis, far fewer studies have challenged its implicit assumption that dopamine is not involved in learning value-neutral features of reward. Here, we review studies in rats and humans that put this assumption to the test, and which suggest that dopamine transients provide a much richer signal that incorporates information that goes beyond integrated value.

View Article and Find Full Text PDF

Rising interest in artificial intelligence in education reinforces the demand for evidence-based implementation. This study investigates how tutor agents' physical embodiment and anthropomorphism (student-reported sociability, animacy, agency, and disturbance) relate to affective (on-task enjoyment) and cognitive (task performance) learning within an intelligent tutoring system (ITS). Data from 56 students (M = 17.

View Article and Find Full Text PDF

In illicit drug markets, the most recently expanding new synthetic opioid subclass is benzimidazoles, also known as nitazenes, which were originally developed as analgesics in the 1950s. The emergence of this classical, potent drug family has attracted extensive research interest in the field of forensic toxicology; however, information on their psychological and physical dependence is very limited. Herein, we evaluated the rewarding effects of four nitazene analogs using a battery of in vivo experiments, with a positive control drug (isotonitazene).

View Article and Find Full Text PDF

VCSAP: Online reinforcement learning exploration method based on visitation count of state-action pairs.

Neural Netw

January 2025

Key Laboratory of Symbolic Computation and Knowledge Engineering (Jilin University), Changchun 130012, China; College of Computer Science and Technology, Jilin University, Changchun 130012, China; College of Software, Jilin University, Changchun 130012, China. Electronic address:

In the domain of online reinforcement learning, strategies that leverage inherent rewards for exploration tend to achieve commendable outcomes within contexts characterized by deceptive or sparse rewards. Counting through the visitation of states is an efficient count-based exploration method to get the proper intrinsic reward. However, only the novelty of the states encountered by the agent is considered in this exploration method, resulting in the over-exploration of a certain state-action pair and falling into a locally optimal solution.

View Article and Find Full Text PDF

Evaluation of a Global Initiative for Asthma Education and Implementation Program to Improve Asthma Care Quality (CARE4ALL): Protocol for a Multicenter, Single-Arm Study.

JMIR Res Protoc

January 2025

National Center for Respiratory Medicine; State Key Laboratory of Respiratory Health and Multimorbidity; National Clinical Research Center for Respiratory Diseases; Institute of Respiratory Medicine, Chinese Academy of Medical Sciences; Department of Pulmonary and Critical Care Medicine, Center of Respiratory Medicine, China-Japan Friendship Hospital, Beijing, China.

Background: Poor symptom control and exacerbations of asthma diminish quality of life and pose a significant burden to patients and society. Implementing evidence-based management as recommended by the Global Initiative for Asthma (GINA), especially introducing inhaled corticosteroid-containing treatments, has the potential to vastly reduce exacerbations and the high burden of asthma in China. However, domestic implementation of the GINA recommendations has been unsatisfactory, especially in lower-level hospitals; thus, an enhancement to the awareness of and adherence to the GINA recommendations among Chinese physicians is needed to improve patient outcomes.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!