Background And Objective: Cancer is one of the major causes of death worldwide and chemotherapies are the most significant anti-cancer therapy, in spite of the emerging precision cancer medicines in the last 2 decades. The growing interest in developing the effective chemotherapy regimen with optimal drug dosing schedule to benefit the clinical cancer patients has spawned innovative solutions involving mathematical modeling since the chemotherapy regimens are administered cyclically until the futility or the occurrence of intolerable adverse events. Thus, in this present work, we reviewed the emerging trends involved in forming a computational solution from the aspect of reinforcement learning.
Methods: Initially, this survey in-depth focused on the details of the dynamic treatment regimens from a broad perspective and then narrowed down to inspirations from reinforcement learning that were advantageous to chemotherapy dosing, including both offline reinforcement learning and supervised reinforcement learning.
Results: The insights established in the chemotherapy-planning problem associated with the Reinforcement Learning (RL) has been discussed in this study. It showed that the researchers were able to widen their perspectives in comprehending the theoretical basis, dynamic treatment regimens (DTR), use of the adaptive control on DTR, and the associated RL techniques.
Conclusions: This study reviewed the recent researches relevant to the topic, and highlighted the challenges, open questions, possible solutions, and future steps in inventing a realistic solution for the aforementioned problem.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.cmpb.2022.107280 | DOI Listing |
Nat Rev Neurosci
January 2025
Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD, USA.
Transient changes in the firing of midbrain dopamine neurons have been closely tied to the unidimensional value-based prediction error contained in temporal difference reinforcement learning models. However, whereas an abundance of work has now shown how well dopamine responses conform to the predictions of this hypothesis, far fewer studies have challenged its implicit assumption that dopamine is not involved in learning value-neutral features of reward. Here, we review studies in rats and humans that put this assumption to the test, and which suggest that dopamine transients provide a much richer signal that incorporates information that goes beyond integrated value.
View Article and Find Full Text PDFNPJ Sci Learn
January 2025
Department of Educational Sciences, University of Potsdam, Karl-Liebknecht-Straße 24/25, 14476, Potsdam, Germany.
Rising interest in artificial intelligence in education reinforces the demand for evidence-based implementation. This study investigates how tutor agents' physical embodiment and anthropomorphism (student-reported sociability, animacy, agency, and disturbance) relate to affective (on-task enjoyment) and cognitive (task performance) learning within an intelligent tutoring system (ITS). Data from 56 students (M = 17.
View Article and Find Full Text PDFJ Toxicol Sci
January 2025
Department of Pharmaceutical and Environmental Sciences, Tokyo Metropolitan Institute of Public Health.
In illicit drug markets, the most recently expanding new synthetic opioid subclass is benzimidazoles, also known as nitazenes, which were originally developed as analgesics in the 1950s. The emergence of this classical, potent drug family has attracted extensive research interest in the field of forensic toxicology; however, information on their psychological and physical dependence is very limited. Herein, we evaluated the rewarding effects of four nitazene analogs using a battery of in vivo experiments, with a positive control drug (isotonitazene).
View Article and Find Full Text PDFNeural Netw
January 2025
Key Laboratory of Symbolic Computation and Knowledge Engineering (Jilin University), Changchun 130012, China; College of Computer Science and Technology, Jilin University, Changchun 130012, China; College of Software, Jilin University, Changchun 130012, China. Electronic address:
In the domain of online reinforcement learning, strategies that leverage inherent rewards for exploration tend to achieve commendable outcomes within contexts characterized by deceptive or sparse rewards. Counting through the visitation of states is an efficient count-based exploration method to get the proper intrinsic reward. However, only the novelty of the states encountered by the agent is considered in this exploration method, resulting in the over-exploration of a certain state-action pair and falling into a locally optimal solution.
View Article and Find Full Text PDFJMIR Res Protoc
January 2025
National Center for Respiratory Medicine; State Key Laboratory of Respiratory Health and Multimorbidity; National Clinical Research Center for Respiratory Diseases; Institute of Respiratory Medicine, Chinese Academy of Medical Sciences; Department of Pulmonary and Critical Care Medicine, Center of Respiratory Medicine, China-Japan Friendship Hospital, Beijing, China.
Background: Poor symptom control and exacerbations of asthma diminish quality of life and pose a significant burden to patients and society. Implementing evidence-based management as recommended by the Global Initiative for Asthma (GINA), especially introducing inhaled corticosteroid-containing treatments, has the potential to vastly reduce exacerbations and the high burden of asthma in China. However, domestic implementation of the GINA recommendations has been unsatisfactory, especially in lower-level hospitals; thus, an enhancement to the awareness of and adherence to the GINA recommendations among Chinese physicians is needed to improve patient outcomes.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!