Many aspects of hedonic behavior, including self-administration of natural and drug rewards, as well as human positive affect, follow a diurnal cycle that peaks during the species-specific active period. This variation has been linked to circadian modulation of the mesolimbic dopamine system, and is hypothesized to serve an adaptive function by driving an organism to engage with the environment during times where the opportunity for obtaining rewards is high. However, relatively little is known about whether more complex facets of hedonic behavior - in particular, reward learning - follow the same diurnal cycle. The current study aimed to address this gap by examining evidence for diurnal variation in reward learning on a well-validated probabilistic reward learning task (PRT). PRT data from a large normative sample (N = 516) of non-clinical individuals, recruited across eight studies, were examined for the current study. The PRT uses an asymmetrical reinforcement ratio to induce a behavioral response bias, and reward learning was operationalized as the strength of this response bias across blocks of the task. Results revealed significant diurnal variation in reward learning, however in contrast to patterns previously observed in other aspects of hedonic behavior, reward learning was lowest in the middle of the day. Although a diurnal pattern was also observed on a measure of more general task performance (discriminability), this did not account for the variation observed in reward learning. Taken together, these findings point to a distinct diurnal pattern in reward learning that differs from that observed in other aspects of hedonic behavior. The results of this study have important implications for our understanding of clinical disorders characterized by both circadian and reward learning disturbances, and future research is needed to confirm whether this diurnal variation has a truly circadian origin.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6174688 | PMC |
http://dx.doi.org/10.1080/07420528.2018.1459662 | DOI Listing |
Cochrane Database Syst Rev
January 2025
Nuffield Department of Primary Care Health Sciences, University of Oxford, Oxford, UK.
Background: Financial incentives (money, vouchers, or self-deposits) can be used to positively reinforce smoking cessation. They may be used as one-off rewards, or in various schedules to reward steps towards sustained smoking abstinence (known as contingency management). They have been used in workplaces, clinics, hospitals, and community settings, and to target particular populations.
View Article and Find Full Text PDFMolecules
December 2024
Department of Medicinal Chemistry, School of Pharmacy, Fudan University, 826 Zhangheng Road, Shanghai 201203, China.
As an appealing approach for discovering novel leads, the key advantage of de novo drug design lies in its ability to explore a much broader dimension of chemical space, without being confined to the knowledge of existing compounds. So far, many generative models have been described in the literature, which have completely redefined the concept of de novo drug design. However, many of them lack practical value for real-world drug discovery.
View Article and Find Full Text PDFCommun Biol
January 2025
FrontLab, Institut du Cerveau - Paris Brain Institute - ICM, INSERM, CNRS, Hôpital de la Pitié Salpêtrière, AP-HP, Sorbonne University, Paris, France.
Creative thinking involves the evaluation of one's ideas in order to select the best one, but the cognitive and neural mechanisms underlying this evaluation remain unclear. Using a combination of creativity and rating tasks, this study demonstrates that individuals attribute subjective values to their ideas, as a relative balance of their originality and adequacy. This relative balance depends on individual preferences and predicts individuals' creative abilities.
View Article and Find Full Text PDFNeurobiol Learn Mem
January 2025
School of Psychology, University of New South Wales, Australia. Electronic address:
Humans and animals use information about future access to rewards to influence their behaviour in the present, however the evidence for this is largely anecdotal. Here we use the nicotine intravenous self-administration paradigm to ask whether rats can use an auditory stimulus signalling a long (450 s) signalled time-out on the next trial to influence their nicotine intake in the present. Rats were trained to choose between low (15 µg/kg/infusion), medium (30 µg/kg/infusion) or high (60 µg/kg/infusion) doses of nicotine on any given trial.
View Article and Find Full Text PDFJ Am Med Inform Assoc
January 2025
Institute of Data Science, National University of Singapore, 117602, Singapore.
Objectives: This study introduces Smart Imitator (SI), a 2-phase reinforcement learning (RL) solution enhancing personalized treatment policies in healthcare, addressing challenges from imperfect clinician data and complex environments.
Materials And Methods: Smart Imitator's first phase uses adversarial cooperative imitation learning with a novel sample selection schema to categorize clinician policies from optimal to nonoptimal. The second phase creates a parameterized reward function to guide the learning of superior treatment policies through RL.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!