With the increase in Internet of Things (IoT) devices and network communications, but with less bandwidth growth, the resulting constraints must be overcome. Due to the network complexity and uncertainty of emergency distribution parameters in smart environments, using predetermined rules seems illogical. Reinforcement learning (RL), as a powerful machine learning approach, can handle such smart environments without a trainer or supervisor. Recently, we worked on bandwidth management in a smart environment with several fog fragments using limited shared bandwidth, where IoT devices may experience uncertain emergencies in terms of the time and sequence needed for more bandwidth for further higher-level communication. We introduced fog fragment cooperation using an RL approach under a predefined fixed threshold constraint. In this study, we promote this approach by removing the fixed level of restriction of the threshold through hierarchical reinforcement learning (HRL) and completing the cooperation qualification. At the first learning hierarchy level of the proposed approach, the best threshold level is learned over time, and the final results are used by the second learning hierarchy level, where the fog node learns the best device for helping an emergency device by temporarily lending the bandwidth. Although equipping the method to the adaptive threshold and restricting fog fragment cooperation make the learning procedure more difficult, the HRL approach increases the method's efficiency in terms of time and performance.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8587839 | PMC |
http://dx.doi.org/10.3390/s21217053 | DOI Listing |
J Exp Anal Behav
January 2025
Animal Learning and Behavior Laboratory, Departamento de Psicología Básica I, Facultad de Psicología, Universidad Nacional de Educación a Distancia (UNED), Madrid, Spain.
The development of schedule-induced drinking depends on different variables affecting the food delivered at the end of the interfood interval. There are mixed results concerning the effects of varying magnitude and/or preference of different reinforcers in the development of schedule-induced drinking, with some studies showing higher levels and other studies showing lower levels of drinking. The purpose of this study was to observe how differences in preference for a flavor of equally nutritious food pellets influence the development and maintenance of schedule-induced drinking.
View Article and Find Full Text PDFFront Robot AI
December 2024
Intelligent Robotics Group, Electrical Engineering and Automation Department, Aalto University, Helsinki, Finland.
This work considers the problem of learning cooperative policies in multi-agent settings with partially observable and non-stationary environments without a communication channel. We focus on improving information sharing between agents and propose a new multi-agent actor-critic method called (MACRPO). We propose two novel ways of integrating information across agents and time in MACRPO: First, we use a recurrent layer in the critic's network architecture and propose a new framework to use the proposed meta-trajectory to train the recurrent layer.
View Article and Find Full Text PDFSmart Learn Environ
December 2024
Department of Biochemistry, University of Nebraska - Lincoln, 1901 Vine St., Beadle N133, Lincoln, NE 68588 USA.
Unlabelled: Concept-heavy courses such as Biochemistry in life and physical science curricula are challenging for many college-aged students. It is easy for students to disengage in a lecture and not learn the subject matter while in class. To improve student learning participation, we employed a flipped format for the first half of the course and compared learning outcomes and attitudes with the traditional lecture in the second half of the course.
View Article and Find Full Text PDFNPP Digit Psychiatry Neurosci
January 2025
Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY USA.
Reinforcement learning studies propose that decision-making is guided by a tradeoff between computationally cheaper model-free (habitual) control and costly model-based (goal-directed) control. Greater model-based control is typically used under highly rewarding conditions to minimize risk and maximize gain. Although prior studies have shown impairments in sensitivity to reward value in individuals with frequent alcohol use, it is unclear how these individuals arbitrate between model-free and model-based control based on the magnitude of reward incentives.
View Article and Find Full Text PDFComput Med Imaging Graph
December 2024
Beijing Institute of Technology, No. 5, Zhong Guan Cun South Street, Beijing, 100081, China. Electronic address:
The change of layer thickness of retina is closely associated with the development of ocular diseases such as glaucoma and optic disc drusen. Optical coherence tomography (OCT) is a widely used technology to visualize the lamellar structures of retina. Accurate segmentation of retinal lamellar structures is crucial for diagnosis, treatment, and related research of ocular diseases.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!