Interest in deploying deep reinforcement learning (DRL) models on low-power edge devices, such as Autonomous Mobile Robots (AMRs) and Internet of Things (IoT) devices, has seen a significant rise due to the potential of performing real-time inference by eliminating the latency and reliability issues incurred from wireless communication and the privacy benefits of processing data locally. Deploying such energy-intensive models on power-constrained devices is not always feasible, however, which has led to the development of model compression techniques that can reduce the size and computational complexity of DRL policies. Policy distillation, the most popular of these methods, can be used to first lower the number of network parameters by transferring the behavior of a large teacher network to a smaller student model before deploying these students at the edge. This works well with deterministic policies that operate using discrete actions. However, many real-world tasks that are power constrained, such as in the field of robotics, are formulated using continuous action spaces, which are not supported. In this work, we improve the policy distillation method to support the compression of DRL models designed to solve these continuous control tasks, with an emphasis on maintaining the stochastic nature of continuous DRL algorithms. Experiments show that our methods can be used effectively to compress such policies up to 750% while maintaining or even exceeding their teacher's performance by up to 41% in solving two popular continuous control tasks.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11314760 | PMC |
http://dx.doi.org/10.3390/s24154876 | DOI Listing |
Virol J
January 2025
Department of Microbiology, College of Medicine, Taif University, Taif, 21944, Saudi Arabia.
Background: Despite numerous genetic studies on Infectious Bronchitis Virus (IBV), many strains from the Middle East remain misclassified or unclassified. Genotype 1 (GI-1) is found globally, while genotype 23 (GI-23) has emerged as the predominant genotype in the Middle East region, evolving continuously through inter- and intra-genotypic recombination. The GI-23 genotype is now enzootic in Europe and Asia.
View Article and Find Full Text PDFJ Immunother Cancer
January 2025
Department of Medical Oncology, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy.
Background: Immune checkpoint inhibitors (ICIs) are recommended to treat patients with deficient mismatch repair/microsatellite instability high (dMMR/MSI-H) metastatic colorectal cancer (mCRC). Pivotal trials have fixed a maximum ICI duration of 2 years, without a compelling rationale. A shorter treatment duration has the potential to improve patients' quality of life and reduce both toxicity and cost without compromising efficacy.
View Article and Find Full Text PDFBMJ Open Qual
January 2025
Rheumatology and immunology department, The First Hospital of Hunan University of Chinese Medicine, Changsha, Hunan, China
Objectives: This study sought to assess the effectiveness of nurse-led care (NLC) in patients with rheumatoid arthritis (RA).
Methods: We conducted a comprehensive search of the Cochrane Library, Web of Science, PubMed, Embase, CINAHL, ClinicalTrials.gov databases and the references from relevant literature published prior to May 2023.
BMJ Case Rep
January 2025
Rheumatology, University of Michigan Michigan Medicine, Ann Arbor, Michigan, USA
A man in his 60s suffered from refractory, biopsy-proven subacute cutaneous lupus erythematosus that required chronic, moderate dose steroids to manage. His rash was accompanied by arthralgias and negative autoantibody testing. His subacute lupus erythematosus (SCLE) was responsive to tofacitinib, but thrombotic complications limited the use of this medication.
View Article and Find Full Text PDFJ Voice
January 2025
Department of Otolaryngology - Head and Neck Surgery, University of Utah, Salt Lake City, UT; Department of Surgery, University Utah, Salt Lake City, UT.
Objectives/hypothesis: Vocal fold (VF) injury and chronic inflammation can progress to scarring, which is notoriously difficult to treat. Human amniotic fluid (AF) has potential for VF wound healing in a rabbit model, and we hypothesized that AF would demonstrate wound healing properties superior to hyaluronic acid (HA) over time.
Study Design: Randomized, controlled trial.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!