Policy Compression for Intelligent Continuous Control on Low-Power Edge Devices.

Sensors (Basel)

IDLab-Faculty of Applied Engineering, University of Antwerp-IMEC, Sint-Pietersvliet 7, 2000 Antwerp, Belgium.

Published: July 2024

Interest in deploying deep reinforcement learning (DRL) models on low-power edge devices, such as Autonomous Mobile Robots (AMRs) and Internet of Things (IoT) devices, has seen a significant rise due to the potential of performing real-time inference by eliminating the latency and reliability issues incurred from wireless communication and the privacy benefits of processing data locally. Deploying such energy-intensive models on power-constrained devices is not always feasible, however, which has led to the development of model compression techniques that can reduce the size and computational complexity of DRL policies. Policy distillation, the most popular of these methods, can be used to first lower the number of network parameters by transferring the behavior of a large teacher network to a smaller student model before deploying these students at the edge. This works well with deterministic policies that operate using discrete actions. However, many real-world tasks that are power constrained, such as in the field of robotics, are formulated using continuous action spaces, which are not supported. In this work, we improve the policy distillation method to support the compression of DRL models designed to solve these continuous control tasks, with an emphasis on maintaining the stochastic nature of continuous DRL algorithms. Experiments show that our methods can be used effectively to compress such policies up to 750% while maintaining or even exceeding their teacher's performance by up to 41% in solving two popular continuous control tasks.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11314760PMC
http://dx.doi.org/10.3390/s24154876DOI Listing

Publication Analysis

Top Keywords

continuous control
12
low-power edge
8
edge devices
8
drl models
8
policy distillation
8
control tasks
8
continuous
5
policy compression
4
compression intelligent
4
intelligent continuous
4

Similar Publications

Background: Despite numerous genetic studies on Infectious Bronchitis Virus (IBV), many strains from the Middle East remain misclassified or unclassified. Genotype 1 (GI-1) is found globally, while genotype 23 (GI-23) has emerged as the predominant genotype in the Middle East region, evolving continuously through inter- and intra-genotypic recombination. The GI-23 genotype is now enzootic in Europe and Asia.

View Article and Find Full Text PDF

Background: Immune checkpoint inhibitors (ICIs) are recommended to treat patients with deficient mismatch repair/microsatellite instability high (dMMR/MSI-H) metastatic colorectal cancer (mCRC). Pivotal trials have fixed a maximum ICI duration of 2 years, without a compelling rationale. A shorter treatment duration has the potential to improve patients' quality of life and reduce both toxicity and cost without compromising efficacy.

View Article and Find Full Text PDF

Objectives: This study sought to assess the effectiveness of nurse-led care (NLC) in patients with rheumatoid arthritis (RA).

Methods: We conducted a comprehensive search of the Cochrane Library, Web of Science, PubMed, Embase, CINAHL, ClinicalTrials.gov databases and the references from relevant literature published prior to May 2023.

View Article and Find Full Text PDF

A man in his 60s suffered from refractory, biopsy-proven subacute cutaneous lupus erythematosus that required chronic, moderate dose steroids to manage. His rash was accompanied by arthralgias and negative autoantibody testing. His subacute lupus erythematosus (SCLE) was responsive to tofacitinib, but thrombotic complications limited the use of this medication.

View Article and Find Full Text PDF

Amniotic Fluid as a Potential Treatment for Vocal Fold Scar in a Rabbit Model.

J Voice

January 2025

Department of Otolaryngology - Head and Neck Surgery, University of Utah, Salt Lake City, UT; Department of Surgery, University Utah, Salt Lake City, UT.

Objectives/hypothesis: Vocal fold (VF) injury and chronic inflammation can progress to scarring, which is notoriously difficult to treat. Human amniotic fluid (AF) has potential for VF wound healing in a rabbit model, and we hypothesized that AF would demonstrate wound healing properties superior to hyaluronic acid (HA) over time.

Study Design: Randomized, controlled trial.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!