Compositional RL Agents That Follow Language Commands in Temporal Logic.

Front Robot AI

CSAIL, MIT, Cambridge, MA, Unites States.

Published: July 2021

We demonstrate how a reinforcement learning agent can use compositional recurrent neural networks to learn to carry out commands specified in linear temporal logic (LTL). Our approach takes as input an LTL formula, structures a deep network according to the parse of the formula, and determines satisfying actions. This compositional structure of the network enables zero-shot generalization to significantly more complex unseen formulas. We demonstrate this ability in multiple problem domains with both discrete and continuous state-action spaces. In a symbolic domain, the agent finds a sequence of letters that satisfy a specification. In a Minecraft-like environment, the agent finds a sequence of actions that conform to a formula. In the Fetch environment, the robot finds a sequence of arm configurations that move blocks on a table to fulfill the commands. While most prior work can learn to execute one formula reliably, we develop a novel form of multi-task learning for RL agents that allows them to learn from a diverse set of tasks and generalize to a new set of diverse tasks without any additional training. The compositional structures presented here are not specific to LTL, thus opening the path to RL agents that perform zero-shot generalization in other compositional domains.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8326833PMC
http://dx.doi.org/10.3389/frobt.2021.689550DOI Listing

Publication Analysis

Top Keywords

finds sequence
12
temporal logic
8
zero-shot generalization
8
agent finds
8
compositional
5
compositional agents
4
agents follow
4
follow language
4
language commands
4
commands temporal
4

Similar Publications

This study aimed to identify shared gene expression related to circadian rhythm disruption in polycystic ovary syndrome (PCOS) and non-alcoholic fatty liver disease (NAFLD) to discover common diagnostic biomarkers. Visceral fat RNA samples were collected from 12 PCOS and 14 non-PCOS patients, a sample size representing the clinical situation and sufficient to capture PCOS gene expression profiles. Along with liver transcriptome profiles from NAFLD patients, these data were analyzed to identify crosstalk circadian rhythm-related genes (CRRGs) between the diseases.

View Article and Find Full Text PDF

Context: DNAN/DNB cocrystals, as a newly developed type of energetic material, possess superior safety and thermal stability, making them a suitable alternative to traditional melt-cast explosives. Nonetheless, an exploration of the thermal degradation dynamics of the said cocrystal composite has heretofore remained uncharted. Consequently, we engaged the ReaxFF/lg force field modality to delve into the thermal dissociation processes of the DNAN/DNB cocrystal assembly across a spectrum of temperatures, encompassing 2500, 2750, 3000, 3250, and 3500 K.

View Article and Find Full Text PDF

Background: The endangered Kashmir musk deer (Moschus cupreus), native to high-altitude Himalayas, is an ecological significant and endangered ungulate, threatened by habitat loss and poaching for musk pod distributed in western Himalayan ranges of India, Nepal and Afghanistan. Despite its critical conservation status and ecological importance in regulating vegetation dynamics, knowledge gaps persist regarding its population structure and genetic diversity, hindering effective management strategies.

Methods And Results: We aimed to understand the population genetics of Kashmir musk deer in north-western Himalayas using two mitochondrial DNA (mtDNA) regions and 11 microsatellite loci.

View Article and Find Full Text PDF

Purpose: Patients with partial or complete DPD deficiency have decreased capacity to degrade fluorouracil and are at risk of developing toxicity, which can be even life-threatening.

Case: A 43-year-old man with moderately differentiated rectal adenocarcinoma on capecitabine presented to the emergency department with complaints of nausea, vomiting, diarrhea, weakness, and lower abdominal pain for several days. Laboratory findings include grade 4 neutropenia (ANC 10) and thrombocytopenia (platelets 36,000).

View Article and Find Full Text PDF

An aerobic, Gram-stain-positive, motile, coccus-shaped actinomycete, designated strain LSe6-4, was isolated from leaves of sea purslane (Sesuvium portulacastrum L.) in Thailand and subjected to a polyphasic taxonomic studies. Growth of the strain occurred at temperatures between 15 and 38 °C, and with NaCl concentrations 0-13%.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!