We demonstrate how a reinforcement learning agent can use compositional recurrent neural networks to learn to carry out commands specified in linear temporal logic (LTL). Our approach takes as input an LTL formula, structures a deep network according to the parse of the formula, and determines satisfying actions. This compositional structure of the network enables zero-shot generalization to significantly more complex unseen formulas. We demonstrate this ability in multiple problem domains with both discrete and continuous state-action spaces. In a symbolic domain, the agent finds a sequence of letters that satisfy a specification. In a Minecraft-like environment, the agent finds a sequence of actions that conform to a formula. In the Fetch environment, the robot finds a sequence of arm configurations that move blocks on a table to fulfill the commands. While most prior work can learn to execute one formula reliably, we develop a novel form of multi-task learning for RL agents that allows them to learn from a diverse set of tasks and generalize to a new set of diverse tasks without any additional training. The compositional structures presented here are not specific to LTL, thus opening the path to RL agents that perform zero-shot generalization in other compositional domains.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8326833 | PMC |
http://dx.doi.org/10.3389/frobt.2021.689550 | DOI Listing |
Biochem Genet
January 2025
Department of Gynecology, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, China.
This study aimed to identify shared gene expression related to circadian rhythm disruption in polycystic ovary syndrome (PCOS) and non-alcoholic fatty liver disease (NAFLD) to discover common diagnostic biomarkers. Visceral fat RNA samples were collected from 12 PCOS and 14 non-PCOS patients, a sample size representing the clinical situation and sufficient to capture PCOS gene expression profiles. Along with liver transcriptome profiles from NAFLD patients, these data were analyzed to identify crosstalk circadian rhythm-related genes (CRRGs) between the diseases.
View Article and Find Full Text PDFJ Mol Model
January 2025
Shanxi Jiangyang Chemical Limited Company, Taiyuan, 030041, Shanxi, China.
Context: DNAN/DNB cocrystals, as a newly developed type of energetic material, possess superior safety and thermal stability, making them a suitable alternative to traditional melt-cast explosives. Nonetheless, an exploration of the thermal degradation dynamics of the said cocrystal composite has heretofore remained uncharted. Consequently, we engaged the ReaxFF/lg force field modality to delve into the thermal dissociation processes of the DNAN/DNB cocrystal assembly across a spectrum of temperatures, encompassing 2500, 2750, 3000, 3250, and 3500 K.
View Article and Find Full Text PDFMol Biol Rep
January 2025
Zoological Survey of India, Kolkata, 700053, India.
Background: The endangered Kashmir musk deer (Moschus cupreus), native to high-altitude Himalayas, is an ecological significant and endangered ungulate, threatened by habitat loss and poaching for musk pod distributed in western Himalayan ranges of India, Nepal and Afghanistan. Despite its critical conservation status and ecological importance in regulating vegetation dynamics, knowledge gaps persist regarding its population structure and genetic diversity, hindering effective management strategies.
Methods And Results: We aimed to understand the population genetics of Kashmir musk deer in north-western Himalayas using two mitochondrial DNA (mtDNA) regions and 11 microsatellite loci.
Cancer Chemother Pharmacol
January 2025
Markey Cancer Center, University of Kentucky, Lexington, KY, USA.
Purpose: Patients with partial or complete DPD deficiency have decreased capacity to degrade fluorouracil and are at risk of developing toxicity, which can be even life-threatening.
Case: A 43-year-old man with moderately differentiated rectal adenocarcinoma on capecitabine presented to the emergency department with complaints of nausea, vomiting, diarrhea, weakness, and lower abdominal pain for several days. Laboratory findings include grade 4 neutropenia (ANC 10) and thrombocytopenia (platelets 36,000).
Curr Microbiol
January 2025
Department of Microbiology, Faculty of Science, Kasetsart University, Chatuchak, Bangkok, 10900, Thailand.
An aerobic, Gram-stain-positive, motile, coccus-shaped actinomycete, designated strain LSe6-4, was isolated from leaves of sea purslane (Sesuvium portulacastrum L.) in Thailand and subjected to a polyphasic taxonomic studies. Growth of the strain occurred at temperatures between 15 and 38 °C, and with NaCl concentrations 0-13%.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!