Explainable post hoc portfolio management financial policy of a Deep Reinforcement Learning agent.

PLoS One

Faculty of Economics and Business (ICADE), Universidad Pontificia Comillas, Madrid, Spain.

Published: January 2025

Financial portfolio management investment policies computed quantitatively by modern portfolio theory techniques like the Markowitz model rely on a set of assumptions that are not supported by data in high volatility markets such as the technological sector or cryptocurrencies. Hence, quantitative researchers are looking for alternative models to tackle this problem. Concretely, portfolio management (PM) is a problem that has been successfully addressed recently by Deep Reinforcement Learning (DRL) approaches. In particular, DRL algorithms train an agent by estimating the distribution of the expected reward of every action performed by an agent given any financial state in a simulator, also called gymnasium. However, these methods rely on Deep Neural Networks model to represent such a distribution, that although they are universal approximator models, capable of representing this distribution over time, they cannot explain its behaviour, given by a set of parameters that are not interpretable. Critically, financial investors policies require predictions to be interpretable, to assess whether they follow a reasonable behaviour, so DRL agents are not suited to follow a particular policy or explain their actions. In this work, driven by the motivation of making DRL explainable, we developed a novel Explainable DRL (XDRL) approach for PM, integrating the Proximal Policy Optimization (PPO) DRL algorithm with the model agnostic explainable machine learning techniques of feature importance, SHAP and LIME to enhance transparency in prediction time. By executing our methodology, we can interpret in prediction time the actions of the agent to assess whether they follow the requisites of an investment policy or to assess the risk of following the agent's suggestions. We empirically illustrate it by successfully identifying key features influencing investment decisions, which demonstrate the ability to explain the agent actions in prediction time. We propose the first explainable post hoc PM financial policy of a DRL agent.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11737690PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0315528PLOS

Publication Analysis

Top Keywords

portfolio management
12
prediction time
12
explainable post
8
post hoc
8
financial policy
8
deep reinforcement
8
reinforcement learning
8
agent financial
8
assess follow
8
drl
7

Similar Publications

Background: In recent years, dental clusters and networks have been established in primary care in many countries to improve access to services for the population and develop cooperation between providers. In Hungary, the first dental clusters were established in 2021, and currently, one-third of dental practices have already joined a cluster. The study aimed to gather and analyze early experiences regarding the motivation of participation in primary care dental clusters and experiences of implementation.

View Article and Find Full Text PDF

Alive & Thrive has been a major global nutrition initiative that aimed to learn how to improve maternal, infant, young child, and adolescent nutrition and health on a large scale. During 2009-2014, Alive & Thrive developed and implemented interventions to improve infant and young child feeding at scale in three countries. Subsequently, Alive & Thrive expanded its work to more than 15 geographies, including six country-specific and two regional programs, to additionally address maternal and adolescent nutrition while adding agriculture and social protection programs to improve maternal, infant, and young child nutrition.

View Article and Find Full Text PDF

Background: Existing research presents conflicting results on the influence of blood donor sex on hemoglobin (Hb) change and transfusion-associated infection and mortality in transfusion recipients.

Aim: This retrospective study explored the association between donor and recipient sex on hospital-onset sepsis (HO-sepsis) and Hb changes in critically ill patients receiving red blood cell (RBC) transfusions.

Methods: Data from 2010-2020 were extracted from an academic hospital's clinical database and a blood supplier's donor database.

View Article and Find Full Text PDF

Background: Puccinia striiformis f. sp. tritici (Pst) causes wheat stripe (yellow) rust disease, which is one of the most destructive diseases affecting wheat worldwide.

View Article and Find Full Text PDF

Climate change has heightened the need to understand physical climate risks, such as the increasing frequency and severity of heat waves, for informed financial decision-making. This study investigates the financial implications of extreme heat waves on stock returns in Europe and the United States. Accordingly, the study combines meteorological and stock market data by integrating methodologies from both climate science and finance.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!