Asynchronous Deep Double Dueling Q-learning for trading-signal execution in limit order book markets.

Front Artif Intell

Department of Engineering Science, Oxford-Man Institute of Quantitative Finance, University of Oxford, Oxford, United Kingdom.

Published: September 2023

We employ deep reinforcement learning (RL) to train an agent to successfully translate a high-frequency trading signal into a trading strategy that places individual limit orders. Based on the ABIDES limit order book simulator, we build a reinforcement learning OpenAI gym environment and utilize it to simulate a realistic trading environment for NASDAQ equities based on historic order book messages. To train a trading agent that learns to maximize its trading return in this environment, we use Deep Dueling Double Q-learning with the APEX (asynchronous prioritized experience replay) architecture. The agent observes the current limit order book state, its recent history, and a short-term directional forecast. To investigate the performance of RL for adaptive trading independently from a concrete forecasting algorithm, we study the performance of our approach utilizing synthetic alpha signals obtained by perturbing forward-looking returns with varying levels of noise. Here, we find that the RL agent learns an effective trading strategy for inventory management and order placing that outperforms a heuristic benchmark trading strategy having access to the same signal.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10561243PMC
http://dx.doi.org/10.3389/frai.2023.1151003DOI Listing

Publication Analysis

Top Keywords

order book
16
limit order
12
trading strategy
12
reinforcement learning
8
trading
8
agent learns
8
order
5
asynchronous deep
4
deep double
4
double dueling
4

Similar Publications

A multi-agent reinforcement learning framework for cross-domain sequential recommendation.

Neural Netw

January 2025

Key Laboratory of Knowledge Engineering with Big Data (the Ministry of Education of China), Hefei University of Technology, Hefei, 230009, Anhui, China. Electronic address:

Sequential recommendation models aim to predict the next item based on the sequence of items users interact with, ordered chronologically. However, these models face the challenge of data sparsity. Recent studies have explored cross-domain sequential recommendation, where users' interaction data across multiple source domains are leveraged to enhance recommendations in data-sparse target domains.

View Article and Find Full Text PDF

This article describes the construction and validation of an instruction manual geared toward nutritional care (NC) for people with severe obesity in the Brazilian Unified Health System (SUS). In the production of this instruction manual, a broad literature review was conducted for the identification and discussion of topics to be treated. The content and appearance validity were conducted according to the Delphi technique and to focus groups, respectively, with evaluators who were nutritionists and practitioners, from different regions of Brazil.

View Article and Find Full Text PDF

We studied freshly collected, dried and herbarized leaf fragments of two palms, namely L. and L., most commonly used for palm-leaf manuscript (PLM) production in South (S) and Southeast Asia (SE) in order to reveal differences in their phytolith assemblages.

View Article and Find Full Text PDF

In order to improve the accuracy of camera colorimetric characterization, a multi-input parameter optimization method was proposed in this paper. The input parameters of the traditional camera characterization method were generally RGB values; in the proposed method, the luminance parameter L was introduced in addition to RGB values, and the four-input parameters of RGBL were used as input parameters for the conversion model. In the experiment, 549 colors were uniformly selected from the Munsell Book of Color (Matte Edition), and the RGBL values and corresponding CIEXYZ values of the selected colors were measured by a spectroradiometer and three cameras, including an imaging luminance meter, respectively.

View Article and Find Full Text PDF

Introduction: Intrusive memories occur frequently after potentially traumatic events and form a core symptom of posttraumatic stress disorder (PTSD) if they persist. The translational approach of visuospatial interventions tries to target those intrusive memories in order to reduce their frequency predominantly using an intervention including as one component the computer game Despite promising results, the application of has critical drawbacks, e.g.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!