We investigate the potential of Multi-Objective, Deep Reinforcement Learning for stock and cryptocurrency single-asset trading: in particular, we consider a Multi-Objective algorithm which generalizes the reward functions and discount factor (i.e., these components are not specified a priori, but incorporated in the learning process). Firstly, using several important assets (BTCUSD, ETHUSDT, XRPUSDT, AAPL, SPY, NIFTY50), we verify the reward generalization property of the proposed Multi-Objective algorithm, and provide preliminary statistical evidence showing increased predictive stability over the corresponding Single-Objective strategy. Secondly, we show that the Multi-Objective algorithm has a clear edge over the corresponding Single-Objective strategy when the reward mechanism is sparse (i.e., when non-null feedback is infrequent over time). Finally, we discuss the generalization properties with respect to the discount factor. The entirety of our code is provided in open-source format.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10770001 | PMC |
http://dx.doi.org/10.1007/s00521-023-09033-7 | DOI Listing |
Sci Rep
January 2025
Xingtai Naknor Technology Co., Ltd, Xingtai, 054000, China.
The heating oil circuit plays an essential role in the heating calendering roller for the lithium battery pole piece. To achieve the optimization of the heating oil circuit, a fluid-thermal-structural coupling method and a multi-objective optimization procedure are proposed to obtain the optimal solution. A fluid-thermal-structural coupling flowchart based on the numerical modeling for the calendering roller temperature distribution is created to automate the analysis processes in the optimization iteration.
View Article and Find Full Text PDFSensors (Basel)
January 2025
School of Biomedical Engineering, Tsinghua University, Shuang Qing Road, Beijing 100084, China.
Mastoidectomy is critical in acoustic neuroma surgery, where precise planning of the bone milling area is essential for surgical navigation. The complexity of representing the irregular volumetric area and the presence of high-risk structures (e.g.
View Article and Find Full Text PDFSensors (Basel)
January 2025
Xi'an Power Supply Company, State Grid Shaanxi Electric Power Co., Ltd., Xi'an 710032, China.
Under the carbon peaking and carbon neutrality target background, efficient collaborative scheduling between distribution networks and multi-microgrids is of great significance for enhancing renewable energy accommodation and ensuring stable system operation. Therefore, this paper proposes a collaborative optimization method for the operation of distribution networks and multi-microgrids with shared energy storage based on a multi-body game. The method is modeled and solved in two stages.
View Article and Find Full Text PDFMaterials (Basel)
January 2025
School of Aeronautic Science and Engineering, Beihang University, Beijing 100191, China.
The structure of thermoset composite laminated plates is made by stacking layers of plies with different fiber orientations. Similarly, the stiffened panel structure is assembled from components with varying ply configurations, resulting in thermal residual stresses and processing-induced deformations (PIDs) during manufacturing. To mitigate the residual stresses caused by the geometric features of corner structures and the mismatch between the stiffener-skin ply orientations, which lead to PIDs in composite-stiffened panels, this study proposes a multi-objective stacking optimization strategy based on an improved adaptive genetic algorithm (IAGA).
View Article and Find Full Text PDFMaterials (Basel)
January 2025
Department of Civil Engineering, School of Mechanics and Engineering Science, Shanghai University, Shanghai 200444, China.
This paper establishes fatigue life prediction models using the soft computing method to address insufficient parameter consideration and limited computational accuracy in predicting the fatigue life of fiber-reinforced polymer (FRP) strengthened concrete beams. Five different input forms were proposed by collecting 117 sets of fatigue test data of FRP-strengthened concrete beams from the existing literature and integrating the outcomes from Pearson correlation analysis and significance testing. Using Gene Expression Programming (GEP), the effects of various input configurations on the accuracy of model predictions were examined.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!