Deep Deterministic Policy Gradient for Portfolio Management

机译：投资组合管理的深度确定性政策梯度

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Portfolio management is a financial problem that has been the subject of much research over the years. It is a planning task where an agent constantly redistributes resources across a set of assets in order to achieve investment objectives and thereby maximize return. However, it remains difficult to obtain an optimal strategy in an environment as complex and dynamic as the financial market. Our article focuses on solving this stochastic control problem in order to obtain an optimal strategy that would allow us to make profitable decisions by interacting directly with the environment. To do this, we explore the power of deep reinforcement learning which differs from traditional Machine Learning by combining the task of predicting stock behavior and analyzing the optimal course of action in a single unit, thus aligning the problem of Machine Learning with the investor's objectives. As a method, we propose to use the Deep Deterministic Policy Gradient which is an off-policy algorithm and is used for environments with continuous action spaces. The obtained results demonstrate that the model achieves a higher rate of return than the strategy of “Uniform Buy and Hold” stocks and the strategy of “Buy Best Stock in last month”.

机译：投资组合管理是多年来一直是众多研究的主题的财务问题。它是一个规划任务，代理人不断重新分配跨一组资产的资源，以实现投资目标，从而最大限度地提高回报。然而，它仍然难以在环境中获得最佳的战略，作为金融市场的复杂和动态。我们的文章侧重于解决这一随机控制问题，以获得最佳策略，使我们能够通过直接与环境进行互动来实现有利可图的决策。为此，我们探讨了深度加强学习的力量，通过组合预测库存行为的任务和分析单个单元的最佳动作的任务来探讨传统的机器学习的力量，从而使投资者目标的机器学习问题对齐。作为一种方法，我们建议使用截止策略算法的深度确定性政策梯度，并用于具有连续动作空间的环境。所获得的结果表明，该模型比“统一买卖”股票的策略实现了更高的回报率，以及“上个月购买最佳股票”的策略。

著录项

来源
《IEEE Congress on Information Science and Technology》|2021年|424-429|共6页
会议地点
作者
Firdaous Khemlichi; Hiba Chougrad; Youness Idrissi Khamlichi; Abdessamad el Boushaki; Safae Elhaj Ben Ali;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Stochastic processes; Reinforcement learning; Prediction algorithms; Reliability; Task analysis; Portfolios; Robots;

机译：随机流程;加固学习;预测算法;可靠性;任务分析;投资组合;机器人;

相似文献

外文文献
中文文献
专利

1. Deep Deterministic Policy Gradient Based Energy Management Strategy for Hybrid Electric Tracked Vehicle With Online Updating Mechanism [J] . Zhikai Ma, Qian Huo, Tao Zhang, Quality Control, Transactions . 2021,第1期

机译：在线更新机制的混合电动跟踪车辆的深度确定性政策梯度基于能量管理策略
2. An Intelligent Energy Management Strategy for Hybrid Vehicle with irrational actions using Twin Delayed Deep Deterministic Policy Gradient [J] . Zemin Eitan Liu, Quan Zhou, Yanfei Li, IFAC PapersOnLine . 2021,第10期

机译：使用双胞胎延迟的非理性行为的混合动力车辆智能能量管理策略深度确定性政策梯度
3. Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm [J] . Junta Wu, Huiyun Li Mathematical Problems in Engineering: Theory, Methods and Applications . 2020,第1期

机译：具有多种深度确定性政策梯度算法的深度集成钢筋学习
4. A Deep Deterministic Policy Gradient-based Strategy for Stocks Portfolio Management [C] . Huanming Zhang, Zhengyong Jiang, Jionglong Su IEEE International Conference on Big Data Analytics . 2021

机译：基于深度确定的基于政策梯度的股票制度策略
5. Deep Reinforcement Learning for Portfolio Management [D] . Ma, Yue. 2021

机译：投资组合管理的深度增强学习
6. Implementation of Deep Deterministic Policy Gradients for Controlling Dynamic Bipedal Walking [O] . Chujun Liu, Andrew G. Lonsberry, Mark J. Nandor, 2019

机译：控制动态双足行走的深度确定性策略梯度的实现
7. A Deep Deterministic Policy Gradient-based Strategy for Stocks Portfolio Management [O] . Huanming Zhang, Zhengyong Jiang, Jionglong Su 2021

机译：基于股票投资组合管理的深度确定性政策梯度战略

Deep Deterministic Policy Gradient for Portfolio Management

摘要

著录项

相似文献

相关主题

期刊订阅