Reinforcement Learning for Trading Systems and Portfolios

机译：交易系统和投资组合的加强学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose to train trading systems by optimizing financial objective functions via reinforcement learning. The performance functions that we consider as value functions are profit or wealth, the Sharpe ratio and our recently proposed differential Sharpe ratio for online learning. In Moody & Wu (1997), we presented empirical results in controlled experiments that demonstrated the advantages of reinforcement learning relative to supervised learning. Here we extend our previous work to compare Q-Learning to a reinforcement learning technique based on real-time recurrent learning (RTRL) that maximizes immediate reward. Our simulation results include a spectacular demonstration of the presence of predictability in the monthly Standard and Poors 500 stock index for the 25 year period 1970 through 1994. Our reinforcement trader achieves a simulated out-of-sample profit of over 4000% for this period, compared to the return for a buy and hold strategy of about 1300% (with dividends rein-vested). This superior result is achieved with substantially lower risk.

机译：我们建议通过加强学习优化金融目标职能来培训贸易系统。我们认为作为价值函数的绩效函数是利润或财富，锐利比率和我们最近提出的在线学习的差异尺度比率。在穆迪和吴（1997年），我们提出在受控实验证明该学习相对增强的优势，监督学习实证结果。在这里，我们将先前的工作扩展到基于实时复发学习（RTRL）的加强学习技术进行比较，以最大化即时奖励。我们的仿真结果包括在1970年至1994年25年期间的月度标准和诗500股指数中存在可预测性的壮观演示。我们的加固交易员在此期间实现了模拟的营业利润超过4000％，与买卖的回报相比，持有约1300％的策略（股息缰绳救了归属）。这种卓越的结果是通过大大降低的风险实现的。

著录项

来源
《National Conferences on Aritificial Intelligence》|1999年||共5页
会议地点
作者
John Moody; Matthew Saffell;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. An adaptive portfolio trading system: A risk-return portfolio optimization using recurrent reinforcement learning with expected maximum drawdown [J] . Almahdi Saud, Yang Steve Y. Expert Systems with Application . 2017,第nova期

机译：自适应投资组合交易系统：使用经常性强化学习和预期最大亏损的风险收益投资组合优化
2. Portfolio trading system of digital currencies: A deep reinforcement learning with multidimensional attention gating mechanism [J] . Weng Liguo, Sun Xudong, Xia Min, Neurocomputing . 2020,第Auga18期

机译：数字货币投资组合交易系统：多维着关注机构的深度增强学习
3. A constrained portfolio trading system using particle swarm algorithm and recurrent reinforcement learning [J] . Almahdi Saud, Yang Steve Y. Expert Systems with Application . 2019,第SEPa期

机译：基于粒子群算法和递归强化学习的受限证券交易系统
4. Reinforcement Learning for Trading Systems and Portfolios [C] . John Moody, Matthew Saffell National Conferences on Aritificial Intelligence . 1999

机译：交易系统和投资组合的加强学习
5. Reinforcement Learning and Recurrent Reinforcement Learning for Dynamic Portfolio Optimization [D] . Almahdi, Saud 2019

机译：强化学习和循环强化学习以实现动态资产组合优化
6. Action-specialized expert ensemble trading system with extended discrete action space using deep reinforcement learning [O] . JoonBum Leem, Ha Young Kim, Baogui Xin, 2020

机译：采用深度加固学习采用延长离散动作空间的行动专业专业专家集合交易系统
7. Portfolio management system in equity market neutral using reinforcement learning [O] . Mu-En Wu, Jia-Hao Syu, Jerry Chun-Wei Lin, 2021

机译：利用加固学习股票市场中性投资组合管理系统

Reinforcement Learning for Trading Systems and Portfolios

摘要

著录项

相似文献

相关主题

期刊订阅