A Family of Robust Stochastic Operators for Reinforcement Learning

机译：用于加强学习的一家强大的随机运营商

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We consider a new family of stochastic operators for reinforcement learning that seeks to alleviate negative effects and become more robust to approximation or estimation errors. Theoretical results are established, showing that our family of operators preserve optimality and increase the action gap in a stochastic sense. Empirical results illustrate the strong benefits of our robust stochastic operators, significantly outperforming the classical Bellman and recently proposed operators.

机译：我们考虑了一个新的随机运营商，用于加强学习，寻求缓解负面影响并变得更加强大，以近似或估计误差。建立了理论结果，表明我们的运营商系列保留了最优性并增加了随机意义上的动作差距。经验结果说明了我们强大的随机运营商的强大益处，显着优于古典贝尔曼和最近提出的运营商。

著录项

来源
《Conference on Neural Information Processing Systems》|2020年|p15071-15901|共11页
会议地点
作者
Yingdong Lu; Mark S. Squillante; Chai Wah Wu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计量学;
关键词

相似文献

外文文献
中文文献
专利

1. Theoretical Analysis of Efficiency and Robustness of Softmax and Gap-Increasing Operators in Reinforcement Learning [J] . Tadashi Kozuno, Eiji Uchibe, Kenji Doya JMLR: Workshop and Conference Proceedings . 2018,第2009期

机译：增强学习中Softmax和间隙增加算子的效率和鲁棒性的理论分析
2. Theoretical Analysis of Efficiency and Robustness of Softmax and Gap-Increasing Operators in Reinforcement Learning [J] . Tadashi Kozuno, Eiji Uchibe, Kenji Doya JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：增强学习中Softmax和间隙增加算子的效率和鲁棒性的理论分析
3. Robust Reinforcement Learning for Stochastic Linear Quadratic Control with Multiplicative Noise ? [J] . Bo Pang, Zhong-Ping Jiang IFAC PapersOnLine . 2021,第7期

机译：具有乘法噪声的随机线性二次控制的鲁棒增强学习？
4. A Family of Robust Stochastic Operators for Reinforcement Learning [C] . Yingdong Lu, Mark S. Squillante, Chai Wah Wu Conference on Neural Information Processing Systems . 2020

机译：用于加强学习的一家强大的随机运营商
5. A Smoothing Framework for Stochastic Continuous-Time Reinforcement Learning Problem [D] . Hu, Bowen. 2021

机译：用于随机连续时间增强学习问题的平滑框架
6. Scaled free-energy based reinforcement learning for robust and efficient learning in high-dimensional state spaces [O] . Stefan Elfwing, Eiji Uchibe, Kenji Doya 2013

机译：基于缩放自由能的增强学习可在高维状态空间中进行健壮和高效的学习
7. Robust Model Predictive Shielding for Safe Reinforcement Learning with Stochastic Dynamics [O] . Shuo Li, Osbert Bastani 2020

机译：随机动力学安全强化学习的鲁棒模型预测屏蔽

A Family of Robust Stochastic Operators for Reinforcement Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅