首页> 外文会议>Conference on Neural Information Processing Systems >A Family of Robust Stochastic Operators for Reinforcement Learning
【24h】

A Family of Robust Stochastic Operators for Reinforcement Learning

机译:用于加强学习的一家强大的随机运营商

获取原文
获取外文期刊封面目录资料

摘要

We consider a new family of stochastic operators for reinforcement learning that seeks to alleviate negative effects and become more robust to approximation or estimation errors. Theoretical results are established, showing that our family of operators preserve optimality and increase the action gap in a stochastic sense. Empirical results illustrate the strong benefits of our robust stochastic operators, significantly outperforming the classical Bellman and recently proposed operators.
机译:我们考虑了一个新的随机运营商,用于加强学习,寻求缓解负面影响并变得更加强大,以近似或估计误差。 建立了理论结果,表明我们的运营商系列保留了最优性并增加了随机意义上的动作差距。 经验结果说明了我们强大的随机运营商的强大益处,显着优于古典贝尔曼和最近提出的运营商。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号