...
首页> 外文期刊>Physical review, E. Statistical physics, plasmas, fluids, and related interdisciplinary topics >Convergence of reinforcement learning algorithms and acceleration of learning - art. no. 026706
【24h】

Convergence of reinforcement learning algorithms and acceleration of learning - art. no. 026706

机译:强化学习算法的融合和学习加速。没有。 026706

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The techniques of reinforcement learning have been gaining increasing popularity recently. However, the question of their convergence rate is still open. We consider the problem of choosing the learning steps alpha(n), and their relation with discount gamma and exploration degree epsilon. Appropriate choices of these parameters may drastically influence the convergence rate of the techniques. From analytical examples, we conjecture optimal values of alpha(n) and then use numerical examples to verify our conjectures. [References: 20]
机译:强化学习技术最近已经越来越流行。但是,它们的收敛速度问题仍然悬而未决。我们考虑选择学习步骤alpha(n)的问题,以及它们与折扣伽玛和探索度epsilon的关系。这些参数的适当选择,可能会大大影响的技术收敛速度。从分析示例中,我们猜出了alpha(n)的最优值,然后使用数值示例来验证我们的猜想。 [参考:20]

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号