首页> 外文会议>IEEE International Conference on Computer Science and Automation Engineering >The measurement of strategy convergence for reinforcement learning in discrete state space
【24h】

The measurement of strategy convergence for reinforcement learning in discrete state space

机译:离散状态空间钢筋学习策略融合的测量

获取原文

摘要

The concept of entropy is introduced into reinforcement learning. The definitions of the local strategy entropy and global strategy entropy are proposed. The global strategy entropy is proved to be the quantitative problem-independent measurement of the learning progress, i.e. the convergence degree of the strategy. To improve the learning performance, reinforcement learning with self-adaptive learning rate is proposed based on the strategy entropy. The experimental results show that learning based on the local strategy entropy has better learning performance than those with fixed learning rates.
机译:熵的概念被引入加强学习。 提出了地方战略熵和全球战略熵的定义。 被证明,全球战略熵是对学习进度的定量问题无关的测量,即战略的收敛程度。 为了提高学习性能,基于战略熵提出了利用自适应学习率的加固学习。 实验结果表明,基于本地战略熵的学习比具有固定学习率的学习性能更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号