首页> 外文会议>IEEE Symposium Series on Computational Intelligence >Model-based Empowerment Computation for Dynamical Agents
【24h】

Model-based Empowerment Computation for Dynamical Agents

机译:基于模型的动态Agent授权计算

获取原文

摘要

Empowerment is defined as the channel capacity between action sequences and sensor information and has been studied as a kind of intrinsic rewards for survival. It is based on the mutual information conditioned by a current state, but generally calculating it needs heavy computation. This paper points out weak points of the previous methods to compute empowerment and proposes an improved method for sensorimotor environments, where their states are observed as continuous values, under the model-based setting. Then, we introduce a planning problem, whose goal is to maximize cumulative empowerment value on the articulated robots, and the behaviors after training are discussed while referring to other theories of intrinsic rewards and the control theory.
机译:授权被定义为动作序列和传感器信息之间的通道容量,已经被研究为生存的一种内在奖励。它基于以当前状态为条件的互信息,但是通常计算时需要大量计算。本文指出了先前计算能力的方法的弱点,并提出了一种针对感觉运动环境的改进方法,在基于模型的设置下,感觉运动状态被视为连续值。然后,我们提出了一个计划问题,其目标是最大化铰接式机器人的累积授权价值,并在参考其他内在奖励理论和控制理论的基础上讨论了训练后的行为。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号