首页> 外文会议>IEEE International Conference on Development and Learning and Epigenetic Robotics >What good are actions? Accelerating learning using learned action priors
【24h】

What good are actions? Accelerating learning using learned action priors

机译:什么好的行动?使用学习的动作前瞻加速学习

获取原文

摘要

The computational complexity of learning in sequential decision problems grows exponentially with the number of actions available to the agent at each state. We present a method for accelerating this process by learning action priors that express the usefulness of each action in each state. These are learned from a set of different optimal policies from many tasks in the same state space, and are used to bias exploration away from less useful actions. This is shown to improve performance for tasks in the same domain but with different goals. We extend our method to base action priors on perceptual cues rather than absolute states, allowing the transfer of these priors between tasks with differing state spaces and transition functions, and demonstrate experimentally the advantages of learning with action priors in a reinforcement learning context.
机译:顺序决策问题的学习的计算复杂性以每个状态的代理可用的动作数量呈指数级增长。我们介绍了一种通过学习表达每个州中每个动作的有用性的动作前沿加速这一过程的方法。这些来自来自同一状态空间中的许多任务的一组不同的最佳策略,并用于偏离较少的探索。这被证明可以提高同一域中的任务的性能,但具有不同的目标。我们将我们的方法扩展到对感知线索而不是绝对状态的基础动作前导者,允许在具有不同状态空间和转换功能的任务之间转移这些前瞻,并在实验上展示在加强学习背景下用动作前导者学习的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号