首页> 外文期刊>IEICE Transactions on Information and Systems >An Approach to the Piano Mover's Problem Using Hierarchic Reinforcement Learning
【24h】

An Approach to the Piano Mover's Problem Using Hierarchic Reinforcement Learning

机译:基于层次强化学习的钢琴搬家问题研究

获取原文
获取原文并翻译 | 示例
       

摘要

We attempt to achieve corporative behavior of autonomous decentralized agents constructed via Q-Learning, which is a type of reinforcement learning. As such, in the present paper, we examine the piano mover's problem including a find-path problem. We propose a multi-agent architecture that has an external agent and internal agents. Internal agents are homogenous and can communicate with each other. The movement of the external agent depends on the composition of the actions of the internal agents. By learning how to move through the internal agents, avoidance of obstacles by the object is expected. We simulate the proposed method in a two-dimensional continuous world. Results obtained in the present investigation reveal the effectiveness of the proposed method.
机译:我们尝试实现通过Q学习构建的自主分散代理的公司行为,这是一种强化学习。因此,在本文中,我们研究了钢琴搬家工人的问题,包括寻路问题。我们提出了一种具有外部代理和内部代理的多代理架构。内部代理是同质的,可以相互通信。外部行为者的运动取决于内部行为者的行为组成。通过学习如何在内部媒介中移动,可以避免物体的障碍。我们在二维连续世界中模拟提出的方法。在本研究中获得的结果表明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号