An Approach to the Piano Mover's Problem Using Hierarchic Reinforcement Learning

Yuko ISHIWAKA; Tomohiro YOSHDA; Hiroshi YOKOI; Yukinori KAKAZU

首页> 外文期刊>IEICE Transactions on Information and Systems >An Approach to the Piano Mover's Problem Using Hierarchic Reinforcement Learning

【24h】

An Approach to the Piano Mover's Problem Using Hierarchic Reinforcement Learning

机译：基于层次强化学习的钢琴搬家问题研究

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We attempt to achieve corporative behavior of autonomous decentralized agents constructed via Q-Learning, which is a type of reinforcement learning. As such, in the present paper, we examine the piano mover's problem including a find-path problem. We propose a multi-agent architecture that has an external agent and internal agents. Internal agents are homogenous and can communicate with each other. The movement of the external agent depends on the composition of the actions of the internal agents. By learning how to move through the internal agents, avoidance of obstacles by the object is expected. We simulate the proposed method in a two-dimensional continuous world. Results obtained in the present investigation reveal the effectiveness of the proposed method.

机译：我们尝试实现通过Q学习构建的自主分散代理的公司行为，这是一种强化学习。因此，在本文中，我们研究了钢琴搬家工人的问题，包括寻路问题。我们提出了一种具有外部代理和内部代理的多代理架构。内部代理是同质的，可以相互通信。外部行为者的运动取决于内部行为者的行为组成。通过学习如何在内部媒介中移动，可以避免物体的障碍。我们在二维连续世界中模拟提出的方法。在本研究中获得的结果表明了该方法的有效性。

著录项

来源
《IEICE Transactions on Information and Systems》 |2004年第8期|p.2106-2113|共8页
作者
Yuko ISHIWAKA; Tomohiro YOSHDA; Hiroshi YOKOI; Yukinori KAKAZU;
展开▼
作者单位

Hakodate Institute of National College of Technology, Hakodate-shi, 042-8501 Japan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
reinforcement learning; piano mover's problem; heterogeneous multi-agent; find-path problem; obstacle avoidance;

机译：强化学习;钢琴演奏者问题;异构多智能体;寻找路径问题;避障;

相似文献

外文文献
中文文献
专利

1. Reinforcement Learning for a New Piano Mover [J] . Yuko Ishiwaka, Tomohiro Yoshida, Yukinori Kakazu Journal of Systemics, Cybernetics and Informatics . 2005,第4期

机译：新钢琴练习者的强化学习
2. Specialization in Hierarchical Learning Systems: A Unified Information-theoretic Approach for Supervised, Unsupervised and Reinforcement Learning [J] . Heinke Hihn, Daniel A. Braun Neural processing letters . 2020,第3期

机译：分层学习系统的专业化：统一的信息 - 监督，无监督和强化学习的理论方法
3. An Efficient Model-Free Approach for Controlling Large-Scale Canals via Hierarchical Reinforcement Learning [J] . Ren Tao, Niu Jianwei, Liu Xuefeng, IEEE transactions on industrial informatics . 2021,第6期

机译：一种有效的无模型方法，用于通过分层加固学习控制大型运河
4. Achieving corporative behavior in heterogeneous agents using hierarchic reinforcement learning - an approach to piano mover's problem [C] . Yuko Ishiwaka, Tomohiro Yoshida, Hiroshi Yokoi, IEEE Interantional Conference on Systems, Man and Cybernetics . 2002

机译：使用等级加强学习实现异构试剂的合作行为 - 一种钢琴搬家问题的方法
5. Learning state and action space hierarchies for reinforcement learning using action -dependent partitioning. [D] . Asadi, Mehran. 2006

机译：使用依赖于动作的分区来学习状态和动作空间层次结构，以进行强化学习。
6. Dual Dynamic Scheduling for Hierarchical QoS in Uplink-NOMA: A Reinforcement Learning Approach [O] . Xiangjun Li, Qimei Cui, Jinli Zhai, 2021

机译：上行链路中的分层QoS的双动态调度：强化学习方法
7. A hierarchical Bayesian approach to assess learning and guessing strategies in reinforcement learning [O] . Jessica Vera Schaaf, Marieke Jepma, Ingmar Visser, 2019

机译：评估加固学习学习和猜测策略的等级贝叶斯方法

An Approach to the Piano Mover's Problem Using Hierarchic Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅