首页> 外文会议>Intelligent Robots and Systems, 2005. (IROS 2005). 2005 IEEE/RSJ International Conference on >Simultaneous learning to acquire competitive behaviors in multi-agent system based on a modular learning system
【24h】

Simultaneous learning to acquire competitive behaviors in multi-agent system based on a modular learning system

机译:基于模块化学习系统的多主体系统同时学习获取竞争行为

获取原文

摘要

Existing reinforcement learning approaches have been suffering from the policy alternation of others in multiagent dynamic environments. A typical example is the case of RoboCup competitions because other agent behaviors may cause sudden changes in state transition probabilities in which constancy is needed for the learning to converge. The keys for simultaneous learning to acquire competitive behaviors in such an environment are: a modular learning system for adaptation to the policy alternation of others; and an introduction of macro actions for simultaneous learning to reduce the search space. This paper presents a method of modular learning in a multiagent environment in which the learning agents can simultaneously learn their behaviors and adapt themselves to the situations as a consequence of the others' behaviors.
机译:在多主体动态环境中,现有的强化学习方法一直在遭受其他策略的替代。一个典型的例子是RoboCup竞赛,因为其他代理行为可能会导致状态转换概率的突然变化,而学习收敛需要恒定性。在这样的环境中,同时学习以获取竞争行为的关键是:模块化的学习系统,以适应其他人的政策变化;并介绍了用于同时学习的宏操作,以减少搜索空间。本文提出了一种在多主体环境中进行模块化学习的方法,在这种方法中,学习主体可以同时学习自己的行为,并使自己适应其他人行为的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号