首页> 外文会议>International conference on development and learning >Simultaneous Learning to Acquire Competitive Behaviors in Multi-agent System Based on Modular Learning System
【24h】

Simultaneous Learning to Acquire Competitive Behaviors in Multi-agent System Based on Modular Learning System

机译:同时学习基于模块化学习系统获取多智能体系中的竞争行为

获取原文

摘要

The existing reinforcement learning approaches have been suffering from the policy alternation of others in multiagent dynamic environments. A typical example is a case of RoboCup competitions since other agent behaviors may cause sudden changes in state transition probabilities of which constancy is needed for the learning to converge. The keys for simultaneous learning to acquire competitive behaviors in such an environment are- a modular learning system for adaptation to the policy alternation of others, and- an introduction of macro actions for simultaneous learning to reduce the search space.This paper presents a method of modular learning in a multiagent environment, by which the learning agents can simultaneously learn their behaviors and adapt themselves to the situations as consequences of the others' behaviors.
机译:现有的强化学习方法遭受多元动态环境中其他人的政策交替。典型的例子是Robocup竞争的情况,因为其他代理行为可能导致国家转换概率突然变化,所以学习融合需要持态。同时学习在这种环境中获取竞争行为的键是一种模块化学习系统,用于适应他人的政策交替,以及引入宏动作以同时学习减少搜索空间。这篇论文提出了一种方法模块化学习在多层环境中,学习代理商可以同时学习其行为,并作为其他人行为的后果调整到这种情况。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号