首页> 外文会议>Joint Workshop on Multi-Agent and Multi-Agent-Based Simulation(MABS 2004) >Toward Guidelines for Modeling Learning Agents in Multiagent-Based Simulation: Implications from Q-Learning and Sarsa Agents
【24h】

Toward Guidelines for Modeling Learning Agents in Multiagent-Based Simulation: Implications from Q-Learning and Sarsa Agents

机译:迈向基于多元素模拟中的学习代理的建议指南:Q-Learning和Sarsa代理商的影响

获取原文

摘要

Abstract. This paper focuses on how simulation results are sensitive to agent modeling in multiagent-based simulation (MABS) and investigates such sensitivity by comparing results where agents have different learning mechanisms, i.e., Q-learning and Sarsa, in the context of reinforcement learning. Through an analysis of simulation results in a bargaining game as one of the canonical examples in game theory, the following implications have been revealed: (1) even a slight difference has an essential influence on simulation results; (2) testing in static and dynamic environments highlights the different tendency of results; and (3) three stages in both Q-learning and Sarsa agents (i.e., (a) competition; (b) cooperation; and (c) learning impossible) are found in the dynamic environment, while no stage is found in the static environment. Prom these three implications, the following very rough guidelines for modeling agents can be derived: (1) cross-element validation for specifying key factors that affect simulation results; (2) a comparison of results between the static and dynamic environments for determining candidates to be investigated in detail; and (3) sensitive analysis for specifying applicable range for learning agents.
机译:抽象的。本文重点研究的模拟结果如何都通过比较结果,其中试剂具有不同的学习机制,即,Q学习和丝兰,在强化学习的上下文中基于多代理仿真(MABS)和调查这种敏感性剂建模敏感。通过在讨价还价博弈的仿真结果的分析,对博弈论的典型例子之一,将产生以下结果已经揭晓:(1)即使是微小的差别对仿真结果的重要影响; (2)在静态和动态环境亮点检测结果的不同倾向;和(3)在两个Q学习和丝兰剂(即,(a)中的竞争;(b)中的合作;及(c)的学习是不可能的)三个阶段在动态环境中发现的,而没有阶段在静态环境中发现。 PROM这三个影响,用于建模剂以下非常粗略的指导方针可以导出:(1)跨元素验证用于指定影响的模拟结果的关键因素; (2)静态和动态环境之间的结果用于确定候选详细待研究的比较;和(3)敏感分析用于确定用于学习剂适用范围。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号