【24h】

An Adaptive Approach for the Exploration-Exploitation Dilemma for Learning Agents

机译:学习代理探索-开发困境的一种自适应方法

获取原文
获取原文并翻译 | 示例

摘要

Learning agents have to deal with the exploration-exploitation dilemma. The choice between exploration and exploitation is very difficult in dynamic systems; in particular in large scale ones such as economic systems. Recent research shows that there is neither an optimal nor a unique solution for this problem. In this paper, we propose an adaptive approach based on meta-rules to adapt the choice between exploration and exploitation. This new adaptive approach relies on the variations of the performance of the agents. To validate the approach, we apply it to economic systems and compare it to two adaptive methods: one local and one global. Herein, we adapt these two methods, which were originally proposed by Wilson, to economic systems. Moreover, we compare different exploration strategies and focus on their influence on the performance of the agents.
机译:学习者必须应对探索与开发的困境。在动态系统中,很难在勘探和开发之间进行选择。特别是在大型系统(例如经济系统)中。最近的研究表明,此问题既没有最佳解决方案,也没有唯一的解决方案。在本文中,我们提出了一种基于元规则的自适应方法,以适应勘探与开发之间的选择。这种新的自适应方法依赖于代理性能的变化。为了验证该方法,我们将其应用于经济系统并将其与两种自适应方法进行比较:一种是本地方法,另一种是全局方法。在本文中,我们将这两种由威尔逊最初提出的方法应用于经济系统。此外,我们比较了不同的勘探策略,并重点研究了它们对代理绩效的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号