首页> 外文会议>International Conference on Computational Collective Intelligence >Comparison of Reinforcement and Supervised Learning Methods in Farmer-Pest Problem with Delayed Rewards
【24h】

Comparison of Reinforcement and Supervised Learning Methods in Farmer-Pest Problem with Delayed Rewards

机译:延迟奖励对农民害虫问题的加强和监督学习方法的比较

获取原文

摘要

In this paper we propose a method based on the time-window idea which allows agents to generate their strategy using supervised learning algorithms in environments with delayed rewards. It is universal and can be used in various environments. Learning speed of the proposed method and reinforcement learning algorithm are compared in a Farmer- Pest problem with delayed rewards. Farmer-Pest problem is chosen for the comparison because it is designed especially for learning algorithms benchmarking. It has several dimensions which change environment characteristics and allows to test algorithms in various conditions. This paper presents results for one reinforcement learning method (SARSA) and three supervised learning algorithms (Na?ve Bayes, C4.5 and Ripper). These algorithms are tested on configurations with various complexity.
机译:在本文中,我们提出了一种基于时间窗的想法的方法,该方法允许代理在具有延迟奖励的环境中使用监督学习算法生成策略。它是普遍的,可以在各种环境中使用。在延迟奖励的农业问题中比较了所提出的方法和强化学习算法的学习速度。选择农民害虫问题是为了比较,因为它专为学习算法基准测试而设计。它有几个尺寸改变了环境特征,并允许在各种条件下测试算法。本文提出了一种加强学习方法(SARSA)和三个监督学习算法(NA'VE BAYES,C4.5和RIPPER)的结果。这些算法在具有各种复杂性的配置上进行测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号