首页> 外文会议>International Conference on Computational Collective Intelligence >Comparison of Reinforcement and Supervised Learning Methods in Farmer-Pest Problem with Delayed Rewards

【24h】

Comparison of Reinforcement and Supervised Learning Methods in Farmer-Pest Problem with Delayed Rewards

机译：延迟奖励对农民害虫问题的加强和监督学习方法的比较

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we propose a method based on the time-window idea which allows agents to generate their strategy using supervised learning algorithms in environments with delayed rewards. It is universal and can be used in various environments. Learning speed of the proposed method and reinforcement learning algorithm are compared in a Farmer- Pest problem with delayed rewards. Farmer-Pest problem is chosen for the comparison because it is designed especially for learning algorithms benchmarking. It has several dimensions which change environment characteristics and allows to test algorithms in various conditions. This paper presents results for one reinforcement learning method (SARSA) and three supervised learning algorithms (Na?ve Bayes, C4.5 and Ripper). These algorithms are tested on configurations with various complexity.

机译：在本文中，我们提出了一种基于时间窗的想法的方法，该方法允许代理在具有延迟奖励的环境中使用监督学习算法生成策略。它是普遍的，可以在各种环境中使用。在延迟奖励的农业问题中比较了所提出的方法和强化学习算法的学习速度。选择农民害虫问题是为了比较，因为它专为学习算法基准测试而设计。它有几个尺寸改变了环境特征，并允许在各种条件下测试算法。本文提出了一种加强学习方法（SARSA）和三个监督学习算法（NA'VE BAYES，C4.5和RIPPER）的结果。这些算法在具有各种复杂性的配置上进行测试。

著录项

来源
《International Conference on Computational Collective Intelligence 》|2013年||共10页
会议地点
作者
Bart?omiej ?nie?yński;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 73.814083;
关键词
agent learning; supervised learning; reinforcement learning.;

机译：代理学习;监督学习;加强学习。;

相似文献

外文文献
中文文献
专利

1. Comparison of strategy learning methods in Farmer-Pest problem for various complexity environments without delays [J] . Bartlomiej Sniezynski, Jacek Dajda Journal of computational science . 2013 ,第3期

机译：各种复杂环境下农虫问题策略学习方法的比较
2. International Conference on Computational Science, ICCS 2011 Farmer-Pest Problem: A Multidimensional Problem Domain for Comparison of Agent Learning Methods [J] . Bart?omiej ?nie?yński, Jacek Dajda, Marcin Mlostek, Procedia Computer Science . 2011 ,第1期

机译：国际计算科学大会，ICCS 2011农夫-害虫问题：用于代理学习方法比较的多维问题域
3. SWIRL: A sequential windowed inverse reinforcement learning algorithm for robot tasks with delayed rewards [J] . Krishnan Sanjay, Garg Animesh, Liaw Richard, The International journal of robotics research . 2019 ,第2a3期

机译：SWIRL：顺序窗口逆强化学习算法，用于延迟奖励的机器人任务
4. Comparison of Reinforcement and Supervised Learning Methods in Farmer-Pest Problem with Delayed Rewards [C] . Bartlomiej Sniezynski International conference on computational collective intelligence . 2013

机译：延误农虫问题中强化学习与监督学习方法的比较
5. Training a Neural Network to Construct Sentences from an Inputted Word List: A Comparison Between Supervised and Reinforcement Learning Methods [D] . Black, Samuel 2018

机译：训练神经网络以从输入的单词列表构建句子：监督学习和强化学习方法之间的比较
6. Immediate reinforcement in delayed reward learning in pigeons [O] . Janet Winter, Charles C. Perkins 1982

机译：立即加强鸽子延迟奖励学习
7. A Comparison Of Supervised And Reinforcement Learning Methods On A Reinforcement Learning Task [O] . Vijaykumar Gullapalli 1992

机译：强化学习任务中监督学习和强化学习方法的比较
8. Learning from Noisy and Delayed Rewards: The Value of Reinforcement Learning to Defense Modeling and Simulation. [R] . Alt, J. K. 2012

机译：学习嘈杂和延迟奖励：强化学习对国防建模和仿真的价值。

Comparison of Reinforcement and Supervised Learning Methods in Farmer-Pest Problem with Delayed Rewards

摘要

著录项

相似文献

相关主题

期刊订阅