Apprenticeship Learning for Continuous State Spaces and Actions in a Swarm-Guidance Shepherding Task

机译：在群体指导牧羊人任务中的连续状态空间和行动的学徒学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Apprenticeship learning (AL) is a learning scheme using demonstrations collected from human operators. Apprenticeship learning via inverse reinforcement learning (AL via IRL) has been used as one of the primary candidate approaches to obtain a near optimal policy that is as good as that of the human policy. The algorithm works by attempting to recover and approximate the human reward function from the demonstrations. This approach assists in overcoming limitations such as the sensitivity associated with the variance in the quality of human data and the short sighted decision time that does not consider future states. However, addressing the problem of continuous action and state spaces has still been challenging in the AL via IRL algorithm. In this paper, we propose a new AL via IRL approach that is able to work with continuous action and state spaces. Our approach is used to train an artificial intelligence (AI) agent acting as a shepherd of artificial sheep-inspired swarm agents in a complex and dynamic environment. The results show that the performance of our approach is as good as that of the human operator, and particularly, the agent's movements seem to be smoother and more effective.

机译：学徒学习（AL）是一种使用从人类运营商收集的示威活动的学习计划。通过逆强化学习（AL Via IRL）的学徒学习已被用作获得近最佳政策的主要候选方法之一，这与人类政策一样好。该算法通过尝试从演示中恢复和近似人工奖励功能。这种方法有助于克服诸如与人类数据质量方差相关的敏感性以及不考虑未来状态的短暂的决定时间。然而，解决连续动作和状态空间的问题仍然在AL通过IRL算法在AL中具有挑战性。在本文中，我们提出了一种新的AL，通过IRL方法能够使用连续动作和状态空间。我们的方法用于培训一种人工智能（AI）代理人，作为一种复杂的和动态环境中的人工绵羊激发的群体的牧羊人。结果表明，我们的方法的性能与人类运营商的表现一样好，特别是，代理商的动作似乎更平滑，更有效。

著录项

来源
《IEEE Symposium Series on Computational Intelligence》|2019年|656p|共8页
会议地点
作者
Hung The Nguyen; Matthew Garratt; Lam Thu Bui; Hussein Abbass;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Learning (artificial intelligence); Force; Machine learning; Task analysis; Information technology; Electronic mail; Mathematical model;

机译：学习（人工智能）;力;机器学习;任务分析;信息技术;电子邮件;数学模型;

相似文献

外文文献
中文文献
专利

1. Control Task for Reinforcement Learning with Known Optimal Solution for Discrete and Continuous Actions [J] . Michael C. ROTTGER, Andreas W. LIEHR Journal of Intelligent Learning Systems and Applications . 2009,第1期

机译：用于离散和连续动作的已知最佳解决方案的强化学习控制任务
2. Experiments of conditioned reinforcement learning in continuous space control tasks [J] . Borja Fernandez-Gauna, Juan Luis Osa, Manuel Graña Neurocomputing . 2018,第JANa3期

机译：连续空间控制任务中条件强化学习的实验
3. A Multi-Task Learning Framework for Emotion Recognition Using 2D Continuous Space [J] . Rui Xia, Yang Liu Affective Computing, IEEE Transactions on . 2017,第1期

机译：使用2D连续空间的情感识别多任务学习框架
4. Apprenticeship Learning for Continuous State Spaces and Actions in a Swarm-Guidance Shepherding Task [C] . Hung The Nguyen, Matthew Garratt, Lam Thu Bui, IEEE Symposium Series on Computational Intelligence . 2019

机译：群体指导牧羊任务中连续状态空间和动作的学徒学习
5. Autonomous mental development in high-dimensional and continuous state and action spaces and its application in autonomous learning of speech. [D] . Joshi, Ameet Vijay. 2003

机译：高维，连续状态和动作空间中的自主思维发展及其在语音自主学习中的应用。
6. Active Sensing for Continuous State and Action Spaces via Task-Action Entropy Minimization [O] . Tipakorn Greigarn, M. Cenk Çavuşoğlu -1

机译：通过任务-动作熵最小化对连续状态和动作空间进行主动感知
7. Continuous-Spaced Action Selection for Single- and Multi-Robot Tasks Using Cooperative Extended Kohonen Maps [O] . Kian Hsiang Low, Wee Kheng Leow, Marcelo H. Ang 2015

机译：使用协作扩展Kohonen映射的单机器人和多机器人任务的连续间隔动作选择

Apprenticeship Learning for Continuous State Spaces and Actions in a Swarm-Guidance Shepherding Task

摘要

著录项

相似文献

相关主题

期刊订阅