Apprenticeship Learning for Continuous State Spaces and Actions in a Swarm-Guidance Shepherding Task

机译：群体指导牧羊任务中连续状态空间和动作的学徒学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Apprenticeship learning (AL) is a learning scheme using demonstrations collected from human operators. Apprenticeship learning via inverse reinforcement learning (AL via IRL) has been used as one of the primary candidate approaches to obtain a near optimal policy that is as good as that of the human policy. The algorithm works by attempting to recover and approximate the human reward function from the demonstrations. This approach assists in overcoming limitations such as the sensitivity associated with the variance in the quality of human data and the short sighted decision time that does not consider future states. However, addressing the problem of continuous action and state spaces has still been challenging in the AL via IRL algorithm. In this paper, we propose a new AL via IRL approach that is able to work with continuous action and state spaces. Our approach is used to train an artificial intelligence (AI) agent acting as a shepherd of artificial sheep-inspired swarm agents in a complex and dynamic environment. The results show that the performance of our approach is as good as that of the human operator, and particularly, the agent's movements seem to be smoother and more effective.

机译：学徒制学习（AL）是一种使用从人类操作员那里收集的示威游行的学习方案。通过逆向强化学习（通过IRL进行的AL）的学徒制学习已被用作获得与人类策略一样好的最佳策略的主要候选方法之一。该算法通过尝试从演示中恢复和逼近人类奖励功能而起作用。这种方法有助于克服局限性，例如与人类数据质量差异相关的敏感性以及不考虑未来状态的短视决策时间。然而，通过IRL算法在AL中解决连续动作和状态空间的问题仍然是挑战。在本文中，我们提出了一种新的通过IRL的AL方法，该方法能够处理连续的动作和状态空间。我们的方法用于在复杂而动态的环境中训练充当人工羊启发性群体代理的牧羊人的人工智能（AI）代理。结果表明，我们的方法的性能与操作员的性能一样好，尤其是代理的动作似乎更平滑，更有效。

著录项

来源
《IEEE Symposium Series on Computational Intelligence》|2019年|102-109|共8页
会议地点
作者
Hung The Nguyen; Matthew Garratt; Lam Thu Bui; Hussein Abbass;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Learning (artificial intelligence); Force; Machine learning; Task analysis; Information technology; Electronic mail; Mathematical model;

机译：学习（人工智能）;强制;机器学习;任务分析;信息技术;电子邮件;数学模型;

相似文献

外文文献
中文文献
专利

1. Control Task for Reinforcement Learning with Known Optimal Solution for Discrete and Continuous Actions [J] . Michael C. ROTTGER, Andreas W. LIEHR Journal of Intelligent Learning Systems and Applications . 2009,第1期

机译：用于离散和连续动作的已知最佳解决方案的强化学习控制任务
2. Experiments of conditioned reinforcement learning in continuous space control tasks [J] . Borja Fernandez-Gauna, Juan Luis Osa, Manuel Graña Neurocomputing . 2018,第JANa3期

机译：连续空间控制任务中条件强化学习的实验
3. A Multi-Task Learning Framework for Emotion Recognition Using 2D Continuous Space [J] . Rui Xia, Yang Liu Affective Computing, IEEE Transactions on . 2017,第1期

机译：使用2D连续空间的情感识别多任务学习框架
4. Apprenticeship Learning for Continuous State Spaces and Actions in a Swarm-Guidance Shepherding Task [C] . Hung The Nguyen, Matthew Garratt, Lam Thu Bui, IEEE Symposium Series on Computational Intelligence . 2019

机译：在群体指导牧羊人任务中的连续状态空间和行动的学徒学习
5. Autonomous mental development in high-dimensional and continuous state and action spaces and its application in autonomous learning of speech. [D] . Joshi, Ameet Vijay. 2003

机译：高维，连续状态和动作空间中的自主思维发展及其在语音自主学习中的应用。
6. Active Sensing for Continuous State and Action Spaces via Task-Action Entropy Minimization [O] . Tipakorn Greigarn, M. Cenk Çavuşoğlu -1

机译：通过任务-动作熵最小化对连续状态和动作空间进行主动感知
7. Continuous-Spaced Action Selection for Single- and Multi-Robot Tasks Using Cooperative Extended Kohonen Maps [O] . Kian Hsiang Low, Wee Kheng Leow, Marcelo H. Ang 2015

机译：使用协作扩展Kohonen映射的单机器人和多机器人任务的连续间隔动作选择

Apprenticeship Learning for Continuous State Spaces and Actions in a Swarm-Guidance Shepherding Task

摘要

著录项

相似文献

相关主题

期刊订阅