Cyber-Human Approach For Learning Human Intention And Shape Robotic Behavior Based On Task Demonstration

机译：基于任务示范的网络人类学习人类意图和造型机器人行为的方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent developments in artificial intelligence enabled training of autonomous robots without human supervision. Even without human supervision during training, current models have yet to be human-engineered and have neither guarantees to match human expectation nor perform within safety bounds. This paper proposes CyberSteer to leverage human-robot interaction and align goals between humans and robotic intelligent agents. Based on human demonstration of the task, CyberSteer learns an intrinsic reward function used by the human demonstrator to pursue the goal of the task. The learned intrinsic human function shapes the robotic behavior during training through deep reinforcement learning algorithms, removing the need for environment-dependent or hand-engineered reward signal. Two different hypotheses were tested, both using non-expert human operators for initial demonstration of a given task or desired behavior: one training a deep neural network to classify human-like behavior and other training a behavior cloning deep neural network to suggest actions. In this experiment, CyberSteer was tested in a high-fidelity unmanned air system simulation environment, Microsoft AirSim. The simulated aerial robot performed collision avoidance through a clustered forest environment using forward-looking depth sensing. The performance of CyberSteer is compared to behavior cloning algorithms and reinforcement learning algorithms guided by handcrafted reward functions. Results show that the human-learned intrinsic reward function can shape the behavior of robotic systems and have better task performance guiding reinforcement learning algorithms compared to standard human-handcrafted reward functions.

机译：最近的人工智能发展使得自治机器人的培训没有人为监督。即使在培训期间没有人体监督，目前的模型也尚未人为人工程，并且既没有保证匹配人类期望，也没有在安全范围内执行。本文提出了利用人机互动并对准人类和机器人智能代理的目标来利用。基于任务的人类示范，Cybersteer学习人类示威者使用的内在奖励功能，以追求任务的目标。学习的内在人类功能在通过深度加强学习算法训练期间塑造了机器人行为，从而消除了对环境依赖性或手工工程奖励信号的需求。测试了两种不同的假设，两者都使用非专家人类运营商进行给定的任务或期望行为的初步演示：一个培训深度神经网络，以分类人类的行为和其他培训行为克隆深神经网络的行为来建议行动。在这个实验中，Cyberseeer在Microsoft Airsim的高保真无人空中系统仿真环境中进行了测试。模拟的空中机器人通过使用前瞻性深度感测来通过聚集的森林环境进行碰撞避免。将Cybersteer的性能与手动奖励功能引导的行为克隆算法和强化学习算法进行了比较。结果表明，人类学习的内在奖励功能可以塑造机器人系统的行为，与标准人类手工奖励功能相比，具有更好的任务性能引导钢筋学习算法。

著录项

来源
《International Joint Conference on Neural Networks》|2018年|1-705p|共7页
会议地点
作者
Vinicius G. Goecks; Gregory M. Gremillion; Hannah C. Lehman; William D. Nothwang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP183-53;
关键词
Task analysis; Shape; Neural networks; Training; Robot sensing systems; Cloning;

机译：任务分析;形状;神经网络;培训;机器人传感系统;克隆;

相似文献

外文文献
中文文献
专利

1. Multi-Robot Behavior Adaptation to Humans' Intention in Human-Robot Interaction Using Information-Driven Fuzzy Friend-Q Learning [J] . Lue-Feng Chen, Zhen-Tao Liu, Min Wu, Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2015,第2a109期

机译：基于信息驱动的模糊Friend-Q学习的多机器人行为适应人机交互的意图
2. Human Intention-Driven Learning Control for Trajectory Synchronization in Human-Robot Collaborative Tasks [J] . Harish Chaandar Ravichandar, Daniel Trombetta, Ashwin P. Dani IFAC PapersOnLine . 2019,第34期

机译：人机协作任务中轨迹同步的人为驱动学习控制
3. Facilitating Human–Robot Collaborative Tasks by Teaching-Learning-Collaboration From Human Demonstrations [J] . Wang Weitian, Li Rui, Chen Yi, IEEE transactions on automation science and engineering . 2019,第2期

机译：通过人类示范的教与学协作来促进人机协作任务
4. Cyber-Human Approach For Learning Human Intention And Shape Robotic Behavior Based On Task Demonstration [C] . Vinicius G. Goecks, Gregory M. Gremillion, Hannah C. Lehman, International Joint Conference on Neural Networks . 2018

机译：基于任务演示的网络人类方法学习人的意图和塑造机器人行为
5. Robot life-long task learning from human demonstrations: A Bayesian approach. [D] . Koenig, Nathan. 2012

机译：从人类演示中学习机器人的终身任务：贝叶斯方法。
6. A Task-Learning Strategy for Robotic Assembly Tasks from Human Demonstrations [O] . Guanwen Ding, Yubin Liu, Xizhe Zang, 2020

机译：人类示威活动的机器人装配任务任务学习策略
7. A Task-Learning Strategy for Robotic Assembly Tasks from Human Demonstrations [O] . Guanwen Ding, Yubin Liu, Xizhe Zang, 2020

机译：人类示威活动的机器人装配任务任务学习策略
8. Autonomous Learning of Task Skills and Human Intention for Enhancing Human Trust of Robot Systems. [R] . Suh, I. H. 2017

机译：自主学习任务技能和人类意图，增强机器人系统的人类信任。

Cyber-Human Approach For Learning Human Intention And Shape Robotic Behavior Based On Task Demonstration

摘要

著录项

相似文献

相关主题

期刊订阅