Human-Centered Reinforcement Learning: A Survey

Li Guangliang; Gomez Randy; Nakamura Keisuke; He Bo

首页> 外文期刊>Human-Machine Systems, IEEE Transactions on >Human-Centered Reinforcement Learning: A Survey

【24h】

Human-Centered Reinforcement Learning: A Survey

机译：以人为本的强化学习：一项调查

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Human-centered reinforcement learning (RL), in which an agent learns how to perform a task from evaluative feedback delivered by a human observer, has become more and more popular in recent years. The advantage of being able to learn from human feedback for a RL agent has led to increasing applicability to real-life problems. This paper describes the state-of-the-art human-centered RL algorithms and aims to become a starting point for researchers who are initiating their endeavors in human-centered RL. Moreover, the objective of this paper is to present a comprehensive survey of the recent breakthroughs in this field and provide references to the most interesting and successful works. After starting with an introduction of the concepts of RL from environmental reward, this paper discusses the origins of human-centered RL and its difference from traditional RL. Then we describe different interpretations of human evaluative feedback, which have produced many human-centered RL algorithms in the past decade. In addition, we describe research on agents learning from both human evaluative feedback and environmental rewards as well as on improving the efficiency of human-centered RL. Finally, we conclude with an overview of application areas and a discussion of future work and open questions.

机译：近年来，以人为中心的强化学习（RL）变得越来越流行，在该学习中，代理人通过人类观察者提供的评估反馈来学习如何执行任务。能够从人为RL代理的反馈中学习的优势已导致对现实生活中问题的适用性增加。本文介绍了最先进的以人为中心的RL算法，旨在成为那些开始以人为中心的RL研究工作的研究人员的起点。此外，本文的目的是对这一领域的最新进展进行全面的综述，并为最有趣，最成功的著作提供参考。在从环境奖励中介绍RL概念之后，本文讨论了以人为中心的RL的起源及其与传统RL的区别。然后，我们描述对人类评价反馈的不同解释，这些解释在过去十年中产生了许多以人为中心的RL算法。此外，我们描述了有关从人类评估反馈和环境奖励中学习的行为主体以及提高以人为本的RL效率的研究。最后，我们以应用领域的概述以及对未来工作和未解决问题的讨论作为结尾。

著录项

来源
《Human-Machine Systems, IEEE Transactions on》 |2019年第4期|337-349|共13页
作者
Li Guangliang; Gomez Randy; Nakamura Keisuke; He Bo;
展开▼
作者单位

Ocean Univ China Coll Informat Sci & Engn Qingdao 266100 Shandong Peoples R China;

Honda Res Inst Japan Co Ltd Wako Saitama 3510114 Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Human agent/robot interaction; human reward; interactive reinforcement learning (RL); interactive shaping; policy shaping;

机译：人机交互/机器人交互;人的奖励;互动强化学习（RL）;互动塑造政策制定;

相似文献

外文文献
中文文献
专利

1. Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey [J] . Sanmit Narvekar, Bei Peng, Matteo Leonetti, Journal of machine learning research . 2020,第a期

机译：课程学习钢筋学习域名：框架和调查
2. A Survey on Transfer Learning for Multiagent Reinforcement Learning Systems [J] . Felipe Leno Da Silva, Anna Helena Reali Costa The Journal of Artificial Intelligence Research . 2019,第8期

机译：多主体强化学习系统的迁移学习调查
3. A Survey on Transfer Learning for Multiagent Reinforcement Learning Systems [J] . Da Silva Felipe Leno, Reali Costa Anna Helena The Journal of Artificial Intelligence Research . 2019,第期

机译：多钢筋学习系统转移学习调查
4. Learning Locomotion For Legged Robots Based on Reinforcement Learning: A Survey [C] . Jinghong Yue International Conference on Electrical Engineering and Control Technologies . 2020

机译：基于强化学习的腿机器人学习机器：调查
5. Reinforcement Learning and Recurrent Reinforcement Learning for Dynamic Portfolio Optimization [D] . Almahdi, Saud 2019

机译：强化学习和循环强化学习以实现动态资产组合优化
6. Reinforcement Learning With Human Advice: A Survey [O] . Anis Najar, Mohamed Chetouani 2021

机译：利用人类建议加固学习：调查
7. A Review of Recent Deep Learning Approaches in Human-Centered Machine Learning [O] . Tharindu Kaluarachchi, Andrew Reis, Suranga Nanayakkara 2021

机译：近期人以人为本的机器学习深入学习方法综述

Human-Centered Reinforcement Learning: A Survey

摘要

著录项

相似文献

相关主题

期刊订阅