Inverse Risk-Sensitive Reinforcement Learning

Ratliff Lillian J.; Mazumdar Eric

首页> 外文期刊>IEEE Transactions on Automatic Control >Inverse Risk-Sensitive Reinforcement Learning

【24h】

Inverse Risk-Sensitive Reinforcement Learning

机译：反向风险敏感的强化学习

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This work addresses the problem of inverse reinforcement learning in Markov decision processes where the decision-making agent is risk-sensitive. In particular, a risk-sensitive reinforcement learning algorithm with convergence guarantees that makes use of coherent risk metrics and models of human decision-making which have their origins in behavioral psychology and economics is presented. The risk-sensitive reinforcement learning algorithm provides the theoretical underpinning for a gradient-based inverse reinforcement learning algorithm that seeks to minimize a loss function defined on the observed behavior. It is shown that the gradient of the loss function with respect to the model parameters is well defined and computable via a contraction map argument. Evaluation of the proposed technique is performed on a Grid World example, a canonical benchmark problem.

机译：这项工作解决了在马尔可夫决策过程中逆钢筋的问题，其中决策代理<斜体>风险敏感。特别是，提供了一种具有会聚的风险敏感的强化学习算法，其利用具有它们起源于行为心理和经济学的人类决策的相干风险指标和模型。风险敏感的加强学习算法提供了基于梯度的逆钢筋学习算法的理论基础，其寻求最小化在观察到的行为上定义的损耗函数。结果表明，通过收缩图参数，损耗函数的损耗功能的梯度是很好的定义和可计算的。对所提出的技术的评估是对<斜斜体>网格世界示例进行的，是一个规范基准问题。

著录项

来源
《IEEE Transactions on Automatic Control》 |2020年第3期|1256-1263|共8页
作者
Ratliff Lillian J.; Mazumdar Eric;
展开▼
作者单位

Univ Washington Dept Elect Engn Seattle WA 98195 USA;

Univ Calif Berkeley Dept Elect Engn & Comp Sci Berkeley CA 94720 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Autonomous systems; Markov processes; optimization; reinforcement learning;

机译：自治系统;马尔可夫进程;优化;加强学习;

相似文献

外文文献
中文文献
专利

1. Risk-sensitive inverse reinforcement learning via semi- and non-parametric methods [J] . Singh Sumeet, Lacotte Jonathan, Majumdar Anirudha, The International journal of robotics research . 2018,第13a14期

机译：通过半参数和非参数方法进行风险敏感的逆向强化学习
2. Advanced planning for autonomous vehicles using reinforcement learning and deep inverse reinforcement learning [J] . You Changxi, Lu Jianbo, Filev Dimitar, Robotics and Autonomous Systems . 2019,第期

机译：利用强化学习和深度逆钢筋学习的自治车辆先进规划
3. Risk-Sensitive Reinforcement Learning [J] . Shen Y, Tobia M, Sommer T, Neural computation . 2014,第7期

机译：风险敏感强化学习
4. Gradient-based inverse risk-sensitive reinforcement learning [C] . Eric Mazumdar, Lillian J. Ratliff, Tanner Fiez, IEEE Annual Conference on Decision and Control . 2017

机译：基于梯度的逆风险敏感强化学习
5. Min-Max Inverse Reinforcement Learning for Learning Bi-Modal Dialogue Policies [D] . Patil, Gandharv. 2020

机译：用于学习双模对话策略的最大最大逆钢筋学习
6. Neural Prediction Errors Reveal a Risk-Sensitive Reinforcement-Learning Process in the Human Brain [O] . Yael Niv, Jeffrey A. Edlund, Peter Dayan, 2012

机译：神经预测错误揭示了人脑中风险敏感的强化学习过程
7. Risk-sensitive Inverse Reinforcement Learning via Semi- and Non-Parametric Methods [O] . Singh, Sumeet, Lacotte, Jonathan, Majumdar, Anirudha, 2017

机译：半经济和风险敏感的逆向强化学习非参数方法

Inverse Risk-Sensitive Reinforcement Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅