Modified reward function on abstract features in inverse reinforcement learning

Shen-yi?Chen; Hui?Qian; Jia?Fan; Zhuo-jun?Jin; Miao-liang?Zhu

首页> 外文期刊>Journal of Zhejiang university science >Modified reward function on abstract features in inverse reinforcement learning

【24h】

Modified reward function on abstract features in inverse reinforcement learning

机译：逆强化学习中对抽象特征的修正奖励函数

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We improve inverse reinforcement learning (IRL) by applying dimension reduction methods to automatically extract abstract features from human-demonstrated policies, to deal with the cases where features are either unknown or numerous. The importance rating of each abstract feature is incorporated into the reward function. Simulation is performed on a task of driving in a five-lane highway, where the controlled car has the largest fixed speed among all the cars. Performance is almost 10.6% better on average with than without importance ratings.

机译：我们通过应用降维方法自动从人类演示的策略中提取抽象特征，以处理特征未知或大量的情况，从而改进了逆强化学习（IRL）。每个抽象特征的重要性等级都包含在奖励函数中。仿真是在五车道高速公路上行驶的任务上执行的，在该车道中，受控汽车在所有汽车中具有最大的固定速度。与没有重要性等级相比，性能平均提高了近10.6％。

著录项

来源
《Journal of Zhejiang university science》 |2010年第9期|共6页
作者
Shen-yi?Chen; Hui?Qian; Jia?Fan; Zhuo-jun?Jin; Miao-liang?Zhu;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Modified reward function on abstract features in inverse reinforcement learning [J] . Shen-yi CHEN, Hui QIAN, Jia FAN, 浙江大学学报（英文版）（C辑：计算机与电子） . 2010,第009期

机译：逆强化学习中对抽象特征的修正奖励函数
2. SWIRL: A sequential windowed inverse reinforcement learning algorithm for robot tasks with delayed rewards [J] . Krishnan Sanjay, Garg Animesh, Liaw Richard, The International journal of robotics research . 2019,第2a3期

机译：SWIRL：顺序窗口逆强化学习算法，用于延迟奖励的机器人任务
3. An investor sentiment reward-based trading system using Gaussian inverse reinforcement learning algorithm [J] . Yang Steve Y., Yu Yangyang, Almandi Saud Expert Systems with Application . 2018,第DECa期

机译：基于高斯逆强化学习算法的基于投资者情绪回报的交易系统
4. A Modified Average Reward Reinforcement Learning Based on Fuzzy Reward Function [C] . Zhenkun Zhai, Wei Chen, Xiong Li, International MultiConference of Engineers and Computer Scientists . 2009

机译：基于模糊奖励功能的修改平均奖励强化学习
5. Deep Reinforcement Learning with Accelerated Reward Function Technique for Robotics Task Planning [D] . Shaikh, Shifa. 2021

机译：机器人任务规划加速奖励功能技术的深增强学习
6. Reinforcement Q-Learning Control With Reward Shaping Function for Swing Phase Control in a Semi-active Prosthetic Knee [O] . Yonatan Hutabarat, Kittipong Ekkachai, Mitsuhiro Hayashibe, 2020

机译：增强Q学习控制在半主动假肢膝关节中为摆动相位控制的奖励塑造功能
7. Active Learning for Reward Estimation in Inverse Reinforcement Learning [O] . Manuel Lopes, Francisco Melo, Luis Montesano 2009

机译：主动学习在逆向强化学习中的奖励估算

Modified reward function on abstract features in inverse reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅