首页> 外文期刊>Journal of Zhejiang university science >Modified reward function on abstract features in inverse reinforcement learning
【24h】

Modified reward function on abstract features in inverse reinforcement learning

机译:逆强化学习中对抽象特征的修正奖励函数

获取原文
           

摘要

We improve inverse reinforcement learning (IRL) by applying dimension reduction methods to automatically extract abstract features from human-demonstrated policies, to deal with the cases where features are either unknown or numerous. The importance rating of each abstract feature is incorporated into the reward function. Simulation is performed on a task of driving in a five-lane highway, where the controlled car has the largest fixed speed among all the cars. Performance is almost 10.6% better on average with than without importance ratings.
机译:我们通过应用降维方法自动从人类演示的策略中提取抽象特征,以处理特征未知或大量的情况,从而改进了逆强化学习(IRL)。每个抽象特征的重要性等级都包含在奖励函数中。仿真是在五车道高速公路上行驶的任务上执行的,在该车道中,受控汽车在所有汽车中具有最大的固定速度。与没有重要性等级相比,性能平均提高了近10.6%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号