Analyzing the Suitability of Cost Functions for Explaining and Imitating Human Driving Behavior based on Inverse Reinforcement Learning

机译：基于逆向强化学习的成本函数在解释和模仿人类驾驶行为中的适用性分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Autonomous vehicles are sharing the road with human drivers. In order to facilitate interactive driving and cooperative behavior in dense traffic, a thorough understanding and representation of other traffic participants' behavior are necessary. Cost functions (or reward functions) have been widely used to describe the behavior of human drivers since they can not only explicitly incorporate the rationality of human drivers and the theory of mind (TOM), but also share similarity with the motion planning problem of autonomous vehicles. Hence, more human-like driving behavior and comprehensible trajectories can be generated to enable safer interaction and cooperation. However, the selection of cost functions in different driving scenarios is not trivial, and there is no systematic summary and analysis for cost function selection and learning from a variety of driving scenarios. In this work, we aim to investigate to what extent cost functions are suitable for explaining and imitating human driving behavior. Further, we focus on how cost functions differ from each other in different driving scenarios. Towards this goal, we first comprehensively review existing cost function structures in literature. Based on that, we point out required conditions for demonstrations to be suitable for inverse reinforcement learning (IRL). Finally, we use IRL to explore suitable features and learn cost function weights from human driven trajectories in three different scenarios.

机译：自动驾驶汽车正在与人类驾驶员共享道路。为了促进在拥挤的交通中的交互驾驶和协作行为，必须全面了解和表示其他交通参与者的行为。成本函数（或报酬函数）已被广泛用于描述人类驾驶员的行为，因为它们不仅可以明确地纳入人类驾驶员的理性和心理理论（TOM），而且与自主运动计划问题有着相似之处汽车。因此，可以生成更多类似人的驾驶行为和可理解的轨迹，以实现更安全的交互和合作。但是，在不同驾驶场景中选择成本函数并不是一件容易的事，并且没有针对成本函数选择和从各种驾驶场景中学习的系统总结和分析。在这项工作中，我们旨在研究成本函数在多大程度上适合于解释和模仿人的驾驶行为。此外，我们关注成本函数在不同驾驶场景中的区别。为了实现这一目标，我们首先全面回顾文献中现有的成本函数结构。在此基础上，我们指出了演示所需的条件，以适合进行逆向强化学习（IRL）。最后，我们使用IRL来探索合适的功能并在三种不同情况下从人类驱动的轨迹中学习成本函数权重。

著录项

来源
《IEEE International Conference on Robotics and Automation》|2020年|5481-5487|共7页
会议地点
作者
Maximilian Naumann; Liting Sun; Wei Zhan; Masayoshi Tomizuka;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Cost function; Trajectory; Vehicles; Roads; Learning (artificial intelligence); Safety; Planning;

机译：成本函数;轨迹;车辆;道路;学习（人工智能）;安全性;计划;

相似文献

外文文献
中文文献
专利

1. Rationally Inattentive Inverse Reinforcement Learning Explains YouTube Commenting Behavior [J] . William Hoiles, Vikram Krishnamurthy, Kunal Pattanayak Journal of machine learning research . 2020,第a期

机译：理性否则逆钢筋学习解释了YouTube评论行为
2. Deep Inverse Reinforcement Learning for Behavior Prediction in Autonomous Driving: Accurate Forecasts of Vehicle Motion [J] . Fernando Tharindu, Denman Simon, Sridharan Sridha, IEEE Signal Processing Magazine . 2021,第1期

机译：自主驾驶行为预测的深度逆钢筋学习：车辆运动准确预测
3. Large-scale cost function learning for path planning using deep inverse reinforcement learning [J] . Wulfmeier Markus, Rao Dushyant, Wang Dominiczeng, The International journal of robotics research . 2017,第10期

机译：使用深度逆强化学习进行路径规划的大规模成本函数学习
4. Predicting driving behavior using inverse reinforcement learning with multiple reward functions towards environmental diversity [C] . Shimosaka Masamichi, Nishi Kentaro, Sato Junichi, IEEE Intelligent Vehicles Symposium . 2015

机译：使用逆向强化学习和对环境多样性的多种奖励功能来预测驾驶行为
5. Explaining Collective Behavior with Dynamical Systems: Spatial Gradient Sensing in Eukaryotic Chemotaxis and Learning Dynamics in Multiagent Reinforcement Learning [D] . Shams, Daniel . 2019

机译：用动力系统解释集体行为：多核化趋化性的空间梯度传感和多核强化学习中的学习动态
6. Dopamine-Mediated Learning and Switching in Cortico-Striatal Circuit Explain Behavioral Changes in Reinforcement Learning [O] . Simon Hong, Okihide Hikosaka 2011

机译：多巴胺介导的学习和皮质-纹状体电路的转换解释了强化学习中的行为变化
7. Driving Behavior Modeling Using Naturalistic Human Driving Data With Inverse Reinforcement Learning [O] . Zhiyu Huang, Jingda Wu, Chen Lv 2021

机译：利用逆钢筋学习的自然主义人类驾驶数据驾驶行为建模

Analyzing the Suitability of Cost Functions for Explaining and Imitating Human Driving Behavior based on Inverse Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅