首页> 外文期刊>Robotics and Autonomous Systems >Combining reinforcement learning with rule-based controllers for transparent and general decision-making in autonomous driving
【24h】

Combining reinforcement learning with rule-based controllers for transparent and general decision-making in autonomous driving

机译:基于规则的控制器结合强化学习,在自主驾驶中透明和一般决策

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

The design of high-level decision-making systems is a topical problem in the field of autonomous driving. In this paper, we combine traditional rule-based strategies and reinforcement learning (RL) with the goal of achieving transparency and robustness. On the one hand, the use of handcrafted rule-based controllers allows for transparency, i.e., it is always possible to determine why a given decision was made, but they struggle to scale to complex driving scenarios, in which several objectives need to be considered. On the other hand, black-box RL approaches enable us to deal with more complex scenarios, but they are usually hardly interpretable. In this paper, we combine the best properties of these two worlds by designing parametric rule-based controllers, in which interpretable rules can be provided by domain experts and their parameters are learned via RL. After illustrating how to apply parameter-based RL methods (PGPE) to this setting, we present extensive numerical simulations in the highway and in two urban scenarios: intersection and roundabout. For each scenario, we show the formalization as an RL problem and we discuss the results of our approach in comparison with handcrafted rule-based controllers and black-box RL techniques. (C) 2020 Elsevier B.V. All rights reserved.
机译:高级决策系统的设计是自主驾驶领域的局部问题。在本文中,我们将基于规则的战略和强化学习(RL)结合起来,实现了实现透明度和稳健性的目标。一方面,使用手工制定的规则的控制器允许透明度,即,始终可以确定为什么制作给定的决定,但他们努力扩展到复杂的驾驶场景,其中需要考虑几个目标。另一方面,黑匣子RL方法使我们能够处理更复杂的情景,但通常几乎不可解决。在本文中,我们通过设计基于参数规则的控制器来结合这两个世界的最佳特性,其中可以通过域专家提供可解释规则,并且通过RL学习其参数。在说明如何将基于参数的RL方法(PGPE)应用于此设置之后,我们在高速公路和两种城市情景中呈现广泛的数值模拟:交叉路口和环形交通枢纽。对于每个场景,我们将正规化作为RL问题显示,我们与手工规则的控制器和黑盒RL技术相比,我们讨论了我们的方法结果。 (c)2020 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号