首页> 外文会议>International Conference on Intelligent Transportation Systems >POMDP and Hierarchical Options MDP with Continuous Actions for Autonomous Driving at Intersections
【24h】

POMDP and Hierarchical Options MDP with Continuous Actions for Autonomous Driving at Intersections

机译:具有连续动作的POMDP和分层选项MDP,用于交叉口的自动驾驶

获取原文

摘要

When applying autonomous driving technology to real-world scenarios, environmental uncertainties make the development of decision-making algorithms difficult. Modeling the problem as a Partially Observable Markov Decision Process (POMDP) [1] allows the algorithm to consider these uncertainties in the decision process, which makes it more robust to real sensor characteristics. However, solving the POMDP with reinforcement learning (RL) [2] often requires storing a large number of observations. Furthermore, for continuous action spaces, the system is computationally inefficient. This paper addresses these problems by proposing to model the problem as an MDP and learn a policy with RL using hierarchical options (HOMDP). The suggested algorithm can store the state-action pairs and only uses current observations to solve a POMDP problem. We compare the results of to the time-to-collision method [3] and the proposed POMDP-with-LSTM method. Our results show that the HOMDP approach is able to improve the performance of the agent for a four-way intersection task with two-way stop signs. The HOMDP method can generate both higher-level discrete options and lower-level continuous actions with only the observations of the current step.
机译:当将自动驾驶技术应用于现实情况时,环境的不确定性使决策算法的开发变得困难。通过将问题建模为部分可观察的马尔可夫决策过程(POMDP)[1],该算法可以在决策过程中考虑这些不确定性,从而使其对实际传感器的特性更加稳健。但是,用强化学习(RL)[2]解决POMDP通常需要存储大量观察值。此外,对于连续的动作空间,该系统在计算上效率低下。本文通过提出将问题建模为MDP并使用分层选项(HOMDP)学习带有RL的策略来解决这些问题。所提出的算法可以存储状态-动作对,并且仅使用当前的观测值来解决POMDP问题。我们将碰撞时间方法[3]的结果与提出的带有LSTM的POMDP-with-LSTM方法进行了比较。我们的结果表明,HOMDP方法能够提高具有双向停车标志的四向交叉路口任务的智能体性能。仅通过对当前步骤的观察,HOMDP方法可以生成较高级别的离散选项和较低级别的连续操作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号