首页> 外文会议>International Conference on Intelligent Transportation Systems >POMDP and Hierarchical Options MDP with Continuous Actions for Autonomous Driving at Intersections
【24h】

POMDP and Hierarchical Options MDP with Continuous Actions for Autonomous Driving at Intersections

机译:POMDP和分层选项MDP,用于在交叉点处的自主驾驶的连续动作

获取原文

摘要

When applying autonomous driving technology to real-world scenarios, environmental uncertainties make the development of decision-making algorithms difficult. Modeling the problem as a Partially Observable Markov Decision Process (POMDP) [1] allows the algorithm to consider these uncertainties in the decision process, which makes it more robust to real sensor characteristics. However, solving the POMDP with reinforcement learning (RL) [2] often requires storing a large number of observations. Furthermore, for continuous action spaces, the system is computationally inefficient. This paper addresses these problems by proposing to model the problem as an MDP and learn a policy with RL using hierarchical options (HOMDP). The suggested algorithm can store the state-action pairs and only uses current observations to solve a POMDP problem. We compare the results of to the time-to-collision method [3] and the proposed POMDP-with-LSTM method. Our results show that the HOMDP approach is able to improve the performance of the agent for a four-way intersection task with two-way stop signs. The HOMDP method can generate both higher-level discrete options and lower-level continuous actions with only the observations of the current step.
机译:在将自主驾驶技术应用于现实世界的情景时,环境不确定性使得决策算法的发展变得困难。将问题建模为部分可观察的马尔可夫决策过程(POMDP)[1]允许该算法考虑决策过程中的这些不确定性,这使得对实际传感器特性更加坚固。然而,用强化学习(RL)求解POMDP [2]通常需要存储大量观察。此外,对于连续动作空间,系统计算效率低下。本文通过建议将问题模拟为MDP来解决这些问题,并使用分层选项(HOMDP)使用RL学习策略。建议的算法可以存储状态动作对,只使用当前观察来解决POMDP问题。我们比较了碰撞时间的结果[3]和提出的pomdp-with-lstm方法。我们的研究结果表明,HOMDP方法能够通过双向停止标志提高代理的代理的性能。 Homdp方法可以仅用当前步骤的观察产生更高级别的离散选项和较低级别的连续动作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号