首页> 外文会议>International Conference on Control, Automation, Robotics Vision >Behavioral decision-making for urban autonomous driving in the presence of pedestrians using Deep Recurrent Q-Network
【24h】

Behavioral decision-making for urban autonomous driving in the presence of pedestrians using Deep Recurrent Q-Network

机译:使用深频Q-Net网络在行人存在下城市自主行动的行为决策

获取原文

摘要

Decision making for autonomous driving in urban environments is challenging due to the complexity of the road structure and the uncertainty in the behavior of diverse road users. Traditional methods consist of manually designed rules as the driving policy, which require expert domain knowledge, are difficult to generalize and might give sub-optimal results as the environment gets complex. Whereas, using reinforcement learning, optimal driving policy could be learned and improved automatically through several interactions with the environment. However, current research in the field of reinforcement learning for autonomous driving is mainly focused on highway setup with little to no emphasis on urban environments. In this work, a deep reinforcement learning based decision-making approach for high-level driving behavior is proposed for urban environments in the presence of pedestrians. For this, the use of Deep Recurrent Q-Network (DRQN) is explored, a method combining state-of-the art Deep Q-Network (DQN) with a long term short term memory (LSTM) layer helping the agent gain a memory of the environment. A 3-D state representation is designed as the input combined with a well defined reward function to train the agent for learning an appropriate behavior policy in a real-world like urban simulator. The proposed method is evaluated for dense urban scenarios and compared with a rule-based approach and results show that the proposed DRQN based driving behavior decision maker outperforms the rule-based approach.
机译:由于道路结构的复杂性和不同的道路使用者行为的特性,在城市环境中的自主驾驶决策是挑战的。传统方法由手动设计的规则包括作为需要专家领域知识的驾驶策略,很难概括,并且可能会使子最佳结果作为环境变得复杂。虽然,使用强化学习,可以通过与环境的几个互动自动学习和改进最佳驾驶策略。然而,目前在自动驾驶的强化学习领域的研究主要集中在公路设置上,几乎没有强调城市环境。在这项工作中,为城市环境在行人的存在下,提出了一种基于深度的高级驾驶行为的决策方法。为此,探索了使用深频率Q-Network(DRQN),一种方法,将最先进的深Q-Network(DQN)与长期短期内存(LSTM)层组合,帮助代理增益存储器环境。 3-D状态表示设计为输入结合良好定义的奖励函数,以培训代理在城市模拟器等现实世界中学习适当的行为政策。该提出的方法被评估为密集的城市情景,并与基于规则的方法相比,结果表明,所提出的基于DRQN的驾驶行为决策者优于基于规则的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号