首页> 外文会议>International Conference on Intelligent Control and Information Processing >Comparison of Control Methods Based on Imitation Learning for Autonomous Driving
【24h】

Comparison of Control Methods Based on Imitation Learning for Autonomous Driving

机译:基于模仿学习的自动驾驶控制方法比较

获取原文

摘要

Recently, some learning-based methods such as reinforcement learning and imitation learning have been used to address the control problem for autonomous driving. Note that reinforcement learning has strong reliance on the simulation environment and requires a handcraft design of the reward function. Considering different factors in autonomous driving, a general evaluation method is still being explored. The purpose of imitation learning is to learn the control policy through human demonstrations. It is meaningful to compare the control performances of current main imitation learning methods based on the provided dataset. In this paper, we compare three typical imitation learning algorithms: Behavior cloning, Dataset Aggregation (DAgger) and Information maximizing Generative Adversarial Imitation Learning (InfoGAIL) in the The Open Racing Car Simulator (TORCS) and Car Learning to Act (CARLA) simulators, respectively. The performance of algorithms is evaluated on lane-keeping task in racing and urban environment. The experiment results show DAgger performs best in simple lane keeping problem, and InfoGAIL has the unique advantage of distinguishing different driving styles from expert demonstrations.
机译:最近,一些基于学习的方法(例如强化学习和模仿学习)已用于解决自动驾驶的控制问题。请注意,强化学习非常依赖于模拟环境,并且需要手工设计奖励功能。考虑到自动驾驶中的不同因素,仍在探索一种通用的评估方法。模仿学习的目的是通过人类示范来学习控制策略。根据提供的数据集比较当前主要模仿学习方法的控制性能非常有意义。在本文中,我们比较了三种典型的模仿学习算法:行为克隆,数据集聚合(DAgger)和最大化公开对抗模仿学习(InfoGAIL)中的信息,这些信息在开放赛车模拟器(TORCS)和汽车学习行为(CARLA)模拟器中进行了研究,分别。在赛车和城市环境中,对车道保持任务的算法性能进行了评估。实验结果表明,DAgger在简单的车道保持问题上表现最佳,而InfoGAIL具有将不同驾驶风格与专家演示区分开的独特优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号