首页> 外文会议>International Joint Conference on Neural Networks >On the Role of Reward Functions for Reinforcement Learning in the Traffic Assignment Problem
【24h】

On the Role of Reward Functions for Reinforcement Learning in the Traffic Assignment Problem

机译:奖励功能对强化学习在交通分配问题中的作用

获取原文

摘要

The traffic assignment problem (TAP) consists of assigning routes to road users in order to minimize traffic congestion. Traditional methods for solving the TAP assume the existence of a central authority who computes and dictates routes to road users. Multi-agent reinforcement learning (MARL) approaches are more realistic in solving this kind of problem because they consider that road users (agents) have complete autonomy for choosing routes. However, MARL approaches usually require a long training period in order to compute the optimal routes, which could be a major limitation in more realistic traffic scenarios. In this paper, we tackle this problem by evaluating the performance of three conceptually different reward functions, namely: expert-designed rewards, difference rewards, and intrinsically motivated rewards. In particular, our focus lies on providing a deeper understanding of the impact of these reward functions on the agents’ performance, thus contributing towards reducing congestion levels. To this end, we perform an extensive experimental evaluation on different road networks, including up to 360,600 concurrently learning agents. Our results show that, although the adopted reward functions were not able to speed up the learning process, the correct reward function choice plays an important role in the quality of the learned solution.
机译:交通分配问题(TAP)包括为道路用户分配路线,以最大程度地减少交通拥堵。解决TAP的传统方法假定存在中央主管部门,该中央主管部门负责计算和指示通往道路使用者的路线。多智能体强化学习(MARL)方法在解决此类问题时更为现实,因为他们认为道路使用者(智能体)拥有选择路线的完全自主权。但是,MARL方法通常需要很长的训练时间才能计算出最佳路线,这在更现实的交通场景中可能是一个主要限制。在本文中,我们通过评估三个概念上不同的奖励函数的性能来解决此问题,即:专家设计的奖励,差异奖励和内在动机的奖励。特别是,我们的重点在于更深入地了解这些奖励功能对代理商绩效的影响,从而有助于减少拥塞程度。为此,我们在不同的道路网络上进行了广泛的实验评估,其中包括多达360,600个并发学习代理。我们的结果表明,尽管采用的奖励函数无法加快学习过程,但正确的奖励函数选择对学习解决方案的质量起着重要作用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号