【24h】

Markov Decision Process Routing Games

机译:马尔可夫决策过程路由游戏

获取原文

摘要

We explore an extension of nonatomic routing games that we call Markov decision process routing games where each agent chooses a transition policy between nodes in a network rather than a path from an origin node to a destination node, i.e. each agent in the population solves a Markov decision process rather than a shortest path problem. We define the appropriate version of a Wardrop equilibrium as well as a potential function for this game in the finite horizon (total reward) case. This work can be thought of as a routing-game-based formulation of continuous population stochastic games (mean-field games or anonymous sequential games). We apply our model to the problem of ridesharing drivers competing for customers.
机译:我们探索非原子路由博弈的扩展,我们称其为马尔可夫决策过程路由博弈,其中每个代理选择网络节点之间的过渡策略,而不是从源节点到目的节点的路径,即总体中的每个代理都求解马尔可夫问题。决策过程,而不是最短路径问题。我们定义了Wardrop平衡的适当版本以及在有限水平(总奖励)情况下此游戏的潜在功能。可以将这项工作视为连续人口随机游戏(均场游戏或匿名连续游戏)的基于路由游戏的表述。我们将模型应用于拼车司机争夺客户的问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号