首页> 外文期刊>Transportation research >Real-world ride-hailing vehicle repositioning using deep reinforcement learning
【24h】

Real-world ride-hailing vehicle repositioning using deep reinforcement learning

机译:利用深度加固学习,现实世界乘车车辆重新定位

获取原文
获取原文并翻译 | 示例
       

摘要

We present a new practical framework based on deep reinforcement learning and decision-time planning for real-world vehicle repositioning on ride-hailing (a type of mobility-on-demand, MoD) platforms. Our approach learns the spatiotemporal state-value function using a batch training algorithm with deep value networks. The optimal repositioning action is generated ondemand through value-based policy search, which combines planning and bootstrapping with the value networks. For the large-fleet problems, we develop several algorithmic features that we incorporate into our framework and that we demonstrate to induce coordination among the algorithmically-guided vehicles. We benchmark our algorithm with baselines in a ride-hailing simulation environment to demonstrate its superiority in improving income efficiency measured by income-per-hour. We have also designed and run a real-world experiment program with regular drivers on a major ride-hailing platform. We have observed significantly positive results on key metrics comparing our method with experienced drivers who performed idle-time repositioning based on their own expertise.
机译:我们介绍了一种基于深度加强学习和决策规划的新的实用框架,以实现Ride-HaIling(一种按需类型,Mod)平台的现实车辆重新定位。我们的方法使用具有深度值网络的批量培训算法来学习时空状态值函数。通过基于价值的策略搜索生成OnDemand的最佳重新定位操作,该策略搜索将规划和引导与值网络相结合。对于大型舰队问题,我们开发了多种算法特征,我们将我们纳入我们的框架,并且我们证明在算法引导的车辆之间诱导协调。我们将我们的算法基准与基线在乘车的仿真环境中,以展示其优越性,以提高每小时收入测量的收入效率。我们还设计并运行了一个实际的实验计划,并在一个主要的乘车平台上进行了常规驱动程序。我们在与经验丰富的驱动程序进行比较的关键指标上观察到显着积极的结果,他们根据自己的专业知识进行空闲时间重新定位。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号