首页> 外文会议>ICMLA 2012;International Conference on Machine Learning and Applications >Balancing Public Cycle Sharing Schemes Using Independent Learners
【24h】

Balancing Public Cycle Sharing Schemes Using Independent Learners

机译:使用独立学习者平衡公共自行车共享计划

获取原文

摘要

This paper concerns the resource management problem arising in public cycle sharing schemes, when some docking stations become empty and remain so while others fill to capacity. To alleviate this, managing companies move bicycles between docking stations in order to maximise the number of satisfied customers while minimising the movement cost. We identify Reinforcement learning (RL) as the most promising technique for finding good movement strategies in these networks, but conventional function-approximation RL methods do not scale well here, due to the quadratic growth in number of actions with network size. We propose the use of cooperating agents, namely Independent Learners, to partition the action space. To overcome the well known issue of coordination in Independent Learners, we combine a novel scheduling approach for asynchronous learning, with a modified Gradient-descent Sarsa(?) algorithm to manage variable step-sizes. Our method competes with, and scales more favourably than, single-agent RL on a selection of simulated networks.
机译:本文涉及公共循环共享方案中出现的资源管理问题,当某些坞站变空并保持空白,而其他坞站变满时。为了减轻这种情况,管理公司在停靠站之间移动自行车,以最大程度地满足客户需求,同时最大程度地降低移动成本。我们将强化学习(RL)确定为在这些网络中寻找良好运动策略的最有前途的技术,但是由于功能数量随网络规模的二次增长,此处的常规功能近似RL方法在此处无法很好地扩展。我们建议使用合作代理(即独立学习者)来划分动作空间。为了克服独立学习者中众所周知的协调问题,我们将一种新颖的异步学习调度方法与改进的Gradient-descent Sarsa(?)算法相结合来管理可变步长。在选择的模拟网络上,我们的方法可与单代理RL竞争并在规模上比单代理RL更有利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号