Balancing Public Cycle Sharing Schemes Using Independent Learners

机译：使用独立学习者平衡公共自行车共享计划

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper concerns the resource management problem arising in public cycle sharing schemes, when some docking stations become empty and remain so while others fill to capacity. To alleviate this, managing companies move bicycles between docking stations in order to maximise the number of satisfied customers while minimising the movement cost. We identify Reinforcement learning (RL) as the most promising technique for finding good movement strategies in these networks, but conventional function-approximation RL methods do not scale well here, due to the quadratic growth in number of actions with network size. We propose the use of cooperating agents, namely Independent Learners, to partition the action space. To overcome the well known issue of coordination in Independent Learners, we combine a novel scheduling approach for asynchronous learning, with a modified Gradient-descent Sarsa(?) algorithm to manage variable step-sizes. Our method competes with, and scales more favourably than, single-agent RL on a selection of simulated networks.

机译：本文涉及公共循环共享方案中出现的资源管理问题，当某些坞站变空并保持空白，而其他坞站变满时。为了减轻这种情况，管理公司在停靠站之间移动自行车，以最大程度地满足客户需求，同时最大程度地降低移动成本。我们将强化学习（RL）确定为在这些网络中寻找良好运动策略的最有前途的技术，但是由于功能数量随网络规模的二次增长，此处的常规功能近似RL方法在此处无法很好地扩展。我们建议使用合作代理（即独立学习者）来划分动作空间。为了克服独立学习者中众所周知的协调问题，我们将一种新颖的异步学习调度方法与改进的Gradient-descent Sarsa（？）算法相结合来管理可变步长。在选择的模拟网络上，我们的方法可与单代理RL竞争并在规模上比单代理RL更有利。

著录项

来源
《ICMLA 2012;International Conference on Machine Learning and Applications》|2012年|p.168-173|共6页
会议地点
作者
Smith Jeremiah; Dickens Luke; Broda Krysia;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动推理、机器学习;自动推理、机器学习;
关键词
cooperative; cycle hire scheme; function approximation; independent learners; large discrete action spaces; multi-agent; reinforcement learning;

机译：合作;周期租用方案;函数逼近;独立学习者;大型离散行动空间;多主体;强化学习;

相似文献

外文文献
中文文献
专利

1. Inequalities in usage of a public bicycle sharing scheme: Socio-demographic predictors of uptake and usage of the London (UK) cycle hire scheme [J] . OgilvieF., GoodmanA. Preventive Medicine: An International Journal Devoted to Practice and Theory . 2012,第1期

机译：公共自行车共享计划的使用不平等：伦敦（英国）自行车租赁计划的使用和人口统计学社会人口预测指标
2. Sharing Sustainability: How Values and Ethics Matter in Consumers' Adoption of Public Bicycle-Sharing Scheme [J] . Juelin Yin, Lixian Qian, Anusorn Singhapakdi Journal of Business Ethics . 2018,第2期

机译：分享可持续发展：价值观和道德观念在消费者采用公共自行车共享计划中的重要性
3. Erratum to: Sharing Sustainability: How Values and Ethics Matter in Consumers' Adoption of Public Bicycle-Sharing Scheme [J] . Yin Juelin, Qian Lixian, Singhapakdi Anusorn Journal of Business Ethics . 2018,第2期

机译：勘误至：分享可持续发展：价值观和道德观念如何影响消费者采用公共自行车共享计划
4. Balancing Public Cycle Sharing Schemes Using Independent Learners [C] . Smith Jeremiah, Dickens Luke, Broda Krysia International Conference on Machine Learning and Applications . 2012

机译：使用独立学习者平衡公共周期共享计划
5. A study of the responses of fourth grade, public school students to the same story read independently, read aloud, and told orally as a shared storytelling experience. [D] . Morgan, Karen Ferris. 2002

机译：对四年级公立学校学生对同一故事的反应进行的一项研究，作为独立的讲故事经验，他们独立阅读，大声阅读和口头讲述。
6. A public health dilemma: Urban bicycle-sharing schemes [O] . Andrew Nanapragasam 2014

机译：公共卫生困境：城市自行车共享计划
7. Inequalities in usage of a public bicycle sharing scheme: Socio-demographic predictors of uptake and usage of the London (UK) cycle hire scheme [O] . Ogilvie F., Goodman A. 2012

机译：公共自行车共享计划的使用不平等：伦敦（英国）自行车租赁计划的使用和人口统计学社会人口统计指标

Balancing Public Cycle Sharing Schemes Using Independent Learners

摘要

著录项

相似文献

相关主题

期刊订阅