Distributed policy search reinforcement learning for job-shop scheduling tasks

Thomas Gabel; Martin Riedmiller

首页> 外文期刊>International Journal of Production Research >Distributed policy search reinforcement learning for job-shop scheduling tasks

【24h】

Distributed policy search reinforcement learning for job-shop scheduling tasks

机译：用于车间调度任务的分布式策略搜索强化学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We interpret job-shop scheduling problems as sequential decision problems that are handled by independent learning agents. These agents act completely decoupled from one another and employ probabilistic dispatching policies for which we propose a compact representation using a small set of real-valued parameters. During ongoing learning, the agents adapt these parameters using policy gradient reinforcement learning, with the aim of improving the performance of the joint policy measured in terms of a standard scheduling objective function. Moreover, we suggest a lightweight communication mechanism that enhances the agents' capabilities beyond purely reactive job dispatching. We evaluate the effectiveness of our learning approach using various deterministic as well as stochastic job-shop scheduling benchmark problems, demonstrating that the utilisation of policy gradient methods can be effective and beneficial for scheduling problems.

机译：我们将作业车间的调度问题解释为由独立的学习代理人处理的顺序决策问题。这些代理的行为完全相互分离，并采用概率分配策略，为此我们建议使用一小组实值参数进行紧凑表示。在进行中的学习期间，代理使用策略梯度强化学习来调整这些参数，以提高根据标准调度目标函数衡量的联合策略的性能。此外，我们提出了一种轻量级的通信机制，该机制可以增强代理的功能，而不仅仅是纯粹的响应式作业调度。我们使用各种确定性以及随机作业车间调度基准问题评估我们的学习方法的有效性，表明使用策略梯度方法可以有效且有益于调度问题。

著录项

来源
《International Journal of Production Research》 |2012年第1期|p.41-61|共21页
作者
Thomas Gabel; Martin Riedmiller;
展开▼
作者单位

Machine Learning Laboratory, Department of Computer Science,Albert-Ludwigs-Universitdt Freiburg, Germany;

Machine Learning Laboratory, Department of Computer Science,Albert-Ludwigs-Universitdt Freiburg, Germany;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
distributed reinforcement learning; job-shop scheduling; distributed control; mulit-agent systems; policy search;

机译：分布式强化学习;车间调度;分布式控制多代理系统;政策搜寻;
入库时间 2022-08-17 13:37:33

相似文献

外文文献
中文文献
专利

1. A self-learning genetic algorithm based on reinforcement learning for flexible job-shop scheduling problem [J] . Ronghua Chen, Bo Yang, Shi Li, Computers & Industrial Engineering . 2020,第Nova期

机译：一种基于强化学习的自学习遗传算法，用于灵活的工作店调度问题
2. Scatter search algorithm for the multiprocessor task job-shop scheduling problem [J] . Fan Kun, Wang Meng, Zhai Yafei, Computers & Industrial Engineering . 2019,第JANa期

机译：多处理器任务作业车间调度问题的分散搜索算法
3. Dynamic scheduling in a job-shop production system with reinforcement learning [J] . Csaba Kardos, Catherine Laflamme, Viola Gallina, Procedia CIRP . 2021,第Suppla1期

机译：加固学习工作店生产系统的动态调度
4. A Policy Search Method For Temporal Logic Specified Reinforcement Learning Tasks [C] . Xiao Li, Yao Ma, Calin Belta 2018 Annual American Control Conference . 2018

机译：时间逻辑指定强化学习任务的策略搜索方法
5. Multi-Task Generalization Using Practice for Distributed Deep Reinforcement Learning [D] . Pattnaik, Upasana. 2021

机译：多任务泛化使用分布式深度加强学习的实践
6. Deep Reinforcement Learning-Based Task Scheduling in IoT Edge Computing [O] . Shuran Sheng, Peng Chen, Zhimin Chen, 2021

机译：基于深度加强学习的IOT Edge Computing任务调度
7. A Policy Search Method For Temporal Logic Specified Reinforcement Learning Tasks [O] . Li, Xiao, Ma, Yao, Belta, Calin 2017

机译：时态逻辑指定强化的策略搜索方法学习任务
8. Distributed Reinforcement Learning for Policy Synchronization in Infinite-Horizon Dec-POMDPs. [R] . Banerjee, B., Kraemer, L. 2012

机译：无限地平线Dec-pOmDp中策略同步的分布式强化学习。

Distributed policy search reinforcement learning for job-shop scheduling tasks

摘要

著录项

相似文献

相关主题

期刊订阅