首页> 外文期刊>International Journal of Production Research >Distributed policy search reinforcement learning for job-shop scheduling tasks
【24h】

Distributed policy search reinforcement learning for job-shop scheduling tasks

机译:用于车间调度任务的分布式策略搜索强化学习

获取原文
获取原文并翻译 | 示例
       

摘要

We interpret job-shop scheduling problems as sequential decision problems that are handled by independent learning agents. These agents act completely decoupled from one another and employ probabilistic dispatching policies for which we propose a compact representation using a small set of real-valued parameters. During ongoing learning, the agents adapt these parameters using policy gradient reinforcement learning, with the aim of improving the performance of the joint policy measured in terms of a standard scheduling objective function. Moreover, we suggest a lightweight communication mechanism that enhances the agents' capabilities beyond purely reactive job dispatching. We evaluate the effectiveness of our learning approach using various deterministic as well as stochastic job-shop scheduling benchmark problems, demonstrating that the utilisation of policy gradient methods can be effective and beneficial for scheduling problems.
机译:我们将作业车间的调度问题解释为由独立的学习代理人处理的顺序决策问题。这些代理的行为完全相互分离,并采用概率分配策略,为此我们建议使用一小组实值参数进行紧凑表示。在进行中的学习期间,代理使用策略梯度强化学习来调整这些参数,以提高根据标准调度目标函数衡量的联合策略的性能。此外,我们提出了一种轻量级的通信机制,该机制可以增强代理的功能,而不仅仅是纯粹的响应式作业调度。我们使用各种确定性以及随机作业车间调度基准问题评估我们的学习方法的有效性,表明使用策略梯度方法可以有效且有益于调度问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号