Collaborative reinforcement learning for a two-robot job transfer flow-shop scheduling problem

Arviv Kfir; Stern Helman; Edan Yael

首页> 外文期刊>International Journal of Production Research >Collaborative reinforcement learning for a two-robot job transfer flow-shop scheduling problem

【24h】

Collaborative reinforcement learning for a two-robot job transfer flow-shop scheduling problem

机译：两机器人作业转移流水车间调度问题的协同强化学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A two-robot flow-shop scheduling problem with n identical jobs and m machines is defined and evaluated for four robot collaboration levels corresponding to different levels of information sharing, learning and assessment: Full - robots work together, performing self and joint learning sharing full information; Pull - one robot decides when and if to learn from the other robot; Push - one robot may force the second to learn from it and None - each robot learns independently with no information sharing. Robots operate on parallel tracks, transporting jobs between successive machines, returning empty to a machine to move another job. The objective is to obtain a robot schedule that minimises makespan (C-max) for machines with varying processing times. A new reinforcement learning algorithm is developed, using dual Q-learning functions. A novel feature in the collaborative algorithm is the assignment of different reward functions to robots; minimising robot idle time and minimising job waiting time. Such delays increase makespan. Simulation analyses with fast, medium and slow speed robots indicated that Full collaboration with a fast-fast robot pair was best according to minimum average upper bound error. The new collaborative algorithm provides a tool for finding optimal and near-optimal solutions to difficult collaborative multi-robot scheduling problems.

机译：定义并评估了具有n个相同作业和m个机器的两机器人流水车间调度问题，并针对四个机器人协作级别进行了评估，这四个级别对应于信息共享，学习和评估的不同级别：完全-机器人协同工作，完成自我和联合学习共享信息;拉-一个机器人决定何时以及是否向另一机器人学习；推式-一个机器人可以强迫第二个机器人向其学习，而无-每个机器人都可以独立学习，而无需共享信息。机器人在平行的轨道上操作，在连续的机器之间运输作业，将空的机器返回机器以移动另一个作业。目的是获得一种机器人计划，以使具有不同处理时间的机器的制造期（C-max）最小化。使用双重Q学习功能，开发了一种新的强化学习算法。协作算法的一个新功能是将不同的奖励函数分配给机器人。最大限度地减少机械手的空闲时间并最小化作业等待时间。这样的延误增加了制造时间。使用快速，中速和慢速机器人进行的仿真分析表明，根据最小平均上限误差，与快速机器人对进行全面协作是最好的。新的协作算法提供了一种工具，可以找到困难的协作式多机器人调度问题的最佳解决方案。

著录项

来源
《International Journal of Production Research》 |2016年第4期|1196-1209|共14页
作者
Arviv Kfir; Stern Helman; Edan Yael;
展开▼
作者单位

Ben Gurion Univ Negev, Dept Ind Engn & Management, IL-84105 Beer Sheva, Israel;

Ben Gurion Univ Negev, Dept Ind Engn & Management, IL-84105 Beer Sheva, Israel;

Ben Gurion Univ Negev, Dept Ind Engn & Management, IL-84105 Beer Sheva, Israel;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Job-transfer robots; identical and non-identical robots; scheduling; flow-shop; reinforcement learning; collaboration;

机译：作业转移机器人;相同和不同的机器人;计划;流水车间;强化学习;协作;

相似文献

外文文献
中文文献
专利

1. Dynamic scheduling for flexible job shop with new job insertions by deep reinforcement learning [J] . Applied Soft Computing . 2020,第期

机译：通过深度加固学习的新工作店的灵活作业商店动态调度
2. A self-learning genetic algorithm based on reinforcement learning for flexible job-shop scheduling problem [J] . Ronghua Chen, Bo Yang, Shi Li, Computers & Industrial Engineering . 2020,第Nova期

机译：一种基于强化学习的自学习遗传算法，用于灵活的工作店调度问题
3. Dynamic scheduling in a job-shop production system with reinforcement learning [J] . Csaba Kardos, Catherine Laflamme, Viola Gallina, Procedia CIRP . 2021,第Suppla1期

机译：加固学习工作店生产系统的动态调度
4. FLOW-SHOP ROBOTIC SCHEDULING WITH COLLABORATIVE REINFORCEMENT LEARNING [C] . Helman Stern, Kfir Arviv, Yael Edan International Conference on Production Research . 2011

机译：采用协同强化学习流动店机器人调度
5. A computational study of the job-shop and the flow-shop scheduling problems. [D] . Lourenco, Helena Ramalhinho Dias. 1993

机译：车间和流水车间调度问题的计算研究。
6. Working Memory and Reinforcement Schedule Jointly Determine Reinforcement Learning in Children: Potential Implications for Behavioral Parent Training [O] . Elien Segers, Tom Beckers, Hilde Geurts, -1

机译：工作记忆和强化时间表共同决定儿童的强化学习：行为父母培训的潜在含义
7. A Reinforcement Learning Method for a Hybrid Flow-Shop Scheduling Problem [O] . 2019

机译：混合流量店调度问题的加强学习方法

Collaborative reinforcement learning for a two-robot job transfer flow-shop scheduling problem

摘要

著录项

相似文献

相关主题

期刊订阅