...
首页> 外文期刊>International Journal of Production Research >Collaborative reinforcement learning for a two-robot job transfer flow-shop scheduling problem
【24h】

Collaborative reinforcement learning for a two-robot job transfer flow-shop scheduling problem

机译:两机器人作业转移流水车间调度问题的协同强化学习

获取原文
获取原文并翻译 | 示例
           

摘要

A two-robot flow-shop scheduling problem with n identical jobs and m machines is defined and evaluated for four robot collaboration levels corresponding to different levels of information sharing, learning and assessment: Full - robots work together, performing self and joint learning sharing full information; Pull - one robot decides when and if to learn from the other robot; Push - one robot may force the second to learn from it and None - each robot learns independently with no information sharing. Robots operate on parallel tracks, transporting jobs between successive machines, returning empty to a machine to move another job. The objective is to obtain a robot schedule that minimises makespan (C-max) for machines with varying processing times. A new reinforcement learning algorithm is developed, using dual Q-learning functions. A novel feature in the collaborative algorithm is the assignment of different reward functions to robots; minimising robot idle time and minimising job waiting time. Such delays increase makespan. Simulation analyses with fast, medium and slow speed robots indicated that Full collaboration with a fast-fast robot pair was best according to minimum average upper bound error. The new collaborative algorithm provides a tool for finding optimal and near-optimal solutions to difficult collaborative multi-robot scheduling problems.
机译:定义并评估了具有n个相同作业和m个机器的两机器人流水车间调度问题,并针对四个机器人协作级别进行了评估,这四个级别对应于信息共享,学习和评估的不同级别:完全-机器人协同工作,完成自我和联合学习共享信息;拉-一个机器人决定何时以及是否向另一机器人学习;推式-一个机器人可以强迫第二个机器人向其学习,而无-每个机器人都可以独立学习,而无需共享信息。机器人在平行的轨道上操作,在连续的机器之间运输作业,将空的机器返回机器以移动另一个作业。目的是获得一种机器人计划,以使具有不同处理时间的机器的制造期(C-max)最小化。使用双重Q学习功能,开发了一种新的强化学习算法。协作算法的一个新功能是将不同的奖励函数分配给机器人。最大限度地减少机械手的空闲时间并最小化作业等待时间。这样的延误增加了制造时间。使用快速,中速和慢速机器人进行的仿真分析表明,根据最小平均上限误差,与快速机器人对进行全面协作是最好的。新的协作算法提供了一种工具,可以找到困难的协作式多机器人调度问题的最佳解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号