Reinforcement Learning for Production Ramp-Up: A Q-Batch Learning Approach

机译：强化学习以提高生产效率：Q批次学习方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The ramp-up process is a significant bottleneck during the development of manufacturing systems. The effort and time required to ramp-up a system is largely dependent on the effectiveness of the human decision making process to select the most promising action and improve the system. Although existing work has identified significant factors influencing ramp-up performance, little has been done to support the actual process. This work approaches ramp-up as sequence of technical changes which aim to get a manufacturing system to a desirable performance in the fastest time. A reinforcement learning approach is proposed to support decisions during ramp-up. The aim is to capture the dynamics between an operator and the system and support time reduction of the process. A batch learning approach has been identified as promising since it matches the practical aspect of decision making during ramp-up. It is combined with a Q-learning algorithm which provides theoretical foundation of optimum convergence. The learning approach has been demonstrated on a highly automated production station during its ramp-up and the generated policy was shown to have significant impact on the ramp-up time reduction.

机译：在制造系统的开发过程中，加速过程是一个重大的瓶颈。增强系统所需的精力和时间在很大程度上取决于人类决策过程选择最有希望的行动并改进系统的有效性。尽管现有工作已经确定了影响加速性能的重要因素，但几乎没有做任何工作来支持实际过程。这项工作是作为技术变更序列而逐步增加的，旨在使制造系统在最快的时间内达到理想的性能。提出了一种强化学习方法，以支持在提升过程中的决策。目的是捕获操作员与系统之间的动态关系并支持减少过程时间。批处理学习方法被认为是有前途的，因为它与提升过程中决策的实际方面相匹配。它与Q学习算法相结合，为最优收敛提供了理论基础。该学习方法已在高度自动化的生产工位上进行了演示，并且表明所生成的策略对缩短工时具有重大影响。

著录项

来源
《ICMLA 2012;International Conference on Machine Learning and Applications》|2012年|p.610-615|共6页
会议地点
作者
Doltsinis Stefanos; Ferreira Pedro; Lohse Niels;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动推理、机器学习;自动推理、机器学习;
关键词
Decision Making Systems; Manufacturing; Ramp-Up; Reinforcement Learning;

机译：决策系统;制造;升级;强化学习;

相似文献

外文文献
中文文献
专利

1. An MDP Model-Based Reinforcement Learning Approach for Production Station Ramp-Up Optimization: Q-Learning Analysis [J] . Doltsinis S., Ferreira P., Lohse N. IEEE Transactions on Systems, Man, and Cybernetics . 2014,第9期

机译：基于MDP模型的强化学习平台用于生产站升级优化：Q学习分析
2. Batch Reinforcement Learning for Robotic Soccer Using the Q-Batch Update-Rule [J] . Cunha Joao, Serra Rui, Lau Nuno, Journal of Intelligent & Robotic Systems: Theory & Application . 2015,第3a4期

机译：使用Q批次更新规则对机器人足球进行批次强化学习
3. Safety factor profile control with reduced central solenoid flux consumption during plasma current ramp-up phase using a reinforcement learning technique [J] . Wakatsuki T., Suzuki T., Hayashi N., Nuclear fusion . 2019,第6期

机译：使用强化学习技术，在等离子电流加速阶段降低中心螺线管通量消耗的安全系数曲线控制
4. Reinforcement Learning for Production Ramp-Up: A Q-Batch Learning Approach [C] . Doltsinis Stefanos, Ferreira Pedro, Lohse Niels International Conference on Machine Learning and Applications . 2012

机译：生产升级的强化学习：Q批处理学习方法
5. Improving Learning and Reducing Time: A Constrained Action Based Reinforcement Learning Approach [D] . Shen, Shitian. 2019

机译：改善学习和减少时间：基于约束的加强学习方法
6. Correction: Linking Individual Learning Styles to Approach-Avoidance Motivational Traits and Computational Aspects of Reinforcement Learning [O] . Kristoffer Carl Aberg, Kimberly C. Doell, Sophie Schwartz -1

机译：纠正：将个人学习风格与避免方法的动机特征和强化学习的计算方面联系起来
7. An MDP model-based reinforcement learning approach for production station ramp-up optimization: Q-learning analysis [O] . Doltsinis, Stefanos, Ferreira, Pedro, Lohse, Niels 2014

机译：基于MDP模型的强化学习方法，用于生产站的产能优化：Q学习分析

Reinforcement Learning for Production Ramp-Up: A Q-Batch Learning Approach

摘要

著录项

相似文献

相关主题

期刊订阅