A Finite Horizon Markov Decision Process Based Reinforcement Learning Control of a Rapid Thermal Processing system

Pradeep D. John; Noel Mathew Mithra

首页> 外文期刊>Journal of Process Control >A Finite Horizon Markov Decision Process Based Reinforcement Learning Control of a Rapid Thermal Processing system

【24h】

A Finite Horizon Markov Decision Process Based Reinforcement Learning Control of a Rapid Thermal Processing system

机译：基于有限的地平线马尔可夫决策过程的快速热处理系统的加固学习控制

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Manufacture of ultra large-scale integrated circuits involves accurate control of a challenging nonlinear Rapid Thermal Processing (RTP) system. Precise control of temperature profile and rapid ramp-up and ramp-down rates demanded by a RTP system cannot be achieved with conventional control strategies due to nonlinear and multi time-scale effects. In this paper the control of a RTP system is reformulated as an optimal multi-step sequential decision problem using the framework of finite horizon Markov decision processes and solved using a Reinforcement Learning (RL) algorithm. Three increasingly complex RL based control strategies are explored and compared with the existing state-of-the-art approach for controlling RTPs. Simulation results indicate that the approach proposed in this paper achieves superior control of the temperature profile and ramp-up and ramp-down rates for the RTP system. (C) 2018 Elsevier Ltd. All rights reserved.

机译：超大型集成电路的制造涉及精确控制挑战非线性快速热处理（RTP）系统。由于非线性和多时间尺度效应，通过传统的控制策略，无法实现对温度曲线的精确控制和RTP系统所需的快速增速和RTP率。在本文中，RTP系统的控制是使用有限地平线马尔可夫决策过程的框架重新重构为最佳的多步级顺序决策问题，并使用加强学习（RL）算法来解决。探索了三种日益复杂的RL的控制策略，并与用于控制RTPS的现有最先进的方法进行比较。仿真结果表明，本文提出的方法达到了对RTP系统的温度曲线和斜坡升高和降低速率的卓越控制。（c）2018年elestvier有限公司保留所有权利。

著录项

来源
《Journal of Process Control》 |2018年第2018期|共8页
作者
Pradeep D. John; Noel Mathew Mithra;
展开▼
作者单位

VIT Univ Sch Elect Engn Vellore Tamil Nadu India;

VIT Univ Sch Elect Engn Vellore Tamil Nadu India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类控制论、信息论（数学理论）;
关键词
Reinforcement Learning; Rapid Thermal Processing; Nonlinear control; Markov Decision Process; Process control; Multivariable control;

机译：加固学习;快速热处理;非线性控制;马尔可夫决策过程;过程控制;多变量控制;

相似文献

外文文献
中文文献
专利

1. A Finite Horizon Markov Decision Process Based Reinforcement Learning Control of a Rapid Thermal Processing system [J] . Pradeep D. John, Noel Mathew Mithra Journal of Process Control . 2018,第期

机译：基于有限的地平线马尔可夫决策过程的快速热处理系统的加固学习控制
2. Joint Manufacturing and Onsite Microgrid System Control Using Markov Decision Process and Neural Network Integrated Reinforcement Learning [J] . Wenqing Hu, Zeyi Sun, Yunchao Zhang, Procedia Manufacturing . 2019,第186期

机译：使用马尔可夫决策过程和神经网络综合加固学习的联合制造和现场微电网系统控制
3. Partially decentralized reinforcement learning in finite, multi-agent Markov decision processes [J] . Omkar Tilak, Snehasis Mukhopadhyay AI communications . 2011,第4期

机译：有限多智能体马尔可夫决策过程中的部分分散强化学习
4. A Reinforcement Learning Based Algorithm for Finite Horizon Markov Decision Processes [C] . Bhatnagar, S., Abdulla, . 2006

机译：基于强化学习的有限水平马尔可夫决策过程算法
5. DECENTRALIZED LEARNING IN GAMES AND FINITE MARKOV CHAINS (CONTROL, PROCESSES, SYSTEMS, THEORY). [D] . WHEELER, RICHARD MORGAN, JR. 1985

机译：游戏和有限马尔可夫链（控制，过程，系统，理论）中的分散学习。
6. Learning to maximize reward rate: a model based on semi-Markov decision processes [O] . Arash Khodadadi, Pegah Fakhari, Jerome R. Busemeyer 2014

机译：学习最大化奖励率：基于半马尔可夫决策过程的模型
7. A Reinforcement Learning Based Algorithm for Finite Horizon Markov Decision Processes [O] . Shalabh Bhatnagar, Mohammed Shahid Abdulla 2006

机译：基于强化学习的有限水平马尔可夫决策过程算法

A Finite Horizon Markov Decision Process Based Reinforcement Learning Control of a Rapid Thermal Processing system

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅