Constrained Q-Learning for Batch Process Optimization

Elton Pan; Panagiotis Petsagkourakis; Max Mowbray; Dongda Zhang; Antonio del Rio-Chanona

首页> 外文期刊>IFAC PapersOnLine >Constrained Q-Learning for Batch Process Optimization

【24h】

Constrained Q-Learning for Batch Process Optimization

机译：受限制的Q学习进行批处理优化

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Chemical process optimization and control often require satisfaction of constraints for safe operation. Reinforcement learning (RL) has been shown to be a powerful control technique that can handle nonlinear stochastic optimal control problems. Despite this promise, RL has yet to see significant translation to industrial practice due to its inability to satisfy state constraints. This work aims to address this challenge. We propose an “oracle”-assisted constrained Q-learning algorithm that guarantees the satisfaction of joint chance constraints with high probability, which is required for safety critical tasks. To that end, constraint tightening (backoffs) are introduced, which are adjusted using Broyden’s method, hence making the backoffs self-tuned. This results in a general methodology that can be integrated into approximate dynamic programming-based algorithms to guarantee constraint satisfaction with high probability. Finally, a case study is presented to compare the performance of the proposed approach with that of model predictive control (MPC). The superior performance of the proposed algorithm, in terms of constraint handling, signifies a step toward the use of RL in real world optimization and control of systems, where constraints are critical in ensuring safety.

机译：化学过程优化和控制通常需要满足安全操作的约束。钢筋学习（RL）已被证明是一种能够处理非线性随机最佳控制问题的强大控制技术。尽管这一承诺，由于无法满足国家限制，但RL尚未对工业实践进行重大翻译。这项工作旨在解决这一挑战。我们提出了一个“Oracle”的“甲骨文” - 自由度约束Q学习算法，可确保对高概率的关节机会限制满意，这是安全关键任务所必需的。为此，介绍了约束紧固（退避），使用Broyden的方法调整，因此使退避自调整。这导致一般方法可以集成到基于近似的动态编程的算法中，以确保对高概率的约束满足。最后，提出了一个案例研究以比较模型预测控制（MPC）的提出方法的性能。在约束处理方面，所提出的算法的卓越性能意味着朝着使用RL在现实世界优化和控制中使用的步骤，其中约束对于确保安全性是至关重要的。

著录项

来源
《IFAC PapersOnLine》 |2021年第3期|共6页
作者
Elton Pan; Panagiotis Petsagkourakis; Max Mowbray; Dongda Zhang; Antonio del Rio-Chanona;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词
Machine LearningBatch OptimizationProcess ControlBioprocessesQ-learningDynamic SystemsData-Driven Optimization;

机译：机器学习优化过程ControlBioProcessesQ-LearningDymentsData驱动优化;

相似文献

外文文献
中文文献
专利

1. Dynamic optimization of constrained semi-batch processes using Pontryagin's minimum principle-An effective quasi-Newton approach [J] . Erdal Aydin, Dominique Bonvin, Kai Sundmacher Computers & Chemical Engineering . 2017,第apra6期

机译：使用庞特里亚金极小原理的动态半优化约束半批量过程-一种有效的拟牛顿法
2. Constrained latent variable model predictive control for trajectory tracking and economic optimization in batch processes [J] . Godoy J. L., Gonzalez A. H., Normey-Rico J. E. Journal of Process Control . 2016,第Null期

机译：间歇过程中轨迹跟踪和经济优化的约束潜变量模型预测控制
3. Constrained Run-to-Run Optimization for Batch Process Based on Support Vector Regression Model [J] . 上海交通大学学报（英文版） . 2006,第004期
4. Constrained batch-to-batch optimal control for batch process based on kernel principal component regression model [C] . Li Ganping, Huang Tao, Zhao Jun 2012 IEEE Fifth International Conference on Advanced Computational Intelligence. . 2012

机译：基于核主成分回归模型的批次间约束的批次间最优控制
5. Constrained Optimization of a Batch Polymerization Process by Combining DoDE and DRSM Tools [D] . Bardooli, Ahmed. 2018

机译：通过组合DODE和DRSM工具约束批量聚合过程的优化
6. Constrained Deep Q-Learning Gradually Approaching Ordinary Q-Learning [O] . Shota Ohnishi, Eiji Uchibe, Yotaro Yamaguchi, 2019

机译：受约束的深度Q学习逐渐接近普通Q学习
7. Dynamic Modelling and Optimization of Polymerization Processes in Batch and Semi-batch Reactors. Dynamic Modelling and Optimization of Bulk Polymerization of Styrene, Solution Polymerization of MMA and Emulsion Copolymerization of Styrene and MMA in Batch and Semi-batch Reactors using Control Vector Parameterization Techniques. [O] . Ibrahim W. H. B. W. 2011

机译：分批和半分批反应器中聚合过程的动态建模和优化。动态和建模的苯乙烯本体聚合，MMA的溶液聚合和苯乙烯和MMA的乳液聚合在间歇和半间歇反应器中使用控制矢量参数化技术。

Constrained Q-Learning for Batch Process Optimization

摘要

著录项

相似文献

相关主题

期刊订阅