Scalable reinforcement learning for plant-wide control of vinyl acetate monomer process

Lingwei Zhu; Yunduan Cui; Go Takami; Hiroaki Kanokogi; Takamitsu Matsubara

首页> 外文期刊>Control Engineering Practice >Scalable reinforcement learning for plant-wide control of vinyl acetate monomer process

【24h】

Scalable reinforcement learning for plant-wide control of vinyl acetate monomer process

机译：可扩展的强化学习，可在整个工厂范围内控制乙酸乙烯酯单体工艺

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper explores a reinforcement learning (RL) approach that designs automatic control strategies in a large-scale chemical process control scenario as the first step for leveraging an RL method to intelligently control real-world chemical plants. The huge number of units for chemical reactions as well as feeding and recycling the materials of a typical chemical process induces a vast amount of samples and subsequent prohibitive computation complexity in RL for deriving a suitable control policy due to high-dimensional state and action spaces. To tackle this problem, a novel RL algorithm: Factorial Fast-food Dynamic Policy Programming (FFDPP) is proposed. By introducing a factorial framework that efficiently factorizes the action space, Fast-food kernel approximation that alleviates the curse of dimensionality caused by the high dimensionality of state space, into Dynamic Policy Programming (DPP) that achieves stable learning even with insufficient samples. FFDPP is evaluated in a commercial chemical plant simulator for a Vinyl Acetate Monomer (VAM) process. Experimental results demonstrate that without any knowledge of the model, the proposed method successfully learned a stable policy with reasonable computation resources to produce a larger amount of VAM product with comparative performance to a state-of-the-art model-based control.

机译：本文探索了一种强化学习（RL）方法，该方法设计了大规模化学过程控制场景中的自动控制策略，这是利用RL方法智能控制实际化工厂的第一步。用于化学反应以及进料和回收典型化学过程中的材料的大量单元会导致产生大量样本，并且由于高维状态和动作空间而导致RL难以获得适当的控制策略，从而导致RL的计算复杂度过高。为了解决这个问题，提出了一种新颖的RL算法：因子快速食品动态策略编程（FFDPP）。通过引入有效分解动作空间的阶乘框架，将减轻状态空间高维数引起的维数诅咒的Fast-food kernel逼近技术引入到动态策略编程（DPP）中，即使没有足够的样本也可以实现稳定的学习。 FFDPP在商业化工厂模拟器中评估了乙酸乙烯酯单体（VAM）工艺。实验结果表明，在不了解模型的情况下，该方法成功地学习了具有合理计算资源的稳定策略，从而能够生产出大量的VAM产品，与基于模型的最新控制具有可比的性能。

著录项

来源
《Control Engineering Practice》 |2020年第4期|104331.1-104331.10|共10页
作者
Lingwei Zhu; Yunduan Cui; Go Takami; Hiroaki Kanokogi; Takamitsu Matsubara;
展开▼
作者单位

Graduate School of Science and Technology Nora Institute of Science and Technology Takayama-cho 8916-5 Ikoma Nora Japan;

New Field Development Center Yokogawa Electric Corporation Nakacho 2-9-32 Musashino-shi Tokyo Japan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Chemical process control; Reinforcement learning; Vinyl acetate monomer;

机译：化学过程控制;强化学习;醋酸乙烯酯单体;

相似文献

外文文献
中文文献
专利

1. Multivariable adaptive neural network predictive control in the presence of measurement time-delay; application in control of Vinyl Acetate monomer process [J] . Pazhooh Faramarz, Shahraki Farhad, Sadeghi Jafar, Journal of Process Control . 2018,第期

机译：多变量自适应神经网络在测量时间延迟存在下预测控制; 乙酸乙烯酯单体工艺中的应用
2. Plantwide control study of a vinyl acetate monomer process design [J] . Olsen DG, Svrcek WY, Young BR Chemical Engineering Communications . 2005,第12期

机译：乙酸乙烯酯单体工艺设计的全厂控制研究
3. Silica gel supported co(acac)(2) catalyst in the controlled radical polymerization of vinyl acetate: an easy and practical method to make crystallized poly(vinyl acetate) in a one step process [J] . Semsarzadeh Mohammad Ali, Sabzevari Alireza Journal of Polymer Research . 2017,第11期

机译：硅胶负载的CO（ACAC）（2）催化剂在乙酸乙烯酯的受控自由基聚合中：一种易于实用的方法，使结晶聚（乙酸乙烯酯）在一步程中
4. Just-In-Time Statistical Process Control: Adaptive Monitoring of Vinyl Acetate Monomer Process [C] . Manabu Kano, Takeaki Sakata, Shinji Hasebe IFAC World Congress . 2011

机译：立交统计过程控制：乙酸乙酸乙烯酯单体过程的自适应监测
5. Simulation and advanced process control of an extractive distillation column and a vinyl acetate monomer plant. [D] . Assef, James Z. 1998

机译：萃取蒸馏塔和乙酸乙烯酯单体装置的仿真和先进过程控制。
6. Sensors Integrated Control of PEMFC Gas Supply System Based on Large-Scale Deep Reinforcement Learning [O] . Jiawen Li, Tao Yu 2021

机译：基于大型深度增强学习的PEMFC气体供应系统的传感器集成控制
7. Controller performance of P,PI and neral network control in vinyl acetate monomer process. [O] . 2008

机译：乙酸乙烯酯单体工艺中P，PI的控制器性能和神经网络控制。

Scalable reinforcement learning for plant-wide control of vinyl acetate monomer process

摘要

著录项

相似文献

相关主题

期刊订阅