首页> 外文期刊>Control Engineering Practice >Scalable reinforcement learning for plant-wide control of vinyl acetate monomer process
【24h】

Scalable reinforcement learning for plant-wide control of vinyl acetate monomer process

机译:可扩展的强化学习,可在整个工厂范围内控制乙酸乙烯酯单体工艺

获取原文
获取原文并翻译 | 示例
           

摘要

This paper explores a reinforcement learning (RL) approach that designs automatic control strategies in a large-scale chemical process control scenario as the first step for leveraging an RL method to intelligently control real-world chemical plants. The huge number of units for chemical reactions as well as feeding and recycling the materials of a typical chemical process induces a vast amount of samples and subsequent prohibitive computation complexity in RL for deriving a suitable control policy due to high-dimensional state and action spaces. To tackle this problem, a novel RL algorithm: Factorial Fast-food Dynamic Policy Programming (FFDPP) is proposed. By introducing a factorial framework that efficiently factorizes the action space, Fast-food kernel approximation that alleviates the curse of dimensionality caused by the high dimensionality of state space, into Dynamic Policy Programming (DPP) that achieves stable learning even with insufficient samples. FFDPP is evaluated in a commercial chemical plant simulator for a Vinyl Acetate Monomer (VAM) process. Experimental results demonstrate that without any knowledge of the model, the proposed method successfully learned a stable policy with reasonable computation resources to produce a larger amount of VAM product with comparative performance to a state-of-the-art model-based control.
机译:本文探索了一种强化学习(RL)方法,该方法设计了大规模化学过程控制场景中的自动控制策略,这是利用RL方法智能控制实际化工厂的第一步。用于化学反应以及进料和回收典型化学过程中的材料的大量单元会导致产生大量样本,并且由于高维状态和动作空间而导致RL难以获得适当的控制策略,从而导致RL的计算复杂度过高。为了解决这个问题,提出了一种新颖的RL算法:因子快速食品动态策略编程(FFDPP)。通过引入有效分解动作空间的阶乘框架,将减轻状态空间高维数引起的维数诅咒的Fast-food kernel逼近技术引入到动态策略编程(DPP)中,即使没有足够的样本也可以实现稳定的学习。 FFDPP在商业化工厂模拟器中评估了乙酸乙烯酯单体(VAM)工艺。实验结果表明,在不了解模型的情况下,该方法成功地学习了具有合理计算资源的稳定策略,从而能够生产出大量的VAM产品,与基于模型的最新控制具有可比的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号