首页> 外文期刊>Computers & Chemical Engineering >Reinforcement learning of adaptive online rescheduling timing and computing time allocation
【24h】

Reinforcement learning of adaptive online rescheduling timing and computing time allocation

机译:加固自适应在线重新安排定时和计算时间分配的学习

获取原文
获取原文并翻译 | 示例
           

摘要

Mathematical optimization methods have been developed to a vast variety of complex problems in the field of process systems engineering (e.g., the scheduling of chemical batch processes). However, the use of these methods in online scheduling is hindered by the stochastic nature of the processes and prohibitively long solution times when optimized over long time horizons. The following questions are raised: When to trigger a rescheduling, how much computing resources to allocate, what optimization strategy to use, and how far ahead to schedule? We propose an approach where a reinforcement learning agent is trained to make the first two decisions (i.e., rescheduling timing and computing time allocation). Using neuroevolution of augmenting topologies (NEAT) as the reinforcement learning algorithm, the approach yields, on average, better closed-loop solutions than conventional rescheduling methods on three out of four studied routing problems. We also reflect on expanding the agent's decision-making to all four decisions.
机译:已经开发了数学优化方法在过程系统工程领域(例如,化学批处理过程的调度)中的各种复杂问题。然而,在在线调度中使用这些方法被过程的随机性质受到了在优化长时间视野优化时的过程中的随机性质。提出以下问题:何时触发重新安排,计算资源提供多少,使用哪种优化策略以及计划进入多远?我们提出了一种方法,其中培训了加强学习代理以使前两个决定(即重新安排定时和计算时间分配)。使用增强拓扑(整洁)作为加强学习算法的神经发展,该方法平均优于恒定的闭环解决方案,比四个研究的路由问题三分之一的重新分析方法。我们还反思扩大代理人的决定,以实现所有四项决定。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号