Reinforcement learning of adaptive online rescheduling timing and computing time allocation

Teemu J. Ikonen; Keijo Heljanko; Iiro Harjunkoski

首页> 外文期刊>Computers & Chemical Engineering >Reinforcement learning of adaptive online rescheduling timing and computing time allocation

【24h】

Reinforcement learning of adaptive online rescheduling timing and computing time allocation

机译：加固自适应在线重新安排定时和计算时间分配的学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Mathematical optimization methods have been developed to a vast variety of complex problems in the field of process systems engineering (e.g., the scheduling of chemical batch processes). However, the use of these methods in online scheduling is hindered by the stochastic nature of the processes and prohibitively long solution times when optimized over long time horizons. The following questions are raised: When to trigger a rescheduling, how much computing resources to allocate, what optimization strategy to use, and how far ahead to schedule? We propose an approach where a reinforcement learning agent is trained to make the first two decisions (i.e., rescheduling timing and computing time allocation). Using neuroevolution of augmenting topologies (NEAT) as the reinforcement learning algorithm, the approach yields, on average, better closed-loop solutions than conventional rescheduling methods on three out of four studied routing problems. We also reflect on expanding the agent's decision-making to all four decisions.

机译：已经开发了数学优化方法在过程系统工程领域（例如，化学批处理过程的调度）中的各种复杂问题。然而，在在线调度中使用这些方法被过程的随机性质受到了在优化长时间视野优化时的过程中的随机性质。提出以下问题：何时触发重新安排，计算资源提供多少，使用哪种优化策略以及计划进入多远？我们提出了一种方法，其中培训了加强学习代理以使前两个决定（即重新安排定时和计算时间分配）。使用增强拓扑（整洁）作为加强学习算法的神经发展，该方法平均优于恒定的闭环解决方案，比四个研究的路由问题三分之一的重新分析方法。我们还反思扩大代理人的决定，以实现所有四项决定。

著录项

来源
《Computers & Chemical Engineering》 |2020年第4期|106994.1-106994.17|共17页
作者
Teemu J. Ikonen; Keijo Heljanko; Iiro Harjunkoski;
展开▼
作者单位

Aalto University Department of Chemical and Metallurgical Engineering PO Box 16100 00076 Aalto Finland;

University of Helsinki Department of Computer Science PO Box 68 00014 University of Helsinki Finland Helsinki Institute for Information Technology (HIIT) Helsinki Finland;

Aalto University Department of Chemical and Metallurgical Engineering PO Box 16100 00076 Aalto Finland ABB Power Grids Research Kallstadter Str 1 68309 Mannheim Germany;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Online scheduling; Rescheduling procedures; Reinforcement learning; Decision-making; Timing; Computing resource allocation;

机译：在线调度;重新安排程序;强化学习;做决定;定时;计算资源分配;

相似文献

外文文献
中文文献
专利

1. Deep Reinforcement Learning for Performance-Aware Adaptive Resource Allocation in Mobile Edge Computing [J] . Binbin Huang, Zhongjin Li, Yunqiu Xu, Wireless communications & mobile computing . 2020,第1期

机译：移动边缘计算中性能感知自适应资源分配的深度增强学习
2. Adaptive Online Decision Method for Initial Congestion Window in 5G Mobile Edge Computing Using Deep Reinforcement Learning [J] . Xie Ruitao, Jia Xiaohua, Wu Kaishun IEEE Journal on Selected Areas in Communications . 2020,第2期

机译：利用深增强学习，5G移动边缘计算初始拥塞窗口自适应在线决策方法
3. Adaptive Online Decision Method for Initial Congestion Window in 5G Mobile Edge Computing Using Deep Reinforcement Learning [J] . Ecological restoration . 2020,第2期

机译：利用深增强学习，5G移动边缘计算初始拥塞窗口自适应在线决策方法
4. Reinforcement Learning in Railway Timetable Rescheduling [C] . Yongqiu Zhu, Hongrui Wang, Rob M.P. Goverde IEEE International Conference on Intelligent Transportation Systems . 2020

机译：铁路时刻表重新安排的加固学习
5. A Reinforcement Learning-based Framework for Resource Allocation and Task Assignment in Mobile Edge Computing Networks [D] . Hsieh, Li-Tse. 2021

机译：基于加强学习的移动边缘计算网络中的资源分配和任务分配框架
6. Modeling choice and reaction time during arbitrary visuomotor learning through the coordination of adaptive working memory and reinforcement learning [O] . Guillaume Viejo, Mehdi Khamassi, Andrea Brovelli, 2015

机译：通过自适应工作记忆和强化学习的协调对任意视觉运动学习中的选择和反应时间建模
7. Computing Resource Allocation Scheme of IOV using Deep Reinforcement Learning in Edge Computing Environment [O] . Yiwei Zhang, Min Zhang, Caixia Fan, 2021

机译：在边缘计算环境中使用深增强学习的IOV计算资源分配方案

Reinforcement learning of adaptive online rescheduling timing and computing time allocation

摘要

著录项

相似文献

相关主题

期刊订阅