Reinforcement learning with Gaussian processes for condition-based maintenance

Shenglin Peng; Qianmei (May) Feng

首页> 外文期刊>Computers & Industrial Engineering >Reinforcement learning with Gaussian processes for condition-based maintenance

【24h】

Reinforcement learning with Gaussian processes for condition-based maintenance

机译：加固与基于条件的维护的高斯工艺

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Condition-based maintenance strategies are effective in enhancing reliability and safety for complex engineering systems that exhibit degradation phenomena with uncertainty. Such sequential decision-making problems are often modeled as Markov decision processes (MDPs) when the underlying process has a Markov property. Recently, reinforcement learning (RL) becomes increasingly efficient to address MDP problems with large state spaces. In this paper, we model the condition-based maintenance problem as a discrete-time continuous-state MDP without discretizing the deterioration condition of the system. The Gaussian process regression is used as function approximation to model the state transition and the value functions of states in reinforcement learning. A RL algorithm is then developed to minimize the long-run average cost (instead of the commonly-used discounted reward) with iterations on the state-action value function and the state value function, respectively. We verify the capability of the proposed algorithm by simulation experiments and demonstrate its advantages in a case study on a battery maintenance decision-making problem. The proposed algorithm outperforms the discrete MDP approach by achieving lower long-run average costs.

机译：基于条件的维护策略对于提高具有不确定性的降解现象的复杂工程系统的可靠性和安全性有效。当底层进程具有Markov属性时，这种连续决策问题通常是Markov决策过程（MDP）。最近，加强学习（RL）越来越有效地解决了大状态空间的MDP问题。在本文中，我们将基于条件的维护问题模拟为离散时间连续状态MDP，而无需离动系统的恶化条件。高斯进程回归用作函数近似，以模拟钢筋学习中状态的状态转换和状态的价值函数。然后开发了RL算法，以便分别在状态 - 动作值函数和状态值函数上使用迭代来最小化长期平均成本（而不是通常使用的折扣奖励）。我们通过仿真实验验证了所提出的算法的能力，并在案例研究中展示了电池维护决策问题的案例。该算法通过实现较低的长期平均成本来优于离散MDP方法。

著录项

来源
《Computers & Industrial Engineering》 |2021年第8期|107321.1-107321.9|共9页
作者
Shenglin Peng; Qianmei (May) Feng;
展开▼
作者单位

Department of Industrial Engineering University of Houston E206 Engineering Blag 2 Houston TX 77204-4008 United States;

Department of Industrial Engineering University of Houston E206 Engineering Blag 2 Houston TX 77204-4008 United States;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Condition-based maintenance; Reinforcement learning; Gaussian process regression; Markov decision process; Gaussian processes for reinforcement learning; Function approximation;

机译：条件的维护;加强学习;高斯过程回归;马尔可夫决策过程;高斯进程加固学习;函数近似;

相似文献

外文文献
中文文献
专利

1. Deep reinforcement learning for condition-based maintenance planning of multi-component systems under dependent competing risks [J] . Zhang Nailong, Si Wujun Reliability Engineering & System Safety . 2020,第Nova期

机译：基于条件的竞争风险下的基于条件的维护计划的深增强学习
2. Reinforcement learning for dynamic condition-based maintenance of a system with individually repairable components [J] . Nooshin Yousefi, Stamatis Tsianikas, David W. Coit Quality engineering . 2020,第1a4期

机译：具有可单独可修复组件的系统动态条件维护的加固学习
3. Model-free safe reinforcement learning for chemical processes using Gaussian processes [J] . Thomas Savage, Dongda Zhang, Max Mowbray, IFAC PapersOnLine . 2021,第3期

机译：使用高斯工艺的化学工艺无模型安全钢筋学习
4. Off-line path integral reinforcement learning using stochastic robot dynamics approximated by sparse pseudo-input Gaussian processes: Application to humanoid robot motor learning in the real environment [C] . Sugimoto Norikazu, Morimoto Jun IEEE International Conference on Robotics and Automation . 2013

机译：利用稀疏伪输入高斯过程近似的随机机器人动力学进行离线路径积分强化学习：在真实环境中的类人机器人运动学习中的应用
5. Design and Implementation of a Micro-world Simulation Platform for Condition-based Maintenance Using Machine Learning Algorithms [D] . Quispe Guanoluisa, David Armando. 2020

机译：使用机器学习算法的条件维护微观世界仿真平台的设计与实现
6. Condition-Based Maintenance with Reinforcement Learning for Dry Gas Pipeline Subject to Internal Corrosion [O] . Zahra Mahmoodzadeh, Keo-Yuan Wu, Enrique Lopez Droguett, 2020

机译：基于条件的维护加固学习干燥气体管道受内腐蚀的影响
7. A Dynamic Condition-Based Maintenance Model Using Inverse Gaussian Process [O] . Zhenyu Wu, Bin Guo, Xiao Tian, 2020

机译：基于动态条件的维护模型使用逆高斯过程

Reinforcement learning with Gaussian processes for condition-based maintenance

摘要

著录项

相似文献

相关主题

期刊订阅