...
首页> 外文期刊>Performance Evaluation >Certified policy synthesis for general Markov decision processes: An application in building automation systems
【24h】

Certified policy synthesis for general Markov decision processes: An application in building automation systems

机译:通用Markov决策流程的认证策略综合:在楼宇自动化系统中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we present an industrial application of new approximate similarity relations for Markov models, and show that they are key for the synthesis of control strategies. Typically, modern engineering systems are modelled using complex and high-order models which make the correct-by-design controller construction computationally hard. Using the new approximate similarity relations, this complexity is reduced and we provide certificates on the performance of the synthesised policies. The application deals with stochastic models for the thermal dynamics in a "smart building" setup: such building automation system set-up can be described by discrete-time Markov decision processes evolving over an uncountable state space and endowed with an output quantifying the room temperature. The new similarity relations draw a quantitative connection between different levels of model abstraction, and allow to quantitatively refine over complex models control strategies synthesised on simpler ones. The new relations, underpinned by the use of metrics, allow in particular for a useful trade-off between deviations over probability distributions on states and distances between model outputs. We develop a software toolbox supporting the application and the computational implementation of these new relations. (C) 2017 Elsevier B.V. All rights reserved.
机译:在本文中,我们提出了马尔可夫模型的新的近似相似关系的工业应用,并表明它们是控制策略综合的关键。通常,现代工程系统是使用复杂的高阶模型来建模的,这使得按设计正确校正的控制器构造在计算上变得困难。使用新的近似相似关系,可以减少这种复杂性,并且我们提供有关综合策略性能的证书。该应用程序处理“智能建筑”设置中热力学的随机模型:这种建筑物自动化系统的设置可以通过在无数状态空间上演化的离散时间马尔可夫决策过程来描述,并具有量化室温的输出。 。新的相似性关系在模型抽象的不同层次之间建立了定量联系,并允许对复杂模型上的控制策略进行定量改进,这些控制策略是在简单模型上合成的。通过使用度量支持的新关系尤其允许在状态概率分布的偏差与模型输出之间的距离之间进行有益的权衡。我们开发了一个软件工具箱,用于支持这些新关系的应用程序和计算实现。 (C)2017 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号