Efficient Timeout Synthesis in Fixed-Delay CTMC Using Policy Iteration

机译：使用策略迭代的固定延迟CTMC中有效超时合成

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We consider the fixed-delay synthesis problem for continuous-time Markov chains extended with fixed-delay transitions (fdCTMC). The goal is to synthesize concrete values of the fixed-delays (timeouts) that minimize the expected total cost incurred before reaching a given set of target states. The same problem has been considered and solved in previous works by computing an optimal policy in a certain discrete-time Markov decision process (MDP) with a huge number of actions that correspond to suitably discretized values of the timeouts. In this paper, we design a symbolic fixed-delay synthesis algorithm which avoids the explicit construction of large action spaces. Instead, the algorithm computes a small sets of "promising" candidate actions on demand. The candidate actions are selected by minimizing a certain objective function by computing its symbolic derivative and extracting a univariate polynomial whose roots are precisely the points where the derivative takes zero value. Since roots of high degree univariate polynomials can be isolated very efficiently using modern mathematical software, we achieve not only drastic memory savings but also speedup by three orders of magnitude compared to the previous methods.

机译：我们考虑使用固定延迟转换（FDCTMC）扩展连续时间马尔可夫链的固定延迟综合问题。目标是综合固定延迟（超时）的具体值，以最小化在达到给定的一组目标状态之前产生的预期总成本。通过在某个离散时间马尔可夫决策过程（MDP）中计算最佳策略，在以前的作品中考虑并解决了相同的问题，其具有与超时的适当离散的值对应的巨大动作。在本文中，我们设计了一种符号固定延迟合成算法，避免了大动作空间的显式构造。相反，该算法根据需要计算一小组“有希望的”候选操作。通过计算其符号衍生物并提取一个单变量多项式来最小化某个目标函数来选择候选操作，其根部正是衍生物需要零值的点。由于高度单变量多项式的根源可以非常有效地使用现代数学软件来孤立，因此不仅达到了剧烈的记忆节省，而且与先前的方法相比，速度三个数量级的加速。

著录项

来源
《IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems》|2016年|xxi 501 p. :|共6页
会议地点
作者
?ubo? Koren?iak; Antonín Ku?era; Vojtěch ?ehák;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN913.2-532;
关键词
Clocks; Protocols; Markov processes; Delays; Standards; Computational modeling;

机译：时钟;协议;马尔可夫进程;延迟;标准;计算建模;

相似文献

外文文献
中文文献
专利

1. AN EFFICIENT POLICY ITERATION ALGORITHM FOR DYNAMIC PROGRAMMING EQUATIONS [J] . Alla Alessandro, Falcone Maurizio, Kalise Dante SIAM Journal on Scientific Computing . 2015,第1期

机译：动态规划方程的一个有效的策略迭代算法。
2. An efficient strategy for the synthesis of syn 1,3-diols via iterative acetate aldol reactions and synthesis of atorvastatin lactone [J] . Goyal Sandeep, Patel Bhautikkumar, Sharma Ratnesh, Tetrahedron letters: The International Journal for the Rapid Publication of Preliminary Communications in Organic Chemistry . 2015,第40期

机译：通过迭代的乙酸羟醛醛缩合反应合成合成1,3-二醇的有效策略和阿托伐他汀内酯的合成
3. An Iterative Synthesis Approach to Petri Net-Based Deadlock Prevention Policy for Flexible Manufacturing Systems [J] . Uzam M., MengChu Zhou IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans . 2007,第3期

机译：基于Petri网的柔性制造系统死锁预防策略的迭代综合方法
4. Efficient Timeout Synthesis in Fixed-Delay CTMC Using Policy Iteration [C] . Ľuboš Korenčiak, Antonín Kučera, Vojtěch Řehák IEEE International Symposium on Modeling, Analysis Simulation of Computer and Telecommunication Systems . 2016

机译：使用策略迭代的固定延迟CTMC中的有效超时综合
5. Efficient approximate policy iteration methods for sequential decision making in reinforcement learning. [D] . Lagoudakis, Michail G. 2003

机译：强化学习中顺序决策的有效近似策略迭代方法。
6. An Improved Pattern Synthesis Iterative Method in Planar Arrays for Obtaining Efficient Footprints with Arbitrary Boundaries [O] . Aarón Ángel Salas-Sánchez, Cibrán López-Álvarez, Juan Antonio Rodríguez-González, 2021

机译：平面阵列的改进图案综合迭代方法用于获得具有任意边界的有效占地面积
7. Efficient Timeout Synthesis in Fixed-Delay CTMC Using Policy Iteration [O] . Korenčiak, Ľuboš, Kučera, Antonín, Řehák, Vojtěch 2016

机译：利用策略迭代实现固定时延CTmC中的高效超时合成

Efficient Timeout Synthesis in Fixed-Delay CTMC Using Policy Iteration

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅