Expected Window Mean-Payoff

Benjamin Bordais; Shibashis Guha; Jean-Fran{c{c}}ois Raskin

首页> 外文期刊>LIPIcs : Leibniz International Proceedings in Informatics >Expected Window Mean-Payoff

【24h】

Expected Window Mean-Payoff

机译：预期窗口均值支付

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We study the expected value of the window mean-payoff measure in Markov decision processes (MDPs) and Markov chains (MCs). The window mean-payoff measure strengthens the classical mean-payoff measure by measuring the mean-payoff over a window of bounded length that slides along an infinite path. This measure ensures better stability properties than the classical mean-payoff. Window mean-payoff has been introduced previously for two-player zero-sum games. As in the case of games, we study several variants of this definition: the measure can be defined to be prefix-independent or not, and for a fixed window length or for a window length that is left parametric. For fixed window length, we provide polynomial time algorithms for the prefix-independent version for both MDPs and MCs. When the length is left parametric, the problem of computing the expected value on MDPs is as hard as computing the mean-payoff value in two-player zero-sum games, a problem for which it is not known if it can be solved in polynomial time. For the prefix-dependent version, surprisingly, the expected window mean-payoff value cannot be computed in polynomial time unless P=PSPACE. For the parametric case and the prefix-dependent case, we manage to obtain algorithms with better complexities for MCs.

机译：我们研究了马尔可夫决策过程（MDP）和马尔可夫链（MC）中窗口均值支付测度的期望值。窗口均值度量通过在沿着无限路径滑动的有限长度的窗口上测量均值来增强经典均值度量。与经典的均值支付相比，此措施可确保更好的稳定性。窗口均值收益先前已针对两人零和游戏引入。与游戏的情况一样，我们研究了此定义的几种变体：可以将度量定义为与前缀无关或不与前缀无关，并且对于固定的窗口长度或左参数的窗口长度。对于固定的窗口长度，我们为MDP和MC提供了与前缀无关的版本的多项式时间算法。当长度留为参数时，在MDP上计算期望值的问题与在两人零和博弈中计算平均收益值一样困难，这个问题尚不清楚是否可以在多项式中解决时间。对于前缀相关的版本，令人惊讶的是，除非P = PSPACE，否则无法在多项式时间内计算期望的窗口均值。对于参数情况和与前缀相关的情况，我们设法获得了具有更好复杂性的MC算法。

著录项

来源
《LIPIcs : Leibniz International Proceedings in Informatics 》 |2019年第1期| 共15页
作者
Benjamin Bordais; Shibashis Guha; Jean-Fran{c{c}}ois Raskin;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术 ;
关键词
mean-payoffMarkov decision processessynthesis;

机译：均值马尔可夫决策过程综合;

相似文献

外文文献
中文文献
专利

1. Looking at mean-payoff and total-payoff through windows [J] . Krishnendu Chatterjee, Laurent Doyen, Mickael Randour, Information and computation . 2015 ,第juna期

机译：通过窗口查看平均收益和总收益
2. WINDOWS OPEN AND CLOSE WHEN LEAST EXPECTED THE PRIVATE PLACEMENT DEBT DEAL OF THE YEAR [J] . Marine Money International . 2021 ,第1期

机译：Windows打开并关闭最低预期年度私人拨款债务交易
3. US HRC prices expected to have limited window to rise [J] . SBB daily eBriefing . 2020 ,第Auga20期

机译：美国HRC价格预计窗口有限
4. Looking at Mean-Payoff Through Foggy Windows [C] . Paul Hunter, Guillermo A. Perez, Jean-Francois Raskin International symposium on automated technology for verification and analysis . 2015

机译：通过有雾的窗户查看平均支付
5. Life cycle assessment of residential windows: Analyzing the environmental impact of window restoration versus window replacement. [D] . Switala-Elmhurst, Katherine. 2014

机译：住宅窗户的生命周期评估：分析窗户修复与窗户更换对环境的影响。
6. Non-invasive prediction of implantation window in controlled hyperstimulation cycles: Can the time from the menstrual day at embryo transfer to expected menstrual cycle give a clue? [O] . İlhan Şanverdi, Enis Özkaya, Tayfun Kutlu, 2016

机译：在控制性过度刺激周期中非侵入性预测植入窗口的时间：从胚胎移植的月经日到预期的月经周期的时间能否提供线索？
7. Looking at mean-payoff and total-payoff through windows [O] . Chatterjee, Krishnendu, Doyen, Laurent, Randour, Mickael, 2015

机译：通过窗口查看平均收益和总收益

Expected Window Mean-Payoff

摘要

著录项

相似文献

相关主题

期刊订阅