首页> 外文期刊>Electronic Communications of the EASST >Estimating Rewards & Rare Events in Nondeterministic Systems
【24h】

Estimating Rewards & Rare Events in Nondeterministic Systems

机译:估计非确定性系统中的奖励和稀有事件

获取原文
           

摘要

Exhaustive verification can quantify critical behaviour arising from concurrency in nondeterministic models. Rare events typically entail no additional challenge, but complex systems are generally intractable. Recent work on Markov decision processes allows the extremal probabilities of a property to be estimated using Monte Carlo techniques, offering the potential to handle much larger models. Here we present algorithms to estimate extremal rewards and consider the challenges posed by rarity. We find that rewards require a different interpretation of confidence and that reachability rewards require the introduction of an auxiliary hypothesis test. We show how importance sampling can significantly improve estimation when probabilities are low, but find it is not a panacea for rare schedulers.
机译:详尽的验证可以量化非确定性模型中并发引起的关键行为。罕见事件通常不会带来其他挑战,但是复杂的系统通常很难处理。马尔可夫决策过程的最新工作允许使用蒙特卡洛技术估算财产的极值概率,从而有可能处理更大的模型。在这里,我们提出了估计极值奖励并考虑稀有性带来的挑战的算法。我们发现,奖励需要对信心的不同解释,而可达性奖励则需要引入辅助假设检验。我们展示了当概率较低时,重要性采样如何显着改善估计,但是对于稀有的调度程序来说,这并不是万灵药。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号