首页> 外文学位 >Probabilistic planning with risk-sensitive criterion
【24h】

Probabilistic planning with risk-sensitive criterion

机译:具有风险敏感准则的概率规划

获取原文
获取原文并翻译 | 示例

摘要

Probabilistic planning models Markov Decision Processes(MDPs) and Partially Observable Markov Decision Processes (POMDPs) -- are models where autonomous agents operate and make sequential decisions under uncertainty environment in successive episodes. To adapt to different scenario and objectives, many criteria and variants for MDPs and POMDPs have been proposed. The Risk-Sensitive criterion (RS-criterion) is one criterion that maximize the probability that the accumulated cost of the agent execution is less than a predefined threshold, so that the agent can avoid the situation that an exorbitantly high accumulated cost is encountered as much as possible.;Risk-Sensitive MDPs (RS-MDPs) and Risk-Sensitive POMDPs (RSPOMDPs) are risk-sensitive models that combine the RS-criterion with MDPs and POMDPs, respectively. I hypothesize that one can design novel algorithms which are specific for RS-MDPs and RS-POMDPs by applying insights gained from analyzing properties and structures of risk-sensitive models. The key observation about risk-sensitive models is that the original formalizations can be transformed to new formalizations by maintaining the consumed cost so far and considering it for decisions as well.;To validate my hypothesis, (1) I formally define the RS-MDP model and distinguish the models under different assumptions about cost. For each assumption and model; I introduce new algorithms and discuss related properties. (2) I formally propose the RS-POMDP model and discuss some deficiencies of several existing regular POMDP models. I introduce procedures to complement several existing algorithms and also provide new algorithms for RS-POMDPs. (3) I also provide theoretical properties of the risk-sensitive models as well as empirical evaluations of the new algorithms on randomly generated problems, the NAVIGATION domain from the ICAPS International Probabilistic Planning Competition, and a taxi domain generated with real-world data. For both RS-MDPs and RS-POMPs, the experiments show my algorithms are more efficient than existing algorithms, which validate my hypothesis.
机译:概率规划模型马尔可夫决策过程(MDP)和部分可观察的马尔可夫决策过程(POMDP)-是在不确定情节下连续情节中自主主体进行操作并做出顺序决策的模型。为了适应不同的场景和目标,已经提出了MDP和POMDP的许多标准和变体。风险敏感标准(RS-criterion)是一种使代理执行的累积成本小于预定义阈值的概率最大化的标准,因此,代理可以避免遇到过多的累积成本过高的情况。风险敏感型MDP(RS-MDP)和风险敏感型POMDP(RSPOMDP)是将RS准则分别与MDP和POMDP结合在一起的风险敏感模型。我假设可以通过应用从风险敏感模型的属性和结构分析中获得的见识,设计出针对RS-MDP和RS-POMDP的新颖算法。关于风险敏感模型的主要观察结果是,可以通过保持到目前为止的已消耗成本并考虑其决策来将原始形式转换为新形式。为了验证我的假设,(1)我正式定义了RS-MDP对不同成本的假设进行建模和区分。对于每个假设和模型;我将介绍新算法并讨论相关属性。 (2)我正式提出了RS-POMDP模型,并讨论了几种现有常规POMDP模型的不足之处。我介绍了一些程序,以补充现有的几种算法,并为RS-POMDP提供新的算法。 (3)我还提供了风险敏感模型的理论特性以及对随机产生的问题的新算法的实证评估,ICAPS国际概率规划竞赛中的NAVIGATION域以及使用实际数据生成的出租车域。对于RS-MDP和RS-POMP,实验表明我的算法比现有算法更有效,这验证了我的假设。

著录项

  • 作者

    Hou, Ping.;

  • 作者单位

    New Mexico State University.;

  • 授予单位 New Mexico State University.;
  • 学科 Computer science.
  • 学位 Ph.D.
  • 年度 2017
  • 页码 142 p.
  • 总页数 142
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号