首页> 外文会议>Asilomar Conference on Signals, Systems and Computers >Rationally inattentive Markov decision processes over a finite horizon
【24h】

Rationally inattentive Markov decision processes over a finite horizon

机译:有限范围内的注意力不集中的马尔可夫决策过程

获取原文

摘要

The framework of Rationally Inattentive Markov Decision Processes (RIMDPs) is an extension of Partially Observable Markov Decision Processes (POMDP) to the case when the observation kernel that governs the information gathering process is also selected by the decision maker. At each time, an observation kernel is chosen subject to a constraint on the Shannon conditional mutual information between the history of states and the current observation given the history of past observations. This set-up naturally arises in the context of networked control systems, artificial intelligence, and economic decision-making by boundedly rational agents. We show that, under certain structural assumptions on the information pattern and on the optimal policy, Bellman's Principle of Optimality can be used to derive a general dynamic programming recursion for this problem that reduces to solving a sequence of conditional rate-distortion problems.
机译:理性不专心的马尔可夫决策过程(RIMDP)的框架是部分可观察的马尔可夫决策过程(POMDP)的扩展,适用于决策者还选择管理信息收集过程的观察内核的情况。每次都选择观察核,但要考虑到状态历史和当前观察之间在给定过去观察历史的情况下对香农条件互信息的约束。这种设置自然是在网络控制系统,人工智能以及由有限理性主体进行的经济决策的背景下产生的。我们表明,在信息模式和最优策略的某些结构假设下,Bellman的最优性原理可用于得出该问题的一般动态规划递归,该递归可简化为解决一系列条件率失真问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号